| Literature DB >> 25206928 |
Jan P Buchmann1, Ari Löytynoja2, Thomas Wicker3, Alan H Schulman4.
Abstract
BACKGROUND: CACTA elements are DNA transposons and are found in numerous organisms. Despite their low activity, several thousand copies can be identified in many genomes. CACTA elements transpose using a 'cut-and-paste' mechanism, which is facilitated by a DDE transposase. DDE transposases from CACTA elements contain, despite their conserved function, different exon numbers among various CACTA families. While earlier studies analyzed the ancestral history of the DDE transposases, no studies have examined exon loss and gain with a view of mechanisms that could drive the changes.Entities:
Keywords: DNA transposons; Intron loss; Molecular evolution; Plants; Transposases
Year: 2014 PMID: 25206928 PMCID: PMC4158355 DOI: 10.1186/1759-8753-5-24
Source DB: PubMed Journal: Mob DNA
Exon/intron boundaries of the 34 analyzed transposases with more than one exon.
| EnSpm12_Fves | 462 | 564G | | | | |
| C | 718 | 771I | | | | |
| EnSpm10_Fves | 826 | 846 | | | | |
| Joey | 842 | 893II | | | | |
| Janus | 837 | 894II | | | | |
| F | 846 | 894II | | | | |
| G | 847 | 894II | | | | |
| Norman | 879 | 921III | | | | |
| En1 | 879 | 925III | | | | |
| Alfred | 885 | 925III | | | | |
| H | 838 | 972 | | | | |
| EnSpm3_Vvin | 821 | 894II | 856 | 925III | | | |
| EnSpm8_Sbic | 754 | 783I | | | | |
| Storm | 827 | 782I | | | | |
| Sherman | 831 | 782I | | | | |
| J | 495 | 521 | 750 | 885 | | | |
| EnSpm2_Mdom | 755 | 782I | 886 | 893II | | | |
| Baldur | 731 | 782I | 837 | 895II | | | |
| I | 834 | 782I | 954 | 910 | | | |
| Isidor | 857 | 894II | 892 | 920III | | | |
| Radon | 841 | 894II | 877 | 921III | | | |
| Rufus | 851 | 894II | 887 | 921III | | | |
| EnSpm13_Vvin | 821 | 894II | 856 | 925III | | | |
| EnSpm5_Vvin | 824 | 894II | 859 | 925III | | | |
| Isaac | 861 | 894II | 900 | 925III | | | |
| Sandro | 744 | 782I | 851 | 928III | | | |
| Balduin | 850 | 895II | 890 | 930III | | | |
| DOPPIA | 843 | 894II | 890 | 936III | | | |
| K | 744 | 7,82I | 850 | 936III | | | |
| Horace | 712 | 711 | 981 | 1,054 | | | |
| EnSpm4_Fves | | | |||
| EnSpm3_Fves | 770 | 781I | | | ||
| Seamus | 730 | 782I | 833 | 892II | 878 | 925III | | |
| Dario | 726 | 711 | 842 | 839 | 895 | 890III | | |
| Aron | 851 | 833 | 899 | 879II | 1,013 | 1,060 | | |
| Korbin | 510 | 567G | 718 | 782I | 814 | 894II | | |
| Chester | 520 | 563G | 728 | 777I | 823 | 889II | 858 | 920III | |
| Baron | 522 | 568G | 730 | 781I | 825 | 893II | 861 | 925III | |
| EnSpm8_Fves | 158 | 163 | 830 | 893II | |||
| ATENSPM6_Athal | 802 | 809 | 918 | 922III | 978 | 981 | 1,011 | 1,012 |
The positions are relative to the beginning of the transcription start and given as follows:On the protein sequence | on the trimmed multiple sequence alignment (MSA). 0 and numbers in italic indicate boundaries with GUIDANCE scores below 0.804 and removed in the final MSA. Superscripts indicate Regions I to III and G cluster, respectively (Figure 1).
Figure 1Multiple sequence alignment based on protein sequences of the 64 analyzed transposases. Colored boxes indicate amino acids, gray boxes indicate residues with a GUIDANCE score below 0.804, and white boxes indicate gaps in the multiple sequence alignment (MSA). The plot below the MSA shows GUIDANCE scores for the corresponding position in the MSA. Columns with a score below 0.804 are indicated in light blue while columns with a score of 0.804 and above in dark blue. Positions relative to the MSA and corresponding GUIDANCE score are shown between the MSA and the plot. Highly conserved DDE transposase motifs as described in [22] are depicted on top. In the phylogenetic tree, colors indicate the host as shown in the legend. Major clades are depicted α to θ. Exon/intron boundaries are depicted as blue circles if their GUIDANCE score was above 0.804 and red otherwise. The number in the boundary indicates the boundary number on the corresponding transposase. Regions I to III are indicated by dashed lines and corresponding roman capitals. Positions of putative intron gain are depicted as described in the legend.
Figure 2Majority-rule based phylogram of the 64 analyzed transposases. The phylogenetic tree is the same as in Figure 1. Bootstrap values represent the percentage out of 1,000 bootstrap replicates. Only bootstraps below 100% are indicated. Transposase hosts are colored as indicated in the legend. Numbers in parentheses indicate the number of exons. Clades are indicated by dashed lines and labeled α to θ.
Distances between exon/intron boundaries within Region I
| Baron2 | 1 | Baron2 | | | | | | | | | | |
| C1 | 11 | 10 | C1 | | | | | | | | | |
| Chester2 | 5 | 4 | 6 | Chester2 | | | | | | | | |
| EnSpm2_Mdom1 | 0 | 1 | 11 | 5 | EnSpm2_Mdom1 | | | | | | | |
| EnSpm3_Fves2 | 1 | 0 | 10 | 4 | 1 | EnSpm3_Fves2 | | | | | | |
| I1 | 0 | 1 | 11 | 5 | 0 | 1 | I1 | | | | | |
| K1 | 0 | 1 | 11 | 5 | 0 | 1 | 0 | K1 | | | | |
| Korbin2 | 0 | 1 | 11 | 5 | 0 | 1 | 0 | 0 | Korbin2 | | | |
| Sandro1 | 0 | 1 | 11 | 5 | 0 | 1 | 0 | 0 | 0 | Sandro1 | | |
| Seamus1 | 0 | 1 | 11 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | Seamus1 | |
| Sherman1 | 0 | 1 | 11 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | Sherman1 |
| Storm1 | 0 | 1 | 11 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
Distances between exon/intron boundaries in the MSA within Region I (depicted in Figure 1). The distances are given in residues in the alignment.
Distances between exon/intron boundaries within Region III
| Balduin2 | 5 | Balduin2 | | | | | | | | | | | | |
| Baron4 | 0 | 5 | Baron4 | | | | | | | | | | | |
| Chester4 | 5 | 10 | 5 | Chester4 | | | | | | | | | | |
| En11 | 0 | 5 | 0 | 5 | En11 | | | | | | | | | |
| EnSpm13_Vvin2 | 0 | 5 | 0 | 5 | 0 | EnSpm13_Vvin2 | | | | | | | | |
| EnSpm3_Vvin2 | 0 | 5 | 0 | 5 | 0 | 0 | EnSpm3_Vvin2 | | | | | | | |
| EnSpm5_Vvin2 | 0 | 5 | 0 | 5 | 0 | 0 | 0 | EnSpm5_Vvin2 | | | | | | |
| Isaac2 | 0 | 5 | 0 | 5 | 0 | 0 | 0 | 0 | Isaac2 | | | | | |
| Isidor2 | 5 | 10 | 5 | 0 | 5 | 5 | 5 | 5 | 5 | Isidor2 | | | | |
| Norman1 | 4 | 9 | 4 | 1 | 4 | 4 | 4 | 4 | 4 | 1 | Norman1 | | ||
| Radon2 | 4 | 9 | 4 | 1 | 4 | 4 | 4 | 4 | 4 | 1 | 0 | Radon2 | | |
| Rufus2 | 4 | 9 | 4 | 1 | 4 | 4 | 4 | 4 | 4 | 1 | 0 | 0 | Rufus2 | |
| Sandro2 | 3 | 2 | 3 | 8 | 3 | 3 | 3 | 3 | 3 | 8 | 7 | 7 | 7 | Sandro2 |
| Seamus3 | 0 | 5 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 5 | 4 | 4 | 4 | 3 |
Distances between exon/intron boundaries in the MSA within Region III (depicted in Figure 1). The distances are given in residues in the alignment.
Distances between exon/intron boundaries within Cluster G
| Chester1 | 5 | Chester1 | |
| EnSpm12_Fves1 | 4 | 1 | EnSpm12_Fves1 |
| Korbin1 | 1 | 4 | 3 |
Distances between exon/intron boundaries in the MSA within Cluster G (depicted in Figure 1). The distances are given in residues in the alignment.
Figure 3Model for the loss and gain of introns in transposases. Simplified phylogenetic tree based on the consensus exon numbers per clade as described in the text. Below the tree the putative ancestor transposase with four exons is depicted. Exons are depicted as gray rectangles with introns as colored lines. Blue, red and green depict introns conserved in Regions I to III, G indicates cluster G with the putative intron gain. Conserved introns share the same color band. Intron loss is depicted by its corresponding color and circled −, intron gain by an encircled +. Gray balloons indicate how the observed configuration arose from the putative ancestor.