| Literature DB >> 31788035 |
Yatong Sang1, Bo Gao1,2, Mohamed Diaby1, Wencheng Zong1, Cai Chen1, Dan Shen1, Saisai Wang1, Yali Wang1, Zoltán Ivics2, Chengyi Song1.
Abstract
BACKGROUND: The Tc1/mariner superfamily might represent the most diverse and widely distributed group of DNA transposons. Several families have been identified; however, exploring the diversity of this superfamily and updating its classification is still ongoing in the life sciences.Entities:
Keywords: DD36E; Horizontal transfer; Tc1/mariner transposons
Year: 2019 PMID: 31788035 PMCID: PMC6875036 DOI: 10.1186/s13100-019-0188-x
Source DB: PubMed Journal: Mob DNA
Fig. 1Taxonomic distribution of IC elements among Eukaryota. IC elements identified in the branches are shown as red stars, and the numbers in parentheses represent the number of species that with IC elements in each branch
Taxonomic distribution of IC elements
| Taxonomic distribution | Mammalia (Bat) | Amphibia (Frog) | Actinopterygii | Myxini | Arachnida | Insecta |
|---|---|---|---|---|---|---|
| Number of species containing IC | 4 | 4 | 132 | 1 | 9 | 4 |
| Number of species containing full IC | 4 | 4 | 76 | 1 | 7 | 4 |
| Number of Species containing intact IC | 0 | 3 | 33 | 1 | 3 | 3 |
| Length of full IC | 1005–1487 | 1194–1225 | 1099–1467 | 1226 | 1200–1234 | 1213–1226 |
| Length of intact IC | N | 1194–1225 | 1169–1229 | 1226 | 1220–1224 | 1217–1226 |
| Transposase length of intact IC | N | 346 | 314–356 | 346 | 346–382 | 346–366 |
| TIR length of intact IC | N | 28–29 | 24–32 | 28 | 25–27 | 25–28 |
| TSD | TA | TA | TA | TA | TA | TA |
Fig. 2Structural organization of ICs. The blue arrows represent TIRs; the black rectangles represent HTH motifs; green rectangles represents GRPR sequences; the orange wire frame represents the NLS; the red rectangles represents catalytic domains, D:aspartic acid, E:glutamic acid; and the gray region represents transposases. The dotted box represents the portion of the transposases that can be deleted in the particular species. Clho:Clitarchus hookeri; Crse:Cryptotermes secundus;Mahr: Machilis hrabei; Xebr:Xenocatantops brachycerus; Lahe:Latrodectus hesperus; Pate:Parasteatoda tepidariorum; Stmi:Stegodyphus mimosarum; Lewa:Leuciscus waleckii; Fuhe:Fundulus heteroclitus; Eslu:Esox lucius; Taru:Takifugu rubripes; Epbu:Eptatretus burgeri; Rhma:Rhinella marina; Napa:Nanorana parkeri; Raca:Rana catesbeiana; Xetr:Xenopus tropicalis. Mylu: Myotis lucifugus;Epfu: Eptesicus fuscus; Myda:Myotis davidii; Mybr:Myotis brandtii
Incomer in bats
| Species | TE name | Length | Transposase length | Copy number | % Ave. Divergence |
|---|---|---|---|---|---|
| IC-MITE1-Mylu | 1220 | 235 | 407 | 4.4 ± 0.08 | |
| IC-MITE2-Mylu | 810 | 235 | 951 | 5.2 ± 0.09 | |
| IC-MITE1-Epfu | 1487 | 235 | 1 | NAa | |
| IC-MITE2-Epfu | 810 | 235 | 367 | 3.9 ± 0.05 | |
| IC-MITE1-Myda | 1056 | 235 | 4 | NA | |
| IC-MITE2-Myda | 810 | 235 | 147 | 5.3 ± 0.6 | |
| IC-MITE1-Mybr | 1005 | 235 | 48 | 4.0 ± 0.15 | |
| IC-MITE2-Mybr | 810 | 235 | 215 | 3.5 ± 0.06 |
SE standard error
a Average percent divergence could not be determined for full-length IC elements due to low copy number
Fig. 3Phylogenetic tree of IC elements identified in this study with eight other members of the Tc1/mariner superfamily based on their transposases. Bootstrapped (1000 replicates) phylogenetic trees were inferred by using the Maximum Likelihood method in IQ-TREE [27]. Each sequence (except the DD37E and DD41D subclasses) contains the name of the transposon, the gene sequence number corresponding to the transposon, and the Latin abbreviation of the species in which the transposon is located. Pp:Pleuronectes platessa;Rp:Rana pipiens; Aa:Anopheles albimanus; Bm:Bombyx mori; Mh:Misgolas hubbardi; Bt:Bactrocera tryoni; Pe:Phyllostachys edulis; Os:Oryza sativa; Gm:Glycine max; Ls:Lepeophtheirus salmonis; Bd:Bactrocera dorsalis; Aa:Aedes atropalpus; Oe:Ochlerotatus epactius; Ag:Anopheles gambiae; Serratia odorifera Sf:Shigella flexneri; Ss:Shigella sonnei; Hs:Homo sapiens; Cc:Ceratitis capitata; Dm:Drosophila mauritiana; Dm:Drosophila melanogaster; At:Arabidopsis thaliana; Aa:Aspergillus awamori; Fo:Fusarium oxysporum; Mg:Magnaporthe grisea
Fig. 4Sequence identities between IC family and eight other families. The sequence identities were measured by pairwise comparisons of full-length transposases
Fig. 5Pairwise distances of IC elements and RAG1. The distances are obtained from all possible pairwise comparisons (n = 196; labeled on the x axis) between the four (Cluster A) and 20 (Cluster B) species in which ICs were identified and complete. The coding sequence (CDS) regions of the RAG1 gene in the NCBI database are available (Additional file 7: Text S3 and Additional file 8: Text S4)
Fig. 6Sequence identities between IC elements among species. The sequence identities were measured by pairwise comparisons of full-length IC consensus sequences. Auli: Austrofundulus limnaeus; Pipr: Pimephales promelas; Hete: Helostoma temminkii; Anja: Anguilla japonica; Pema: Periophthalmus magnuspinnatus; Pyna: Pygocentrus nattereri; Phph: Phycis phycis