| Literature DB >> 33871607 |
Abstract
LTR retrotransposons comprise a major component of the genomes of eukaryotes. On occasion, retrotransposon genes can be recruited by their hosts for diverse functions, a process formally referred to as co-option. However, a comprehensive picture of LTR retrotransposon gag gene co-option in eukaryotes is still lacking, with several documented cases exclusively involving Ty3/Gypsy retrotransposons in animals. Here, we use a phylogenomic approach to systemically unearth co-option of retrotransposon gag genes above the family level of taxonomy in 2,011 eukaryotes, namely co-option occurring during the deep evolution of eukaryotes. We identify a total of 14 independent gag gene co-option events across more than 740 eukaryote families, eight of which have not been reported previously. Among these retrotransposon gag gene co-option events, nine, four, and one involve gag genes of Ty3/Gypsy, Ty1/Copia, and Bel-Pao retrotransposons, respectively. Seven, four, and three co-option events occurred in animals, plants, and fungi, respectively. Interestingly, two co-option events took place in the early evolution of angiosperms. Both selective pressure and gene expression analyses further support that these co-opted gag genes might perform diverse cellular functions in their hosts, and several co-opted gag genes might be subject to positive selection. Taken together, our results provide a comprehensive picture of LTR retrotransposon gag gene co-option events that occurred during the deep evolution of eukaryotes and suggest paucity of LTR retrotransposon gag gene co-option during the deep evolution of eukaryotes.Entities:
Keywords: LTR retrotransposon; co-option; phylogenetics
Year: 2021 PMID: 33871607 PMCID: PMC8321522 DOI: 10.1093/molbev/msab101
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.The phylogenetic relationship between LTR retrotransposons that are closely related to Crtg proteins and representative LTR retrotransposons. This phylogenetic tree was reconstructed based on RT protein sequences. The LTR retrotransposons that are closely related to Crtg proteins in animals, plants and fungi were highlighted in purple, green, and blue, respectively.
Fig. 2.The evolutionary history of Crtg genes in animals. The host phylogenetic relationship is based on TimeTree and literature (Wiegmann et al. 2011; Kumar et al. 2017; Zhang et al. 2018; Martin et al. 2019), and gene syntenies flanking each Crtg gene are shown near the corresponding species.
Fig. 3.The evolutionary history of Crtg genes in plants. The host phylogenetic relationship is based on TimeTree and literature (Kumar et al. 2017; Janssens et al. 2020), and gene syntenies flanking each Crtg gene are shown near the corresponding species.
Fig. 4.The evolutionary history of Crtg genes in fungi. The host phylogenetic relationship is based on TimeTree and literature (Hirayama et al. 2010; Raja et al. 2011; Li et al. preprint; Tian et al. 2015), and gene syntenies flanking each Crtg gene are shown near the corresponding species.
Selection Analyses of Crtg Genes.
| Gene | No. of Sequences(used in selectionanalyses/total) | No. of Sites | d | M1a Versus M2a | M8a Versus M8 | BUSTED | FUBAR | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2Δlnl |
|
|
|
| 2Δlnl |
| Codons withd |
| No. of Sitesunder PositiveSelection | No. of Sitesunder NegativeSelection | ||||
|
| 7/7 | 596 | 0.039 | 0 | 1 | 0.699 | 0.301 | 0 | 1.014 | 0.314 | — |
| 0 | 354 |
|
| 82/108 | 190 | 0.209 | 0 | 1 | 0.808 | 0.192 | 0 | 0.845 | 0.358 | — |
| 0 | 156 |
|
| 27/47 | 258 | 0.343 | 0 | 1 | 0.606 | 0.394 | 0 | 0.944 | 0.331 | — |
| 1 | 92 |
|
| 120/152 | 285 | 0.352 | 69.112 | 9.990 × 10-16 | 0.573 | 0.386 | 0.041 | 28.113 | 0 | 27Q*; 233F*;235S**;236A**;237N**; 241T* |
| 1 | 161 |
|
| 4/4 | 259 | 0.44 | 0 | 1 | 0.64 | 0.36 | 0 | 0.861 | 0.353 | — |
| 1 | 13 |
|
| 2/2 | 217 | 0.996 | 0.001 | 0.999 | 0.005 | 0.995 | 0 | 0.001 | 0.972 | — | — | — | — |
|
| 62/63 | 293 | 0.115 | 0 | 1 | 0.844 | 0.156 | 0 | 0 | 1 | — |
| 0 | 285 |
|
| 14/22 | 236 | 0.113 | 0 | 1 | 0.71 | 0.29 | 0 | 0.032 | 0.859 | — |
| 0 | 145 |
The dN/dS values of the Ctrg genes were estimated using the one-ratio model (M0) in PAML.
2Δl represents twice of the difference in the natural logs of the likelihoods of the pairs of models (M1a vs. M2a and M8a vs. M8) being compared.
The P-value indicates the confidence with which the neutral models (M1a and M8a) can be rejected in favor of the positive selection models (M2a and M8), respectively.
Proportion of sites with omega < 1 (P0), omega = 1 (P1), and omega > 1 (P2).
Codons under positive selection with a posterior probability > 95% and 99% by Bayes empirical Bayes (BEB) analysis are labeled with one and two asterisks, respectively.
Fig. 5.The distribution of Crtg genes across eukaryotes. Phylogenetic tree of eukaryotes is based on literatures (Eichinger et al. 2005; Steenkamp et al. 2006; Burki et al. 2020).