| Literature DB >> 22675029 |
Abstract
L1 elements are mammalian non-long terminal repeat retrotransposons, or long interspersed elements (LINEs), that significantly influence the dynamics and fluidity of the genome. A series of observations suggest that plant L1-clade LINEs, just as mammalian L1s, mobilize both short interspersed elements (SINEs) and certain messenger RNA by recognizing the 3'-poly(A) tail of RNA. However, one L1 lineage in monocots was shown to possess a conserved 3'-end sequence with a solid RNA structure also observed in maize and sorghum SINEs. This strongly suggests that plant LINEs require a particular 3'-end sequence during initiation of reverse transcription. As one L1-clade LINE was also found to share the 3'-end sequence with a SINE in a green algal genome, I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized the specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22675029 PMCID: PMC3472496 DOI: 10.1093/molbev/mss147
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FThe number of LINE families belonging to each LINE clade according to biological taxa. LINE clades in which the partner LINE of a SINE was identified are shown. Remaining clades are grouped as “Others” (Repbase 16.10). “other vertebrates”: nonmammalian vertebrates; “land plants”: mostly flowering plants.
3′-Repeats of Plant LINE Families.
| Species | LINE clade | Families | 3′-repeat | ||
|---|---|---|---|---|---|
| (A) | Other repeats | None | |||
| Flowering plants | L1 | 233 | 224 | 0 | 9 |
| RTE | 7 | 0 | 7 | 0 | |
| Green algae | L1 | 15 | 2 | 8 | 5 |
| RandI | 8 | 0 | 8 | 0 | |
| RTEX | 6 | 0 | 6 | 0 | |
a(TTG)n and (TTGATG)n.
bL1-1_CR (Chlamydomonas) and Zepp (Chlorella).
c(CATA)n, (CA)n, (CAA)n, and (TAA)n.
d(ATT)n and (CTATTT)n.
e(CA)n, (CAA)n, (CCAT)n, (ACAATG)n, and (CTTGTAA)n.
FPhylogenetic relationships among the L1-clade LINEs. LINE-clades are shown in bold italics. Several lineages in which a stringent or relaxed L1 was found are indicated by asterisks: (*1) LINE1-1_ZM (stringent), (*2) L1-1_CR (stringent), and (*3) L1HS (relaxed). The phylogenetic relationships among 146 LINEs were inferred using the amino acid sequences of ORF2 proteins from plant L1 entries in the database (Repbase 15.08; Viridiplantae) and from other LINEs (Ohshima and Okada 2005). A total of 404 positions made up the final data set. The linearized NJ consensus tree obtained from bootstrap analysis with 1,000 replications is shown (an ML consensus tree formed with the same data set is available as supplementary fig. S5, Supplementary Material online). The evolutionary distances were computed using the Jones-Taylor-Thornton (JTT) matrix-based method. For clarity, some clades were collapsed with filled triangles, the widths of which were in proportion to the number of LINEs. The full expanded tree is shown in supplementary figure S4, Supplementary Material online. Bootstrap values are only shown for nodes with scores > 45.
FProposed model for the 3′-end recognition of L1-clade LINEs. The ancestral L1-clade LINE in the ancestral green plant possessed a stringent, nonmammalian-type RNA recognition property. During the course of plant evolution, a L1 lineage(s) lost the ability to specifically recognize the RNA template for reverse transcription, introducing relaxed 3′-end recognition in land plants. Processed pseudogenes have been reported in eudicots, monocots, and mammals. ME1-3: plant L1 lineages; (e): eudicots; (m): monocots; M, F: vertebrate L1 lineages; (m): mammals; (f): fish.