| Literature DB >> 27098918 |
Zhenglin Zhu1, Shengjun Tan2, Yaqiong Zhang2, Yong E Zhang2,3.
Abstract
RNA-based duplicated genes or functional retrocopies (retrogenes) are known to drive phenotypic evolution. Retrogenes emerge via retroposition, which is mainly mediated by long interspersed nuclear element 1 (LINE-1 or L1) retrotransposons in mammals. By contrast, long terminal repeat (LTR) retrotransposons appear to be the major player in plants, although an L1-like mechanism has also been hypothesized to be involved in retroposition. We tested this hypothesis by searching for young retrocopies, as these still retain the sequence features associated with the underlying retroposition mechanism. Specifically, we identified polymorphic retrocopies (retroCNVs) by analyzing public Arabidopsis (Arabidopsis thaliana) resequencing data. Furthermore, we searched for recently originated retrocopies encoded by the reference genome of Arabidopsis and Manihot esculenta. Across these two datasets, we found cases with L1-like hallmarks, namely, the expected target site sequence, a polyA tail and target site duplications. Such data suggest that an L1-like mechanism could operate in plants, especially dicots.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27098918 PMCID: PMC4838847 DOI: 10.1038/srep24755
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
RetroCNVs in Arabidopsis.
| PG | AS | E-E | E-I | I-E | IS | N | Flank | PolyA | TSD | TTAAAA | Mechanism |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AT3G06040.1 | Can-0 | 7 | 17 | 14 | Chr2(+): 7.9M | 3 | N | N | Y | Uncertain | |
| AT3G08580.2 | No-0 | 25 | 22 | 25 | Chr2(−): 11.7M | 14 | LTR/LTR | N | N | N | LTR |
| AT5G58720.1 | Oy-0 | 108 | 73 | 72 | Chr4(+): 7.6M | 3 | Y | Y | Y | L1-like | |
| AT5G51410.2 | No-0 | 26 | 18 | 26 | Chr1(+): 12.3M | 1 | N | N | N | Uncertain |
“PG” means the parental gene from which the retroCNV is derived. Since parental gene could encode multiple isoforms, only one transcript model (“0.1” or “0.2”) with the highest sequence similarity to the retroCNV is listed. “AS” denotes the accession in which the retroCNV is assembled, while “N” denotes the number of accessions in which the retroCNV is present. In the columns entitled “E-E” (reads mapped to exon-exon junctions), “E-I” (reads mapped to exon-intron junctions) or “I-E” (reads mapped to intron-exon junctions), the numbers refer to the total count of reads mapped to the corresponding junctions. “IS” is the coordinate of the insertion site with “+/−” showing the orientation of the retroCNV relative to the sense strand of the inserted chromosome (Chr). “Flank” shows the retrotransposons in the 5′/3′ flanking region of the retroCNV. Only the retroCNV derived from AT3G08580.2 is flanked by LTR retrotransposon at both sides whereas no recognizable retrotransposon is associated with the other three cases. The next three columns “PolyA”, “TSD” and “TTAAAA” list whether there is polyA, TSD or TTAAAA-like sequences in the flanking region, respectively. Given these sequence features, the mechanism is inferred in the last column.
Figure 1Schematic representation of three retrocopies.
Thick and thin boxes stand for the coding region and the untranslated region, respectively. The symbol like “H” refers to intron. The exon size is roughly drawn to scale. The arrow means the transcription direction. The dashed line shows the correspondance of sequences between the parental gene and the retroCNV. The retroposed segment is marked in purple with the other in light blue. For the retroCNV, the candidate target site, target site duplication and polyA tail are marked in orange, green, and red, respectively. Panel (A) shows the retroCNV derived from the parental gene AT5G58720.1 in Arabidopsis where a partial sequence derived from the last three exons of AT5G58720.1 was retroposed and inserted into Chromosome 4 (Chr 4). Panel (B) shows a retrocopy encoded by the Arabidopsis reference genome, and Panel (C) shows a retrocopy encoded by the M. esculenta reference genome. Interestingly, in Panel (B), an insertion of LTR retrotransposon (Copia) occurrs in the middle of the retrocopy, which is marked in dark yellow.
Figure 2The pipeline for the identification and assembly of retroCNVs.