| Literature DB >> 25573958 |
Cristian Chaparro1, Thomas Gayraud2, Rogerio Fernandes de Souza3, Douglas Silva Domingues4, Sélastique Akaffou5, Andre Luis Laforga Vanzela3, Alexandre de Kochko2, Michel Rigoreau6, Dominique Crouzillat6, Serge Hamon2, Perla Hamon2, Romain Guyot7.
Abstract
A novel structure of nonautonomous long terminal repeat (LTR) retrotransposons called terminal repeat with GAG domain (TR-GAG) has been described in plants, both in monocotyledonous, dicotyledonous and basal angiosperm genomes. TR-GAGs are relatively short elements in length (<4 kb) showing the typical features of LTR-retrotransposons. However, they carry only one open reading frame coding for the GAG precursor protein involved for instance in transposition, the assembly, and the packaging of the element into the virus-like particle. GAG precursors show similarities with both Copia and Gypsy GAG proteins, suggesting evolutionary relationships of TR-GAG elements with both families. Despite the lack of the enzymatic machinery required for their mobility, strong evidences suggest that TR-GAGs are still active. TR-GAGs represent ubiquitous nonautonomous structures that could be involved in the molecular diversities of plant genomes.Entities:
Keywords: GAG; LTR-retrotransposons; conservation in plant genomes; nonautonomous elements
Mesh:
Substances:
Year: 2015 PMID: 25573958 PMCID: PMC4350172 DOI: 10.1093/gbe/evv001
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FConserved structures of nonautonomous LTR-retrotransposons documented in plant genomes. Autonomous refers to the structure of complete LTR-retrotransposons (here Copia-like): The coding regions are in gray; the PBS motif is represented as a black triangle and the PPT is represented as a white triangle; GAG, capsid; AP, aspartic protease; INT, integrase; RNAse, RNAse H. BARE-2 refers to the BARE-2 nonautonomous found in barley (Tanskanem et al. 2007).
List of Primers Used for RT-PCR Analysis
| Primers | Sequences (5′–3′) | Product Size (bp) |
|---|---|---|
| TRIM-1-S-F | CACCTCCAACGGTTGATTCT | 361 |
| TRIM-1-S-R | ATGTGTAGTTGCCCCGAGTC | |
| TR-GAG1-F | GCAGCAGACCTCTGGAAAAA | 328 |
| TR-GAG1-R | TGGTTTGCCTTCCTTTGTTT | |
| G3-F | ACGAGTGGGTTTCCTGAGTG | — |
| G3-R | TGGGTCTCTGGAACTTACCG |
aControl primers used as in Guyot et al. (2009).
FStructure and graphical alignments of the nonautonomous LTR-retrotransposons TRIM-1 family. (A) Schematic representation of the TRIM-1-S element and alignment of five different C. canephora TRIM-1-S genomic copies against themselves using Dotter (Sonnhammer and Durbin 1995). (B) Schematic representation of the TR-GAG1 element and alignment of five different C. canephora TR-GAG1 genomic copies against themselves using Dotter. (C) Dotter alignment between TR-GAG1 (horizontal sequence) and TRIM-1-S (vertical sequence).
FSchematic representation of the TR-GAG1 structure. The TR-GAG1 element contains the following sequence characteristics: LTR, PBS, PPT, and an ORF harboring known GAG motifs (here UBN2 and Zf-C2HC motifs). The element shown is located on “Chr. 0” positions 113020990–113023502 from the C. canephora draft genome (http://coffee-genome.org).
FCharacterization of TR-GAG families in the C. canephora draft genome. (A) Dot-plot of 130 predicted TR-GAG sequences against themselves. TR-GAGs were predicted by LTR_STRUC and filter out according to features described for TR-GAG1. Sequences were clustered by similarity. (B) Detailed structure of one copy (Chr. 4, positions 21003142–21006851) of the TR-GAG2 family.
Estimation of the TR-GAG Family’s Copy Number in Coffea Genomes Using 454 Sequencing Survey
| Species | Ploidy Level | Estimated Genome Size (Mb) | #454 Sequences | Produced Bases (Mb) | Genome Coverage (%) | TR-GAG1 Copies | TR-GAG2 Copies | TR-GAG3 Copies | TR-GAG4 Copies | TR-GAG5 Copies |
|---|---|---|---|---|---|---|---|---|---|---|
| 2n | 700 | 106,459 | 45.05 | 6.40 | 172,48 | 563,07 | 6,74 | 8,18 | 27,28 | |
| 2n | 700 | 149,196 | 67.08 | 9.58 | 69,61 | 390,62 | 14,85 | 22,20 | 44,88 | |
| 4n | 1,240 | 122,258 | 54.5 | 4.39 | 111,55 | 1168,72 | 55,40 | 10,21 | 35,21 | |
| 2n | 645 | 101,309 | 42.1 | 6.52 | 62,56 | 659,44 | 28,64 | 26,14 | 22,42 | |
| 2n | 863 | 194,3 | 60.511 | 2.25 | 97,94 | 696,71 | 13,97 | 9,00 | 24,68 | |
| 2n | 506 | 88,498 | 34.19 | 5.7 | 54,02 | 103,02 | 2,96 | 0,00 | 16,04 | |
| 2n | 593 | 215,117 | 91.7 | 15.4 | 59,76 | 157,79 | 1,12 | 7,34 | 13,67 | |
| 2n | 469 | 160,479 | 67.99 | 14.49 | 26,77 | 80,00 | 0,00 | 0,00 | 13,64 | |
| 2n | 513 | 160,107 | 72.66 | 14.10 | 48,45 | 34,35 | 0,92 | 0,00 | 21,63 | |
| 2n | 682 | 163,873 | 76.65 | 11.23 | 61,91 | 144,93 | 0,00 | 0,00 | 18,40 | |
| 2n | 593 | 112,793 | 46.25 | 7.8 | 43,56 | 336,74 | 1,35 | 0,00 | 24,50 | |
| 2n | 748 | 49,789 | 19.44 | 2.94 | 5,07 | 6,96 | 0,00 | 0,00 | 0,00 |
Note.—Only 454 reads with a minimum of 80% of nucleotide identity over 80% of the read length were considered. Genome sizes were listed in Noirot et al. (2003) and Razafinarivo et al. (2012).
FIdentification of TR-GAG families in available plant genomes. (A) Dot-plot of predicted TR-GAG sequences from 23 plant genomes against themselves. TR-GAGs were predicted by LTR_STRUC and filter out according to features described for TR-GAG1. Sequences were clustered by plant genomes. (B) Detailed structure of one TR-GAG family for seven different plant genomes.