| Literature DB >> 23595021 |
Natacha Senerchia1, Thomas Wicker, François Felber, Christian Parisod.
Abstract
Transposable elements (TEs) represent a major fraction of plant genomes and drive their evolution. An improved understanding of genome evolution requires the dynamics of a large number of TE families to be considered. We put forward an approach bypassing the required step of a complete reference genome to assess the evolutionary trajectories of high copy number TE families from genome snapshot with high-throughput sequencing. Low coverage sequencing of the complex genomes of Aegilops cylindrica and Ae. geniculata using 454 identified more than 70% of the sequences as known TEs, mainly long terminal repeat (LTR) retrotransposons. Comparing the abundance of reads as well as patterns of sequence diversity and divergence within and among genomes assessed the dynamics of 44 major LTR retrotransposon families of the 165 identified. In particular, molecular population genetics on individual TE copies distinguished recently active from quiescent families and highlighted different evolutionary trajectories of retrotransposons among related species. This work presents a suite of tools suitable for current sequencing data, allowing to address the genome-wide evolutionary dynamics of TEs at the family level and advancing our understanding of the evolution of nonmodel genomes.Entities:
Keywords: 454 pyrosequencing; Aegilops; molecular population genetics; repetitive fraction composition; transposable elements; whole-genome snapshot
Mesh:
Substances:
Year: 2013 PMID: 23595021 PMCID: PMC4104650 DOI: 10.1093/gbe/evt064
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FOverview of the approach used here to identify and classify 454 reads into families of TEs, and then investigate their evolutionary dynamics through molecular population genetics.
Proportions of 454 Reads Corresponding to Known TEs, Organelle, Coding, and Low Complexity (SSR) Sequences
| Genome fraction | ||
|---|---|---|
| TEs | 71.3% | 72% |
| Organelle | 4.6% | 4.7% |
| Coding sequences | 1.4% | 1.4% |
| SSR | 0.3% | 0.6% |
| Unclassified | 22.4% | 21.3% |
| Total number of 454 reads | 667,485 | 646,327 |
FContribution of 44 major LTR retrotransposon families to 454 reads in Aegilops cylindrica (CY, dark gray) and Ae. geniculata (GE, light gray) with 95% confidence interval estimated by resampling. Families are grouped according to their superfamilies: CO, copia; GY, gypsy; X, unclassified retrotransposon. *,†mark TE families making at least 50% of the genome complement of Ae. cylindrica and Ae. geniculata, respectively. Proportions for BARE1, Fatima, and Sabrina are not at scale and are indicated by values.
Evolutionary Genetics of Copies of LTR Retrotransposons in Aegilops cylindrica (CY) and Ae. geniculata (GE)
| Name | Alignment | Tree | CY Mismatch Distribution | GE Mismatch Distribution | τ | π CY | π GE | |
|---|---|---|---|---|---|---|---|---|
| 300/23/27 | 0.7921 | I | 1 | 2 | — | 0.04 | 0.10 | |
| 180/47/35 | 0.6505 | I | 1 | 2 | — | 0.14 | 0.18 | |
| 150/14/37 | 0.4449 | I | 2 | 1 | — | 0.12 | 0.15 | |
| 300/286/133 | 0.3013 | II | 1 | 2 | GE | 0.06 | 0.03 | |
| 300/28/27 | 0.2166 | II | 1 | 2 | GE | 0.14 | 0.12 | |
| 300/23/59 | 0.2121 | II | 2 | 1 | CY | 0.09 | 0.09 | |
| 300/187/147 | 0.1386 | II | 1 | 2 | GE | 0.04 | 0.05 | |
| 295/58/149 | 0.3162 | II | 1 | 1 | GE | 0.19 | 0.16 | |
| 250/20/29 | 0.3126 | 1 | 1 | GE | 0.12 | 0.06 | ||
| 300/65/90 | 0.2056 | II | 1 | 1 | — | 0.08 | 0.06 | |
| 300/71/71 | 0.2606 | III | 1 | 1 | — | 0.11 | 0.10 | |
| 300/97/180 | 0.1117 | III | 1 | 1 | — | 0.05 | 0.06 | |
| 300/162/204 | 0.0915 | III | 1 | 1 | — | 0.04 | 0.06 | |
| 276/50/43 | 0.076 | III | 1 | 1 | — | 0.10 | 0.11 | |
| 250/22/11 | 0.0528 | III | 1 | 1 | — | 0.10 | 0.10 | |
| 300/82/97 | 0.0376 | III | 1 | 1 | — | 0.09 | 0.13 | |
| 200/35/20 | 0.0362 | III | 1 | 1 | — | 0.07 | 0.08 | |
| 300/22/21 | 0.0306 | III | 1 | 1 | — | 0.06 | 0.10 | |
| 300/43/75 | 0.0249 | III | 1 | 1 | — | 0.06 | 0.06 | |
| 250/51/46 | 0.0238 | III | 2 | 1 | — | 0.08 | 0.08 | |
| 285/24/44 | 0.0227 | III | 1 | 1 | — | 0.09 | 0.12 | |
| 280/85/166 | 0.0194 | III | 2 | 1 | — | 0.11 | 0.10 | |
| 300/32/51 | 0.0141 | III | 2 | 1 | — | 0.08 | 0.08 | |
| 288/48/83 | 0.0230 | IV | 1 | 1 | — | 0.09 | 0.10 | |
| 225/307/245 | 0.0078 | IV | 1 | 1 | — | 0.06 | 0.07 | |
| 195/40/11 | 0 | IV | 1 | 1 | — | 0.18 | 0.18 | |
| 300/128/5 | 0 | IV | 1 | 1 | — | 0.06 | NA |
aLength of the sequence alignment from both species/number of reads in CY/number of reads in GE.
bTopology of the maximum likelihood tree.
cNumber of significant peaks detected in the mismatch distribution identified in the species-specific alignments.
dSpecies with a significantly lower time since expansion (τ) than the other species and the total alignment, nonsignificant (—).
eNucleotide diversity (π) among copies within each species.
*P < 0.05.
FExamples of phylogenetic relationships among copies of LTR retrotransposon families in Aegilops cylindrica (green) and Ae. geniculata (orange). Unrooted accelerated maximum likelihood trees distinguish four main topologies: (a) trees with only few species-specific clades of insertions as shown by the Lila family (referred as Tree I), (b) Tree II is composed of several species-specific clades of insertions, such as the family BARE1, (c) Tree III with only few species-specific clades but preponderant mixed-species clades of insertions, as shown by the Maximus family, and (d) families such as Hawi showing only mixed-species clades of insertions (Tree IV). Scale bar represents the branch lengths. In each panel, the distribution of distances among sequences from species-specific alignments (i.e., mismatch distribution) is shown using blue and red for significantly positive and negative slopes respectively.