| Literature DB >> 25896861 |
Tyler A Elliott1, T Ryan Gregory2.
Abstract
BACKGROUND: The genomes of eukaryotes vary enormously in size, with much of this diversity driven by differences in the abundances of transposable elements (TEs). There is also substantial structural and phylogenetic diversity among TEs, such that they can be classified into distinct classes, superfamilies, and families. Possible relationships between TE diversity (and not just abundance) and genome size have not been investigated to date, though there are reasons to expect either a positive or a negative correlation. This study compares data from 257 species of animals, plants, fungi, and "protists" to determine whether TE diversity at the superfamily level is related to genome size.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25896861 PMCID: PMC4438587 DOI: 10.1186/s12862-015-0339-8
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Number of superfamilies (TE diversity) and log-scale genome size (Mbp) in 257 eukaryote genomes. Brown points represent animal genomes, green points represent land plant genomes, purple points represent fungal genomes and red points represent “protist” genomes. This includes all available data, regardless of TE discovery and annotation method (cf. Figure 6).
Figure 2TE diversity versus genome size separated into the two TE classes. (A) Number of superfamilies’ (TE diversity) of DNA transposons and log-scale genome size (Mbp) in 257 eukaryote genomes. (B) Number of superfamilies’ (TE diversity) of retrotransposons and log-scale genome size (Mbp) in 257 eukaryote genomes. Brown points represent animal genomes, green points represent land plant genomes, purple points represent fungal genomes and red points represent “protist’ genomes.
Figure 6Effects of TE discovery method. The overall pattern of TE diversity versus genome size among eukaryotes according to whether TE discovery was A) based only on sequence similarity against an existing database or B) based on both sequence similarity and de novo discovery. (A much smaller number of studies used only de novo methods, and are not shown in a separate analysis). Importantly, the general patterns are the same regardless of TE discovery method(s) used (see also Figure 1). Brown points represent animal genomes, green points represent land plant genomes, purple points represent fungal genomes and red points represent “protist” genomes.
Figure 3Number of superfamilies (TE diversity) and genome size (Mbp) in 75 animal genomes. There was no linear relationship across all animals (r = −0.12, p > 0.3).
Figure 4Number of superfamilies (TE diversity) and genome size (Mbp) in 80 land plant genomes. The line represents the significant negative correlation between TE diversity and genome size among plants (r = −0.44, p < 0.0001).
Figure 5TE diversity versus genome size in fungi. Number of superfamilies (TE diversity) and genome size (Mbp) in 77 fungal genomes. The line represents the significant positive correlation between TE diversity and genome size in fungi (r = 0.764, p < 0.0001).
Summary statistics for TE diversity (number of superfamilies) in each of the taxonomic groups studied
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 18.32 | 14.91 | 10.53 | 5.66 | 7.96 |
|
| 9.04 | 6.58 | 3.12 | 3.32 | 4.58 |
|
| 38 | 28 | 25 | 17 | 16 |
|
| 81.79 | 43.36 | 9.75 | 11.02 | 20.96 |
TE superfamilies found in all taxonomic groups studied
|
|
|
|---|---|
|
| Tc1/ |
TE superfamilies found in only one taxonomic group
|
|
|
|
|---|---|---|
|
|
| RTETP, |
Percentage of species found with each superfamily
|
|
|
|
|
|
|---|---|---|---|---|
|
|
|
|
|
|
| CR1/L3 (78.67%) |
|
|
|
|
|
|
| Tc1/ | Tc1/ |
|
| L1 (68%) |
|
| L1 and |
|
| RTE (58.67%) | PIF/ |
| DIRS (32%) |
|
The top 5 percentage superfamily hits for each taxonomic group.