| Literature DB >> 31857829 |
Radka Symonová1, Alexander Suh2,3.
Abstract
BACKGROUND: Teleost fish genome size has been repeatedly demonstrated to positively correlate with the proportion of transposable elements (TEs). This finding might have far-reaching implications for our understanding of the evolution of nucleotide composition across vertebrates. Genomes of fish and amphibians are GC homogenous, with non-teleost gars being the single exception identified to date, whereas birds and mammals are AT/GC heterogeneous. The exact reason for this phenomenon remains controversial. Since TEs make up significant proportions of genomes and can quickly accumulate across genomes, they can potentially influence the host genome with their own GC content (GC%). However, the GC% of fish TEs has so far been neglected.Entities:
Keywords: GC content; Genome evolution; Nucleotide composition; Teleost fish; Transposon
Year: 2019 PMID: 31857829 PMCID: PMC6909575 DOI: 10.1186/s13100-019-0195-y
Source DB: PubMed Journal: Mob DNA
Fig. 1Genome size, transposable elements, and nucleotide composition. a Abundance of transposable elements in 29 teleosts, one non-teleost ray-finned fish (spotted gar, L. oculatus; Loc) with a AT/GC heterogeneous genome, one lobe-finned fish (L. chalumnae; Lch), one lamprey (P. marinus; Pma) and one shark (C. milii; Cmi) species related to their host genome size (genome size as C-value in picograms, pg), data from [3]. b GC percentage (GC%) of 46 fish genomes with available genome assemblies (excluding salmonids with their rediploidized genomes exceptionally enriched in extremely GC-rich rRNA genes [31]) negatively correlates with fish genome size based on averaged cytological measurements (C-value in pg, multiple C-value records were averaged). C-value data from the Animal Genome Size Database [32], GC% data from GenBank [33]. c GC% of TE consensus sequences (not accounting for their copy number within genomes) positively correlates with the overall GC% of the host genome in 25 ray-finned fish species, one lancelet (Branchiostoma belcheri; Bbe), one lamprey (Pma), one shark (Cmi) and one coelacanth included in FishTEDB [34]. Genomic GC% data are from GenBank [33], GC% of TEs was calculated from species-specific TE consensus sequence libraries from FishTEDB [34]. d GC% of genome assemblies (in Mb) of 58 fish species listed GenBank [33]
Fig. 2Comparison of GC% of repetitive and non-repetitive genomic fractions in 54 fish species from the Ensembl database (Release 98). The Y-axis shows GCREP, i.e. GC% of repeats (including low-complexity regions) masked with the RepeatMasker tool, while the X-axis shows GCNONREP of the non-repetitive fraction of each assembly. Data used for this analysis are available in the Additional file 2: Table S2
Fig. 3GCTE in the major groups of Class I and Class II TEs, calculated as sum of GC% for all 28 fish species available in the FishTEDB database. TE consensus sequences for these calculations are from the “Browse” section of the FishTEDB database [34]
Fig. 4Comparison GC% between TE consensus sequences from Class I (retrotransposons) and Class II (DNA transposons) in six selected fish species (highlighted in the main text) listed in the FishTEDB database [34]