| Literature DB >> 22628461 |
Eugénie Pessia1, Alexandra Popa, Sylvain Mousset, Clément Rezvoy, Laurent Duret, Gabriel A B Marais.
Abstract
GC-biased gene conversion (gBGC) is a process that tends to increase the GC content of recombining DNA over evolutionary time and is thought to explain the evolution of GC content in mammals and yeasts. Evidence for gBGC outside these two groups is growing but is still limited. Here, we analyzed 36 completely sequenced genomes representing four of the five major groups in eukaryotes (Unikonts, Excavates, Chromalveolates and Plantae). gBGC was investigated by directly comparing GC content and recombination rates in species where recombination data are available, that is, half of them. To study all species of our dataset, we used chromosome size as a proxy for recombination rate and compared it with GC content. Among the 17 species showing a significant relationship between GC content and chromosome size, 15 are consistent with the predictions of the gBGC model. Importantly, the species showing a pattern consistent with gBGC are found in all the four major groups of eukaryotes studied, which suggests that gBGC may be widespread in eukaryotes.Entities:
Mesh:
Year: 2012 PMID: 22628461 PMCID: PMC5635611 DOI: 10.1093/gbe/evs052
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhylogenetic tree of the 36 species studied. Major groups in eukaryotes (see Keeling et al. 2005) are indicated. Green circles indicate significant positive correlations between GC content (total GC content and/or GC3) and recombination rates (measured directly or using chromosome size as a proxy), consistent with gBGC (this work and others). Red circles indicate significant negative correlations between GC content and recombination rates, not consistent with gBGC. Filled circles indicate new observations from the present study. The “?” indicates when results using direct or indirect measures of recombination rates are not fully consistent.
Correlation between Recombination Rates and GC Content among Eukaryotes
| Species | Eukaryotic groups | Chromosome number | Genetic map | Total GC/rec rates | Chrom size/rec rates |
|---|---|---|---|---|---|
|
| Unikonts | 16 | 861 | 0.62* (*) | −0.6* (*) |
|
| Unikonts | 13 | 285 | 0.04 ns (ns) | −0.14 ns (ns) |
|
| Unikonts | 8 | 150 | 0.29 ns (ns) | −0.05 ns (ns) |
|
| Unikonts | 19 | 10195 | 0.68* (*) | −0.5* (ns) |
|
| Unikonts | 22 | 28121 | 0.75*** (**) | −0.87*** (***) |
|
| Unikonts | 27 | 9268 | 0.89*** (***) | −0.97*** (***) |
|
| Unikonts | 13 | 276 | −0.59* (ns) | 0.21 ns (ns) |
|
| Unikonts | 5 | 780 | 0.5 ns (ns) | −1* (*) |
|
| Unikonts | 4 | 67 | 0.8 ns (ns) | −0.4 ns (ns) |
|
| Unikonts | 16 | 2008 | 0.74* (*) | −0.35 ns (ns) |
|
| Excavates | 11 | 119 | 0.14 ns (ns) | −0.09 ns (ns) |
|
| Chromal | 14 | 3438 | 0.37 ns (ns) | −0.54* (ns) |
|
| Plantae | 5 | 676 | 0 ns (ns) | −0.2 ns (ns) |
|
| Plantae | 19 | 540 | 0.06 ns (ns) | −0.28 ns (ns) |
|
| Plantae | 19 | 515 | −0.33 ns (ns) | −0.56* (*) |
|
| Plantae | 12 | 1202 | −0.18 ns (ns) | 0.53 ns (ns) |
|
| Plantae | 10 | 2029 | 0.5 ns (ns) | 0.21 ns (ns) |
aThe eukaryotic groups relate to those shown in figure 1. Chromal, Chromalveolates.
bNumber of markers in genetic maps.
cValues are Spearman correlation coefficients, then come P values: ns, nonsignificant, * <0.05, ** <10−3, *** <10−4 and q values (from FDR corrections for multiple tests) are indicated in parentheses.
dHere is indicated the number of chromosome arms instead of the number of chromosomes.
Correlation between Chromosome Size and GC Content among Eukaryotes
| Species | Eukaryotic groups | Chromosome number | Mean GC content (%) | Statistical power | GC total/chrom size | GC3/chrom size |
|---|---|---|---|---|---|---|
|
| Unikonts | 11 | 47 | 41 | 0.3 ns (ns) | 0.06 ns (ns) |
|
| Unikonts | 3 | 36 | 0 | −0.5 ns (ns) | −0.5 ns (ns) |
|
| Unikonts | 16 | 38 | 70 | −0.83 | −0.87*** (***) |
|
| Unikonts | 13 | 39 | 52 | −0.69* (*) | −0.71* (*) |
|
| Unikonts | 8 | 41 | 25 | 0.24 ns (ns) | 0.71 ns (ns) |
|
| Unikonts | 6 | 49 | 14 | 0.77 ns (ns) | −0.09 ns (ns) |
|
| Unikonts | 8 | 50 | 25 | 0.71 ns (ns) | 0.26 ns (ns) |
|
| Unikonts | 7 | 52 | 22 | −0.11 ns (ns) | 0.07 ns (ns) |
|
| Unikonts | 4 | 48 | 0 | 0.4 ns (ns) | 0.4 ns (ns) |
|
| Unikonts | 23 | 54 | 100 | −0.46* (ns) | −0.47* (*) |
|
| Unikonts | 14 | 49 | 58 | −0.33 ns (ns) | −0.72* (*) |
|
| Unikonts | 13 | 52 | 52 | −0.91*** (***) | −0.68* (*) |
|
| Unikonts | 9 | 38 | 28 | −0.1 ns (ns) | −0.07 ns (ns) |
|
| Unikonts | 20 | 42 | 99 | −0.28 ns (ns) | −0.26 ns (ns) |
|
| Unikonts | 23 | 41 | 100 | −0.57* (*) | −0.54* (*) |
|
| Unikonts | 29 | 41 | 100 | −0.93*** (***) | −0.97*** (***) |
|
| Unikonts | 13 | 36 | 52 | 0.2 ns (ns) | 0.29 ns (ns) |
|
| Unikonts | 6 | 35 | 14 | −0.54 ns (ns) | −0.26 ns (ns) |
|
| Unikonts | 5 | 42 | 7 | −0.3 ns (ns) | −0.5 ns (ns) |
|
| Unikonts | 16 | 35 | 70 | −0.03 ns (ns) | −0.16 ns (ns) |
|
| Unikonts | 6 | 22 | 14 | −0.94* (*) | −0.6 ns (ns) |
|
| Excavates | 11 | 46 | 41 | 0.73* (*) | 0.48 ns (ns) |
|
| Excavates | 36 | 60 | 100 | −0.85*** (***) | −0.82*** (***) |
|
| Chromal | 3 | 26 | 0 | −1 ns (ns) | −1 ns (ns) |
|
| Chromal | 114 | 28 | 100 | −0.84*** (***) | −0.89*** (***) |
|
| Chromal | 14 | 19 | 58 | −0.8 | −0.77* (*) |
|
| Chromal | 8 | 30 | 25 | 0.1 ns (ns) | −0.1 ns (ns) |
|
| Chromal | 23 | 47 | 100 | −0.06 ns (ns) | −0.87*** (***) |
|
| Chromal | 33 | 49 | 100 | −0.18 ns (ns) | −0.46* (*) |
|
| Plantae | 19 | 60 | 94 | −0.72** (*) | −0.66* (*) |
|
| Plantae | 15 | 64 | 63 | −0.83 | −0.42 ns (ns) |
|
| Plantae | 5 | 36 | 7 | −0.2 ns (ns) | −0.3 ns (ns) |
|
| Plantae | 19 | 34 | 94 | 0.37 ns (ns) | −0.31 ns (ns) |
|
| Plantae | 19 | 33 | 94 | 0.22 ns (ns) | 0.36 ns (ns) |
|
| Plantae | 12 | 44 | 46 | 0.47 ns (ns) | 0.48 ns (ns) |
|
| Plantae | 10 | 44 | 36 | 0.3 ns (ns) | 0.68* (*) |
aThe eukaryotic groups relate to those shown in figure 1. Chromal, Chromalveolates.
bStatistical power for chromosome number ≥23 is set to 100%.
cValues are Spearman correlation coefficients, then come P values: ns, nonsignificant, * <0.05,** <10−3,*** <10−4 and q values (from FDR corrections for multiple tests) are indicated in parentheses.
dHere is indicated the number of chromosome arms instead of the number of chromosomes.
FExamples of relationships between chromosome size and total GC content. (a) L. major. (b) T. brucei. (c) G. theta. The values above the plots indicate the number of similar observations that were made among the 36 species (e.g., N = 12 for [a] means 12 significant positive correlations). ρ = Spearman coefficient. Statistical significance: ns, nonsignificant, * <0.05, ** <10−3, *** <10−4.