| Literature DB >> 30590692 |
Marjolaine Rousselle1, Alexandre Laverré1, Emeric Figuet1, Benoit Nabholz1, Nicolas Galtier1.
Abstract
Recombination is expected to affect functional sequence evolution in several ways. On the one hand, recombination is thought to improve the efficiency of multilocus selection by dissipating linkage disequilibrium. On the other hand, natural selection can be counteracted by recombination-associated transmission distorters such as GC-biased gene conversion (gBGC), which tends to promote G and C alleles irrespective of their fitness effect in high-recombining regions. It has been suggested that gBGC might impact coding sequence evolution in vertebrates, and particularly the ratio of nonsynonymous to synonymous substitution rates (dN/dS). However, distinctive gBGC patterns have been reported in mammals and birds, maybe reflecting the documented contrasts in evolutionary dynamics of recombination rate between these two taxa. Here, we explore how recombination and gBGC affect coding sequence evolution in mammals and birds by analyzing proteome-wide data in six species of Galloanserae (fowls) and six species of catarrhine primates. We estimated the dN/dS ratio and rates of adaptive and nonadaptive evolution in bins of genes of increasing recombination rate, separately analyzing AT → GC, GC → AT, and G ↔ C/A ↔ T mutations. We show that in both taxa, recombination and gBGC entail a decrease in dN/dS. Our analysis indicates that recombination enhances the efficiency of purifying selection by lowering Hill-Robertson effects, whereas gBGC leads to an overestimation of the adaptive rate of AT → GC mutations. Finally, we report a mutagenic effect of recombination, which is independent of gBGC.Entities:
Keywords: GC-biased gene conversion; adaptive substitution rate; birds; coding sequence evolution; primates; recombination
Mesh:
Year: 2019 PMID: 30590692 PMCID: PMC6389324 DOI: 10.1093/molbev/msy243
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.dN, dS, and dN/dS ratio against GC3 for each species for all substitutions taken together. Statistics are estimated under a model assuming base composition nonstationarity.
Spearman Correlation Coefficients between GC3 and Divergence Estimates Obtained with a Model Assuming Nonstationarity.
| Species | dN | dS | dN/dS | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | WS | SW | GC-conservative | All | WS | SW | GC-conservative | All | WS | SW | GC-conservative | |
| 0.867 | 0.915 | 0.115 | 0.952 | 1 | 1 | 1 | 0.952 | –0.952 | –0.988 | –0.769 | –0.855 | |
| 0.879 | 0.139 | 0.818 | 0.721 | 0.988 | 0.976 | 0.964 | 0.988 | –0.709 | –0.952 | 0.188 | –0.721 | |
| –0.103 | –0.515 | 0.030 | –0.067 | 0.939 | 0.891 | 0.539 | 0.903 | –1 | –0.976 | –0.539 | –0.939 | |
| 0.927 | 0.818 | 0.939 | 0.867 | 1 | 0.988 | 0.891 | 0.988 | –0.564 | –0.939 | 0.648 | –0.636 | |
| –0.467 | –0.612 | –0.636 | 0.091 | 0.867 | 0.952 | 0.661 | 0.769 | –1 | –1 | –0.879 | –0.939 | |
| 0.830 | –0.356 | 0.830 | 0.891 | 1 | 1 | 1 | 1 | –1 | –1 | –0.2 | –0.939 | |
| –0.369 | –0.491 | –0.042 | –0.164 | 0.479 | 0.794 | –0.285 | 0.297 | –0.564 | –0.903 | 0.309 | –0.6 | |
| –0.127 | –0.806 | 0.115 | 0.103 | 0.636 | 0.952 | –0.297 | 0.6 | –0.430 | –0.927 | 0.236 | –0.467 | |
| –0.236 | –0.648 | 0.042 | 0.467 | 0.369 | 0.964 | 0.552 | 0.769 | –0.273 | –0.927 | –0.176 | –0.624 | |
| 0.055 | –0.212 | 0.321 | 0.079 | 0.745 | 0.891 | 0.345 | 0.455 | –0.491 | –0.806 | 0.067 | –0.297 | |
| 0.588 | 0.624 | 0.467 | 0.673 | 0.879 | 0.988 | 0.709 | 0.964 | –0.479 | –0.939 | –0.151 | –0.891 | |
| 0.527 | 0.479 | 0.503 | 0.612 | 0.891 | 0.976 | 0.333 | 0.927 | –0.527 | –0.964 | 0.055 | –0.927 | |
Note.—Significance levels are showed with * before the FDR correction (False Discovery Rate), and with two shades of red (if the correlation is positive) or green (if the correlation in negative) after FDR correction (light: P-value < 0.05, dark: P-value < 0.01).
P-value < 0.05.
P-value < 0.01.
. 2.dN, dS, and dN/dS ratio against GC3 for each species and each type of substitutions (W → S, S → W, and GC-conservative substitutions). Statistics are estimated under a model assuming base composition nonstationarity.
. 3.π n/πs ratio against GC3 for each species (A: primates, B: Galloanserae) and each type of mutations (W → S, S → W, and GC-conservative mutations) and average nonsynonymous (C) and synonymous (D) allele frequency for each species for W → S, S → W, and GC-conservative SNPs (statistics estimated without masking CpG sites).
Spearman Correlation Coefficients between GC3 and πn, πs, πn/πs Obtained without Masking CpG Sites.
| Species | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | WS | SW | GC-conservative | All | WS | SW | GC-conservative | All | WS | SW | GC-conservative | |
| 0.22 | –0.34 | 0.34 | 0.23 | 0.90 | 0.842 | 0.6 | 0.70 | –0.77 | −0.78 | –0.22 | –0.36 | |
| 0.61 | 0.50 | 0.69 | 0.13 | 0.93 | 0.38 | 0.83 | 0.73 | –0.69 | –0.01 | –0.45 | –0.45 | |
| 0.18 | 0.12 | 0.12 | 0.18 | –0.43 | –0.57 | 0.28 | –0.01 | 0.35 | 0.39 | –0.054 | 0.2 | |
| –0.63 | –0.79 | 0.09 | –0.39 | 0.91 | 0.56 | 0.64 | –0.41 | –0.96 | –0.86 | –0.66 | 0.24 | |
| –0.5 | –0.83 | 0.06 | 0.2 | 0.93 | 0.50 | 0.64 | 0.57 | –0.84 | –0.80 | –0.43 | –0.29 | |
| 0.63 | 0.15 | 0.68 | 0.442 | 0.93 | 0.47 | 0.93 | 0.90 | –0.72 | –0.44 | –0.44 | –0.76 | |
| –0.1 | –0.78 | 0.17 | –0.55 | 0.83 | 0.90 | 0.24 | –0.32 | –0.75 | –0.93 | 0.11 | –0.33 | |
| 0.71 | –0.44 | 0.91 | 0.47 | 0.96 | 0.96 | 0.92 | 0.95 | –0.85 | –0.89 | –0.46 | –0.61 | |
| –0.12 | –0.44 | 0.05 | –0.45 | 0.86 | 0.87 | 0.90 | 0.51 | –0.88 | –0.83 | –0.55 | –0.81 | |
| –0.7 | –0.90 | 0.33 | –0.76 | 0.90 | 0.63 | 0.73 | –0.56 | –0.96 | –0.91 | –0.26 | –0.56 | |
| 0.34 | –0.33 | 0.52 | –0.22 | 0.91 | 0.96 | 0.90 | 0.80 | –0.81 | –0.85 | –0.15 | –0.78 | |
| 0.87 | –0.31 | 0.86 | 0.52 | 0.89 | 0.92 | 0.89 | 0.96 | –0.95 | –0.92 | –0.80 | –0.89 | |
Note.—Significance levels are showed with * before the FDR correction (False Discovery Rate), and with two shades of red (if the correlation is positive) or green (if the correlation in negative) after FDR correction (light: P-value < 0.05, dark: P-value < 0.01).
P-value < 0.05.
P-value < 0.01.
. 4.α, ωa and ωna estimates for each species and each type of mutations (all mutations, W→S, S→W and GC-conservative) using all genes. Statistics are obtained using the model “GammaZero”.
Spearman Correlation Coefficients between GC3 and α, ωa, and ωna Estimates Obtained without Masking CpG Sites.
| Species | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | WS | SW | GC-conservative | All | WS | SW | GC-conservative | All | WS | SW | GC-conservative | |
| 0.41 | 0.24 | 0.0060 | 0.490 | 0.2 | –0.24 | –0.030 | 0.466 | –0.27 | –0.21 | 0.0424 | –0.46 | |
| 0.28 | 0.32 | 0.35 | 0.28 | 0.33 | 0.24 | 0.357 | 0.381 | –0.2 | –0.34 | –0.21 | –0.38 | |
| –0.06 | 0.12 | 0.52 | 0.042 | –16 | –0.090 | 0.49 | 0.17 | –0.21 | –0.054 | –0.50 | –0.17 | |
| 0.38 | 0.393 | 0.381 | 0.35 | 0.6 | 0.187 | 0.49 | 0.35 | –0.41 | –0.33 | –0.15 | –0.35 | |
| 0.09 | 0.0424 | 0.22 | 0.57 | –0.08 | 0.22 | 0.187 | 0.57 | –0.3 | –0.15 | –0.33 | –0.57 | |
| 0.26 | 0.22 | 0.32 | –0.078 | 0.23 | 0.15 | 0.27 | –0.29 | –0.41 | –0.2 | –0.39 | 0.29 | |
| 0.45 | –0.20 | –0.031 | 0.082 | 0.1 | –0.15 | 0.0060 | –0.28 | –0.5 | 0.012 | –0.11 | –0.096 | |
| 0.03 | 0.143 | –0.14 | –0.13 | –0.16 | 0.078 | –0.30 | –0.16 | –0.13 | 0.056 | 0.14 | 0.093 | |
| 0.52 | 0.13 | –0.032 | 0.30 | 0.62 | –0.066 | –0.22 | 0.22 | –0.70 | –0.10 | 0.0064 | –0.27 | |
| 0.04 | 0.20 | –0.18 | 0.079 | –0.39 | –0.06 | –0.16 | 0.030 | –0.32 | –0.27 | 0.018 | –0.06 | |
| 0.03 | –0.055 | –0.28 | 0.10 | –0.18 | 0.006 | –0.29 | 0.13 | –0.19 | –0.11 | –0.10 | –0.12 | |
| 0.04 | 0.06 | –0.03 | 0.57 | –0.24 | –0.53 | –0.16 | 0.57 | –0.56 | –0.38 | 0.019 | –0.57 | |
Note.—α, ωa, and ωna are obtained using the model GammaZero. Significance levels are showed with * before the FDR correction (False Discovery Rate), and with two shades of red (if the correlation is positive) or green (if the correlation in negative) after FDR correction (light: P-value < 0.05, dark: P-value < 0.01).
P-value < 0.05.
P-value < 0.01.