| Literature DB >> 36099311 |
Ana Filipa Moutinho1,2, Adam Eyre-Walker2, Julien Y Dutheil1,3.
Abstract
Understanding the dynamics of species adaptation to their environments has long been a central focus of the study of evolution. Theories of adaptation propose that populations evolve by "walking" in a fitness landscape. This "adaptive walk" is characterised by a pattern of diminishing returns, where populations further away from their fitness optimum take larger steps than those closer to their optimal conditions. Hence, we expect young genes to evolve faster and experience mutations with stronger fitness effects than older genes because they are further away from their fitness optimum. Testing this hypothesis, however, constitutes an arduous task. Young genes are small, encode proteins with a higher degree of intrinsic disorder, are expressed at lower levels, and are involved in species-specific adaptations. Since all these factors lead to increased protein evolutionary rates, they could be masking the effect of gene age. While controlling for these factors, we used population genomic data sets of Arabidopsis and Drosophila and estimated the rate of adaptive substitutions across genes from different phylostrata. We found that a gene's evolutionary age significantly impacts the molecular rate of adaptation. Moreover, we observed that substitutions in young genes tend to have larger physicochemical effects. Our study, therefore, provides strong evidence that molecular evolution follows an adaptive walk model across a large evolutionary timescale.Entities:
Mesh:
Year: 2022 PMID: 36099311 PMCID: PMC9470001 DOI: 10.1371/journal.pbio.3001775
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 9.593
Kendall’s correlation coefficients for the relationship between ω, ω, and ω and gene age, for the analysis of gene age and the combined analyses of gene age with the respective cofactors: protein length, gene expression, protein intrinsic disorder, and the mean relative solvent accessibility per gene.
The combined probabilities for each cofactor within and across species are presented in the fields “Weighted Z” and “Weighted Z across species,” respectively, for ω, ω, and ω.
|
|
| Weighted Z across species | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| ω |
|
| ω |
|
| ω |
|
| ||
|
| 0.962 *** | 0.848 *** | 0.733 *** | 0.727 *** | 0.697 ** | 0.636 ** | ||||
|
|
| 1.000 | 0.867 | −0.200 | 0.867 * | 0.600 | 0.867 * | 1.56 × 10−4 *** | 7.71 × 10−5 *** | 7.98 × 10−3 ** |
|
| 1.000 | 0.867 * | 0.600 | 0.733 * | 0.867 * | 0.467 | ||||
|
| 6.46 × 10−4 | 1.61 × 10−3 | 0.133 | 2.64 × 10−3 ** | 5.29 × 10−3 ** | 0.0105 * | ||||
|
|
| 0.867 | 0.867 * | 0.467 | 0.867 * | 1.000 ** | 0.600 | 6.93 × 10−5 *** | 6.89 × 10−6 *** | 3.53 × 10−3 ** |
|
| 0.867 | 1.000 | 0.333 | 0.867 * | 0.733 * | 1.000 ** | ||||
|
| 1.51 × 10−3 | 3.71 × 10−4 | 0.186 | 1.09 × 10−3 ** | 1.68 × 10−3 ** | 2.24 × 10−3 ** | ||||
|
|
| 1.000 | 0.939 | 0.636 ** | 0.670 ** | 0.303 | 0.515 * | <2 × 10−216 *** | 6.60 × 10−6 *** | 2.53 × 10−3 ** |
|
| 0.970 | 0.909 | 0.454 * | 0.630 ** | 0.576 ** | 0.273 | ||||
|
| <2 × 10−216 | <2 × 10−216 | 1.20 × 10−3 ** | 3.85 × 10−5 *** | 5.80 × 10−3 ** | 4.18 × 10−2 * | ||||
|
|
| 0.944 | 0.889 | 0.722 ** | 0.636 ** | 0.673 ** | 0.564 * | 1.00 × 10−7 *** | 9.00 × 10−7 *** | 1.37 × 10−5 *** |
|
| 1.000 | 0.778 | 0.667 * | 0.636 ** | 0.491 * | 0.564 * | ||||
|
| 6.20 × 10−6 | 1.41 × 10−5 | 1.24 × 10−3 ** | 3.67 × 10−4 *** | 7.76 × 10−4 *** | 1.55 × 10−3 ** | ||||
For each variable, the correlation coefficient and the combined probabilities are shown with the respective significance (*P < 0.05; **P < 0.01; ***P < 0.001; “.” 0.05 ≤ P < 0.10) for ω, ω, and ω in Arabidopsis and Drosophila. As the effect of gene age was assessed by combining genes in each age class, these correlation coefficients do not measure the intrinsic gene-level strength of correlation between gene age and molecular rates of evolution.
Fig 1(a) Phylogenetic definition of the strata used in the analyses for A. thaliana (top) and D. melanogaster (bottom). The number of genes mapped to each clade is shown. (b) Relationship between the rate of protein evolution (ω), nonadaptive nonsynonymous substitutions (ω), and adaptive nonsynonymous substitutions (ω) with gene age in A. thaliana (top) and in D. melanogaster (bottom). Clades are ordered according to (a). In D. melanogaster, the results for X-linked, autosomal, and total genes are shown. Mean values of ω, ω, and ω for each category are represented with the black points. Error bars denote for the 95% confidence interval for each category, computed over 100 bootstrap replicates. The data (S1 Data) and code needed to generate this table can be found at https://gitlab.gwdg.de/molsysevol/supplementarydata_geneage and https://zenodo.org/record/6828430.
Fig 2Estimates of ω, ω, and ω plotted as a function of (a) protein length and (b) mean expression levels, (c) RSA, and (d) protein intrinsic disorder with gene age in A. thaliana (top) and D. melanogaster (bottom). Analyses were performed by comparing short and long genes (a), lowly and highly expressed genes (b), proteins with low and high mean RSA values (c), and proteins with low and high average intrinsic disorder (d) across age categories (see Material and methods). Legend as in Fig 1. Significance levels are shown for each correlation between gene age and the rates of protein evolution (*P < 0.05; **P < 0.01; ***P < 0.001; “.” 0.05 ≤ P < 0.10; “ns” P > 0.10). The data (S6–S9 Data tables) and code needed to generate this table can be found at https://gitlab.gwdg.de/molsysevol/supplementarydata_geneage and https://zenodo.org/record/6828430.
Fig 3Relationship between gene age and Grantham’s distance between amino acids for A. thaliana (a) and D. melanogaster (b). For each clade, the median value of Grantham’s distance between residues is depicted with the black dot. The shaded area represents the physicochemical distances within the first and third quartile. The data (S27 Data) and code needed to generate this table can be found at https://gitlab.gwdg.de/molsysevol/supplementarydata_geneage and https://zenodo.org/record/6828430.