| Literature DB >> 34132772 |
Susan F Bailey1, Luz Angela Alonso Morales2, Rees Kassen2.
Abstract
Synonymous mutations are often assumed to be neutral with respect to fitness because they do not alter the encoded amino acid and so cannot be "seen" by natural selection. Yet a growing body of evidence suggests that synonymous mutations can have fitness effects that drive adaptive evolution through their impacts on gene expression and protein folding. Here, we review what microbial experiments have taught us about the contribution of synonymous mutations to adaptation. A survey of site-directed mutagenesis experiments reveals the distributions of fitness effects for nonsynonymous and synonymous mutations are more similar, especially for beneficial mutations, than expected if all synonymous mutations were neutral, suggesting they should drive adaptive evolution more often than is typically observed. A review of experimental evolution studies where synonymous mutations have contributed to adaptation shows they can impact fitness through a range of mechanisms including the creation of illicit RNA polymerase binding sites impacting transcription and changes to mRNA folding stability that modulate translation. We suggest that clonal interference in evolving microbial populations may be the reason synonymous mutations play a smaller role in adaptive evolution than expected based on their observed fitness effects. We finish by discussing the impacts of falsely assuming synonymous mutations are neutral and discuss directions for future work exploring the role of synonymous mutations in adaptive evolution.Entities:
Keywords: distribution of fitness effects; experimental evolution; positive selection; synonymous mutations
Mesh:
Year: 2021 PMID: 34132772 PMCID: PMC8410137 DOI: 10.1093/gbe/evab141
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Characteristics of DFEs from Studies with Published Fitness Data for Synonymous Mutations
| No. of Mutations Tested | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Study | Organism | Fitness Assay Environment | WG/focal Gene | All | Range in Relative Fitness (min–max) | All | GPD Domain for Ben. Mutations | |||||
| Viruses |
| A | WG | S | 11 | 3 | 0–3.133 | 0.0245 | 0.0163 | Gumbel | ||
| N | 55 | 5 | 0–2.393 | |||||||||
| B | Qβ | WG | S | 36 | 12 | 0–1.03 | 0.0001 | 0.0001 | Weibull | |||
| N | 32 | 3 | 0–1.035 | |||||||||
| C | ΦX174 | WG | S | 38 | 21 | 0.809–1.068 | 0.0001 | >0.05 | Gumbel | |||
| N | 32 | 7 | 0–1.061 | |||||||||
|
| D | f1 | WG | S | 30 | 15 | 0.886–1.055 | 0.0001 | >0.05 | Weibull | ||
| N | 60 | 7 | 0–1.074 | |||||||||
|
| E | Baby hamster kidney host cells (BHK21) | WG | S | 9 | 3 | 0–1.055 | 0.0064 | >0.05 | Gumbel | ||
| N | 80 | 13 | 0–1.16 | |||||||||
|
| F | Human lung carcinoma host cells (A549) |
| S | 532 | 158 | 0.011–11.249 | 0.0001 | 0.0235 | Gumbel | ||
| N | 2478 | 242 | 0–20.216 | |||||||||
| Bacteria |
| G |
| LB-agar+ampicillin | S | 611 | 368 | 0.003–1.640 | 0.0001 | >0.05 | Weibull | |
| N | 1925 | 434 | 0.001–1.670 | |||||||||
|
| H |
| M9 + glucose media |
| S | 39 | 27 | 0.925–1.080 | 0.0001 | >0.05 | Weibull | |
| N | 71 | 29 | 0.622–1.0514 | |||||||||
|
| I |
| LB |
| S | 17 | 2 | 0.738–1.005 | >0.05 | — | — | |
| N | 39 | 2 | 0.754–1.004 | |||||||||
| J | LB |
| S | 21 | 0 | 0.606–0.989 | >0.05 | — | — | |||
| N | 49 | 0 | 0.72–1 | |||||||||
|
| K |
| LB agar+cefotaxime | S | — | 10 | 1.1–2.3 | — | 0.017 | S: Weibull | ||
| N | — | 38 | 1.1–27 | N: Fréchet | ||||||||
| Fungi |
| L |
| Dextrose media+G418 + ampicillin | 9-AA regionof Hsp90 | S | 15 | 14 | 0.937–1.014 | 0.0001 | 0.015 | S: Weibull |
| N | 545 | 41 | 0–1.042 | N: Gumbel | ||||||||
Note.—Studies are grouped by kingdom and then ordered alphabetically by author. WG, whole genome; S, synonymous mutations; NS, nonsynonymous mutations; GPD, Generalized Pareto Distribution; Gumbel domain describes exponential distributions, Weibull domain describes right-truncated distributions, Fréchet domain describes heavy-tailed distributions. Data from Domingo-Calap et al. (2009) and Cuevas et al. (2012) are pooled together (as in Cuevas et al. [2012]), and Schenk et al. (2012) focused on only beneficial mutations. Fitness values reported are relative to an ancestor, thus neutral mutations have a fitness of 1. ω > 1 indicates mutations with a relative fitness greater than one.
Only reported separately for S and N mutation when they fit two different GPD domains.
Too few mutations (df≤2) to be confident in the model fit.
Not strictly a fitness measure, fold increase in minimum inhibitory concentration (MIC) relative to the ancestor.
. 1.Distribution of fitness effects of all mutations, with fitness relative to the ancestor (ω) along the x axis and counts of mutations along the y axis. Blue represents nonsynonymous mutations, red represents synonymous mutations. Bars indicate mutation count data. The vertical black solid lines at ω = 1 indicate the fitness of the ancestor and the dashed vertical lines on either side indicate an estimate of 95% CI around that estimate based on mean measurement error reported. Study K did not report measurement error for its fitness estimates and so no dashed line is plotted. Blue and red curves indicate smoothed density fits of the nonsynonymous and synonymous mutations, respectively, using the “density” function in R. Letter labels correspond to study letter labels in table 1. The shapes of the DFEs of synonymous and nonsynonymous mutations are significantly different in all panels except I and J (K–S test, P < 0.05). Note.—The x axis in panel K is not strictly a fitness measure, but instead fold-increase in minimum inhibitory concentration (MIC) relative to the ancestor.
. 2.Distribution of fitness effects of mutations with relative fitness greater than 1 (ω > 1). Relative fitness is shown along the x axis and counts of mutations along the y axis. Blue represents nonsynonymous mutations, red represents synonymous mutations. Bars indicate mutation count data. The vertical black solid lines at ω = 1 indicate the fitness of the ancestor and the dashed vertical lines indicate the estimated 95% CI based on mean measurement error reported. Study K did not report measurement error for its fitness estimates and so no dashed line is plotted. Blue and red curves indicate smoothed density fits of the nonsynonymous and synonymous mutations, respectively, using the “density” function in R. Letters correspond to studies summarized in table 1. Panel J is blank because this study did not observe beneficial mutations. The shapes of the DFEs of synonymous and nonsynonymous mutations are significantly different in panels A, B, F, K, and L (K–S test, P < 0.05). Note.—The x axis in panel K is not strictly a fitness measure, but instead fold-increase in minimum inhibitory concentration (MIC) relative to the ancestor.
Experimental Evolution Studies that Report Clear Evidence for Beneficial Fitness Effects of Synonymous Mutations Arising De Novo
| Study | Organism | Experimental Conditions | No. MutationsObserved | Range of FitnessEffects (min–max) | Evidence for Fitness Effects | Molecular Mechanisms Proposed/Tested | ||
|---|---|---|---|---|---|---|---|---|
| Viruses |
| ΦX174 | S | 11 | ↑ | Parallel evolution | No mechanisms proposed. | |
| N | 63 | |||||||
|
| S | 4 | ↑ | Rapid adaptation; Parallel evolution | Proposed RNA genome secondary structure. | |||
| N | 2 | |||||||
|
| S | 2 | ↑ | Rapid adaptation | No mechanisms proposed. | |||
| N | 14 | |||||||
|
| Baby hamster kidney (BHK-21) and sand fly (LL-5) host cells | S | 5 | ↑ | Parallel evolution | Unknown. Tested change in codon usage and RNA structure/stability. | ||
| N | 16 | |||||||
|
| Ancestral genotypes had suboptimal codons in 10A gene; | S | 5 | ↑ | Rapid adaptation; Parallel evolution | Not mRNA structure. Possibly codon usage. | ||
| N | 12 | |||||||
|
| Madin–Darby canine kidney (MDCK) host cells | S | 7 | 0.05–0.29 | Allele frequency dynamics | No mechanisms proposed. | ||
| N | 10 | 0.06–0.22 | ||||||
|
| Bacteriophage Qβ (+ssRNA) | S | 14 | ↑ | Rapid adaptation; Amplification rate of reconstructed mutants | Proposed RNA genome secondary structure. | ||
| N | 17 | |||||||
| Bacteria |
| M9 + either mannose, glucose, or xylose | S | 2 | 0.07–0.09 | Competitive fitness of reconstructed mutants | Increased expression. No consistent change in optimal codon usage or mRNA structure/stability. | |
| N | 50 | ↑ | ||||||
|
|
| Four ancestral genotypes, each with different codon usage in | S | 4 | 0.11–0.19 | Parallel evolution; Growth rate of reconstructed mutants | Increased expression. Not change in codon usage or RNA structure/stability. Not anti-SD affinity. Proposed change in transcription binding site. | |
| N | 5 | 0.04–0.17 | ||||||
|
|
| Ancestral genotype has deletion (ΔargC) that severely limits growth and modified | S | 1 | 5.1 fold | Growth rate of reconstructed mutants | Strengthens an inefficient promotor for downstream gene, | |
| N | 2 | 3.2–4.7 fold | ||||||
|
|
| Ancestral genotypes each had a single deleterious S mutation | S | 18 | 0.08–0.73 | Growth rate of reconstructed mutants | Changes in predicted mRNA structure/stability. | |
| N | 31 | 0.05–0.72 | ||||||
|
|
| Ancestral genotype has deletion (ΔargC) that severely limits growth; M9 + glucose media | S | 2 | 0.41–0.67 | Growth rate of reconstructed mutants | Changes in predicted mRNA structure/stability and translation efficiency. | |
| N | 1 | 0.32 | ||||||
| Fungi |
|
| Fitness measures are from one asexsual population, and one sexual population; | S | 2 | −0.01 to 0.018 | Allele frequency dynamics; Competitive fitness of reconstructed mutants | No mechanisms proposed. |
| N | 13 | −0.01 to 0.08 |
Note.—Studies are grouped by kingdom and then ordered by publication date. S, synonymous mutations; N, nonsynonymous mutations; ↑, indirect evidence of increased fitness driven in part by synonymous mutations.
We only report those mutations with fitness significantly different from the ancestor.
Nonsynonymous mutants were not reconstructed and tested.
. 3.Probability that the next mutation fixed during an adaptive is synonymous over the number of unique clones competing for fixation. Points along each line represent the outcomes of random draws of a range of different number of beneficial clones with fitness drawn from experimentally quantified distributions of fitness effects from the studies summarized in table 1 (legend letter labels correspond to letter labels in table 1).