| Literature DB >> 24891070 |
Thomas Bataillon1, Susan F Bailey.
Abstract
The rates and properties of new mutations affecting fitness have implications for a number of outstanding questions in evolutionary biology. Obtaining estimates of mutation rates and effects has historically been challenging, and little theory has been available for predicting the distribution of fitness effects (DFE); however, there have been recent advances on both fronts. Extreme-value theory predicts the DFE of beneficial mutations in well-adapted populations, while phenotypic fitness landscape models make predictions for the DFE of all mutations as a function of the initial level of adaptation and the strength of stabilizing selection on traits underlying fitness. Direct experimental evidence confirms predictions on the DFE of beneficial mutations and favors distributions that are roughly exponential but bounded on the right. A growing number of studies infer the DFE using genomic patterns of polymorphism and divergence, recovering a wide range of DFE. Future work should be aimed at identifying factors driving the observed variation in the DFE. We emphasize the need for further theory explicitly incorporating the effects of partial pleiotropy and heterogeneity in the environment on the expected DFE.Entities:
Keywords: distribution of fitness effects; experimental evolution; mutation; mutational landscape models; population genomics
Mesh:
Year: 2014 PMID: 24891070 PMCID: PMC4282485 DOI: 10.1111/nyas.12460
Source DB: PubMed Journal: Ann N Y Acad Sci ISSN: 0077-8923 Impact factor: 5.691
Figure 1Hypothetical whole distributions of fitness effects. The whole distribution of fitness effect can comprise, in principle, a continuum of fitness effects ranging from lethal to strongly or mildly deleterious to beneficial. Three distributions are pictured here that comprise different amounts of deleterious and beneficial mutations. The dotted and dashed distributions are arbitrarily chosen, while the continuous distribution (solid gray) is one of the type predicted by a Fisherian fitness landscape (a displaced and reflected Γ distribution).
Figure 2Alternative distribution of beneficial fitness effects as predicted by extreme-value theory (EVT). All distributions displayed here are specific cases of a generalized Pareto distribution9 that differ by their shape. A shape of 1 corresponds to an exponential distribution (gray), a shape parameter >1 will yield a DFE with a much heavier tail (so-called Frechet domain, in blue). Alternatively, a shape parameter <1 will yield distributions that decay much more rapidly than the exponential (so-called Weibull domain), and the beneficial mutations cannot exceed a maximum beneficial effect.
DFE inferred from experimental studies relying on the isolation of individual mutations
| Strategy used to isolate mutations | Organism | Mutational target | Number of beneficial mutations | Mutations characterized | DFE inferred | References |
|---|---|---|---|---|---|---|
| Resistance to antibiotic | gyrA and others | 18 | Beneficial | Exponential | ||
| Resistance to antibiotic | rpoB | 15 | Beneficial | Exponential | ||
| Reporter construct | 11 genes total | 100 | Beneficial | Normal | ||
| Increased growth rate | ssDNA bacteriophage Id11 | WG | 9 | Beneficial | Weibull | |
| Novel host growth | RNA phage ϕ6 | P3 (host attachment gene) | 16 | Beneficial | Weibull | |
| Site-directed mutagenesis | VSV (RNA virus) | WG | 16 | (A) Beneficial | (A) Γ, significantly leptokurtic | |
| (B) Deleterious | (B) log-normal + uniform | |||||
| Site-directed mutagenesis | RNA bacteriophage Qβ; ssDNA bacteriophage ΦX174 | WG | 0 | All | Γ, but β and exponential also fit well | |
| Site-directed mutagenesis | RNA virus pTEV-7DA | WG | 0 | Viable mutations only | β, but all distributions tested were significant | |
| Site-directed mutagenesis | ssDNA bacteriophage f1 | WG | 2 | All | Log-normal or Weibull | |
| Site-directed mutagenesis | β-lactamase TEM-1 | 0 | All | Γ; biophysical model of protein stability | ||
| Site-directed mutagenesis | rpsT and rplA | 0 | All | Γ | ||
| Site-directed mutagenesis (EMPIRIC) | iG170D | 9 AAs in | 0 | All | Bimodal (nearly neutral + deleterious) | |
| Tn insertion | WG | 0 | All | Γ + uniform |
WG, whole genome.
Summaries of studies inferring mutational properties through fitness/marker trajectories over time
| Method | Organism | Sample size (# of mutations) | DFE | Beneficial mutation rate | References |
|---|---|---|---|---|---|
| Marker frequencies | 66 | Exponential or Γ | 4 × 10−9 | ||
| Marker frequencies | 30 | Peaked, unimodal | 5.9 × 10−8 | ||
| Marker frequencies | 72 | Exponential, uniform, and Dirac δ | 2 × 10−7 | ||
| Marker frequencies | 75 and 87 | Γ | 10−5 | ||
| Marker frequencies | K43N: 81; K88E: 102 | K43N: cannot reject log-normal (Weibull domain) | K43N: 5 × 10−5; K88N: 4 × 10−5 | ||
| K88E: β (reject Weibull) | |||||
| Marker frequencies | 68 | Weibull | 3.8 × 10−8 | ||
| Mean fitness over time | 260 | Unimodal, positively skewed | – | ||
| Mean fitness over time | 288 | Unimodal for one genotype, bimodal for the other two. | 6.6 × 10−8 |
Figure 3Expected amounts of polymorphism and divergence contributed by nonsynonymous mutations as a function of the scaled selection coefficient of a mutation. The expected amount of polymorphism in a genomic region can be summarized as the total number of polymorphic sites for both nonsynonymous (Pn) and synonymous sites (Ps, lower horizontal line), and the number of rare mutations using, for example, the number of singletons (mutations seen only once) at nonsynonymous sites (Sn). Amounts of divergence are quantified using the number of divergent synonymous (Ds, upper horizontal line) and nonsynonymous sites (Dn) relative to an out-group sequence. For a given mutation rate and effective population size, sample size n of chromosomes resequenced and size of a genomic fragment population genetic theory and diffusion results on the Wright-Fisher model can be used to compute these quantities as a function of the scaled mutation effect S of a nonsynonymous mutation. Here for illustration, we assume 1000 synonymous neutral sites, 3000 nonsynonymous nucleotide sites with a scaled mutation rate θ = 4Nμ = 0.01, a scaled divergence to the out-group of λ = 0.05, and a sample comprising n = 10 chromosomes.
Figure 4An example of site frequency spectrum (SFS) data in chimpanzee (Pan troglodytes troglodytes). Data comprising the resequencing of the exome of 12 individuals (n = 24 chromosomes) is redrawn using the data of Ref. 76. Circle size is proportional to counts of polymorphism. Synonymous counts are in gray (upper set of L-shaped dots) and nonsynonymous counts in black (lower set of L-shaped dots).
Summary of studies inferring the distribution of scaled fitness effects, Ns, of nonsynonymous mutations
| Organism | Method/dem model | –1 < | –10 < | –10 < | Distribution(s) fitted | References |
|---|---|---|---|---|---|---|
| Human | Diffusion + complex demography | 0.27 | 0.30 | 0.43 | Mix of normal exponential/neutral | |
| Human | EWK2009 | 0.35 | 0.09 | 0.56 | Γ | |
| K&K | 0.19 | 0 | 0.81 | LN, Γ, β, Spikes | ||
| EWK2009 | 0.09 | 0.06 | 0.74 | Γ | ||
| EWK2009 | 0.06 | 0.07 | 0.87 | Γ | ||
| 0.25 | 0.25 | 0.5 | Γ | |||
| 0.2 | 0.2 | 0.6 | ||||
| Angiosperms | EWK2009 | 0.1–0.35 | 0.05–0.15 | 0.7–0.8 | Γ | |
| EWK2009 | 20–35 | 12–15 | 50–65 | Γ |
EWK200956: diffusion based, simple demographic model fitted featuring a possible step change from population size N1 to population size N2 at some time t in the past (N1, N2, and t become “nuisance parameters” estimated alongside DFE and the fraction of favorable mutations).
K&K: discrete W–F matrix based, demographic model identical to EWK2009.
Dem, demographic; LN, log normal; spikes, spikes at different Ns class values.