| Literature DB >> 31242179 |
Katharina B Böndel1, Susanne A Kraemer1,2, Toby Samuels1, Deirdre McClean3, Josianne Lachapelle4, Rob W Ness4, Nick Colegrave1, Peter D Keightley1.
Abstract
Spontaneous mutations are the source of new genetic variation and are thus central to the evolutionary process. In molecular evolution and quantitative genetics, the nature of genetic variation depends critically on the distribution of effects of mutations on fitness and other quantitative traits. Spontaneous mutation accumulation (MA) experiments have been the principal approach for investigating the overall rate of occurrence and cumulative effect of mutations but have not allowed the phenotypic effects of individual mutations to be studied directly. Here, we crossed MA lines of the green alga Chlamydomonas reinhardtii with its unmutated ancestral strain to create haploid recombinant lines, each carrying an average of 50% of the accumulated mutations in a large number of combinations. With the aid of the genome sequences of the MA lines, we inferred the genotypes of the mutations, assayed their growth rate as a measure of fitness, and inferred the distribution of fitness effects (DFE) using a Bayesian mixture model. We infer that the DFE is highly leptokurtic (L-shaped). Of mutations with absolute fitness effects exceeding 1%, about one-sixth increase fitness in the laboratory environment. The inferred distribution of effects for deleterious mutations is consistent with a strong role for nearly neutral evolution. Specifically, such a distribution predicts that nucleotide variation and genetic variation for quantitative traits will be insensitive to change in the effective population size.Entities:
Mesh:
Year: 2019 PMID: 31242179 PMCID: PMC6615632 DOI: 10.1371/journal.pbio.3000192
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Data overview.
| MA line cross | Number of mutations | Number of RLs | Number of haplotypes |
|---|---|---|---|
| L03 | 39 | 247 | 214 |
| L06 | 69 | 238 | 109 |
| L07 | 59 | 261 | 69 |
| L09 | 98 | 272 | 68 |
| L11 | 66 | 272 | 154 |
| L14 | 55 | 236 | 67 |
| Combined | 386 | 1,526 | 681 |
Abbreviations: MA, mutation accumulation; RL, recombinant line.
Fig 1Relationship between growth rate and number of mutations carried by an RL or ancestor for the six CC-2931 MA line crosses.
Linear regression lines are shown. MA, mutation accumulation; RL, recombinant line.
Likelihood ratio tests for mixed-model analysis of growth rate as a function of number of mutations of all kinds with 1 degree of freedom.
| MA line cross | Chi square | |
|---|---|---|
| L03 | 11.2 | 0.00080 |
| L06 | 0.77 | 0.38 |
| L07 | 0.18 | 0.67 |
| L09 | 0.072 | 0.79 |
| L11 | 2.4 | 0.12 |
| L14 | 0.72 | 0.40 |
| Whole data set | 3.1 | 0.080 |
Abbreviation: MA, mutation accumulation.
Bayesian MCMC estimates based on modes of the posterior distributions and 95% credible intervals for mutation effect (e) and mutation frequency (q) parameters under two- or three-category models along with BIC relative to the model with two categories of mutation effects.
Both models include a class of mutations with zero effect on the trait.
| Parameter estimate (95% credible interval) | |||||
|---|---|---|---|---|---|
| Model (no. mutation categories) | e1 | q1 | e2 | q2 | BIC |
| 2 | −0.031 (−0.044, −0.023) | 0.042 (0.020, 0.079) | - | - | 0 |
| 3 | −0.024 (−0.043, −0.011) | 0.071 (0.031, 0.42) | 0.021 (0.010, 0.068) | 0.048 (0.010, 0.41) | −147 |
Abbreviations: BIC, Bayesian information criterion; MCMC, Markov chain Monte Carlo.
Bayesian estimates obtained from the modes of the posterior distributions and 95% credible intervals for parameters of gamma distributions of negative and positive mutation effects (indexed by 0 and 1, respectively), under two-sided gamma distribution models with the same or different means for negative- and positive-effect mutations.
For example, e1 is the estimated mean of the gamma distribution of positive-effect mutations, and q1 is their frequency.
| Parameter | Model | Estimate | 95% credible interval | |
|---|---|---|---|---|
| β | Two-sided gamma, same means | 0.32 | 0.26 | 0.70 |
| 0.0049 | 0.0037 | 0.0070 | ||
| 0.48 | 0.39 | 0.58 | ||
| β | Two-sided gamma, different means | 0.30 | 0.24 | 0.71 |
| −0.0092 | −0.020 | −0.0060 | ||
| 0.0021 | 0.0013 | 0.0032 | ||
| 0.84 | 0.73 | 0.90 | ||
Fig 2Inferred DFE assuming a two-sided gamma model (smooth line) and a point mass DFE for the three-category model (transparent blue rectangles).
DFE, distribution of fitness effects.
Average squared effects of mutations (×1,000) of certain mutation type classifications (S1 Data) estimated under the two-sided gamma distribution model.
For example, in the row labelled ‘SNP versus Indel’, e2(−) and e2(+) are the average squared effects for SNP and indel mutations, respectively. P values for the difference between the squared effects of mutations were obtained by bootstrapping mutations 1,000 times.
| Mutation type | e2(−) | e2(+) | |
|---|---|---|---|
| SNP versus indel | 0.074 | 0.073 | 0.84 |
| Nonexonic versus exonic | 0.061 | 0.083 | 0.15 |
| Nonintronic versus intronic | 0.081 | 0.059 | 0.17 |
| Nonintergenic versus intergenic | 0.074 | 0.067 | 0.92 |
Fig 3The estimated reflected gamma distribution of effects (inferred gamma distribution) compared to the distribution of posterior mean estimates for the effects of the individual mutations (individual estimates).
Fig 4Relationship between estimated fitness effects of mutations obtained by MCMC and estimates obtained from the difference in mean growth rate between recombinant lines carrying the mutant and wild-type allele (raw difference).
Raw difference estimates were calculated within MA line genotypes, excluding the ancestral lines (which are homozygous mutant or wild type for all mutations carried by a MA line). MA, mutation accumulation; MCMC, Markov chain Monte Carlo.
Representation of the data from a MA line crossing experiment.
| Description | Symbol | Dimension |
|---|---|---|
| Number of observations | Scalar | |
| Overall number of mutations | Scalar | |
| Mutation matrix | ||
| Number of fixed effects | Scalar | |
| Fixed effects matrix | ||
| Number of plates | Scalar | |
| Plate number vector | ||
| Phenotypic value vector |
Variables in the MCMC model specific to the multicategory model.
| Description of variable | Symbol | Dimension | Constraints on value |
|---|---|---|---|
| Vector of mutation categories | 1‥ | Integer, 0‥ 1 – | |
| Vector of mutation effects | 0‥ | – | |
| Vector of mutation frequencies | 0‥ |
Abbreviation: MCMC, Markov chain Monte Carlo.
Variables in the model specific to the two-sided gamma distribution model.
| Description of variable | Symbol | Dimension | Constraints on value |
|---|---|---|---|
| Vector of mutation sign indicators | 1‥ | Integer, 0‥1 | |
| Matrix of mutation effects | [0‥1] × [1‥ | Positive real number | |
| Vector of frequencies of negative and positive-effect mutations | 0‥1 | 0 < | |
| Vector of shape parameters | 0‥1 | Positive real number | |
| Vector of scale parameters | 0‥1 | Positive real number |
Variables of the model common to the multicategory and two-sided gamma distribution models.
| Description of variable | Symbol | Dimension | Constraints on value |
|---|---|---|---|
| Vector of fixed effects | 1‥ | – | |
| Vector of plate effects | 1‥ | – | |
| Random plate effect variance | Scalar | – | |
| Overall mean | Scalar | – | |
| Residual variance | Scalar | – |