| Literature DB >> 21705473 |
Jeffrey D Jensen1, Doris Bachtrog.
Abstract
Characterizing the role of effective population size in dictating the rate of adaptive evolution remains a major challenge in evolutionary biology. Depending on the underlying distribution of fitness effects of new mutations, populations of different sizes may differ vastly in their rate of adaptation. Here, we collect polymorphism data at over 100 loci for two closely related Drosophila species with different current effective population sizes (N(e)), Drosophila miranda and D. pseudoobscura, to evaluate the prevalence of adaptive evolution versus genetic drift in molecular evolution. Utilizing these large and consistently sampled data sets, we obtain greatly improved estimates of the demographic histories of both species. Specifically, although current N(e) differs between these species, their ancestral sizes were much more similar. We find that statistical approaches capturing recent adaptive evolution (using patterns of polymorphisms) detect higher rates of adaptive evolution in the larger D. pseudoobscura population. In contrast, methods aimed at detecting selection over longer time periods (i.e., those relying on divergence data) estimate more similar rates of adaptation between the two species. Thus, our results suggest an important role of effective population size in dictating rates of adaptation and highlight how complicated population histories--as is probably the case for most species--can effect rates of adaptation. Additionally, we also show how different methodologies to detect positive selection can reveal information about different timescales of adaptive evolution.Entities:
Mesh:
Year: 2011 PMID: 21705473 PMCID: PMC3157839 DOI: 10.1093/gbe/evr063
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary Statistics of Synonymous (and nonsynonymous) Patterns of Variation in Drosophila miranda and D. pseudoobscura
| average | 14 | 14 |
| πsyn (πNS) | 0.006 (0.0003) | 0.014 (0.0011) |
| θsyn (θNS) | 0.007 (0.0004) | 0.019 (0.0013) |
| Taj | −0.38 (−1.1) | −0.37 (−0.26) |
| F&W | 0.10 (0.09) | 1.81 (1.57) |
Estimated Selection Parameters across Both Species for the Multilocus Polymorphism-and Divergence-Based Recurrent Selection Statistics Used Here
| 2 × 10−3 | 9 × 10−4 | |
| 2 | 1 × 10−4 | 5 × 10−3 |
| α | 0.78 | 0.83 |
| 2 | −1.32(−1.95) | −2.67(−3.36) |
Mean selection coefficient; estimation procedure of Jensen, Thornton, and Andolfatto (2008).
Mean rate of adaptation; estimation procedure of Jensen, Thornton, and Andolfatto (2008).
Fraction of positively selected loci; estimation procedure of Eyre-Walker and Keightley (2009).
Strength of purifying selection acting on nonsynonymous sites; estimation procedure of Williamson et al. (2005).
Strength of purifying selection acting on nonsynonymous sites; estimation procedure of Loewe et al. (2006).
FA cartoon schematic of the demographic models estimated for D. miranda and D. pseudoobscura. The demographic history of D. miranda is characterized by a severe bottleneck, while D. pseudoobscura is inferred to have had a relatively long-term stable population size, followed be recent growth. Thus, while the recent demographic history has served to exaggerate differences in size, the ancestral population size after the split of the two species may have been similar. T and T are the estimated times of the population size changes in 4N generations, and F and F are the changes in population size associated with the event (where F is the population size fraction relative to the ancestral size). The vertical black dotted line indicates the present time of sampling (i.e., t = 0).
Summary of Demographic Models Estimated Using Different Procedures
| dadi | Bottleneck: | Bottleneck: | — |
| Codon usage | — | — | 0.89 |
| DFE-α | Growth: | Growth: | — |
| Williamson et al. | Bottleneck: | Bottleneck: | — |
| Jensen et al. | — | 0.26 |
Estimation procedure of Gutenkunst et al. (2009). Values indicate the reduction in variation at the time of the size change (e.g., population is reduced to 0.0005 of the ancestral size), the period of reduction in 4N generations (e.g., reduction at 0.12 4N generations, lasting 0.02 4N generations), and the size to which the population recovers after the reduction (e.g., population recovers to 0.48 of the ancestral size).
Estimation procedure of Bulmer (1991), in which a relative size is estimated based on patterns of codon usage.
Estimation procedure of Eyre-Walker and Keightley (2009) coestimated with the fraction of selected sites. A stepwise growth in population size is estimated (e.g., 6-fold growth relative to the ancestral size) at a given time (e.g., 0.37 4N generations in the past).
Estimation procedure of Williamson et al. (2005), coestimated with the fraction of selected sites. A stepwise reduction in population size is estimated (e.g., reduction to 0.001 of the ancestral size) at a given time (e.g., 0.1 4N generations in the past).
Estimation of neutral θ from the estimation procedure of Jensen, Thornton, and Andolfatto (2008)—the relative size is presented as estimated from patterns of polymorphism.
Estimated Demographic Models and the Fit to the Observed Synonymous Frequency Spectrum
| Program | Command Line | Fit to Data |
| dadi | ms 20 100000 -t 10 -eN 0 0.48 -eN 0.1 0.0005 -eN 0.12 1 | 0.79 |
| dadi, pse | ms 20 100000 -t 10 -eN 0 1.35 -eN 0.09 0.81 -eN 0.18 1 | 0.86 |
| DFE | ms 20 100000 -t 10 -eG 0.37 6 | 0.09 |
| DFE, pse | ms 20 100000 -t 10 -eG 0.08 6.6 | 0.11 |
| Williamson | ms 20 100000 -t 10 -eN 0 0.001 -eN 0.1 1 | 0.54 |
| Williamson, pse | ms 20 100000 -t 10 -eN 0 0.71 -eN 0.12 1 | 0.38 |
Fraction of replicates within σ = 0.01 of empirically observed values of both mean Tajima's Dsyn and πsyn.
Gutenkunst et al. (2009).
Eyre-Walker and Keightley (2006).
Williamson et al. (2005).
FDistribution of fitness effects of newly arising amino-acid mutations estimated in the D. miranda and D. pseudoobscura lineages. Consistent with a smaller N in D. miranda, a larger fraction of newly arising nonsynonymous mutations are estimated to be under weak purifying selection (i.e., 1 < N <100).
The Number of Significant Test Rejections across Both Species, after a Multiple Test Correction, for the Single-Locus Polymorphism and Divergence-Based Statistics Used Here
| CLRT | 3 | 27 |
| GOF | 1 | 11 |
| ωmax | 1 | 4 |
| MK | 2 | 18 |
| HKA | 6 | 12 |
FApproximate Bayesian estimation of both the strength and rate of recurrent positive selection, for randomly selected homologous genes from D. miranda and D. pseudoobscura. Estimation is based on 106 draws from the prior. Given are the marginal distributions, with D. pseudoobscura in black and D. miranda in gray. Consistent with an important role of effective population size driving adaptive evolution, roughly an order of magnitude greater rate of fixation is estimated for the currently larger D. pseudoobscura population using this polymorphism-based statistic.
Summary of Forward Simulation Results
| Severitybneck | 0.0005 | 0.81 |
| Timebneck | 0.12 | 0.18 |
| Durationbneck | 0.02 | 0.09 |
| Recoverybneck | 0.48 | 1.35 |
| 9 × 10−4 | 9 × 10−4 | |
| 2 | 1 × 10−4 | 5 × 10−3 |
| 0.38 | 0.81 |
Severity of the simulated reduction in population size relative to the ancestral size.
Time of the simulated reduction in population size in 4N generations.
Duration of the simulated reduction in population size in 4N generations.
Simulated size to which the population recover postbottleneck, relative to ancestral size.
Simulated selection coefficeint.
Simulated rate of fixation.
Estimated rate of fixation for the above parameters.
Power of the CLRT/GOF combination for the above parameters.
ineage-Specific Estimates of α, the Fraction of Adaptive Amino Acid Substitutions (and 95% confidence intervals), Using Drosophila affinis or D. athabasca as a Second Outgroup
| Method | ||
| All sites | ||
| α | 0.66 (0.55–0.74) | 0.57 (0.38–0.70) |
| α | 0.57 (0.43–0.67) | 0.45 (0.23–0.61) |
| α | 0.56 (0.43–0.66) | 0.49 (0.29–0.64) |
| α | 0.72 (0.63–0.80) | 0.70 (0.55–0.80) |
| α | 0.68 (0.58–0.76) | 0.69 (0.54–0.79) |
| α | 0.65 (0.52–0.75) | 0.69 (0.53–0.80) |
Fraction of adaptive amino acid mutations; estimation procedure of Fay et al. (2001).
Fraction of adaptive amino acid mutations; estimation procedure of Smith and Eyre-Walker (2002).
Fraction of adaptive amino acid mutations; estimation procedure of Bierne and Eyre-Walker (2004).
Fraction of adaptive amino acid mutations ignoring polymorphism at a frequency < 0.1.
FPlot of p versus K for D. miranda (the pooled loci of both this study, as well as the randomly selected genes of Bachtrog et al. 2009), D. pseudoobscura (the pooled loci of homologous genes across the X-chromosome) and D. melanogaster (Bachtrog 2008). The solid line indicates the significant correlation between these measures of synonymous polymorphism and non-synonymous divergence — a prediction consistent with both genetic hitchhiking and background selection - among these species of differing effective population sizes (with D. melanogaster thought to be of intermediate size between D. miranda and D. pseudoobscura).
FPlot of CLRT P values with K. (A) Drosophila miranda: The pooled loci of both this study, as well as the randomly selected genes of Bachtrog et al. (2009), are shown. (B) Drosophila pseudoobscura: The pooled loci of homologous genes across the X chromosome. The dotted line indicates the 5% significance cutoff for the CLRT. The solid line indicates the significant correlation between the observed divergence measure K and the calculated P value of this polymorphism-based test statistic. Results indicate a significant correlation between this polymorphism-based test of selection and this divergence-based measure, in both species—despite roughly an order of magnitude difference in effective population size. This result suggests that at least a portion of the correlation observed in figure 4 owes to hitchhiking effects.