| Literature DB >> 35821220 |
David B Stern1,2, Nathan W Anderson3, Juanita A Diaz3, Carol Eunmi Lee4.
Abstract
The role of epistasis in driving adaptation has remained an unresolved problem dating back to the Evolutionary Synthesis. In particular, whether epistatic interactions among genes could promote parallel evolution remains unexplored. To address this problem, we employ an Evolve and Resequence (E&R) experiment, using the copepod Eurytemora affinis, to elucidate the evolutionary genomic response to rapid salinity decline. Rapid declines in coastal salinity at high latitudes are a predicted consequence of global climate change. Based on time-resolved pooled whole-genome sequencing, we uncover a remarkably parallel, polygenic response across ten replicate selection lines, with 79.4% of selected alleles shared between lines by the tenth generation of natural selection. Using extensive computer simulations of our experiment conditions, we find that this polygenic parallelism is consistent with positive synergistic epistasis among alleles, far more so than other mechanisms tested. Our study provides experimental and theoretical support for a novel mechanism promoting repeatable polygenic adaptation, a phenomenon that may be common for selection on complex physiological traits.Entities:
Mesh:
Year: 2022 PMID: 35821220 PMCID: PMC9276764 DOI: 10.1038/s41467-022-31622-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Genomic signatures of laboratory selection in response to salinity decline.
a Manhattan plot of single-nucleotide polymorphism (SNP) signatures of selection on one arbitrarily selected genomic scaffold. SNPs above the dotted red line were deemed significant (adjusted P < 0.05). SNPs are colored according to the selected haplotype block in which they were grouped, and shaded gray boxes delineate the 26 haplotype blocks on this scaffold. Top—Cochran-Mantel-Haezel (CMH) test for significant allele frequency changes beyond expectations from genetic drift. Bottom—linear mixed model (LMM) test to distinguish allele frequency trajectories between treatment and control lines. b Selected allele frequency trajectories during laboratory selection. Gray lines represent mean allele frequencies across the ten replicate lines (eight at generation ten) for each selected allele (i.e., haplotype block, N = 121). The purple line shows the average frequency for all selected alleles in the treatment lines. The yellow line shows the average frequency for all selected alleles in the control lines. The blue shaded area is the 1% and 99% quantile range of allele frequencies from 10,000 neutral simulations starting from the average starting frequency of the selected alleles. c Histogram of selected allele frequencies in the starting population, polarized to show the rising allele. The dotted line represents the mean frequency. d Histogram of selection coefficients of selected alleles, estimated using the change in frequency after ten generations across the replicate treatment lines. The dotted line represents the mean.
Gene paralogues with putative ion-transporter and osmoregulatory function found on haplotype blocks showing signatures of selection to salinity decline in the laboratory evolution experiment.
| Gene Symbola | Gene Description | Haplotype Block Numberb | Starting Frequency | Selection Coefficient |
|---|---|---|---|---|
| Carbonic Anhydrase, paralogue 7 | 6 | 0.254 | 0.107 | |
| Carbonic Anhydrase, paralogue 3 | 6 | 0.254 | 0.107 | |
| Carbonic Anhydrase, paralogue 8 | 19 | 0.178 | 0.114 | |
| Ammonia Transporter, paralogue 1 | 43 | 0.098 | 0.137 | |
| V-type H+ ATPase, complex V1, subunit C | 43 | 0.098 | 0.137 | |
| Arginine kinase | 55 | 0.106 | 0.194 | |
| Carbonic Anhydrase, paralogue 9 | 67 | 0.440 | 0.089 | |
| Carbonic Anhydrase, paralogue 10 | 67 | 0.440 | 0.089 | |
| V-type H+ ATPase, complex V1, subunit A | 71 | 0.446 | 0.098 | |
| Na+/K+-ATPase, subunit α, paralogues 1,2,4 | 76 | 0.472 | 0.101 | |
| Na+/K+-ATPase, subunit | 83 | 0.229 | 0.119 | |
| Carbonic Anhydrase, paralogue 5 | 91 | 0.372 | 0.129 | |
| Rh Protein, paralogue 1 | 92 | 0.330 | 0.097 | |
| V-type H+ ATPase, complex V1, subunit G | 94 | 0.283 | 0.127 | |
| Na+,HCO3− Cotransporter, paralogue unknown | 94 | 0.283 | 0.127 | |
| Na+/H+ Antiporter, paralogues 1-7 | 97 | 0.268 | 0.169 | |
| Ammonia Transporter, paralogue 7 | 114 | 0.206 | 0.100 | |
| Rh Protein, paralogue 4 | 114 | 0.206 | 0.100 | |
| Carbonic Anhydrase, paralogue 14 | 117 | 0.378 | 0.102 |
The gene paralogues have also been implicated in previous studies of adaptation in the E. affinis species complex[15,28].
aGenes names and paralogue number are based on manual annotations of the copepod E. affinis complex genome[28,94].
bAdditional details regarding the selected haplotype blocks can be found in Supplementary Data 1.
Fig. 2Genomic parallelism across replicate treatment lines during laboratory selection.
a Pairwise overlap (Jaccard index) of selected alleles between experimental replicate lines (dotted brown lines) relative to simulations (box plots). Box plots display median (middle line), 25th and 75th percentile (box), 5th and 95th percentile (vertical line), and data beyond the 5th and 95th percentile (single points) of mean Jaccard indices at generations six (dark blue) and ten (yellow) from 1000 simulation iterations. Empirical values most closely match those of data simulated under the “positive epistasis” model with an α parameter of 36.5 (generation six: P = 0.69; generation ten: P = 0.58), and are significantly higher than those simulated under the standard population genetic “multiplicative fitness” model (P = 0) and the quantitative genetic “directional epistasis” model (P = 0), based on N = 1000 simulations for each model. The “multiplicative fitness” model provides a null expectation for parallelism given the allelic selection coefficients and effective population sizes. b Relationship between number of loci contributing to the adaptive response and levels of parallel evolution. For 100 simulation iterations, we calculated the mean Jaccard index between simulated populations and display the mean (triangle and circle points) and interquartile range (vertical lines and shaded area) of Jaccard indices across simulation iterations. Epistatic models (Table 2), particularly quantitative fitness models, predict that the degree of parallelism among replicate lines increases as the number of loci contributing to an adaptive response declines. Levels of parallelism in our real data (horizontal dotted lines) can be replicated either by reducing the number of effective alleles in the “directional epistasis” model (teal) or increasing the α parameter of the “positive epistasis” model (orange). c The distribution of selected alleles in terms of the proportion of replicate lines in which the selected allele experienced a significant frequency shift at generation ten (i.e., “Replicate frequency spectrum” ref. [41]). A far greater proportion of replicate lines show higher proportions of the selected alleles in the empirical data (brown) than in the “multiplicative” (green) or “directional epistasis” (teal) model simulations, but closely match simulations under the “positive epistasis” model (orange). See Table 2 for details on the models.
Descriptions of evolutionary genetic models used in computer simulations of epistasis.
| (a) Population genetic framework | ||
|---|---|---|
| Model Type | Fitness Function | Description |
| Multiplicative (no epistasis) | The traditional population genetics model, in which a beneficial allele increases an individual’s fitness by a fixed ratio. | |
| Positive Epistasis | The effect of an allele is increased by the presence of other selected alleles. | |
| Negative Epistasis | The effect of an allele is decreased by the presence of other selected alleles. | |
The Phenotype:
Horizontal Shift:
Fig. 3Experiment-selected loci also exhibit signatures of selection in the wild populations from the Baltic Sea.
a Locations sampled for pooled genomic sequencing across the Baltic Sea. Sampling sites are colored according to their mean annual salinity estimated from the International Council for the Exploration of the Sea (Supplementary Data 3). The map was generated using data from https://www.naturalearthdata.com (Accessed June 2020). b SNPs underlying experiment-selected alleles harbor a strong signature of polygenic selection across the wild populations, as captured by the QX statistic. The QX statistic measures how quickly the experiment-selected loci evolve and how strongly SNP frequencies covary across wild populations. The empirical QX estimate for the experiment-selected SNPs (red dotted line) is far greater than the null distribution (gray histogram) based on the neutral population history and genomic background. c, d Minor allele frequencies (i.e., folded allele frequencies) for SNPs underlying experiment-selected alleles (purple) are significantly higher than non-selected SNPs (yellow) in two geographically divergent Baltic Sea populations from highly different salinity conditions. Vertical dotted lines display mean minor allele frequency for each category of SNP.