| Literature DB >> 34971382 |
Jeremy D Lange1, Héloïse Bastide1, Justin B Lack1, John E Pool1.
Abstract
Population genetics seeks to illuminate the forces shaping genetic variation, often based on a single snapshot of genomic variation. However, utilizing multiple sampling times to study changes in allele frequencies can help clarify the relative roles of neutral and non-neutral forces on short time scales. This study compares whole-genome sequence variation of recently collected natural population samples of Drosophila melanogaster against a collection made approximately 35 years prior from the same locality-encompassing roughly 500 generations of evolution. The allele frequency changes between these time points would suggest a relatively small local effective population size on the order of 10,000, significantly smaller than the global effective population size of the species. Some loci display stronger allele frequency changes than would be expected anywhere in the genome under neutrality-most notably the tandem paralogs Cyp6a17 and Cyp6a23, which are impacted by structural variation associated with resistance to pyrethroid insecticides. We find a genome-wide excess of outliers for high genetic differentiation between old and new samples, but a larger number of adaptation targets may have affected SNP-level differentiation versus window differentiation. We also find evidence for strengthening latitudinal allele frequency clines: northern-associated alleles have increased in frequency by an average of nearly 2.5% at SNPs previously identified as clinal outliers, but no such pattern is observed at random SNPs. This project underscores the scientific potential of using multiple sampling time points to investigate how evolution operates in natural populations, by quantifying how genetic variation has changed over ecologically relevant timescales.Entities:
Keywords: zzm321990 Drosophila melanogasterzzm321990 ; adaptive evolution; insecticide resistance; temporal population genomics
Mesh:
Year: 2022 PMID: 34971382 PMCID: PMC8826484 DOI: 10.1093/molbev/msab368
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.Principle components analysis reflects expected relationships between old Providence genomes and other sequenced genomes, with no obvious outlier individuals. X chromosome SNP data from seven additional North American, European, and African populations were used in this analysis.
Fig. 2.Inversion frequencies have shifted across time and space. (A) Inversion frequencies at eight well-studied chromosomal inversions. The asterisks underneath In(2R)Ns, In(3R)Mo, and In(3R)P indicate statistically significant frequencies between seasons. (B) Four inversions showed significant frequency differences between sampling locations (x axis) in the spring, as detailed in supplementary table S2, Supplementary Material online. No inversions displayed significant frequency differences between locations in the fall samples.
Fig. 3.Population ancestry appears to have not shifted appreciably between time points. African versus European ancestry was estimated along chromosomes using the hidden Markov model of Corbett-Detig and Nielsen (2017). African ancestry proportion was averaged across inversion-free strains in the original samples (black line), averaged across six pools in the fall sample (blue), and 12 pools in the spring sample (yellow). Because inversions can affect ancestry, and we observed shifts in inversions frequencies between time points, we weighted ancestry in the old samples to match inversion frequencies between the time points. This weighted ancestry along the genome is shown here.
Fig. 4.Northern-associated alleles at clinal outlier SNPs have tended to increase in frequency over time. The top histogram depicts 1,671 frequency differences of northern-associated alleles at clinal outlier SNPs. The bottom histogram depicts the frequency difference at 1,671 SNPs chosen at random across the genome.
Fig. 5.Population branch statistic and GO enrichment. (A) PBS at each window genome wide. Gene names are discussed in the text. (B) The top ten categories in our window PBS GO enrichment analysis.
Outlier Regions Containing the Top Population Branch Statistic Values Genome Wide, Indicating Candidates for Temporal Evolution.
| Chr. Arm | Start | Stop | Windows | Top Window PBS | Top Window | Genes within 10 kb of Peak |
|---|---|---|---|---|---|---|
| 2R | 10450311 | 11002594 | 78 | 0.2077 | 4.0E−07 |
|
| X | 8549577 | 8715140 | 26 | 0.1440 | 1.2E−06 |
|
| 2R | 7935691 | 8252784 | 42 | 0.1019 | 3.6E−05 |
|
| 3R | 24459660 | 25160620 | 88 | 0.1015 | 4.7E−05 |
|
| X | 3159369 | 3404026 | 35 | 0.0837 | 8.9E−05 |
|
| 3R | 19412432 | 19432122 | 4 | 0.0872 | 1.4E−04 |
|
| X | 16919722 | 17013929 | 14 | 0.0849 | 1.6E−04 |
|
| 3R | 8856384 | 8882684 | 3 | 0.0848 | 2.2E−04 |
|
| X | 18020585 | 18110534 | 14 | 0.0782 | 2.3E−04 |
|
| X | 9526895 | 9557458 | 4 | 0.0712 | 3.6E−04 |
|
| 3L | 10456489 | 10564065 | 20 | 0.0732 | 4.1E−04 |
|
| 2L | 22212037 | 22940523 | 16 | 0.0742 | 6.2E−04 |
|
| 2R | 6988767 | 7043417 | 10 | 0.0677 | 9.6E−04 |
|
| X | 18302387 | 18449417 | 21 | 0.0647 | 9.7E−04 |
|
| 2R | 9169961 | 9219512 | 7 | 0.0688 | 1.1E−03 |
|
| X | 10212150 | 10403274 | 22 | 0.0585 | 1.2E−03 |
|
| X | 12025462 | 12179467 | 23 | 0.0569 | 1.5E−03 |
|
| 2R | 6448156 | 6451900 | 1 | 0.0657 | 1.5E−03 |
|
| X | 12439233 | 12514400 | 10 | 0.0557 | 1.7E−03 |
|
| 3L | 4892820 | 4920140 | 4 | 0.0643 | 1.7E−03 |
|
Notes.—Regions were defined as described in Materials and Methods. Raw P values based on demographic simulations are also provided. Coordinates reflect release 5 of the Drosophila melanogaster reference genome.
Fig. 6.The frequency of the intact Cyp6a17 + Cyp6a23 allele shifted rapidly under strong selection. (A) The frequencies of the intact allele in each of the six sampling years and an estimate of the frequency in modern populations are shown. (B) Results of an ABC analysis to infer the selection strength and starting frequency of the intact allele that best recapitulate our empirical sampling results between 1975 and 1983.
Windows Containing the Highest SNP PBS Values Genome Wide Are Shown, Excluding Those Associated with Window PBS Outlier Regions, Based on Our “Pseudo-Pool” Treatment of the Old Sample Data.
| Chr. Arm | Window Start | Window Stop | Window PBS | Max SNP PBS | Max SNP Position | Freq. Old | Freq. Fall | Freq. Spring | Location of Top SNP |
|---|---|---|---|---|---|---|---|---|---|
| X | 12427571 | 12433229 | 0.0240 | 0.4909 | 12428485 | 0.3545 | 0.0063 | 0.0584 | Intron of |
| 2R | 9039862 | 9049014 | 0.0322 | 0.4554 | 9047945 | 0.2071 | 0.0000 | 0.0000 | Exon of |
| 2L | 11494112 | 11498367 | 0.0115 | 0.4334 | 11497953 | 0.3383 | 0.0267 | 0.0380 | Intron of |
| 2R | 9022225 | 9039861 | 0.0268 | 0.4101 | 9038358 | 0.1857 | 0.0000 | 0.0000 | Intron of |
| 2R | 11997670 | 12004051 | 0.0105 | 0.4004 | 12000155 | 0.1949 | 0.0032 | 0.0040 | Exon of |
| 3R | 15348249 | 15355833 | 0.0172 | 0.3973 | 15353076 | 0.4902 | 0.1768 | 0.0430 | Intron of |
| 2L | 6009244 | 6014164 | 0.0184 | 0.3871 | 6011436 | 0.3515 | 0.0662 | 0.0311 | Intron of |
| X | 17218725 | 17225201 | 0.0200 | 0.3792 | 17221539 | 0.4603 | 0.1121 | 0.1054 | Intergenic, close to |
| 2L | 12722716 | 12729408 | 0.0289 | 0.3491 | 12725414 | 0.6217 | 0.2521 | 0.1833 | Intron of |
| 2R | 11931245 | 11941772 | 0.0180 | 0.3457 | 11935869 | 0.1558 | 0.0000 | 0.0000 | Exon of |
| 2R | 6608168 | 6617713 | 0.0176 | 0.3419 | 6616156 | 0.3338 | 0.0729 | 0.0370 | 5′-UTR ofr |
| X | 19627456 | 19638362 | 0.0179 | 0.3370 | 19629237 | 0.7745 | 0.1975 | 0.4863 | Intergenic, between |
| X | 10416049 | 10430412 | 0.0232 | 0.3342 | 10418289 | 0.3500 | 0.0634 | 0.0344 | Intron of |
| 2L | 16801631 | 16808037 | 0.0300 | 0.3276 | 16801968 | 0.3699 | 0.0961 | 0.0328 | Exon of |
Note.—Coordinates reflect release 5 of the Drosophila melanogaster reference genome.
Fig. 7.A genome-wide enrichment of elevated genetic differentiation between old and new population samples was observed. The enrichment of low raw P values, indicating higher PBS values than expected under the neutral demographic model, is depicted for (A) window PBS, and (B) maximum SNP PBS per window. As noted in the main text, the window enrichment can be accounted for by just a few broad outlier regions, whereas approximately 20 SNP PBS outlier regions must be removed to account for the latter enrichment.