| Literature DB >> 32413142 |
Martin Kapun1,2,3,4,5, Maite G Barrón1,6, Fabian Staubach1,7, Darren J Obbard1,8, R Axel W Wiberg1,9,10, Jorge Vieira1,11,12, Clément Goubert1,13,14, Omar Rota-Stabelli1,15, Maaria Kankare1,16, María Bogaerts-Márquez1,6, Annabelle Haudry1,13, Lena Waidele1,7, Iryna Kozeretska1,17,18, Elena G Pasyukova1,19, Volker Loeschcke1,20, Marta Pascual1,21,22, Cristina P Vieira1,11,12, Svitlana Serga1,17, Catherine Montchamp-Moreau1,23, Jessica Abbott1,24, Patricia Gibert1,13, Damiano Porcelli1,25, Nico Posnien1,26, Alejandro Sánchez-Gracia1,21,22, Sonja Grath1,27, Élio Sucena1,28,29, Alan O Bergland1,30, Maria Pilar Garcia Guerreiro1,31, Banu Sebnem Onder1,32, Eliza Argyridou1,27, Lain Guio1,6, Mads Fristrup Schou1,20,24, Bart Deplancke1,33, Cristina Vieira1,13, Michael G Ritchie1,9, Bas J Zwaan1,34, Eran Tauber1,35,36, Dorcas J Orengo1,21,22, Eva Puerma1,21,22, Montserrat Aguadé1,21,22, Paul Schmidt1,37, John Parsch1,27, Andrea J Betancourt1,38, Thomas Flatt1,2,3, Josefa González1,6.
Abstract
Genetic variation is the fuel of evolution, with standing genetic variation especially important for short-term evolution and local adaptation. To date, studies of spatiotemporal patterns of genetic variation in natural populations have been challenging, as comprehensive sampling is logistically difficult, and sequencing of entire populations costly. Here, we address these issues using a collaborative approach, sequencing 48 pooled population samples from 32 locations, and perform the first continent-wide genomic analysis of genetic variation in European Drosophila melanogaster. Our analyses uncover longitudinal population structure, provide evidence for continent-wide selective sweeps, identify candidate genes for local climate adaptation, and document clines in chromosomal inversion and transposable element frequencies. We also characterize variation among populations in the composition of the fly microbiome, and identify five new DNA viruses in our samples.Entities:
Keywords: SNPs; adaptation, demography; clines; population genomics; selection; structural variants
Mesh:
Year: 2020 PMID: 32413142 PMCID: PMC7475034 DOI: 10.1093/molbev/msaa120
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.The geographic distribution of population samples. Locations of all samples in the 2014 DrosEU data set. The color of the circles indicates the sampling season for each location: ten of the 32 locations were sampled at least twice, once in summer and once in fall (see table 1 and supplementary table S1, Supplementary Material online). Note that some of the 12 Ukrainian locations overlap in the map.
Sample Information for All Populations in the DrosEU Data Set.
| ID | Country | Location | Coll. Date | Number ID | Lat (°) | Lon (°) | Alt (m) | Season |
| Coll. Name |
|---|---|---|---|---|---|---|---|---|---|---|
| AT_Mau_14_01 | Austria | Mauternbach | 2014-07-20 | 1 | 48.38 | 15.56 | 572 | S | 80 | Andrea J. Betancourt |
| AT_Mau_14_02 | Austria | Mauternbach | 2014-10-19 | 2 | 48.38 | 15.56 | 572 | F | 80 | Andrea J. Betancourt |
| TR_Yes_14_03 | Turkey | Yesiloz | 2014-08-31 | 3 | 40.23 | 32.26 | 680 | S | 80 | Banu Sebnem Onder |
| TR_Yes_14_04 | Turkey | Yesiloz | 2014-10-23 | 4 | 40.23 | 32.26 | 680 | F | 80 | Banu Sebnem Onder |
| FR_Vil_14_05 | France | Viltain | 2014-08-18 | 5 | 48.75 | 2.16 | 153 | S | 80 | Catherine Montchamp-Moreau |
| FR_Vil_14_07 | France | Viltain | 2014-10-27 | 7 | 48.75 | 2.16 | 153 | F | 80 | Catherine Montchamp-Moreau |
| FR_Got_14_08 | France | Gotheron | 2014-07-08 | 8 | 44.98 | 4.93 | 181 | S | 80 | Cristina Vieira |
| UK_She_14_09 | United Kingdom | Sheffield | 2014-08-25 | 9 | 53.39 | −1.52 | 100 | S | 80 | Damiano Porcelli |
| UK_Sou_14_10 | United Kingdom | South Queensferry | 2014-07-14 | 10 | 55.97 | −3.35 | 19 | S | 80 | Darren Obbard |
| CY_Nic_14_11 | Cyprus | Nicosia | 2014-08-10 | 11 | 35.07 | 33.32 | 263 | S | 80 | Eliza Argyridou |
| UK_Mar_14_12 | United Kingdom | Market Harborough | 2014-10-20 | 12 | 52.48 | −0.92 | 80 | F | 80 | Eran Tauber |
| UK_Lut_14_13 | United Kingdom | Lutterworth | 2014-10-20 | 13 | 52.43 | −1.10 | 126 | F | 80 | Eran Tauber |
| DE_Bro_14_14 | Germany | Broggingen | 2014-06-26 | 14 | 48.22 | 7.82 | 173 | S | 80 | Fabian Staubach |
| DE_Bro_14_15 | Germany | Broggingen | 2014-10-15 | 15 | 48.22 | 7.82 | 173 | F | 80 | Fabian Staubach |
| UA_Yal_14_16 | Ukraine | Yalta | 2014-06-20 | 16 | 44.50 | 34.17 | 72 | S | 80 | Iryna Kozeretska |
| UA_Yal_14_18 | Ukraine | Yalta | 2014-08-27 | 18 | 44.50 | 34.17 | 72 | S | 80 | Iryna Kozeretska |
| UA_Ode_14_19 | Ukraine | Odesa | 2014-07-03 | 19 | 46.44 | 30.77 | 54 | S | 80 | Iryna Kozeretska |
| UA_Ode_14_20 | Ukraine | Odesa | 2014-07-22 | 20 | 46.44 | 30.77 | 54 | S | 80 | Iryna Kozeretska |
| UA_Ode_14_21 | Ukraine | Odesa | 2014-08-29 | 21 | 46.44 | 30.77 | 54 | S | 80 | Iryna Kozeretska |
| UA_Ode_14_22 | Ukraine | Odesa | 2014-10-10 | 22 | 46.44 | 30.77 | 54 | F | 80 | Iryna Kozeretska |
| UA_Kyi_14_23 | Ukraine | Kyiv | 2014-08-09 | 23 | 50.34 | 30.49 | 179 | S | 80 | Iryna Kozeretska |
| UA_Kyi_14_24 | Ukraine | Kyiv | 2014-09-08 | 24 | 50.34 | 30.49 | 179 | F | 80 | Iryna Kozeretska |
| UA_Var_14_25 | Ukraine | Varva | 2014-08-18 | 25 | 50.48 | 32.71 | 125 | S | 80 | Oleksandra Protsenko |
| UA_Pyr_14_26 | Ukraine | Pyriatyn | 2014-08-20 | 26 | 50.25 | 32.52 | 114 | S | 80 | Oleksandra Protsenko |
| UA_Dro_14_27 | Ukraine | Drogobych | 2014-08-24 | 27 | 49.33 | 23.50 | 275 | S | 80 | Iryna Kozeretska |
| UA_Cho_14_28 | Ukraine | Chornobyl | 2014-09-13 | 28 | 51.37 | 30.14 | 121 | F | 80 | Iryna Kozeretska |
| UA_Cho_14_29 | Ukraine | Chornobyl Yaniv | 2014-09-13 | 29 | 51.39 | 30.07 | 121 | F | 80 | Iryna Kozeretska |
| SE_Lun_14_30 | Sweden | Lund | 2014-07-31 | 30 | 55.69 | 13.20 | 51 | S | 80 | Jessica Abbott |
| DE_Mun_14_31 | Germany | Munich | 2014-06-19 | 31 | 48.18 | 11.61 | 520 | S | 80 | John Parsch |
| DE_Mun_14_32 | Germany | Munich | 2014-09-03 | 32 | 48.18 | 11.61 | 520 | F | 80 | John Parsch |
| PT_Rec_14_33 | Portugal | Recarei | 2014-09-26 | 33 | 41.15 | −8.41 | 175 | F | 80 | Jorge Vieira |
| ES_Gim_14_34 | Spain | Gimenells (Lleida) | 2014-10-20 | 34 | 41.62 | 0.62 | 173 | F | 80 | Lain Guio |
| ES_Gim_14_35 | Spain | Gimenells (Lleida) | 2014-08-13 | 35 | 41.62 | 0.62 | 173 | S | 80 | Lain Guio |
| FI_Aka_14_36 | Finland | Akaa | 2014-07-25 | 36 | 61.10 | 23.52 | 88 | S | 80 | Maaria Kankare |
| FI_Aka_14_37 | Finland | Akaa | 2014-08-27 | 37 | 61.10 | 23.52 | 88 | S | 80 | Maaria Kankare |
| FI_Ves_14_38 | Finland | Vesanto | 2014-07-26 | 38 | 62.55 | 26.24 | 121 | S | 66 | Maaria Kankare |
| DK_Kar_14_39 | Denmark | Karensminde | 2014-09-01 | 39 | 55.95 | 10.21 | 15 | F | 80 | Mads Fristrup Schou |
| DK_Kar_14_41 | Denmark | Karensminde | 2014-11-25 | 41 | 55.95 | 10.21 | 15 | F | 80 | Mads Fristrup Schou |
| CH_Cha_14_42 | Switzerland | Chalet à Gobet | 2014-07-24 | 42 | 46.57 | 6.70 | 872 | S | 80 | Martin Kapun |
| CH_Cha_14_43 | Switzerland | Chalet à Gobet | 2014-10-05 | 43 | 46.57 | 6.70 | 872 | F | 80 | Martin Kapun |
| AT_See_14_44 | Austria | Seeboden | 2014-08-17 | 44 | 46.81 | 13.51 | 591 | S | 80 | Martin Kapun |
| UA_Kha_14_45 | Ukraine | Kharkiv | 2014-07-26 | 45 | 49.82 | 36.05 | 141 | S | 80 | Svitlana Serga |
| UA_Kha_14_46 | Ukraine | Kharkiv | 2014-09-14 | 46 | 49.82 | 36.05 | 141 | F | 80 | Svitlana Serga |
| UA_Cho_14_47 | Ukraine | Chornobyl Applegarden | 2014-09-13 | 47 | 51.27 | 30.22 | 121 | F | 80 | Svitlana Serga |
| UA_Cho_14_48 | Ukraine | Chornobyl Polisske | 2014-09-13 | 48 | 51.28 | 29.39 | 121 | F | 70 | Svitlana Serga |
| UA_Kyi_14_49 | Ukraine | Kyiv | 2014-10-11 | 49 | 50.34 | 30.49 | 179 | F | 80 | Svitlana Serga |
| UA_Uma_14_50 | Ukraine | Uman | 2014-10-01 | 50 | 48.75 | 30.21 | 214 | F | 80 | Svitlana Serga |
| RU_Val_14_51 | Russia | Valday | 2014-08-17 | 51 | 57.98 | 33.24 | 217 | S | 80 | Elena Pasyukova |
Note.—Origin, collection date, season, and sample size (number of chromosomes: n) of the 48 samples in the DrosEU 2014 data set. Additional information can be found in supplementary table S1, Supplementary Material online.
Clinality of Genetic Variation and Population Structure.
| Factor | Latitude | Longitude | Altitude | Season | Moran’s |
|---|---|---|---|---|---|
| π(X) | 4.11 | 1.62 |
| 1.65 | 0.86 |
| π(Aut) | 0.91 | 2.54 |
| 0.16 | −0.86 |
| θ(X) | 2.65 | 1.31 |
| 2.22 | 0.24 |
| θ(Aut) | 0.48 | 1.44 |
| 0.37 | −1.13 |
|
| 0.02 | 0.38 | 5.93 | 3.26 | −2.08 |
|
| 0.09 | 0.76 | 5.33 | 0.71 | −1.45 |
| PC1 | 0.63 |
| 3.64 | 0.75 |
|
| PC2 |
|
|
| 1.68 | −0.32 |
| PC3 | 0.39 | 0.23 |
| 0.28 | 1.38 |
Note.—Effects of geographic variables and/or seasonality on genome-wide average levels of diversity (π, θ, and Tajima’s D; top rows) and on the first three axes of a PCA based on allele frequencies at neutrally evolving sites (bottom rows). The values represent F ratios from general linear models. Italic type indicates F ratios that are significant after Bonferroni correction (adjusted α′=0.0055). Asterisks in parentheses indicate significance when accounting for spatial autocorrelation by spatial error models. These models were only calculated when Moran’s I test, as shown in the last column, was significant.
P < 0.05;
P < 0.01;
P < 0.001.
Fig. 2.Candidate signals of selective sweeps in European populations. The central panel shows the distribution of Tajima’s D in 50-kb sliding windows with 40-kb overlap, with red and green dashed lines indicating Tajima’s D = 0 and −1, respectively. The top panel shows a detail of a genomic region on chromosomal arm 2R in the vicinity of Cyp6g1 and Hen1 (highlighted in red), genes reportedly involved in pesticide resistance. This strong sweep signal is characterized by an excess of low-frequency SNP variants and overall negative Tajima’s D in all samples. Colored solid lines depict Tajima’s D for each sample (see supplementary fig. S2, Supplementary Material online, for color codes, Supplementary Material online); the black dashed line shows Tajima’s D averaged across all samples. The bottom panel shows a region on 3L previously identified as a potential target of selection, which shows a similar strong sweep signature. Notably, both regions show strongly reduced genetic variation (supplementary fig. S1, Supplementary Material online).
Fig. 3.Genetic differentiation among European populations. (A) Average FST among populations at putatively neutral sites. The center plot shows the distribution of FST values for all 1,128 pairwise population comparisons, with the FST values for each comparison obtained from the mean across all 4,034 SNPs used in the analysis. Plots on the left and the right show population pairs in the lower (blue) and upper (red) 5% tails of the FST distribution. (B) PCA analysis of allele frequencies at the same SNPs reveals population substructuring in Europe. Hierarchical model fitting using the first four PCs showed that the populations fell into two clusters (indicated by red and blue), with cluster assignment of each population subsequently estimated by k-means clustering. (C) Admixture proportions for each population inferred by model-based clustering with ConStruct are highlighted as pie charts (left plot) or structure plots (center). The optimal number of three spatial layers (K) was inferred by cross-validation (right plot).
Fig. 4.Manhattan plots of SNPs with q values <0.05 in BayeScEnv association tests with PC1 or PC2 of bioclimatic variables. Vertical lines denote the breakpoints of common inversions. The gene names highlight some candidate genes found in our study and which have previously been identified as varying clinally by Fabian et al. (2012) and Machado et al. (2016) along the North American east coast. Note that q values of 0 (which are infinite on a log-scale) are plotted at the top of each figure, above the gray dash-dotted horizontal lines in order to separate them from the other candidates with q values >0. These zero values are unlikely to be spurious as the densities of these infinite values tend to line up with peaks of log10(q) below the dashed line, suggesting that they represent highly significant continuations of these peaks.
Fig. 5.Mitochondrial haplotypes. (A) Network showing the relationship of five common mitochondrial haplotypes. (B) Estimated frequency of each mitochondrial haplotype in 48 European samples.
Fig. 6.Geographic patterns of structural variants. The upper panel shows stacked bar plots with the relative abundances of transposable elements (TEs) in all 48 population samples. The proportion of each repeat class was estimated from sampled reads with dnaPipeTE (two samples per run, 0.1× coverage per sample). The lower panel shows stacked bar plots depicting absolute frequencies of six cosmopolitan inversions in all 48 population samples.
Clinality and/or Seasonality of Chromosomal Inversions.
| Factor | Latitude | Longitude | Altitude | Season | Moran’s |
|---|---|---|---|---|---|
|
| 2.2 |
|
| 0.89 | −0.92 |
|
| 0.25 |
| 2.88 | 2.43 | 1.25 |
|
|
| 2.82 | 0.62 | 3.6 | −1.61 |
|
|
| 0.75 | 1.42 | 0.04 | 2.79 |
|
| 0.3 | 0.09 | 0.35 | 0.03 | −0.9 |
|
|
| 0.66 | 1.69 | 1.55 | −0.89 |
Note.—The values represent F ratios from binomial generalized linear models to account for frequency data. Underlined italic type indicates deviance values that were significant after Bonferroni correction (adjusted α′=0.0071). Asterisks in parentheses indicate significance when accounting for spatial autocorrelation by spatial error models. These models were only calculated when Moran’s I test, as shown in the last column, was significant.
P < 0.01;
P < 0.001.
Fig. 7.Microbiota associated with Drosophila. Relative abundance of Drosophila-associated microbes as assessed by MGRAST classified shotgun sequences. Microbes had to reach at least 3% relative abundance in one of the samples to be represented.