Literature DB >> 29255303

Genomic basis and evolutionary potential for extreme drought adaptation in Arabidopsis thaliana.

Moises Exposito-Alonso1, François Vasseur1,2, Wei Ding1, George Wang1,3, Hernán A Burbano1,4, Detlef Weigel5.   

Abstract

As Earth is currently experiencing dramatic climate change, it is of critical interest to understand how species will respond to it. The chance of a species withstanding climate change is likely to depend on the diversity within the species and, particularly, whether there are sub-populations that are already adapted to extreme environments. However, most predictive studies ignore that species comprise genetically diverse individuals. We have identified genetic variants in Arabidopsis thaliana that are associated with survival of an extreme drought event-a major consequence of global warming. Subsequently, we determined how these variants are distributed across the native range of the species. Genetic alleles conferring higher drought survival showed signatures of polygenic adaptation and were more frequently found in Mediterranean and Scandinavian regions. Using geo-environmental models, we predicted that Central European, but not Mediterranean, populations might lag behind in adaptation by the end of the twenty-first century. Further analyses showed that a population decline could nevertheless be compensated by natural selection acting efficiently over standing variation or by migration of adapted individuals from populations at the margins of the species' distribution. These findings highlight the importance of within-species genetic heterogeneity in facilitating an evolutionary response to a changing climate.

Entities:  

Mesh:

Year:  2017        PMID: 29255303      PMCID: PMC5777624          DOI: 10.1038/s41559-017-0423-0

Source DB:  PubMed          Journal:  Nat Ecol Evol        ISSN: 2397-334X            Impact factor:   15.460


Ongoing climate change has already shifted latitudinal and altitudinal distributions of many plant species1. Future changes in distributions by local extinctions and migrations are most commonly inferred from niche models that are based on current climate across species ranges2,3. Such approaches, however, ignore that an adaptive response can occur also in situ if there is sufficient variation in genes responsible for local adaptation4–6. The plant Arabidopsis thaliana is found under a wide range of contrasting environments, making it distinctively suited for studying evolutionary adaptation to a changing climate7–9. For the next 50 to 100 years, extreme drought events, potentially one of the strongest climate change-related selective pressures10, are predicted to become pervasive across the Eurasian range of A. thaliana2,11. An attractive hypothesis is that populations from the Southern edge of the species’ range12 provide a reservoir of genetic variants that can make individuals resistant to future, more extreme, climate conditions12,13. To investigate the potential of A. thaliana to adapt to extreme drought events, we first linked genetic variation to survival under an experimental extreme-drought treatment. By combining genome-wide association (GWA) techniques that capture signals of local and/or polygenic adaptation14 with environmental niche models8,15, we then predicted genetic changes of populations under future climate change scenarios. An unexpected result of our predictions is that populations at both the Northern and Southern margins of the species’ range will likely more easily adapt to increased extreme drought events, due to these populations carrying a greater spectrum of drought survival alleles.

Results and Discussion

Differential survival to an extreme drought event

We began by exposing a high-quality subset of 211 geo-referenced natural inbred A. thaliana accessions16 to an experimental extreme drought event during the vegetative phase, which killed the plants before they could reproduce (Table S1). After two weeks of normal growth, plants were challenged by a terminal severe drought for over six weeks and imaged every 2-4 days (Fig. 1A) (see Supplementary Methods section 2). To quantify the rate of leaf senescence, a polynomial linear mixed model was fit to the time series of green pixels per pot (Fig. 1B-D, Video S1). The average genotype deviations from the mean quadratic-term in the model provided the best estimate of this survivorship trait in late stages of drought (Supplementary Fig. 3, see details in Supplementary Methods), ranging from -5 to +5 x 10-4 green pixels/day2. The most sensitive genotypes survived only about 32 days, while the most resilient plants survived about 15 days longer. Genotype-dependent survival probably reflects both constitutive as well as induced drought responses, i.e., both environment-dependent and -independent behaviors of the tested accessions. Additional environments need to be examined in order to disentangle these two types of responses.
Figure 1

Terminal drought treatment and phenotyping of 211 accessions.

(A) Soil water content as measured by sensors in three well spaced experimental trays. Vertical lines indicate dates of image acquisition. (B) Trajectories of total rosette area of 200 randomly chosen pots (see Video S1). Color index according to quadratic parameter in (D). (C) Map projection of the environmental niche model prediction of the quadratic parameter (the drought-survival index) in (D). (D) Decay trajectory modeled with a polynomial regression, with genotypes as random factors, from the day of maximum number of green pixels until the end of the experiment. Each line corresponds to one genotype.

The amount of water available during our drought experiment translates to only about 30-40 mm of monthly rainfall, and as expected, accessions with higher survival come from regions with low precipitation during the warmest season (correlation with climate variable bio18 [www.worldclim.org, ref. 17]: Pearson correlation, r=-0.19, p=0.005), and specifically with low precipitation during May and June (r≤-0.19, p≤0.005) (see Fig. 2A). To further exploit current climatic data, we used 19 bioclimatic variables and random forest models18 for environmental niche modeling (ENM) to predict the geographic distribution of the drought-survival index across Europe (Fig. 1C). Surprisingly, we found that individuals with higher drought survival were not only likely to be present around the Mediterranean, but also at the opposite end of the species’ range in Sweden19 (Fig. 1C, ENM cross-validation accuracy=89%, Table S10). In contrast to the warm-dry Mediterranean climate, Scandinavian dry periods occur on average at freezing temperatures (Supplementary Fig. 12). Consequently, precipitation might occur as snow and soil water content is frozen, thus water is not accessible to plants, producing a physiological drought response20.
Figure 2

Population structure and history of 762 high-quality genomes.

(A) Geographic locations and 11 genetic clusters estimated by ADMIXTURE (k=11 having the lowest cross-validation error). Black indicates less than 40 mm of June rainfall (1960 to 1990 average), which corresponds to the amount of water provided in our drought experiment (Fig. 1). Note areas of very low June rainfall in the Mediterranean basin and along the coast in Scandinavia (partially obscured by colored circles). Cape Verde Islands are shown as inset. (B) Principal Component Analysis of genome-wide SNPs. (C) Effective population sizes in time estimated from MSMC. (D) Population ancestral graph and the first migration trajectory from Treemix.

Survival across geographically structured population lineages

We then studied whether the different genetic lineages of A. thaliana are locally adapted6 to low precipitation regimes via increased drought-survival. Using an extended panel of 762 A. thaliana accessions (Table S1) we carried out genetic clustering21 and studied population size trajectories22 (Fig. 2). This corroborated the existence of a so-called Mediterranean ‘relict’ group12 and ten other derived groups of relictual (e.g. Spanish groups) or other (e.g. Central Europe) origin, as an apparent result of complex migration and admixture processes23. A generalized linear model indicated that genetic group membership explained a significant amount of drought-survival variance (GLM: R2=12.8%; p=4 x 10-5), with the North (N) Swedish and Northeastern (NE) Spanish groups each having on average higher survival than the other groups (t-test p≤0.01). A population graph estimated by Treemix24 suggested a gene flow edge between the Mediterranean and Scandinavian drought-resistant genetic groups, potentially indicative of historical sharing of drought survival alleles (Fig. 2D). Finally, an ENM of the genetic group membership with climatic variables from the accession’s geographic origin confirmed that the most important predictive variable of genetic structure was precipitation during the warmest quarter (bio18), followed by mean temperature of the driest quarter (bio9), and minimum temperature of the coldest month (bio6) (ENM accuracy > 95%. Supplementary Fig. 8 and Table S10). As our results indicate that the deepest genetic split parallels contrasts in local precipitation regimes and ability to survive drought, we expect that decline in rainfall could lead to a future loss of certain genetic groups and/or to turnover of genetic diversity11 (see Fig.12 Supplementary Fig. 8).

The genomic basis of survival

Because the potential of populations to adapt to drought will ultimately depend on specific genetic variants and the selected trait architecture, we identified drought-associated loci with EMMAX25, a genome-wide association (GWA) method. Although genotype-associated variance25 h2 was relatively high, 50%, no individual SNP was significantly associated with drought survival (minimum p~10-7, after FDR or Bonferroni corrections p>0.05) (Supplementary Fig. 5, Table S3). Significant associations in multiple phenotypes have been detected in similarly powered A. thaliana experiments26. While multiple testing adjustment can over-correct p-values and obscure true associations, the absence of significant associations may also be due to (i) polygenic trait architecture, with many small-effect loci27, and/or (ii) confounding by strong population structure, consistent with the association of drought survival with genetic group membership.

Polygenic signal of adaptation

To test for polygenic adaptation, we repeated the GWA analyses with a model that specifically handles both oligo- and polygenic architectures, BSLMM28. BSLMM estimates, among other parameters, the probability that each SNP comes from a group of major-effect loci. Around half of the top non-significant EMMAX SNPs were found to have over 99% probability of belonging to such a major-effect group (Fisher’s exact test of overlap, p=3x10-7; see Supplementary Methods 3.3). We further tested the polygenic hypothesis using the population genetic approach of Berg & Coop14. The test is based on the principle that if populations diverge in a specific trait such as drought-survival that is due to many loci, there should be an orchestrated shift in their allele frequencies. After testing some 60 groups of EMMAX SNP hits of variable size and at different ranks, we detected the most significant signal of polygenic adaptation with the group that included the 151 top SNPs (Table S9). The signal was lost for ranks below the top 300-400 EMMAX SNPs (Table S9). We then compared summary statistics of the top 151 SNPs with background SNPs matched in frequency to avoid GWA discovery biases. The top 151 SNPs showed high Fst values, consistent with allele frequency differentiation between populations (Supplementary Fig. 5). Tajima’s D values were positive (U Mann-Whitney p<0.05), indicating intermediate allele frequencies at the GWA loci (Supplementary Fig. 5), which could be a result of selection favoring alternative alleles in different ecological niches of the species29. The genomic regions containing the top SNPs did not show any evidence for precipitous reductions of haplotypic diversity, as would be expected for hard selective sweeps30 (Supplementary Fig. 5). Together these patterns fit the expectations of local adaptation from a polygenic trait controlled by some hundred loci31 — a scenario that should enable a fast response to new environmental shifts.

Ancestry associations suggest a Mediterranean origin of survival alleles

During local adaptation, the relevant loci diverge due to natural selection across populations, which generates a statistical correlation with population groups32. In this situation, the default correction of population structure applied in GWA might obscure some of the true associations. There are cases where Fst scans can be useful to identify overly divergent loci that could be involved in local adaptation. However, in cases of strong population structure, the mean genome-wide Fst is high32, complicating outlier detection (Supplementary Fig. 4). One can recover relevant variants that are deeply divergent across populations and therefore invisible to conventional GWA by first assigning ancestry to each SNP. Using ChromoPainter33, which relies on linkage disequilibrium information, we segmented each genome in question into its different population ancestries (here 11 groups). The first outcome of this analysis was that individuals from NW and NE Spain and, to a lesser extent, the Southern Mediterranean (Fig. 2A), have inherited many DNA segments from relictual individuals (Supplementary Fig. 7). In a generalized linear model framework, we then tested whether the ancestries of individuals at a SNP coincided with the observed phenotypic differences in drought-survival. Performing this “ancestry” genome-wide association (aGWA) and using a permutation correction of p-values (see Supplementary Methods 3.6), we detected 8 distinct peaks (p<0.001, Fig. 3A) including over 1,000 significant SNPs (70 SNPs after linkage disequilibrium pruning) (Table S4). The most prominent peak was located on chromosome 5 and explained over 20% of the variance in drought survival (Table S4). There was no overlap in top SNPs between GWA and aGWA because they search for different association signals. Our aGWA resembles other admixture mapping techniques34, and might be most useful for associations in scenarios of adaptive introgression and local adaptation. Although we do not know yet whether our observations can be generalized, our work demonstrates the power of using alternative GWA approaches in situations where adaptive variation is expected to be tightly linked to population history and structure.
Figure 3

Ancestry GWA of drought survival and environmental predictions.

(A) Manhattan plot of SNPs from ancestry GWA (aGWA) after permutation correction of p-values. Dashed lines indicate significant thresholds at p<0.05, 0.01, and 0.001. (B) Top, Neighbour-joining phylogeny of 1,000 concatenated genome-wide SNPs compared with a phylogeny of all significant aGWA SNPs (ca. 1,000). Colors indicate population clusters (Fig. 2). Relicts and N. Swedish groups are highlighted. Bottom, genetic distances for genome-background SNPs or aGWA SNPs. (C) Environmental niche models of 70 top aGWA SNPs (after LD pruning), trained with climate averages from 1960-1990, and then (D) used to forecast gain or loss of alleles in 2070 under free migration. (E) Discrepancy of alleles that can be gained by 2070 between the geographically constrained (PCA control) model and the free migration model.

To understand the origin of aGWA-identified SNPs, we constructed trees for all concatenated aGWA SNPs and for genome-wide background SNPs. Although the individuals from both the warm (Iberia and relicts) and cold (Scandinavia) edges of the species distribution are far apart in genome-wide SNPs, they are closely related in drought-associated SNPs (Fig. 3B). Overall, this is consistent with a common Mediterranean origin of drought-adaptive genetic variants of both Northern and Southern individuals (Fig. 2D, Fig. 3B), and highlights the relevance of populations at the latitudinal extremes of the species range as a possible genetic reservoir for future climate change adaptation12.

Drought survival is a resilience trait independent on phenology

Drought adaptation can be accomplished by diverse mechanisms, with cross-stress resistance being pervasive35. An annual life history enables drought survival through an escape strategy based on the acceleration of the life cycle from germination to flowering and seed production. An alternative strategy, the avoidance strategy, is employed by many xeric perennials with increased water efficiency36. Previous drought experiments with A. thaliana have shown that both strategies exist, although early flowering, which is associated with an escape strategy, was more favourable under water-limiting conditions37,38. In our experiment, drought-survival was not negatively correlated with flowering time in unstressed conditions39 (Pearson correlation, r=0.07, p=0.12). Although a correlation was not significant at the individual ecotype level, the GWA effect sizes of drought-survival for the top 151 SNPs were positively correlated with the ability of the same SNPs to delay flowering (Pearson correlation, r=0.51, p=1x10-11, see Supplementary Methods 3.4). Given the described trade-off between escape by flowering and water use efficiency in A. thaliana37,40,41, our drought-survival index might be related to the avoidance strategy, although this needs to be tested with specific physiological experiments (Supplementary Fig. 11, Table S6). Gene enrichment analysis revealed a weak signal for membrane transport (see Supplementary Methods 3.7). Adjustment of osmotic balance through cell membrane transport is a drought avoidance mechanism42 that might also confer cross-tolerance to other abiotic stresses43. Therefore, it might be of relevance for Scandinavian A. thaliana accessions or other populations in extreme environments (Supplementary Fig. 12)19.

Forecast of genetic changes to global warming reveals regional differences in evolutionary potential

It is expected that populations with increased survival to severe abiotic stresses should have an evolutionary advantage in face of the predicted increase in drought frequency and intensity both around the Mediterranean and in Europe, which will constitute a critical hazard for many plants2,11, including A. thaliana. Surprisingly, environmental niche models (ENM) of species distributions, which have been used to predict future changes of species’ ranges 2,3, do not usually include information of within-species diversity that can lead to adaptation from standing variation44–46. This could in turn lead to overestimates of extinction rates47–49. By fitting ENMs of current climate with SNP data, using a similar rationale as for the “climate GWA” of Hancock and colleagues7, we attempted to forecast the most likely genetic makeup under current and future climate conditions. We trained one ENM for each of the 151 GWA and 70 aGWA drought-associated SNPs to predict which allele, either the high or the low survival one, is more likely, given a set of environmental variables (all ENM 5CV accuracy >92%; Table S3-4, Supplementary Fig. 13-16). Consequently, from each model, we geographically mapped the potential distribution of the high survival allele using available environmental datasets (www.worldclim.org, ref. 17). Finally, concatenating the resulting 221 maps, we inferred the most likely individual genotype at each location. At present, individuals from both northern and southern edges of the species’ Eurasian and N. African range are predicted to harbor more drought-survival alleles than those located in between (Fig. 3C, Supplementary Fig. 15-16, with the quadratic term in a regression of allele count on latitude being positive at p=10-3), corroborating our previous observations. Using the trained ENM, we also forecast the distribution of the 221 drought-survival alleles in 2070 (rpc 8.5, IPCC, www.ipcc.ch, ref. 17). While it was expected that populations in the Mediterranean Basin need to become more drought resistant11, our predictions anticipate a greater increase in the total number of drought-survival alleles for Central Europe (Fig. 3, Supplementary Fig. 14-15). This is because by 2070 rainfall in Central Europe will likely become more similar to that in the Mediterranean2,11 (Supplementary Fig. 12). Because some drought-survival alleles are currently not present in Central Europe, we speculated that gene migration might be necessary to facilitate adaptation to future conditions50. An underlying assumption of the ENM is that alleles will be present wherever required by the environment, but this assumption of “universal migration” may not be realistic for future predictions if the presence of alleles is currently geographically restricted. We therefore included two geographic boundary conditions in the ENM to generate alternative models that were either more or less “migration-limited” (see Supplementary Methods 4.2). After fitting all possible models and predicting allele distributions with future climate, we calculated the difference of predicted allele presence per map grid cell between the naïve, free migration ENM and the two geographically constrained ones (Fig. 3D-E). If an allele has currently a narrow distribution or is specific to a certain genetic background, its future presence in an area might not be predicted by the constrained models, even though the climate variables coincide with the SNP’s environmental range. Such a scenario seems to apply to Central Europe, as the deficit in drought-survival alleles predicted by the free over the constrained models was 8-30% (18-66 out of 221) (Fig. 3E; with the quadratic term in a regression of the allele count difference on latitude being negative at p<10-10). Central European populations may therefore be under threat of lagging adaptation by the end of the 21st century. In the end, for a population to persist, not only must drought-survival alleles be present locally, but they also need to increase in frequency51. The chance of this occurring will depend on current local allele frequencies and the strength of natural selection favouring the drought-survival alleles. Therefore, we studied current allele frequencies at three representative locations with the highest sampling density in our dataset (40 samples within a 50 kilometer area): Madrid (Spain), Tübingen (Germany) and Malmö (Sweden), which are near the southern edge, center and northern edge of the Eurasian and N. African range, respectively. Based on ENM predictions, we calculated allele frequency changes from present to 2070. Frequencies are predicted to increase significantly only in the Tübingen population (Student's t test, p<10-16, Table S11), but not in Madrid and Malmö, indicating that these two populations might already be adapted to the future local climate. Although not all drought-associated alleles are found in Tübingen (32 of 70 aGWA SNPs and 136 of 151 GWA SNPs), increasing the number of the alleles in single genotypes should be feasible, since there are already single genotypes that have 24 (aGWA) and 123 (GWA) of these alleles (see Supplementary Methods 4.2). Running 50-generations simulations starting at the present Tübingen frequency of each of the drought-survival alleles and assuming a range of selection coefficients, we estimated that a 1-3% of fitness advantage on average would be necessary to increase frequencies to match those of the adapted Madrid and Malmö populations (Supplementary Fig. 17, see Supplementary Methods 4.2). Such selection could take place efficiently when populations are large, as is typical for highly-proliferative weeds51,52.

Conclusion

Leveraging the genetic resources available for A. thaliana, we have begun to address the question of how climate change will affect biodiversity. We provide evidence for the possibility of adaptive genetic variation to extreme drought events from standing variation. Specifically, we found that drought survival in A. thaliana has a polygenic basis and that favorable alleles are more abundant toward the edges of the species’ distribution range. Extreme adaptation at range edges might thus be critical for a species’ persistence under climate change. Although many aspects of future adaptation are not considered here, namely non-drought related or seasonal climate change51, biotic interactions, phenotypic plasticity, or novel adaptive mutations, our spatially explicit analyses emphasize the potential of adaptive evolution from standing variation to mitigate climate change’s detrimental effects.

Methods

Study populations

211 natural inbred lines from the 1001 Genomes project16 were grown in a terminal drought experiment, and 762 lines were analyzed for genetic structure and genome-environment models. These two subsets were selected based on sequence quality and homogeneity of geographic distribution (see Supplementary Methods 1.1). We retrieved the genomes corresponding to the above natural lines from http://1001genomes.org/data/GMI-MPI/releases/v3.1/ and extracted the biallelic SNPs with >95% calling rate. This resulted in keeping ~4M SNP.

Genetic structure

To understand the genetic structure of Arabidopsis thaliana we ran, on the 762 samples, the software ADMIXTURE v1.2 (ref. 21) assuming two to 20 groups and using a 5-fold cross-validation procedure. The number of groups with the smallest cross-validation error was 11 (Fig. 2, Fig. Supplementary Table 1, Supplementary Fig. 8). We computed a genomic PCA using PLINK v1.9 (ref. 53). The three first PC axes explained 33.5% of the genomic variance (see Supplementary Methods 3). We used genomes with probability >0.9 of assignment to one of the 11 ADMIXTURE groups to run MSMC v.3 (ref. 22). This was done in quartets of genomes, i.e. four genomes for within-population coalescent mode, and two genomes of each of two populations for the cross-coalescent mode (Fig. 2, Supplementary Fig. 5). Using the 11 genetic groups as population lineages, we run Treemix assuming zero to five migration edges24 (Fig. 2, Supplementary Fig. 5)

Terminal drought experiment

Stratified seeds from the selected 211 natural lines were sown in greenhouse pots and abundantly watered every three days during two weeks. Thereafter watering only occurred every three weeks, which dramatically reduced soil water content (Fig. 1, Supplementary Methods 1.2). Top-view photographs of the potting trays were done at 20 timepoints during the whole experiment with a high resolution Panasonic DMC-TZ61 digital camera mounted in a closed black box setting to ensure image consistency (Supplementary Methods 2). Using customized Python scripts and the module Open Computer Vision, we segmented the green plant-leave pixels from the brown soil background to monitor plant area over time (Supplementary Video). Starting from the day with the largest rosettes areas, until the end of the experiment, we modeled the decay of green area (i.e. # pixels) using a polynomial generalized linear mixed model with Poisson link as described in the MCMCglmm R package v.2.25 (see Supplementary Methods 2). The random genotype effects captured the average deviation of each genotype from a general intercept, slope and quadratic curvature. After calculating the heritability of each of the three coefficient deviations and their correlation with the genotype’s climate variables of origin, we understood that it was the quadratic curvature that was the most suitable to use as index of survival (Supplementary Methods 2).

Genome-Wide Association (GWA)

Using the index of survival per genotype as the trait and the SNPs with a minimum allele frequency > 5% as predictors (n=879,654 SNPs), we carried out associations using the linear mixed model implemented in EMMAX software25 to find SNPs that excessively contributed to the prediction of survival of genotypes (Supplementary Table 3) (see Supplementary Methods 3.3). To corroborate the identified top SNPs we also performed a Bayesian Sparse Linear Mixed Model (BSLMM) with GEMMA software28. Both approaches, EMMAX and GEMMA, fit a model as: Y = X + Zu + є, where Y is the vector of trait values, X is the alternative allele dosage at SNP i and β the allelic effect of SNP i on the trait. Population structure is corrected with a random genotype term (of 211 levels) represented by u, which follows a Multivariate Normal distribution where is A the relationship matrix between all individual genotypes built from SNP information and is the genotype-associated variance. Different from EMMAX, the BSLMM model considers that the β coefficients follow a mixture of two distributions, one that expects many small effects and another that generates few strong effects. To determine whether the top SNPs identified in the GWA might have been subject to polygenic adaptation, we used the method from Berg & Coop14. We did this for several groupings of top SNPs and reported the group that yielded the strongest signal (see all results in Supplementary Table 9). Using painted chromosomes generated using ChromoPainter v. 2.0.7 (ref. 33), we carried out another set of associations between the survival trait and the local ancestry category (11 groups) of a chunk of the genome. We used a linear model, Y = μ + Xβ + є, and reported the positions in the genome with the least mean square error (i.e. highest R2) (Supplementary Table 4). To compute p-values, we took an empirical p-value distribution approach based on 1,000 random permutation runs (see Supplementary Methods 3.6). To understand the ancestry of the associated genomic positions, we concatenated the SNP genotypes of the top-associated positions, computed genetic distances between natural lines and generated a Neighbour Joining tree. This tree was compared with a tree built from an equal number of randomly-picked background SNPs.

Genome-wide diversity and selection summary statistics

We calculated genome-wide Fst among the ADMIXTURE-defined groups and Tajima’s D with PLINK v1.9 (ref. 53) and likelihood of a selective sweep with SweeD (ref. 30). We investigated the enrichment of the top SNPs in the upper tail of the distributions of those statistics by calculating a right-tailed t-test in contrast with genome-background SNPs with the same frequency values (Supplementary Fig. 4, Supplementary Table 3, rank columns).

Environmental Niche Models

We used classification and regression Random Forest models implemented in the randomForest R package, available environmental databases www.worldclim.com v.1.4 (ref. 17, 19 bioclimatic variables at 2.5 arc-minutes resolution), and geographic locations of GWA-identified alleles, to fit environmental niche models (ENM). To evaluate model’s predictive ability for each allele, we used a 5-fold cross-validation procedure in which ⅘ parts of the data were used to train the model and ⅕ was used to test it. This enabled us to assign a percentage of successful assignment of an allele given the environmental variables at a location (Supplementary Tables 3-4). The fitted Random Forest model was used to generate potential geographic distributions of survival-associated alleles which, all overlapped, provided a geographic map of density of survival alleles. Using existing predictions of the same 19 bioclimatic variables to 2050 and 2070 under both low (2.6 rcp) and high (8.5 rcp) CO2 accumulation scenarios, we re-predicted the distribution of alleles to the different future scenarios using the previously fitted Random Forest models. Because of the implicit assumption of free movement of alleles, we generated two additional models per SNP: (1) ENM including the latitude and longitude variables in the Random Forest models and (2) ENM including the three first PC axes geographically modeled with present day climate (see below). By repeating predictions with future climate data, but keeping the latitude, longitude and PC components constant, some alleles would not be predicted in areas where the appropriate environment exists but which are outside of the current geographic distribution (1) or current local genomic background (2) (see Supplementary Methods 4, Supplementary Fig. 13-16). Apart from the potential distribution of putatively adaptive alleles, we also modeled the geographic distribution of continuous traits, namely the aforementioned PCA components of population structure or the index of survival under drought itself. In those cases the Random Forest was of the regression type and the predictive ability was computed for the test data calculating the squared Pearson’s correlation coefficient between predicted and true values (see Supplementary Methods 4). To complement observations of presence and absence of alleles from ENM predictions, we carried out Wright-Fisher simulations of single biallelic SNPs (for details see Supplementary Methods 4.2.4). We ran simulations for 50 discrete generations. The population size was assumed of 300,000 plants, as inferred from diversity data, and was constant over time. Fitness was only determined by the selection coefficient of the drought alleles, which varied from 0 to 20% in an array of simulation runs. The starting frequency of the allele was set equal to the present day frequency of all natural lines sampled in a given geographic area (e.g., Tübingen). These simulations could be extended in the future to incorporate joint fitness effects from multiple adaptive mutations and complex environment-driven demographic processes (Supplementary Methods 4.2.4).

Code availability

Code for the image analysis pipeline available at http://github.com/MoisesExpositoAlonso/hippo with DOI: https://doi.org/10.5281/zenodo.1039888, code for ancestryGWA is available at https://github.com/MoisesExpositoAlonso/aGWA with DOI: https://doi.org/10.5281/zenodo.1039882, code for Wright-Fisher population simulations at http://github.com/MoisesExpositoAlonso/popgensim with DOI: https://doi.org/10.5281/zenodo.1039886.

Data availability

Phenotypic datasets available in the Supplementary Dataset. Processed genome matrices are available at http://1001genomes.org/data/GMI-MPI/releases/v3.1/. Raw reads are stored in the www.ncbi.nlm.nih.gov/sra archive under the ID number: SRP056687.
  43 in total

1.  A quantitative survey of local adaptation and fitness trade-offs.

Authors:  Joe Hereford
Journal:  Am Nat       Date:  2009-05       Impact factor: 3.926

Review 2.  Climate change and evolutionary adaptation.

Authors:  Ary A Hoffmann; Carla M Sgrò
Journal:  Nature       Date:  2011-02-24       Impact factor: 49.962

3.  Conserving biodiversity under climate change: the rear edge matters.

Authors:  Arndt Hampe; Rémy J Petit
Journal:  Ecol Lett       Date:  2005-05       Impact factor: 9.492

4.  Genetics of water use physiology in locally adapted Arabidopsis thaliana.

Authors:  Julius P Mojica; Jack Mullen; John T Lovell; J Grey Monroe; John R Paul; Christopher G Oakley; John K McKay
Journal:  Plant Sci       Date:  2016-04-19       Impact factor: 4.729

5.  The association among gene expression responses to nine abiotic stress treatments in Arabidopsis thaliana.

Authors:  William R Swindell
Journal:  Genetics       Date:  2006-10-08       Impact factor: 4.562

6.  Ecological genomics meets community-level modelling of biodiversity: mapping the genomic landscape of current and future environmental adaptation.

Authors:  Matthew C Fitzpatrick; Stephen R Keller
Journal:  Ecol Lett       Date:  2014-09-30       Impact factor: 9.492

7.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

Review 8.  Mapping of disease-associated variants in admixed populations.

Authors:  Daniel Shriner; Adebowale Adeyemo; Edward Ramos; Guanjie Chen; Charles N Rotimi
Journal:  Genome Biol       Date:  2011-05-30       Impact factor: 13.583

Review 9.  Membrane transporters and drought resistance - a complex issue.

Authors:  Karolina M Jarzyniak; Michał Jasiński
Journal:  Front Plant Sci       Date:  2014-12-04       Impact factor: 5.753

10.  Image-based methods for phenotyping growth dynamics and fitness components in Arabidopsis thaliana.

Authors:  François Vasseur; Justine Bresson; George Wang; Rebecca Schwab; Detlef Weigel
Journal:  Plant Methods       Date:  2018-07-26       Impact factor: 4.993

View more
  33 in total

1.  Fitness variation across subtle environmental perturbations reveals local modularity and global pleiotropy of adaptation.

Authors:  Grant Kinsler; Kerry Geiler-Samerotte; Dmitri A Petrov
Journal:  Elife       Date:  2020-12-02       Impact factor: 8.140

2.  What processes must we understand to forecast regional-scale population dynamics?

Authors:  Jesse R Lasky; Mevin B Hooten; Peter B Adler
Journal:  Proc Biol Sci       Date:  2020-12-09       Impact factor: 5.349

3.  Climate change is predicted to disrupt patterns of local adaptation in wild and cultivated maize.

Authors:  Jonás A Aguirre-Liguori; Santiago Ramírez-Barahona; Peter Tiffin; Luis E Eguiarte
Journal:  Proc Biol Sci       Date:  2019-07-10       Impact factor: 5.349

Review 4.  The evolutionary genomics of species' responses to climate change.

Authors:  Jonás A Aguirre-Liguori; Santiago Ramírez-Barahona; Brandon S Gaut
Journal:  Nat Ecol Evol       Date:  2021-08-09       Impact factor: 15.460

5.  Plant adaptation to climate change - Where are we?

Authors:  Jill Anderson; Bao-Hua Song
Journal:  J Syst Evol       Date:  2020-06-18       Impact factor: 4.098

6.  Evolutionary footprints of a cold relic in a rapidly warming world.

Authors:  Eva Wolf; Emmanuel Gaquerel; Mathias Scharmann; Levi Yant; Marcus A Koch
Journal:  Elife       Date:  2021-12-21       Impact factor: 8.140

7.  Into the range: a latitudinal gradient or a center-margins differentiation of ecological strategies in Arabidopsis thaliana?

Authors:  Aurélien Estarague; François Vasseur; Kevin Sartori; Cristina C Bastias; Denis Cornet; Lauriane Rouan; Gregory Beurier; Moises Exposito-Alonso; Stéphane Herbette; Justine Bresson; Denis Vile; Cyrille Violle
Journal:  Ann Bot       Date:  2022-02-11       Impact factor: 4.357

Review 8.  Arabis alpina: A perennial model plant for ecological genomics and life-history evolution.

Authors:  Stefan Wötzel; Marco Andrello; Maria C Albani; Marcus A Koch; George Coupland; Felix Gugerli
Journal:  Mol Ecol Resour       Date:  2021-09-07       Impact factor: 8.678

9.  Climate as a driver of adaptive variations in ecological strategies in Arabidopsis thaliana.

Authors:  François Vasseur; Kevin Sartori; Etienne Baron; Florian Fort; Elena Kazakou; Jules Segrestin; Eric Garnier; Denis Vile; Cyrille Violle
Journal:  Ann Bot       Date:  2018-11-30       Impact factor: 4.357

10.  Adaptive diversification of growth allometry in the plant Arabidopsis thaliana.

Authors:  François Vasseur; Moises Exposito-Alonso; Oscar J Ayala-Garay; George Wang; Brian J Enquist; Denis Vile; Cyrille Violle; Detlef Weigel
Journal:  Proc Natl Acad Sci U S A       Date:  2018-03-14       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.