| Literature DB >> 35991553 |
Andrés J Cortés1, Felipe López-Hernández1, Matthew W Blair2.
Abstract
Leveraging innovative tools to speed up prebreeding and discovery of genotypic sources of adaptation from landraces, crop wild relatives, and orphan crops is a key prerequisite to accelerate genetic gain of abiotic stress tolerance in annual crops such as legumes and cereals, many of which are still orphan species despite advances in major row crops. Here, we review a novel, interdisciplinary approach to combine ecological climate data with evolutionary genomics under the paradigm of a new field of study: genome-environment associations (GEAs). We first exemplify how GEA utilizes in situ georeferencing from genotypically characterized, gene bank accessions to pinpoint genomic signatures of natural selection. We later discuss the necessity to update the current GEA models to predict both regional- and local- or micro-habitat-based adaptation with mechanistic ecophysiological climate indices and cutting-edge GWAS-type genetic association models. Furthermore, to account for polygenic evolutionary adaptation, we encourage the community to start gathering genomic estimated adaptive values (GEAVs) for genomic prediction (GP) and multi-dimensional machine learning (ML) models. The latter two should ideally be weighted by de novo GWAS-based GEA estimates and optimized for a scalable marker subset. We end the review by envisioning avenues to make adaptation inferences more robust through the merging of high-resolution data sources, such as environmental remote sensing and summary statistics of the genomic site frequency spectrum, with the epigenetic molecular functionality responsible for plastic inheritance in the wild. Ultimately, we believe that coupling evolutionary adaptive predictions with innovations in ecological genomics such as GEA will help capture hidden genetic adaptations to abiotic stresses based on crop germplasm resources to assist responses to climate change. "I shall endeavor to find out how nature's forces act upon one another, and in what manner the geographic environment exerts its influence on animals and plants. In short, I must find out about the harmony in nature" Alexander von Humboldt-Letter to Karl Freiesleben, June 1799.Entities:
Keywords: abiotic stress; gene banks; genome-wide environmental scans; genome-wide selection scans (GWSS); genome–environment associations (GEA); genomic prediction (GP); germplasm collections; landraces
Year: 2022 PMID: 35991553 PMCID: PMC9389289 DOI: 10.3389/fgene.2022.910386
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Examples of GEA studies carried out in plant species. This compilation of previous studies explicitly refers to genome–environment association studies (GEA) using the Scopus database https://www.scopus.com/ with the following search parameters: TITLE-ABS-KEY (“Genome – Environment Associations”) AND (LIMIT-TO (DOCTYPE, “ar”)). The table is sorted chronologically. Method abbreviations are shown at the bottom of the table.
| Speciesapproach | Sampling data | Genotypic data | Targeted stress | Environmental data | Analytical | Main finding | References |
|---|---|---|---|---|---|---|---|
| GLM | Tolerance to aridity |
| 3,059 SNPs and 23 SSRs | Aridity index and Thornthwaite index using biovariables | 622 trees | Environmental and genetic data for the identification of functionally important genetic variation within natural populations |
|
| Genome-wide scans | Local adaptation to climate gradients |
| ∼215,000 SNPs | Aridity, temperature, precipitation, radiation, and day length | 948 accessions | Natural adaptive genetic variation in |
|
| Redundancy analysis (RDA) | Local adaptation to climate gradients |
| 214,051 SNPs | Potential evapotranspiration using annual precipitation and a measure of aridity | 1,003 accessions | The climatic structure of SNP correlations is due to changes in coding sequence that may underlie local adaptation |
|
| MLM | Drought and heat stress |
| 1,918,637 SNPs | Biovariables | 202 accessions | Genetic basis of adaptation to drought and heat stress disclosed in |
|
| EMMA | Tolerance to aluminum toxicity and drought stress |
| 404,627 SNPs | Potential evapotranspiration using annual precipitation and a measure of aridity | 1,943 accessions | Genomic signatures of environmental adaptation may be useful for crop improvement, enhancing germplasm identification, and marker-assisted selection |
|
| BayeScan | Local adaptation to climate gradients |
| ∼170,000 SNPs | Variables from | 391 trees | Physical proximity of genes in coadapted complexes may buffer against the movement of maladapted alleles from geographically proximal but climatically distinct populations |
|
| LFMMs, GLM | Local adaptation to climate gradients |
| 144 SNPs—12 SSRs | Environmental index from raw variables in | 79 natural populations | Local adaptation to climate gradients |
|
| Bayenv, Bayescan | Local adaptation to climate gradients |
| 87,218 SNPs | Variables from | 762 trees | Outlier loci putatively under selection detected in populations at the extremity of climatic gradients, and tested |
|
| MLM, GLM | Drought stress |
| 22,845 SNPs | Drought index using Thornthwaite model and annual precipitation | 86 accessions | Genomic signatures of adaptation are useful for germplasm characterization, potentially enhancing future marker-assisted selection, and crop improvement |
|
| BayPass | Abiotic stresses |
| 1,638,649 SNPs | Mean annual temperature; mean coldest month temperature; and precipitations in winter, spring, summer, and autumn | 168 natural populations | The identification of climate-adaptive genetic loci at a micro-geographic scale also highlights the importance to include within-species genetic diversity in ecological niche models for projecting potential species distributional shifts |
|
| LFMMs and GLM | Drought stress |
| 14,409 SNPs | Aridity index using biovariables | 1,249 accessions | Wild individuals have higher ability to resist stress-aridity conditions and could be used to improve the resistance of cultivated varieties |
|
| LFMMs and MSOD-MSR | Drought stress |
| 43,515 SNPs | Biovariables | 202 accessions | The importance of soil in driving adaptation in the system and elucidate the basis of evolutionary potential of |
|
| GLM | Abiotic stress |
| 355,442 SNPs |
| 1,143 accessions | Combining large-scale genomic and ecological data in this diverse maize panel, this study supports a polygenic adaptation model of maize and offers a framework to enhance the understanding of maize adaptation |
|
| SUPERFarmCPU, BLINK, GLM, and MLM | Heat stress |
| 23,373 SNPs | PCA from temperature biovariables, modified heat Thornthwaite index, and heat index | 78 accessions | It is feasible to identify genome-wide environmental associations with modest sample sizes by using a combination of various carefully chosen environmental indices and last-generation GWAS algorithms |
|
| Bayenv2 | Drought and heat stress |
| 14,889 SNPs | Biovariables | 130 accessions | Significant correlation between the number of loci associated with each environmental variable in the GEA, and the importance of each variable in environmental niche modeling |
|
| LFMMs | Drought and heat stress |
| 381 SNPs | Isothermality, mean temperature of the driest quarter, precipitation during the dry season, and precipitation during the wet season | 60 accessions | Genetic variation in |
|
| CANCOR | Drought stress |
| 189,968 SNPs | Environment index | 469 natural populations | CANCOR retrieved 633 outlier loci associated with two climatic gradients, characterized by cold–dry vs. mild–wet winter, and long rainy season vs. long summer, pointing out traits putatively conferring adaptation at the extremes of these gradients |
|
| BLINK | Drought and heat stress |
| 72,190 SNPs | Altitude, annual temperature, and precipitation of accessions’ passport data from | 1,425 accessions | Candidate loci identified with the GEA will have potential utilization for germplasm identification and sorghum breeding for stress |
|
| BAYESCENV | Drought and heat stress |
| 6,120 SNPs | Isothermality, evapotranspiration, temperature seasonality, temperature annual range, annual precipitation, and seasonality precipitation | 139 accessions | Genome-wide data provide new insights into the important role of environmental heterogeneity in accessing the footprints of local adaptation in an ancient relictual species |
|
| LFMMs | Abiotic stress |
| 8,007,303 SNPs | Abiotic index ENVIREM | 94 trees | Climate adaptation in |
|
| BLINK | Drought and heat stress |
| 54,080 SNPs | Altitude, annual temperature, and precipitation | 940 accessions | The current study aimed to better understand the GEA of a large collection of Ethiopian sorghum landraces, characterized with genome-wide SNP markers, to investigate key traits related to adaptation |
|
| FarmCPU | Cold stress |
| 2,936,477 SNPs | Frost-free period and other climatic information | 134 accessions | Significant selective regions and candidate genes were identified, and the potential molecular mechanism of local adaptation to low temperature in woody plants was discussed |
|
| MLM | Drought stress |
| Genes | Drought index using Thornthwaite model and annual precipitation | 52 accessions | The results suggested that tepary bean, specially wild accessions, could be sources of novel alleles for drought tolerance |
|
| MLM | Abiotic stress |
| 28,823 SNPs | Biovariables from | 110 accessions | SNP markers and candidate genes associated with bio-climatic variables should be validated in segregating populations for water MAS |
|
| LFMMs | Abiotic stress |
| 14,160 SNPs | Worldclim.org (WC), The Climatic Research Unit (University of East Anglia) (CRU), The Satellite Application Facility on Climate Monitoring, and The NASA Distributed Active Archive Centre for Biogeochemical Dynamics (DAAC) | 675 accessions | Authors identified a set of candidate genes for adaptation associated with environmental gradients along the distribution range |
|
| LFMMs | Abiotic stress |
| 10,478 SNPs | 202 accessions |
FarmCPU, fixed and random model circulating probability unification; BLINK, Bayesian-information and linkage-disequilibrium iteratively nested keyway; LFMMs, latent factor mixed models; CANCOR, canonical correlation analysis; SUPER, settlement of MLM under progressively exclusive relationship; MSOD-MSR, Moran spectral outlier detection/randomization; EMMA, mixed linear association model; MAS, marker-assisted selection.
Symbol † indicates studies in legume species.
FIGURE 1Analytical pipeline to infer genome-wide signatures of environmental adaptation in crop wild relatives (CWR) and landraces that span heterogeneous environments. The green shaded box refers to gene bank collections, while white, red and blue shaded boxes represent input data, analytical models and output inferences, respectively (Cortés et al., 2020a; Cortés et al., 2020b; Cortés and López-Hernández, 2021). Genomic prediction (GP) and genomic-estimated adaptation values (GEAVs) promise speeding up plant breeding goals.
FIGURE 2An integrated case study inspired in common bean (Phaseolus vulgaris L.) accessions exemplifies how to identify and harness natural signatures of environmental adaptation across diverse genepools. First, (A) genome-wide patterns of genetic divergence, as measured by the F and delta divergence (Roesti et al., 2014) statistics, inform underlying confounding demographic processes across wild accessions (Blair et al., 2012) and landraces (Blair et al., 2009). Even though highly polymorphic markers have traditionally been preferred for demographic inferences (Blair et al., 2009; Kwak and Gepts, 2009), modern SNP genotyping technologies also enable reconstructing the genomic landscape of divergence at a higher resolution (Cortés et al., 2011; Wu et al., 2020). Once potential confounding demographic patterns have been accounted for, (B) it is then feasible to disentangle genuine (in red) signatures of environmental adaptation (Cortés et al., 2018b) from spurious concurring genetic drift due to genomic constraining features such as low recombining regions, reduced effective population size, and translocations (Blair et al., 2018). In order to improve genome–environment associations (GEA), the field has moved from the candidate gene approach (Cortés et al., 2012a; Cortés et al., 2012b; Blair et al., 2016; Buitrago-Bitar et al., 2021) into full genomic scans (Cortés and Blair, 2018a; López-Hernández and Cortés, 2022), which better account for linkage disequilibrium (LD) heterogeneity. It is also advisable to target discrete abiotic pressures by explicitly relying on the mechanistic ecophysiological models (Cortés et al., 2013) such as drought (Cortés and Blair, 2018a) and heat stress (López-Hernández and Cortés, 2022). Finally, (C) these combined summary statistics (i.e., ecophysiological indices, population stratification, and LD) can ultimately redound in prebreeding efforts aiming to introgress exotic adaptive variation into elite lines, for instance, via backcrossing (BC) schemes for abiotic (Muñoz et al., 2003; Blair et al., 2006; Blair and Izquierdo, 2012; Burbano-Erazo et al., 2021) and biotic (Garzon et al., 2008) stresses, all guided with indirect (Miklas et al., 2006) genomic selection tools such as marker-assisted selection (MAS) and genomic selection, GS (Cortés et al., 2020a; Cortés et al., 2020b; Cortés and López-Hernández, 2021) within a moder enviromics approach (Cooper et al., 2021). Different line colors stand for hypothetical distinct chromosomes. Dashed horizontal lines mark significance thresholds.