| Literature DB >> 34950235 |
Trevor M Faske1,2, Alison C Agneray1,2, Joshua P Jahner1, Lana M Sheta1, Elizabeth A Leger1,2, Thomas L Parchman1,2.
Abstract
The spatial structure of genomic and phenotypic variation across populations reflects historical and demographic processes as well as evolution via natural selection. Characterizing such variation can provide an important perspective for understanding the evolutionary consequences of changing climate and for guiding ecological restoration. While evidence for local adaptation has been traditionally evaluated using phenotypic data, modern methods for generating and analyzing landscape genomic data can directly quantify local adaptation by associating allelic variation with environmental variation. Here, we analyze both genomic and phenotypic variation of rubber rabbitbrush (Ericameria nauseosa), a foundational shrub species of western North America. To quantify landscape genomic structure and provide perspective on patterns of local adaptation, we generated reduced representation sequencing data for 17 wild populations (222 individuals; 38,615 loci) spanning a range of environmental conditions. Population genetic analyses illustrated pronounced landscape genomic structure jointly shaped by geography and environment. Genetic-environment association (GEA) analyses using both redundancy analysis (RDA) and a machine-learning approach (Gradient Forest) indicated environmental variables (precipitation seasonality, slope, aspect, elevation, and annual precipitation) influenced spatial genomic structure and were correlated with allele frequency shifts indicative of local adaptation at a consistent set of genomic regions. We compared our GEA-based inference of local adaptation with phenotypic data collected by growing seeds from each population in a greenhouse common garden. Population differentiation in seed weight, emergence, and seedling traits was associated with environmental variables (e.g., precipitation seasonality) that were also implicated in GEA analyses, suggesting complementary conclusions about the drivers of local adaptation across different methods and data sources. Our results provide a baseline understanding of spatial genomic structure for E. nauseosa across the western Great Basin and illustrate the utility of GEA analyses for detecting the environmental causes and genetic signatures of local adaptation in a widely distributed plant species of restoration significance.Entities:
Keywords: common garden; genetic‐environment association; genotyping‐by‐sequencing; landscape genomics; local adaptation; seedling traits
Year: 2021 PMID: 34950235 PMCID: PMC8674890 DOI: 10.1111/eva.13323
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
FIGURE 1Landscape genomic structure strongly mirrors geography. (a) Map of the sampled localities for the 17 E. nauseosa populations. (b) Principal components analysis (PCA) of genotype probabilities illustrates spatial genomic structure. A Procrustes rotation of the first 2 PC axes onto latitude/longitude displays how spatial genetic structure is strongly predicted by geographic proximity. Colors of points in B match those of the population labels in “a”
Environmental variable abbreviations with descriptions and units used in the text and figures, and the range of values for each variable. Data for these variables are represented more fully in Table S3. Both genetic and phenotypic associations with each environmental variable are given for RDA (loading on the first axis), GF (split importance R 2), and RF (importance R 2). Rank represents the rank of that variable/analysis combination (i.e., 1 is most important, 10 is least). Number of loci is the number most highly correlated with each environmental variable with percent (%) out of the total outlier loci overlapping between RDA and GF (92 loci). Variables are ordered by their loading on the first RDA axis
| Env. variable | Description (units) | Variable range | Genetic‐environment association | Common garden | ||
|---|---|---|---|---|---|---|
| RDA1 (rank) | GF | # loci (%) | RF | |||
| PrcpSeas | Precipitation seasonality | 0.18–0.64 | 0.666 (1) | 0.026 (1) | 52 (56.5%) | 20.82 (1) |
| Slope | Slope (degrees) | 0.68–17.39 | 0.538 (2) | 0.012 (5) | 6 (6.5%) | 1.15 (8) |
| Elevation | Elevation (m) | 1277–2408 | 0.526 (3) | 0.014 (2) | 13 (14.1%) | 4.91 (7) |
| MinVPD | Min. vapor pressure deficit (hPa) | 1.28–3.30 | 0.497 (4) | 0.009 (6) | 3 (3.3%) | 6.59 (6) |
| PrcpAnn | Annual precip. (mm) | 215.19–388.44 | 0.431 (5) | 0.014 (3) | 1 (1.1%) | 6.69 (5) |
| Aspect | Aspect (degrees from North) | 9.46–356.19 | 0.409 (6) | 0.012 (4) | 11 (12%) | 12.31 (2) |
| SoilMax | Max. soil water capacity (cm) | 89.77–163.34 | 0.362 (7) | 0.007 (10) | 1 (1.1%) | 0.00 (9; tie) |
| CumlAET | Cuml. actual evapotranspiration (mm per year) | 185.29–275.46 | 0.299 (8) | 0.008 (9) | 4 (4.3%) | 0.00 (9; tie) |
| TempAnn | Annual temp. (℃) | 7.1–10.7 | 0.267 (9) | 0.008 (8) | 1 (1.1%) | 10.75 (3) |
| HL | Heat load index | 0.81–1.03 | 0.24 (10) | 0.008 (7) | NA | 9.87 (4) |
PrcpSeas classes designated by Walsh and Lawler (1981).
HL calculated from McCune and Keon (2002).
FIGURE 2Neutral and adaptive processes shape landscape genomic structure. Positive relationships between genetic distance and both (a) geographic and (b) environmental distances provide evidence for IBD and IBE. (c) Less evidence for an association between geographic and environmental distance is most likely due to the heterogeneous environments of the Great Basin. (d) Relative contributions of environmental and geographic distance to genetic distance were estimated using a piecewiseSEM. Geographic distance had a slightly larger relative impact on genetic distance (β = 0.379; or 0.293 + [0.286 × 0.300]) than on environmental distance when accounting for indirect effects. Conditional estimates (includes random effects) of variance explained are represented by the r 2 under each response variable
Relevant environmental information for the sampled populations and summary statistics for sampling effort and genetic diversity (θ π). Inbreeding coefficients (F) are not reported, as most are effectively zero (mean = 0.0033). Gray rows indicate populations included in genetic analyses but not in the common garden
| Site name, State | Site abbr. | Precip. seasonality | Annual mean temp. (℃) | Aspect | Elev. (m) |
|
|
|---|---|---|---|---|---|---|---|
| Austin Summit, NV | AS | 0.322 | 7.72 | 244.80 | 2408 | 15 | 0.0206 |
| Bald Mountain, NV | BM | 0.191 | 6.85 | 158.20 | 2245 | 14 | 0.0211 |
| Buena Vista, OR | BV | 0.286 | 8.02 | 127.88 | 1277 | 15 | 0.0209 |
| Dayton Hill, NV | DH | 0.600 | 10.54 | 342.90 | 1400 | 15 | 0.0210 |
| Diamond Crater, OR | DC | 0.267 | 8.01 | 138.37 | 1295 | 14 | 0.0211 |
| East Walker, CA | EW | 0.531 | 8.15 | 82.69 | 2043 | 10 | 0.0195 |
| Finger Rock, NV | FR | 0.185 | 8.45 | 9.46 | 2129 | 15 | 0.0205 |
| Hwy 140, NV | HO | 0.322 | 9.01 | 135.00 | 1423 | 13 | 0.0212 |
| Jones Canyon, NV | JC | 0.406 | 9.25 | 205.02 | 1440 | 15 | 0.0209 |
| Long Valley, CA | LV | 0.579 | 9.02 | 37.41 | 1660 | 12 | 0.0194 |
| Modoc, CA | MD | 0.403 | 8.51 | 315.00 | 1333 | 15 | 0.0202 |
| Patagonia, NV | PT | 0.469 | 10.72 | 24.78 | 1421 | 7 | 0.0204 |
| Peavine Low, NV | PL | 0.643 | 9.09 | 38.66 | 1724 | 14 | 0.0205 |
| Petrified Wash, NV | PW | 0.186 | 9.03 | 266.99 | 1871 | 10 | 0.0213 |
| Smith Creek, NV | SC | 0.175 | 7.73 | 170.75 | 2050 | 14 | 0.0202 |
| Spanish Springs, CA | SS | 0.307 | 7.54 | 356.19 | 1645 | 9 | 0.0195 |
| Virginia Mountains, NV | VM | 0.561 | 9.64 | 145.01 | 1503 | 15 | 0.0209 |
FIGURE 3Environmental variation predicted population structure and was associated with pronounced allele frequency shifts at loci likely to be under selection (a) Redundancy analysis (RDA) of the genotype probabilities for each individual associated with environmental predictor variables (Table 2). Direction and length of arrows correspond to the loadings of each variable on the same two RDA axes. Point colors correspond to population colors in Figure 1. (b) The loadings of each locus onto the same two RDA axes and environmental predictor variables (vectors scaled 8.45× and 0.376× for graphical representation in “a” and “b”, respectively). The colored points indicate loci identified as outliers (±3.5 SD; p < 0.0005) on the first RDA axis. The top‐right inset illustrates the proportion of variance explained (PVE) for each of the first six RDA axes, with the first RDA axis overwhelmingly explaining the most variance. Points are colored to match the environmental predictor of greatest association in both RDA and GF analyses with white representing outliers unique to RDA. The number of outlier loci associated with each variable is represented parenthetically. Environmental correlations can be positive or negative, thus the same colors can exist for outliers on the left and right
FIGURE 4Local adaptation to specific environmental predictors illustrated across both GEA approaches and analyses of phenotypic data from a common garden. (a) Relative importance of each environmental variable (scaled from 0% to 100%, with 100% having the largest contribution) for both the phenotype‐environment and genetic‐environment associations. GEA analyses are represented in solid colors with loadings of the first RDA axis (dark) and split importance (R 2) from GF (light). Relative importance of each environmental variable in the phenotypic analysis is represented by the importance R 2 from RF and represented by the striped bar. The composite variable of seedling emergence, seed weight, and shoot biomass was used in the RF analysis to represent phenotypic variance within the common garden. The rank of each environmental variable within its analysis is displayed to the right of each bar. Concordance among the two GEA approaches is illustrated by strong correlation between the absolute value of the first RDA axis loadings and GF R 2 for each environmental variable (b) as well as across the individual outlier loci (c). Concordance of environment variable association in the common garden with both GEA analyses is displayed in panels “d” and “e”
FIGURE 5Phenotypic data from a common garden suggest local adaptation to precipitation seasonality. (a) Strong associations between seedling emergence and precipitation seasonality suggest local adaptation. Seedling emergence is calculated as the percentage of plants emerged and survived 40 days in each population. (b) Monthly precipitation averages from 30‐year PRISM normals for three populations with the highest (PL), lowest (SC), and median (MD) precipitation seasonality. (c, d) Seed weight and shoot biomass vary by orders of magnitude among the populations. Additionally, their associations with precipitation seasonality (described by Pearson's r and p‐values) are consistent with local adaptation. The line corresponds to the precipitation seasonality estimate of that population. The mean phenotype and 95% bootstrapped confidence intervals are represented by the squares and error bars. The populations are ordered by seedling emergence, illustrating correlation among the phenotypes. Colors are the same as those in the Figure 1 map
Variance partitioning of the individual and shared contributions of genetic, environmental, and geographic variation on phenotypic variation from the common garden experiment. All three predictors captured 69.84% of the total phenotypic variation. Adjusted r 2 represents the individual contribution of the predictor with all others partialled out and the proportion of variance explained (PVE) represents the overall contribution without controlling for interactive effects among the predictors. Models are ordered by the greatest PVE
| Model | Adj. |
|
|---|---|---|
| Pheno ~ Gen + Env +Geo | 20.36 | 69.84 |
| Pheno ~ Gen + Env | 19.52 | 67.28 |
| Pheno ~ Env + Geo | 0.00 | 56.90 |
| Pheno ~ Env | 16.12 | 54.50 |
| Pheno ~ Gen + Geo | 0.00 | 53.72 |
| Pheno ~ Gen | 12.94 | 52.66 |
| Pheno ~ Geo | 2.57 | 21.26 |
| Residuals | 30.16 | – |
Model significance cannot be tested.
Model significance (p ≥ 0.05) using RDA/pRDA.