| Literature DB >> 28035257 |
Simon Nadeau1, Patrick G Meirmans2, Sally N Aitken3, Kermit Ritland3, Nathalie Isabel4.
Abstract
Accurately detecting signatures of local adaptation using genetic-environment associations (GEAs) requires controlling for neutral patterns of population structure to reduce the risk of false positives. However, a high degree of collinearity between climatic gradients and neutral population structure can greatly reduce power, and the performance of GEA methods in such case is rarely evaluated in empirical studies. In this study, we attempted to disentangle the effects of local adaptation and isolation by environment (IBE) from those of isolation by distance (IBD) and isolation by colonization from glacial refugia (IBC) using range-wide samples in two white pine species. For this, SNPs from 168 genes, including 52 candidate genes for growth and phenology, were genotyped in 133 and 61 populations of Pinus strobus and P. monticola, respectively. For P. strobus and using all 153 SNPs, climate (IBE) did not significantly explained among-population variation when controlling for IBD and IBC in redundancy analyses (RDAs). However, 26 SNPs were significantly associated with climate in single-locus GEA analyses (Bayenv2 and LFMM), suggesting that local adaptation took place in the presence of high gene flow. For P. monticola, we found no evidence of IBE using RDAs and weaker signatures of local adaptation using GEA and FST outlier tests, consistent with adaptation via phenotypic plasticity. In both species, the majority of the explained among-population variation (69 to 96%) could not be partitioned between the effects of IBE, IBD, and IBC. GEA methods can account differently for this confounded variation, and this could explain the small overlap of SNPs detected between Bayenv2 and LFMM. Our study illustrates the inherent difficulty of taking into account neutral structure in natural populations and the importance of sampling designs that maximize climatic variation, while minimizing collinearity between climatic gradients and neutral structure.Entities:
Keywords: Pinus; genetic‐environment associations; isolation by colonization; isolation by environment; landscape genetics; local adaptation
Year: 2016 PMID: 28035257 PMCID: PMC5192886 DOI: 10.1002/ece3.2550
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1White pine tree (Pinus strobus) along the road (Maine, USA)
Figure 2Sampling locations for Pinus strobus and P. monticola. Populations are colored according to their genetic group membership detected using STRUCTURE for K = 2 (Nadeau et al., 2015)
Description of climatic variables obtained for all sampled populations
| Climatic variable | Units | |
|---|---|---|
| DD5 | Degree‐days above 5°C | °C |
| TD | Temperature difference between mean warmest month temperature and coldest month temperature, or continentality | °C |
| bFFP | Beginning of frost‐free period | Julian date |
| eFFP | End of frost‐free period | Julian date |
| MSP | Mean summer precipitation | mm |
| PAS | Precipitation as snow | mm |
| CMD | Hargreaves climatic moisture deficit | mm |
| Elev | Elevation | m |
Figure 3(a) Pinus strobus and (b) P. monticola: principal component analysis (PCA) including seven climatic variables obtained for available samples in seed banks and provenance trials (see Nadeau et al., 2015). Variation along PC1 (x‐axis) and PC2 (y‐axis) was used to select samples for genotyping in order to cover a wide range of environmental variation. Genotyped populations are colored according to their genetic group membership as in Figure 2. Available populations that were not genotyped (gray dots) were either not sampled or failed genotyping. Ellipses represent the 95% confidence intervals for each group. Insets show the proportion of variation explained by each PC
Number of outlier SNPs detected using BayeScan, Bayenv2, and LFMM in Pinus strobus and P. monticola. A false discovery rate of 5% was used for BayeScan and LFMM, and Bayes factor >3 was used for Bayenv2
|
|
| |
|---|---|---|
| BayeScan | ||
| Divergent | 2 | 1 |
| Balancing | 3 | 0 |
| Total (%) | 5 (3.3) | 1 (0.6) |
| Bayenv2 (%) | 12 (7.8) | 12 (7.6) |
| LFMM (%) | 19 (12.4) | 6 (3.8) |
| Total (%) | 29 (19.0) | 18 (11.4) |
aNumbers in parentheses indicate the proportion of outlier SNPs (number of outlier SNPs/number of SNPs tested).
Figure 4(a) Pinus strobus and (b) P. monticola: proportion of tested SNPs associated with each climatic variable by Bayenv2 (Bayes factor >3) and LFMM (q < 0.05)
Highly supported candidate SNPs, that is, detected by a minimum of two different methods. Variable names in the “Bayenv2” and “LFMM” columns refer to climatic variables that were significantly associated with the SNPs
| SNP | Gene | Bayescan | Bayenv2 | LFMM | SNP annotation | Putative gene function | Candidate for growth/phenology in |
|---|---|---|---|---|---|---|---|
|
| |||||||
| N‐029 | 0_6047_02 | div*** | DD5, TD, bFFP, eFFP, PAS, CMD**** | DD5, bFFP, eFFP, PAS, TD, CMD, MSP, Elev**** | na | basic helix‐loop‐helix (bHLH) DNA‐binding superfamily protein | No |
| G‐014 | GQ0081.BR.1_D09 | div*** | DD5, MSP, PAS, Elev** | DD5* | NS | Plastid movement impaired1‐related1 (PMIR1); plant‐specific C2 domain containing gene family | No |
| M‐015 | 0_8683_01 | ns | DD5, bFFP, PAS, CMD**** | PAS, CMD, DD5, bFFP*** | NS | Serine–threonine‐protein kinase at1g18390‐like | Yes |
| M‐016 | 0_8683_01 | ns | TD* | CMD, PAS* | NS | Serine–threonine‐protein kinase at1g18390‐like | Yes |
| M‐017 | 0_8844_01 | ns | eFFP, bFFP, DD5** | DD5* | Intron | Galacturonosyltransferase 13‐like | Yes |
|
| |||||||
| S‐007 | CL3539‐Contig1_01 | div* | Elev* | ns | Intron | TOM1‐like protein 2 | Yes |
ns, nonsignificant; S, synonymous SNP; NS, nonsynonymous SNP; na, not annotated (no blast hit).
BayeScan and LFMM: *q < 0.05; **q < 0.01; ***q < 0.001, ****q < 0.0001. Bayenv2: *BF > 3; **BF > 10; ***BF > 32; ****BF > 100.
aBased on a blastn against the P. glauca gene catalogue (see “Materials and Methods”).
bRefSeq annotation; cTAIR annotation of the Picea glauca best ortholog is provided when there was no significant hit on RefSeq.
Genes containing outlier SNPs (any of BayeScan, Bayenv2, or LFMM) in both Pinus monticola and Pinus strobus
| SNP | Gene |
|
| SNP annotation | Putative gene function (RefSeq) | Candidate for growth/phenology in | ||
|---|---|---|---|---|---|---|---|---|
|
| GEA |
| GEA | |||||
| T‐019 | 2_4724_01 | ns | DD5, bFFP, eFFP*,
| ns | ns | Intron | Serine–threonine‐protein kinase‐se HT1‐like | Yes |
| S‐021 | 2_4724_01 | – | – | ns | bFFP, Elev*,
| Intron | Serine–threonine‐protein kinase‐se HT1‐like | Yes |
| N‐033 | 0_7001_01 | ns | DD5, eFFP, bFFP, PAS**,
| – | – | NS | NADPH‐dependent diflavin oxidoreductase ATR3‐like isoform 2 | No |
| P‐034 | 0_7001_01 | – | – | ns | TD, eFFP*,
| S | NADPH‐dependent diflavin oxidoreductase ATR3‐like isoform 2 | No |
| O‐027 | 2_9665_01 | ns | bFFP*,
| – | – | NS | Interferon‐induced guanylate‐binding protein | No |
| Q‐032 | 2_9665_01 | – | – | ns | PAS*,
| S | Interferon‐induced guanylate‐binding protein | No |
ns, nonsignificant; “–”, not tested because the SNP was not genotyped or was monomorphic in this species; S, synonymous SNP; NS, nonsynonymous SNP; na, not annotated (no blast hit).
aBased on a blastn against the P. glauca gene catalogue (see “Materials and Methods”).
bSNP detected by Bayenv2; *BF > 3; **BF > 10; ***BF > 32; ****BF > 100; ****BF > 100: cSNP detected by LFMM: *q < 0.05; **q < 0.01; ***q < 0.001, ****q < 0.0001.
Mantel and partial Mantel tests in Pinus strobus and P. monticola. Correlation coefficients (r) between (1) genetic distance (Y) and geographic distance (D); and (2) between genetic distance (Y) and each of the eight climatic variables after controlling for D
| Test |
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Y ~ D | .274 | .001*** | .014* | .339 | .001*** | .009** |
| Y ~ DD5 | D | .073 | .162 | .208 | .081 | .149 | .447 |
| Y ~ TD | D | .158 | .008** | .036* | −.164 | 1 | 1 |
| Y ~ bFFP | D | .106 | .048* | .108 | .017 | .362 | .684 |
| Y ~ eFFP | D | .107 | .032* | .096● | −.022 | .608 | .684 |
| Y ~ MSP | D | .081 | .133 | .200 | −.010 | .469 | .684 |
| Y ~ PAS | D | .042 | .233 | .262 | −.032 | .573 | .684 |
| Y ~ CMD | D | −.041 | .698 | .698 | .007 | .394 | .684 |
| Y ~ Elev | D | .077 | .120 | .200 | .223 | .017* | .077● |
Populations including five or more genotyped individuals were used in this analysis.
aY = genetic distances calculated as the pairwise Slatkin's linearized F ST between populations.
b●p < .10; *p < .05; **p < .01; ***p < .001.
cFalse discovery rate: ●q < 0.10; *q < 0.05; **q < 0.01; ***q < 0.001.
Redundancy analyses (RDAs) to partition among‐population genetic variation (F) in Pinus strobus and P. monticola into three components: climate (IBE); geography (IBD); and north–south ancestry (IBC)
| Combined fractions |
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All (153) SNPs | Bayenv2 outlier (12) SNPs | LFMM outlier (19) SNPs | All (158) SNPs | Bayenv2 outlier (12) SNPs | LFMM outlier (6) SNPs | |||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| F~clim. | .059 | .001*** | .295 | .001*** | .193 | .001*** | .089 | .001*** | .317 | .001*** | .109 | .002** |
| F~geog. | .064 | .001*** | .327 | .001*** | .198 | .001*** | .152 | .001*** | .400 | .001*** | .215 | .001*** |
| F~anc. | .045 | .001*** | .171 | .001*** | .139 | .001*** | .101 | .001*** | .386 | .001*** | .088 | .001*** |
|
| ||||||||||||
| F~clim. | (geog. + anc.) | .001 | .382 | .016 | .091● | .025 | .010** | −.006 | .722 | −.005 | .613 | −.048 | .962 |
| F~geog. | (clim. + anc.) | .007 | .023* | .034 | .002** | .026 | .001*** | .025 | .006** | .012 | .145 | .010 | .310 |
| F~anc. | (clim. + geog.) | .018 | .001*** | .005 | .136 | .021 | .002** | .021 | .001*** | .059 | .001*** | −.006 | .646 |
| F~clim.+geog. | anc. | .029 | – | .125 | – | .051 | – | .049 | – | .072 | – | .124 | – |
| F~geog.+anc. | clim. | −.001 | – | .012 | – | .001 | – | .034 | – | .077 | – | .060 | – |
| F~clim.+anc. | geog. | −.001 | – | −.002 | – | −.003 | – | .002 | – | .011 | – | .012 | – |
| F~clim. + anc. + geog. | .029 | – | .156 | – | .120 | – | .045 | – | .240 | – | .022 | – |
| Total explained | .084 | .348 | .244 | .176 | .471 | .228 | ||||||
| Total confounded | .058 | .293 | .172 | .130 | .400 | .218 | ||||||
| Total unexplained | .916 | .652 | .756 | .824 | .529 | .772 | ||||||
| Total | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | ||||||
aF = Independent matrix of population alleles frequencies; RDA tests are of the form: F~dependent matrices | covariate matrices. Clim. = climate (eight climatic variables); geog. = geography (P. monticola: x, y, xy, y 2; P. strobus: x, y, xy); anc. = north–south ancestry (Q‐values from STRUCTURE). Populations including five or more genotyped individual were used in this analysis.
bSubsets of SNPs detected by Bayenv2 (BF > 3) and by LFMM (q < 0.05). The number of SNPs for each subset is given in parentheses.
c●p < .10; *p < .05; **p < .01; ***p < .001. Significance of confounded fractions between climate, geography, and north–south ancestry was not tested.
dTotal explained = total adjusted R 2 of individual fractions. Total confounded = Total of individual fractions confounded between various combinations of climate, geography, and north–south ancestry. Negative R 2 values were considered null for this calculation.
Figure 5(a, b, c) Pinus strobus and (d, e, f) P. monticola: Venn diagrams showing the proportion of among‐population genetic variation explained by climate (IBE, eight climatic variables), geography (IBD, P. monticola: x, y, xy, y 2; P. strobus: x, y, xy), and north–south ancestry (IBC, Q‐values from STRUCTURE) in redundancy analyses (RDAs) using (a, d) all SNPs; or subsets of SNPs detected by (b, e) Bayenv2; and (c, f) LFMM. Circles in Venn diagrams are not proportional to the amount of explained variation by each factor. Significance codes: *p < .05; **p < .01; ***p < .001. Significance of confounded fractions between climate, geography, and north–south ancestry (overlap in circles) was not tested