| Literature DB >> 24405978 |
Lily M Blair1, Julie M Granka, Marcus W Feldman.
Abstract
BACKGROUND: Phenotypic variation along environmental gradients has been documented among and within many species, and in some cases, genetic variation has been shown to be associated with these gradients. Bayenv is a relatively new method developed to detect patterns of polymorphisms associated with environmental gradients. Using a Bayesian Markov Chain Monte Carlo (MCMC) approach, Bayenv evaluates whether a linear model relating population allele frequencies to environmental variables is more probable than a null model based on observed frequencies of neutral markers. Although this method has been used to detect environmental adaptation in a number of species, including humans, plants, fish, and mosquitoes, stability between independent runs of this MCMC algorithm has not been characterized. In this paper, we explore the variability of results between runs and the factors contributing to it.Entities:
Mesh:
Year: 2014 PMID: 24405978 PMCID: PMC3896655 DOI: 10.1186/1479-7364-8-1
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Pairs of Bayenv runs compared
| Hancock1 | | | | | | | | | | | | | | | | | |
| Hancock2 | a | | | | | | | | | | | | | | | | |
| Blair1 | | a | | | | | | | | | | | | | | | |
| Blair2 | | a | a | | | | | | | | | | | | | | |
| Blair3 | | a | a | a | | | | | | | | | | | | | |
| Blair4 | | a | a | a | a | | | | | | | | | | | | |
| Blair5 | | a | a | a | a | a | | | | | | | | | | | |
| LongBlair1 | | | | | | | | | | | | | | | | | |
| LongBlair2 | | | | | | | | b | | | | | | | | | |
| LongBlair3 | | | | | | | | b | b | | | | | | | | |
| LongBlair4 | | | | | | | | b | b | b | | | | | | | |
| LongBlair5 | | | | | | | | b | b | b | b | | | | | | |
| W/O_Sib1 | | d | | | | | | | | | | | | | | | |
| W/O_Sib2 | | d | | | | | | | | | | | c | | | | |
| W/O_Sib3 | | d | | | | | | | | | | | c | c | | | |
| W/O_Sib4 | | d | | | | | | | | | | | c | c | c | | |
| W/O_Sib5 | d | c | c | c | c |
Letters indicate the parts of Tables 2 and 3 in which results for the indicated comparisons are averaged and presented. In Table 2, letters a, b, c, and d also correspond to parts e, f, g, and h, respectively. Hancock1 refers to the original Climate file posted on the dbCline website and published in Hancock et al. [40]; Hancock2 refers to file Climate.2 posted on dbCline website in autumn 2012 that uses the appropriate MCMC burn-in (see ‘Methods’ section); Blair1-5 refer to our five separate runs using 100,000 MCMC iterations; LongBlair1-5 refer to our five separate runs using 500,000 MCMC iterations; W/O_Sib1-5 refer to our five runs using a modified data set excluding the Siberian populations.
Average correlations of the log of the Bayes factors (BF) and average correlations of the empirical values between runs for each climate variable
| Log of BF | | | | | | | | | | | |
| (a) All populations (100,000 iterations) incl Hancock | 0.87 | 0.88 | 0.66 | 0.83 | 0.84 | 0.73 | 0.75 | 0.74 | 0.89 | 0.73 | 0.79 |
| (b) All populations (500,000 iterations) | 0.93 | 0.92 | 0.82 | 0.88 | 0.90 | 0.85 | 0.85 | 0.83 | 0.93 | 0.85 | 0.88 |
| (c) Without Siberia (150,000 iterations) | 0.87 | 0.87 | 0.74 | 0.82 | 0.86 | 0.79 | 0.81 | 0.78 | 0.87 | 0.80 | 0.85 |
| (d) Without Siberia (150,000 iterations) vs. Hancock2 | 0.79 | 0.82 | 0.70 | 0.81 | 0.81 | 0.70 | 0.72 | 0.75 | 0.84 | 0.78 | 0.76 |
| Empirical | | | | | | | | | | | |
| (e) All populations (100,000 iterations) incl Hancock | 0.81 | 0.84 | 0.63 | 0.79 | 0.81 | 0.68 | 0.70 | 0.70 | 0.85 | 0.70 | 0.75 |
| (f) All populations (500,000 iterations) | 0.91 | 0.91 | 0.81 | 0.88 | 0.90 | 0.82 | 0.83 | 0.82 | 0.92 | 0.84 | 0.86 |
| (g) Without Siberia (150,000 iterations) | 0.82 | 0.81 | 0.70 | 0.80 | 0.82 | 0.73 | 0.75 | 0.74 | 0.82 | 0.76 | 0.80 |
| (h) Without Siberia (150,000 iterations) vs. Hancock2 | 0.61 | 0.65 | 0.67 | 0.74 | 0.66 | 0.62 | 0.68 | 0.70 | 0.67 | 0.72 | 0.75 |
‘Win’ refers to winter, ‘sum’ refers to summer, ‘prec. rate’ refers to precipitation rate, ‘rad. flux’ refers to radiation flux, and ‘rel. humid’ refers to relative humidity. Runs compared in each part are shown in Table 1. Incl Hancock means that the average is taken over pairs of Blair1-5 (100,000 iterations) and Hancock1-2, shown by a's in Table 1. Hancock1-2 were run with 150,000 iterations.
Empirical value tail comparisons between indicated runs
| (a) All populations (100,000 iterations) - Hancock2 vs. Blair1 | 0.55 | 0.79 | 0.83 | 0.88 | |
| 0.75 | 0.40 | 0.51 | 0.66 | ||
| | 0.78 | 0.49 | 0.34 | 0.54 | |
| | 0.83 | 0.62 | 0.51 | 0.24 | |
| (b) All populations (500,000 iterations) - LongBlair1 vs. LongBlair3 | 0.64 | 0.85 | 0.89 | 0.93 | |
| 0.88 | 0.51 | 0.62 | 0.78 | ||
| | 0.91 | 0.65 | 0.46 | 0.67 | |
| | 0.94 | 0.80 | 0.70 | 0.37 | |
| (c) Without Siberia (150,000 iterations) - W/O_Sib1 vs. W/O_Sib3 | 0.60 | 0.83 | 0.86 | 0.92 | |
| 0.81 | 0.45 | 0.56 | 0.73 | ||
| | 0.85 | 0.56 | 0.40 | 0.61 | |
| | 0.90 | 0.71 | 0.59 | 0.30 | |
| (d) Without Siberia (150,000 iterations) vs. Hancock2 - Hancock2 vs. W/O_Sib1 | 0.51 | 0.72 | 0.75 | 0.80 | |
| 0.75 | 0.36 | 0.44 | 0.56 | ||
| | 0.81 | 0.46 | 0.30 | 0.44 | |
| 0.87 | 0.62 | 0.48 | 0.20 | ||
The table reports the fraction of SNPs from the smaller tail of one run that is present in the larger tail of the second run, averaged over climate variables. Two identical runs produce values of 1. Within each part (a to d), all pairs of runs (see Table 1) have similar fractions of overlap, and representative tables are shown here. For a given part, the first run's empirical tails are shown as rows (e.g., Hancock2 in part a), and the second run's are shown as columns (e.g., Blair1 in part a). Also, 623,318 SNPs are included in this analysis.
Overlap of ranked SNPs (ranked by empirical value from lowest to highest and binned in groups of 1,000) between runs
| 1–1,000 | 0.44 ± 0.020 |
| 100,001–101,000 | 0.0073 ± 0.0033 |
| 200,001–201,000 | 0.0051 ± 0.0023 |
| 300,001–301,000 | 0.0043 ± 0.0016 |
| 400,001–401,000 | 0.0042 ± 0.0014 |
| 500,001–501,000 | 0.0035 ± 0.0014 |
| 600,001–601,000 | 0.014 ± 0.0098 |
For each bin, we calculated the proportion of overlapping SNPs between each corresponding bin for all ten pairs of runs (Blair1-5). Table reports the mean and standard deviation of the overlap proportion across all pairs of runs. Only 1 bin per 100,000 SNPs is shown. Also, 623,318 total SNPs were used in the analysis.
Genic/non-genic enrichments for each climate variable in the indicated empirical tails
| Absolute latitude | Hancock1 (published results) | 3 | 3 | 3 |
| | Hancock1 (our analysis) | 1 | 2 | 2 |
| | Hancock2 | 3 | 3 | 3 |
| | Blair1 | 0 | 2 | 3 |
| | Blair2 | 0 | 3 | 3 |
| | Blair3 | 1 | 1 | 2 |
| | Blair4 | 1 | 3 | 3 |
| | Blair5 | 1 | 1 | 3 |
| | LongBlair1 | 3 | 3 | 3 |
| | LongBlair2 | 2 | 2 | 1 |
| | LongBlair3 | 1 | 2 | 2 |
| | LongBlair4 | 3 | 2 | 1 |
| | LongBlair5 | 2 | 2 | 3 |
| | W/O_Sib1 | 1 | 0 | 0 |
| | W/O_Sib2 | 0 | 1 | 2 |
| | W/O_Sib3 | 1 | 0 | 1 |
| | W/O_Sib4 | 0 | 0 | 0 |
| | W/O_Sib5 | 0 | 0 | 0 |
| Summer maximum temperature | Hancock1 (published results) | 0 | 0 | 2 |
| | Hancock1 (our analysis) | 0 | 2 | 3 |
| | Hancock2 | 2 | 3 | 3 |
| | Blair1 | 0 | 0 | 0 |
| | Blair2 | 0 | 0 | 0 |
| | Blair3 | 0 | 1 | 0 |
| | Blair4 | 0 | 0 | 0 |
| | Blair5 | 1 | 0 | 0 |
| | LongBlair1 | 0 | 0 | 1 |
| | LongBlair2 | 0 | 1 | 0 |
| | LongBlair3 | 0 | 0 | 1 |
| | LongBlair4 | 0 | 2 | 2 |
| | LongBlair5 | 0 | 0 | 1 |
| | W/O_Sib1 | 3 | 2 | 2 |
| | W/O_Sib2 | 0 | 0 | 0 |
| | W/O_Sib3 | 0 | 0 | 0 |
| | W/O_Sib4 | 0 | 2 | 3 |
| | W/O_Sib5 | 3 | 0 | 0 |
| Summer precipitation rate | Hancock1 (published results) | 0 | 0 | 0 |
| | Hancock1 (our analysis) | 0 | 0 | 0 |
| | Hancock2 | 0 | 0 | 0 |
| | Blair1 | 0 | 0 | 0 |
| | Blair2 | 0 | 0 | 0 |
| | Blair3 | 0 | 0 | 0 |
| | Blair4 | 0 | 0 | 0 |
| | Blair5 | 0 | 0 | 0 |
| | LongBlair1 | 0 | 0 | 0 |
| | LongBlair2 | 0 | 0 | 0 |
| | LongBlair3 | 0 | 0 | 0 |
| | LongBlair4 | 0 | 0 | 0 |
| | LongBlair5 | 0 | 0 | 0 |
| | W/O_Sib1 | 0 | 0 | 0 |
| | W/O_Sib2 | 0 | 0 | 0 |
| | W/O_Sib3 | 0 | 0 | 0 |
| | W/O_Sib4 | 0 | 0 | 0 |
| | W/O_Sib5 | 0 | 0 | 0 |
| Summer solar radiation | Hancock1 (published results) | 3 | 3 | 3 |
| | Hancock1 (our analysis) | 3 | 3 | 3 |
| | Hancock2 | 0 | 1 | 3 |
| | Blair1 | 2 | 2 | 0 |
| | Blair2 | 0 | 0 | 0 |
| | Blair3 | 1 | 0 | 0 |
| | Blair4 | 0 | 0 | 0 |
| | Blair5 | 3 | 0 | 0 |
| | LongBlair1 | 2 | 3 | 3 |
| | LongBlair2 | 0 | 3 | 3 |
| | LongBlair3 | 2 | 0 | 0 |
| | LongBlair4 | 0 | 2 | 1 |
| | LongBlair5 | 0 | 3 | 3 |
| | W/O_Sib1 | 3 | 3 | 3 |
| | W/O_Sib2 | 1 | 0 | 0 |
| | W/O_Sib3 | 0 | 0 | 0 |
| | W/O_Sib4 | 1 | 0 | 0 |
| | W/O_Sib5 | 3 | 3 | 2 |
| Summer relative humidity | Hancock1 (published results) | 2 | 3 | 3 |
| | Hancock1 (our analysis) | 2 | 3 | 3 |
| | Hancock2 | 0 | 3 | 3 |
| | Blair1 | 0 | 2 | 1 |
| | Blair2 | 2 | 0 | 0 |
| | Blair3 | 0 | 0 | 0 |
| | Blair4 | 0 | 2 | 0 |
| | Blair5 | 1 | 0 | 0 |
| | LongBlair1 | 0 | 0 | 0 |
| | LongBlair2 | 1 | 2 | 3 |
| | LongBlair3 | 0 | 2 | 1 |
| | LongBlair4 | 0 | 0 | 1 |
| | LongBlair5 | 1 | 1 | 1 |
| | W/O_Sib1 | 0 | 3 | 3 |
| | W/O_Sib2 | 2 | 3 | 2 |
| | W/O_Sib3 | 0 | 0 | 0 |
| | W/O_Sib4 | 1 | 2 | 1 |
| | W/O_Sib5 | 0 | 0 | 1 |
| Winter minimum temperature | Hancock1 (published results) | 1 | 0 | 0 |
| | Hancock1 (our analysis) | 1 | 0 | 0 |
| | Hancock2 | 0 | 3 | 3 |
| | Blair1 | 0 | 1 | 2 |
| | Blair2 | 0 | 2 | 2 |
| | Blair3 | 0 | 1 | 2 |
| | Blair4 | 0 | 0 | 1 |
| | Blair5 | 0 | 1 | 1 |
| | LongBlair1 | 0 | 3 | 2 |
| | LongBlair2 | 0 | 2 | 2 |
| | LongBlair3 | 0 | 3 | 2 |
| | LongBlair4 | 0 | 2 | 2 |
| | LongBlair5 | 0 | 3 | 3 |
| | W/O_Sib1 | 0 | 0 | 0 |
| | W/O_Sib2 | 0 | 2 | 0 |
| | W/O_Sib3 | 1 | 0 | 2 |
| | W/O_Sib4 | 0 | 1 | 3 |
| | W/O_Sib5 | 0 | 1 | 2 |
| Winter precipitation rate | Hancock1 (published results) | 0 | 3 | 3 |
| | Hancock1 (our analysis) | 1 | 3 | 3 |
| | Hancock2 | 2 | 0 | 0 |
| | Blair1 | 2 | 0 | 0 |
| | Blair2 | 0 | 1 | 0 |
| | Blair3 | 2 | 1 | 0 |
| | Blair4 | 2 | 3 | 3 |
| | Blair5 | 0 | 0 | 0 |
| | LongBlair1 | 0 | 0 | 0 |
| | LongBlair2 | 0 | 0 | 0 |
| | LongBlair3 | 0 | 0 | 0 |
| | LongBlair4 | 0 | 0 | 0 |
| | LongBlair5 | 0 | 2 | 0 |
| | W/O_Sib1 | 0 | 0 | 2 |
| | W/O_Sib2 | 0 | 2 | 3 |
| | W/O_Sib3 | 0 | 0 | 0 |
| | W/O_Sib4 | 0 | 3 | 2 |
| | W/O_Sib5 | 3 | 1 | 2 |
| Winter relative humidity | Hancock1 (published results) | 3 | 3 | 3 |
| | Hancock1 (our analysis) | 3 | 2 | 2 |
| | Hancock2 | 3 | 3 | 3 |
| | Blair1 | 3 | 3 | 3 |
| | Blair2 | 3 | 3 | 2 |
| | Blair3 | 3 | 1 | 3 |
| | Blair4 | 3 | 3 | 3 |
| | Blair5 | 3 | 1 | 0 |
| | LongBlair1 | 3 | 3 | 0 |
| | LongBlair2 | 3 | 3 | 2 |
| | LongBlair3 | 3 | 3 | 2 |
| | LongBlair4 | 2 | 0 | 0 |
| | LongBlair5 | 3 | 2 | 1 |
| | W/O_Sib1 | 3 | 2 | 2 |
| | W/O_Sib2 | 3 | 2 | 3 |
| | W/O_Sib3 | 3 | 3 | 2 |
| | W/O_Sib4 | 3 | 3 | 3 |
| | W/O_Sib5 | 3 | 3 | 0 |
| Winter solar radiation | Hancock1 (published results) | 3 | 1 | 1 |
| | Hancock1 (our analysis) | 3 | 0 | 2 |
| | Hancock2 | 3 | 2 | 0 |
| | Blair1 | 3 | 1 | 0 |
| | Blair2 | 3 | 2 | 1 |
| | Blair3 | 3 | 0 | 0 |
| | Blair4 | 3 | 0 | 0 |
| | Blair5 | 2 | 1 | 0 |
| | LongBlair1 | 3 | 0 | 0 |
| | LongBlair2 | 3 | 2 | 0 |
| | LongBlair3 | 3 | 2 | 1 |
| | LongBlair4 | 3 | 3 | 2 |
| | LongBlair5 | 3 | 1 | 0 |
| | W/O_Sib1 | 3 | 0 | 0 |
| | W/O_Sib2 | 3 | 1 | 2 |
| | W/O_Sib3 | 3 | 0 | 0 |
| | W/O_Sib4 | 3 | 3 | 2 |
| W/O_Sib5 | 2 | 0 | 1 | |
The table shows the number of stars of significance for genic/non-genic enrichments in the 0.05, 0.01, and 0.005 tails of the empirical p value distribution for each indicated run (defined as in Table 1 and in the ‘Methods’ section). ‘1’ denotes 95%–97.5% of bootstraps had an enrichment of genic compared to non-genic SNPs in that empirical tail, ‘2’ denotes 97.5%–99%, and ‘3’ denotes >99%. ‘0’ denotes that a genic enrichment is not significant (i.e., that <95% of the bootstraps had an enrichment). Enrichments for the nine environmental variables that were shown in Hancock et al. [40] are shown here.