| Literature DB >> 21533023 |
Angela M Hancock1, David B Witonsky, Gorka Alkorta-Aranburu, Cynthia M Beall, Amha Gebremedhin, Rem Sukernik, Gerd Utermann, Jonathan K Pritchard, Graham Coop, Anna Di Rienzo.
Abstract
Humans inhabit a remarkably diverse range of environments, and adaptation through natural selection has likely played a central role in the capacity to survive and thrive in extreme climates. Unlike numerous studies that used only population genetic data to search for evidence of selection, here we scan the human genome for selection signals by identifying the SNPs with the strongest correlations between allele frequencies and climate across 61 worldwide populations. We find a striking enrichment of genic and nonsynonymous SNPs relative to non-genic SNPs among those that are strongly correlated with these climate variables. Among the most extreme signals, several overlap with those from GWAS, including SNPs associated with pigmentation and autoimmune diseases. Further, we find an enrichment of strong signals in gene sets related to UV radiation, infection and immunity, and cancer. Our results imply that adaptations to climate shaped the spatial distribution of variation in humans.Entities:
Mesh:
Year: 2011 PMID: 21533023 PMCID: PMC3080864 DOI: 10.1371/journal.pgen.1001375
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Climate variables used for the analysis.
(A) Maps show the distributions of summer and winter climate variables: maximum summer temperature, minimum winter temperature and solar radiation, precipitation rate and relative humidity in the summer and winter. (B) A heatmap shows the absolute values of Spearman rank correlation coefficients between pairs of climate variables.
Proportions of genic and nonsynonymous SNPs relative to the proportion of non-genic SNPs in the tail of the minimum rank distribution.
| Variable category | Population set | Genic:Nongenic | NS:Nongenic | ||||
| tail cutoff | tail cutoff | ||||||
| 0.05 | 0.01 | 0.005 | 0.05 | 0.01 | 0.005 | ||
| Climate | Worldwide | 1.08 | 1.14 | 1.18 | 1.25 | 1.58 | 1.63 |
| AWE | 1.06 | 1.12 | 1.12 | 1.17 | 1.37 | 1.61 | |
| AEA | 1.03 | 1.09 | 1.17 | 1.06 | 1.23 | 1.20 | |
Symbols *, ** and *** denote support from >95%, >97.5% and >99% of bootstrap replicate, respectively.
Proportions of genic and nonsynonymous SNPs relative to the proportion of non-genic SNPs in the tails of the individual variable distributions.
| Season | Variable | genic:non-genic | NS:non-genic | ||||
| tail cut-off: | tail cut-off: | ||||||
| 0.05 | 0.01 | 0.005 | 0.05 | 0.01 | 0.005 | ||
| Latitude | 1.07 *** | 1.14*** | 1.19*** | 1.19*** | 1.60*** | 1.56*** | |
| summer | Maximum Temperature | 1.02 | 1.06 | 1.13 | 1.17 | 1.33 | 1.56 |
| Precipitation Rate | 1.00 | 1.02 | 1.02 | 1.03 | 1.14 | 1.24 | |
| Relative Humidity | 1.05 | 1.20 | 1.22 | 1.06 | 1.21 | 1.40* | |
| Solar Radiation | 1.05 | 1.15 | 1.16 | 1.17 | 1.35 | 1.48 | |
| winter | Minimum Temperature | 1.04 | 1.04 | 1.05 | 1.24 | 1.37 | 1.75 |
| Precipitation Rate | 1.04 | 1.12 | 1.17 | 1.10 | 1.23 | 1.36 | |
| Relative Humidity | 1.07 | 1.13 | 1.14 | 1.24 | 1.20 | 1.24 | |
| Solar Radiation | 1.09 | 1.08 | 1.13 | 1.26 | 1.45 | 1.24 | |
Symbols *, ** and *** denote support from >95%, >97.5% and >99% of bootstrap replicate, respectively.
Figure 2Mean-centered allele frequency plotted against population for SNPs with the strongest signals (transformed rank statistic <10−5).
The variables shown are: (A) winter solar radiation in the worldwide analysis, (B) summer precipitation rate in the worldwide analysis, and winter solar radiation in (C) the AWE population subset and (D) the AEA population subset. Since the particular patterns that result in strong correlations in the worldwide analysis are diverse, SNPs for these variables were split into two clusters using the results of an eigen analysis of the matrix of SNPs and populations. SNPs were assigned to clusters based on the eigenvector term for the eigenvector corresponding to the first eigenvalue [91]. Mean-centered allele frequencies were computed by subtracting the mean allele frequency across populations. SNPs with rank statistics less than 10−5 are included in the plots. Population names and means are colored based on membership in one of five major geographical regions (sub-Saharan Africa, Western Eurasia, East Asia, Oceania, and the Americas) and ordered, within each region, so that the climate variable values increase from left to right across the x-axis. Alleles are polarized based on the signs of the Spearman correlations with the climate variable. Each gray dot represents an individual SNP and fitted lines (obtained using the lm function in R) for each region are shown in color. The ranges of the climate variable values across each geographic region are shown above the horizontal axis.
Figure 3Global variation in allele frequencies for SNPs with strong signals with climate.
Two NS SNPs from the worldwide analysis: (A) A SNP (rs3782489) in keratin 77 (KRT77), is strongly correlated with summer solar radiation, and (B) a SNP (rs2075756) in the thyroid receptor interacting protein (TRIP6) is strongly correlated with absolute latitude. Two SNPs from the population subset analysis: (C) A SNP (rs4558836) in CORIN has a signal in the AEA population subset with winter minimum temperature, but not in the AWE subset, and (D) a NS SNP (rs5743810) in TLR6 has a signal in the AWE population subset with winter solar radiation, but not in the AEA subset. Two SNPs that are associated with autoimmune disease from GWAS: (E) A SNP (rs2313132) upstream of PCDH18 that is associated with SLE is strongly correlated with summer solar radiation, and (F) a SNP (rs6074022) upstream of CD40 that is associated with multiple sclerosis is strongly correlated with minimum winter temperature. For each plot, gray points represent individual SNPs and colored lines represent fitted lines (obtained using the lm function in R) for each region. The ranges of the climate variable values for each region are shown at the bottom of the corresponding segment of the plot.
Figure 4Venn diagrams showing the overlap between lower tails of rank statistics from the worldwide analysis and each population subset analysis.
The Venn diagram on the right shows the overlap expected between the results of the worldwide analysis and a set of randomly drawn SNPs.
SNPs with the strongest signals of selection among those associated with phenotypic traits in GWAS.
| Trait category | Strongest disease or trait association | Ref SNP ID | Most significant climate correlation | Nearby genes | |||
| Pop Set | Variable | log10BF | Rank Statistic | ||||
| Pigmentation and tanning | Hair Color | rs12913832 | WW | Summer Maximum Temperature | 7.06 | 2.08×10−5 |
|
| Hair Color | rs12913832 | WW | Summer Relative Humidity | 8.11 | 2.08×10−5 |
| |
| Hair Color | rs28777 | AWE | Winter Solar Radiation | 10.4 | 4.22×10−5 |
| |
| Hair Color | rs28777 | WW | Winter Relative Humidity | 4.26 | 3.29×10−4 |
| |
| Eye Color | rs1667394 | AWE | Winter Solar Radiation | 8.33 | 4.99×10−5 |
| |
| Hair Color | rs1667394 | AWE | Winter Solar Radiation | 8.33 | 4.99×10−5 |
| |
| Tanning | rs35391 | WW | Summer Relative Humidity | 7.27 | 6.81×10−5 |
| |
| Tanning | rs35391 | AWE | Winter Solar Radiation | 6.63 | 3.50×10−4 |
| |
| Immune and Autoimmune | Multiple sclerosis | rs6074022 | AEA | Summer Precipitation Rate | 6.98 | 4.00×10−4 |
|
| Multiple sclerosis | rs6074022 | WW | Winter Minimum Temperature | 11.1 | 2.40×10−4 |
| |
| SLE | rs2313132 | WW | Summer Solar Radiation | 2.05 | 4.52×10−4 |
| |
| SLE | rs2187668 | AWE | Summer Relative Humidity | 8.25 | 1.82×10−5 |
| |
| Celiac Disease | rs2187668 | AWE | Summer Relative Humidity | 8.25 | 1.82×10−5 |
| |
| Crohn's disease | rs4613763 | WW | Summer Relative Humidity | 2.19 | 2.26×10−4 |
| |
| Psoriasis | rs10484554 | AEA | Summer Precipitation Rate | 7.23 | 1.80×10−4 |
| |
| AIDS progression | rs10484554 | AEA | Summer Precipitation Rate | 7.23 | 1.80×10−4 |
| |
| Height | Height | rs185819 | AEA | Summer Maximum Temperature | 5.55 | 4.79×10−4 |
|
| Cardiovascular | Stroke | rs10486776 | AWE | Winter Solar Radiation | 2.3 | 2.94×10−4 |
|
| Factor VII | rs10488360 | AWE | Summer Precipitation Rate | 6.76 | 2.06×10−4 |
| |
| Other | Bone Mineral Density (Hip) | rs10490823 | AWE | Winter Solar Radiation | 5.53 | 4.54×10−4 |
|
| Other | Testicular germ cell tumor | rs210138 | AEA | Summer Precipitation Rate | 8.14 | 1.50×10−4 |
|
This table contains SNPs with an empirical rank less than 5×10−4 and a GWAS p-value of less than 1×10−5.
Disease classes enriched in the 1% and 5% tails of the minimum rank distribution.
| Variable | Geographic Region | Disease Class | SNPs in gene set: other genic SNPs | ||
| tail cutoff: | |||||
| 0.05 | 0.01 | 0.005 | |||
| Climate | Worldwide | Cardiovascular | 1.27*** | 1.45* | 1.69* |
| AWE | Cancer | 1.24 | 1.54 | 1.76 | |
| Cardiovascular | 1.39 | 1.49 | 1.50 | ||
| Immune | 1.33 | 1.58 | 1.85 | ||
| Infection | 1.32 | 1.65 | 2.19 | ||
| AEA | Cardiovascular | 1.25 | 1.69 | 2.05 | |
| Immune | 1.14 | 1.51 | 1.88 | ||
Symbols *, ** and *** denote support from >95%, >97.5% and >99% of bootstrap replicate, respectively.
A subset of the strongest results for chemical and genetic perturbations.
| Related Trait/Disease | Pop Set | Description | SNPs in gene set: other genic SNPs | ||
| cutoff: | |||||
| 0.05 | 0.01 | 0.005 | |||
| Response to UV radiation | AEA | Down-regulated at 6 hours following treatment of WS1 human skin fibroblasts with UVC at a low dose (10 J/m∧2) | 2.37** | 4.56*** | 4.58* |
| AEA | Down-regulated at any time-point following treatment of both XPB/CS and XPB/TTD fibroblasts with 3 J/m∧2 UVC | 1.22* | 1.57** | 1.65* | |
| AEA | Down-regulated at 8 hours following treatment of XPB/CS fibroblasts with 3 J/m∧2 UVC | 1.18*** | 1.43** | 1.63*** | |
| AEA | Down-regulated at any time-point following treatment of XPB/CS fibroblasts with 3 J/m∧2 UVC | 1.16** | 1.41** | 1.57*** | |
| Thermo-regulation | AEA | Up-regulated in brown preadipocytes from Irs1-knockout mice, which display severe defects in adipocyte differentiation | 1.43** | 1.60* | 2.07* |
| Cancer/Cell proliferation | WW | Down-regulated with stable, ectopic overexpression of BRCA1 in human prostate cancer cell lines | 2.01*** | 3.24*** | 4.31** |
| WW | Genes concomitantly modulated by activated Notch1 in mouse and human primary keratinocytes | 1.40* | 1.90* | 2.50** | |
| WW | Genes up-regulated in kras knockdown vs control in a human cell line | 1.46*** | 2.08*** | 2.74*** | |
| AEA | Genes up-regulated in kras knockdown vs control in a human cell line | 1.34*** | 2.11*** | 2.24*** | |
| AEA | Gene set that can be used to differentiate | 1.48** | 2.45** | 3.49** | |
| AEA | Up-regulated by butyrate at 24 hrs in SW260 colon carcinoma cells | 1.61** | 2.30** | 2.88** | |
| AWE | Up-regulated by sulindac at 48 hrs in SW260 colon carcinoma cells | 1.45** | 2.10* | 2.52* | |
| AWE | Genes up-regulated by NF-kappa B | 1.30* | 1.96* | 2.37** | |
| AWE | Down-regulated in cells undergoing IL-3-dependent proliferative self-renewal | 1.54*** | 2.47*** | 2.74*** | |
| AWE | Up-regulated in human dermal (foreskin) microvascular endothelial cells that were stimulated to proliferate with prolonged EGF treatment | 1.81** | 2.88** | 5.34*** | |
| Infection/Immunity | AWE | Genes up-regulated by NF-kappa B | 1.30* | 1.96* | 2.37** |
| AEA | Down-regulated in fibroblasts following infection with human cytomegalovirus | 2.20* | 3.59* | 6.03** | |
| AEA | Genes down-reglated in peripheral blood lymphocytes (PBLs) of immunosuppressed patients with a well functioning kidney transplant | 1.84*** | 3.37** | 5.10** | |
| AWE | Down-regulated in cells undergoing IL-3-dependent proliferative self-renewal | 1.54*** | 2.47*** | 2.74*** | |
| AWE | Up-regulated in human dermal (foreskin) microvascular endothelial cells that were stimulated to proliferate with prolonged EGF treatment | 1.81** | 2.88** | 5.34*** | |
| AWE | Genes up-regulated in peripheral blood lymphocytes (PBLs) of stable, immunosuppressed patients with a well functioning kidney transplant | 1.74*** | 2.29*** | 2.58** | |