| Literature DB >> 23236293 |
Gorka Alkorta-Aranburu1, Cynthia M Beall, David B Witonsky, Amha Gebremedhin, Jonathan K Pritchard, Anna Di Rienzo.
Abstract
Although hypoxia is a major stress on physiological processes, several human populations have survived for millennia at high altitudes, suggesting that they have adapted to hypoxic conditions. This hypothesis was recently corroborated by studies of Tibetan highlanders, which showed that polymorphisms in candidate genes show signatures of natural selection as well as well-replicated association signals for variation in hemoglobin levels. We extended genomic analysis to two Ethiopian ethnic groups: Amhara and Oromo. For each ethnic group, we sampled low and high altitude residents, thus allowing genetic and phenotypic comparisons across altitudes and across ethnic groups. Genome-wide SNP genotype data were collected in these samples by using Illumina arrays. We find that variants associated with hemoglobin variation among Tibetans or other variants at the same loci do not influence the trait in Ethiopians. However, in the Amhara, SNP rs10803083 is associated with hemoglobin levels at genome-wide levels of significance. No significant genotype association was observed for oxygen saturation levels in either ethnic group. Approaches based on allele frequency divergence did not detect outliers in candidate hypoxia genes, but the most differentiated variants between high- and lowlanders have a clear role in pathogen defense. Interestingly, a significant excess of allele frequency divergence was consistently detected for genes involved in cell cycle control and DNA damage and repair, thus pointing to new pathways for high altitude adaptations. Finally, a comparison of CpG methylation levels between high- and lowlanders found several significant signals at individual genes in the Oromo.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23236293 PMCID: PMC3516565 DOI: 10.1371/journal.pgen.1003110
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Hemoglobin and oxygen saturation measurements.
Box plots describe variation in the Amhara 05 (dark grey boxes), Amhara 95 (grey boxes) and Oromo (white boxes) for Hb concentration (g/dL) among males (A) and females (B) and for O2 sat also among males (E) and females (F). Box plots show the median (horizontal line), interquartile range (box), and range (whiskers), except the extreme values represented by circles. Statistically significant differences after multiple test correction between groups (unpaired two-sided two-sample t-test) are bolded in C, D, G, and H.
Figure 2Hemoglobin association test within Amhara.
The QQplot represents the excess of strong association with Hb among Amhara individuals (A). The observed −log10 p-value distribution is ranked from smallest to largest and plotted (y-axis) against the expected −log10 p-value (y-axis) in black. The grey area indicates the 95% confidence interval (see methods). Genome-wide (GW) significance level (after multiple test correction) is indicated by the dashed line. The Manhattan plot (B) shows the GW significance achieved by a set of high-LD SNPs in chromosome 1. The box plots describe the correlation between hemoglobin levels and the 3 genotypes of the top and GW significant SNP (rs10803083) among high (C) and low altitude (D) Amhara.
Association test of Tibetan EGLN1 and EPAS1 SNPs within Amhara, Oromo, and combined Ethiopians.
| Tibetans | Amhara | Oromo | Ethiopians | |||||||||||
| SNP | Gene | β | MAF | β | p | Power | MAF | β | p | Power | MAF | β | p | Power |
| rs961154 |
| 1.70 | 0.39 | 0.08 | 0.58 | 100 | 0.41 | 0.13 | 0.57 | 100 | 0.40 | 0.12 | 0.34 | 100 |
| rs2790859 |
| 1.70 | 0.39 | 0.08 | 0.58 | 100 | 0.41 | 0.13 | 0.57 | 100 | 0.40 | 0.12 | 0.34 | 100 |
| rs1992846 |
| 0.84 | 0.36 | 0.00 | 0.99 | 100 | 0.42 | −0.39 | 0.13 | 100 | 0.38 | −0.14 | 0.29 | 100 |
| rs7594278 |
| 0.52 | 0.31 | 0.06 | 0.71 | 98 | 0.24 | −0.08 | 0.77 | 80 | 0.28 | −0.01 | 0.93 | 100 |
| rs6544887 |
| 0.79 | 0.34 | −0.05 | 0.74 | 100 | 0.34 | −0.11 | 0.63 | 100 | 0.34 | −0.07 | 0.58 | 100 |
| rs17035010 |
| 0.84 | 0.39 | −0.07 | 0.66 | 100 | 0.30 | −0.02 | 0.93 | 100 | 0.36 | −0.04 | 0.80 | 100 |
| rs3768729 |
| 0.80 | 0.44 | 0.10 | 0.52 | 100 | 0.44 | −0.08 | 0.74 | 100 | 0.44 | 0.04 | 0.77 | 100 |
| rs7583554 |
| 0.94 | 0.46 | 0.10 | 0.52 | 100 | 0.41 | 0.16 | 0.53 | 100 | 0.44 | 0.13 | 0.34 | 100 |
| rs7583088 |
| 0.92 | 0.30 | 0.10 | 0.51 | 100 | 0.22 | −0.07 | 0.79 | 100 | 0.27 | 0.02 | 0.87 | 100 |
| rs11678465 |
| 0.85 | 0.30 | 0.10 | 0.51 | 100 | 0.22 | −0.06 | 0.83 | 100 | 0.27 | 0.03 | 0.84 | 100 |
| rs6712143 |
| 0.94 | 0.40 | 0.16 | 0.28 | 100 | 0.32 | −0.09 | 0.74 | 100 | 0.37 | 0.09 | 0.52 | 100 |
| rs4953342 |
| 0.90 | 0.21 | −0.12 | 0.50 | 100 | 0.12 | −0.19 | 0.57 | 99 | 0.18 | −0.16 | 0.33 | 100 |
| rs2121266 |
| 1.02 | 0.38 | 0.14 | 0.34 | 100 | 0.30 | −0.20 | 0.43 | 100 | 0.35 | 0.01 | 0.95 | 100 |
| rs9973653 |
| 0.52 | 0.35 | 0.01 | 0.95 | 100 | 0.30 | −0.18 | 0.47 | 100 | 0.33 | −0.05 | 0.70 | 100 |
| rs1374749 |
| 0.88 | 0.47 | −0.18 | 0.18 | 100 | 0.49 | −0.29 | 0.20 | 100 | 0.48 | −0.22 | 0.07 | 100 |
| rs4953353 |
| 0.97 | 0.31 | −0.01 | 0.93 | 100 | 0.31 | −0.19 | 0.46 | 100 | 0.31 | −0.08 | 0.54 | 100 |
| rs6756667 |
| 0.93 | 0.37 | 0.02 | 0.89 | 100 | 0.32 | 0.04 | 0.86 | 100 | 0.35 | 0.03 | 0.82 | 100 |
| rs7571218 |
| 0.71 | 0.50 | 0.17 | 0.22 | 100 | 0.49 | −0.19 | 0.46 | 100 | 0.50 | 0.06 | 0.64 | 100 |
Genotype-phenotype association β coefficients for EGLN1 were obtained from Simonson et al [16] while those for EPAS1 were obtained from Beall et al [17].
β indicates the observed linear coefficient for the relationship between SNP genotype and Hb levels.
Power refers to the probability of detecting a significant association (p<0.05) between SNP genotype and Hb level given the MAF and the sample size in the Ethiopian populations assuming that the β coefficient is as high or higher as that observed in Tibetans.
Figure 3Power plots.
The effect of β and MAF on the power of association tests based on the Ethiopian sample size (corrected for the number of SNPs tested within 10 kb from gene) is illustrated for EPAS1 (A, 72 SNPs)), EGLN1 (B, 38 SNPs) and any gene within the Response to Hypoxia gene ontology category (C, 1309 SNPs).
Biological pathways for which a significant excess of genic relative to all other genic SNPs are observed for all three tail cut-offs of the Amhara PBS, Amhara High PBS, or Amhara High MR distributions, respectively.
| Amhara PBS tail cut-off | Amhara High PBS tail cut-off | Amhara High MR tail cut-off | ||||||||
| Classification | Gene Set | 0.01 | 0.01 | 0.05 | 0.01 | 0.01 | 0.05 | 0.01 | 0.01 | 0.05 |
| GO BP | RESPONSE TO HYPOXIA | 3.40 | 3.41 | 1.84 | 3.31 | 2.9 | 1.92 | 2.58 | 2.2 | 1.75 |
| GO BP | CELL CYCLE CHECKPOINT | 5.72 | 3.40 | 1.83 | 0.48 | 0.96 | 0.96 | 0.68 | 0.36 | 0.9 |
| GO BP | DNA INTEGRITY CHECKPOINT | 6.30 | 3.86 | 2.00 | 0 | 0.82 | 0.94 | 0 | 0.07 | 0.85 |
| GO BP | DNA DAMAGE RESPONSE AND SIGNAL TRANSDUCTION | 4.83 | 2.91 | 1.59 | 2.02 | 1.79 | 1.07 | 1.12 | 1.26 | 1.18 |
| GO BP | DNA DAMAGE CHECKPOINT | 6.47 | 4.00 | 1.97 | 0 | 0.9 | 0.94 | 0 | 0.09 | 0.81 |
| GO BP | SENSORY ORGAN DEVELOPMENT | 10.29 | 6.08 | 2.31 | 2.72 | 1.36 | 0.87 | 1.9 | 0.95 | 1.04 |
| BioCarta | SALMONELLA PATHWAY | 15.61 | 9.63 | 2.89 | 0 | 0 | 0.91 | 0 | 0 | 0.31 |
| KEGG | CELL CYCLE | 1.1 | 1.1 | 1.39 | 2.93 | 2.70 | 1.60 | 4.39 | 3.35 | 1.34 |
| KEGG | CHRONIC MYELOID LEUKEMIA | 1.23 | 1.24 | 1.34 | 2.72 | 2.02 | 1.44 | 2.67 | 2.09 | 1.1 |
| GO BP | CELLULAR RESPIRATION | 0 | 1.01 | 0.69 | 4.55 | 3.07 | 2.20 | 1.11 | 2.25 | 2.42 |
| GO BP | AXON GUIDANCE | 0.35 | 0.62 | 0.8 | 1.97 | 1.72 | 1.28 | 0.47 | 0.95 | 0.93 |
| GO BP | CHROMOSOME ORGANIZATION AND BIOGENESIS | 0.69 | 0.77 | 0.87 | 1.46 | 2.22 | 1.46 | 3.93 | 3.17 | 1.99 |
| GO BP | DNA REPAIR | 1.76 | 1.2 | 1.22 | 1.83 | 1.6 | 1.36 | 2.23 | 2.41 | 1.64 |
| GO BP | ESTABLISHMENT AND OR MAINTENANCE OF CHROMATIN ARCHITECTURE | 0.73 | 0.91 | 0.83 | 1.78 | 2.09 | 1.39 | 3.31 | 2.57 | 1.74 |
| GO BP | HISTONE MODIFICATION | 1.76 | 1.76 | 0.95 | 1.58 | 2.28 | 1.26 | 5.58 | 4.57 | 2.34 |
| GO BP | CHROMATIN MODIFICATION | 0.96 | 0.97 | 0.84 | 2.4 | 2.4 | 1.37 | 4.29 | 3.34 | 1.75 |
| GO BP | ALCOHOL METABOLIC PROCESS | 1.44 | 1.18 | 0.93 | 2.1 | 1.84 | 1.15 | 3.48 | 2.51 | 1.53 |
denote support from ≥95% and 99% of bootstrap replicates, respectively.