| Literature DB >> 35296751 |
Helmut Schaschl1, Tobias Göllner2, David L Morris3.
Abstract
ALDH2 is a key enzyme in alcohol metabolism that protects cells from acetaldehyde toxicity. Using iHS, iSAFE and FST statistics, we identified regulatory acting variants affecting ALDH2 gene expression under positive selection in populations of European ancestry. Several SNPs (rs3184504, rs4766578, rs10774625, rs597808, rs653178, rs847892, rs2013002) that function as eQTLs for ALDH2 in various tissues showed evidence of strong positive selection. Very large pairwise FST values indicated high genetic differentiation at these loci between populations of European ancestry and populations of other global ancestries. Estimating the timing of positive selection on the beneficial alleles suggests that these variants were recently adapted approximately 3000-3700 years ago. The derived beneficial alleles are in complete linkage disequilibrium with the derived ALDH2 promoter variant rs886205, which is associated with higher transcriptional activity. The SNPs rs4766578 and rs847892 are located in binding sequences for the transcription factor HNF4A, which is an important regulatory element of ALDH2 gene expression. In contrast to the missense variant ALDH2 rs671 (ALDH2*2), which is common only in East Asian populations and is associated with greatly reduced enzyme activity and alcohol intolerance, the beneficial alleles of the regulatory variants identified in this study are associated with increased expression of ALDH2. This suggests adaptation of Europeans to higher alcohol consumption.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35296751 PMCID: PMC8927298 DOI: 10.1038/s41598-022-08588-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) iHS p-values plotted across the human chromosomal region 12.q24.12 for the population GBR (European genetic ancestry); red/green lines: threshold for significant (p < 0.01; Bonferroni correction p < 1 × 10–5) iHS scores; (b) pairwise F (GBR–LWK); red line: significant outlier loci with F > 0.3. Bottom: position of genes and SNPs from Table 1.
SNPs under positive selection at the human chromosomal region 12q24.12 in populations with European ancestry (GBR, TSI, FIN). Given are iHS scores and the calculated (-log) p-values (in bold Bonferroni correction with p < 1 × 10–5), the timing (t) of positive selection on the derived beneficial allele in thousand years ago (kya) and 95% credible interval (CI) (rounded to one decimal figure), the estimated selection coefficients (s) in GBR, average allele frequency in % for the derived beneficial allele/ancestral allele in the different ancestries and global locus-specific F values (sd = standard deviation) calculated across all analysed populations.
| Beneficial allele/ ancestral allele | Location | iHS | iHS | 95% CI | Allele frequency in % | Locus-specific | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GBR | TSI | FIN | AFR | EUR | SAS | EAS | |||||||
| rs3184504-T/C | Exon, | − 3.2 | − 3.3 | − 2.9 | 3.2; 3.3; 2.7 | 0.1 (93.7) | 3.7 | 3.2–4.3 | 0.2/99.8 | 46/54 | 7/93 | 0.2/99.8 | 0.351 (0.060) |
| rs4766578-T/A | Intron, | − 3.8 | − 3.1 | − 3.1 | 4.1; 3.0; 3.0 | 0.1 (89.3) | 3.5 | 3.0–4.0 | 0.2/99.8 | 48/52 | 7/93 | 0.2/99.8 | 0.366 (0.063) |
| rs10774625-A/G | Intron, | − 3.8 | − 3.1 | − 3.0 | 4.1; 3.0; 2.9 | 0.1 (89.9) | 3.0 | 2.7–3.4 | 0.2/99.8 | 48/52 | 7/93 | 0.2/99.8 | 0.366 (0.062) |
| rs597808-A/G | Intron, | − 3.8 | − 4.2 | − 3.1 | 4.1; 4.8; 3.0 | 0.09 (69.9) | 3.5 | 3.0–4.1 | 0.2/99.8 | 47/53 | 7/93 | 0.2/99.8 | 0.352 (0.064) |
| rs653178-C/T | Intron, | − 3.5 | − 4.4 | − 3.3 | 3.7; | 0.05 (51.7) | 3.1 | 2.6–3.7 | 0.3/99.7 | 47/53 | 7/93 | 0/100 | 0.356 (0.063) |
| rs847892-G/A | Intron, | − 2.7 | − 2.7 | − 2.7 | 2.5; 2.5; 2.5 | 0.04 (49.2) | 6.0 | 5.1–7.0 | 0.4/99.6 | 69/31 | 30/70 | 7/93 | 0.405 (0.080) |
| rs2013002-T/C | Intron, | − 2.7 | − 2.1 | − 2.4 | 2.5; 1.8; 2.1 | 0.08 (53.2) | 3.1 | 2.8–3.6 | 0.3/99.7 | 41/59 | 6/94 | 0.3/99.7 | 0.315 (0.053) |
Figure 2iSAFE scores plotted for SNPs surrounding the chr12q24.12 region (5.6 Mbp window) for the population GBR (European genetic ancestry); also indicated are the SNPs identified by the iHS statistics as being under positive selection; the top-ranked SNPs are above the threshold sores iSAFE > 0.304.
Pairwise LD (D′) of SNPs under positive selection in populations of European ancestry (GBR + TSI + FIN) and the ALDH2 promoter (*) variant rs886205; all calculated D′ values with p-value < 0.0001 (χ2 statistics).
| Chr:pos | SNP | LD ( | ||||||
|---|---|---|---|---|---|---|---|---|
| rs3184504 | rs4766578 | rs10774625 | rs597808 | rs653178 | rs847892 | rs2013002 | ||
| chr12:111,884,608 | rs3184504 | – | ||||||
| chr12:111,904,371 | rs4766578 | 1.0 | – | |||||
| chr12:111,910,219 | rs10774625 | 1.0 | 1.0 | – | ||||
| chr12:111,973,358 | rs597808 | 0.98 | 0.986 | 0.986 | – | |||
| chr12:112,007,756 | rs653178 | 0.98 | 0.979 | 0.979 | 0.859 | |||
| chr12:112,141,570 | rs847892 | 0.846 | 0.851 | 0.851 | 0.846 | 1.0 | – | |
| chr12:112,200,150 | rs2013002 | 0.956 | 0.970 | 0.97 | 1.0 | 1.0 | 1.0 | – |
| chr12:112,204,427 | *rs886205 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
GTEx and RegulomeDB data on SNPs under positive selection in European populations (GBR, TSI, FIN). Given is also a summary of reported traits from the NHGRI-EBI GWAS catalogue. GTEx eQTLs–eGene interaction with p < 0.0001. RegulomeDB rank: 2b: TF binding + any motif + DNase Footprint + DNase peak; 3a: TF binding + any motif + DNase peak; 4–5: TF binding + DNase peak; 6: motif hit. The RegulomeDB probability score ranges from 0 to 1, with 1 being most likely to be a regulatory variant (for further details see[54]). Transcription factor HNF4A, an important regulatory element of the ALDH2 gene expression, is given in bold.
| GTEx | RegulomeDB | GWAS reported traits | ||||
|---|---|---|---|---|---|---|
| eQTL | eGene | Rank | Score | Chromatin state | Motif | |
| rs3184504 | 3a | 0.67022 | Strong transcription; enhancers | Cardiovascular disease, blood pressure, ischemic stroke, glaucoma, rheumatoid arthritis, cancer, celiac disease, type I diabetes mellitus, parental longevity, inflammatory bowel disease, multiple sclerosis, blood cell count, hypothyroidism, haemoglobin measurement | ||
| rs4766578 | 2b | 0.63936 | Strong transcription; enhancers | Sjögren's syndrome, reticulocyte fraction of red cells, arthritis, vitiligo, HDL cholesterol, smoking status, coronary artery disease | ||
| rs10774625 | 5 | 0 | Strong transcription; enhancers | Hypertension, myocardial infarction, coronary artery disease, asthma, cholesterol levels, systemic lupus erythematosus, urate measurement, life span, systolic blood pressure, hypothyroidism, glomerular filtration rate | ||
| rs597808 | 5 | 0.13454 | Strong transcription | Systolic blood pressure, alcohol drinking, diastolic blood pressure, cholesterol levels, apolipoprotein B levels, colorectal cancer, allergic diseases, haematocrit, systemic lupus erythematosus, allergy | ||
| rs653178 | 4 | 0.60906 | Active TSS; strong transcription; enhancers | Allergic disease, asthma, celiac disease, cholesterol level, eczema, Crohn's disease, chronic kidney disease, blood pressure, eosinophil counts, inflammatory bowel disease, type 1 diabetes, urate level | ||
| rs847892 | 6 | 0.20016 | Active TSS; strong transcription; enhancers | No data | ||
| rs2013002 | 6 | 0.55195 | Active TSS; enhancers | Alcohol drinking and blood pressure | ||