| Literature DB >> 31527586 |
Jiyuan An1, Puya Gharahkhani1, Matthew H Law1, Jue-Sheng Ong1, Xikun Han1, Catherine M Olsen2, Rachel E Neale3,4,5, John Lai6, Tom L Vaughan7, Ines Gockel8, René Thieme8, Anne C Böhmer9,10, Janusz Jankowski11, Rebecca C Fitzgerald12, Johannes Schumacher9,10,13, Claire Palles14, David C Whiteman2, Stuart MacGregor15.
Abstract
Gastroesophageal reflux disease (GERD) is caused by gastric acid entering the esophagus. GERD has high prevalence and is the major risk factor for Barrett's esophagus (BE) and esophageal adenocarcinoma (EA). We conduct a large GERD GWAS meta-analysis (80,265 cases, 305,011 controls), identifying 25 independent genome-wide significant loci for GERD. Several of the implicated genes are existing or putative drug targets. Loci discovery is greatest with a broad GERD definition (including cases defined by self-report or medication data). Further, 91% of the GERD risk-increasing alleles also increase BE and/or EA risk, greatly expanding gene discovery for these traits. Our results map genes for GERD and related traits and uncover potential new drug targets for these conditions.Entities:
Mesh:
Year: 2019 PMID: 31527586 PMCID: PMC6746768 DOI: 10.1038/s41467-019-11968-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Genetic correlation between phenotypes. The lines with two arrows show the genetic correlation (standard error in brackets) from the result of LD-score regression r, genetic correlation estimates from the LD score regression. There is sample overlap between UKBB cases with either ICD10, self-report or medicine based GERD and the numbers do not add up to the total UKBB samples size. As the correlation is computed as the estimated covariance divided by product of the estimated standard error of the two traits, the correlation estimates may be slightly >1 when the correlation is high. NA* sample size is too small to estimate r
Identified SNPs for GERD (Chromosomes 1–6)
| Chr | SNP | BP | A1/A2 | OR [95% CI] | GERD adj. | Meta | BE/EA | Gene context | eQTL gene | PheWAS |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | rs1937450 | 66M | T/G | 0.97 [0.96–0.98] | 1.63 × 10−8 | --- | 0.73 | [ | Obesity (3.0 × 10−9) | |
| 1 | rs7552188 | 98M | T/C | 1.04 [1.03–1.06] | 3.76 × 10−10 | ++? | 0.54 | [ | Obesity (2.1 × 10−10) | |
| 2 | rs11901649 | 21M | G/A | 1.03 [1.02–1.04] | 2.14 × 10−8 | ++? | 1.03 × 10−7 | [ | ||
| 2 | rs4362541 | 68M | A/T | 1.05 [1.03–1.06] | 3.38 × 10−10 | +++ | 0.0033 | |||
| 2 | rs7609078 | 100M | G/A | 1.04 [1.02–1.05] | 2.54 × 10−9 | +++ | 0.22 | [ | Height (1.9 × 10−5) | |
| 3 | rs74652506 | 43M | C/T | 0.96 [0.94–0.97] | 4.16 × 10−8 | --- | 0.95 | [ | Obesity (2.0 × 10−7), Height (5.3 × 10−5) | |
| 3 | rs7613875 | 50M | C/A | 0.97 [0.96–0.98] | 3.67 × 10−8 | --- | 0.068 | BMI (6.9 × 10−33), EDU (3.6 × 10−28) | ||
| 3 | rs4676893 | 71M | A/T | 1.04 [1.03–1.05] | 5.24 × 10−10 | +++ | 8.4 × 10−12 | Obesity (5.4 × 10−9) | ||
| 4 | rs809955 | 141M | G/A | 1.04 [1.03–1.05] | 4.69 × 10−10 | +++ | 0.014 | [ | Obesity (7.7 × 10−12), smoking (9.7 × 10−7) | |
| 5 | rs10940767 | 29M | T/A | 0.96 [0.95–0.98] | 3.45 × 10−10 | --- | 1.56 × 10−6 | |||
| 5 | rs72771256 | 83M | G/A | 1.04 [1.03–1.06] | 2.96 × 10−8 | +?+ | 0.046 | Obesity (5.3 × 10−8) | ||
| 6 | rs7763910 | 26M | A/G | 0.96 [0.95–0.98] | 2.41 × 10−8 | --+ | 0.040 | [ | EDU (4.6 × 10−7), Height (2.8 × 10−6) | |
| 6 | rs9266237 | 31M | G/C | 0.96 [0.95–0.97] | 1.46 × 10−9 | --? | 0.19 |
Chr: (chromosome) and bp: base-pair position of SNP in hg19; A1/A2 refers to the effect allele and noneffect allele for the SNP; OR: GERD meta-analysis odds ratio and 95% confidence interval (confidence intervals were inflated by the square root of the LD-score regression intercept to account for potential residual stratification); GERD adj. P: adjusted P-value of SNP association in meta-analysis of UKBB, i.e., each SNP’s chi-squared value was divided by the intercept (1.04) from LD-score regression. 23andMe and QSkin for GERD; The column “Meta” shows the direction of risk association for the effect allele in order of the UKBB, 23andMe and QSkin GERD GWASs where “+” denotes increased risk, and “-” denotes decreased risk and “?” denotes that the result is missing for that study. P-value BE/EA shows P-value of SNP association for analysis of BE/EA samples. The “Gene context” column refers to the genes nearby (<200 kb) the most associated GERD SNP; brackets indicate that the peak SNP or those in LD (r2 > 0.8) are within the specified genes. The “-” sign after the genes indicates that the genes are located upstream of the top SNP, while the “+” after genes shows that the genes are located downstream of the top SNP. If any gene in the column has significant eQTL-association with the SNP from any of the 44 GTEx Tissues (https://gtexportal.org/), the gene is shown in eQTL column and the P-value quoted describes the strength of the relationship to the corresponding SNP. GWAS results for BMI, obesity, education, major depression, height and smoking were retrieved from https://genetics.opentargets.org, and are listed in the PheWAS column. More detailed eQTL and phewas results are in Supplementary Table 1. For each significant independent association a locuszoom plot is given in Supplementary Fig. 3.
Identified SNPs for GERD (Chromosomes 7–22))
| Chr | SNP | BP | A1/A2 | OR [95% CI] | GERD adj. | Meta | BE/EA | Gene context | eQTL gene | PheWAS |
|---|---|---|---|---|---|---|---|---|---|---|
| 7 | rs4721096 | 1.8M | T/C | 0.96 [0.94–0.97] | 2.24 × 10−8 | --- | 0.020 | [ | BMI (1.1 × 10−12) | |
| 7 | rs10242223 | 3.5M | A/G | 1.04 [1.02–1.05] | 2.54 × 10−8 | +++ | 0.0040 | [ | Smoking (2.2 × 10−8) | |
| 7 | rs10228350 | 114M | A/T | 0.97 [0.96–0.98] | 3.87 × 10−8 | --- | 0.024 | [ | MDD (3.3 × 10−9) | |
| 7 | rs12706746 | 126M | G/A | 0.97 [0.95–0.98] | 1.98 × 10−8 | --- | 0.45 | [ | MDD (1.5 × 10−7) | |
| 11 | rs12792379 | 6M | G/A | 0.96 [0.95–0.98] | 3.91 × 10−8 | --- | 0.00023 | |||
| 12 | rs11171710 | 56M | G/A | 0.97 [0.96–0.98] | 1.80 × 10−8 | --- | 0.078 | [ | Obesity (3.3 × 10−9) | |
| 12 | rs597808 | 112M | A/G | 1.03 [1.02–1.04] | 1.33 × 10−8 | +++ | 0.78 | [ | Obesity (1.2 × 10−23) | |
| 17 | rs34796998 | 50M | C/G | 0.96 [0.95–0.97] | 2.98 × 10−11 | --- | 0.073 | MDD (7.2 × 10−6) | ||
| 19 | rs1363119 | 18M | A/G | 1.04 [1.03–1.05] | 5.81 × 10−10 | +++ | 0.00032 | |||
| 19 | rs12974777 | 19M | C/T | 1.04 [1.02–1.05] | 6.92 × 10−9 | +?+ | 0.0014 | [ | ||
| 21 | rs1297211 | 16M | C/G | 1.03 [1.02–1.04] | 3.64 × 10−8 | +++ | 0.31 | Obesity (7.5 × 10−7) | ||
| 21 | rs7282609 | 34M | A/G | 0.97 [0.95–0.98] | 4.18 × 10−8 | --- | 0.077 | EDU (1.4 × 10−7) |
Chr: (chromosome) and bp: base-pair position of SNP in hg19; A1/A2 refers to the effect allele and noneffect allele for the SNP; OR: GERD meta-analysis odds ratio and 95% confidence interval (confidence intervals were inflated by the square root of the LD-score regression intercept to account for potential residual stratification); GERD adj. P: adjusted P-value of SNP association in meta-analysis of UKBB, i.e., each SNP’s chi-squared value was divided by the intercept (1.04) from LD-score regression. 23andMe and QSkin for GERD; The column “Meta” shows the direction of risk association for the effect allele in order of the UKBB, 23andMe and QSkin GERD GWASs where “+” denotes increased risk, and “-” denotes decreased risk and “?” denotes that the result is missing for that study. P-value BE/EA shows P-value of SNP association for analysis of BE/EA samples. The “Gene context” column refers to the genes nearby (<200 kb) the most associated GERD SNP; brackets indicate that the peak SNP or those in LD (r2 > 0.8) are within the specified genes. The “-” sign after the genes indicates that the genes are located upstream of the top SNP, while the “+” after genes shows that the genes are located downstream of the top SNP. If any gene in the column has significant eQTL-association with the SNP from any of the 44 GTEx Tissues (https://gtexportal.org/), the gene is shown in eQTL column and the P-value quoted describes the strength of the relationship to the corresponding SNP. GWAS results for BMI, obesity, education, major depression, height and smoking were retrieved from https://genetics.opentargets.org, and are listed in the PheWAS column. More detailed eQTL and phewas results are in Supplementary Table 1. For each significant independent association a locuszoom plot is given in Supplementary Fig. 3 (note for rs12974777 the SNP shown is the top SNP in the region after conditioning on nearby SNP rs1363119).
Fig. 2Manhattan plot for GERD from meta-analysis of 81,077 GERD cases and 307,284 controls. The x-axis shows genomic position (chromosome 1–22) and the y-axis shows the log10 (P-value) of the SNP association. The threshold for genome-wide significance is set at P = 5.0 × 10−8 (the red-dotted horizontal line)
Additional GERD genes identified via gene-based tests
| Gene | Gene interval | Top SNP | SNP- | Gene- | eQTL | PheWAS |
|---|---|---|---|---|---|---|
| 1:3,320,749–33,240,571 | rs61798840 | 2.27 × 10−5 | 1.61 × 10−6 | Obesity (6.5 × 10−6) | ||
| 1:66,258,193–66,840.262 | rs1937450 | 1.63 × 10−8 | 7.01 × 10−9 | Obesity (3.0 × 10−9) | ||
| 1:174,128,552–174,964,445 | rs333423 | 4.70 × 10−7 | 6.04 × 10−9 | 1.7 × 10−30 | MDD (2.8 × 10−6) | |
| 3:49,924,435–49,941,311 | rs7613875 | 4.89 × 10−8 | 6.33 × 10−8 | 7.9 × 10−12 | BMI(6.9 × 10−33 EDU(3.6 × 10−28) | |
| 3:50126341–50156397 | rs7613875 | 4.89 × 10−8 | 5.28 × 10−8 | 9.0 × 10−60 | BMI(6.9 × 10−33 EDU(3.6 × 10−28) | |
| 3:50192562–50226508 | rs7613875 | 4.89 × 10−8 | 3.49 × 10−7 | 9.2 × 10−6 | BMI(6.9 × 10−33 EDU(3.6 × 10−28) | |
| 3:65,339,200–66,024,511 | rs9815340 | 2.36 × 10−6 | 2.06 × 10−6 | |||
| 4:17,844,839–18,023,483 | rs1993638 | 1.06 × 10−6 | 1.73 × 10−6 | |||
| 5:122,110,691–122,170,234 | rs7707685 | 1.26 × 10−7 | 1.30 × 10−6 | NA | ||
| 5:155,462,147–156,194,799 | rs56785833 | 2.13 × 10−6 | 1.88 × 10−6 | 2.4 × 10−18 | ||
| 6:129,204,286–129,837,711 | rs9492232 | 5.97 × 10−7 | 2.67 × 10−7 | NA | Obesity (9.4 × 10−9), Smoking (3.4 × 10−6) | |
| 7:26,331,515–26,413,949 | rs12700707 | 2.48 × 10−6 | 2.32 × 10−7 | NA | ||
| 7:158,820,866–158,937,649 | rs2730263 | 4.95 × 10−7 | 1.81 × 10−6 | 4.3 × 10−17 | Smoking (4.7 × 10−5) | |
| 9:129,376,722–129,463,311 | rs10760444 | 9.01 × 10−7 | 1.87 × 10−7 | NA | Obesity (1.3 × 10−8) | |
| 12:112,212,3857–112,194,911 | rs597808 | 1.33 × 10−8 | 2.52 × 10−7 | NA | Obesity (1.3 × 10−23) Smoking(1.1 × 10−10) | |
| 14:75,230,025-75,304,013 | rs17183201 | 2.08 × 10−6 | 4.98 × 10−7 | NA | ||
| 16:89,574,802–89,624,174 | rs57696383 | 2.50 × 10−7 | 8.86 × 10−7 | 1.4 × 10−21 | Height(2.1 × 10−16), Obesity (1.2 × 10−12) | |
| 18:42,258,849–42,648,475 | rs2028653 | 9.73 × 10−6 | 1.18 × 10−6 | NA | Obesity (1.6 × 10−4), Height (1.5 × 10−5) | |
| 18:49,866,542–51,062,273 | rs56796226 | 8,62 × 10−6 | 4.98 × 10−10 | 2.6 × 10−5 | MDD (1.0 × 10−8), EDU (1.1 × 10−6), Height (1.2 × 10−5) | |
| 18:52,889,562–5,330,322 | rs77262239 | 1.02 × 10−6 | 8.99 × 10−8 | NA | MDD (4.0 × 10−6) |
Gene-based test results from MAGMA software. Regions which already exceeded per-SNP genome-wide significance in Tables 1 and 2 are not shown here. The gene interval in the form Chromosome:start–stop bp is shown mapped to hg19. Top SNP refers to the SNP with the lowest P-value in the region, with SNP-P the P-value for that SNP. Gene-based-P is the gene-based P-value. The eQTL column shows the lowest P-value for testing the correlation between gene expression and genotype of the top SNP in 44 GTEx tissues. PheWAS column shows any strong associations for the top SNP with any of BMI, obesity, education, major depression, height, and smoking. The PheWAS results were retrieved from the online Open-Targets database https://genetics.opentargets.org. They are listed in the PheWAS column alongside its P-value. More detailed eQTL and PheWAS results are in Supplementary Table 14.
Fig. 3Traits with significant genetic correlation with GERD. Vertical axis displays genetic correlation from LD-score regression. Error bars denote ±1 standard error
Putative drug target genes from GERD GWAS
| Gene | chr | Start, stop | Evidence for this particular gene | Drug(s) | Currently used for GERD/EA/BE | Highly expressed tissues(TPM) |
|---|---|---|---|---|---|---|
| 11 | 6,280,841–6,293,363 | Peak SNP rs12792379 is an eQTL for | PROGLUMIDE, NETAZEPIDE | Yes | Stomach (40), brain frontal cortex (22), brain cortex (22), brain cerebellum (22), brain cerebellar hemisphere (22) | |
| 3 | 134,514,099–134,851,891 | Peak SNP rs60716777 is between | VANDETANIB | Yes | Brain-cerebellar hemisphere (33), brain-cerebellum (23), brain-caudate (10), brain-putamen (9.5), cells-EBV (9.3) | |
| 12 | 56,360,556–56,366,568 | See paragraph in discussion section | Multiple | Yes | EBV (64), esophagus-muscularis (32), lung (32), ovary (32), skin (28) | |
| 1 | 97,543,300–98,386,615 | Peak SNP rs7552188 is an eQTL (esophageal mucosa) for | ENILURACIL | No | Transformed fibroblasts (40), lung (27), whole blood (20), spleen (19), ovary (19) | |
| 1 | 97,543,300–98,386,615 | Signal wholly over | Multiple | No | Brain-spinal cord (29), brain-nucleus (17), brain-hypothalamus (14), brain-frontal (14), brain-cortex (14) | |
| 3 | 49,924,436–49,941,306 | Peak SNP rs7613875 is an eQTL (esophagus mucosa) for | NARNATUMAB, MK-8033, MGCD-265 | No | Brain-spinal cord (30), brain-nucleus (17), brain-substantia (16), brain-frontal (16), spleen (16) | |
| 6 | 129,204,286–129,837,710 | Signal wholly over | OCRIPLASMIN | No | Ovary (121), nerve-tibial (91), fallopian tube (74), transformed fibroblasts (72), cervix-endocervix (55) |
Two further genes (KCNB2 on chromosome 8, SLC9A3 on chromosome 5) are drug targets, but we have not listed them in Table 4 as the link from the peak SNP to the putative target gene was weaker than for those in Table 4 (e.g., there are several equally likely candidate genes in the region). The top five expressed genes obtained from GTEx portal (https://gtexportal.org/home/gene, TPM is transcripts per kilobase million).