| Literature DB >> 28680059 |
Xin Song1, Wen-Qing Li2,3,4, Nan Hu2, Xue Ke Zhao1, Zhaoming Wang2,5,6, Paula L Hyland2, Tao Jiang1, Guo Qiang Kong1, Hua Su2, Chaoyu Wang2, Lemin Wang2, Li Sun1, Zong Min Fan1, Hui Meng1, Tang Juan Zhang1, Ling Fen Ji1, Shou Jia Hu1, Wei Li Han1, Min Jie Wu7, Peng Yuan Zheng7, Shuang Lv7, Xue Min Li8, Fu You Zhou9, Laurie Burdett2,5, Ti Ding10, You-Lin Qiao11, Jin-Hu Fan11, Xiao-You Han10, Carol Giffen12, Margaret A Tucker2, Sanford M Dawsey2, Neal D Freedman2, Stephen J Chanock2, Christian C Abnet2, Philip R Taylor2, Li-Dong Wang13, Alisa M Goldstein14.
Abstract
Based on our initial genome-wide association study (GWAS) on esophageal squamous cell carcinoma (ESCC) in Han Chinese, we conducted a follow-up study to examine the single nucleotide polymorphisms (SNPs) associated with family history (FH) of upper gastrointestinal cancer (UGI) cancer in cases with ESCC. We evaluated the association between SNPs and FH of UGI cancer among ESCC cases in a stage-1 case-only analysis of the National Cancer Institute (NCI, 541 cases with FH and 1399 without FH) and Henan GWAS (493 cases with FH and 869 without FH) data (discovery phase). The top SNPs (or their surrogates) from discovery were advanced to a stage-2 evaluation in additional Henan subjects (2801 cases with FH and 3136 without FH, replication phase). A total of 19 SNPs were associated with FH of UGI cancer in ESCC cases with P < 10-5 in the stage-1 meta-analysis of NCI and Henan GWAS data. In stage-2, the association for rs79747906 (located at 18p11.31, P = 5.79 × 10-6 in discovery) was replicated (P = 0.006), with a pooled-OR of 1.59 (95%CI: 1.11-2.28). We identified potential genetic variants associated with FH of UGI cancer. Our findings may provide important insights into new low-penetrance susceptibility regions involved in the susceptibility of families with multiple UGI cancer cases.Entities:
Mesh:
Year: 2017 PMID: 28680059 PMCID: PMC5498542 DOI: 10.1038/s41598-017-04822-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The association between rs79747906 (T-C) and family history in ESCC cases.
| No. of cases with FH | No. of cases without FH | MAF for cases with FH | MAF for cases without FH | OR |
| |
|---|---|---|---|---|---|---|
|
| ||||||
| NCI studyb,c | 541 | 1399 | 0.068 | 0.051 | 1.70 (1.15–2.51) | 0.007 |
| Henan studyb | 493 | 869 | 0.081 | 0.053 | 2.16 (1.44–3.23) | 0.0002 |
| Meta-analysisd | 1034 | 2268 | 0.074 | 0.052 | 1.91 (1.44–2.52) | 5.79 × 10−6 |
|
| ||||||
| Model 1e | 2801 | 3136 | 0.076 | 0.062 | 1.23 (1.06–1.42) | 0.006 |
| Model 2f | 1937 | 3136 | 0.076 | 0.062 | 1.24 (1.06–1.46) | 0.008 |
FH: family history; MAF: minor allele frequency; odds ratio: OR.
aThe P-values and ORs per one effect allele (the minor allele C) were calculated from unconditional logistic regression models using genotype-trend tests adjusted for age, sex and sub-study (for the analysis of NCI study, which includes NCI Shanxi and NIT).
bESCC cases with FH of UGI cancer (FH+) were defined as those having one or more relative(s) diagnosed with UGI cancer within three generations for NCI/Shanxi study and Henan study.
cFor the smaller NCI study (NIT), FH was limited to those having one or more relative(s) diagnosed with UGI cancer within first-degree relatives.
dCalculated using the fixed-effects model as no heterogeneity was found between the two studies (P = 0.41).
eESCC cases with family history were defined as those that have one or more relative(s) within three generations diagnosed with UGI cancer.
fESCC cases with family history were defined as those that have one or more relative(s) within first-degree diagnosed with UGI cancer.
Top SNPs in the NCI and Henan GWAS associated with family history of UGI cancer in ESCC casesa.
| Gene | Chr. | Discovery phase (Meta-analysis of NCI and Henan GWAS) | Replication phase (Henan) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP (major, minor) | n1 | n2 | A1 | A2 | ORb |
| SNP (major, minor) | n1 | n2 | A1 | A2 | ORb |
| ||
|
| 11q23.3 | rs140792366 (C, G) | 1032 | 2266 | 0.027 | 0.015 | 4.23 | 7.65 × 10-7 | rs140792366 (C, G) | 2801 | 3135 | 0.006 | 0.005 | N/Af | N/Af |
|
| 2q37.1 | rs117453803 (A, T) | 1032 | 2266 | 0.075 | 0.052 | 2.07 | 9.74 × 10-7 | rs73997003 (G, A)c | 2801 | 3136 | 0.074 | 0.067 | 1.10 | 0.19 |
| 17p13.2 | rs187481103 (A, C) | 1032 | 2267 | 0.024 | 0.013 | 4.08 | 1.27 × 10−6 | rs184911713 (G)d | 2801 | 3135 | 0 | 0 | N/Af | N/Af | |
|
| 6p12.1 | rs9357885 (A, T) | 1032 | 2266 | 0.179 | 0.139 | 1.49 | 1.48 × 10−6 | rs6459122 (C, T)c | 2799 | 3135 | 0.091 | 0.094 | 0.97 | 0.65 |
| 2p24.1 | rs2049728 (T, C) | 1032 | 2268 | 0.255 | 0.310 | 0.76 | 3.81 × 10−6 | rs2049728 (T, C) | 2799 | 3135 | 0.289 | 0.296 | 0.97 | 0.38 | |
| 2q32.1 | rs57921607 (A, C) | 1032 | 2266 | 0.163 | 0.131 | 1.60 | 4.07 × 10−6 | N/Ae | N/Ae | N/Ae | N/Ae | N/Ae | N/Ae | N/Ae | |
|
| 15q21.2 | rs186503151 (C, T) | 1033 | 2268 | 0.026 | 0.014 | 3.63 | 4.14 × 10−6 | rs141703242 (C)d | 2801 | 3136 | 0 | 0 | N/Af | N/Af |
| 7q35 | rs2372415 (C, G) | 1032 | 2266 | 0.282 | 0.230 | 1.35 | 4.28 × 10−6 | rs2372415 (C, G) | 2799 | 3135 | 0.255 | 0.258 | 0.99 | 0.79 | |
| 3p22.3 | rs12631160 (T, C) | 1033 | 2267 | 0.047 | 0.027 | 2.10 | 5.35 × 10−6 | rs12631160 (T, C) | 2799 | 3135 | 0.047 | 0.047 | 1.02 | 0.85 | |
|
| 18p11.31 | rs79747906 (T, C) | 1032 | 2266 | 0.074 | 0.052 | 1.91 | 5.79 × 10−6 | rs79747906 (T, C) | 2801 | 3136 | 0.076 | 0.062 |
|
|
|
| 7p15.3 | rs1549391 (C, T) | 1033 | 2267 | 0.040 | 0.069 | 0.59 | 6.32 × 10−6 | rs1549391 (C, T) | 2801 | 3136 | 0.052 | 0.051 | 1.05 | 0.59 |
|
| 7p21.3 | rs2285970 (T, C) | 1032 | 2266 | 0.037 | 0.019 | 2.43 | 6.41 × 10−6 | rs2285970 (T, C) | 2799 | 3135 | 0.030 | 0.036 | 0.83 | 0.08 |
| 3p22.3 | rs1073209 (T, C) | 1033 | 2268 | 0.095 | 0.067 | 1.61 | 6.52 × 10−6 | rs4955227 (T, G)c | 2801 | 3136 | 0.074 | 0.071 | 1.04 | 0.54 | |
|
| 12p13.32 | rs1046165 (T, C) | 1034 | 2268 | 0.529 | 0.465 | 1.28 | 7.14 × 10−6 | rs1046165 (T, C) | 2801 | 3136 | 0.484 | 0.479 | 1.02 | 0.60 |
| 20q13.31 | rs59635584 (G, T) | 1032 | 2266 | 0.035 | 0.020 | 2.54 | 7.37 × 10−6 | rs59635584 (G, T) | 2801 | 3136 | 0.019 | 0.020 | 0.97 | 0.80 | |
|
| 17p13.1 | rs1050533 (T, C) | 1032 | 2266 | 0.508 | 0.457 | 1.30 | 8.56 × 10−6 | rs1050541 (T, G)c | 2801 | 3135 | 0.424 | 0.410 | 1.05 | 0.19 |
|
| 19p13.12 | rs12461816 (C, T) | 1032 | 2266 | 0.218 | 0.176 | 1.39 | 8.91 × 10-6 | rs12461816 (C, T) | 2801 | 3136 | 0.209 | 0.202 | 1.04 | 0.37 |
|
| 6p12.1 | rs2745751 (A, C) | 1033 | 2267 | 0.112 | 0.080 | 1.54 | 8.96 × 10-6 | rs1883703 (A, T)c | 2799 | 3135 | 0.080 | 0.082 | 0.98 | 0.72 |
|
| 17p13.1 | rs58614441 (T, C) | 1032 | 2266 | 0.186 | 0.148 | 1.43 | 9.11 × 10-6 | rs34416693 (G, A)c | 2800 | 3135 | 0.237 | 0.230 | 1.03 | 0.57 |
an1 is the number of cases with family history while n2 is the number of cases without family history. A1 is the allele frequency of the minor allele (effect allele) in cases with family history and A2 is the allele frequency of the minor allele (effect allele) in cases without family history. SNPs are ordered based on the increasing P-values in the discovery phase.
bThe P-values and ORs for the SNP (per one minor allele) were calculated from unconditional logistic regression models using genotype-trend tests adjusted for age, sex and sub-study (for the analysis of NCI study, which includes NCI Shanxi and NIT).
cSurrogate SNPs were selected to replace the original SNP from the meta-analysis of the NCI and Henan GWAS. The surrogates were located within 200 kb on either side of each targeted SNP and were selected based on the r2 with the targeted SNP in the genotype data from 1000 Genomes project JPT + CHB population. The linkage r2 was 1.00 for rs73997003 with rs117453803, 0.62 for rs6459122 with rs9357885, 0.93 for rs4955227 with rs1073209, 0.58 for rs1050541 with rs1050533, 0.94 for rs1883703 with rs2745751, and 0.59 for rs34416693 with rs58614441.
dFor rs187481103 and rs186503151, we were unable to find good surrogates that could survive the assay design. Given the monoallelic nature of the tested surrogates (rs184911713 and rs141703242) in our study population, we were essentially unable to test/replicate the findings for those two SNPs.
eThis SNP (rs57921607) and its surrogate(s) failed in the follow-up study.
fORs and P values not calculated because of the extremely rare minor allele.
Functional annotation of rs79747906 and rs12461816.
| SNP | rs79747906 (risk allele C, 18p11.31) | rs12461816 (risk allele T, 19p13.12) |
|---|---|---|
| Gene Annotation |
|
|
| Chromatin Regulatory Statesa | None | Stomach Smooth Muscle (ReprPC_W) |
| DNaseI Siteb | No | No |
| Alters CpGc | No | Yes, methylated CpG in normal esophagus and gastric tissues |
| Protein(s) boundd | Weak CTCF binding | None |
| DNA Motif(s) alterede | CDP, Nanog, PAX4 and SMAD2 | MSX-1 |
| eQTL (risk allele association, | No significant associationg |
|
| eQTL (risk allele association, | No significant associationg |
|
| eQTL (risk allele association, | No significant associationg |
|
| Number of eQTL gene tests (significance | 17 tests ( | 57 tests ( |
aChromatin State Segmentation using a Hidden Markov Model (ChromHMM) by Auxillary Core Marks + K27Ac in adipose derived mesenchymal stem cells (ASC), adipose nuclei (AN), blood cells (CD19, CD8 and CD4), stomach mucosa (SM), stomach smooth muscle (SMM), gastric and esophagus tissues issues from Roadmap (http://www.roadmapepigenomics.org/). Repressed histone methylation (Re), Repressed polycomb weak (ReprPC_W), Repressed polycomb(ReprPC), Heterochromatin (H), Transcription (T), Quiescent (Q), and Enhancer (E).
bDNaseI Hypersensitivity sites were assigned by DNase I hypersensitive assay in 125 cell types in ENCODE (http://genome.ucsc.edu/ENCODE/) and in NIH Roadmap data.
cDNA CpG altered in esophagus and stomach tissues from NIH Roadmap data.
dTranscription binding sites and binding motif defined by chromatin immunoprecipitation sequencing for 161 factors were obtained from ENCODE and HaploReg v4.
eDNA binding motifs for transcription factors (TF) and proteins obtained from HaploReg v4. DNA motifs were assigned to a factor group based on TF ChIP experiments and known DNA motifs as described by Keheradpour P and Kellis M[25].
fExpression quantitative trait loci (eQTL) analysis were conducted in normal esophageal tissues: esophageal mucosa[Mucosa]; esophageal muscularis [Muscularis]; and gastroesophageal junction [GJE], normal stomach mucosa, and the whole blood. We examined coding and non-coding genes in cis or located within a 1MB of the signal and known to be expressed at the mRNA in the target tissue. Results were obtained from the Genotype-Tissue Expression (GTEx) Project (http://www.gtexportal.org/home/). P-values were calculated based on linear regression between log and quantile normalized RNA-seq expression values and imputation-based genotype with 3 genotyping principal components, 15 peer factors, and gender as covariates.
gNo significant association after considering multiple comparisons for number of tests. Suggestive associations with P < 0.05 are shown in SI Table 3.