| Literature DB >> 25592173 |
Juncheng Dai1, Meng Zhu2, Cheng Wang2, Wei Shen2, Wen Zhou2, Jie Sun2, Jia Liu2, Guangfu Jin1, Hongxia Ma1, Zhibin Hu3, Dongxin Lin4, Hongbing Shen3.
Abstract
Genome-wide association studies identified genetic susceptibility variants mostly lie outside of protein-coding regions. It suggested variants located at transcriptional regulatory region should play an important role in cancer carcinogenesis including lung cancer. In the present study, we systematically investigated the associations between the variants in the binding sites of an extensive transcription factor CTCF and lung cancer risk in Chinese population. A two-stage case-control design was conducted to evaluate the variants located at the uniform CTCF ChIP-seq peaks in a Chinese population (2,331 vs 3,077; 1,115 vs 1,346). The ChIP-seq data for CTCF, specified on lung cancer cell line A549, were downloaded from ENCODE database. Imputation was performed to increase the genome coverage in the CTCF binding regions. Three variants in CTCF binding sites were found to associate with lung cancer risk in the first stage. Further replication revealed a novel single nucleotide polymorphism rs60507107 was significantly associated with increased risk of lung cancer in two stages (Additive model: OR = 1.19, 95%CI = 1.11-1.27, P = 6.98 × 10(-7)). Our results indicate that rs60507107 in the binding site of CTCF is associated with an increased risk of lung cancer. This may further advance our understanding of regulatory DNA sequences in cancer development.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25592173 PMCID: PMC4296290 DOI: 10.1038/srep07833
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Associations of the 15 SNPs in discovery stage
| MAF | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP | Chr. | BP. | Related gene | Info | Allele | 1000 Genome | Cases | Controls | |||
| rs75772117 | 4 | 6744829 | 0.81 | G > A | 0.07 | 0.08 | 0.10 | 3.37 × 10−2 | 1.26 × 10−5 | 1.09 × 10−6 | |
| rs10072980 | 5 | 33181214 | 0.90 | T > C | 0.14 | 0.15 | 0.18 | 2.88 × 10−3 | 9.80 × 10−5 | 2.77 × 10−7 | |
| rs1632447 | 6 | 29688886 | 0.99 | C > A | 0.02 | 0.03 | 0.02 | 3.57 × 10−2 | 3.79 × 10−7 | 2.05 × 10−6 | |
| rs114731497 | 6 | 29653824 | 0.98 | C > G | 0.02 | 0.04 | 0.02 | 8.65 × 10−4 | 3.53 × 10−7 | 3.22 × 10−8 | |
| rs115786093 | 6 | 30451515 | 0.92 | G > A | 0.05 | 0.04 | 0.03 | 1.45 × 10−5 | 1.57 × 10−3 | 1.31 × 10−6 | |
| rs115364068 | 6 | 30325531 | 0.98 | T > C | 0.07 | 0.07 | 0.05 | 1.62 × 10−4 | 2.15 × 10−8 | 1.88 × 10−9 | |
| rs115299438 | 6 | 31164828 | 1.00 | C > A | 0.09 | 0.10 | 0.13 | 4.88 × 10−4 | 8.94 × 10−5 | 7.32 × 10−6 | |
| rs2252937 | 6 | 31461613 | 0.89 | T > C | 0.08 | 0.14 | 0.11 | 3.38 × 10−3 | 7.89 × 10−5 | 3.86 × 10−6 | |
| rs3124203 | 10 | 30799716 | 0.92 | C > G | 0.10 | 0.07 | 0.10 | 3.04 × 10−2 | 3.67 × 10−4 | 9.54 × 10−6 | |
| rs11617518 | 13 | 24290267 | 1.00 | C > T | 0.32 | 0.32 | 0.28 | 6.97 × 10−4 | 9.03 × 10−4 | 4.07 × 10−6 | |
| rs12100587 | 14 | 106004323 | 0.88 | T > G | 0.44 | 0.41 | 0.47 | 1.65 × 10−3 | 1.16 × 10−9 | 1.76 × 10−10 | |
| rs2836333 | 21 | 39724700 | 0.80 | A > G | 0.24 | 0.24 | 0.27 | 1.63 × 10−2 | 2.71 × 10−4 | 5.42 × 10−6 | |
a: all selected SNPs were identified by Imputation;
b: imputed quality info;
c: minor allele frequency of ASN in 1000 Genome;
d: adjusted by age, gender, pack-year of smoking and pca1.
Figure 1rs60507107 in CTCF TFBS Chip-seq peaks.
Associations of the 2 replicated SNPs in GWAS and replicated stage
| GWAS | Validation | Combined | |||||||
|---|---|---|---|---|---|---|---|---|---|
| SNP (cytoband, BP) | Genotype | Cases/Controls | OR (95%CI) | Cases/Controls | OR (95%CI) | OR (95%CI) | |||
| rs2002059 | AA | 2020/2587 | 1.00 | 929/1141 | 1.00 | 1.00 | |||
| 10q24.2, 101130855 | AG | 306/469 | 0.73(0.63–0.85) | 6.75 × 10−5 | 160/185 | 1.06(0.85–1.34) | 5.90 × 10−1 | 0.91(0.80–1.04) | 1.62 × 10−1 |
| GG | 5/21 | 0.23(0.10–0.55) | 9.27 × 10−4 | 10/7 | 1.77(0.67–4.68) | 2.51 × 10−1 | 0.73(0.38–1.39) | 3.36 × 10−1 | |
| Additive | 0.62(0.53–0.73) | 1.07 × 10−8 | 1.11(0.90–1.37) | 3.39 × 10−1 | 0.90(0.80–1.02) | 1.01 × 10−1 | |||
BP: base position;
a: Adjusted by age, gender, pack-year of smoking and pca1.
b: Adjusted by age, gender and pack-year of smoking.
Figure 2regional plot of rs60507107 and rs37010.
Stratification analysis on rs60507107
| GWAS | Validation | |||||||
|---|---|---|---|---|---|---|---|---|
| Variables | Cases/Controls | OR(95% CI) | Cases/Controls | OR(95% CI) | ||||
| Age | 1.000 | 0.893 | ||||||
| ≤60 | 1142/1521 | 1.22(1.08–1.38) | 2.00E-03 | 515/689 | 1.14(0.97–1.35) | 1.19E-01 | ||
| >60 | 1189/1556 | 1.22(1.08–1.38) | 1.00E-03 | 600/657 | 1.12(0.95–1.32) | 1.64E-01 | ||
| Gender | 0.368 | 0.182 | ||||||
| Male | 1711/2086 | 1.20(1.08–1.33) | 1.00E-03 | 731/875 | 1.07(0.92–1.24) | 3.66E-01 | ||
| Female | 620/991 | 1.29(1.09–1.53) | 3.00E-03 | 384/471 | 1.27(1.04–1.56) | 1.70E-02 | ||
| Smoking status | 0.143 | 0.773 | ||||||
| Current | 1252/1083 | 1.21(1.07–1.37) | 2.00E-03 | 437/515 | 1.14(0.94–1.38) | 1.75E-01 | ||
| Former | 254/226 | 0.95(0.73–1.24) | 7.30E-01 | 102/114 | 0.98(0.65–1.48) | 9.15E-01 | ||
| Never | 825/1768 | 1.28(1.12–1.46) | 3.59E-04 | 576/717 | 1.15(0.98–1.35) | 8.80E-02 | ||
| Smoking level | 0.888 | 0.405 | ||||||
| ≤25 | 1245/2327 | 1.19(1.07–1.33) | 2.00E-03 | 747/974 | 1.17(1.02–1.35) | 2.50E-02 | ||
| >25 | 1086/750 | 1.22(1.06–1.41) | 6.00E-03 | 368/372 | 1.05(0.85–1.30) | 6.34E-01 | ||
| Histology | 0.275 | 0.819 | ||||||
| SC | 822 | 1.14(1.00–1.30) | 4.20E-02 | 332 | 1.15(0.96–1.39) | 1.29E-01 | ||
| AC | 1304 | 1.29(1.17–1.43) | 6.63E-07 | 783 | 1.12(0.98–1.27) | 8.60E-02 | ||
| other | 205 | 1.14(0.91–1.41) | 2.47E-01 | NA | NA | NA | ||
a: Adjusted by age, gender, pack-year of smoking and pca1 where is appropriate b: P value for Cochran's chi-square-based heterogeneity test; c: Adjusted by age, gender and pack-year of smoking where is appropriate; NA: not available.