| Literature DB >> 28304396 |
Yongchu Pan1,2,3, Hongliang Liu1,3, Yanru Wang1,3, Xiaozheng Kang1,3, Zhensheng Liu1,3, Kouros Owzar1,4, Younghun Han5, Li Su6,7, Yongyue Wei6,7, Rayjean J Hung8, Yonathan Brhane8, John McLaughlin9, Paul Brennan10, Heike Bickeböller11, Albert Rosenberger11, Richard S Houlston12, Neil Caporaso13, Maria Teresa Landi13, Joachim Heinrich14, Angela Risch15, Xifeng Wu16, Yuanqing Ye16, David C Christiani6,7, Christopher I Amos5, Qingyi Wei1,3.
Abstract
mRNA splicing is an important mechanism to regulate mRNA expression. Abnormal regulation of this process may lead to lung cancer. Here, we investigated the associations of 11,966 single-nucleotide polymorphisms (SNPs) in 206 mRNA splicing-related genes with lung cancer risk by using the summary data from six published genome-wide association studies (GWASs) of Transdisciplinary Research in Cancer of the Lung (TRICL) (12,160 cases and 16,838 controls) and another two lung cancer GWASs of Harvard University (984 cases and 970 controls) and deCODE (1,319 cases and 26,380 controls). We found that a total of 12 significant SNPs with false discovery rate (FDR) ≤0.05 were mapped to one novel gene PRPF6 and two previously reported genes (DHX16 and LSM2) that were also confirmed in this study. The six novel SNPs in PRPF6 were in high linkage disequilibrium and associated with PRPF6 mRNA expression in lymphoblastoid cells from 373 Europeans in the 1000 Genomes Project. Taken together, our studies shed new light on the role of mRNA splicing genes in the development of lung cancer.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28304396 PMCID: PMC5356340 DOI: 10.1038/srep44634
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Association results of 11,966 SNPs in 206 mRNA splicing-related genes and lung cancer risk in the TRICL Consortium.
SNPs are plotted on the X-axis according to their positions on each chromosome. The association P values with lung cancer risk are shown on the Y-axis (as −log10 P values). The 12 SNPs from three genes (DHX16, LSM2 and PRPF6) were identified after the FDR correction.
Characteristic information of the study populations.
| Study | Controls | Lung cancer patients | ||
|---|---|---|---|---|
| All | AD | SQ | ||
| ICR | 5200 | 1952 | 465 | 611 |
| MDACC | 1134 | 1150 | 619 | 306 |
| IARC | 3791 | 2533 | 517 | 911 |
| NCI | 5736 | 5713 | 1841 | 1447 |
| Toronto | 499 | 331 | 90 | 50 |
| GLC | 478 | 481 | 186 | 97 |
| 16838 | 12160 | 3718 | 3422 | |
| Harvard | 970 | 984 | 597 | 216 |
| deCODE | 26380 | 1319 | 547 | 259 |
| 44188 | 14463 | 4862 | 3897 | |
1ICR: the Institute of Cancer Research Genome-wide Association Study, UK;
2MDACC: the MD Anderson Cancer Center Genome-wide Association Study, US;
3IARC: the International Agency for Research on Cancer Genome-wide Association Study, France;
4NCI: the National Cancer Institute Genome-wide Association Study, US;
5Toronto: the Lunenfeld-Tanenbaum Research Institute Genome-wide Association Study, Toronto, Canada;
6GLC: German Lung Cancer Study, Germany;
7TRICL: GWASs datasets combined by six GWASs of ICR, MDACC, IARC, NCI, Toronto and GLC;
8Harvard: Harvard Lung Cancer Study, US;
9deCODE: Icelandic Lung Cancer Study, Iceland;
10AD: adenocarcinoma;
11SQ: squamous cell carcinoma.
mRNA splicing-related genes SNPs and lung cancer risk in the TRICL1 Consortium with FDR corrected P ≤ 0.05.
| ID | SNP | Chr: Position | GENE | Allele | EAF | Q | Effects | OR (95% CI) | FDR | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | rs115420460 | 6: 30618906 | A/G | 0.11 | 0.67 | 0.00 | ++++++ | 1.18 (1.12–1.25) | 3.93E-09 | <0.0001 | |
| 2 | rs114312980 | 6: 31768799 | A/C | 0.11 | 0.23 | 26.77 | ++++++ | 1.20 (1.12–1.29) | 2.19E-07 | 0.0007 | |
| 3 | rs115489726 | 6: 31766660 | C/T | 0.11 | 0.24 | 25.69 | ++++++ | 1.20 (1.12–1.29) | 2.54E-07 | 0.0007 | |
| 4 | rs115801685 | 6: 31772093 | C/A | 0.11 | 0.23 | 27.36 | ++++++ | 1.20 (1.12–1.29) | 2.61E-07 | 0.0007 | |
| 5 | rs114637560 | 6: 31765864 | T/A | 0.15 | 0.27 | 22.42 | +−++++ | 1.14 (1.07–1.21) | 3.04E-07 | 0.0007 | |
| 6 | rs115834633 | 6: 31765984 | G/A | 0.11 | 0.20 | 30.79 | ++++++ | 1.20 (1.12–1.30) | 5.75E-07 | 0.0011 | |
| 7 | rs116165844 | 20: 62610556 | G/T | 0.13 | 0.89 | 0.00 | −−−−−− | 0.89 (0.85–0.94) | 1.62E-05 | 0.0197 | |
| 8 | rs8126213 | 20: 62611478 | G/A | 0.13 | 0.89 | 0.00 | −−−−−− | 0.89 (0.85–0.94) | 1.64E-05 | 0.0197 | |
| 9 | rs147176547 | 20: 62613001 | C/G | 0.14 | 0.98 | 0.00 | −−−−−− | 0.90 (0.85–0.94) | 1.65E-05 | 0.0197 | |
| 10 | rs112219537 | 20: 62620029 | G/A | 0.13 | 0.96 | 0.00 | −−−−−− | 0.90 (0.85–0.94) | 2.37E-05 | 0.0237 | |
| 11 | rs113450630 | 20: 62623703 | C/T | 0.14 | 0.98 | 0.00 | −−−−−− | 0.90 (0.85–0.94) | 1.93E-05 | 0.021 | |
| 12 | rs75100087 | 20: 62636139 | C/T | 0.14 | 0.97 | 0.00 | −−−−−− | 0.89 (0.85–0.94) | 6.83E-06 | 0.0117 |
1TRICL: GWASs datasets combined by six GWASs of ICR, MDACC, IARC, NCI, Toronto and GLC.
2Based on NCBI build 37 of the human genome.
3Reference allele/effect allele.
4EAF, effect allele frequency.
5Fixed-effects models were used when no heterogeneity was found between studies (Q > 0.10 and I2 < 25.0); otherwise, random-effects models were used.
6+ means positive association, and − means negative association.
7Meta-analysis of additive results from six lung cancer GWASs.
8FDR, false discovery rate.
Figure 2Regional plots and recombination rates of PRPF6 (rs8126213 ± 500 kb) in the TRICL Consortium.
PRPF6 rs8126213 was shown in purple and the linkage disequilibrium (LD) values (r2) with the other SNPs are indicated by the heat scale. Other five PRPF6 SNPs are shown in red color, which meant that they were in high LD with rs8126213 (r2 > 0.8).
Summary association results of PRPF6 rs8126213 (G > A) in all of the eight lung cancer GWASs.
| Study population | Sample size | Overall (N = 14463) | AD (N = 4862) | SQ (N = 3897) | ||||
|---|---|---|---|---|---|---|---|---|
| Cases | Controls | OR (95% CI)* | OR (95% CI)* | OR (95% CI)* | ||||
| 12160 | 16838 | 0.89 (0.85–0.94) | 1.65E-05 | 0.88 (0.81–0.95) | 9.0E-04 | 0.88 (0.82–0.96) | 2.20E-03 | |
| ICR | 1952 | 5200 | 0.94 (0.85–1.05) | 0.303 | 0.94 (0.77–1.15) | 0.572 | 0.92 (0.77–1.20) | 0.374 |
| MDACC | 1150 | 1134 | 0.87 (0.73–1.04) | 0.120 | 0.87 (0.71–1.07) | 0.199 | 0.84 (0.64–1.11) | 0.219 |
| IARC | 2533 | 3791 | 0.91 (0.81–1.01) | 0.075 | 0.93 (0.76–1.13) | 0.456 | 0.91 (0.78–1.06) | 0.238 |
| NCI | 5713 | 5736 | 0.87 (0.81–0.95) | 8.36E-04 | 0.86 (0.77–0.97) | 0.012 | 0.85 (0.75–0.96) | 7.92E-03 |
| Toronto | 331 | 499 | 0.86 (0.62–1.20) | 0.379 | 0.77 (0.47–1.27) | 0.314 | 0.97 (0.51–1.83) | 0.923 |
| GLC | 481 | 478 | 0.83 (0.63–1.10) | 0.195 | 0.67 (0.45–0.98) | 0.041 | 1.01 (0.64–1.58) | 0.979 |
| 2303 | 27350 | 0.98 (0.89–1.09) | 0.772 | 0.97 (0.84–1.12) | 0.683 | 1.02 (0.82–1.26) | 0.875 | |
| Harvard | 984 | 970 | 0.97 (0.79–1.20) | 0.801 | 0.94 (0.74–1.18) | 0.576 | 0.97 (0.68–1.40) | 0.886 |
| deCODE | 1319 | 26380 | 0.99 (0.87–1.12) | 0.850 | 0.99 (0.82–1.20) | 0.940 | 1.04 (0.80–1.37) | 0.761 |
| 14463 | 44188 | 0.91 (0.87–0.95) | 6.36E-05 | 0.91 (0.87–0.95) | 1.70E-03 | 0.90 (0.83–0.97) | 4.80E-03 | |
Abbreviations: TRICL: GWASs datasets combined by six GWASs of ICR, MDACC, IARC, NCI, Toronto and GLC; ICR: the Institute of Cancer Research Genome-wide Association Study, UK; MDACC: the MD Anderson Cancer Center Genome-wide Association Study, US; IARC: the International Agency for Research on Cancer Genome-wide Association Study, France; NCI: the National Cancer Institute Genome-wide Association Study, US; Toronto: the Lunenfeld-Tanenbaum Research Institute Genome-wide Association Study, Toronto, Canada; GLC: German Lung Cancer Study, Germany; Harvard: Harvard Lung Cancer Study, US; deCODE: Icelandic Lung Cancer Study, Iceland; AD: adenocarcinoma; SQ: squamous cell carcinoma; *Meta-analysis of results from additive model.