| Literature DB >> 30104567 |
Xuemei Ji1, Yohan Bossé2,3, Maria Teresa Landi4, Jiang Gui1, Xiangjun Xiao1, David Qian1, Philippe Joubert3, Maxime Lamontagne3, Yafang Li1, Ivan Gorlov1, Mariella de Biasi5,6, Younghun Han1, Olga Gorlova1, Rayjean J Hung7, Xifeng Wu8, James McKay9, Xuchen Zong7, Robert Carreras-Torres9, David C Christiani10,11, Neil Caporaso4, Mattias Johansson9, Geoffrey Liu7, Stig E Bojesen12,13,14, Loic Le Marchand15, Demetrios Albanes4, Heike Bickeböller16, Melinda C Aldrich17, William S Bush17,18, Adonina Tardon19,20, Gad Rennert21,22, Chu Chen23, M Dawn Teare24, John K Field25, Lambertus A Kiemeney26, Philip Lazarus27, Aage Haugen28, Stephen Lam29, Matthew B Schabath30, Angeline S Andrew31, Hongbing Shen32, Yun-Chul Hong33, Jian-Min Yuan34, Pier A Bertazzi35,36, Angela C Pesatori35,36, Yuanqing Ye8, Nancy Diao10, Li Su10, Ruyang Zhang10,32, Yonathan Brhane7, Natasha Leighl37, Jakob S Johansen38, Anders Mellemgaard38, Walid Saliba21,22, Christopher Haiman39, Lynne Wilkens15, Ana Fernandez-Somoano19,20, Guillermo Fernandez-Tardon19,20, Erik H F M van der Heijden26, Jin Hee Kim40, Juncheng Dai32, Zhibin Hu32, Michael P A Davies25, Michael W Marcus25, Hans Brunnström41, Jonas Manjer42, Olle Melander42, David C Muller43, Kim Overvad42, Antonia Trichopoulou44, Rosario Tumino45, Jennifer Doherty31,46, Gary E Goodman46,47, Angela Cox48, Fiona Taylor48, Penella Woll48, Irene Brüske49, Judith Manz49, Thomas Muley50,51, Angela Risch52, Albert Rosenberger16, Kjell Grankvist53, Mikael Johansson54, Frances Shepherd55, Ming-Sound Tsao55, Susanne M Arnold56, Eric B Haura57, Ciprian Bolca58, Ivana Holcatova59, Vladimir Janout59, Milica Kontic60, Jolanta Lissowska61, Anush Mukeria62, Simona Ognjanovic63, Tadeusz M Orlowski64, Ghislaine Scelo9, Beata Swiatkowska65, David Zaridze62, Per Bakke66, Vidar Skaug28, Shanbeh Zienolddiny28, Eric J Duell67, Lesley M Butler34, Woon-Puay Koh68,69, Yu-Tang Gao70, Richard Houlston71, John McLaughlin72, Victoria Stevens73, David C Nickle74, Ma'en Obeidat75, Wim Timens76, Bin Zhu4, Lei Song4, María Soler Artigas77,78, Martin D Tobin77,78, Louise V Wain77,78, Fangyi Gu4, Jinyoung Byun1, Ahsan Kamal1, Dakai Zhu1, Rachel F Tyndale79,80,81, Wei-Qi Wei82, Stephen Chanock4, Paul Brennan9, Christopher I Amos83,84.
Abstract
Genome-wide association studies (GWAS) identified the chromosome 15q25.1 locus as a leading susceptibility region for lung cancer. However, the pathogenic pathways, through which susceptibility SNPs within chromosome 15q25.1 affects lung cancer risk, have not been explored. We analyzed three cohorts with GWAS data consisting 42,901 individuals and lung expression quantitative trait loci (eQTL) data on 409 individuals to identify and validate the underlying pathways and to investigate the combined effect of genes from the identified susceptibility pathways. The KEGG neuroactive ligand receptor interaction pathway, two Reactome pathways, and 22 Gene Ontology terms were identified and replicated to be significantly associated with lung cancer risk, with P values less than 0.05 and FDR less than 0.1. Functional annotation of eQTL analysis results showed that the neuroactive ligand receptor interaction pathway and gated channel activity were involved in lung cancer risk. These pathways provide important insights for the etiology of lung cancer.Entities:
Mesh:
Year: 2018 PMID: 30104567 PMCID: PMC6089967 DOI: 10.1038/s41467-018-05074-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Schematic overview of the study design. (1) In the discovery phase, a total of 310,276 SNPs were the same in both the 1st and 2nd discovery cohorts and were applied for association analyses and meta-analyses. (2) SNPs within the 15q25.1 locus, which were associated with lung cancer risk with logistic regression P values of less than 5 × 10−8 in the 1st discovery cohort and in the meta-analysis of the discovery cohorts, were selected as index SNPs. (3) The epistasis test between SNPs in the whole genome and the index SNPs within chromosome 15q25.1 locus were conducted for both discovery cohorts and a meta-analysis was performed to combine the epistasis results. (4) The SNPs, which interacted with the index SNPs with an epistasis P value of less than 0.05 in the 1st discovery cohort and in the meta-analysis of both discovery cohorts, and less than 0.10 in the 2nd discovery cohort, were selected as the candidate SNPs. (5) The index SNPs and the candidate SNPs with the logistic regression P values in the meta-analysis of discovery cohorts were applied for GWAS pathway analysis. (6) In the replication phase, the index SNPs and the candidate SNPs with the logistic regression P values in an independent cohort were applied for GWAS pathway analysis to validate the susceptibility pathway enriched in step 5. (7) The most significant genes in the whole genome regulated by SNPs in chromosome 15q25.1, which were selected with the eQTL study, were employed for pathway analysis. (8) The individual and combined effects of genes in the pathways on lung cancer risk were calculated. (9) A similar process to select index SNPs and candidate SNPs and to carry out GWAS pathway analyses in the subgroups of smokers and non-smokers were conducted
Participant characteristics of lung cancer cases and controls in GWAS cohorts
| Variants | 1st Discovery cohort ( | 2nd Discovery cohort ( | Replication Cohort ( | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Control ( | Case ( | Control ( | Case ( | Control ( | Case ( | ||||||||||
| No. | % | No. | % | No. | % | No. | % | No. | % | No. | % | ||||
| Age (years) | |||||||||||||||
| 0–64 | 502 | 25.4 | 420 | 21.8 | 0.009 | 2304 | 64.39 | 1825 | 60.9 | 0.004 | 8449 | 60.2 | 9513 | 51.6 | <0.0001 |
| ≥65 | 1475 | 74.6 | 1503 | 78.2 | 1274 | 35.61 | 1170 | 39.1 | 5577 | 39.8 | 8926 | 48.4 | |||
| Gender | |||||||||||||||
| Male | 1514 | 76.6 | 1520 | 79.0 | 0.06 | 2417 | 67.55 | 2093 | 69.9 | 0.04 | 8639 | 61.6 | 11,495 | 62.3 | 0.37 |
| Female | 463 | 23.4 | 403 | 21.0 | 1161 | 32.45 | 902 | 30.1 | 5384 | 38.4 | 6941 | 37.6 | |||
| Omitted | 3 | 0.02 | 3 | 0.02 | |||||||||||
| Smoking status | |||||||||||||||
| Never | 633 | 32.0 | 138 | 7.1 | <0.0001 | 867 | 24.23 | 137 | 4.6 | <0.0001 | 4415 | 31.5 | 1800 | 9.8 | <0.0001 |
| Ever | 1339 | 67.7 | 1774 | 92.3 | 2702 | 75.52 | 2854 | 95.3 | 9930 | 66.5 | 16,341 | 88.6 | |||
| Omitted | 5 | 0.3 | 11 | 0.6 | 9 | 0.25 | 4 | 0.1 | 281 | 2.0 | 298 | 1.6 | |||
| Histology | |||||||||||||||
| Squamous | 488 | 25.4 | 307 | 10.3 | 4490 | 24.3 | |||||||||
| Adenocarcinoma | 788 | 40.9 | 620 | 20.7 | 6819 | 37.0 | |||||||||
| Other | 613 | 31.9 | 226 | 7.5 | 5487 | 29.8 | |||||||||
| Omitted | 34 | 1.8 | 1842 | 61.5 | 1643 | 8.9 | |||||||||
Index SNPs in the chromosome 15q25.1 locus which were associated with lung cancer with P < 5.00E-8 in the 1st discovery cohort and in meta-analysis of the discovery cohorts
| SNP | Gene | Predicted function | A1 | A2 | 1st discovery cohort | 2nd discovery cohort | Meta-analysis of discovery cohorts | replication cohort | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BONFa | BONFa | BONFa | BONFa | |||||||||
| rs1051730 | CHRNA3 | coding | T | C | 2.28E-14 | 7.77E-11 | 3.03E-13 | 1.04E-09 | 1.64E-25 | 5.09E-20 | 3.11E-49 | 1.06E-45 |
| rs1996371 | CHRNB4 | intronic | G | A | 9.08E-12 | 3.10E-08 | 1.15E-05 | 3.93E-02 | 2.05E-14 | 6.36E-09 | 2.83E-24 | 9.65E-21 |
| rs6495314 | CHRNB4 | intronic | C | A | 1.47E-11 | 5.01E-08 | 7.29E-06 | 2.49E-02 | 1.47E-14 | 4.56E-09 | 8.54E-24 | 2.91E-20 |
| rs8034191 | HYKK | intronic | C | T | 3.05E-11 | 1.04E-07 | 8.98E-14 | 3.07E-10 | 2.40E-23 | 7.45E-18 | 2.12E-46 | 7.23E-43 |
| rs11638372 | CHRNB4 | intronic | T | C | 3.14E-10 | 1.07E-06 | 2.95E-05 | 1.01E-01 | 8.11E-13 | 2.52E-07 | 5.28E-24 | 1.80E-20 |
| rs2036534 | HYKK | 3downstream | C | T | 3.81E-10 | 1.30E-06 | 4.29E-06 | 1.47E-02 | 7.81E-14 | 2.42E-08 | 4.85E-32 | 1.65E-28 |
| rs4887077 | CHRNB4 | intronic | T | C | 4.16E-10 | 1.42E-06 | 2.39E-05 | 8.17E-02 | 7.72E-13 | 2.40E-07 | 2.23E-23 | 7.61E-20 |
| rs6495309 | CHRNB4 | 3downstream | T | C | 3.57E-08 | 1.22E-04 | 4.29E-06 | 1.47E-02 | 2.18E-12 | 6.76E-07 | 9.34E-29 | 3.19E-25 |
aP-value was adjusted for multiple comparisons using Bonferroni correction.
Pathways and GO terms in discovery and replication with a threshold of FDR < 0.25 in both phase
| Source | Pathway/gene set name | Meta-analysis of discovery cohorts | Replication cohort | ||
|---|---|---|---|---|---|
| FDR | FDR | ||||
| KEGG | neuroactive ligand receptor interaction | 0.001 | 0.006 | 0.013 | 0.042 |
| Reactome | neuronal system | 0.001 | 0.015 | 0.014 | 0.082 |
| transmission across chemical synapses | 0.003 | 0.023 | 0.003 | 0.028 | |
| Gene Oncology | substrate-specific channel activity | <0.001 | 0.005 | 0.002 | 0.004 |
| ion channel activity | <0.001 | 0.005 | 0.002 | 0.004 | |
| substrate-specific transporter activity | 0.001 | 0.006 | 0.010 | 0.013 | |
| cation channel activity | 0.002 | 0.006 | 0.002 | 0.008 | |
| ion transmembrane transporter activity | 0.002 | 0.006 | 0.004 | 0.009 | |
| metal ion transmembrane transporter activity | 0.001 | 0.006 | 0.002 | 0.003 | |
| transmembrane transporter activity | <0.001 | 0.006 | 0.007 | 0.012 | |
| gated channel activity | 0.002 | 0.006 | 0.001 | 0.016 | |
| substrate-specific transmembrane transporter activity | <0.001 | 0.006 | 0.006 | 0.012 | |
| cation transmembrane transporter activity | 0.001 | 0.006 | 0.003 | 0.006 | |
| transmembrane receptor activity | 0.001 | 0.007 | <0.001 | 0.006 | |
| receptor activity | 0.017 | 0.021 | 0.002 | 0.007 | |
| macromolecular complex | 0.001 | 0.008 | 0.006 | 0.037 | |
| protein complex | 0.001 | 0.012 | 0.006 | 0.080 | |
| intrinsic to membrane | 0.003 | 0.022 | 0.002 | 0.025 | |
| intrinsic to plasma membrane | 0.004 | 0.024 | 0.002 | 0.030 | |
| integral to membrane | 0.003 | 0.027 | 0.002 | 0.027 | |
| membrane part | 0.005 | 0.028 | 0.013 | 0.050 | |
| membrane | 0.006 | 0.032 | 0.024 | 0.071 | |
| plasma membrane part | 0.009 | 0.032 | 0.005 | 0.030 | |
| integral to plasma membrane | 0.003 | 0.035 | 0.002 | 0.051 | |
| plasma membrane | 0.015 | 0.044 | 0.034 | 0.085 | |
Functional annotation of eQTL study results for our susceptibility GWAS GO terms with a threshold of P value < 0.1
| GO term | |
|---|---|
| gated channel activity | 0.029 |
| ion channel activity | 0.071 |
| cation channel activity | 0.073 |
| substrate-specific channel activity | 0.08 |
| cation transmembrane transporter activity | 0.098 |
Fig. 2Gene network of the susceptibility pathways/GO terms. Each node is a gene, which was the selected gene in our GWAS pathway analysis and is shown in Supplementary Data 4. The connecting lines are drawn if the two genes have a relationship such as co-expression, shared protein domain, physical interactions, co-localization, or pathway. The thickness of the lines represents the degree of similarity between two genes. Gene network is produced using GeneMANIA. a The network of KEGG neuroactive ligand receptor interaction pathway consists of 23 genes and only TRPV1 in the associated gene list identified by our GWAS enrichment was shown no direct relationship with other associated genes. b The network of gated channel activity GO term consists all of 22 genes identified by our GWAS enrichment. c The network of ion channel activity GO term consists all of 24 genes identified by our GWAS enrichment. d The network of cation channel activity GO term consists all of 22 genes identified by our GWAS enrichment. e The network of substrate-specific channel activity GO term consists all of 24 genes identified by our GWAS enrichment. f The network of cation transmembrane transporter activity GO term consists 29 and only SLC4A4 in the associated gene list genes identified by our GWAS enrichment was shown no direct relationship with other associated genes
Fig. 3Association among susceptibility pathways and GO terms. a Each node is a subcomponent. Connecting lines are drawn if the overlapping coefficient between the two nodes is greater than 0.8. This picture was drawn by Cytoscape with EnrichmentMap plugin, using the standard gene set file. b The relationship between the susceptibility GO terms. The relationship between the GO terms was based on the knowledge from the European Bioinformatics Institute. All gene sets which were enriched in our GWAS pathway analyses were colored. Of the GO terms, 10 terms belonged to a transporter activity term. Both substrate-specific transporter activity term and transmembrane transporter activity term are part of the transporter activity term and have a child term of substrate-specific transmembrane transporter activity. Both terms of substrate-specific channel activity and ion transmembrane transporter activity are part of the substrate-specific transmembrane transporter activity term, and have a child term of ion channel activity which had a child term of cation channel activity. In addition, cation transmembrane transporter activity was part of ion transmembrane transporter activity, and had a child term of metal ion transmembrane transporter activity. Gated channel activity term was a child term of transmembrane transporter activity and shared several child terms with ion channel activity term
Individual and combined effects of SNPs from our susceptibility pathways on lung cancer risk in the meta-analysis of Discovery Cohorts
| Univariate analysis | Multivariate analysis* | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| OR | L95 | U95 |
| OR | L95 | U95 |
| ||||
| CHRNA3 | |||||||||||
| rs1051730 | 0 | 1 | 1.29E-25 | 2.68E-24 | |||||||
| 1 | 1.31 | 1.21 | 1.43 | 3.46E-10 | 1.32 | 1.21 | 1.44 | 1.23E-09 | |||
| 2 | 1.86 | 1.65 | 2.09 | 8.80E-25 | 1.89 | 1.67 | 2.14 | 7.44E-24 | |||
| CHRNB4 | |||||||||||
| rs6495309 | 0 | 1 | 2.35E-12 | 4.56E-11 | |||||||
| 1 | 1.22 | 1 | 1.5 | 0.05 | 1.21 | 0.98 | 1.49 | 0.08 | |||
| 2 | 1.58 | 1.3 | 1.93 | 5.21E-06 | 1.56 | 1.27 | 1.91 | 2.67E-05 | |||
| KCNJ4 | |||||||||||
| rs138396 | 0 | 1 | 0.06 | 0.04 | |||||||
| 1 | 1 | 0.91 | 1.09 | 0.94 | 1.01 | 0.92 | 1.1 | 0.89 | |||
| 2 | 1.13 | 1.01 | 1.26 | 0.03 | 1.15 | 1.02 | 1.29 | 0.02 | |||
| SCN2B | |||||||||||
| rs7944321 | 0 | 1 | 0.07 | 0.14 | |||||||
| 1 | 1.07 | 0.98 | 1.16 | 0.13 | 1.06 | 0.97 | 1.15 | 0.19 | |||
| 2 | 1.12 | 0.94 | 1.34 | 0.22 | 1.09 | 0.91 | 1.32 | 0.35 | |||
| neuroactive ligand receptor interaction pathway | |||||||||||
| (CHRNA3 rs1051730 and CHRNB4 rs6495309) | |||||||||||
| 0-1 | 1 | 2.80E-26 | 1 | 1.55E-24 | |||||||
| 2 | 1.32 | 1.18 | 1.47 | 8.45E-07 | 1.32 | 1.18 | 1.48 | 1.82E-06 | |||
| 3 | 1.48 | 1.32 | 1.65 | 4.94E-12 | 1.47 | 1.31 | 1.65 | 6.14E-11 | |||
| 4 | 2.04 | 1.79 | 2.33 | 2.22E-26 | 2.07 | 1.8 | 2.38 | 4.99E-25 | |||
| gated channel activity term | |||||||||||
| (CHRNA3 rs1051730, CHRNB4 rs6495309, KCNJ4 rs138396 and SCN2B rs7944321) | |||||||||||
| 0-1 | 1 | 4.39E-19 | 2.46E-18 | ||||||||
| 2-3 | 1.29 | 1.08 | 1.53 | 5.50E-03 | 1.3 | 1.08 | 1.56 | 5.50E-03 | |||
| 4-5 | 1.6 | 1.34 | 1.9 | 1.84E-07 | 1.63 | 1.36 | 1.96 | 1.65E-07 | |||
| 6-8 | 2.15 | 1.74 | 2.65 | 9.31E-13 | 2.17 | 1.74 | 2.7 | 4.01E-12 | |||
*Adjusted by age sex smoke status in the Logistic Models.
Individual and combined effects of SNPs from our susceptibility pathways on lung cancer risk in Replication Cohort
| Univariate analysis | Multivariate analysis* | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| OR | L95 | U95 |
|
| OR | L95 | U95 |
|
| ||
| CHRNA3 | |||||||||||
| rs1051730 | 0 | 1 | 1.70E-51 | 1 | 1.82E-44 | ||||||
| 1 | 1.28 | 1.22 | 1.34 | 3.43E-23 | 1.27 | 1.21 | 1.33 | 2.62E-20 | |||
| 2 | 1.65 | 1.54 | 1.77 | 2.47E-46 | 1.63 | 1.52 | 1.75 | 1.22E-40 | |||
| CHRNB4 | |||||||||||
| rs6495309 | 0 | 1 | 1.40E-29 | 1 | 1.55E-24 | ||||||
| 1 | 1.14 | 1.02 | 1.28 | 0.02 | 1.12 | 1 | 1.26 | 0.05 | |||
| 2 | 1.46 | 1.31 | 1.63 | 1.32E-11 | 1.42 | 1.27 | 1.6 | 2.13E-09 | |||
| KCNJ4 | |||||||||||
| rs138396 | 0 | 1 | 2.00E-04 | 1 | 5.00E-04 | ||||||
| 1 | 1.06 | 1.01 | 1.11 | 0.03 | 1.05 | 1 | 1.11 | 0.08 | |||
| 2 | 1.13 | 1.06 | 1.21 | 2.00E-04 | 1.13 | 1.06 | 1.21 | 4.00E-04 | |||
| SCN2B | |||||||||||
| rs7944321 | 0 | 1 | 8.90E-03 | 1 | 0.01 | ||||||
| 1 | 1.06 | 1.01 | 1.11 | 0.026 | 1.05 | 1 | 1.11 | 0.04 | |||
| 2 | 1.1 | 0.99 | 1.22 | 0.083 | 1.1 | 0.99 | 1.23 | 0.08 | |||
| neuroactive ligand receptor interaction pathway | |||||||||||
| (CHRNA3 rs1051730 and CHRNB4 rs6495309) | |||||||||||
| 0-1 | 1 | 1.11E-58 | 1 | 4.80E-50 | |||||||
| 2 | 1.21 | 1.14 | 1.28 | 1.79E-09 | 1.21 | 1.14 | 1.29 | 2.96E-09 | |||
| 3 | 1.44 | 1.36 | 1.54 | 1.44E-30 | 1.42 | 1.33 | 1.52 | 2.43E-26 | |||
| 4 | 1.77 | 1.64 | 1.91 | 2.65E-49 | 1.74 | 1.61 | 1.89 | 3.32E-43 | |||
| gated channel activity term | |||||||||||
| (CHRNA3 rs1051730, CHRNB4 rs6495309, KCNJ4 rs138396 and SCN2B rs7944321) | |||||||||||
| 0-1 | 1 | 3.36E-44 | 1 | 2.09E-37 | |||||||
| 2-3 | 1.18 | 1.07 | 1.3 | 8.00E-04 | 1.15 | 1.04 | 1.28 | 5.40E-03 | |||
| 4-5 | 1.51 | 1.37 | 1.67 | 4.89E-17 | 1.47 | 1.33 | 1.63 | 7.77E-14 | |||
| 6-8 | 1.79 | 1.59 | 2.02 | 2.84E-22 | 1.72 | 1.52 | 1.95 | 8.27E-18 | |||
*Adjusted by age sex smoke status in the Logistic Model