| Literature DB >> 29859855 |
Zhaoxi Wang1, Yongyue Wei2, Ruyang Zhang3, Li Su1, Stephanie M Gogarten4, Geoffrey Liu5, Paul Brennan6, John K Field7, James D McKay6, Jolanta Lissowska8, Beata Swiatkowska9, Vladimir Janout10, Ciprian Bolca11, Milica Kontic12, Ghislaine Scelo6, David Zaridze13, Cathy C Laurie4, Kimberly F Doheny14, Elizabeth K Pugh14, Beth A Marosy14, Kurt N Hetrick14, Xiangjun Xiao15, Claudio Pikielny15, Rayjean J Hung16, Christopher I Amos15, Xihong Lin17, David C Christiani18.
Abstract
Recent technological advancements have permitted high-throughput measurement of the human genome, epigenome, metabolome, transcriptome, and proteome at the population level. We hypothesized that subsets of genes identified from omic studies might have closely related biological functions and thus might interact directly at the network level. Therefore, we conducted an integrative analysis of multi-omic datasets of non-small cell lung cancer (NSCLC) to search for association patterns beyond the genome and transcriptome. A large, complex, and robust gene network containing well-known lung cancer-related genes, including EGFR and TERT, was identified from combined gene lists for lung adenocarcinoma. Members of the hypoxia-inducible factor (HIF) gene family were at the center of this network. Subsequent sequencing of network hub genes within a subset of samples from the Transdisciplinary Research in Cancer of the Lung-International Lung Cancer Consortium (TRICL-ILCCO) consortium revealed a SNP (rs12614710) in EPAS1 associated with NSCLC that reached genome-wide significance (OR = 1.50; 95% CI: 1.31-1.72; p = 7.75 × 10-9). Using imputed data, we found that this SNP remained significant in the entire TRICL-ILCCO consortium (p = .03). Additional functional studies are warranted to better understand interrelationships among genetic polymorphisms, DNA methylation status, and EPAS1 expression.Entities:
Keywords: Hypoxia-inducible factor; Integrated analysis; Lung adenocarcinoma; Network analysis; Non-small cell lung cancer
Mesh:
Substances:
Year: 2018 PMID: 29859855 PMCID: PMC6021270 DOI: 10.1016/j.ebiom.2018.05.024
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Demographic distribution of study populations.
| 65.5 ± 10.6 | 59.4 ± 11.6 | <0.001 | 67.1 ± 9.6 | |
| ≥65 | 540 (54.9%) | 351 (36.2%) | <0.001 | 20 (71.4%) |
| 477 (48.5%) | 528 (54.4%) | 0.010 | 12 (42.9%) | |
| 49.7 ± 35.7 | 25.0 ± 26.7 | <0.001 | 53.3 ± 34.8 | |
| ≥30 | 664 (55.1%) | 326 (33.6%) | <0.001 | 22 (78.6%) |
| <0.001 | ||||
| Never | 92 (9.4%) | 161 (16.6%) | 2 (7.1%) | |
| Former | 502 (51.0%) | 555 (57.2%) | 18 (64.3%) | |
| Current | 390 (39.6%) | 254 (26.2%) | 8 (28.6%) | |
| Adenocarcinoma | 597 (60.7%) | 18 (64.3%) | ||
| Squamous cell | 216 (22.0%) | 7 (25.0%) | ||
| Small cell | 0 (0.0%) | 0 (0.0%) | ||
| Other | 171 (17.3%) | 3 (10.7%) | ||
Including adenocarcinoma in situ.
Common genes between GWAS and transcriptomic profiling (p < .05).
| Gene | Chr | Gene ID | Description | Transcriptome | GWAS | ||
|---|---|---|---|---|---|---|---|
| 9 | 216 | Aldehyde dehydrogenase 1 family, member A1 | −2.1 | 0.01294 | 29 | 0.02264 | |
| 9 | 23,452 | Angiopoietin-like 2 | −2.1 | 0.01364 | 5 | 0.00878 | |
| 15 | 585 | Bardet-Biedl syndrome 4 | −2.1 | 0.01240 | 5 | 0.04792 | |
| 1 | 79,630 | Chromosome 1 open reading frame 54 | −2.2 | 0.01047 | 3 | 0.03904 | |
| 5 | 821 | Calnexin | −2.5 | 0.00366 | 1 | 0.04936 | |
| 17 | 9720 | Coiled-coil domain containing 144A | −2.0 | 0.02563 | 1 | 0.04716 | |
| 3 | 51,208 | Claudin 18 | −2.0 | 0.00403 | 7 | 0.03534 | |
| 21 | 9946 | Crystallin, zeta (quinone reductase)-like 1 | −2.3 | 0.03630 | 3 | 0.03622 | |
| 1 | 1520 | Cathepsin S | −2.2 | 0.01058 | 2 | 0.00318 | |
| 1 | 84,301 | DDI1, DNA-damage inducible 1, homolog 2 ( | −2.4 | 0.00707 | 2 | 0.00074 | |
| 13 | 160,851 | Diacylglycerol kinase, eta | −2.5 | 0.01353 | 71 | 0.04530 | |
| 18 | 1829 | Desmoglein 2 | −2.3 | 0.00147 | 12 | 0.01370 | |
| 7 | 1956 | Epidermal growth factor receptor [erythroblastic leukemia viral (v-erb-b) oncogene homolog, avian) | −2.4 | 0.01896 | 1 | 0.01781 | |
| 2 | 2034 | Endothelial PAS domain protein 1 | −2.6 | 0.04683 | 35 | 0.00262 | |
| 14 | 2353 | V-fos FBJ murine osteosarcoma viral oncogene homolog | −2.9 | 0.02894 | 2 | 0.03475 | |
| 19 | 2828 | G protein-coupled receptor 4 | −2.2 | 0.02893 | 5 | 0.01226 | |
| 2 | 2995 | Glycophorin C (Gerbich blood group) | −2.9 | 0.00198 | 23 | 0.03402 | |
| 14 | 3091 | Hypoxia-inducible factor 1, alpha subunit (basic helix-loop-helix transcription factor) | −2.0 | 0.01108 | 8 | 0.00307 | |
| 6 | 3024 | Histone cluster 1, H1a | 2.1 | 0.00948 | 1 | 0.01148 | |
| 16 | 83,693 | Hydroxysteroid dehydrogenase like 1 | 2.1 | 0.02955 | 1 | 0.01549 | |
| 12 | 28,981 | Intraflagellar transport 81 homolog ( | −2.2 | 0.04329 | 6 | 0.02961 | |
| 3 | 57,577 | KIAA1407 | −3.1 | 0.01488 | 6 | 0.00387 | |
| 1 | 3953 | Leptin receptor | −2.0 | 0.02291 | 1 | 0.03057 | |
| 11 | 10,894 | Lymphatic vessel endothelial hyaluronan receptor 1 | −2.5 | 0.02175 | 2 | 0.02959 | |
| 17 | 51,003 | Mediator complex subunit 31 | −2.6 | 0.01772 | 1 | 0.02361 | |
| 11 | 64,231 | Membrane-spanning 4-domains, subfamily A, member 6A | −2.0 | 0.04791 | 4 | 0.00202 | |
| 3 | 4638 | Myosin, light chain kinase | −3.5 | 0.00032 | 1 | 0.00935 | |
| 7 | 10,135 | nicotinamide phosphoribosyltransferase | −2.1 | 0.01653 | 5 | 0.00979 | |
| 22 | 10,762 | Nucleoporin 50 kDa | −2.6 | 0.00218 | 4 | 0.03813 | |
| 11 | 80,227 | Proteasomal ATPase-associated factor 1 | −2.0 | 0.00515 | 8 | 0.00595 | |
| 6 | 135,138 | PARK2 co-regulated | −2.0 | 0.00203 | 1 | 0.04082 | |
| 1 | 142 | Poly (ADP-ribose) polymerase family, member 1 | 2.2 | 0.01837 | 8 | 0.02273 | |
| 6 | 5134 | Programmed cell death 2 | −2.3 | 0.03616 | 3 | 0.04564 | |
| 11 | 58,473 | Pleckstrin homology domain containing, family B (evectins) member 1 | −2.5 | 0.00524 | 5 | 0.04509 | |
| 7 | 51,422 | Protein kinase, AMP-activated, gamma 2 non-catalytic subunit | −2.3 | 0.02595 | 1 | 0.03415 | |
| 10 | 5588 | Protein kinase C, theta | −2.3 | 0.00363 | 83 | 0.02882 | |
| 2 | 5937 | RNA binding motif, single stranded interacting protein 1 | −2.3 | 0.00746 | 26 | 0.01284 | |
| 8 | 50,484 | Ribonucleotide reductase M2 B (TP53 inducible) | −2.0 | 0.01162 | 10 | 0.02509 | |
| 19 | 53,637 | Sphingosine-1-phosphate receptor 5 | −2.0 | 0.00713 | 2 | 0.01977 | |
| 5 | 57,556 | Sema domain, transmembrane domain, and cytoplasmic domain, (semaphorin) 6A | −4.4 | 2.03E-06 | 60 | 0.00432 | |
| 16 | 23,049 | PI-3-kinase-related kinase SMG-1 | −2.1 | 0.01381 | 11 | 0.00747 | |
| 4 | 10,460 | Transforming, acidic coiled-coil containing protein 3 | 2.2 | 0.03279 | 5 | 0.02719 | |
| 11 | 10,312 | T-cell, immune regulator 1, ATPase, H+ transporting, lysosomal V0 subunit A3 | −2.7 | 0.02007 | 1 | 0.02523 | |
| 3 | 131,540 | Zinc finger, DHHC-type containing 19 | −2.6 | 0.01204 | 8 | 0.04905 | |
| 19 | 10,782 | Zinc finger protein 274 | −3.1 | 0.00116 | 7 | 0.04410 | |
| 1 | 9406 | Zinc finger, RAN-binding domain containing 2 | −2.3 | 0.00502 | 4 | 0.00498 | |
Fold change indicates difference between tumor tissue and non-affected adjacent tissue. P-values are from paired t-tests.
Fig. 1Networks built from GWAS and transcriptomic profiling data. a, Significant network built from combining top-ranked GWAS genes (103 genes with p < .005), top-ranked transcriptomic profiling genes (75 genes with p < .001), and a core list of genes (37 non-overlapping genes out of 46 core genes). Blue indicates genes only identified from transcriptomic profiling; green indicates genes only identified from GWAS; and red indicates genes from the core list. b, Network of 13 hub genes (≥5 connections within the network).
Fig. 2Distribution of networks built by randomly selected genes. From a total of 24,847 genes, we randomly generated 6 sets of gene lists, with each set containing 100 gene lists. Each set contained 50, 100, 200, 300, 400, or 500 genes, denoted by R50, R100, R200, R300, R400, and R500, respectively. Networks were built from each gene list using MetaCore GeneGo database and were used to calculate a 95% CI curve. GWAS denotes network from GWAS data; FFPE denotes network from transcriptomic profiling data; and GWAS+FFPE+core denotes networks built from combining GWAS data, transcriptomic profiling data, and a core gene list.
Network evaluations.
| 215 | 198 | 217 | 191 | |
| Genomic list | GWAS | GWAS-AC | GWAS | GWAS |
| Transcriptomic list | FFPE | FFPE | GSE10072 | Common of FFPE & GSE10072 |
| Core list | 37 | 33 | 29 | − |
| Genes | 46 | 43 | 46 | 40 |
| Connections | 70 | 62 | 69 | 58 |
| Complexity | 1.556 | 1.476 | 1.533 | 1.487 |
| Gene ratio | 0.214 | 0.217 | 0.212 | 0.209 |
| | + | − | + | − |
| | − | + | − | − |
| | − | − | + | − |
| | + | + | − | + |
| | + | + | − | − |
| | − | − | + | − |
| | + | + | + | − |
| | − | − | + | − |
| | + | − | + | + |
| | + | + | + | + |
| | + | + | + | + |
| | − | − | − | + |
| + | + | − | − | |
| + | + | − | − |
Fig. 3Manhattan plot of EPAS1 in targeted sequencing project.