| Literature DB >> 27738493 |
Meng Chen1, Nathaniel Rothman2, Yuanqing Ye1, Jian Gu1, Paul A Scheet1, Maosheng Huang1, David W Chang1, Colin P Dinney3, Debra T Silverman2, Jonine D Figueroa2, Stephen J Chanock2, Xifeng Wu1.
Abstract
Genome-wide association studies (GWAS) are designed to identify individual regions associated with cancer risk, but only explain a small fraction of the inherited variability. Alternative approach analyzing genetic variants within biological pathways has been proposed to discover networks of susceptibility genes with additional effects. The gene set enrichment analysis (GSEA) may complement and expand traditional GWAS analysis to identify novel genes and pathways associated with bladder cancer risk. We selected three GSEA methods: Gen-Gen, Aligator, and the SNP Ratio Test to evaluate cellular signaling pathways involved in bladder cancer susceptibility in a Texas GWAS population. The candidate genetic polymorphisms from the significant pathway selected by GSEA were validated in an independent NCI GWAS. We identified 18 novel pathways (P < 0.05) significantly associated with bladder cancer risk. Five of the most promising pathways (P ≤ 0.001 in any of the three GSEA methods) among the 18 pathways included two cell cycle pathways and neural cell adhesion molecule (NCAM), platelet-derived growth factor (PDGF), and unfolded protein response pathways. We validated the candidate polymorphisms in the NCI GWAS and found variants of RAPGEF1, SKP1, HERPUD1, CACNB2, CACNA1C, CACNA1S, COL4A2, SRC, and CACNA1C were associated with bladder cancer risk. Two CCNE1 variants, rs8102137 and rs997669, from cell cycle pathways showed the strongest associations; the CCNE1 signal at 19q12 has already been reported in previous GWAS. These findings offer additional etiologic insights highlighting the specific genes and pathways associated with bladder cancer development. GSEA may be a complementary tool to GWAS to identify additional loci of cancer susceptibility.Entities:
Keywords: GWAS; bladder cancer; gene set enrichment analysis; pathway analysis; susceptibility loci
Year: 2016 PMID: 27738493 PMCID: PMC5059113 DOI: 10.18632/genesandcancer.113
Source DB: PubMed Journal: Genes Cancer ISSN: 1947-6019
Figure 1Q-Q plot of observed versus expected chi2 test statistics in Texas population
Significant pathways from the three gene set enrichment analyses
| Gen-Gen | Aligator | SNP Ratio Test | ||||
|---|---|---|---|---|---|---|
| Pathway | P | Rank | P | Rank | P | Rank |
| <0.001 | 1 | 0.0264 | 25 | 0.005994 | 5 | |
| <0.001 | 1 | 0.0198 | 21 | 0.013986 | 12 | |
| 0.001 | 2 | 0.0012 | 2 | 0.001998 | 2 | |
| 0.001 | 2 | 0.0016 | 3 | 0.011988 | 10 | |
| BIOCARTA_NDKDYNAMIN_PATHWAY | 0.003 | 4 | 0.007 | 8 | 0.08991 | 56 |
| REACTOME_NCAM1_INTERACTIONS | 0.005 | 6 | 0.009 | 11 | 0.025974 | 21 |
| BIOCARTA_P27_PATHWAY | 0.005 | 6 | 0.0044 | 6 | 0.021978 | 18 |
| 0.007 | 8 | 0.0008 | 1 | 0.000999 | 1 | |
| REACTOME_INACTIVATION_OF_APC_VIA_DIRECT_INHIBITION_OF_THE_APCOMPLEX | 0.008 | 9 | 0.0368 | 30 | 0.002997 | 3 |
| BIOCARTA_BAD_PATHWAY | 0.009 | 10 | 0.0496 | 38 | 0.008991 | 7 |
| REACTOME_CONVERSION_FROM_APC_CDC20_TO_APC_CDH1_IN_LATE_ANAPHASE | 0.013 | 13 | 0.0484 | 37 | 0.016983 | 14 |
| BIOCARTA_NFAT_PATHWAY | 0.017 | 16 | 0.0036 | 4 | 0.013986 | 12 |
| REACTOME_CTLA4_INHIBITORY_SIGNALING | 0.018 | 17 | 0.0392 | 32 | 0.027972 | 22 |
| REACTOME_PHOSPHORYLATION_OF_THE_APC | 0.018 | 17 | 0.0376 | 31 | 0.010989 | 9 |
| REACTOME_APCDC20_MEDIATED_DEGRADATION_OF_CYCLIN_B | 0.019 | 18 | 0.0392 | 32 | 0.00999 | 8 |
| KEGG_PROSTATE_CANCER | 0.02 | 19 | 0.004 | 5 | 0.036963 | 29 |
| REACTOME_SYNTHESIS_OF_BILE_ACIDS_AND_BILE_SALTS_VIA_24_HYDROXYCHOLESTEROL | 0.025 | 22 | 0.007 | 8 | 0.048951 | 36 |
| BIOCARTA_DC_PATHWAY | 0.044 | 34 | 0.0138 | 17 | 0.005994 | 5 |
The pathways discussed in detail were in bold
Genes contained in significant pathways
| REACTOME_SIGNALING_BY_PDGF | HRAS | PDGFB | PDGFA | BCAR1 | STAT5A | STAT5B | PIK3CA | PDGFC | PDGFD | RAPGEF1 | ||||||||||||||||||||
| PIK3CB | MAPK1 | NCK2 | CRKL | NCK1 | MAPK3 | COL1A2 | PDGFRA | PDGFRB | COL1A1 | |||||||||||||||||||||
| 64 genes | GRB2 | COL3A1 | COL2A1 | SRC | STAT6 | COL9A1 | COL9A2 | COL9A3 | KRAS | COL6A6 | ||||||||||||||||||||
| SOS1 | COL6A3 | COL6A2 | COL6A1 | THBS1 | THBS2 | PIK3R1 | THBS3 | RASA1 | PIK3R2 | |||||||||||||||||||||
| THBS4 | SPP1 | COL4A4 | PLAT | COL4A3 | COL4A2 | COL4A1 | MAP2K1 | MAP2K2 | YWHAB | |||||||||||||||||||||
| RAF1 | STAT1 | COL5A2 | FURIN | PLG | STAT3 | COL5A1 | COL4A5 | PTPN11 | NRAS | |||||||||||||||||||||
| PLCG1 | COL29A1 | CRK | GRB7 | |||||||||||||||||||||||||||
| REACTOME_NCAM_SIGNALING_FOR_ | NRTN | HRAS | GDNF | ARTN | NCAM1 | MAPK1 | MAPK3 | CNTN2 | COL1A2 | COL1A1 | ||||||||||||||||||||
| PRNP | NCAN | SPTB | FGFR1 | GRB2 | COL3A1 | CACNB1 | CACNB2 | COL2A1 | CACNB3 | |||||||||||||||||||||
| 69 genes | CACNB4 | ST8SIA2 | SRC | COL9A1 | COL9A2 | PTK2 | COL9A3 | KRAS | COL6A6 | SOS1 | ||||||||||||||||||||
| COL6A3 | COL6A2 | COL6A1 | AGRN | COL4A4 | COL4A3 | COL4A2 | COL4A1 | MAP2K1 | SPTBN5 | |||||||||||||||||||||
| MAP2K2 | CREB1 | CACNA1I | SPTBN4 | PTPRA | YWHAB | RAF1 | COL5A2 | CACNA1S | COL5A1 | |||||||||||||||||||||
| COL4A5 | RPS6KA5 | PSPN | NRAS | FYN | ST8SIA4 | CACNA1G | SPTBN2 | CACNA1H | SPTBN1 | |||||||||||||||||||||
| GFRA1 | COL29A1 | SPTA1 | CACNA1F | GFRA4 | CACNA1C | CACNA1D | GFRA2 | SPTAN1 | ||||||||||||||||||||||
| BIOCARTA_RACCYCD_PATHWAY | E2F1 | HRAS | NFKBIA | NFKB1 | AKT1 | CCNE1 | RAC1 | RHOA | PIK3CA | PAK1 | ||||||||||||||||||||
| CHUK | PIK3R1 | TFDP1 | RELA | RAF1 | CDK6 | RB1 | CDK4 | CDK2 | MAPK1 | |||||||||||||||||||||
| 26 genes | CCND1 | CDKN1A | CDKN1B | IKBKG | MAPK3 | IKBKB | ||||||||||||||||||||||||
| CDC34 | CCNA1 | E2F1 | CUL1 | TFDP1 | CDK2 | RB1 | CCNE1 | SKP2 | SKP1 | |||||||||||||||||||||
| REACTOME_UNFOLDED_PROTEIN_RESPONSE | HERPUD1 | MBTPS2 | PDIA6 | NFYA | EDEM1 | DDIT3 | ATF6 | ATF4 | ATF3 | DNAJB9 | ||||||||||||||||||||
| 19 genes | DNAJB11 | XBP1 | EIF2S1 | ERN1 | HSPA5 | DNAJC3 | MBTPS1 | EIF2AK3 | SERP1 | |||||||||||||||||||||
The overlapping genes in “REACTOME_SIGNALING_BY_PDGF” and “REACTOME_NCAM_SIGNALING_FOR_NEURITE_OUT_GROWTH”, and “BIOCARTA_RACCYCD_PATHWAY” and “BIOCARTA_SKP2E2F_PATHWAY” were indicated by asterisk.
Validated SNPs from significant pathways in independent populations
| Texas | NCI | Meta-Analysis | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Case | Control | Case | Control | Case | Control | ||||||||||||
| SNP | Gene | MAF | MAF | MAF | MAF | MAF | MAF | ||||||||||
| REACTOME_NCAM_SIGNALING_FOR_NEURITE_OUT_GROWTH | |||||||||||||||||
| rs12416052 | 0.43 | 0.4 | 0.024 | 0.43 | 0.41 | 0.0488 | 0.43 | 0.41 | 0.00638 | ||||||||
| rs17611556 | 0.06 | 0.09 | 0.0148 | 0.06 | 0.07 | 0.0185 | 0.06 | 0.08 | 0.00194 | ||||||||
| rs1990240 | 0.28 | 0.25 | 0.0163 | 0.28 | 0.26 | 0.0206 | 0.28 | 0.26 | 0.00229 | ||||||||
| rs2239062 | 0.44 | 0.41 | 0.0194 | 0.46 | 0.44 | 0.0188 | 0.46 | 0.44 | 0.00353 | ||||||||
| rs2239117 | 0.28 | 0.25 | 0.0097 | 0.28 | 0.27 | 0.0452 | 0.28 | 0.26 | 0.00491 | ||||||||
| rs2239118 | 0.25 | 0.21 | 0.0024 | 0.25 | 0.23 | 0.0447 | 0.25 | 0.23 | 0.0024 | ||||||||
| rs3767499 | 0.5 | 0.47 | 0.0325 | 0.49 | 0.47 | 0.0198 | 0.49 | 0.47 | 0.00224 | ||||||||
| rs418543 | 0.37 | 0.33 | 0.0098 | 0.36 | 0.34 | 0.0049 | 0.36 | 0.34 | 0.00026 | ||||||||
| rs6011959 | 0.31 | 0.28 | 0.032 | 0.28 | 0.26 | 0.0051 | 0.29 | 0.26 | 0.00019 | ||||||||
| rs7132154 | 0.24 | 0.27 | 0.0313 | 0.23 | 0.25 | 0.0008 | 0.23 | 0.26 | 0.00011 | ||||||||
| rs7963955 | 0.28 | 0.25 | 0.0149 | 0.28 | 0.26 | 0.0219 | 0.28 | 0.26 | 0.00231 | ||||||||
| REACTOME_SIGNALING_BY_PDGF | |||||||||||||||||
| rs418543 | 0.37 | 0.33 | 0.0098 | 0.36 | 0.34 | 0.0049 | 0.36 | 0.34 | 0.00026 | ||||||||
| rs6011959 | 0.31 | 0.28 | 0.032 | 0.28 | 0.26 | 0.0051 | 0.29 | 0.26 | 0.00019 | ||||||||
| rs7040470 | 0.42 | 0.46 | 0.0089 | 0.43 | 0.46 | 0.0004 | 0.43 | 0.46 | 1.20E-05 | ||||||||
| BIOCARTA_RACCYCD_PATHWAY | |||||||||||||||||
| rs4804903 | 0.3 | 0.34 | 0.0019 | 0.33 | 0.34 | 0.0458 | 0.32 | 0.34 | 0.00106 | ||||||||
| rs8102137 | 0.38 | 0.32 | 0.0006 | 0.35 | 0.33 | 0.0003 | 0.36 | 0.33 | 1.43E-06 | ||||||||
| rs997669 | 0.44 | 0.39 | 0.0019 | 0.42 | 0.39 | 0.0031 | 0.42 | 0.39 | 3.61E-05 | ||||||||
| BIOCARTA_SKP2E2F_PATHWAY | |||||||||||||||||
| rs10491321 | 0.16 | 0.2 | 0.0035 | 0.19 | 0.2 | 0.0206 | 0.18 | 0.2 | 0.00045 | ||||||||
| rs4804903 | 0.3 | 0.34 | 0.0019 | 0.33 | 0.34 | 0.0458 | 0.32 | 0.34 | 0.00106 | ||||||||
| rs8102137 | 0.38 | 0.32 | 0.0006 | 0.35 | 0.33 | 0.0003 | 0.36 | 0.33 | 1.43E-06 | ||||||||
| rs997669 | 0.44 | 0.39 | 0.0019 | 0.42 | 0.39 | 0.0031 | 0.42 | 0.39 | 3.61E-05 | ||||||||
| REACTOME_UNFOLDED_PROTEIN_RESPONSE | |||||||||||||||||
| rs2518054 | 0.13 | 0.11 | 0.0312 | 0.11 | 0.1 | 0.0193 | 0.12 | 0.1 | 0.00168 | ||||||||
“REACTOME_SIGNALING_BY_PDGF” and “REACTOME_NCAM_SIGNALING_FOR_NEURITE_OUT_GROWTH” had overlapping SNPs. The shared SNPs and Genes were indicated by asterisk.
“BIOCARTA_RACCYCD_PATHWAY” and “BIOCARTA_SKP2E2F_PATHWAY” had overlapping SNPs. The shared SNPs and Genes were indicated by asterisk.
MAF- minor allele frequency
Figure 2The imputation of gene regions of interest using 1000 genomes data (black dot) along with Texas Bladder Cancer GWAS data (gray dot)
The SNPs indicated by triangles were those validated in NCI population. Top three gene regions in which top signals from imputation are in strong linkage disequilibrium (R2 > 0.7) with the SNPs validated in both Texas and NCI populations are displayed. A. CCNE1; B. SKP1; C. CACNA1C.