| Literature DB >> 25551447 |
Stefan J Barfeld1, Philip East2, Verena Zuber3, Ian G Mills4,5.
Abstract
BACKGROUND: Tumorigenesis is characterised by changes in transcriptional control. Extensive transcript expression data have been acquired over the last decade and used to classify prostate cancers. Prostate cancer is, however, a heterogeneous multifocal cancer and this poses challenges in identifying robust transcript biomarkers.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25551447 PMCID: PMC4351903 DOI: 10.1186/s12920-014-0074-9
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
KEGG pathway enrichment analysis for the genes comprising signature 1 (101.6.1)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa04510:Focal adhesion | 37 | 2.86 | 2.01E-06 | CAV2, CAV1, MYL5, MYL2, TNC, PTEN, MYL9, VCL, IGF1R, LAMB3, LAMB2, ITGB8, ILK, ITGB6, PDGFC, PAK1, THBS1, THBS4, COL4A4, PRKCA, ACTB, MET, ITGA1, ACTN1, IGF1, HGF, COL4A6, FLNA, LAMA4, ITGA6, CCND2, ITGA5, JUN, ITGA8, COL1A2, MYLK, PTENP1, PARVA | 404 | 201 | 5085 | 2.32 | 3.42E-04 | 3.42E-04 | 0 |
| KEGG_PATHWAY | hsa05414:Dilated cardiomyopathy | 21 | 1.62 | 2.37E-05 | ACTB, SLC8A1, ACTC1, MYL2, LMNA, ITGA1, IGF1, CACNB2, TPM2, TPM1, TPM4, TGFB2, DES, ITGA6, ITGA5, ITGB8, PLN, ITGA8, ITGB6, PRKACB, SGCB | 404 | 92 | 5085 | 2.87 | 0 | 0 | 0.03 |
| KEGG_PATHWAY | hsa05410:Hypertrophic cardiomyopathy (HCM) | 20 | 1.55 | 2.50E-05 | ACTB, SLC8A1, ACTC1, IL6, MYL2, LMNA, ITGA1, IGF1, CACNB2, TPM2, TPM1, TPM4, TGFB2, DES, ITGA6, ITGA5, ITGB8, ITGA8, ITGB6, SGCB | 404 | 85 | 5085 | 2.96 | 0 | 0 | 0.03 |
| KEGG_PATHWAY | hsa00280:Valine, leucine and isoleucine degradation | 12 | 0.93 | 4.71E-04 | MCCC2, ALDH7A1, ALDH1B1, MCEE, AOX1, BCKDHB, DLD, ACAD8, ACAT1, HIBADH, ALDH3A2, AUH | 404 | 44 | 5085 | 3.43 | 0.08 | 0.02 | 0.57 |
| KEGG_PATHWAY | hsa04512:ECM-receptor interaction | 17 | 1.31 | 7.54E-04 | COL4A4, TNC, ITGA1, COL4A6, CD47, LAMA4, LAMB3, LAMB2, CD44, ITGA6, ITGB8, ITGA5, ITGA8, ITGB6, COL1A2, THBS1, THBS4 | 404 | 84 | 5085 | 2.55 | 0.12 | 0.03 | 0.92 |
| KEGG_PATHWAY | hsa04270:Vascular smooth muscle contraction | 20 | 1.55 | 0 | PRKCA, ACTA2, PPP1R12B, CALD1, MRVI1, KCNMB1, ITPR1, MYL9, ITPR2, EDNRA, AGTR1, ACTG2, PLCB4, GNAQ, PLA2G12A, MYH11, PRKACB, PLCB1, PPP1R14A, MYLK | 404 | 112 | 5085 | 2.25 | 0.17 | 0.03 | 1.35 |
| KEGG_PATHWAY | hsa05412:Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 14 | 1.08 | 0.01 | ACTB, SLC8A1, LMNA, ITGA1, ACTN1, CACNB2, DES, ITGA6, ITGB8, ITGA5, PKP2, ITGA8, ITGB6, SGCB | 404 | 76 | 5085 | 2.32 | 0.65 | 0.14 | 7.2 |
| KEGG_PATHWAY | hsa04610:Complement and coagulation cascades | 13 | 1.01 | 0.01 | C4A, MASP1, C4B, CFB, C1S, CD59, KLKB1, F3, SERPINE1, SERPINA1, CFI, C2, PROS1 | 404 | 69 | 5085 | 2.37 | 0.71 | 0.14 | 8.49 |
| KEGG_PATHWAY | hsa04310:Wnt signaling pathway | 22 | 1.7 | 0.01 | PRKCA, CSNK1A1, WNT5B, CAMK2G, MMP7, FZD1, DKK1, PLCB4, SFRP1, CCND2, SFRP2, JUN, SFRP4, PRICKLE2, PPP3CB, CAMK2D, WIF1, PRKACB, AXIN2, PLCB1, MYC, APC | 404 | 151 | 5085 | 1.83 | 0.72 | 0.13 | 8.79 |
| KEGG_PATHWAY | hsa05332:Graft-versus-host disease | 9 | 0.7 | 0.01 | HLA-DQB1, IL6, HLA-DRB1, HLA-DRB4, HLA-C, HLA-DPA1, HLA-B, FAS, HLA-DMB, HLA-G, HLA-DQA1 | 404 | 39 | 5085 | 2.9 | 0.82 | 0.16 | 11.58 |
| KEGG_PATHWAY | hsa00590:Arachidonic acid metabolism | 11 | 0.85 | 0.01 | CYP2U1, GPX2, PTGIS, PTGS2, PTGDS, ALOX15B, PLA2G12A, PTGS1, GGTLC3, GGT1, ALOX5, CBR3 | 404 | 56 | 5085 | 2.47 | 0.86 | 0.16 | 13.08 |
| KEGG_PATHWAY | hsa00480:Glutathione metabolism | 10 | 0.77 | 0.02 | GSTM1, GPX2, GSTM2, GSTA4, GGTLC3, GSTZ1, GSTO2, ANPEP, GSTT2, GGT1, GSTO1 | 404 | 50 | 5085 | 2.52 | 0.93 | 0.2 | 17.11 |
| KEGG_PATHWAY | hsa04514:Cell adhesion molecules (CAMs) | 19 | 1.47 | 0.02 | HLA-DQB1, HLA-DRB1, CDH1, ITGB2, HLA-C, NEO1, HLA-B, HLA-DMB, CDH3, HLA-DQA1, HLA-G, ITGA6, ITGB8, ITGA8, PVRL3, CD2, HLA-DRB4, HLA-DPA1, JAM2, NEGR1, SELE | 404 | 132 | 5085 | 1.81 | 0.93 | 0.19 | 17.38 |
| KEGG_PATHWAY | hsa04940:Type I diabetes mellitus | 9 | 0.7 | 0.02 | HLA-DQB1, HLA-DRB1, PTPRN2, HLA-DRB4, HLA-C, HLA-DPA1, HLA-B, FAS, HLA-DMB, HLA-G, HLA-DQA1 | 404 | 42 | 5085 | 2.7 | 0.93 | 0.17 | 17.49 |
| KEGG_PATHWAY | hsa04350:TGF-beta signaling pathway | 14 | 1.08 | 0.02 | SMAD6, FST, DCN, TGFB2, ACVR2A, ID1, ZFYVE16, ID4, ID3, THBS1, MYC, BMPR1A, ACVR1, THBS4 | 404 | 87 | 5085 | 2.03 | 0.96 | 0.19 | 20.39 |
| KEGG_PATHWAY | hsa04810:Regulation of actin cytoskeleton | 27 | 2.09 | 0.02 | FGFR2, FGF7, MYL5, MYL2, DIAPH2, FGF13, ITGB2, MYL9, VCL, GSN, ITGB8, ITGB6, RRAS, PDGFC, PAK1, FGF2, APC, ACTB, LIMK2, ITGA1, ACTN1, ITGA6, CHRM3, ITGA5, CFL2, ITGA8, MYLK | 404 | 215 | 5085 | 1.58 | 0.96 | 0.18 | 20.54 |
| KEGG_PATHWAY | hsa05330:Allograft rejection | 8 | 0.62 | 0.02 | HLA-DQB1, HLA-DRB1, HLA-DRB4, HLA-C, HLA-DPA1, HLA-B, FAS, HLA-DMB, HLA-G, HLA-DQA1 | 404 | 36 | 5085 | 2.8 | 0.97 | 0.19 | 22.63 |
| KEGG_PATHWAY | hsa00330:Arginine and proline metabolism | 10 | 0.77 | 0.02 | ALDH7A1, ALDH18A1, ACY1, GATM, GLUD2, ALDH1B1, MAOB, OAT, ALDH3A2, CKB | 404 | 53 | 5085 | 2.37 | 0.98 | 0.19 | 23.67 |
| KEGG_PATHWAY | hsa00982:Drug metabolism | 11 | 0.85 | 0.02 | GSTM1, GSTM2, CYP3A5, GSTA4, AOX1, MAOB, ADH5, GSTZ1, GSTO2, GSTT2, GSTO1 | 404 | 62 | 5085 | 2.23 | 0.98 | 0.18 | 24.32 |
| KEGG_PATHWAY | hsa05218:Melanoma | 12 | 0.93 | 0.02 | FGF7, MET, IGF1, CDH1, CDK6, FGF13, HGF, RB1, PTEN, IGF1R, PDGFC, FGF2, PTENP1 | 404 | 71 | 5085 | 2.13 | 0.98 | 0.18 | 24.47 |
| KEGG_PATHWAY | hsa05416:Viral myocarditis | 12 | 0.93 | 0.02 | ACTB, HLA-DQB1, CAV1, HLA-DRB1, ITGB2, HLA-C, HLA-B, HLA-DMB, HLA-DQA1, HLA-G, MYH11, HLA-DRB4, HLA-DPA1, SGCB | 404 | 71 | 5085 | 2.13 | 0.98 | 0.18 | 24.47 |
| KEGG_PATHWAY | hsa05310:Asthma | 7 | 0.54 | 0.02 | FCER1A, HLA-DQB1, HLA-DRB1, HLA-DRB4, FCER1G, HLA-DPA1, HLA-DMB, HLA-DQA1 | 404 | 29 | 5085 | 3.04 | 0.98 | 0.18 | 25.37 |
| KEGG_PATHWAY | hsa00640:Propanoate metabolism | 7 | 0.54 | 0.04 | ALDH7A1, ALDH1B1, MCEE, SUCLA2, ACAT1, ACSS3, ALDH3A2 | 404 | 32 | 5085 | 2.75 | 1 | 0.25 | 36.89 |
| KEGG_PATHWAY | hsa04916:Melanogenesis | 14 | 1.08 | 0.05 | PRKCA, WNT5B, GNAI1, CAMK2G, CREB1, EDN1, FZD1, EDNRB, PLCB4, GNAQ, CAMK2D, CREB3L4, PRKACB, PLCB1 | 404 | 99 | 5085 | 1.78 | 1 | 0.3 | 44.76 |
| KEGG_PATHWAY | hsa04020:Calcium signaling pathway | 21 | 1.62 | 0.06 | PRKCA, SLC8A1, CAMK2G, PHKA1, PTGFR, ITPR1, ITPR2, EDNRA, AGTR1, EDNRB, GNAL, CD38, PLCB4, GNAQ, CHRM3, PLN, CAMK2D, PPP3CB, PRKACB, PLCB1, MYLK | 404 | 176 | 5085 | 1.5 | 1 | 0.37 | 54.64 |
| KEGG_PATHWAY | hsa04530:Tight junction | 17 | 1.31 | 0.06 | PRKCA, ACTB, RAB3B, MYL5, MAGI2, ZAK, MYL2, MPDZ, GNAI1, ACTN1, AMOTL1, PTEN, MYL9, EPB41L2, MYH11, RRAS, JAM2, PTENP1 | 404 | 134 | 5085 | 1.6 | 1 | 0.36 | 54.73 |
| KEGG_PATHWAY | hsa05222:Small cell lung cancer | 12 | 0.93 | 0.07 | COL4A4, PTGS2, CDK6, RB1, PTEN, COL4A6, LAMB3, LAMA4, LAMB2, ITGA6, PIAS1, MYC, PTENP1 | 404 | 84 | 5085 | 1.8 | 1 | 0.36 | 56.55 |
| KEGG_PATHWAY | hsa04360:Axon guidance | 16 | 1.24 | 0.08 | LIMK2, GNAI1, MET, NTN4, SLIT2, EPHA3, SEMA5A, EPHA4, EPHB6, CFL2, PPP3CB, SEMA3C, EFNA5, UNC5D, PAK1, RASA1 | 404 | 129 | 5085 | 1.56 | 1 | 0.42 | 65.75 |
| KEGG_PATHWAY | hsa04115:p53 signaling pathway | 10 | 0.77 | 0.09 | SERPINB5, CCND2, SERPINE1, IGF1, CDK6, FAS, GADD45B, THBS1, CCNG2, PTEN, PTENP1 | 404 | 68 | 5085 | 1.85 | 1 | 0.42 | 66.77 |
| KEGG_PATHWAY | hsa04720:Long-term potentiation | 10 | 0.77 | 0.09 | PRKCA, PLCB4, GNAQ, CAMK2G, CAMK2D, PPP3CB, PRKACB, PLCB1, ITPR1, ITPR2 | 404 | 68 | 5085 | 1.85 | 1 | 0.42 | 66.77 |
| KEGG_PATHWAY | hsa04672:Intestinal immune network for IgA production | 8 | 0.62 | 0.09 | HLA-DQB1, IL6, TNFSF13B, HLA-DRB1, HLA-DRB4, HLA-DPA1, HLA-DMB, HLA-DQA1, TGFB2 | 404 | 49 | 5085 | 2.05 | 1 | 0.42 | 68.02 |
| KEGG_PATHWAY | hsa05322:Systemic lupus erythematosus | 13 | 1.01 | 0.09 | HLA-DQB1, HLA-DRB1, C4A, C4B, ACTN1, SSB, C1S, H2AFJ, HLA-DMB, HLA-DQA1, HLA-DRB4, HLA-DPA1, H3F3B, C2 | 404 | 99 | 5085 | 1.65 | 1 | 0.41 | 68.42 |
| KEGG_PATHWAY | hsa00620:Pyruvate metabolism | 7 | 0.54 | 0.09 | ALDH7A1, ALDH1B1, DLD, ACYP2, DLAT, ACAT1, ALDH3A2 | 404 | 40 | 5085 | 2.2 | 1 | 0.41 | 69.45 |
| KEGG_PATHWAY | hsa04730:Long-term depression | 10 | 0.77 | 0.09 | PRKCA, IGF1R, PLCB4, GNAQ, GNAI1, PLA2G12A, IGF1, PLCB1, ITPR1, ITPR2 | 404 | 69 | 5085 | 1.82 | 1 | 0.4 | 69.5 |
| KEGG_PATHWAY | hsa00980:Metabolism of xenobiotics by cytochrome P450 | 9 | 0.7 | 0.1 | GSTM1, GSTM2, CYP3A5, GSTA4, ADH5, GSTZ1, GSTO2, GSTT2, GSTO1 | 404 | 60 | 5085 | 1.89 | 1 | 0.42 | 71.98 |
Genes comprising signature 1 (Additional file 5: Table S3) were uploaded into the DAVID gene ontology search engine (http://david.abcc.ncifcrf.gov/). KEGG pathway enrichment was generated and the table represents the output file ranked based on significance and annotated by column header.
KEGG pathway enrichment analysis for the genes comprising signature 2 (101.6.2)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa04610:Complement and coagulation cascades | 10 | 1.56 | 0 | C1QA, FGG, A2M, C3, KLKB1, CD46, C1R, SERPING1, C1S, CFD | 219 | 69 | 5085 | 3.37 | 0.33 | 0.33 | 2.97 |
| KEGG_PATHWAY | hsa04540:Gap junction | 10 | 1.56 | 0.01 | TJP1, ADCY2, GNAI1, PDGFA, TUBB6, GUCY1A3, GJA1, LPAR1, PRKACB, ITPR2 | 219 | 89 | 5085 | 2.61 | 0.88 | 0.65 | 14.98 |
| KEGG_PATHWAY | hsa04142:Lysosome | 11 | 1.72 | 0.03 | AGA, HGSNAT, LAMP2, CTSK, GM2A, PSAP, LGMN, CTSB, SCARB2, FUCA1, CLN5 | 219 | 117 | 5085 | 2.18 | 0.99 | 0.77 | 28.69 |
| KEGG_PATHWAY | hsa04270:Vascular smooth muscle contraction | 10 | 1.56 | 0.05 | PLA2G4A, ADCY2, CALD1, MRVI1, GUCY1A3, PRKCH, PRKACB, PPP1CB, MYLK, ITPR2 | 219 | 112 | 5085 | 2.07 | 1 | 0.87 | 46.04 |
| KEGG_PATHWAY | hsa04310:Wnt signaling pathway | 12 | 1.88 | 0.06 | CCND1, PRICKLE1, CCND2, BTRC, NFAT5, CAMK2D, TP53, MAPK10, PRKACB, FZD5, FZD4, FZD7 | 219 | 151 | 5085 | 1.85 | 1 | 0.85 | 51.51 |
| KEGG_PATHWAY | hsa05330:Allograft rejection | 5 | 0.78 | 0.07 | HLA-DRB5, HLA-DPB1, HLA-E, HLA-DOA, HLA-DRA | 219 | 36 | 5085 | 3.22 | 1 | 0.84 | 56.25 |
| KEGG_PATHWAY | hsa05416:Viral myocarditis | 7 | 1.1 | 0.08 | CAV1, CCND1, HLA-DRB5, HLA-DPB1, HLA-E, HLA-DOA, HLA-DRA | 219 | 71 | 5085 | 2.29 | 1 | 0.86 | 64.57 |
| KEGG_PATHWAY | hsa05332:Graft-versus-host disease | 5 | 0.78 | 0.08 | HLA-DRB5, HLA-DPB1, HLA-E, HLA-DOA, HLA-DRA | 219 | 39 | 5085 | 2.98 | 1 | 0.82 | 65.24 |
| KEGG_PATHWAY | hsa04510:Focal adhesion | 14 | 2.19 | 0.09 | CAV1, PDGFA, MAPK10, FLNC, PPP1CB, VCL, CCND1, CCND2, ITGAV, COL6A2, RAP1A, THBS1, PIK3R1, MYLK | 219 | 201 | 5085 | 1.62 | 1 | 0.81 | 67.5 |
Genes comprising signature 2 (Additional file 6: Table S4) were uploaded into the DAVID gene ontology search engine (http://david.abcc.ncifcrf.gov/). KEGG pathway enrichment was generated and the table represents the output file ranked based on significance and annotated by column header.
KEGG pathway enrichment analysis for the genes comprising signature 3 (101.6.3)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa04110:Cell cycle | 36 | 0.47 | 9.03E-20 | E2F1, E2F2, E2F3, TTK, CHEK1, PTTG1, CCNE2, CCNE1, CDKN2A, MCM7, CDKN2C, CDKN2D, ORC6L, TFDP2, BUB1, CCNA2, STAG1, CDC7, CDC6, RBL1, SKP2, ESPL1, CDC20, MCM2, CDC25C, MCM4, CDC25A, CDC25B, CDKN1C, CCNB1, CCNB2, MAD2L1, PLK1, GSK3B, BUB1B, MAD2L2 | 225 | 125 | 5085 | 6.51 | 1.29E-17 | 1.29E-17 | 1.07E-16 |
| KEGG_PATHWAY | hsa03030:DNA replication | 12 | 0.16 | 2.15E-07 | RFC5, PRIM1, MCM7, RFC4, POLE2, LIG1, POLA1, POLA2, MCM2, RNASEH2A, MCM4, FEN1 | 225 | 36 | 5085 | 7.53 | 3.08E-05 | 1.54E-05 | 2.55E-04 |
| KEGG_PATHWAY | hsa04114:Oocyte meiosis | 18 | 0.23 | 4.82E-06 | SGOL1, AURKA, CDC20, ESPL1, PTTG1, CDC25C, CCNE2, CCNB1, CCNE1, CCNB2, MAD2L1, ADCY9, CALML3, PLK1, BUB1, FBXO5, CAMK2B, MAD2L2 | 225 | 110 | 5085 | 3.7 | 6.89E-04 | 2.30E-04 | 0.01 |
| KEGG_PATHWAY | hsa04914:Progesterone-mediated oocyte maturation | 14 | 0.18 | 8.26E-05 | HSP90AA1, CDC25C, CDC25A, CDC25B, CCNB1, CCNB2, MAD2L1, KRAS, ADCY9, PLK1, BUB1, MAD2L2, PIK3R3, CCNA2 | 225 | 86 | 5085 | 3.68 | 0.01 | 0 | 0.1 |
| KEGG_PATHWAY | hsa04115:p53 signaling pathway | 10 | 0.13 | 0 | CCNE2, CCNB1, CCNE1, CDKN2A, CCNB2, RRM2, TSC2, CHEK1, PMAIP1, GTSE1 | 225 | 68 | 5085 | 3.32 | 0.32 | 0.07 | 3.16 |
| KEGG_PATHWAY | hsa05222:Small cell lung cancer | 11 | 0.14 | 0 | E2F1, CCNE2, E2F2, CCNE1, CKS1B, E2F3, PTK2, SKP2, PIAS2, PIK3R3, ITGA2B | 225 | 84 | 5085 | 2.96 | 0.4 | 0.08 | 4.12 |
| KEGG_PATHWAY | hsa04360:Axon guidance | 14 | 0.18 | 0 | PLXNA1, EFNB3, PLXNA2, DPYSL5, EPHB1, PTK2, KRAS, UNC5B, PAK2, UNC5A, FYN, GSK3B, SRGAP1, SRGAP2 | 225 | 129 | 5085 | 2.45 | 0.45 | 0.08 | 4.83 |
| KEGG_PATHWAY | hsa00240:Pyrimidine metabolism | 11 | 0.14 | 0.01 | PRIM1, TYMS, POLR3K, POLE2, RRM2, RRM1, DCK, POLA1, POLA2, NME7, TK1 | 225 | 95 | 5085 | 2.62 | 0.71 | 0.14 | 9.63 |
| KEGG_PATHWAY | hsa05219:Bladder cancer | 7 | 0.09 | 0.01 | E2F1, RPS6KA5, E2F2, E2F3, CDKN2A, KRAS, PGF | 225 | 42 | 5085 | 3.77 | 0.74 | 0.14 | 10.67 |
| KEGG_PATHWAY | hsa05215:Prostate cancer | 10 | 0.13 | 0.02 | E2F1, CCNE2, E2F2, CCNE1, E2F3, HSP90AA1, KRAS, GSK3B, PIK3R3, CTNNB1 | 225 | 89 | 5085 | 2.54 | 0.9 | 0.2 | 17.14 |
| KEGG_PATHWAY | hsa00230:Purine metabolism | 14 | 0.18 | 0.02 | POLR3K, POLA1, DCK, POLA2, HPRT1, GMPS, NME7, GART, PRIM1, ADCY9, POLE2, RRM2, PKLR, RRM1 | 225 | 153 | 5085 | 2.07 | 0.91 | 0.2 | 18.08 |
| KEGG_PATHWAY | hsa03410:Base excision repair | 6 | 0.08 | 0.02 | POLE2, UNG, LIG1, MBD4, NTHL1, FEN1 | 225 | 35 | 5085 | 3.87 | 0.92 | 0.19 | 18.86 |
| KEGG_PATHWAY | hsa05214:Glioma | 8 | 0.1 | 0.02 | E2F1, E2F2, E2F3, CDKN2A, KRAS, CALML3, CAMK2B, PIK3R3 | 225 | 63 | 5085 | 2.87 | 0.94 | 0.2 | 21.16 |
| KEGG_PATHWAY | hsa05200:Pathways in cancer | 23 | 0.3 | 0.03 | E2F1, E2F2, FZD8, CKS1B, MSH6, E2F3, HSP90AA1, PGF, FGF9, SKP2, BIRC5, FZD2, CTNNB1, CTNNA2, CCNE2, CCNE1, PTK2, CDKN2A, KRAS, GSK3B, PIAS2, PIK3R3, ITGA2B | 225 | 328 | 5085 | 1.58 | 0.99 | 0.27 | 30.48 |
| KEGG_PATHWAY | hsa00670:One carbon pool by folate | 4 | 0.05 | 0.03 | TYMS, MTHFD2, SHMT2, GART | 225 | 16 | 5085 | 5.65 | 0.99 | 0.26 | 31.03 |
| KEGG_PATHWAY | hsa04916:Melanogenesis | 9 | 0.12 | 0.07 | FZD8, KRAS, ADCY9, CALML3, GSK3B, GNAS, CAMK2B, FZD2, CTNNB1 | 225 | 99 | 5085 | 2.05 | 1 | 0.47 | 57.19 |
| KEGG_PATHWAY | hsa05210:Colorectal cancer | 8 | 0.1 | 0.08 | FZD8, MSH6, KRAS, GSK3B, BIRC5, FZD2, PIK3R3, CTNNB1 | 225 | 84 | 5085 | 2.15 | 1 | 0.48 | 60.57 |
| KEGG_PATHWAY | hsa03430:Mismatch repair | 4 | 0.05 | 0.08 | RFC5, MSH6, RFC4, LIG1 | 225 | 23 | 5085 | 3.93 | 1 | 0.48 | 61.82 |
| KEGG_PATHWAY | hsa05223:Non-small cell lung cancer | 6 | 0.08 | 0.09 | E2F1, E2F2, E2F3, CDKN2A, KRAS, PIK3R3 | 225 | 54 | 5085 | 2.51 | 1 | 0.5 | 66.23 |
| KEGG_PATHWAY | hsa05218:Melanoma | 7 | 0.09 | 0.09 | E2F1, E2F2, E2F3, CDKN2A, KRAS, FGF9, PIK3R3 | 225 | 71 | 5085 | 2.23 | 1 | 0.5 | 67.79 |
| KEGG_PATHWAY | hsa05212:Pancreatic cancer | 7 | 0.09 | 0.1 | E2F1, E2F2, E2F3, CDKN2A, KRAS, PGF, PIK3R3 | 225 | 72 | 5085 | 2.2 | 1 | 0.5 | 69.77 |
Genes comprising signature 3 (Additional file 7: Table S5) were uploaded into the DAVID gene ontology search engine (http://david.abcc.ncifcrf.gov/). KEGG pathway enrichment was generated and the table represents the output file ranked based on significance and annotated by column header.
KEGG pathway enrichment analysis for the genes comprising signature 3 (101.6.4)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa00100:Steroid biosynthesis | 3 | 0.1 | 0.03 | SQLE, FDFT1, SC4MOL | 86 | 17 | 5085 | 10.43 | 0.97 | 0.97 | 30.81 |
| KEGG_PATHWAY | hsa05200:Pathways in cancer | 11 | 0.38 | 0.05 | LAMA1, HRAS, PTK2, SOS1, CBL, VEGFA, PPARG, RALA, LEF1, MDM2, LAMB1 | 86 | 328 | 5085 | 1.98 | 0.99 | 0.93 | 41.08 |
| KEGG_PATHWAY | hsa04510:Focal adhesion | 8 | 0.27 | 0.05 | LAMA1, HRAS, PTK2, FLT1, DIAPH1, SOS1, VEGFA, LAMB1 | 86 | 201 | 5085 | 2.35 | 1 | 0.85 | 44.04 |
| KEGG_PATHWAY | hsa00330:Arginine and proline metabolism | 4 | 0.14 | 0.06 | ARG1, P4HA2, P4HA1, CPS1 | 86 | 53 | 5085 | 4.46 | 1 | 0.82 | 49.31 |
| KEGG_PATHWAY | hsa05216:Thyroid cancer | 3 | 0.1 | 0.08 | HRAS, PPARG, LEF1 | 86 | 29 | 5085 | 6.12 | 1 | 0.86 | 63 |
Genes comprising signature 4 (Additional file 8: Table S6) were uploaded into the DAVID gene ontology search engine (http://david.abcc.ncifcrf.gov/). KEGG pathway enrichment was generated and the table represents the output file ranked based on significance and annotated by column header.
Figure 1Gene signatures capable of discriminating between prostate cancer subgroups and classify metastatic disease. Gene signatures generated using the Varambally dataset and found to be significant discriminators of metastatic disease and primary/localised cancers (Additional file 10: Table S8) when applied to the Tomlins and Rawaswamy datasets were used to cluster samples in these datasets in a heatmap. The gene signatures represented are those capable of characterising samples from at least one progression stage (Fischer’s exact < = 0.05). Gene signatures are rows and samples are columns. The colour coded bar at the base of the heatmap indicates the clinical grouping for each sample as also defined in the key. Metastatic hormone refractory, metastatic hormone naïve and hormone refractory vs. naïve represent prostate cancer cases from the Tomlins dataset, as do PIN (prostatic intraepithelial neoplasia) and primary carcinoma. The other categories (metastatic and primary) are samples from the Rawaswamy dataset and are metastatic and primary cancers from multiple organ sites, not simply the prostate gland. The blue bar graph on the right-hand side of the heatmap depicts the number of genes in each signature which are differentially expressed and contribute to the sample clustering in this analysis. For signature 1 (dist 101.6.1 and Additional file 5: Table S3) this is 1748 genes in total as highlighted and other bars are numbers of genes relative to this. The colour scale represents the mean log2 fold change for differential gene signatures (> = abs log2(2)). Red indicates module induction, green repression. Gene signatures significant in both directions are indicated in yellow. Using the mean module log2 fold change we clustered the samples and modules using hierarchical clustering with euclidean distance as a measure of dissimilarity. Data points that contained both induced and repressed values have been excluded from the clustering.
KEGG pathway enrichment for the 71-gene signature capable of subclustering localised prostate cancer cases across multiple datasets
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa04270:Vascular smooth muscle contraction | 5 | 6.94 | 0 | ACTG2, MYH11, KCNMB1, MYLK, MYL9 | 26 | 112 | 5085 | 8.73 | 0.12 | 0.12 | 1.99 |
| KEGG_PATHWAY | hsa05414:Dilated cardiomyopathy | 4 | 5.56 | 0.01 | DES, PLN, IGF1, TPM2 | 26 | 92 | 5085 | 8.5 | 0.47 | 0.27 | 9.6 |
| KEGG_PATHWAY | hsa04960:Aldosterone-regulated sodium reabsorption | 3 | 4.17 | 0.02 | IGF1, ATP1A2, IRS1 | 26 | 41 | 5085 | 14.31 | 0.66 | 0.31 | 15.92 |
| KEGG_PATHWAY | hsa04310:Wnt signaling pathway | 4 | 5.56 | 0.04 | SFRP1, CAMK2G, PRICKLE2, MYC | 26 | 151 | 5085 | 5.18 | 0.91 | 0.45 | 31.53 |
| KEGG_PATHWAY | hsa05410:Hypertrophic cardiomyopathy (HCM) | 3 | 4.17 | 0.06 | DES, IGF1, TPM2 | 26 | 85 | 5085 | 6.9 | 0.99 | 0.57 | 49.28 |
Genes were uploaded into the DAVID gene ontology search engine (http://david.abcc.ncifcrf.gov/). KEGG pathway enrichment was generated and the table represents the output file ranked based on significance and annotated by column header.
Figure 2Differential expression of a 71-gene signature classifier in a prostate cancer exon-array dataset (Taylor ) and the TCGA RNA-seq dataset for prostate cancer (TCGA-PRAD). The expression values of the 71-gene signature (dist.0.6.34) capable of subclustering localised prostate cancer from other samples in all three interrogated datasets are shown in two independent datasets, A. a prostate cancer exon-array dataset (Taylor et al.) and B. TCGA RNA-seq dataset for prostate cancer (TCGA-PRAD) were used. Values were log2 normalized and the mean of the sample groups (PRIMARY TUMOUR/SOLID TISSUE NORMAL) is shown.
KEGG pathway enrichment analysis for the entire set of overexpressed genes in localised prostate cancer versus benign tissue in the Varambally dataset (GSE3325)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa00512:O-Glycan biosynthesis | 3 | 0.34 | 0.01 | GALNTL4, GCNT1, ST6GALNAC1 | 26 | 30 | 5085 | 19.56 | 0.43 | 0.43 | 8.95 |
| KEGG_PATHWAY | hsa04610:Complement and coagulation cascades | 3 | 0.34 | 0.04 | C4A, C4B, SERPINA1 | 26 | 69 | 5085 | 8.5 | 0.94 | 0.75 | 36.76 |
| KEGG_PATHWAY | hsa05322:Systemic lupus erythematosus | 3 | 0.34 | 0.08 | C4A, C4B, HLA-DMB | 26 | 99 | 5085 | 5.93 | 1 | 0.83 | 58.75 |
Genes were uploaded into the DAVID gene ontology search engine (http://david.abcc.ncifcrf.gov/). KEGG pathway enrichment was generated and the table represents the output file ranked based on significance and annotated by column header.
Figure 3Heatmaps confirming the clustering ability of the 33-gene signature in a prostate cancer exon-array dataset (Taylor ) and the TCGA RNA-seq dataset for prostate cancer (TCGA-PRAD). The 33-gene signature was applied to two independent datasets, A. a prostate cancer exon-array dataset (Taylor et al.), and B. TCGA RNA-seq dataset for prostate cancer (TCGA-PRAD). Expression values were log2 transformed, normalized for high level mean and variance and hierarchically clustered using Euclidian distance. Genes are rows and samples are columns. The colour coded bars indicate expression values and the clinical grouping for each sample as defined in the keys.
Comparison of the performance of a 31-gene signature with ERG, AR and KLK3 in discriminating between benign tissue, localised prostate cancer and metastatic disease
|
|
|
|
|
|---|---|---|---|
| KLK3 | 0.5204082 | 0.9104938 | 0.8707483 |
| ERG | 0.812616 | 0.9326599 | 0.6099773 |
| AR | 0.6581633 | 0.8395062 | 0.8435374 |
| 31 Gene signature | 0.994898 | 0.9938272 | 0.957672 |
| Derived from the Grasso Data |
Data were downloaded from Grasso et al.,. ROC statistics were computed in an evaluation sample set having established the weighting for genes in the signature using logistic regression in a test sample set. We report the area under the curve (AUC) for each transcript and for the signature for each of three pairwise comparisons as generated using the R package ROCR.
Figure 4Receiver operating characteristic (ROC) curves for discrimination between localised prostate cancer and benign cases, metastatic and benign cases and metastatic and prostate cancers using a 31-gene signature (row 1), AR (row 2), ERG (row 3) and KLK3 (row 4).
Figure 5Workflow for the identification of robust gene signatures and gene sets for clustering prostate cancer cases. In step 1, we identified all statistically significant differentially expressed Affymetrix array probes in a small dataset consisting of 13 macrodissected clinical samples encompassing localised benign prostatic hyperplasia, localised prostate cancer and metastatic disease (GSE3325). We then generated gene signatures from these based on gene coexpression at varying stringency thresholds. These gene signatures were then applied to two additional datasets, a microdissected dataset (Tomlins et al.) and a multi-tissue site cancer and metastatic dataset (Ramaswamy et al.). A large number of the coexpression gene signatures clustered localised prostate cancers from metastatic disease and prostate metastases from other sample sets. The most compact gene signature able to do so consisted of 71 genes (A) and we assessed its expression pattern in two additional datasets, an exon-array dataset (Taylor et al.) and in a RNA-sequenced dataset (TCGA-PRAD). Few of the genes in the significant coexpression gene signatures were overexpressed genes in localised prostate cancers. In the second phase of the study, we abstracted all of the overexpressed genes and refined this down to a set of 33 genes based on significant overexpression in additional publicly available prostate cancer microarray datasets housed within the Oncomine database (B). These genes also effectively clustered benign versus cancer cases in an exon-array dataset (Taylor et al.) an expression microarray dataset (Grasso et al.) and a RNA-sequenced dataset (TCGA-PRAD) (C and D). In conclusion, it is possible to generate gene classifiers of clinical prostate cancer from a small dataset of macrodissected samples with the capacity to classify larger sequenced and microdissected datasets based on clinical characteristics.