| Literature DB >> 25611546 |
Abstract
MicroRNAs (miRNAs) play key roles in the initiation and progression of various cancers by regulating genes. Regulatory interactions between genes and miRNAs are complex, as multiple miRNAs can regulate multiple genes. In addtion, these interactions vary from patient to patient and even among patients with the same cancer type, as cancer development is a heterogeneous process. These relationships are more complicated because transcription factors and other regulatory molecules can also regulate miRNAs and genes. Hence, it is important to identify the complex relationships between genes and miRNAs in cancer. In this study, we propose a computational approach to constructing modules that represent these relationships by integrating the expression data of genes and miRNAs with gene-gene interaction data. First, we used a biclustering algorithm to construct modules consisting of a subset of genes and a subset of samples to incorporate the heterogeneity of cancer cells. Second, we combined gene-gene interactions to include genes that play important roles in cancer-related pathways. Then, we selected miRNAs that are closely associated with genes in the modules based on a Gaussian Bayesian network and Bayesian Information Criteria. When we applied our approach to ovarian cancer and glioblastoma (GBM) data sets, 33 and 54 modules were constructed, respectively. In these modules, 91% and 94% of ovarian cancer and GBM modules, respectively, were explained either by direct regulation between genes and miRNAs or by indirect relationships via transcription factors. In addition, 48.4% and 74.0% of modules from ovarian cancer and GBM, respectively, were enriched with cancer-related pathways, and 51.7% and 71.7% of miRNAs in modules were ovarian cancer-related miRNAs and GBM-related miRNAs, respectively. Finally, we extensively analyzed significant modules and showed that most genes in these modules were related to ovarian cancer and GBM.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25611546 PMCID: PMC4303261 DOI: 10.1371/journal.pcbi.1004042
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Overview of the proposed approach.
(A) Collect gene expression and miRNA expression data sets from paired tumor samples, and calculate log2 ratios between tumor samples and normal samples. (B) Construct gene-sample modules (GSM) from a differentially expressed gene expression matrix using a biclustering algorithm, which allows duplications of genes and samples in multiple modules. (C) Add genes to GSM using gene-gene interactions, if the included genes increase the average PCC values among genes in the module. (D) Construct gene-miRNA modules (GMM) by selecting gene-regulating miRNAs in GSM. Use a Gaussian Bayesian network and the BIC score to evaluate the relationship between genes and miRNAs. (E) To determine the functional relevance of the modules, test whether the genes from the modules are enriched for specific biological functions or signaling pathways. To validate that modules are related to a specific cancer, check that the genes and miRNAs are related to the specific cancer.
Figure 2Performance comparison of gene-miRNA modules for ovarian cancer.
For ovarian cancer, we compared the performance of gene-miRNA modules generated from four cases: SCC with GGI information, SCC without GGI information, PCC with GGI information, and PCC without GGI information. For all cases, the x-axis presents different percentages of candidate miRNAs (T%) among all miRNAs when constructing gene-miRNA modules. For each case, the number of modules (A), the ratios of cancer genes (B), the ratios of ovarian cancer genes (C), the ratios of ovarian cancer miRNAs (D), the average number of enriched pathways (E), and the ratios of modules enriched with at least one pathway (F) are shown.
Cancer genes, ovarian cancer genes and ovarian cancer miRNAs for selected modules.
| Module ID | Cancer Genes | Num1 | Ovarian Cancer Genes | Num2 | Ovarian Cancer miRNAs | Num3 |
|---|---|---|---|---|---|---|
| 2 | CD44, MMP9, PLAUR, LTB, GBP1, CTSH, EPB41L3, POU2AF1, VAV1, CXCL10, MEF2C, HCK, BTK, CASP1, CD74, LCK, LYN, FGR, SPP1 | 19/60 | CD44, DPYD, IL18, MMP9, PLAUR | 5/60 | miR-125b, miR-146a, miR-155, miR-17, miR-20a, miR-21, miR-218, miR-22, miR-223, miR-224, miR-335 | 11/24 |
| 3 | CDK2, E2F1, PLK1, MCM2, CDC6, EZH2, ASPM, BUB1 | 8/35 | CDK2, E2F1 | 2/35 | miR-106b, miR-130b, miR-18a, miR-19a, miR-25, miR-29a, miR-93 | 7/14 |
| 6 | BARD1, CDC25A, CDK2, MSH6, MCM2, BUB1, FEN1, PCNA, CDKN3 | 9/34 | BARD1, CDC25A, CDK2, MKI67, MSH6 | 5/34 | miR-101, miR-106b, miR-130b, miR-17, miR-18a, miR-19a, miR-20b, miR-25, miR-29a, miR-93 | 10/20 |
| 8 | PLAUR, MMP11, BGN, COL16A1, THBS2, THBS1, VCAN, COL1A1, TIMP3, PDGFRB, COL1A2 | 11/39 | FN1, LGALS1, PLAU, PLAUR, SERPINE1 | 5/39 | miR-152, miR-199a, miR-214, miR-22 | 4/8 |
| 12 | E2F3, MCM2, FEN1, DEK, PALB2, PSMA5 | 6/33 | E2F3, NBN | 2/33 | miR-93 | 1/2 |
| 13 | CDC42, PLK1, CDC6, BUB1, PCNA, UCHL5, FANCE, SMARCB1, FANCG, EIF4EBP1, ECT2 | 11/78 | CDC42 | 1/78 | miR-18a, miR-25, miR-29a, miR-93 | 4/8 |
| 18 | MCM2, FEN1, FOXM1, DEK, FANCG, WHSC1 | 6/31 | MKI67 | 1/31 | miR-18a, miR-25, miR-29a, miR-93 | 4/7 |
| 20 | AURKA, CDC20, MAD2L1, TOP2A, PLK1, ASPM, BUB1, FOXM1, MYBL2, KIF14, CCNA2, CCNB1, BUB1B | 13/44 | AURKA, CDC20, MAD2L1, TOP2A | 4/44 | miR-101, miR-17, miR-18a, miR-19a, miR-29a, miR-93 | 6/13 |
| 21 | HCK, BTK, LCK, IL2RG, IL2RB, ITK, CCR1, LAPTM5 | 8/30 | 0/30 | miR-146a, miR-155, miR-21, miR-218, miR-22, miR-223, miR-224 | 7/17 | |
| 22 | MMP2, MMP11, THBS2, VCAN, COL1A1, LOXL2, ADAM12, DPT, ECM1 | 9/27 | FN1, MMP1, MMP2, PLAU, SPARC | 5/27 | miR-152, miR-214, miR-22 | 3/6 |
| 25 | MCM2, FEN1, PCNA, MYBL2, FBXO5 | 5/29 | 0/29 | miR-18a, miR-25, miR-29a, miR-93 | 4/8 | |
| 26 | MAD2L1, PLK1, FEN1, PCNA, UCHL5, CCNA2, CCNB1, FBXO5, RAP1GDS1, RAN | 10/44 | MAD2L1 | 1/44 | let-7b, miR-101, miR-17, miR-18a, miR-19a, miR-25, miR-29a, miR-93 | 8/17 |
| 27 | MMP14, MMP2, MMP11, BGN, COL16A1, THBS2, THBS1, VCAN, COL1A1, PDGFRB, COL1A2, LOXL2, ADAM12, ECM1, COL11A1, TWIST1, SFRP4, LOX, TAGLN, LHFP | 20/55 | FN1, MMP14, MMP2, PLAU, SERPINF1, SPARC | 6/55 | miR-127, miR-145, miR-152, miR-199a, miR-214, miR-22 | 6/12 |
| 31 | CD82, CTSB, STAT3, TNFSF10, GBP1, EPB41L3, CXCL10, CASP1, LYN, SPP1, LAPTM5, IRF1, CTSL1, TACC1, S100A13, CAPG | 16/65 | ACVR2B, CD82, CTSB, CTSD, DPYD, RAB25, SERPINF1, STAT3, TNFSF10 | 9/65 | miR-125b, miR-130a, miR-146a, miR-155, miR-17, miR-183, miR-20a, miR-20b, miR-21, miR-218, miR-22, miR-223, miR-224, miR-335 | 14/23 |
| 33 | AURKA, CDC20, TOP2A, PLK1, ASPM, BUB1, FOXM1, EfCT2, KIF14, CCNA2, BUB1B, FBXO5, UBE2C, TK1, CENPF, TACC3, CKS2 | 17/57 | AURKA, CDC20, MKI67, TOP2A | 4/57 | let-7b, miR-101, miR-106b, miR-130b, miR-146b, miR-16, miR-17, miR-18a, miR-19a, miR-20b, miR-25, miR-29a, miR-93 | 13/31 |
Num1 represents the number of cancer genes / the number of all genes in a module,
Num2 the number of ovarian cancer genes / the number of all genes in a module, and
Num3 the number of ovarian cancer miRNAs / the number of all miRNAs in a module.
miRNAs regulate genes in ovarian cancer modules.
| Module ID | miRNA | m | k | x |
| Genes |
|---|---|---|---|---|---|---|
| 2 | miR-185 | 757 | 60 | 9 | 1.21E-02 | RASSF4,CTSH,POU2AF1,PSCD4,AIM2,LCK |
| 3 | miR-7 | 997 | 35 | 7 | 2.26E-02 | BUB1,ASPM,SEC61A2,CDK2,COQ7,SYT17 |
| 6 | miR-7 | 997 | 34 | 7 | 1.94E-02 | BUB1,POLE2,KIF23,CDK2,MCM6 |
| 7 | miR-331 | 892 | 25 | 5 | 3.38E-02 | PYCRL,SHARPIN,PLEC1 |
| 13 | miR-7 | 997 | 78 | 12 | 2.60E-02 | BUB1,POLE2,MCM6,BXDC2,RBBP9,SMARCB1,GAD1 |
| 15 | miR-9 | 863 | 35 | 6 | 3.62E-02 | MXD3,C6orf134,DDX25 |
| 15 | miR-29b | 1266 | 35 | 9 | 8.60E-03 | DNAH7,COL4A6,DDX25 |
| 17 | miR-29a | 1038 | 29 | 7 | 1.00E-02 | MYBL2,TDG,PPIE,MSH2 |
| 17 | miR-29b | 1266 | 29 | 7 | 2.75E-02 | TIMELESS,TDG,FAF1 |
| 25 | miR-7 | 997 | 29 | 8 | 1.91E-03 | FBXO5,POLE2,KIF23,MCM6 |
| 26 | miR-29b | 1266 | 44 | 10 | 1.41E-02 | CHEK1,TIMELESS,RIT1,DYNLT1 |
| 29 | miR-93 | 946 | 23 | 5 | 3.03E-02 | CDCA8,MED8,RLF |
| 30 | let-7b | 1050 | 26 | 6 | 2.19E-02 | EHMT2,RNF5,RGL2 |
| 33 | let-7b | 1050 | 57 | 11 | 9.28E-03 | ESPL1,DEPDC1,BUB1B,UBE2C,AURKB |
m,
k, and
x represent the number of genes regulated by the miRNA collected from MicroCosm, the number of genes in the module, and the number of genes regulated by the miRNA in the module, respectively. The significant numbers of genes in each module are regulated by the miRNA, and the significances are shown in
p-value.
Experimentally validated gene-miRNA interactions with strong evidence from miRTarbase in ovarian cancer modules.
| Module ID | Gene | miRNA | Validation Method | PubMed ID |
|---|---|---|---|---|
| 2 | EPB41L3 | miR-223 | Luciferase reporter assay, Western blot | 21628394 |
| 2 | MEF2C | miR-223 | Luciferase reporter assay | 18278031 |
| 2 | MEF2C | miR-21 | Immunofluorescence, In situ hybridization, Luciferase reporter assay | 21170291 |
| 3 | E2F1 | miR-93 | Luciferase reporter assay, Western blot | 19486339 |
| 3 | E2F1 | miR-106b | Luciferase reporter assay, Western blot | 19486339 |
| 3 | EZH2 | miR-25 | Luciferase reporter assay, qRT-PCR, Western blot | 22399519 |
| 5 | CREBZF | miR-221 | Reporter assay, Microarray | 20018759 |
| 6 | CCNE2 | miR-26a | Luciferase reporter assay, Western blot | 19524505 |
| 13 | CDC42 | miR-29a | Luciferase reporter assay, Western blot | 19079265 |
| 14 | BCL3 | miR-125b | Luciferase reporter assay | 20658525 |
| 14 | HK2 | miR-125b | Luciferase reporter assay, qRT-PCR | 22593586 |
| 18 | NASP | miR-29a | Luciferase reporter assay, Western blot | 22080513 |
| 26 | CCNA2 | let-7b | Immunoblot, Immunofluorescence, Luciferase reporter assay, qRT-PCR | 18379589 |
| 27 | TWIST1 | miR-214 | Luciferase reporter assay, qRT-PCR, Western blot | 22540680 |
| 27 | MMP14 | miR-145 | Reporter assay, Microarray | 21351259 |
| 31 | STAT3 | miR-21 | Western blot, Other | 20048743 |
| 31 | STAT3 | miR-20b | qRT-PCR, ELISA, ChIP, Western blot | 20232316 |
| 31 | EPB41L3 | miR-223 | Luciferase reporter assay, Western blot | 21628394 |
| 31 | TNFSF10 | miR-222 | Western blot | 18246122 |
| 32 | TOB1 | miR-218 | Luciferase reporter assay | 23060446 |
| 33 | CCNA2 | let-7b | Immunoblot, Immunofluorescence, Luciferase reporter assay, qRT-PCR | 18379589 |
Figure 3Regulations among genes, miRNAs, and TFs in ovarian cancer modules.
For three ovarian cancer modules-22 (A), 8 (B), and 33 (C)-the expression values of genes, miRNAs, and TFs are shown. Arrows represent genes and miRNAs regulated by TFs or other miRNAs. Genes and miRNAs are members of each module, but TFs do not belong to the modules.
Figure 4Regulations among genes, miRNAs, and TFs in GBM modules.
For two GBM modules, 11 (A) and 5 (B), the expression values of genes, miRNAs, and TFs are shown. Arrows represent genes and miRNAs regulated by TFs or other miRNAs. Genes and miRNAs are members of each module, but TFs do not belong to the modules.
Ovarian cancer modules with enriched pathways.
| Module ID | Pathways | Related genes | # of genes |
|
|---|---|---|---|---|
| 2 | Cytokine-Cytokine Receptor Interaction | CXCL13, LTB, CXCL11, IL18, CXCL9, CD27, CXCL10, CCR5 | 8 | 7.16E-04 |
| 2 | Chemokine Signaling Pathway | CXCL13, CXCL11, CXCL9, VAV1, CXCL10, HCK, DOCK2, CCR5, LYN, FGR | 10 | 8.02E-07 |
| 2 | Cell Adhesion Molecules Cams | ICOS, SIGLEC1, ITGB2, CD4 | 4 | 4.45E-02 |
| 2 | Toll-Like Receptor Signaling Pathway | CXCL11, CXCL9, CXCL10, SPP1 | 4 | 1.92E-02 |
| 2 | Natural Killer Cell Mediated Cytotoxicity | VAV1, LCP2, ITGB2, TYROBP, LCK | 5 | 6.05E-03 |
| 2 | T-Cell Receptor Signaling Pathway | ICOS, VAV1, LCP2, CD4, LCK | 5 | 5.81E-03 |
| 2 | B-Cell Receptor Signaling Pathway | BLNK, VAV1, BTK, LYN | 4 | 9.50E-03 |
| 2 | Defense Response | CXCL11, CXCL9, BLNK, CXCL10, CLEC5A, LSP1, CCR5, TYROBP | 8 | 1.93E-03 |
| 2 | Immune Response | CXCL13, IL18, BLNK, CD96, POU2AF1, AIM2, PSMB10, LCP2, CCR5, ARHGDIB, CD74 | 11 | 1.41E-06 |
| 2 | T-Cell Activation | IL18, CD4, LCK | 3 | 3.49E-02 |
| 2 | Response to Wounding | CXCL11, CXCL9, BLNK, CXCL10, CCR5 | 5 | 4.65E-02 |
| 2 | Phosphorylation | HCK, ITGB2, BTK, LCK, LYN, FGR | 6 | 4.80E-02 |
| 2 | Cellular Defense Response | CXCL9, CLEC5A, LSP1, CCR5, TYROBP | 5 | 7.74E-04 |
| 6 | Cell Cycle | CHEK1, CDC7, CCNE2, MCM4, CDK2, MCM6, CDC25A, MCM2, PCNA, BUB1 | 10 | 2.97E-11 |
| 6 | p53-Signaling Pathway | CHEK1, CCNE2, CDK2 | 3 | 1.95E-02 |
| 6 | MCM Pathway | MCM4, CDK2, MCM6, MCM2 | 4 | 3.42E-05 |
| 6 | Cell Cycle Process | CHEK1, CDC7, TIMELESS, CDK2, KIF15, KNTC1, KIF23, BUB1, RACGAP1, CDKN3 | 10 | 9.74E-09 |
| 6 | Mitotic Cell Cycle | CDC7, CDK2, KIF15, KNTC1, KIF23, BUB1, CDKN3 | 7 | 1.04E-05 |
| 6 | Response to DNA Damage Stimulus | CHEK1, POLE2, FEN1, MSH6 | 4 | 3.44E-02 |
| 6 | Regulation of Cell Cycle | CHEK1, CDC7, CCNE2, TIMELESS, CDK2, KNTC1, CDC25A, BUB1, CDKN3 | 9 | 9.44E-08 |
| 6 | Regulation of Cell Proliferation | CHEK1, CDC7, TIMELESS, CDK2, CDKN3 | 5 | 3.44E-02 |
| 8 | TGF-Beta Signaling Pathway | INHBA, COMP, THBS2, THBS1 | 4 | 6.99E-03 |
| 8 | Focal Adhesion | MYLK, COMP, ITGB1, THBS2, THBS1, COL3A1, COL1A1, FN1, PDGFRB, COL1A2, ITGA5 | 11 | 8.00E-10 |
| 8 | ECM-Receptor Interaction | COMP, ITGB1, THBS2, THBS1, COL3A1, COL1A1, FN1, COL1A2, ITGA5 | 9 | 4.33E-10 |
| 8 | Complement and Coagulation Cascades | SERPINE1, PLAU, PLAUR | 3 | 4.44E-02 |
| 22 | Focal Adhesion | COL5A3, COL1A1, COL6A1, COL5A1, THBS2, FN1, ITGA5, COL3A1 | 8 | 3.94E-07 |
| 22 | ECM-Receptor Interaction | COL5A3, COL1A1, COL6A1, COL5A1, THBS2, FN1, ITGA5, COL3A1 | 8 | 8.02E-10 |
| 22 | Proteolysis | MMP11, MMP1, CTSK, PLAU, MMP2 | 5 | 2.24E-02 |
| 26 | G2 Pathway | PLK1, CCNB1, CHEK1 | 3 | 8.31E-03 |
| 26 | Regulation of Cell Cycle | FBXO5, BIRC5, CCNA2, MAD2L1, TIMELESS, CHEK1, GMNN, CDC7 | 8 | 1.14E-05 |
| 33 | Cell Cycle | BUB1, TTK, ESPL1, PLK1, BUB1B, CCNA2, CDC20, CCNB2 | 8 | 8.53E-06 |
| 33 | Microtubule Based Process | TTK, KIF11, KIF23, PRC1, NUSAP1, KIF4A, KPNA2 | 7 | 2.64E-06 |
| 33 | Regulation of Cell Cycle | BUB1, FBXO5, TTK, BUB1B, UBE2C, NUSAP1, CCNA2, CKS2, BIRC5 | 9 | 2.51E-06 |
Several ovarian cancer modules are shown with enriched
pathways and
cancer genes. We selected these modules based on the importance of terms and the ratios of cancer genes and ovarian cancer genes.
Figure 5Network presentation of module 22 in ovarian cancer.
In this network, diamonds represent miRNAs: sky-blue nodes for ovarian cancer miRNAs from the HMDD database, pink nodes for ovarian cancer miRNAs supported by the literature, and yellow nodes for the remaining miRNAs. Genes are represented by circles: pink nodes for ovarian cancer genes validated by the literature, green nodes for ovarian cancer genes validated by the DDOC database, orange nodes for cancer genes, and white nodes for the remaining genes. A blue solid line indicates that the MCC value between a gene and a miRNA is larger than 0.3. A purple line indicates that the linked genes are enriched together with at least one function. For example, COL6A1, COL5A3, THBS2, FN1, COL1A1, COL5A1, COLA1A, and COL3A1 are enriched with at least one function together (ECM receptor pathway or Focal adhesion pathway). Table S17 presents PubMed identifiers for ovarian cancer genes in pink nodes.
Figure 6Network presentation of module 8 in ovarian cancer.
The description of this network is the same as in Fig. 5 except that red lines are used to represent two enriched pathways (complement and coagulation cascades pathway, and TGF signaling pathway).
Figure 7Module 11 in GBM.
The description of this network is the same as in Fig. 5, except that green nodes indicate GBM genes validated by two articles [34, 35], and pink nodes indicate GBM genes validated by the literature in PubMed. Table S17 presents PubMed identifiers for GBM genes.
Figure 8Expression levels of ovarian cancer subtype marker genes.
(A) Heat map of the means of marker gene expression levels for 32 ovarian cancer modules. Red indicates overexpression of genes, and green indicates underexpression of genes. (B) Expression levels of marker genes of selected modules. Blue bars represent marker genes that determine the subtype and red bars represent other subtype marker genes.
Figure 9Expression levels of GBM subtype marker genes.
(A) Heat map of the means of marker gene expression levels for 54 GBM modules. Red indicates overexpression of genes, and green indicates underexpression of genes. (B) Expression levels of marker genes of selected modules. Blue bars represent marker genes that determine the subtype, and red bars represent other subtype marker genes.
Figure 10Performance comparisons.
Comparison of modules identified using our approach and the NMF approach using ovarian cancer data. (A) The ratio of modules with at least one enriched function or pathway. (B) The average number of enriched functions in the identified modules. (C) The average ratios of cancer genes, ovarian cancer genes, and ovarian cancer miRNAs in the modules.