| Literature DB >> 30732558 |
Qiu Xiao1,2, Jiawei Luo3, Cheng Liang4, Jie Cai1, Guanghui Li1, Buwen Cao1.
Abstract
BACKGROUND: Non-coding RNAs (ncRNAs) are emerging as key regulators and play critical roles in a wide range of tumorigenesis. Recent studies have suggested that long non-coding RNAs (lncRNAs) could interact with microRNAs (miRNAs) and indirectly regulate miRNA targets through competing interactions. Therefore, uncovering the competing endogenous RNA (ceRNA) regulatory mechanism of lncRNAs, miRNAs and mRNAs in post-transcriptional level will aid in deciphering the underlying pathogenesis of human polygenic diseases and may unveil new diagnostic and therapeutic opportunities. However, the functional roles of vast majority of cancer specific ncRNAs and their combinational regulation patterns are still insufficiently understood.Entities:
Keywords: Cancer; Machine learning; Module discovery; Regulatory pattern; ceRNA; lncRNA function; microRNA
Mesh:
Substances:
Year: 2019 PMID: 30732558 PMCID: PMC6367773 DOI: 10.1186/s12859-019-2654-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Overall workflow of CeModule for detecting lncRNA, miRNA, and mRNA-associated regulatory patterns
Fig. 2Topological features of the identified modules and the ceRNA regulatory network for ovarian cancer. a View of the ceRNA module network in OV. If two nodes are members of a module and their interactions exist in the databases as mentioned in the aforementioned interaction databases, then an edge between the two nodes is displayed. Three colors (black, purple and green) correspond to three types of interactions (lncRNA-miRNA, miRNA-gene and gene-gene). Nodes with no edges are omitted to improve visualization. b Overlap of the top 10 lncRNAs across three dimensions for OV. c The distributions of number of modules identified by CeModule for the top 10 lncRNAs, miRNAs, and mRNAs with the highest degree in OV dataset
The top 10 lncRNAs, miRNAs and mRNAs with the highest degree, closeness centrality, and betweenness centrality in OV
| Rank | Degree | Betweenness | Closeness | ||||||
|---|---|---|---|---|---|---|---|---|---|
| lncRNAs | miRNAs | mRNAs | lncRNAs | miRNAs | mRNAs | lncRNAs | miRNAs | mRNAs | |
| 1 | MALAT1 | let-7b | RPS16 | MALAT1 | mir-10a | TCF7L1 | LINC00240 | mir-155 | NME5 |
| 2 | NEAT1 | mir-10a | RPS11 | NEAT1 | let-7b | SNRPF | RP11-403I13.8 | mir-506 | HIF3A |
| 3 | GAS5 | mir-99b | RPS5 | H19 | mir-30a | PTP4A3 | MALAT1 | mir-206 | TCF7L1 |
| 4 | H19 | mir-10b | RPS18 | GAS5 | mir-146a | PRRX2 | NEAT1 | mir-223 | LRRC6 |
| 5 | SNHG1 | mir-30a | RPS8 | TUG1 | mir-375 | PNISR | FGD5-AS1 | mir-10a | ACTG1 |
| 6 | TUG1 | mir-143 | SRGN | FGD5-AS1 | mir-149 | NR5A1 | H19 | mir-30a | PNISR |
| 7 | FGD5-AS1 | mir-182 | TYROBP | SNHG5 | mir-99b | LRRC6 | TUG1 | let-7b | PRRX2 |
| 8 | SNHG5 | mir-183 | RPL11 | XIST | mir-183 | HIF3A | XIST | mir-197 | PTP4A3 |
| 9 | XIST | mir-200c | ALOX5AP | SNHG1 | mir-143 | CTSD | SNHG1 | mir-146a | CTSD |
| 10 | MEG3 | mir-25 | RPL3 | SNHG3 | mir-320a | ACTG1 | SNHG14 | mir-25 | SNRPF |
Fig. 3a Functional enrichment analysis for the 10 miRNAs with the highest degree using TAM in OV. b Pathway enrichment analysis of the module 15 in OV dataset. c Pathway enrichment analysis of the module 17 in OV dataset. The area proportion of each pathway presents the number of genes enriched in this pathway
Representative enriched GO terms of the selected modules for OV dataset
| Module | GO term | Description | q-value | Cancer lncRNAs | Cancer miRNAs | Cancer mRNAs |
|---|---|---|---|---|---|---|
| 2 | GO:0002376 | immune system process | 1.04E-12 | mir-10a | APOC1, APOE, BTG3, C1QA, C1QB, CBS, CCL2, etc | |
| GO:0009605 | response to external stimulus | 2.31E-07 | ||||
| GO:0006954 | inflammatory response | 2.76E-04 | ||||
| GO:0050865 | regulation of cell activation | 2.25E-03 | ||||
| GO:0007154 | cell communication | 2.25E-03 | ||||
| 7 | GO:0032502 | developmental process | 1.32E-06 | DLEU2, | mir-196b, mir-199b | CHST2,CLDN11,COX6B1, MGP, DACT3, DCHS1, DLK1, etc |
| GO:0030154 | cell differentiation | 1.62E-05 | ||||
| GO:0060284 | regulation of cell development | 1.06E-04 | ||||
| GO:0010942 | positive regulation of cell death | 2.89E-04 | ||||
| GO:0007275 | multicellular organismal development | 7.77E-07 | ||||
| 15 | GO:0007155 | cell adhesion | 2.57E-06 | mir-202, mir-506, mir-508, mir-513c | FSTL1, LHX1, MEST, MFAP2, CDH3, | |
| GO:0022610 | biological adhesion | 2.64E-06 | ||||
| GO:0009968 | negative regulation of signal transduction | 1.38E-03 | ||||
| GO:0042698 | ovulation cycle | 3.10E-04 | ||||
| GO:0050896 | response to stimulus | 2.54E-05 | ||||
| 17 | GO:0022411 | cellular component disassembly | 1.43E-20 | GPC3, | ||
| GO:0009968 | negative regulation of signal transduction | 7.65E-04 | ||||
| GO:0060284 | regulation of cell development | 8.80E-04 | ||||
| GO:0045595 | regulation of cell differentiation | 5.91E-04 | ||||
| GO:0006413 | translational initiation | 8.31E-21 |
Note: The bold letters represent the lncRNAs/miRNAs/mRNAs related to ovarian cancer; q-value represents the corrected p-value using the Benjamini-Hochberg method
Overlapping miRNAs for the identified modules and clusters/families in OV
| Module | Overlap miRsa | p-value | Overlap miRsb | p-value |
|---|---|---|---|---|
| 1 | mir-362,mir-532, mir-500, mir-501 | 1.22e-06 | mir-200b,mir-200c | 7.33e-04 |
| – | – | mir-500,mir-501 | 2.40e-03 | |
| 18 | mir-99b,mir-152a | 9.15e-04 | mir-100,mir-99b | 9.15e-04 |
| 20 | let-7c, mir-99a | 1.03e-04 | mir-200a,mir-200b | 1.01e-03 |
| mir-200a, mir-200b | 3.07e-04 | – | – | |
| 30 | – | – | let-7b,let-7c | 4.96e-03 |
| 70 | mir-516a,mir-519a, mir-522,mir-518e | 1.45e-03 | mir-516a,mir-519a, mir-522,mir-518e | 7.66e-04 |
Note: a/b represent the miRNAs that overlap between modules and miRNA clusters as well as families
Fig. 4a Comparison of the correlation evaluation scores between all the identified modules by CeModule and the randomly generated modules for ovarian cancer dataset. b Distribution of the correlation evaluation scores of the 1000 random modules with the same size for modules 1 and 2 in ovarian cancer dataset
Statistics of the correlation coefficients in OV and UCEC datasets
| Dataset | Ave (lnc-miR) | Ave (miR-mR) | Ave (lnc-mR) | Ave-mod |
|---|---|---|---|---|
| OV | 0.0546 | 0.0659 | 0.0678 | 0.119 |
| UCEC | 0.0639 | 0.0772 | 0.0854 | 0.173 |
Note: Ave (lnc-miR), Ave (miR-mR) and Ave (lnc-mR) are the average absolute Pearson correlation coefficients of all lncRNA-miRNA, miRNA-mRNA and lncRNA-mRNA pairs, respectively; Ave-mod is the correlation evaluation score across all modules
Fig. 5a Percentage of modules with at least two known cancer-related (ovarian cancer-related)lncRNAs/miRNAs/mRNAs in ovarian cancer dataset. b Overlap of cancer lncRNAs, and ovarian cancer lncRNAs between the benchmark set and lncRNAs in the identified modules for ovarian cancer dataset
Known ovarian cancer-associated and cancer- associated lncRNAs for these representative modules in OV
| Module | Cancer lncRNAs | Numa | q-value | OV lncRNAs | Numb | q-value |
|---|---|---|---|---|---|---|
| 2 | MALAT1,MIR155HG,NEAT1,PVT1 | 4/74 | 1.05e-02 | MALAT1, NEAT1, PVT1 | 3/74 | 4.65e-04 |
| 7 | DLEU2,DNM3OS,GAS5,HOTAIRM1,MALAT1, | 9/86 | 2.30e-06 | DNM3OS, GAS5, MALAT1 | 3/86 | 6.55e-04 |
| 12 | MALAT1,RMRP,RP11-385 J1.2,XIST | 4/30 | 6.00e-04 | MALAT1, XIST | 2/30 | 2.34e-03 |
| 31 | GAS5,MALAT1,NEAT1,RP11-304 L19.5,SNHG3,SNHG5,TP53TG1,UCA1 | 8/75 | 6.59e-06 | GAS5,MALAT1,NEAT1, UCA1 | 4/75 | 3.95e-05 |
| 41 | DLEU2,GAS5,LINC00467,MALAT1,NEAT1, SNHG1,SNHG3 | 7/57 | 9.50e-06 | GAS5, MALAT1, NEAT1 | 3/57 | 3.91e-04 |
| 62 | DNM3OS,H19,HOTAIRM1,LINC00152,MALAT1,MEG3,NEAT1,PVT1,RMRP,RP11-304 L19.5,RP11-401P9.4,SNHG5,UCA1,XIST | 14/79 | 2.35e-12 | DNM3OS, MALAT1, PVT1, NEAT1, UCA1, XIST, H19 | 7/79 | 1.59e-10 |
| 66 | H19,MALAT1,MEG3,NEAT1,PVT1,SNHG1, SNHG3,UCA1,XIST | 9/58 | 2.02e-07 | H19, MALAT1, NEAT1, PVT1, UCA1, XIST | 6/58 | 1.78e-09 |
Note: Numa and Numb are the ratios of lncRNAs that associated with cancer and OV in these modules. q-value is the FDR-corrected p-value after multiple testing correction