| Literature DB >> 30011266 |
Duc Do1, Serdar Bozdag1.
Abstract
MicroRNAs (miRNAs) inhibit expression of target genes by binding to their RNA transcripts. It has been recently shown that RNA transcripts targeted by the same miRNA could "compete" for the miRNA molecules and thereby indirectly regulate each other. Experimental evidence has suggested that the aberration of such miRNA-mediated interaction between RNAs-called competing endogenous RNA (ceRNA) interaction-can play important roles in tumorigenesis. Given the difficulty of deciphering context-specific miRNA binding, and the existence of various gene regulatory factors such as DNA methylation and copy number alteration, inferring context-specific ceRNA interactions accurately is a computationally challenging task. Here we propose a computational method called Cancerin to identify cancer-associated ceRNA interactions. Cancerin incorporates DNA methylation, copy number alteration, gene and miRNA expression datasets to construct cancer-specific ceRNA networks. We applied Cancerin to three cancer datasets from the Cancer Genome Atlas (TCGA) project. Our results indicated that ceRNAs were enriched with cancer-related genes, and ceRNA modules in the inferred ceRNA networks were involved in cancer-associated biological processes. Using LINCS-L1000 shRNA-mediated gene knockdown experiment in breast cancer cell line to assess accuracy, Cancerin was able to predict expression outcome of ceRNA genes with high accuracy.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30011266 PMCID: PMC6072113 DOI: 10.1371/journal.pcbi.1006318
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Cancerin pipeline to infer cancer-associated ceRNA interaction networks.
Cancerin consists of three main steps. In step 1, for each DE RNA, Cancerin selects its candidate DE miRNA regulators based on sequence binding results. In step 2, Cancerin applies a LASSO-based variable selection procedure to select a subset of miRNA regulators that contribute to the expression variation of the DE RNA. In step 3, Cancerin applies multiple filtering conditions to infer ceRNA interactions between the RNAs that are regulated by common miRNAs.
Number of putative DE miRNA-DE RNA interactions and number of DE miRNAs and DE RNAs included in those interactions (output for Cancerin—Step 1).
| BRCA | KIRC | HNSC | |
|---|---|---|---|
| No. of putative DE miRNA—DE mRNA interactions | 153,465 | 107,348 | 94,980 |
| No. of DE miRNAs | 215 | 164 | 201 |
| No. of DE mRNAs | 7,502 | 6,690 | 5,005 |
| No. of putative DE miRNA—DE lncRNA interactions | 60,935 | 18,589 | 17,350 |
| No. of DE miRNAs | 215 | 164 | 201 |
| No. of DE lncRNAs | 3,111 | 1,335 | 896 |
1: included in putative DE miRNA—DE mRNA interactions.
2: included in putative DE miRNA—DE lncRNA interactions.
Number of selected miRNA-RNA interactions obtained after applying the variable selection procedure (output of Cancerin—Step 2).
| BRCA | KIRC | HNSC | |
|---|---|---|---|
| No. of miRNA-mRNA interactions | 6,616 | 8,408 | 9,893 |
| No. of miRNAs | 196 | 154 | 190 |
| No. of mRNAs | 2,814 | 2,971 | 3,020 |
| No. of miRNA-lncRNA interactions | 502 | 217 | 467 |
| No. of miRNAs | 134 | 93 | 141 |
| No. of lncRNAs | 210 | 91 | 175 |
1: included in the selected miRNA—mRNA interactions.
2: included in the selected miRNA—lncRNA interactions.
Percentage of RNA targets regulated by miRNAs and also by at least one additional type of regulators.
| BRCA | KIRC | HNSC | |
|---|---|---|---|
| Percentage of RNA targets under CNA regulation | 76.2% | 69.2% | 77.2% |
| Percentage of RNA targets under DNA Methylation regulation | 30.4% | 26.3% | 35.0% |
| Percentage of RNA targets under TF regulation | 54.1% | 59.3% | 48.0% |
Number of miRNA-RNA interactions and their constituent miRNAs and RNAs selected in “Cancerin (original)” and “Cancerin (only_miRNA)”.
The first, second, and third value in each cell refers to the results from “Cancerin (original)”, “Cancerin (only_miRNA)”, and the common results between the two cases, respectively.
| BRCA | KIRC | HNSC | |
|---|---|---|---|
| No. of miRNA-RNA interactions | 7,118/4,071/3,242 | 8,625/6,524/5,085 | 10,360/8,648/6,619 |
| No. of miRNAs | 204/201/198 | 155/153/153 | 195/196/195 |
| No. of RNAs | 3,024/1,763/1,523 | 3,062/2,219/2,068 | 3,195/2,520/2,404 |
Number of inferred ceRNA interactions and number of ceRNAs in those interactions (output of Cancerin—Step 3).
| BRCA | KIRC | HNSC | |
|---|---|---|---|
| No. of all ceRNA interactions | 4,115 | 4,639 | 2,725 |
| No. of mRNA-mRNA ceRNA interactions | 3,674 | 4,614 | 2,589 |
| No. of mRNA-lncRNA ceRNA interactions | 394 | 25 | 121 |
| No. of lncRNA-lncRNA ceRNA interactions | 47 | 0 | 15 |
| No. of all ceRNAs | 1,593 | 1,081 | 1,110 |
| No. of mRNAs as ceRNAs | 1,491 | 1,071 | 1,063 |
| No. of lncRNAs ceRNAs | 102 | 10 | 47 |
1: subset of all ceRNA interactions (Row 1)
2: subset of all ceRNAs (Row 5)
Fig 2Degree distribution and power-law statistics.
(A) Degree distribution of ceRNAs for each cancer type. Linear regression statistics between log(ceRNA’s degree) and log(ceRNA’s degree probability) in (B) BRCA, (C) KIRC, and (D) HNSC cancer types.
Fig 3Hazard ratio distribution of prognostic ceRNAs and non-ceRNAs in each cancer type.
A prognostic RNA was defined as a DE RNA whose p-value from the univariate Cox regression was smaller than 0.05. For each cancer type, the prognostic RNAs were categorized into ceRNAs and non-ceRNAs. The p-values shown in the plot were from the Wilcoxon rank-sum test between hazard ratios of prognostic ceRNAs and non-ceRNAs.
Cancer hallmark terms that were enriched in the ceRNA modules.
| Cancer type | Cancer hallmark geneset | Description | Enriched Module |
|---|---|---|---|
| BRCA | Epithelial Mesenchymal Transition | Genes defining epithelial-mesenchymal transition, as in wound healing, fibrosis and metastasis | 2, 4, 14 |
| E2F Targets | Genes encoding cell cycle related targets of E2F transcription factors | 3, 7, 13 | |
| Estrogen Response Early | Genes defining late response to estrogen | 1, 11 | |
| G2M Checkpoint | Genes involved in the G2/M checkpoint, as in progression through the cell division cycle | 3, 7 | |
| TGF Beta Signaling | TGF-beta signaling pathway | 6 | |
| Spermatogenesis | Genes up-regulated during production of male gametes (sperm), as in spermatogenesis | 7 | |
| IL-6/JAK/STAT3 Signaling | Genes up-regulated by IL6 via STAT3, e.g., during acute phase response | 12 | |
| Interferon Gammaresponse | Genes up-regulated in response to IFNG | 12 | |
| UV Response Up | Genes up-regulated in response to ultraviolet (UV) radiation | 17 | |
| KIRC | Epithelial Mesenchymal Transition | Genes defining epithelial-mesenchymal transition, as in wound healing, fibrosis and metastasis | 4 |
| UV Response DN | Genes down-regulated in response to ultraviolet (UV) radiation | 4 | |
| Oxidative Phosphorylation | Genes encoding proteins involved in oxidative phosphorylation | 11 | |
| MYC Targets V1 | A subgroup of genes regulated by MYC—version 1 (v1) | 11 | |
| Adipogenesis | Genes up-regulated during adipocyte differentiation (adipogenesis) | 11 | |
| HNSC | Epithelial Mesenchymal Transition | Genes defining epithelial-mesenchymal transition, as in wound healing, fibrosis and metastasis | 4, 5 |
| TGF Beta Signaling | TGF-beta signaling pathway (UV) radiation | 4 | |
| MYC Targets V1 | A subgroup of genes regulated by MYC—version 1 (v1) | 6 | |
| G2M Checkpoint | Genes involved in the G2/M checkpoint, as in progression through the cell division cycle | 7 | |
| E2F Targets | Genes encoding cell cycle related targets of E2F transcription factors | 7 |
Number of selected ceRNA interactions by applying different methods.
| BRCA | KIRC | HNSC | |
|---|---|---|---|
| Cancerin (original) | 4,115 | 4,639 | 2,725 |
| Cancerin (OLS regression) | 6,039 | 19,202 | 6,262 |
| Cancerin (sensitivity correlation filtering step deactivated) | 7,018 | 18,976 | 8,179 |
| Correlation-based method | 25,853 | 46,518 | 16,908 |
Accuracy of the ceRNA networks inferred by different methods based on LINCS-L1000 (MCF7) dataset.
| Accuracy (96h) | Accuracy (144h) | Overall Accuracy (96h + 144h) | |
|---|---|---|---|
| Cancerin (original) | 71.4% | 69.6% | |
| Hermes | 60.0% | 70.2% | |
| Cancerin (only_miRNA) | 67.1% | 69.6% | |
| Cancerin (OLS regression) | 66.1% | 58.1% | 62.9% |
| Cancerin (sensitivity correlation filtering step deactivated) | 66.3% | 66.1% | 66.2% |
| Correlation-based method | 62.8% | 68.2% | 65.0% |