| Literature DB >> 28105958 |
Abstract
BACKGROUND: MicroRNAs (miRNAs) play important regulatory roles in the wide range of biological processes by inducing target mRNA degradation or translational repression. Based on the correlation between expression profiles of a miRNA and its target mRNA, various computational methods have previously been proposed to identify miRNA-mRNA association networks by incorporating the matched miRNA and mRNA expression profiles. However, there remain three major issues to be resolved in the conventional computation approaches for inferring miRNA-mRNA association networks from expression profiles. 1) Inferred correlations from the observed expression profiles using conventional correlation-based methods include numerous erroneous links or over-estimated edge weight due to the transitive information flow among direct associations. 2) Due to the high-dimension-low-sample-size problem on the microarray dataset, it is difficult to obtain an accurate and reliable estimate of the empirical correlations between all pairs of expression profiles. 3) Because the previously proposed computational methods usually suffer from varying performance across different datasets, a more reliable model that guarantees optimal or suboptimal performance across different datasets is highly needed.Entities:
Mesh:
Substances:
Year: 2016 PMID: 28105958 PMCID: PMC5249041 DOI: 10.1186/s12918-016-0373-1
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
24].
Fig. 1Workflow for inferring direct miRNA-mRNA association relationships
Number of experimentally confirmed miRNA-mRNA associations by the ground-truth data
| Single Method | Ensemble Method | |||||||
|---|---|---|---|---|---|---|---|---|
| Corpcor | Space | MIND | C&S | C&M | S&M | C&S&M | ||
| EMT | Whole | 35 | 45 | 24 | 45 | 34 | 35 | 41 |
| Bootstrap | 32 | 38 | 25 | 40 | 24 | 37 | 40 | |
| MCC | Whole | 200 | 183 | 210 | 204 | 206 | 201 | 209 |
| Bootstrap | 211 | 204 | 207 | 201 | 217 | 220 | 216 | |
| BR | Whole | 98 | 83 | 95 | 90 | 94 | 97 | 102 |
| Bootstrap | 107 | 95 | 99 | 99 | 102 | 100 | 105 | |
The Top 100 correlations for each miRNA were selected from each experiment for performance comparison. To evaluate the effect of three direct correlation inference methods, bootstrapping and Ensemble approach, we performed a comparative study using EMT, MCC and BR datasets. Corpcor (denoted as C) is the partial correlation estimation method, SPACE (denoted as S) is the sparse partial correlation estimation method, and MIND (denoted as M) is the mutual information-based network deconvolution method. ‘Whole’ means that the whole expression profiles were used to infer a direct correlation matrix, and ‘Bootstrap’ means that 100 direct correlation matrices were computed using 100 bootstrapped samples and then aggregated based on an inverse-rank-product method
Fig. 2Average number of experimentally confirmed miRNA-mRNA correlations on three datasets. This bar-chart represents the average number of the experimentally confirmed miRNA-mRNA correlations of each method on EMT, MCC and BR datasets. It also shows the statistical significances of differences on performance between two methods in terms of the p-value computed using the paired t-test. ‘Single’ means the average performance of three models from the three direct correlation inference methods without bootstrapping and Ensemble aggregation steps. ‘Bootstrap’ means the average performance of the bootstrapping aggregation results for each three direct correlation inference method. ‘Ensemble’ means the average performance of inferred models using Ensemble aggregation of single experiments. Additionally, Bootstrap&Ensemble means the average performance of proposed DMirNet that uses both bootstrapping and Ensemble aggregation. E-P&I&L means a comparable control that is an ensemble model aggregating Pearson, IDA and Lasso [1]
Performance comparison of DMirNet with the state-of-the-art Ensemble model
| Dataset | Direct correlation inference methods | the state-of-the-art method | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Corpcor | Space | MIND | E-C&S&M | B&E-C&S&M | Pearson | IDA | Lasso | E-P&I&L | |
| EMT | 35 | 45 | 24 | 41 | 40 | 30 | 29 | 29 | 31 |
| MCC | 200 | 183 | 210 | 209 | 216 | 205 | 198 | 187 | 203 |
| BR | 98 | 83 | 95 | 102 | 105 | 114 | 124 | 120 | 101 |
To compare the performance of our method with a related work, we investigate the number of experimentally confirmed miRNA-mRNA associations of the state-of-the-art Ensemble model. It combines Pearson’s correlation (denoted as P), IDA (denoted as I), and Lasso (denoted as L) using the Borda count election and was reported as the best-performed Ensemble model on the three datasets [1]. ‘E’ denotes the Ensemble approach, and ‘B&E’ denotes the DMirNet with both bootstrapping and Ensemble aggregation
Fig. 3The key network structure of top 500 miRNA-mRNA association network using the MCC dataset. In this key network structure, a node represents a network module, a label of a module represents community centrality, and an edge stands for the connectivity among modules. The network modules identified using ModuLand
Fig. 4KEGG biological pathways related to Top 1000 pairs of the MCC dataset
Fig. 5Cancer-related miRNA-mRNA association networks among Top 1000 pairs of the MCC dataset. The red rectangle nodes are mRNAs and the blue circle nodes are mRNAs
Conflict between experimental results on hsa-miR-19a-3p and RAB14
| Publication | Method | Tissue | Cell line | Tested cell line | Result | Regulation |
|---|---|---|---|---|---|---|
| Hafner M. et al. 2010 [ | PAR-CLIP | Kidney | HEK293 | N/A | Positive | Down |
| Kanzaki H et al. 2011 [ | Luciferase Reporter Assay | Lung | SBC3 | HEK293 | Negative | ? |
A manually curated miRNA-target database includes conflict experimental results for some miRNA-mRNA pairs. As an example, this table shows a conflict experimental result on hsa-miR-19a-3p and RAB14(hsa) from TarBase 6.0 [37]