| Literature DB >> 30499204 |
Sheng-Peng Yu1, Cheng Liang1, Qiu Xiao2, Guang-Hui Li3, Ping-Jian Ding4, Jia-Wei Luo4.
Abstract
MiRNAs are a class of small non-coding RNAs that are involved in the development and progression of various complex diseases. Great efforts have been made to discover potential associations between miRNAs and diseases recently. As experimental methods are in general expensive and time-consuming, a large number of computational models have been developed to effectively predict reliable disease-related miRNAs. However, the inherent noise and incompleteness in the existing biological datasets have inevitably limited the prediction accuracy of current computational models. To solve this issue, in this paper, we propose a novel method for miRNA-disease association prediction based on matrix completion and label propagation. Specifically, our method first reconstructs a new miRNA/disease similarity matrix by matrix completion algorithm based on known experimentally verified miRNA-disease associations and then utilizes the label propagation algorithm to reliably predict disease-related miRNAs. As a result, MCLPMDA achieved comparable performance under different evaluation metrics and was capable of discovering greater number of true miRNA-disease associations. Moreover, case study conducted on Breast Neoplasms further confirmed the prediction reliability of the proposed method. Taken together, the experimental results clearly demonstrated that MCLPMDA can serve as an effective and reliable tool for miRNA-disease association prediction.Entities:
Keywords: label propagation; matrix completion; miRNA-disease association prediction
Mesh:
Substances:
Year: 2018 PMID: 30499204 PMCID: PMC6349206 DOI: 10.1111/jcmm.14048
Source DB: PubMed Journal: J Cell Mol Med ISSN: 1582-1838 Impact factor: 5.310
Figure 1Flowchart of potential disease‐miRNA association prediction based on the computational model of MCLPMDA. Our algorithm mainly consists of three steps: (1) we construct a new miRNA similarity matrix as well as a disease similarity matrix based on matrix completion algorithm; (2) the two reconstructed similarity matrices are combined with Gaussian interaction profile kernel similarity for miRNAs and diseases respectively; (3) label propagation algorithm is conducted in both miRNA space and disease space to obtain the final prediction results
Figure 2The comparison results between MCLPMDA and the other five methods in the framework of global LOOCV
Figure 3The comparison results between MCLPMDA and the other five methods in the framework of 5‐fold cross validation
Figure 4The comparison results between MCLPMDA and the other three computational models in terms of LODOCV
Statistical significance of differences in performance between the proposed method and the other five methods in LODOCV. P‐values were calculated by Wilcoxon signed rank test
| SNMDA | HGIMDA | EGBMMDA | MKRMDA | Without MC | |
|---|---|---|---|---|---|
|
| 3.67e‐02 | 4.24e‐04 | 4.54e‐98 | 2.12e‐41 | 1.95e‐03 |
Figure 5The number of true miRNA‐disease associations identified by each method
Top 50 predicted miRNAs associated with Breast Neoplasms based on known associations in HMDD. The first column records the top 1‐50 predicted miRNAs; the second column records the corresponding evidence in two databases; the third column records log2 fold change and the fourth column records the adjusted P‐values of the significance of differential expression for each miRNA
| miRNA | Evidence | logFC | FDR |
|---|---|---|---|
| hsa‐mir‐125a | dbDEMC;miR2Disease | −0.50551 | 6.02e‐12 |
| hsa‐mir‐196a | dbDEMC;miR2Disease | 3.210542 | 1.04e‐37 |
| hsa‐mir‐499a | dbDEMC | −1.8022 | 4.51e‐26 |
| hsa‐mir‐198 | dbDEMC;miR2Disease | −0.60211 | 3.93e‐02 |
| hsa‐let‐7a | dbDEMC;miR2Disease | −0.17426 | 9.69e‐02 |
| hsa‐mir‐141 | dbDEMC | 2.216414 | 4.65e‐74 |
| hsa‐mir‐143 | dbDEMC;miR2Disease | −1.17215 | 1.97e‐28 |
| hsa‐mir‐145 | dbDEMC;miR2Disease | −2.37613 | 3.00e‐224 |
| hsa‐mir‐150 | dbDEMC | −0.05109 | 1.00e+00 |
| hsa‐mir‐16 | dbDEMC | 0.394382 | 4.28e‐05 |
| hsa‐mir‐21 | dbDEMC | 2.143077 | 1.52e‐110 |
| hsa‐mir‐1 | dbDEMC | −5.68257 | 8.66e‐254 |
| hsa‐mir‐133a | dbDEMC;miR2Disease | −6.50194 | 0.00e+00 |
| hsa‐mir‐133b | dbDEMC;miR2Disease | −6.68341 | 2.74e‐190 |
| hsa‐mir‐146a | dbDEMC | 0.501373 | 1.37e‐04 |
| hsa‐mir‐208b | dbDEMC;miR2Disease | −4.35801 | 2.98e‐62 |
| hsa‐mir‐103a | dbDEMC | 0.809716 | 1.35e‐15 |
| hsa‐mir‐106a | dbDEMC;miR2Disease | 0.999651 | 3.51e‐12 |
| hsa‐mir‐10b | dbDEMC;miR2Disease | −1.88876 | 1.34e‐94 |
| hsa‐mir‐126 | dbDEMC;miR2Disease | −0.98217 | 9.66e‐36 |
| hsa‐mir‐135a | dbDEMC;miR2Disease | 1.217938 | 1.75e‐03 |
| hsa‐mir‐151a | dbDEMC;miR2Disease | 0.417736 | 2.23e‐07 |
| hsa‐mir‐152 | dbDEMC | −0.15395 | 1.38e‐01 |
| hsa‐mir‐181b | dbDEMC;miR2Disease | 1.397101 | 8.49e‐30 |
| hsa‐mir‐182 | dbDEMC;miR2Disease | 2.364107 | 2.39e‐63 |
| hsa‐mir‐183 | dbDEMC | 2.946886 | 1.06e‐95 |
| hsa‐mir‐191 | dbDEMC;miR2Disease | 1.217488 | 2.41e‐29 |
| hsa‐mir‐192 | dbDEMC | 1.468736 | 2.60e‐37 |
| hsa‐mir‐193b | dbDEMC | −0.02624 | 1.00e+00 |
| hsa‐mir‐194 | dbDEMC;miR2Disease | 0.496013 | 2.49e‐07 |
| hsa‐mir‐200a | dbDEMC;miR2Disease | 2.10741 | 1.56e‐64 |
| hsa‐mir‐200b | dbDEMC;miR2Disease | 1.698791 | 6.59e‐41 |
| hsa‐mir‐200c | dbDEMC | 1.53758 | 2.83e‐44 |
| hsa‐mir‐203 | dbDEMC | 2.262136 | 6.25e‐23 |
| hsa‐mir‐204 | dbDEMC;miR2Disease | −2.62831 | 2.42e‐62 |
| hsa‐mir‐205 | miR2Disease | −1.46212 | 2.66e‐19 |
| hsa‐mir‐20a | dbDEMC | 0.784424 | 1.26e‐09 |
| hsa‐mir‐210 | dbDEMC | 3.06042 | 6.75e‐48 |
| hsa‐mir‐215 | dbDEMC | −1.27642 | 5.39e‐28 |
| hsa‐mir‐221 | dbDEMC | −0.07311 | 7.72e‐01 |
| hsa‐mir‐223 | dbDEMC | −0.8271 | 7.04e‐13 |
| hsa‐mir‐25 | dbDEMC;miR2Disease | −0.0555 | 7.34e‐01 |
| hsa‐mir‐26b | dbDEMC | −0.22093 | 7.48e‐03 |
| hsa‐mir‐31 | dbDEMC;miR2Disease | 0.25524 | 3.00e‐01 |
| hsa‐mir‐34b | dbDEMC | 0.253323 | 2.36e‐01 |
| hsa‐mir‐429 | dbDEMC | 2.689754 | 5.04e‐72 |
| hsa‐mir‐449a | Unconfirmed | 5.627081 | 2.54e‐25 |
| hsa‐mir‐449b | dbDEMC | 4.278504 | 1.73e‐17 |
| hsa‐mir‐92a | dbDEMC | −0.27138 | 5.93e‐03 |
| hsa‐mir‐93 | dbDEMC | 1.137218 | 1.98e‐29 |
Figure 6The prediction accuracy of 5‐fold cross validation classification of tumour samples based on the top 6 prioritized miRNAs
Figure 7(A) The expression levels of hsa‐mir‐125a at different pathologic stages; (B) Kaplan–Meier survival analysis for hsa‐mir‐125a. As observed, patients with higher expression level are at lower risk level
|
|
|---|
|
|
|
|
|
|
|
Fix the others and update Fix the others and update Fix the others and update Update the multiplier Update parameter μ by: Check the convergence condition by: |
|
|
|
|
|
|
|---|
|
|
|
|
| 1. Input |
| 2. Input |
| 3. Integrate similarity information to get |
| 4. Predict from miRNA space and disease space: |
|
|
|
|
| 5. Integrate the results |
| 6. |