| Literature DB >> 30142158 |
Xing Chen1, Jun Yin1, Jia Qu1, Li Huang2.
Abstract
Recently, a growing number of biological research and scientific experiments have demonstrated that microRNA (miRNA) affects the development of human complex diseases. Discovering miRNA-disease associations plays an increasingly vital role in devising diagnostic and therapeutic tools for diseases. However, since uncovering associations via experimental methods is expensive and time-consuming, novel and effective computational methods for association prediction are in demand. In this study, we developed a computational model of Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction (MDHGI) to discover new miRNA-disease associations by integrating the predicted association probability obtained from matrix decomposition through sparse learning method, the miRNA functional similarity, the disease semantic similarity, and the Gaussian interaction profile kernel similarity for diseases and miRNAs into a heterogeneous network. Compared with previous computational models based on heterogeneous networks, our model took full advantage of matrix decomposition before the construction of heterogeneous network, thereby improving the prediction accuracy. MDHGI obtained AUCs of 0.8945 and 0.8240 in the global and the local leave-one-out cross validation, respectively. Moreover, the AUC of 0.8794+/-0.0021 in 5-fold cross validation confirmed its stability of predictive performance. In addition, to further evaluate the model's accuracy, we applied MDHGI to four important human cancers in three different kinds of case studies. In the first type, 98% (Esophageal Neoplasms) and 98% (Lymphoma) of top 50 predicted miRNAs have been confirmed by at least one of the two databases (dbDEMC and miR2Disease) or at least one experimental literature in PubMed. In the second type of case study, what made a difference was that we removed all known associations between the miRNAs and Lung Neoplasms before implementing MDHGI on Lung Neoplasms. As a result, 100% (Lung Neoplasms) of top 50 related miRNAs have been indexed by at least one of the three databases (dbDEMC, miR2Disease and HMDD V2.0) or at least one experimental literature in PubMed. Furthermore, we also tested our prediction method on the HMDD V1.0 database to prove the applicability of MDHGI to different datasets. The results showed that 50 out of top 50 miRNAs related with the breast neoplasms were validated by at least one of the three databases (HMDD V2.0, dbDEMC, and miR2Disease) or at least one experimental literature.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30142158 PMCID: PMC6126877 DOI: 10.1371/journal.pcbi.1006418
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1The disease DAG of lung neoplasms.
The addresses of its ancestors are shown in a DAG structure.
Fig 2Flowchart of MDHGI model to predict the potential miRNA-disease associations based on the known associations in HMDD V2.0 database.
Fig 3The illustration of the inexact ALM algorithm.
Fig 4The potential association probability between the miRNA m and the disease d which can be calculated by summarizing all paths with the length equal to three (For example, m–m1–d1–d).
Fig 5The left graph shows the AUC of global LOOCV compared with HGIMDA, RLSMDA, HDMP, WBSMDA, and MCMDA. The right graph shows the AUC of local LOOCV compared with HGIMDA, RLSMDA, HDMP, WBSMDA, MCMDA, RWRMDA, MIDP, and MiRAI. As a result, MDHGI achieved AUCs of 0.8945 and 0.8240 in the global and local LOOCV, which exceed all the previous classical models.
Supplementary experiments with different weight parameters to miRNA-miRNA edges and disease-disease edges (bold fonts are original weights and results).
| The weight for miRNA-miRNA edge | The weight for disease-disease edge | The weight for miRNA-disease edge | The AUC for Global LOOCV | The AUC for Local LOOCV | The AUC for 5-fold cross validation |
|---|---|---|---|---|---|
| 0.9 | 0.9 | 1 | 0.8925 | 0.8226 | 0.8774+/-0.0019 |
| 0.8 | 0.8 | 1 | 0.8903 | 0.8214 | 0.8751+/-0.0021 |
| 0.7 | 0.7 | 1 | 0.8881 | 0.8201 | 0.8724+/-0.0021 |
| 0.6 | 0.6 | 1 | 0.8859 | 0.8190 | 0.8708+/-0.0021 |
| 0.5 | 0.5 | 1 | 0.8839 | 0.8180 | 0.8681+/-0.0019 |
| 0.4 | 0.4 | 1 | 0.8822 | 0.8172 | 0.8666+/-0.0017 |
| 0.3 | 0.3 | 1 | 0.8807 | 0.8165 | 0.8650+/-0.0022 |
Prediction of the top 50 predicted miRNAs associated with Esophageal Neoplasms based on known associations in HMDD V2.0 database.
The prediction result was examined in dbDEMC and miR2Disease. The first column records top 1–25 related miRNAs. The third column records the top 26–50 related miRNAs.
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-200b | dbDEMC | hsa-mir-10a | dbDEMC |
| hsa-mir-133b | dbDEMC | hsa-mir-182 | dbDEMC |
| hsa-mir-16 | dbDEMC | hsa-mir-127 | dbDEMC |
| hsa-mir-19b | dbDEMC | hsa-mir-320a | unconfirmed |
| hsa-mir-429 | dbDEMC | hsa-mir-193b | dbDEMC |
| hsa-mir-17 | dbDEMC | hsa-mir-27b | dbDEMC |
| hsa-mir-125b | dbDEMC | hsa-mir-181b | dbDEMC |
| hsa-mir-142 | dbDEMC | hsa-mir-29a | dbDEMC |
| hsa-mir-1 | dbDEMC | hsa-mir-7 | dbDEMC |
| hsa-mir-199b | dbDEMC | hsa-mir-191 | dbDEMC |
| hsa-let-7d | dbDEMC | hsa-let-7f | unconfirmed |
| hsa-mir-218 | unconfirmed | hsa-mir-124 | dbDEMC |
| hsa-mir-195 | dbDEMC | hsa-mir-378a | unconfirmed |
| hsa-mir-708 | unconfirmed | hsa-mir-125a | dbDEMC |
| hsa-mir-10b | dbDEMC | hsa-mir-222 | dbDEMC |
| hsa-mir-30c | dbDEMC | hsa-mir-15b | dbDEMC |
| hsa-mir-194 | dbDEMC;miR2Disease | hsa-mir-197 | dbDEMC |
| hsa-mir-18a | dbDEMC | hsa-mir-30a | dbDEMC |
| hsa-mir-146b | dbDEMC | hsa-mir-23b | dbDEMC |
| hsa-let-7e | dbDEMC | hsa-mir-221 | dbDEMC |
| hsa-mir-151a | unconfirmed | hsa-mir-625 | dbDEMC |
| hsa-mir-29b | dbDEMC | hsa-mir-122 | Unconfirmed |
| hsa-mir-181a | dbDEMC | hsa-mir-95 | dbDEMC |
| hsa-let-7i | dbDEMC | hsa-mir-424 | dbDEMC |
| hsa-mir-224 | dbDEMC | hsa-mir-30d | dbDEMC |
Prediction of the top 50 predicted miRNAs associated with lymphoma based on known associations in HMDD V2.0 database.
The prediction result was examined in dbDEMC and miR2Disease. The first column records top 1–25 related miRNAs. The third column records the top 26–50 related miRNAs.
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-223 | dbDEMC | hsa-mir-182 | dbDEMC |
| hsa-mir-205 | dbDEMC | hsa-mir-129 | dbDEMC |
| hsa-mir-10b | dbDEMC | hsa-mir-10a | dbDEMC;miR2Disease |
| hsa-mir-9 | dbDEMC | hsa-mir-106b | dbDEMC |
| hsa-mir-145 | dbDEMC;miR2Disease | hsa-mir-30a | dbDEMC |
| hsa-mir-141 | dbDEMC | hsa-mir-1 | dbDEMC |
| hsa-mir-143 | dbDEMC;miR2Disease | hsa-mir-192 | dbDEMC |
| hsa-mir-451a | unconfirmed | hsa-mir-22 | dbDEMC |
| hsa-mir-196a | dbDEMC | hsa-mir-199b | dbDEMC |
| hsa-mir-27a | dbDEMC | hsa-mir-183 | dbDEMC |
| hsa-mir-106a | dbDEMC;miR2Disease | hsa-mir-497 | dbDEMC |
| hsa-mir-142 | unconfirmed | hsa-mir-99a | dbDEMC;miR2Disease |
| hsa-mir-34c | unconfirmed | hsa-mir-199a | dbDEMC |
| hsa-mir-34a | dbDEMC | hsa-mir-127 | dbDEMC;miR2Disease |
| hsa-mir-31 | dbDEMC | hsa-mir-27b | dbDEMC |
| hsa-mir-195 | dbDEMC | hsa-mir-193a | unconfirmed |
| hsa-mir-181b | dbDEMC | hsa-mir-148a | dbDEMC |
| hsa-mir-34b | dbDEMC | hsa-mir-130a | dbDEMC |
| hsa-mir-125b | unconfirmed | hsa-mir-224 | dbDEMC |
| hsa-mir-429 | unconfirmed | hsa-let-7a | dbDEMC |
| hsa-mir-7 | dbDEMC | hsa-mir-197 | dbDEMC |
| hsa-mir-214 | dbDEMC | hsa-mir-137 | dbDEMC |
| hsa-mir-29a | dbDEMC | hsa-mir-30d | dbDEMC |
| hsa-mir-25 | dbDEMC | hsa-mir-134 | dbDEMC |
| hsa-mir-93 | dbDEMC | hsa-mir-296 | dbDEMC |
Prediction of the top 50 predicted miRNAs associated with lung neoplasms based on known associations in HMDD V2.0 database.
All known associations between the miRNAs and Lung Neoplasms were removed before the prediction process. The prediction result was examined in dbDEMC, miR2Disease and HMDD V2.0. The first column records top 1–25 related miRNAs. The third column records the top 26–50 related miRNAs.
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-20a | dbDEMC;miR2Disease;HMDD | hsa-mir-34c | dbDEMC;HMDD |
| hsa-mir-17 | miR2Disease;HMDD | hsa-mir-200a | dbDEMC;miR2Disease;HMDD |
| hsa-mir-18a | dbDEMC;miR2Disease;HMDD | hsa-mir-146a | dbDEMC;miR2Disease;HMDD |
| hsa-mir-19b | dbDEMC;HMDD | hsa-mir-223 | HMDD |
| hsa-mir-19a | dbDEMC;miR2Disease;HMDD | hsa-mir-143 | dbDEMC;miR2Disease;HMDD |
| hsa-mir-145 | dbDEMC;miR2Disease;HMDD | hsa-mir-29a | dbDEMC;miR2Disease;HMDD |
| hsa-mir-155 | dbDEMC;miR2Disease;HMDD | hsa-let-7g | dbDEMC;miR2Disease;HMDD |
| hsa-let-7a | dbDEMC;miR2Disease;HMDD | hsa-mir-146b | miR2Disease;HMDD |
| hsa-mir-21 | dbDEMC;miR2Disease;HMDD | hsa-mir-9 | miR2Disease;HMDD |
| hsa-mir-34a | dbDEMC;HMDD | hsa-mir-218 | dbDEMC;miR2Disease;HMDD |
| hsa-let-7b | miR2Disease;HMDD | hsa-mir-141 | dbDEMC;miR2Disease |
| hsa-mir-92a | HMDD | hsa-mir-200c | dbDEMC;miR2Disease;HMDD |
| hsa-mir-126 | dbDEMC;miR2Disease;HMDD | hsa-mir-106b | dbDEMC |
| hsa-let-7d | dbDEMC;miR2Disease;HMDD | hsa-mir-34b | dbDEMC;HMDD |
| hsa-let-7c | dbDEMC;miR2Disease;HMDD | hsa-mir-101 | dbDEMC;miR2Disease;HMDD |
| hsa-mir-200b | dbDEMC;miR2Disease;HMDD | hsa-mir-15a | dbDEMC |
| hsa-mir-221 | dbDEMC;HMDD | hsa-mir-214 | dbDEMC;miR2Disease;HMDD |
| hsa-let-7e | miR2Disease;HMDD | hsa-mir-205 | dbDEMC;miR2Disease;HMDD |
| hsa-mir-125b | miR2Disease;HMDD | hsa-mir-1 | dbDEMC;miR2Disease;HMDD |
| hsa-let-7f | miR2Disease;HMDD | hsa-mir-125a | dbDEMC;miR2Disease;HMDD |
| hsa-mir-199a | dbDEMC;miR2Disease;HMDD | hsa-mir-10b | dbDEMC;HMDD |
| hsa-mir-16 | dbDEMC;miR2Disease | hsa-mir-25 | dbDEMC;HMDD |
| hsa-let-7i | dbDEMC;HMDD | hsa-mir-181b | dbDEMC;HMDD |
| hsa-mir-29b | dbDEMC;miR2Disease;HMDD | hsa-mir-210 | dbDEMC;miR2Disease;HMDD |
| hsa-mir-222 | dbDEMC;HMDD | hsa-mir-93 | dbDEMC;miR2Disease;HMDD |
Prediction of the top 50 predicted miRNAs associated with breast neoplasms based on known associations in HMDD V1.0 database.
The prediction result was examined in dbDEMC, miR2Disease and HMDD V2.0. The first column records top 1–25 related miRNAs. The third column records the top 26–50 related miRNAs.
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-let-7e | dbDEMC;HMDD | hsa-mir-23b | dbDEMC;HMDD |
| hsa-let-7b | dbDEMC;HMDD | hsa-mir-203 | dbDEMC;miR2Disease; HMDD |
| hsa-mir-223 | dbDEMC;HMDD | hsa-mir-30e | Unconfirmed |
| hsa-mir-126 | dbDEMC;miR2Disease; HMDD | hsa-mir-29c | dbDEMC;miR2Disease; HMDD |
| hsa-mir-16 | dbDEMC;HMDD | hsa-mir-107 | dbDEMC;HMDD |
| hsa-let-7i | dbDEMC;miR2Disease; HMDD | hsa-mir-199b | dbDEMC;HMDD |
| hsa-let-7c | dbDEMC;HMDD | hsa-mir-18b | dbDEMC;HMDD |
| hsa-mir-92b | dbDEMC | hsa-mir-181a | dbDEMC;miR2Disease; HMDD |
| hsa-mir-99b | dbDEMC | hsa-mir-532 | dbDEMC |
| hsa-mir-100 | dbDEMC;HMDD | hsa-mir-27a | dbDEMC;miR2Disease; HMDD |
| hsa-mir-130a | dbDEMC | hsa-mir-22 | dbDEMC;miR2Disease; HMDD |
| hsa-mir-182 | dbDEMC;miR2Disease; HMDD | hsa-mir-148a | dbDEMC;miR2Disease; HMDD |
| hsa-mir-92a | HMDD | hsa-mir-192 | dbDEMC |
| hsa-let-7g | dbDEMC;HMDD | hsa-mir-196b | dbDEMC |
| hsa-mir-106a | dbDEMC | hsa-mir-142 | Unconfirmed |
| hsa-mir-335 | dbDEMC;miR2Disease; HMDD | hsa-mir-372 | dbDEMC |
| hsa-mir-195 | dbDEMC;miR2Disease; HMDD | hsa-mir-135a | dbDEMC;HMDD |
| hsa-mir-150 | dbDEMC | hsa-mir-224 | dbDEMC;HMDD |
| hsa-mir-101 | dbDEMC;miR2Disease; HMDD | hsa-mir-424 | dbDEMC |
| hsa-mir-191 | dbDEMC;miR2Disease; HMDD | hsa-mir-198 | dbDEMC |
| hsa-mir-24 | dbDEMC;HMDD | hsa-mir-28 | dbDEMC |
| hsa-mir-99a | dbDEMC | hsa-mir-212 | dbDEMC |
| hsa-mir-30a | miR2Disease;HMDD | hsa-mir-497 | dbDEMC;miR2Disease; HMDD |
| hsa-mir-32 | dbDEMC | hsa-mir-520c | miR2Disease;HMDD |
| hsa-mir-373 | dbDEMC;miR2Disease; HMDD | hsa-mir-520b | dbDEMC;HMDD |