| Literature DB >> 30131824 |
Bin-Sheng He1, Jia Qu2, Qi Zhao3,4.
Abstract
With the rapid development of biological research, microRNAs (miRNA) have become an attractive topic because lots of experimental studies have revealed the significant associations between miRNAs and diseases. However, considering that experiments are expensive and time-consuming, computational methods for predicting associations between miRNAs and diseases have become increasingly crucial. In this study, we proposed a neighborhood regularized logistic matrix factorization method for miRNA-disease association prediction (NRLMFMDA) by integrating miRNA functional similarity, disease semantic similarity, Gaussian interaction profile kernel similarity, and experimentally validation of disease-miRNA association. We used Gaussian interaction profile kernel similarity to cover the shortage of the traditional similarity to make it more reasonable and complete. Furthermore, NRLMFMDA also considered the important influences of the neighborhood information and took full advantage of them to improve the accuracy of the miRNA-disease association prediction. We also improved the accuracy by giving higher weights to the known association data in the process of calculating the potential association probabilities. In the global and the local leave-one-out cross validation, NRLMFMDA got the AUCs of 0.9068 and 0.8239, respectively. Moreover, the average AUC of NRLMFMDA in 5-fold cross validation was 0.8976 ± 0.0034. All the three kinds of cross validations have shown significant advantages to a number of previous models. In the case studies of breast neoplasms, esophageal neoplasms and lymphoma according to known miRNA-disease associations in the recent version of HMDD database, there were 78, 80, and 74% of top 50 predicted related miRNAs verified to have associations with these three diseases, respectively. In the further case studies for new disease without any known related miRNAs and the previous version of HMDD database, there were also high proportions of the predicted miRNAs verified by experimental reports. All the validation experiment results have demonstrated the effectiveness and practicability of NRLFMDA to predict the potential miRNA-disease associations.Entities:
Keywords: association prediction; disease; matrix factorization; microRNA; neighborhood regularized
Year: 2018 PMID: 30131824 PMCID: PMC6090164 DOI: 10.3389/fgene.2018.00303
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Flowchart of NRLMFMDA model to predict the potential miRNA-disease associations based on the known associations in HMDD database.
Figure 2AUC of global LOOCV (left) compared with HGIMDA, RLSMDA, HDMP, and WBSMDA; AUC of local LOOCV (right) compared with HGIMDA, RLSMDA, HDMP, WBSMDA, and RWRMDA. As a result, NRLMFMDA achieved AUCs of 0.9068 and 0.8239 in the global and local LOOCV, which exceed all the previous classical models.
Performance evaluation comparison between NRLMFMDA and other several typical models in global LOOCV, local LOOCV and 5-fold cross validation based on known miRNA-disease associations.
| NRLMFMDA | 0.9068 | 0.8239 | 0.8976 ± 0.0034 |
| HGIMDA | 0.8781 | 0.8077 | N/A |
| RLSMDA | 0.8426 | 0.6953 | 0.8569 ± 0.0020 |
| HDMP | 0.8366 | 0.7702 | 0.8342 ± 0.0010 |
| WBSMDA | 0.8030 | 0.8031 | 0.8185 ± 0.0009 |
| RWRMDA | N/A | 0.7891 | N/A |
Prediction of the top 50 predicted miRNAs associated with breast neoplasms based on known associations in HMDD database.
| hsa-mir-200c | dbdemc;miR2Disease | hsa-mir-1302 | unconfirmed |
| hsa-let-7e | dbdemc | hsa-let-7i | dbdemc;miR2Disease |
| hsa-let-7d | dbdemcc;miR2Disease | hsa-mir-133a | dbdemc |
| hsa-mir-655 | unconfirmed | hsa-mir-9 | dbdemc;miR2Disease |
| hsa-mir-590 | dbdemc | hsa-mir-103a | unconfirmed |
| hsa-mir-221 | dbdemc;miR2Disease | hsa-mir-450b | unconfirmed |
| hsa-mir-181 | unconfirmed | hsa-mir-19b | dbdemc |
| hsa-mir-10b | dbdemc;miR2Disease | hsa-mir-18a | dbdemc;miR2Disease |
| hsa-mir-15a | dbdemc | hsa-mir-23a | dbdemc |
| hsa-mir-182 | dbdemc;miR2Disease | hsa-mir-20b | unconfirmed |
| hsa-mir-150 | dbdemc | hsa-mir-345 | dbdemc |
| hsa-mir-16 | dbdemc | hsa-mir-106a | dbdemc |
| hsa-mir-219 | dbdemc | hsa-mir-33a | unconfirmed |
| hsa-mir-15b | dbdemc | hsa-mir-195 | dbdemc;miR2Disease |
| hsa-mir-17 | miR2Disease | hsa-mir-200a | dbdemc;miR2Disease |
| hsa-mir-422a | dbdemc | hsa-mir-455 | dbdemc |
| hsa-mir-215 | dbdemc | hsa-mir-132 | dbdemc |
| hsa-mir-1247 | unconfirmed | hsa-mir-652 | dbdemc |
| hsa-mir-151 | unconfirmed | hsa-mir-96 | dbdemc;miR2Disease |
| hsa-mir-22 | dbdemc;miR2Disease | hsa-mir-1323 | unconfirmed |
| hsa-mir-107 | dbdemc | hsa-mir-137 | dbdemc |
| hsa-mir-143 | dbdemc;miR2Disease | hsa-mir-202 | dbdemc;miR2Disease |
| hsa-mir-346 | dbdemc | hsa-mir-2355 | unconfirmed |
| hsa-mir-191 | dbdemc;miR2Disease | hsa-mir-204 | dbdemc;miR2Disease |
| hsa-mir-223 | dbdemc | hsa-mir-126 | dbdemc;miR2Disease |
The first column records top 1–25 related miRNAs. The second column records the top 26-50 related miRNAs.
Prediction of the top 50 predicted miRNAs associated with esophageal neoplasms based on known associations in HMDD database.
| hsa-mir-146a | dbdemc | hsa-mir-1972 | unconfirmed |
| hsa-mir-26b | dbdemc | hsa-mir-200b | dbdemc |
| hsa-mir-675 | unconfirmed | hsa-mir-20b | dbdemc |
| hsa-mir-10b | dbdemc | hsa-mir-1247 | unconfirmed |
| hsa-mir-191 | dbdemc | hsa-mir-31 | dbdemc |
| hsa-mir-15b | dbdemc | hsa-mir-198 | dbdemc |
| hsa-mir-143 | dbdemc | hsa-mir-103a | unconfirmed |
| hsa-mir-20a | dbdemc | hsa-mir-152 | dbdemc |
| hsa-mir-34b | dbdemc | hsa-mir-1915 | unconfirmed |
| hsa-mir-27a | dbdemc | hsa-mir-195 | dbdemc |
| hsa-mir-9 | dbdemc | hsa-mir-320e | unconfirmed |
| hsa-mir-221 | dbdemc | hsa-mir-335 | dbdemc |
| hsa-mir-590 | dbdemc | hsa-mir-106b | dbdemc |
| hsa-mir-200c | dbdemc | hsa-mir-181d | dbdemc |
| hsa-mir-25 | dbdemc | hsa-mir-422a | dbdemc |
| hsa-let-7a | dbdemc | hsa-mir-372 | dbdemc |
| hsa-mir-203 | dbdemc;miR2Disease | hsa-mir-15a | dbdemc |
| hsa-mir-376c | unconfirmed | hsa-mir-1 | dbdemc |
| hsa-let-7e | dbdemc | hsa-mir-181 | unconfirmed |
| hsa-mir-100 | dbdemc | hsa-mir-29a | dbdemc |
| hsa-mir-29b | dbdemc | hsa-mir-30d | dbdemc |
| hsa-mir-2355 | unconfirmed | hsa-mir-106a | dbdemc |
| hsa-mir-205 | dbdemc;miR2Disease | hsa-mir-92 | dbdemc |
| hsa-mir-30a | dbdemc | hsa-mir-371a | unconfirmed |
| hsa-mir-125a | dbdemc | hsa-mir-141 | dbdemc |
The first column records top 1-25 related miRNAs. The second column records the top 26–50 related miRNAs.
Prediction of the top 50 predicted miRNAs associated with lymphoma based on known associations in HMDD database.
| hsa-mir-10b | dbdemc | hsa-mir-106b | dbdemc |
| hsa-mir-1247 | unconfirmed | hsa-let-7a | dbdemc |
| hsa-mir-221 | dbdemc;miR2Disease | hsa-mir-326 | dbdemc |
| hsa-mir-1302 | unconfirmed | hsa-mir-99b | dbdemc |
| hsa-mir-30a | dbdemc | hsa-mir-103a | unconfirmed |
| hsa-mir-31 | dbdemc | hsa-mir-30b | dbdemc |
| hsa-mir-9 | dbdemc | hsa-mir-124 | dbdemc |
| hsa-mir-27b | dbdemc | hsa-mir-204 | dbdemc |
| hsa-mir-181c | dbdemc | hsa-mir-1915 | unconfirmed |
| hsa-let-7d | dbdemc | hsa-mir-410 | unconfirmed |
| hsa-mir-15b | dbdemc | hsa-mir-19b | dbdemc;miR2Disease |
| hsa-mir-202 | unconfirmed | hsa-mir-301b | unconfirmed |
| hsa-let-7e | dbdemc;miR2Disease | hsa-mir-518a | unconfirmed |
| hsa-mir-2355 | unconfirmed | hsa-mir-125a | dbdemc |
| hsa-mir-27a | dbdemc | hsa-mir-191 | dbdemc |
| hsa-mir-139 | dbdemc;miR2Disease | hsa-mir-23a | dbdemc |
| hsa-mir-17 | dbdemc;miR2Disease | hsa-mir-200c | dbdemc |
| hsa-mir-215 | dbdemc | hsa-mir-33a | dbdemc |
| hsa-mir-20a | dbdemc;miR2Disease | hsa-mir-1 | dbdemc |
| hsa-mir-29b | dbdemc | hsa-mir-127 | dbdemc;miR2Disease |
| hsa-mir-29c | dbdemc | hsa-mir-132 | dbdemc |
| hsa-mir-208b | unconfirmed | hsa-mir-146b | unconfirmed |
| hsa-mir-655 | unconfirmed | hsa-mir-200b | dbdemc |
| hsa-mir-99a | dbdemc;miR2Disease | hsa-mir-942 | unconfirmed |
| hsa-mir-219 | dbdemc | hsa-let-7b | dbdemc |
The first column records top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
Prediction of the top 50 predicted miRNAs associated with carcinoma, hepatocellular based on known associations in HMDD database.
| hsa-mir-146a | dbdemc;miR2Disease;HMDD | hsa-mir-1247 | unconfirmed |
| hsa-mir-16 | dbdemc;miR2Disease;HMDD | hsa-mir-150 | dbdemc;miR2Disease;HMDD |
| hsa-mir-215 | miR2Disease | hsa-mir-483 | HMDD |
| hsa-mir-133b | HMDD | hsa-let-7e | dbdemc;miR2Disease;HMDD |
| hsa-mir-15a | dbdemc;miR2Disease;HMDD | hsa-mir-205 | miR2Disease;HMDD |
| hsa-mir-15b | dbdemc;HMDD | hsa-mir-139 | miR2Disease;HMDD |
| hsa-mir-103b | unconfirmed | hsa-mir-92a | miR2Disease;HMDD |
| hsa-mir-345 | HMDD | hsa-mir-145 | dbdemc;miR2Disease;HMDD |
| hsa-mir-9 | miR2Disease | hsa-mir-204 | unconfirmed |
| hsa-mir-20a | dbdemc;miR2Disease;HMDD | hsa-let-7g | miR2Disease;HMDD |
| hsa-mir-219 | miR2Disease;HMDD | hsa-mir-1302 | unconfirmed |
| hsa-mir-143 | dbdemc;miR2Disease | hsa-mir-1972 | unconfirmed |
| hsa-mir-125a | dbdemc;miR2Disease;HMDD | hsa-mir-191 | dbdemc;HMDD |
| hsa-mir-29b | dbdemc;HMDD | hsa-mir-450b | HMDD |
| hsa-mir-106b | dbdemc;miR2Disease;HMDD | hsa-mir-181d | dbdemc;HMDD |
| hsa-mir-22 | dbdemc;HMDD | hsa-mir-30b | HMDD |
| hsa-mir-152 | miR2Disease;HMDD | hsa-mir-10b | HMDD |
| hsa-mir-675 | unconfirmed | hsa-mir-941 | unconfirmed |
| hsa-mir-27b | dbdemc | hsa-mir-30a | miR2Disease;HMDD |
| hsa-mir-221 | dbdemc;miR2Disease;HMDD | hsa-mir-30d | dbdemc;HMDD |
| hsa-let-7d | miR2Disease;HMDD | hsa-mir-200a | dbdemc;miR2Disease;HMDD |
| hsa-mir-100 | dbdemc;HMDD | hsa-mir-194 | dbdemc;miR2Disease |
| hsa-mir-26a | dbdemc;miR2Disease;HMDD | hsa-mir-2355 | unconfirmed |
| hsa-mir-198 | HMDD | hsa-mir-146b | HMDD |
| hsa-mir-29a | dbdemc;HMDD | hsa-let-7c | dbdemc;miR2Disease;HMDD |
The first column records top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
Prediction of the top 50 predicted miRNAs associated with lung neoplasms based on known associations in old version HMDD database.
| hsa-mir-96 | dbdemc;HMDD | hsa-mir-139 | dbdemc;miR2Disease |
| hsa-mir-498 | dbdemc | hsa-mir-323 | unconfirmed |
| hsa-mir-491 | unconfirmed | hsa-mir-181d | dbdemc |
| hsa-mir-335 | miR2Disease;HMDD | hsa-mir-379 | unconfirmed |
| hsa-mir-378 | unconfirmed | hsa-mir-448 | unconfirmed |
| hsa-mir-596 | unconfirmed | hsa-mir-302d | dbdemc |
| hsa-mir-409 | unconfirmed | hsa-mir-301b | unconfirmed |
| hsa-mir-523 | unconfirmed | hsa-mir-1 | dbdemc;miR2Disease;HMDD |
| hsa-mir-526b | dbdemc | hsa-mir-154 | dbdemc |
| hsa-mir-220 | miR2Disease | hsa-mir-510 | unconfirmed |
| hsa-mir-15a | dbdemc | hsa-mir-17 | miR2Disease;HMDD |
| hsa-mir-520f | dbdemc | hsa-mir-133a | dbdemc;HMDD |
| hsa-mir-136 | dbdemc;HMDD | hsa-mir-376a | HMDD |
| hsa-mir-520c | unconfirmed | hsa-mir-219 | miR2Disease;HMDD |
| hsa-mir-657 | unconfirmed | hsa-mir-181a | dbdemc;HMDD |
| hsa-mir-185 | dbdemc;HMDD | hsa-mir-25 | dbdemc;HMDD |
| hsa-mir-34a | dbdemc;HMDD | hsa-mir-194 | unconfirmed |
| hsa-mir-514 | unconfirmed | hsa-mir-130b | dbdemc |
| hsa-mir-383 | dbdemc | hsa-mir-15b | dbdemc |
| hsa-mir-642 | unconfirmed | hsa-mir-532 | unconfirmed |
| hsa-mir-29a | dbdemc;miR2Disease;HMDD | hsa-mir-598 | unconfirmed |
| hsa-mir-181b | dbdemc;HMDD | hsa-mir-512 | unconfirmed |
| hsa-mir-338 | dbdemc;miR2Disease;HMDD | hsa-mir-526a | unconfirmed |
| hsa-mir-224 | dbdemc;miR2Disease;HMDD | hsa-let-7b | miR2Disease;HMDD |
| hsa-mir-210 | dbdemc;miR2Disease;HMDD | hsa-mir-134 | HMDD |
The first column records top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.