| Literature DB >> 30439645 |
Xing Chen1, Chun-Chun Wang2, Jun Yin2, Zhu-Hong You3.
Abstract
Since the first microRNA (miRNA) was discovered, a lot of studies have confirmed the associations between miRNAs and human complex diseases. Besides, obtaining and taking advantage of association information between miRNAs and diseases play an increasingly important role in improving the treatment level for complex diseases. However, due to the high cost of traditional experimental methods, many researchers have proposed different computational methods to predict potential associations between miRNAs and diseases. In this work, we developed a computational model of Random Forest for miRNA-disease association (RFMDA) prediction based on machine learning. The training sample set for RFMDA was constructed according to the human microRNA disease database (HMDD) version (v.)2.0, and the feature vectors to represent miRNA-disease samples were defined by integrating miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. The Random Forest algorithm was first employed to infer miRNA-disease associations. In addition, a filter-based method was implemented to select robust features from the miRNA-disease feature set, which could efficiently distinguish related miRNA-disease pairs from unrelated miRNA-disease pairs. RFMDA achieved areas under the curve (AUCs) of 0.8891, 0.8323, and 0.8818 ± 0.0014 under global leave-one-out cross-validation, local leave-one-out cross-validation, and 5-fold cross-validation, respectively, which were higher than many previous computational models. To further evaluate the accuracy of RFMDA, we carried out three types of case studies for four human complex diseases. As a result, 43 (esophageal neoplasms), 46 (lymphoma), 47 (lung neoplasms), and 48 (breast neoplasms) of the top 50 predicted disease-related miRNAs were verified by experiments in different kinds of case studies. The results of cross-validation and case studies indicated that RFMDA is a reliable model for predicting miRNA-disease associations.Entities:
Keywords: Random Forest; association prediction; disease; microRNA
Year: 2018 PMID: 30439645 PMCID: PMC6234518 DOI: 10.1016/j.omtn.2018.10.005
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Figure 1AUCs of RFMDA and HGIMDA, RLSMDA, HDMP, WBSMDA, MaxFlow, and MCMDA under Global LOOCV
As one can see, RFMDA achieved AUCs of 0.8891 under global LOOCV, which were higher than those of previous models.
Figure 2AUCs of RFMDA and HGIMDA, RLSMDA, HDMP, WBSMDA, RWRMDA, MaxFlow, MCMDA, MIDP, and MiRAI under Local LOOCV
As one can see, RFMDA achieved AUCs of 0.8323 under local LOOCV, which were higher than those of previous models.
Top 50 miRNAs Associated with Esophageal Neoplasms Were Predicted by RFMDA Based on Known Associations in the HMDD v.2.0
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-127 | dbDEMC | hsa-mir-30a | dbDEMC |
| hsa-let-7g | dbDEMC | hsa-mir-125b | dbDEMC |
| hsa-mir-222 | dbDEMC | hsa-mir-7 | dbDEMC |
| hsa-mir-221 | dbDEMC | hsa-mir-18a | dbDEMC |
| hsa-mir-30c | dbDEMC | hsa-mir-95 | dbDEMC |
| hsa-mir-146b | dbDEMC | hsa-let-7i | dbDEMC |
| hsa-mir-372 | dbDEMC | hsa-mir-181a | dbDEMC |
| hsa-mir-181b | dbDEMC | hsa-mir-204 | dbDEMC |
| hsa-mir-10b | dbDEMC | hsa-mir-107 | unconfirmed |
| hsa-mir-93 | dbDEMC | hsa-mir-451 | dbDEMC and miR2Disease |
| hsa-mir-16 | dbDEMC | hsa-mir-122 | dbDEMC |
| hsa-mir-200b | dbDEMC | hsa-mir-335 | unconfirmed |
| hsa-mir-142 | dbDEMC | hsa-let-7f | dbDEMC |
| hsa-mir-191 | dbDEMC | hsa-mir-29a | unconfirmed |
| hsa-mir-9 | dbDEMC | hsa-mir-139 | dbDEMC |
| hsa-mir-199b | dbDEMC | hsa-mir-140 | dbDEMC |
| hsa-mir-137 | dbDEMC | hsa-mir-218 | dbDEMC |
| hsa-mir-20b | dbDEMC | hsa-mir-135a | unconfirmed |
| hsa-mir-132 | dbDEMC | hsa-mir-125a | dbDEMC |
| hsa-mir-18b | dbDEMC | hsa-mir-194 | dbDEMC |
| hsa-mir-449a | unconfirmed | hsa-mir-29b | dbDEMC and miR2Disease |
| hsa-mir-449b | unconfirmed | hsa-mir-30e | dbDEMC |
| hsa-mir-106a | dbDEMC | hsa-mir-27b | unconfirmed |
| hsa-mir-373 | dbDEMC and miR2Disease | hsa-mir-193b | dbDEMC |
| hsa-mir-224 | dbDEMC | hsa-mir-195 | dbDEMC |
The top 1–25 related miRNAs are recorded in the first column, and the top 26–50 related miRNAs are recorded in the third column. As we can see 10, 19, and 43 of the top 10, top 20, and top 50 were verified by databases.
Top 50 miRNAs Associated with Lymphoma Were Predicted by RFMDA Based on Known Associations in the HMDD v.2.0
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-let-7b | dbDEMC | hsa-mir-199a | dbDEMC |
| hsa-mir-125b | unconfirmed | hsa-mir-106a | dbDEMC and miR2Disease |
| hsa-let-7a | dbDEMC | hsa-mir-29a | dbDEMC |
| hsa-let-7c | dbDEMC | hsa-mir-182 | dbDEMC |
| hsa-mir-34a | dbDEMC | hsa-mir-23b | dbDEMC |
| hsa-let-7e | dbDEMC and miR2Disease | hsa-mir-15b | dbDEMC |
| hsa-mir-106b | dbDEMC | hsa-mir-29b | dbDEMC |
| hsa-let-7d | dbDEMC | hsa-mir-27a | dbDEMC |
| hsa-mir-145 | dbDEMC and miR2Disease | hsa-mir-141 | dbDEMC |
| hsa-let-7i | dbDEMC | hsa-mir-22 | dbDEMC |
| hsa-mir-143 | dbDEMC and miR2Disease | hsa-mir-195 | dbDEMC |
| hsa-mir-222 | dbDEMC | hsa-mir-30a | dbDEMC |
| hsa-mir-221 | dbDEMC and miR2Disease | hsa-mir-196a | dbDEMC |
| hsa-mir-223 | dbDEMC | hsa-mir-1 | dbDEMC |
| hsa-mir-127 | dbDEMC and miR2Disease | hsa-mir-32 | dbDEMC |
| hsa-mir-25 | dbDEMC | hsa-mir-95 | dbDEMC and miR2Disease |
| hsa-mir-30c | dbDEMC | hsa-mir-34b | dbDEMC |
| hsa-mir-146b | unconfirmed | hsa-mir-148a | dbDEMC |
| hsa-let-7f | dbDEMC | hsa-mir-183 | dbDEMC |
| hsa-mir-181b | dbDEMC | hsa-mir-10b | dbDEMC |
| hsa-mir-214 | dbDEMC | hsa-mir-132 | dbDEMC |
| hsa-mir-191 | dbDEMC | hsa-mir-133a | dbDEMC |
| hsa-let-7g | dbDEMC | hsa-mir-199b | dbDEMC |
| hsa-mir-34c | unconfirmed | hsa-mir-335 | dbDEMC |
| hsa-mir-100 | dbDEMC | hsa-mir-372 | unconfirmed |
The top 1–25 related miRNAs are recorded in the first column, and the top 26–50 related miRNAs are recorded in the third column. As we can see 9, 18, and 46 of the top 10, top 20, and top 50 were verified by databases.
Top 50 miRNAs Associated with Lung Neoplasms Were Predicted by RFMDA after Hiding All Known Associations about Lung Neoplasms Based in the HMDD v.2.0
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-133a | dbDEMC and HMDD | hsa-mir-192 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-150 | dbDEMC, miR2Disease, and HMDD | hsa-mir-130a | dbDEMC and miR2Disease |
| hsa-mir-196a | dbDEMC and HMDD | hsa-mir-10a | dbDEMC |
| hsa-mir-210 | dbDEMC, miR2Disease, and HMDD | hsa-mir-200c | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-182 | dbDEMC, miR2Disease, and HMDD | hsa-mir-148a | dbDEMC and HMDD |
| hsa-mir-204 | miR2Disease | hsa-mir-17 | miR2Disease and HMDD |
| hsa-mir-100 | dbDEMC and HMDD | hsa-mir-146a | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-199b | dbDEMC, miR2Disease, and HMDD | hsa-mir-206 | HMDD |
| hsa-mir-196b | dbDEMC | hsa-mir-203 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-31 | dbDEMC, miR2Disease, and HMDD | hsa-mir-20a | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-335 | miR2Disease and HMDD | hsa-mir-26a | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-30a | miR2Disease and HMDD | hsa-mir-302b | dbDEMC |
| hsa-mir-1 | dbDEMC, miR2Disease, and HMDD | hsa-mir-224 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-296 | dbDEMC | hsa-mir-302c | dbDEMC |
| hsa-mir-205 | dbDEMC, miR2Disease, and HMDD | hsa-mir-181a | dbDEMC and HMDD |
| hsa-mir-27a | dbDEMC and HMDD | hsa-mir-221 | dbDEMC and HMDD |
| hsa-mir-21 | dbDEMC, miR2Disease, and HMDD | hsa-mir-95 | miR2Disease and HMDD |
| hsa-mir-183 | dbDEMC, miR2Disease, and HMDD | hsa-mir-143 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-22 | miR2Disease and HMDD | hsa-mir-32 | miR2Disease and HMDD |
| hsa-mir-200a | dbDEMC, miR2Disease, and HMDD | hsa-mir-135b | dbDEMC and HMDD |
| hsa-mir-181b | dbDEMC and HMDD | hsa-mir-135a | dbDEMC and HMDD |
| hsa-mir-146b | miR2Disease and HMDD | hsa-mir-302a | unconfirmed |
| hsa-mir-107 | dbDEMC and HMDD | hsa-mir-7 | miR2Disease and HMDD |
| hsa-mir-34c | dbDEMC and HMDD | hsa-mir-218 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-372 | unconfirmed | hsa-mir-491 | unconfirmed |
The top 1–25 related miRNAs are recorded in the first column, and the top 26–50 related miRNAs are recorded in the third column. As we can see 10, 20, and 47 of the top 10, top 20, and top 50 were verified by databases.
Top 50 miRNAs Associated with Breast Neoplasms Were Predicted by RFMDA Based on Known Associations in the HMDD v.1.0
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-223 | dbDEMC and HMDD | hsa-mir-520b | dbDEMC and HMDD |
| hsa-mir-24 | dbDEMC and HMDD | hsa-mir-23b | dbDEMC and HMDD |
| hsa-let-7b | dbDEMC and HMDD | hsa-mir-148a | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-126 | dbDEMC, miR2Disease, and HMDD | hsa-mir-135a | dbDEMC and HMDD |
| hsa-mir-373 | dbDEMC, miR2Disease, and HMDD | hsa-mir-182 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-32 | dbDEMC | hsa-mir-142 | unconfirmed |
| hsa-mir-16 | dbDEMC and HMDD | hsa-let-7i | dbDEMC, miR2Disease, and HMDD |
| hsa-let-7c | dbDEMC and HMDD | hsa-mir-128b | miR2Disease |
| hsa-mir-150 | dbDEMC | hsa-mir-335 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-29c | dbDEMC, miR2Disease, and HMDD | hsa-mir-15b | dbDEMC |
| hsa-mir-372 | dbDEMC | hsa-mir-98 | dbDEMC and miR2Disease |
| hsa-mir-101 | dbDEMC, miR2Disease, and HMDD | hsa-mir-181a | dbDEMC, miR2Disease, and HMDD |
| hsa-let-7e | dbDEMC and HMDD | hsa-mir-183 | dbDEMC and HMDD |
| hsa-mir-106a | dbDEMC | hsa-mir-26a | dbDEMC, miR2Disease, and HMDD |
| hsa-let-7g | dbDEMC and HMDD | hsa-mir-100 | dbDEMC and HMDD |
| hsa-mir-99b | dbDEMC | hsa-mir-107 | dbDEMC and HMDD |
| hsa-mir-192 | dbDEMC | hsa-mir-224 | dbDEMC and HMDD |
| hsa-mir-30e | unconfirmed | hsa-mir-92b | dbDEMC |
| hsa-mir-199b | dbDEMC and HMDD | hsa-mir-95 | dbDEMC |
| hsa-mir-27a | dbDEMC, miR2Disease, and HMDD | hsa-mir-22 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-130a | dbDEMC | hsa-mir-196b | dbDEMC |
| hsa-mir-195 | dbDEMC, miR2Disease, and HMDD | hsa-mir-191 | dbDEMC, miR2Disease, and HMDD |
| hsa-mir-30a | miR2Disease and HMDD | hsa-mir-18b | dbDEMC and HMDD |
| hsa-mir-203 | dbDEMC, miR2Disease, and HMDD | hsa-mir-186 | dbDEMC |
| hsa-mir-92a | HMDD | hsa-mir-424 | dbDEMC |
The top 1–25 related miRNAs are recorded in the first column, and the top 26–50 related miRNAs are recorded in the third column. As we can see 10, 19, and 48 of the top 10, top 20, and top 50 were verified by databases.
Figure 3Flowchart of RFMDA Model to Predict Potential Associations between miRNAs and Diseases