| Literature DB >> 24975600 |
Abstract
MicroRNAs play critical role in the development and progression of various diseases. Predicting potential miRNA-disease associations from vast amount of biological data is an important problem in the biomedical research. Considering the limitations in previous methods, we developed Regularized Least Squares for MiRNA-Disease Association (RLSMDA) to uncover the relationship between diseases and miRNAs. RLSMDA can work for diseases without known related miRNAs. Furthermore, it is a semi-supervised (does not need negative samples) and global method (prioritize associations for all the diseases simultaneously). Based on leave-one-out cross validation, reliable AUC have demonstrated the reliable performance of RLSMDA. We also applied RLSMDA to Hepatocellular cancer and Lung cancer and implemented global prediction for all the diseases simultaneously. As a result, 80% (Hepatocellular cancer) and 84% (Lung cancer) of top 50 predicted miRNAs and 75% of top 20 potential associations based on global prediction have been confirmed by biological experiments. We also applied RLSMDA to diseases without known related miRNAs in golden standard dataset. As a result, in the top 3 potential related miRNA list predicted by RLSMDA for 32 diseases, 34 disease-miRNA associations were successfully confirmed by experiments. It is anticipated that RLSMDA would be a useful bioinformatics resource for biomedical researches.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24975600 PMCID: PMC4074792 DOI: 10.1038/srep05501
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Method comparison: (left) Comparison between RLSMDA and RWRMDA proposed by Chen, et al.47 in terms of ROC curve and AUC based on local leave-one-out cross validation on 1394 known experimentally verified miRNA–disease associations.
RLSMDA obtained comparable performance in the local LOOCV as RWRMDA, while RWRMDA cannot predict disease-related miRNAs for diseases without known related miRNAs and all the diseases simultaneously. RLSMDA can successfully solve these two critical shortcomings of RWRMDA. (right) Comparison between RLSMDA and HDMP in the term of global LOOCV. RLSMDA and HDMP obtained the AUC of 0.9511 and 0.9431, respectively. Although only slight improvement has been obtained here, RLSMDA can predict the potential miRNAs for diseases which do not have known related miRNAs, which has solved the most critical limitation of HDMP. The performance of RLSMDA could be further improved by introducing the information of miRNA family and cluster as what has been done in the method of HDMP.
The top 50 potential Hepatocellular cancer (HCC) related miRNAs predicted by RLSMDA and the confirmation for their associations by various databases are listed here (1st column: top 1–25; 2nd column: top 26–50). Forty of top 50 miRNAs have been confirmed to be related with HCC
| Name | Evidence | Name | Evidence |
|---|---|---|---|
| hsa-mir-155 | HMDD,dbDEMC,miR2Disease | hsa-mir-29c | HMDD,DbDEMC |
| hsa-mir-24 | HMDD,miR2Disease | hsa-mir-146b | HMDD |
| hsa-mir-107 | HMDD,dbDEMC,miR2Disease | hsa-mir-194 | dbDEMC,miR2Disease |
| hsa-mir-29b | HMDD,DbDEMC | hsa-let-7d | HMDD,miR2Disease |
| hsa-mir-126 | HMDD,dbDEMC,miR2Disease | hsa-mir-135b | Unconfirmed |
| hsa-let-7i | HMDD,DbDEMC | hsa-mir-497 | HMDD,DbDEMC |
| hsa-mir-183 | HMDD,miR2Disease | hsa-mir-204 | Unconfirmed |
| hsa-mir-214 | HMDD,dbDEMC,miR2Disease | hsa-let-7b | HMDD,miR2Disease |
| hsa-mir-34c | HMDD | hsa-mir-25 | HMDD,dbDEMC,miR2Disease |
| hsa-mir-31 | HMDD,miR2Disease | hsa-mir-32 | Unconfirmed |
| hsa-mir-191 | HMDD,DbDEMC | hsa-mir-196b | Unconfirmed |
| hsa-mir-181b | HMDD,dbDEMC,miR2Disease | hsa-mir-378 | Unconfirmed |
| hsa-let-7f | HMDD,miR2Disease | hsa-mir-142 | HMDD,miR2Disease |
| hsa-mir-103 | dbDEMC,miR2Disease | hsa-mir-95 | Unconfirmed |
| hsa-let-7g | HMDD,miR2Disease | hsa-mir-148b | HMDD,dbDEMC,miR2Disease |
| hsa-mir-132 | miR2Disease | hsa-mir-210 | HMDD,DbDEMC |
| hsa-mir-128b | miR2Disease | hsa-mir-205 | HMDD,miR2Disease |
| hsa-mir-151 | miR2Disease | hsa-mir-199b | HMDD,miR2Disease |
| hsa-mir-451 | dbDEMC | hsa-mir-498 | HMDD |
| hsa-mir-150 | HMDD,dbDEMC,miR2Disease | hsa-mir-182 | HMDD,miR2Disease |
| hsa-let-7c | HMDD,dbDEMC,miR2Disease | hsa-mir-421 | HMDD |
| hsa-mir-34b | Unconfirmed | hsa-mir-93 | HMDD,dbDEMC,miR2Disease |
| hsa-mir-141 | HMDD,miR2Disease | hsa-mir-340 | Unconfirmed |
| hsa-mir-29a | HMDD,DbDEMC | hsa-mir-193b | Unconfirmed |
| hsa-mir-658 | Unconfirmed | hsa-mir-30c | HMDD,miR2Disease |
The top 20 potential disease related miRNAs predicted by RLSMDA in the global ranking and the confirmation for their associations by various databases are listed here. Fifteen of top 20 disease-miRNA associations have been confirmed
| Ranking | Diseases | miRNAs | Evidence |
|---|---|---|---|
| 1 | Colonic Neoplasms | hsa-mir-222 | dbDEMC |
| 2 | Stomach Neoplasms | hsa-mir-451 | miR2Disease |
| 3 | Ovarian Neoplasms | hsa-mir-15a | |
| 4 | Colorectal Neoplasms | hsa-mir-19a | HMDD,miR2Disease |
| 5 | Muscular Disorders, Atrophic | hsa-mir-206 | |
| 6 | Colonic Neoplasms | hsa-mir-203 | dbDEMC,miR2Disease |
| 7 | Stomach Neoplasms | hsa-mir-19b | |
| 8 | Breast Neoplasms | hsa-let-7e | HMDD,dbDEMC |
| 9 | Colonic Neoplasms | hsa-mir-92b | |
| 10 | Carcinoma, Hepatocellular | hsa-mir-155 | HMDD,dbDEMC,miR2Disease |
| 11 | Colorectal Neoplasms | hsa-mir-125b | HMDD |
| 12 | Breast Neoplasms | hsa-let-7b | HMDD,dbDEMC |
| 13 | Adenocarcinoma | hsa-mir-200b | HMDD |
| 14 | Colonic Neoplasms | hsa-mir-183 | dbDEMC,miR2Disease |
| 15 | Breast Neoplasms | hsa-mir-92a | HMDD |
| 16 | Ovarian Neoplasms | hsa-mir-143 | miR2Disease |
| 17 | Breast Neoplasms | hsa-mir-223 | HMDD,dbDEMC |
| 18 | Neoplasms | hsa-mir-15a | HMDD |
| 19 | Breast Neoplasms | hsa-mir-16 | HMDD,dbDEMC |
| 20 | Stomach Neoplasms | hsa-mir-92a |
Confirmed disease-miRNA associations predicted by RLSMDA for diseases without known related miRNAs in our golden standard dataset
| Ranking | Diseases | miRNAs | PMID |
|---|---|---|---|
| 1 | Acute Coronary Syndrome | hsa-mir-1 | 21806992 |
| 1 | Aortic Aneurysm, Abdominal | hsa-mir-21 | 22357537 |
| 1 | Aortic Aneurysm, Thoracic | hsa-mir-21 | 22010139 |
| 1 | Arthritis, Psoriatic | hsa-mir-146a | 20500689 |
| 1 | Crohn Disease | hsa-mir-16 | 22386737 |
| 1 | Laryngeal Neoplasms | hsa-mir-205 | 22605671 |
| 1 | Leukemia, Myelogenous, Chronic, BCR-ABL Positive | hsa-mir-181a | 22442671 |
| 1 | Liver Failure | hsa-mir-221 | 21400558 |
| 1 | Lupus Erythematosus, Systemic | hsa-mir-146a | 21529448 |
| 1 | Mesothelioma | hsa-mir-18a | 21358347 |
| 1 | Osteosarcoma | hsa-mir-15a | 22922827 |
| 1 | Retinoblastoma | hsa-mir-181b | 21373755 |
| 1 | Sezary Syndrome | hsa-mir-21 | 21525938 |
| 1 | Vascular Diseases | hsa-mir-21 | 20560046 |
| 2 | Amyloidosis | hsa-mir-16 | 21834602 |
| 2 | Antiphospholipid Syndrome | hsa-mir-20a | 21794077 |
| 2 | Aortic Valve Stenosis | hsa-mir-21 | 22882958 |
| 2 | Atrial Fibrillation | hsa-mir-223 | 22944230 |
| 2 | Creutzfeldt-Jakob Syndrome | hsa-mir-146a | 22043907 |
| 2 | Endometrial Neoplasms | hsa-mir-194 | 21851624 |
| 2 | Huntington Disease | hsa-mir-200c | 22906125 |
| 2 | Lichen Planus, Oral | hsa-mir-21 | 21943223 |
| 2 | Mesothelioma | hsa-mir-20a | 21358347 |
| 2 | Lymphoma, Non-Hodgkin | hsa-mir-21 | 22487708 |
| 2 | Osteosarcoma | hsa-mir-16 | 22922827 |
| 3 | Colitis, Ulcerative | hsa-mir-143 | 21557394 |
| 3 | Cystic Fibrosis | hsa-mir-155 | 21282106 |
| 3 | Endometrial Neoplasms | hsa-mir-155 | 21176560 |
| 3 | Fibrosis | hsa-mir-29c | 21784902 |
| 3 | Hyperlipidemias | hsa-mir-122 | 22587332 |
| 3 | Keratoconus | hsa-mir-184 | 21996275 |
| 3 | Mycosis Fungoides | hsa-let-7a | 21966986 |
| 3 | Neoplasms, Squamous Cell | hsa-mir-181a | 21244495 |
| 3 | Osteoporosis | hsa-mir-133a | 22506038 |
Figure 2The basic idea of disease semantic similarity calculation.
Figure 3The flowchart of RLSMDA includes three steps: solving optimization problem; obtaining the optimal classifier in the disease and miRNA space, respectively; combining classifiers in the disease and miRNA space to obtain final predictive result.