| Literature DB >> 26273645 |
Quan Zou1, Jinjin Li2, Qingqi Hong3, Ziyu Lin2, Yun Wu4, Hua Shi4, Ying Ju2.
Abstract
MicroRNAs constitute an important class of noncoding, single-stranded, ~22 nucleotide long RNA molecules encoded by endogenous genes. They play an important role in regulating gene transcription and the regulation of normal development. MicroRNAs can be associated with disease; however, only a few microRNA-disease associations have been confirmed by traditional experimental approaches. We introduce two methods to predict microRNA-disease association. The first method, KATZ, focuses on integrating the social network analysis method with machine learning and is based on networks derived from known microRNA-disease associations, disease-disease associations, and microRNA-microRNA associations. The other method, CATAPULT, is a supervised machine learning method. We applied the two methods to 242 known microRNA-disease associations and evaluated their performance using leave-one-out cross-validation and 3-fold cross-validation. Experiments proved that our methods outperformed the state-of-the-art methods.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26273645 PMCID: PMC4529919 DOI: 10.1155/2015/810514
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Distribution of the three datasets.
| Dataset | Matrix | Similarity score >0 |
|---|---|---|
| MicroRNA-microRNA association dataset | 271 × 271 | 56289 |
| Disease-disease association dataset | 5080 × 5080 | 20285172 |
| MicroRNA-disease association dataset | 271 × 5080 | 242 |
Figure 1Bipartite graph of the microRNA-disease association network.
Figure 2Degree distributions of microRNAs and diseases in the bipartite graph of the microRNA-disease association network.
Statistical data for the bipartite graph of the microRNA-disease association network.
| Title | Number |
|---|---|
| MicroRNAs | 271 |
| Diseases | 5080 |
| Known-associating microRNAs | 99 |
| Known-associating diseases | 51 |
| Known-associations | 242 |
| Average number of microRNA degrees | 2.44 |
| Average number of disease degrees | 4.75 |
Figure 3Unweighted, undirected graph.
Figure 4ROC curves of KATZ and CATAPULT methods by leave-one-out cross-validation.
Distribution of diseases on the basis of microRNAs.
| Number of microRNAs | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 8 | 9 | 10 | 12 | 15 | 20 | 24 | 27 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number of diseases | 5029 | 16 | 10 | 6 | 3 | 3 | 2 | 4 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
Figure 6Recovery of microRNA-disease associations with respect to disease rank under leave-one-out cross-validation.
Figure 5ROC curves of KATZ and CATAPULT methods by 3-fold cross-validation.
Comparison of different prediction methods based on AUC values.
| Method | MBSI | PBSI | NetCBI | KATZ | CATAPULT |
|---|---|---|---|---|---|
| AUC | 74.83% | 54.02% | 80.66% | 98.9% | 98.8% |
Top 10 newly predicted microRNA-disease associations by KATZ.
| Rank | MicroRNA | OMIM disease ID | Disease | Source |
|---|---|---|---|---|
| 1 | hsa-let-7i | 211980 | Lung cancer | HMDD |
| 2 | hsa-let-7d | 114480 | Breast cancer | HMDD |
| 3 | hsa-mir-145 | 211980 | Lung cancer | HMDD |
| 4 | hsa-mir-18a | 114480 | Breast cancer | HMDD |
| 5 | hsa-mir-145 | 114480 | Breast cancer | HMDD |
| 6 | hsa-mir-106b | 114480 | Breast cancer | HMDD |
| 7 | hsa-let-7e | 114480 | Breast cancer | HMDD |
| 8 | hsa-let-7b | 114480 | Breast cancer | HMDD |
| 9 | hsa-mir-19a | 114480 | Breast cancer | HMDD |
| 10 | hsa-mir-125a | 114480 | Breast cancer | HMDD |
Top 10 newly predicted microRNA-disease associations by CATAPULT.
| Rank | MicroRNA | OMIM disease ID | Disease | Source |
|---|---|---|---|---|
| 1 | hsa-let-7a | 176807 | Prostate cancer | miR2Disease |
| 2 | hsa-mir-34a | 114480 | Breast cancer | HMDD |
| 3 | hsa-mir-21 | 211980 | Lung cancer | HMDD |
| 4 | hsa-let-7c | 114480 | Breast cancer | HMDD |
| 5 | hsa-mir-19a | 114480 | Breast cancer | HMDD |
| 6 | hsa-let-7a | 151400 | Chronic lymphocytic leukemia | miR2Disease |
| 7 | hsa-mir-29b | 114480 | Breast cancer | miR2Disease |
| 8 | hsa-mir-146a | 211980 | Lung cancer | HMDD |
| 9 | hsa-mir-155 | 211980 | Lung cancer | HMDD |
| 10 | hsa-let-7c | 114550 | Hepatocellular carcinoma | miR2Disease |