| Literature DB >> 31455026 |
Ping Xuan1, Lingling Li1, Tiangang Zhang2, Yan Zhang1, Yingying Song1.
Abstract
Identifying disease-associated microRNAs (disease miRNAs) contributes to the understanding of disease pathogenesis. Most previous computational biology studies focused on multiple kinds of connecting edges of miRNAs and diseases, including miRNA-miRNA similarities, disease-disease similarities, and miRNA-disease associations. Few methods exploited the node attribute information related to miRNA family and cluster. The previous methods do not completely consider the sparsity of node attributes. Additionally, it is challenging to deeply integrate the node attributes of miRNAs and the similarities and associations related to miRNAs and diseases. In the present study, we propose a novel method, known as MDAPred, based on nonnegative matrix factorization to predict candidate disease miRNAs. MDAPred integrates the node attributes of miRNAs and the related similarities and associations of miRNAs and diseases. Since a miRNA is typically subordinate to a family or a cluster, the node attributes of miRNAs are sparse. Similarly, the data for miRNA and disease similarities are sparse. Projecting the miRNA and disease similarities and miRNA node attributes into a common low-dimensional space contributes to estimating miRNA-disease associations. Simultaneously, the possibility that a miRNA is associated with a disease depends on the miRNA's neighbour information. Therefore, MDAPred deeply integrates projections of multiple kinds of connecting edges, projections of miRNAs node attributes, and neighbour information of miRNAs. The cross-validation results showed that MDAPred achieved superior performance compared to other state-of-the-art methods for predicting disease-miRNA associations. MDAPred can also retrieve more actual miRNA-disease associations at the top of prediction results, which is very important for biologists. Additionally, case studies of breast, lung, and pancreatic cancers further confirmed the ability of MDAPred to discover potential miRNA-disease associations.Entities:
Keywords: low-dimensional feature vector; miRNA–disease associations; nonnegative matrix factorization; projection of connecting edges; projection of node attributes
Year: 2019 PMID: 31455026 PMCID: PMC6749327 DOI: 10.3390/molecules24173099
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Receiver operating characteristic (ROC) and precision-recall (PR) curves of MDAPred and the other five methods. (A) ROC curves (B) PR curves.
Areas under the ROC curves (AUCs) of MDAPred and other methods on 15 diseases.
| Disease Name | AUC | |||||
|---|---|---|---|---|---|---|
| MDAPred | DMPred | PBMDA | GSTRW | Liu’s Method | BNPMDA | |
| Breast neoplasms |
| 0.974 | 0.906 | 0.837 | 0.920 | 0.902 |
| Hepatocellular carcinoma |
| 0.931 | 0.910 | 0.791 | 0.929 | 0.900 |
| Glioma |
| 0.855 | 0.882 | 0.786 | 0.914 | 0.843 |
| Acute myeloid leukemia |
| 0.963 | 0.885 | 0.796 | 0.910 | 0.865 |
| Lung neoplasms |
| 0.944 | 0.862 | 0.813 | 0.906 | 0.855 |
| Melanoma |
| 0.910 | 0.849 | 0.758 | 0.893 | 0.839 |
| Osteosarcoma | 0.968 |
| 0.860 | 0.771 | 0.897 | 0.859 |
| Ovarian neoplasms |
| 0.967 | 0.888 | 0.844 | 0.918 | 0.877 |
| Pancreatic neoplasms |
| 0.821 | 0.879 | 0.833 | 0.902 | 0.870 |
| Alzheimer Disease |
| 0.958 | 0.833 | 0.816 | 0.875 | 0.830 |
| Carcinoma, Renal Cell |
| 0.894 | 0.856 | 0.784 | 0.900 | 0.854 |
| Diabetes Mellitus, Type 2 |
| 0.936 | 0.870 | 0.870 | 0.905 | 0.869 |
| Glioblastoma | 0.938 |
| 0.849 | 0.759 | 0.889 | 0.843 |
| Heart failure |
| 0.959 | 0.884 | 0.814 | 0.909 | 0.882 |
| Atherosclerosis |
| 0.955 | 0.891 | 0.822 | 0.910 | 0.876 |
| Average AUC |
| 0.933 | 0.873 | 0.806 | 0.904 | 0.839 |
The bold values indicate the higher AUCs.
AUPRs of MDAPred and other methods on 15 diseases.
| Disease Name | AUPR | |||||
|---|---|---|---|---|---|---|
| MDAPred | DMPred | PBMDA | GSTRW | Liu’s Method | BNPMDA | |
| Breast neoplasms |
| 0.800 | 0.718 | 0.389 | 0.725 | 0.566 |
| Hepatocellular carcinoma |
| 0.715 | 0.767 | 0.483 | 0.749 | 0.676 |
| Glioma |
| 0.175 | 0.390 | 0.224 | 0.436 | 0.386 |
| Acute myeloid leukemia |
| 0.466 | 0.386 | 0.122 | 0.408 | 0.324 |
| Lung neoplasms |
| 0.620 | 0.561 | 0.370 | 0.596 | 0.542 |
| Melanoma |
| 0.366 | 0.482 | 0.205 | 0.524 | 0.491 |
| Osteosarcoma | 0.601 |
| 0.356 | 0.181 | 0.373 | 0.327 |
| Ovarian neoplasms |
| 0.366 | 0.529 | 0. 400 | 0.236 | 0.496 |
| Pancreatic neoplasms |
| 0.569 | 0.457 | 0.333 | 0.556 | 0.478 |
| Alzheimer Disease |
| 0.351 | 0.136 | 0.086 | 0.485 | 0.220 |
| Carcinoma, Renal Cell |
| 0.206 | 0.314 | 0.135 | 0.143 | 0.299 |
| Diabetes Mellitus, Type 2 |
| 0.398 | 0.259 | 0.132 | 0.356 | 0.268 |
| Glioblastoma |
| 0.284 | 0.346 | 0.161 | 0.303 | 0.336 |
| Heart failure |
| 0.393 | 0.301 | 0.134 | 0.348 | 0.300 |
| Atherosclerosis |
| 0.309 | 0.304 | 0.084 | 0.297 | 0.218 |
| Average PR |
| 0.500 | 0.436 | 0.233 | 0.463 | 0.359 |
The bold values indicate the higher AUPRs.
Figure 2Recall rates of 15 diseases under different top k.
Comparison of different methods based on AUCs with a paired t-test.
| DMPred | PBMDA | GSTRW | Liu’s Method | BNPMDA | |
|---|---|---|---|---|---|
| 2.4983 × 10−41 | 3.2311 × 10−5 | 6.3212 × 10−16 | 6.9812 × 10−8 | 2.9742 × 10−6 | |
| 2.2341 × 10−35 | 1.8643 × 10−6 | 1.6542 × 10−6 | 3.4521 × 10−5 | 8.8432 × 10−4 |
The top 50 breast cancer-related candidates.
| Rank | MiRNA name | Evidence | Rank | MiRNA name | Description |
|---|---|---|---|---|---|
| 1 | hsa-mir-186 | dbDEMC, PhenomiR | 26 | hsa-mir-885 | literature [ |
| 2 | hsa-mir-99b | dbDEMC, PhenomiR | 27 | hsa-mir-6838 | Unconfirmed |
| 3 | hsa-mir-483 | PhenomiR | 28 | hsa-mir-323a | dbDEMC, PhenomiR |
| 4 | hsa-mir-4480 | literature [ | 29 | hsa-mir-1244 | dbDEMC |
| 5 | hsa-mir-181d | dbDEMC, PhenomiR, miRCancer | 30 | hsa-mir-361 | PhenomiR, miRCancer |
| 6 | hsa-mir-28 | dbDEMC, PhenomiR | 31 | hsa-mir-216a | dbDEMC, PhenomiR, miRCancer |
| 7 | hsa-mir-455 | PhenomiR, miRCancer | 32 | hsa-mir-136 | dbDEMC, PhenomiR |
| 8 | hsa-mir-154 | dbDEMC, PhenomiR, miRCancer | 33 | hsa-mir-569 | literature [ |
| 9 | hsa-mir-330 | dbDEMC, PhenomiR, miRCancer | 34 | hsa-mir-336 | dbDEMC |
| 10 | hsa-mir-454 | dbDEMC, PhenomiR | 35 | hsa-mir-325 | dbDEMC, PhenomiR |
| 11 | hsa-mir-181 | dbDEMC, PhenomiR, miRCancer | 36 | hsa-mir-571 | dbDEMC |
| 12 | hsa-mir-208b | dbDEMC, PhenomiR | 37 | hsa-mir-95 | dbDEMC, PhenomiR |
| 13 | hsa-mir-663 | dbDEMC, PhenomiR | 38 | hsa-mir-517b | dbDEMC, PhenomiR, miRCancer |
| 14 | hsa-mir-133 | dbDEMC, PhenomiR, miRCancer | 39 | hsa-mir-323 | dbDEMC, PhenpmiR |
| 15 | hsa-mir-30 | dbDEMC, PhenomiR, miRCancer | 40 | hsa-mir-633 | dbDEMC |
| 16 | hsa-mir-504 | dbDEMC | 41 | hsa-mir-1183 | dbDEMC |
| 17 | hsa-mir-543 | dbDEMC | 42 | hsa-mir-4454 | literature [ |
| 18 | hsa-mir-217 | dbDEMC, PhenomiR, miRCancer | 43 | hsa-mir-705 | dbDEMC |
| 19 | hsa-mir-33 | dbDEMC, PhenomiR, miRCancer | 44 | hsa-mir-532 | dbDEMC, PhenomiR |
| 20 | hsa-mir-211 | dbDEMC, PhenomiR, miRCancer | 45 | hsa-mir-126a | dbDEMC, miRCancer |
| 21 | hsa-mir-449b | dbDEMC, PhenomiR, miRCancer | 46 | hsa-mir-1909 | dbDEMC |
| 22 | hsa-mir-362 | miRCancer | 47 | hsa-mir-539 | dbDEMC, PhenomiR, miRCancer |
| 23 | hsa-mir-208 | dbDEMC, PhenomiR | 48 | hsa-mir-520f | PhenomiR, miRCancer |
| 24 | hsa-mir-433 | dbDEMC, PhenomiR, miRCancer | 49 | hsa-mir-498 | miRCancer |
| 25 | hsa-mir-520e | dbDEMC, PhenomiR, miRCancer | 50 | hsa-mir-3135b | literature [ |
Figure 3Multiple data representations of miRNAs and diseases: (a) calculate miRNA similarities through miRNA–associated diseases, (b) calculate the similarities of disease by combining disease semantic similarities and disease phenotypic similarities, (c) establish association matrix A based on known associations between miRNAs and diseases, and (d) create a representation matrix of miRNA families and clusters.
Figure 4Iterative algorithm for estimation of the miRNA–disease association scores.