| Literature DB >> 31510919 |
Yuchong Gong1, Yanqing Niu2, Wen Zhang3, Xiaohong Li4.
Abstract
BACKGROUND: MiRNAs play significant roles in many fundamental and important biological processes, and predicting potential miRNA-disease associations makes contributions to understanding the molecular mechanism of human diseases. Existing state-of-the-art methods make use of miRNA-target associations, miRNA-family associations, miRNA functional similarity, disease semantic similarity and known miRNA-disease associations, but the known miRNA-disease associations are not well exploited.Entities:
Keywords: Network embedding; Random forest; miRNA-disease associations
Mesh:
Substances:
Year: 2019 PMID: 31510919 PMCID: PMC6740005 DOI: 10.1186/s12859-019-3063-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Performance of NEMII based on different feature combinations
| AUPR | AUC | F1 | ACC | REC | SPEC | PRE | |
|---|---|---|---|---|---|---|---|
| combination 1 | 0.6036 ± 0.0018 | 0.9252 ± 0.0014 | 0.6072 ± 0.0020 | 0.9955 ± 0.0001 | 0.4860 ± 0.0052 | 0.9992 ± 0.0001 | 0.8128 ± 0.0158 |
| combination 2 | 0.2630 ± 0.0032 | 0.7890 ± 0.0056 | 0.3338 ± 0.0032 | 0.9933 ± 0.0000 | 0.2360 ± 0.0025 | 0.9987 ± 0.0000 | 0.5681 ± 0.0058 |
| combination 3 | 0.6086 ± 0.0015 | 0.9284 ± 0.0012 | 0.6129 ± 0.0031 | 0.9956 ± 0.0001 | 0.4887 ± 0.0069 | 0.9992 ± 0.0001 | 0.8247 ± 0.0160 |
| combination 4 | 0.6085 ± 0.0024 | 0.9262 ± 0.0018 | 0.6115 ± 0.0026 | 0.9956 ± 0.0000 | 0.4836 ± 0.0055 | 0.9993 ± 0.0001 | 0.8366 ± 0.0105 |
| combination 5 | 0.6104 ± 0.0012 | 0.9293 ± 0.0017 | 0.6147 ± 0.0025 | 0.9956 ± 0.0001 | 0.4893 ± 0.0060 | 0.9993 ± 0.0001 | 0.8289 ± 0.0164 |
* combination 1: SDNE feature alone
* combination 2: miRNA-family feature and disease similarity feature
* combination 3: SDNE feature and miRNA-family feature
* combination 4: SDNE feature and disease similarity feature
* combination 5: SDNE feature, miRNA-family feature and disease similarity feature
Performances of NEMII on datasets with fewer associations
| Ratio | AUPR | AUC | F1 | ACC | REC | SPEC | PRE |
|---|---|---|---|---|---|---|---|
| 0% | 0.6104 ± 0.0012 | 0.9293 ± 0.0017 | 0.6147 ± 0.0025 | 0.9956 ± 0.0001 | 0.4893 ± 0.0060 | 0.9993 ± 0.0001 | 0.8289 ± 0.0164 |
| 10% | 0.6001 ± 0.0018 | 0.9276 ± 0.0011 | 0.6045 ± 0.0037 | 0.9969 ± 0.0001 | 0.4811 ± 0.0051 | 0.9993 ± 0.0001 | 0.8176 ± 0.0168 |
| 20% | 0.5956 ± 0.0030 | 0.9266 ± 0.0014 | 0.6036 ± 0.0040 | 0.9965 ± 0.0000 | 0.4738 ± 0.0091 | 0.9995 ± 0.0001 | 0.8354 ± 0.0169 |
| 30% | 0.5863 ± 0.0026 | 0.9255 ± 0.0010 | 0.5946 ± 0.0036 | 0.9960 ± 0.0001 | 0.4620 ± 0.0074 | 0.9995 ± 0.0001 | 0.8390 ± 0.0290 |
Fig. 1AUPR and AUC of embedding methods based on different dimensions
Performance of models based on different classifiers
| Classifiers | AUPR | AUC | F1 | ACC | REC | SPEC | PRE |
|---|---|---|---|---|---|---|---|
| RF | 0.6104 ± 0.0012 | 0.9293 ± 0.0017 | 0.6147 ± 0.0025 | 0.9956 ± 0.0001 | 0.4893 ± 0.006 | 0.9993 ± 0.0001 | 0.8289 ± 0.0164 |
| NB | 0.1846 ± 0.0008 | 0.9103 ± 0.0089 | 0.2528 ± 0.0028 | 0.9892 ± 0.0004 | 0.2572 ± 0.0124 | 0.9944 ± 0.0005 | 0.2532 ± 0.0056 |
| LR | 0.2129 ± 0.0008 | 0.9023 ± 0.0008 | 0.2734 ± 0.0017 | 0.9884 ± 0.0004 | 0.3078 ± 0.0094 | 0.9933 ± 0.0005 | 0.2480 ± 0.0096 |
| SVM | 0.0968 ± 0.0034 | 0.9021 ± 0.0010 | 0.1718 ± 0.0036 | 0.9740 ± 0.0012 | 0.3761 ± 0.0144 | 0.9783 ± 0.0013 | 0.1121 ± 0.0037 |
| weighted RF | 0.5944 ± 0.0014 | 0.9336 ± 0.0014 | 0.5920 ± 0.0025 | 0.9953 ± 0.0001 | 0.4741 ± 0.0085 | 0.9991 ± 0.0001 | 0.7913 ± 0.0233 |
Performances of NEMII, PBMDA, NTSMDA and GRNMF
| Methods | AUPR | AUC | F1 | ACC | REC | SPEC | PRE |
|---|---|---|---|---|---|---|---|
| NEMII | 0.6104 ± 0.0012 | 0.9293 ± 0.0017 | 0.6147 ± 0.0025 | 0.9956 ± 0.0001 | 0.4893 ± 0.0060 | 0.9993 ± 0.0001 | 0.8289 ± 0.0164 |
| PBMDA | 0.2095 ± 0.0015 | 0.9164 ± 0.0005 | 0.2676 ± 0.0021 | 0.9892 ± 0.0005 | 0.2759 ± 0.0139 | 0.9944 ± 0.0006 | 0.2642 ± 0.0103 |
| NTSMDA | 0.0916 ± 0.0012 | 0.8857 ± 0.0009 | 0.1410 ± 0.0013 | 0.9740 ± 0.0015 | 0.2988 ± 0.0171 | 0.9788 ± 0.0017 | 0.0931 ± 0.0020 |
| GRNMF | 0.2446 ± 0.0024 | 0.9128 ± 0.0008 | 0.3192 ± 0.0137 | 0.9945 ± 0.0005 | 0.2989 ± 0.0127 | 0.9897 ± 0.0004 | 0.3066 ± 0.0016 |
Fig. 2Performances of different methods on predicting miRNAs associated with three diseases
Fig. 3Performances of recovering associations of NEMII and other approaches
The top 10 miRNA-disease associations predicted by our method
| miRNA | Disease | Rank | Evidence |
|---|---|---|---|
| hsa-let-7c | Crohn Disease | 1 | N.A. |
| hsa-let-7c | Gastritis, Atrophic | 2 | [ |
| hsa-let-7e | Lymphoproliferative Disorders | 3 | N.A. |
| hsa-let-7e | Giant Cell Tumors | 4 | N.A. |
| hsa-mir-103a-2 | Myelodysplastic Syndromes | 5 | [ |
| hsa-let-7e | Biliary Atresia | 6 | [ |
| hsa-mir-10a | Carotid Artery Diseases | 7 | N.A. |
| hsa-mir-10b | Eczema | 8 | N.A. |
| hsa-mir-1179 | Breast Neoplasms | 9 | [ |
| hsa-mir-1179 | Carcinoma, Hepatocellular | 10 |
|
Predicted miRNAs associated with three diseases
| Disease | miRNA | Rank | Evidence |
|---|---|---|---|
| breast neoplasms | hsa-mir-1179 | 1 | [ |
| hsa-mir-1180 | 2 | [ | |
| hsa-mir-106a | 3 | [ | |
| hsa-mir-377 | 4 | N.A. | |
| hsa-mir-1909 | 5 | N.A. | |
| hsa-mir-181c | 6 | N.A. | |
| hsa-mir-1202 | 7 | N.A. | |
| hsa-mir-1296 | 8 | [ | |
| hsa-mir-2110 | 9 | N.A. | |
| hsa-mir-711 | 10 | [ | |
| lung neoplasms | hsa-mir-1180 | 1 | N.A. |
| hsa-mir-1179 | 2 | [ | |
| hsa-mir-376c | 3 | [ | |
| hsa-mir-500b | 4 | N.A. | |
| hsa-mir-1293 | 5 | [ | |
| hsa-mir-296 | 6 | [ | |
| hsa-mir-1183 | 7 | N.A. | |
| hsa-mir-99b | 8 | [ | |
| hsa-mir-298 | 9 | N.A. | |
| hsa-mir-2110 | 10 | N.A. | |
| prostatic neoplasms | hsa-mir-103a-2 | 1 | N.A. |
| hsa-mir-1179 | 2 | [ | |
| hsa-mir-10b | 3 | N.A. | |
| hsa-mir-10a | 4 | [ | |
| hsa-mir-1180 | 5 | [ | |
| hsa-mir-147a | 6 | N.A. | |
| hsa-mir-217 | 7 | N.A. | |
| hsa-mir-125a | 8 | [ | |
| hsa-mir-624 | 9 | N.A. | |
| hsa-mir-630 | 10 | N.A. |
Fig. 4Pipeline of NEMII (DSS: Disease Semantic Similarity, SDNE: Structural Deep Network Embedding, DAG: Directed Acyclic Graph)