| Literature DB >> 34041455 |
Bo-Ya Ji1,2,3, Zhu-Hong You1,2,3, Yi Wang1,3, Zheng-Wei Li4, Leon Wong1,2,3.
Abstract
Predicting the microRNA-disease associations by using computational methods is conductive to the efficiency of costly and laborious traditional bio-experiments. In this study, we propose a computational machine learning-based method (DANE-MDA) that preserves integrated structure and attribute features via deep attributed network embedding to predict potential miRNA-disease associations. Specifically, the integrated features are extracted by using deep stacked auto-encoder on the diverse orders of matrixes containing structure and attribute information and are then trained by using random forest classifier. Under 5-fold cross-validation experiments, DANE-MDA yielded average accuracy, sensitivity, and AUC at 85.59%, 84.23%, and 0.9264 in term of HMDD v3.0 dataset, and 83.21%, 80.39%, and 0.9113 in term of HMDD v2.0 dataset, respectively. Additionally, case studies on breast, colon, and lung neoplasms related disease show that 47, 47, and 46 of the top 50 miRNAs can be predicted and retrieved in the other database.Entities:
Keywords: Cancer; Computational bioinformatics; Systems biology
Year: 2021 PMID: 34041455 PMCID: PMC8141887 DOI: 10.1016/j.isci.2021.102455
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Illustration of the overall framework of DANE-MDA (DAG: directed acyclic graph; DSS: disease semantic similarity)
The results of DANE-MDA under 5-fold cross-validation based on the HMDD v3.0 dataset
| Fold | ACC.(%) | AUC(%) | Sen.(%) | Prec.(%) | Spec.(%) | MCC(%) |
|---|---|---|---|---|---|---|
| 0 | 85.10 | 92.56 | 83.32 | 86.40 | 86.88 | 70.25 |
| 1 | 85.94 | 92.89 | 84.57 | 86.95 | 87.31 | 71.91 |
| 2 | 85.38 | 92.32 | 83.48 | 86.78 | 87.28 | 70.81 |
| 3 | 85.59 | 92.80 | 84.88 | 86.11 | 86.31 | 71.19 |
| 4 | 85.96 | 92.66 | 84.89 | 86.74 | 87.02 | 71.93 |
The last line represents the average and standard deviation of each indicator.
The results of DANE-MDA under 5-fold cross-validation based on the HMDD v2.0 dataset
| Fold | ACC.(%) | AUC(%) | Sen.(%) | Prec.(%) | Spec.(%) | MCC(%) |
|---|---|---|---|---|---|---|
| 0 | 84.53 | 92.22 | 79.65 | 88.27 | 89.41 | 69.39 |
| 1 | 81.86 | 90.17 | 79.56 | 83.40 | 84.16 | 63.79 |
| 2 | 83.89 | 91.48 | 80.02 | 86.73 | 87.75 | 67.98 |
| 3 | 83.93 | 91.17 | 81.49 | 85.67 | 86.37 | 67.94 |
| 4 | 81.86 | 90.61 | 81.22 | 82.28 | 82.50 | 63.73 |
The last line represents the average and standard deviation of each indicator.
Figure 2The ROC curves of DANE-MDA under 5-fold cross validation based on HMDD v3.0 dataset
Figure 3The ROC curves of DANE-MDA under 5-fold cross validation based on HMDD v2.0 dataset
Figure 4The PR curves of DANE-MDA under 5-fold cross validation based on HMDD v3.0 dataset
Figure 5The PR curves of DANE-MDA under 5-fold cross validation based on HMDD v2.0 dataset
The AUC values of parameter α under each fold cross-validation (β = 0.94, t = 5)
| Fold | 0 | 1 | 2 | 3 | 4 | Average |
|---|---|---|---|---|---|---|
| 1 | 0.9169 | 0.9224 | 0.9149 | 0.9223 | 0.9171 | 0.9187 ± 0.34 |
| 0.95 | 0.9242 | 0.9263 | 0.9206 | 0.9269 | 0.9252 | 0.9246 ± 0.25 |
| 0.90 | 0.9211 | 0.9272 | 0.9230 | 0.9286 | 0.9215 | 0.9243 ± 0.34 |
| 0.80 | 0.9271 | 0.9277 | 0.9243 | 0.9270 | 0.9241 | 0.9261 ± 0.17 |
| 0.75 | 0.9262 | 0.9299 | 0.9224 | 0.9250 | 0.9261 | 0.9259 ± 0.27 |
| 0 | 0.8774 | 0.8849 | 0.8776 | 0.8791 | 0.8746 | 0.8787 ± 0.38 |
The AUC values of parameter β under each fold cross-validation (α = 0.85, t = 5)
| Fold | 0 | 1 | 2 | 3 | 4 | Average |
|---|---|---|---|---|---|---|
| 0.98 | 0.9274 | 0.9253 | 0.9208 | 0.9275 | 0.9222 | 0.9246 ± 0.30 |
| 0.96 | 0.9249 | 0.9312 | 0.9252 | 0.9279 | 0.9222 | 0.9263 ± 0.34 |
| 0.92 | 0.9249 | 0.9252 | 0.9221 | 0.9291 | 0.9243 | 0.9251 ± 0.25 |
| 0.90 | 0.9234 | 0.9268 | 0.9238 | 0.9279 | 0.9224 | 0.9249 ± 0.24 |
The AUC values of parameter t under each fold cross-validation ()
| Fold | 0 | 1 | 2 | 3 | 4 | Average |
|---|---|---|---|---|---|---|
| 1 | 0.9247 | 0.9260 | 0.9210 | 0.9290 | 0.9193 | 0.9240 ± 0.39 |
| 3 | 0.9255 | 0.9286 | 0.9236 | 0.9250 | 0.9249 | 0.9255 ± 0.19 |
| 7 | 0.9234 | 0.9282 | 0.9213 | 0.9307 | 0.9223 | 0.9252 ± 0.41 |
| 9 | 0.9264 | 0.9277 | 0.9202 | 0.9292 | 0.9234 | 0.9254 ± 0.36 |
Figure 6The line graph of average AUC results at different α values of DANE-MDA
Figure 7The line graph of average AUC results at different β values of DANE-MDA
Figure 8The line graph of average AUC results at different t values of DANE-MDA
The average results and standard deviations of DANE-MDA with different feature combinations under 5-fold cross-validation
| Feature | Acc.(%) | AUC(%) | Sen.(%) | Prec.(%) | Spec.(%) | MCC(%) |
|---|---|---|---|---|---|---|
| Only attribute | 81.01 ± 0.28 | 87.87 ± 0.38 | 81.86 ± 0.91 | 80.49 ± 0.37 | 80.15 ± 0.63 | 62.03 ± 0.58 |
| Only structure | 84.76 ± 0.21 | 91.87 ± 0.34 | 83.39 ± 0.39 | 85.75 ± 0.31 | 86.14 ± 0.38 | 69.55 ± 0.42 |
Figure 9The average ROC and PR curves of DANE-MDA with different feature combinations under 5-fold cross-validation
The average results and standard deviations of DANE-MDA with different classifiers under 5-fold cross-validation
| Classifier | ACC.(%) | AUC(%) | Sen.(%) | Prec.(%) | Spec.(%) | MCC(%) |
|---|---|---|---|---|---|---|
| KNN | 82.69 ± 0.30 | 89.68 ± 0.39 | 91.39 ± 0.39 | 77.85 ± 0.27 | 74.00 ± 0.35 | 66.39 ± 0.61 |
| Naive Bayes | 78.02 ± 0.44 | 79.57 ± 0.33 | 91.77 ± 0.43 | 71.97 ± 0.35 | 64.27 ± 0.46 | 58.28 ± 0.90 |
| AdaBoost | 83.56 ± 0.58 | 91.47 ± 0.22 | 85.41 ± 0.75 | 82.36 ± 0.68 | 81.70 ± 0.83 | 67.16 ± 1.16 |
Figure 10The average ROC and PR curves of DANE-MDA with different classifiers under 5-fold cross-validation
Comparison of the average AUC value of DANE-MDA and different models based on HMDD v3.0 dataset
| Models | Average AUC (%) |
|---|---|
| DBMDA | 91.29 |
| WBSMDA | 81.85 |
| PBMDA | 91.72 |
| HDMP | 83.42 |
| RLSMDA | 85.69 |
Comparison of the average AUC value of DANE-MDA and different models based on HMDD v2.0 dataset
| Models | Average AUC (%) |
|---|---|
| TLHNMDA | 87.95 |
| NCMCMDA | 89.42 |
| RFMDA | 88.18 |
| MDHGI | 87.94 |
The top 50 miRNA-colon neoplasm associations predicted by DANE-MDA
| Rank | miRNA | Evidence | Rank | miRNA | Evidence |
|---|---|---|---|---|---|
| 1 | hsa-miR-29c-5p | dbDemc | 26 | hsa-miR-199a-5p | dbDemc |
| 2 | hsa-miR-99b-5p | dbDemc | 27 | hsa-miR-19b-3p | dbDemc |
| 3 | hsa-miR-144-5p | dbDemc | 28 | hsa-miR-497-5p | dbDemc |
| 4 | hsa-miR-182-5p | dbDemc | 29 | hsa-miR-30e-5p | dbDemc |
| 5 | hsa-miR-92a-2-5p | dbDemce | 30 | hsa-miR-27b-5p | dbDemc |
| 6 | hsa-miR-338-5p | dbDemc | 31 | hsa-miR-206 | dbDemc |
| 7 | hsa-miR-422a | dbDemc; miR2Disease | 32 | hsa-miR-185-5p | dbDemc |
| 8 | hsa-miR-199b-5p | dbDemc | 33 | hsa-miR-425-5p | dbDemc |
| 9 | hsa-miR-378a-5p | dbDemc | 34 | hsa-miR-135a-5p | dbDemc |
| 10 | hsa-miR-373-5p | Unconfirmed | 35 | hsa-miR-491-5p | dbDemc |
| 11 | hsa-miR-451a | dbDemc | 36 | hsa-miR-340-5p | dbDemc |
| 12 | hsa-miR-29b-2-5p | dbDemc | 37 | hsa-miR-149-5p | dbDemc |
| 13 | hsa-miR-214-5p | dbDemc | 38 | hsa-miR-187-5p | dbDemc |
| 14 | hsa-miR-503-5p | dbDemc | 39 | hsa-miR-129-5p | dbDemc |
| 15 | hsa-miR-28-5p | dbDemc | 40 | hsa-miR-184 | dbDemc |
| 16 | hsa-miR-146b-5p | dbDemc | 41 | hsa-miR-95-5p | Unconfirmed |
| 17 | hsa-miR-590-5p | dbDemc | 42 | hsa-miR-7-2-3p | Unconfirmed |
| 18 | hsa-miR-342-5p | dbDemc | 43 | hsa-miR-7-1-3p | dbDemc |
| 19 | hsa-miR-193a-5p | dbDemc | 44 | hsa-miR-582-5p | dbDemc |
| 20 | hsa-miR-421 | dbDemc | 45 | hsa-miR-16-5p | dbDemc |
| 21 | hsa-miR-186-5p | dbDemc | 46 | hsa-miR-10a-5p | dbDemc |
| 22 | hsa-miR-26a-5p | dbDemc | 47 | hsa-miR-181a-2-3p | dbDemc |
| 23 | hsa-miR-26b-5p | dbDemc | 48 | hsa-miR-423-5p | dbDemc |
| 24 | hsa-miR-124-5p | dbDemc | 49 | hsa-miR-181c-5p | dbDemc |
| 25 | hsa-miR-122-5p | dbDemc | 50 | hsa-miR-20b-5p | dbDemc |
The top 50 miRNA-breast neoplasm associations predicted by DANE-MDA
| Rank | miRNA | Evidence | Rank | miRNA | Evidence |
|---|---|---|---|---|---|
| 1 | hsa-miR-15a-5p | dbDemc | 26 | hsa-miR-582-5p | dbDemc |
| 2 | hsa-miR-181d-5p | dbDemc | 27 | hsa-miR-1271-5p | dbDemc |
| 3 | hsa-miR-99b-5p | dbDemc | 28 | hsa-miR-1231 | dbDemc |
| 4 | hsa-miR-500a-5p | dbDemc | 29 | hsa-miR-589-5p | dbDemc |
| 5 | hsa-miR-637 | dbDemce | 30 | hsa-miR-650 | dbDemc |
| 6 | hsa-miR-454-5p | dbDemc | 31 | hsa-miR-376a-2-5p | Unconfirmed |
| 7 | hsa-miR-646 | dbDemc | 32 | hsa-miR-323b-5p | dbDemc |
| 8 | hsa-miR-767-5p | dbDemc | 33 | hsa-miR-384 | dbDemc |
| 9 | hsa-miR-28-5p | dbDemc | 34 | hsa-miR-543 | dbDemc |
| 10 | hsa-miR-382-5p | dbDemc | 35 | hsa-miR-302e | dbDemc |
| 11 | hsa-miR-508-5p | dbDemc | 36 | hsa-miR-19b-2-5p | dbDemc |
| 12 | hsa-miR-211-5p | dbDemc | 37 | hsa-miR-337-5p | dbDemc |
| 13 | hsa-miR-431-5p | dbDemc | 38 | hsa-miR-557 | dbDemc |
| 14 | hsa-miR-532-5p | dbDemc | 39 | hsa-miR-602 | dbDemc |
| 15 | hsa-miR-483-5p | dbDemc | 40 | hsa-miR-154-5p | dbDemc |
| 16 | hsa-miR-1297 | dbDemc | 41 | hsa-miR-361-5p | dbDemc |
| 17 | hsa-miR-519a-5p | Unconfirmed | 42 | hsa-miR-4732-5p | dbDemc |
| 18 | hsa-miR-501-5p | dbDemc | 43 | hsa-miR-941 | dbDemc |
| 19 | hsa-miR-628-5p | dbDemc | 44 | hsa-miR-362-5p | dbDemc |
| 20 | hsa-miR-455-5p | dbDemc | 45 | hsa-miR-297 | dbDemc |
| 21 | hsa-miR-601 | dbDemc | 46 | hsa-miR-513c-5p | Unconfirmed |
| 22 | hsa-miR-622 | dbDemc | 47 | hsa-miR-571 | dbDemc |
| 23 | hsa-miR-422a | dbDemc | 48 | hsa-miR-544a | dbDemc |
| 24 | hsa-miR-300 | dbDemc | 49 | hsa-miR-636 | dbDemc |
| 25 | hsa-miR-325 | dbDemc | 50 | hsa-miR-3651 | dbDemc |
The top 50 miRNA-lung neoplasm associations predicted by DANE-MDA
| Rank | miRNA | Evidence | Rank | miRNA | Evidence |
|---|---|---|---|---|---|
| 1 | hsa-miR-15b-5p | dbDemc | 26 | hsa-miR-16-2-3p | dbDemc |
| 2 | hsa-miR-16-1-3p | dbDemc | 27 | hsa-miR-425-5p | dbDemc; miR2Disease |
| 3 | hsa-miR-518b | dbDemc | 28 | hsa-miR-484 | dbDemc |
| 4 | hsa-miR-642a-5p | dbDemc | 29 | hsa-miR-575 | dbDemc |
| 5 | hsa-miR-429 | dbDemc; miR2Disease | 30 | hsa-miR-452-5p | dbDemc |
| 6 | hsa-miR-106b-5p | dbDemc | 31 | hsa-miR-590-5p | dbDemc |
| 7 | hsa-miR-424-5p | dbDemc | 32 | hsa-miR-625-5p | dbDemc |
| 8 | hsa-miR-28-5p | dbDemc | 33 | hsa-miR-193b-5p | dbDemc |
| 9 | hsa-miR-382-5p | dbDemc | 34 | hsa-miR-302c-5p | Unconfirmed |
| 10 | hsa-miR-409-5p | dbDemc | 35 | hsa-miR-505-5p | dbDemc |
| 11 | hsa-miR-421 | dbDemc | 36 | hsa-miR-181b-5p | dbDemc |
| 12 | hsa-miR-532-5p | dbDemc | 37 | hsa-miR-708-5p | dbDemc |
| 13 | hsa-miR-483-5p | dbDemc | 38 | hsa-miR-1246 | dbDemc |
| 14 | hsa-miR-128-3p | dbDemc | 39 | hsa-miR-151a-5p | dbDemc |
| 15 | hsa-miR-491-5p | dbDemc | 40 | hsa-miR-376c-5p | dbDemc |
| 16 | hsa-miR-885-5p | dbDemc | 41 | hsa-miR-370-5p | dbDemc |
| 17 | hsa-miR-92b-5p | Unconfirmed | 42 | hsa-miR-298 | dbDemc |
| 18 | hsa-miR-509-5p | dbDemc | 43 | hsa-miR-23b-5p | dbDemc |
| 19 | hsa-miR-1307-5p | dbDemc | 44 | hsa-miR-628-5p | dbDemc |
| 20 | hsa-miR-455-5p | dbDemc | 45 | hsa-miR-539-5p | dbDemc |
| 21 | hsa-miR-489-5p | Unconfirmed | 46 | hsa-miR-711 | Unconfirmed |
| 22 | hsa-miR-422a | dbDemc | 47 | hsa-miR-1179 | dbDemc |
| 23 | hsa-miR-1271-5p | dbDemc | 48 | hsa-miR-1244 | dbDemc |
| 24 | hsa-miR-125b-2-3p | dbDemc | 49 | hsa-miR-339-5p | dbDemc |
| 25 | hsa-miR-181d-5p | dbDemc | 50 | hsa-miR-3613-5p | dbDemc |