| Literature DB >> 29101378 |
Laiyi Fu1, Qinke Peng2.
Abstract
Cumulative evidence from biological experiments has confirmed that microRNAs (miRNAs) are related to many types of human diseases through different biological processes. It is anticipated that precise miRNA-disease association prediction could not only help infer potential disease-related miRNA but also boost human diagnosis and disease prevention. Considering the limitations of previous computational models, a more effective computational model needs to be implemented to predict miRNA-disease associations. In this work, we first constructed a human miRNA-miRNA similarity network utilizing miRNA-miRNA functional similarity data and heterogeneous miRNA Gaussian interaction profile kernel similarities based on the assumption that similar miRNAs with similar functions tend to be associated with similar diseases, and vice versa. Then, we constructed disease-disease similarity using disease semantic information and heterogeneous disease-related interaction data. We proposed a deep ensemble model called DeepMDA that extracts high-level features from similarity information using stacked autoencoders and then predicts miRNA-disease associations by adopting a 3-layer neural network. In addition to five-fold cross-validation, we also proposed another cross-validation method to evaluate the performance of the model. The results show that the proposed model is superior to previous methods with high robustness.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29101378 PMCID: PMC5670180 DOI: 10.1038/s41598-017-15235-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Results on the SM miRNA datasets.
| Method | AUC | AUPR |
|---|---|---|
| RLSMDA | 0.7715 ± 0.027 | 0.0321 ± 0.004 |
| HGIMDA | 0.7040 ± 0.025 | 0.0402 ± 0.006 |
| NCPMDA | 0.7760 ± 0.012 | 0.0578 ± 0.006 |
| PBMDA | 0.8194 ± 0.032 | 0.2106 ± 0.017 |
| RKNNMDA | 0.5678 ± 0.016 | 0.1581 ± 0.038 |
| DeepMDA | 0.9126* ± 0.004 | 0.2297* ± 0.040 |
| SAE + ADA | 0.8996 ± 0.010 | 0.1859 ± 0.026 |
| RAW + DNN | 0.9102 ± 0.007 | 0.1991 ± 0.028 |
The AUC and AUPR scores are listed above. The * indicates the highest AUC/AUPR score. Generally, the three deep learning models performed better than other five models.
Results on the SM miRNA datasets.
| Method | AUC | AUPR |
|---|---|---|
| RLSMDA | 0.8325 ± 0.007 | 0.2457 ± 0.008 |
| HGIMDA | 0.7169 ± 0.005 | 0.1182 ± 0.004 |
| NCPMDA | 0.8849 ± 0.006 | 0.3473 ± 0.010 |
| PBMDA | 0.8925 ± 0.005 | 0.4867 ± 0.014 |
| RKNNMDA | 0.7044 ± 0.004 | 0.3365 ± 0.013 |
| DeepMDA | 0.9270* ± 0.005 | 0.5853* ± 0.031 |
| SAE + ADA | 0.8982 ± 0.005 | 0.4376 ± 0.024 |
| RAW + DNN | 0.9153 ± 0.004 | 0.5293 ± 0.026 |
The AUC and AUPR scores are listed above. The * indicates the highest AUC/AUPR score. Generally, the three deep learning models performed better than the other five models.
Results on the full miRNA-disease datasets.
| Method | AUC | AUPR |
|---|---|---|
| RLSMDA | 0.8475 ± 0.005 | 0.1157 ± 0.004 |
| HGIMDA | 0.7689 ± 0.011 | 0.1120 ± 0.007 |
| NCPMDA | 0.8731 ± 0.007 | 0.2801 ± 0.011 |
| PBMDA | 0.9086 ± 0.004 | 0.4378 ± 0.016 |
| RKNNMDA | 0.7076 ± 0.005 | 0.3534 ± 0.011 |
| DeepMDA | 0.9486* ± 0.002 | 0.5917* ± 0.014 |
| SAE + ADA | 0.9211 ± 0.002 | 0.4075 ± 0.011 |
| RAW + DNN | 0.9368 ± 0.001 | 0.4933 ± 0.014 |
The AUC and AUPR scores are listed above. The * indicates the highest AUC/AUPR score. Generally, the three deep learning models performed better than the other five models.
Comparison between DRMDA and DeepMDA on the full miRNA-disease datasets in five-fold cross validation.
| Method | AUC | AUPR |
|---|---|---|
| Single SAE + SVM(DRMDA) | 0.8812 ± 0.006 | 0.4614 ± 0.004 |
| Single SAE + DNN | 0.9394 ± 0.002 | 0.5003 ± 0.010 |
| DeepMDA | 0.9486* ± 0.002 | 0.5917* ± 0.014 |
The AUC and AUPR scores are listed above. Generally, DeepMDA performed better than the other two models.
Figure 1ROC curves and AUC values of eight different methods based on 5-fold cross-validation. The three deep learning models performed better than other 4 network-based methods in general. DeepMDA had the best performance in all the eight models.
Results on the full miRNA-disease datasets in LODOCV.
| Method | AUC | AUPR |
|---|---|---|
| RLSMDA | 0.8530 ± 0.133 | 0.2066 ± 0.240 |
| HGIMDA | 0.7616 ± 0.164 | 0.1025 ± 0.142 |
| NCPMDA | 0.6374 ± 0.220 | 0.0596 ± 0.104 |
| PBMDA | 0.6902 ± 0.223 | 0.2918* ± 0.289 |
| RKNNMDA | 0.5680 ± 0.131 | 0.2085 ± 0.273 |
| DeepMDA | 0.8729* ± 0.118 | 0.2556 ± 0.271 |
| SAE + ADA | 0.8552 ± 0.124 | 0.1914 ± 0.220 |
| RAW + DNN | 0.8633 ± 0.121 | 0.2180 ± 0.248 |
The AUC and AUPR scores are listed above. Generally, DeepMDA performed better than the other seven models in LODOCV.
Figure 2ROC curves and AUC values of eight different methods based on LODOCV. The three deep learning models performed better than other five network-based methods in general. DeepMDA had the best performance in all the eight models.
Results on the noisy miRNA-disease datasets.
| Method | AUC | AUPR |
|---|---|---|
| Noised DeepMDA | 0.9334 ± 0.005 | 0.4558 ± 0.019 |
| Noised SAE + ADA | 0.8235 ± 0.012 | 0.1613 ± 0.015 |
| Noised SAE + RF | 0.8122 ± 0.015 | 0.1320 ± 0.014 |
| Noised DRMDA | 0.7757 ± 0.044 | 0.1290 ± 0.034 |
| DeepMDA | 0.9486* ± 0.002 | 0.5917* ± 0.014 |
| SAE + ADA | 0.9211 ± 0.002 | 0.4075 ± 0.011 |
| SAE + RF | 0.9249 ± 0.003 | 0.5674 ± 0.012 |
| DRMDA | 0.8812 ± 0.006 | 0.4614 ± 0.004 |
The AUC and AUPR scores are listed above. Generally, DeepMDA performed better than other three models when adding noise.
Figure 3The flowchart of proposed DeepMDA. The miRNA similarity was integrated using miRNA-functional similarity and miRNA-disease association. As for disease similarity, we adopted DAG information and Gaussian interaction profile similarity information. The two input data was fed into two stacked autoencoders to learn high-level features, then merged and finally utilized a 3-layer network to infer the association between miRNAs and diseases.