| Literature DB >> 33087118 |
Minghui Liu1, Jingyi Yang1, Jiacheng Wang1, Lei Deng2,3.
Abstract
BACKGROUND: Studies have found that miRNAs play an important role in many biological activities involved in human diseases. Revealing the associations between miRNA and disease by biological experiments is time-consuming and expensive. The computational approaches provide a new alternative. However, because of the limited knowledge of the associations between miRNAs and diseases, it is difficult to support the prediction model effectively.Entities:
Keywords: Diffusion feature; HeteSim measure; eXtreme gradient boosting; miRNA-disease association
Year: 2020 PMID: 33087118 PMCID: PMC7579981 DOI: 10.1186/s12920-020-00783-0
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Performance comparison of different feature groups (Diffusion, HeteSim and combined feature)
Fig. 2Comparison result between XGBoost and other machine learning algorithms including PF, SVM and GTB
Fig. 3The ROC curves of MDAPCOM and previous approaches containing PRINCE and RWRMDA on the data sets. The ratio of positive and negative sample 1:1
Fig. 4The ROC curves of MDAPCOM and previous approaches containing PRINCE and RWRMDA on the data sets. The ratio of positive and negative sample 1:2
Fig. 5The ROC curves of MDAPCOM and previous approaches containing PRINCE and RWRMDA on the data sets. The ratio of positive and negative sample 1:5
Fig. 6The ROC curves of MDAPCOM and previous approaches containing PRINCE and RWRMDA on the data sets. The ratio of positive and negative sample 1:10
Fig. 7The flowchart of MDAPCOM: a Obtain six kinds of data from online databases. b Merge these data to build a global heterogeneous network c Utilize RWR algorithm to get the diffusion feature. d Apply SVD to reduce dimension of the diffusion feature. e Use HeteSim measure to obtain HeteSim feature. f Integrate reduced diffusion feature and HeteSim feature and then apply XGBoost algorithm to train the model using the combined feature
All paths less than 5 in length starting at miRNA and ending at disease. M is miRNA, P is protein and D is disease, for example, path1 MMD is the path miRNA-miRNA-disease
| id | path | id | path | id | path |
|---|---|---|---|---|---|
| 1 | MMD | 14 | MMMPD | 27 | MPPDD |
| 2 | MPD | 15 | MMMDD | 28 | MPDMD |
| 3 | MDD | 16 | MMPMD | 29 | MPDPD |
| 4 | MMMD | 17 | MMPPD | 30 | MPDDD |
| 5 | MMPD | 18 | MMPDD | 31 | MDMMD |
| 6 | MMDD | 19 | MMDMD | 32 | MDMPD |
| 7 | MPMD | 20 | MMDPD | 33 | MDMDD |
| 8 | MPPD | 21 | MMDDD | 34 | MDPMD |
| 9 | MPDD | 22 | MPMMD | 35 | MDPPD |
| 10 | MDMD | 23 | MPMPD | 36 | MDPDD |
| 11 | MDPD | 24 | MPMDD | 37 | MDDMD |
| 12 | MDDD | 25 | MPPMD | 38 | MDDPD |
| 13 | MMMMD | 26 | MPPPD | 39 | MDDDD |