| Literature DB >> 34215183 |
Xue-Jun Chen1, Xin-Yun Hua1, Zhen-Ran Jiang2.
Abstract
BACKGROUND: A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional research.Entities:
Keywords: Light gradient boosting machine; Noise smoothing; k-means; miRNA-disease association
Mesh:
Substances:
Year: 2021 PMID: 34215183 PMCID: PMC8254275 DOI: 10.1186/s12859-021-04266-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The main ideas of ANMDA and 6 published methods
| Method | Main idea |
|---|---|
| ANMDA | Adopts subsampling for noise smoothing and light gradient boosting machine for prediction |
| BHCMDA | Uses biased heat conduction-based method to pay attention to specific nodes for prediction |
| DBNMDA | Constructs deep-belief network for prediction |
| EKRRMDA | Applies ensemble learning and kernel ridge regression on various data subset created by random selection of features for prediction |
| FCGCNMDA | Applies fully connected graph convolutional networks for prediction |
| MDACNN | Uses auto-encoders for dimensionality reduction and then applies convolutional neural networks for prediction |
| WBSMDA | Calculates within-scores and between scores for prediction |
Fig. 1The AUROCs of ANMDA and other 6 published methods
Fig. 2The performance of ANMDA, ABMDA, GBDT-LR and IRFMDA-100 tested on the same data
The performance of ANMDA, ABMDA, GBDT-LR and IRFMDA-100 in 100 times five-fold cross validation
| Metrics | ANMDA | ABMDA | GBDT-LR | IRFMDA-100 |
|---|---|---|---|---|
| AUROC | 0.9373 ± 0.0005 | 0.9023 ± 0.0021 | 0.9246 ± 0.0010 | 0.9267 ± 0.0009 |
| AUPR | 0.9328 ± 0.0008 | 0.8879 ± 0.0032 | 0.9177 ± 0.0015 | 0.9222 ± 0.0012 |
| Precision | 0.8561 ± 0.0017 | 0.8213 ± 0.0033 | 0.8403 ± 0.0026 | 0.8447 ± 0.0021 |
| Recall | 0.8728 ± 0.0020 | 0.8371 ± 0.0044 | 0.8567 ± 0.0031 | 0.8598 ± 0.0025 |
| F1-score | 0.8643 ± 0.0014 | 0.8290 ± 0.0030 | 0.8484 ± 0.0021 | 0.8521 ± 0.0016 |
Fig. 3The ROC and PR curves of different algorithms with and without subsampling for noise smoothing
Fig. 4The ROC and PR curves of kNN MLP and LightGBM without subsampling for noise smoothing
Fig. 5The framework of ANMDA contains three steps: construct features; construct data (construct positive samples and using k-means on undetected pairs to select negative samples based on HMDD v2.0); apply the algorithm to predict the associations
Fig. 6The interference of the noise hiding in the dataset
The performance of logistic regression algorithm on noise and noise-free data
| Data | AUROC | AUPR | Precision | Recall | F1-score |
|---|---|---|---|---|---|
| Noise-free | 0.9897 | 0.9892 | 0.9273 | 0.9450 | 0.9344 |
| Noise | 0.9647 | 0.9519 | 0.9039 | 0.9400 | 0.9204 |
Fig. 7The pseudocode of ANMDA