| Literature DB >> 31931344 |
Kai Zheng1, Zhu-Hong You2, Lei Wang3, Yong Zhou4, Li-Ping Li5, Zheng-Wei Li4.
Abstract
MicroRNAs (miRNAs) play a critical role in human diseases. Determining the association between miRNAs and disease contributes to elucidating the pathogenesis of liver diseases and seeking the effective treatment method. Despite great recent advances in the field of the associations between miRNAs and diseases, implementing association verification and recognition efficiently at scale presents serious challenges to biological experimental approaches. Thus, computational methods for predicting miRNA-disease association have become a research hotspot. In this paper, we present a new computational method, named distance-based sequence similarity for miRNA-disease association prediction (DBMDA), that directly learns a mapping from miRNA sequence to a Euclidean space. The notable feature of our approach consists of inferring global similarity from region distances that can be figured by chaos game representation algorithm based on the miRNA sequences. In the 5-fold cross-validation experiment, the area under the curve (AUC) obtained by DBMDA in predicting potential miRNA-disease associations reached 0.9129. To assess the effectiveness of DBMDA more effectively, we compared it with different classifiers and former prediction models. Besides, we constructed two case studies for prostate neoplasms and colon neoplasms. Results show that 39 and 39 out of the top 40 predicted miRNAs were confirmed by other databases, respectively. BDMDA has made new attempts in sequence similarity and achieved excellent results, while at the same time providing a new perspective for predicting the relationship between diseases and miRNAs. The source code and datasets explored in this work are available online from the University of Chinese Academy of Sciences (http://220.171.34.3:81/).Entities:
Keywords: chaos game representation; disease; heterogenous information; miRNAs; rotation forest
Year: 2019 PMID: 31931344 PMCID: PMC6957846 DOI: 10.1016/j.omtn.2019.12.010
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Figure 1The ROCs of DBMDA and AUCs Based on 5-Fold Cross-Validation
The Comparison Results of DBMDA Based on 5-Fold Cross-Validation
| Testing Set | Accuracy | Sensitivity | Precision | F1-Score |
|---|---|---|---|---|
| 1 | 83.14% | 81.55% | 84.23% | 82.87% |
| 2 | 86.21% | 86.83% | 85.77% | 86.30% |
| 3 | 85.57% | 86.42% | 84.99% | 85.70% |
| 4 | 86.22% | 87.07% | 85.63% | 86.34% |
| 5 | 85.66% | 86.83% | 84.85% | 85.83% |
| Average | 85.36% ± 1.27% | 85.74% ± 2.35% | 85.09% ± 0.62% | 85.40% ± 1.44% |
Figure 2The ROCs of Four Different Classifiers, which Are RoF, SVM, Random Forest, and Decision Tree
Performance Comparison among Four Different Classifiers, which are Rotation Forest, SVM, Random Forest, and Decision Tree
| Method | Accuracy | Sensitivity | Precision | F1-Score |
|---|---|---|---|---|
| SVM | 83.73% | 83.56% | 83.33% | 83.45% |
| RF | 82.06% | 76.49% | 85.43% | 80.72% |
| DT | 80.33% | 78.12% | 81.10% | 79.58% |
| RoF | 85.00% | 85.60% | 84.11% | 84.85% |
The Comparison with Related Models
| Methods | AUC Scores |
|---|---|
| RLSMDA | 86.17% |
| PBSI | 54.02% |
| MBSI | 74.83% |
| NetCBI | 80.66% |
| MaxFlow | 86.93% |
| miRGOFS | 87.70% |
| HGIMDA | 87.81% |
| MDHGI | 87.94% |
| LMTRDA | 90.54% |
| DBMDA | 91.29% |
The results of the method are reported in Chen and Yan.
The results of the method are reported in Chen and Zhang.
The results of the method are reported in Yu et al.
The results of the method are reported in Yang et al.
The results of the method are reported in Chen et al.
The results of the method are reported in Chen et al.
The results of the method are reported in Wang et al.
Prediction of the Top 40 Predicted miRNAs Associated with Prostate Neoplasms Based on Known Associations in dbDEMC v.2.0 and miR2Database
| miRNA | dbDEMC | miR2D | miRNA | dbDEMC | miR2D |
|---|---|---|---|---|---|
| hsa-mir-192 | confirmed | unconfirmed | hsa-mir-181a-2 | confirmed | unconfirmed |
| hsa-let-7i | confirmed | unconfirmed | hsa-mir-196a | confirmed | unconfirmed |
| hsa-mir-140 | confirmed | unconfirmed | hsa-mir-208a | confirmed | unconfirmed |
| hsa-mir-199b | confirmed | confirmed | hsa-mir-337 | confirmed | unconfirmed |
| hsa-mir-144 | confirmed | unconfirmed | hsa-mir-1246 | confirmed | unconfirmed |
| hsa-mir-372 | confirmed | unconfirmed | hsa-mir-30 | confirmed | unconfirmed |
| hsa-let-7e | confirmed | confirmed | hsa-mir-184 | confirmed | confirmed |
| hsa-let-7f | confirmed | confirmed | hsa-mir-509 | unconfirmed | unconfirmed |
| hsa-mir-10b | confirmed | confirmed | hsa-mir-9-3 | confirmed | unconfirmed |
| hsa-mir-129 | confirmed | unconfirmed | hsa-let-7f-2 | confirmed | unconfirmed |
| hsa-mir-9-1 | confirmed | unconfirmed | hsa-mir-202 | confirmed | confirmed |
| hsa-mir-206 | confirmed | unconfirmed | hsa-mir-33a | confirmed | unconfirmed |
| hsa-mir-125a | confirmed | confirmed | hsa-mir-451a | confirmed | unconfirmed |
| hsa-mir-30b | confirmed | confirmed | hsa-let-7f-1 | confirmed | unconfirmed |
| hsa-mir-362 | confirmed | unconfirmed | hsa-mir-186 | confirmed | unconfirmed |
| hsa-mir-133 | confirmed | unconfirmed | hsa-mir-302b | confirmed | unconfirmed |
| hsa-mir-139 | confirmed | unconfirmed | hsa-mir-328 | confirmed | unconfirmed |
| hsa-mir-137 | confirmed | unconfirmed | hsa-mir-383 | confirmed | unconfirmed |
| hsa-mir-181b-2 | confirmed | unconfirmed | hsa-mir-431 | confirmed | unconfirmed |
| hsa-mir-338 | confirmed | unconfirmed | hsa-mir-103a-2 | confirmed | unconfirmed |
Prediction of the Top 40 Predicted miRNAs Associated with Colon Neoplasms Based on Known Associations in dbDEMC v.2.0 and miR2Database
| miRNA | dbDEMC | miR2D | miRNA | dbDEMC | miR2D |
|---|---|---|---|---|---|
| hsa-mir-26a | confirmed | confirmed | hsa-mir-497 | confirmed | confirmed |
| hsa-mir-182 | confirmed | confirmed | hsa-mir-92a-2 | confirmed | unconfirmed |
| hsa-mir-342 | confirmed | confirmed | hsa-mir-124 | confirmed | confirmed |
| hsa-mir-483 | confirmed | unconfirmed | hsa-mir-129 | confirmed | confirmed |
| hsa-mir-139 | confirmed | unconfirmed | hsa-mir-133a-1 | confirmed | confirmed |
| hsa-mir-372 | confirmed | unconfirmed | hsa-mir-181b-1 | confirmed | confirmed |
| hsa-mir-181b-2 | confirmed | confirmed | hsa-mir-26a-1 | confirmed | confirmed |
| hsa-mir-181a-2 | confirmed | confirmed | hsa-mir-373 | confirmed | unconfirmed |
| hsa-mir-124-1 | confirmed | confirmed | hsa-mir-423 | confirmed | unconfirmed |
| hsa-mir-193a | confirmed | unconfirmed | hsa-mir-499 | unconfirmed | unconfirmed |
| hsa-mir-193b | confirmed | unconfirmed | hsa-mir-128 | confirmed | confirmed |
| hsa-mir-26b | confirmed | unconfirmed | hsa-mir-16 | confirmed | unconfirmed |
| hsa-mir-34b | confirmed | unconfirmed | hsa-mir-212 | confirmed | unconfirmed |
| hsa-mir-1 | confirmed | confirmed | hsa-mir-340 | confirmed | unconfirmed |
| hsa-mir-133a-2 | confirmed | confirmed | hsa-mir-98 | confirmed | unconfirmed |
| hsa-mir-199b | confirmed | unconfirmed | hsa-mir-100 | confirmed | unconfirmed |
| hsa-mir-27b | confirmed | confirmed | hsa-mir-124-3 | confirmed | confirmed |
| hsa-mir-29c | confirmed | unconfirmed | hsa-mir-133 | confirmed | confirmed |
| hsa-mir-451a | confirmed | unconfirmed | hsa-mir-183 | confirmed | confirmed |
| hsa-mir-144 | confirmed | unconfirmed | hsa-mir-370 | confirmed | unconfirmed |
Figure 3CGR of the miRNA Named hsa-mir-135
Figure 4The Flowchart of Quantify the Sequence Similarity Utilizing its Regional Distance
(A) The CGRs of hsa-mir-27a are plotted with the average coordinates for each 8 × 8 quadrant represented. (B) The CGRs of hsa-mir-651 are plotted with the average coordinates for each 8 × 8 quadrant represented. (C) Figuring the region distances of hsa-mir-27a and hsa-mir-651.
Figure 5The Workflow of DBMDA Model to Predict Potential miRNA-Disease Associations