| Literature DB >> 35953785 |
Morteza Kouhsar1, Esra Kashaninia1, Behnam Mardani2, Hamid R Rabiee3.
Abstract
BACKGROUND: Several types of RNA in the cell are usually involved in biological processes with multiple functions. Coding RNAs code for proteins while non-coding RNAs regulate gene expression. Some single-strand RNAs can create a circular shape via the back splicing process and convert into a new type called circular RNA (circRNA). circRNAs are among the essential non-coding RNAs in the cell that involve multiple disorders. One of the critical functions of circRNAs is to regulate the expression of other genes through sponging micro RNAs (miRNAs) in diseases. This mechanism, known as the competing endogenous RNA (ceRNA) hypothesis, and additional information obtained from biological datasets can be used by computational approaches to predict novel associations between disease and circRNAs.Entities:
Keywords: Biological network; Disease; Representation learning; ceRNA; circRNA
Mesh:
Substances:
Year: 2022 PMID: 35953785 PMCID: PMC9367077 DOI: 10.1186/s12859-022-04883-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1The overall workflow of the proposed algorithm
Fig. 2The average AUC of each classifier is based on the size of feature vectors extracted by DeepWalk
The average values of the evaluation metrics in 5 folds for different classifiers based on their optimal number of features
| Classifier (optimal feature vector size) | Acc (%) | F1 (%) | Pre (%) | Sen (%) | Spe (%) | AUC (%) |
|---|---|---|---|---|---|---|
| ABRF (10) | 89.74 | 89.37 | 92.6 | 86.44 | 93.04 | 96.58 |
| LR (80) | 71.3 | 71.32 | 71.25 | 71.48 | 71.13 | 77.48 |
| MP (10) | 89.56 | 89.59 | 89.37 | 89.91 | 89.22 | 95.54 |
| RF (10) | 90.09 | 89.78 | 92.61 | 87.13 | 93.04 | 96.44 |
| XGB (20) | 92.09 | 92.078 | 92.36 | 91.82 | 92.35 | 97.77 |
| SVM (90) | 72.09 | 73.26 | 70.16 | 76.69 | 67.48 | 76.41 |
Fig. 3ROC curve and AUC based on the average values of 5 folds (the size of extracted feature vector with Deepwalk set to the optimum value for each classifier)
The average values of the evaluation metrics in 5 folds for different state-of-the-art algorithms based on the benchmark dataset
| Algorithm | Acc (%) | F1 (%) | Pr (%) | Se (%) | Sp (%) | AUC (%) |
|---|---|---|---|---|---|---|
| CircWalk | 92.09 | 92.08 | 92.36 | 91.83 | 92.35 | 97.77 |
| DMFCDA | 83.69 | 83.69 | 81.55 | 87.79 | 79.6 | 83.69 |
| GCNCDA | 74.52 | 74.9 | 73.79 | 76.17 | 72.87 | 82.72 |
| SIMCCDA | 83.36 | 16.4 | 9.1 | 84.54 | 83.34 | 73.3 |
| GMNN2CD | 99.09 | 85.52 | 72.63 | 63.36 | 99.78 | 96.69 |
Fig. 4ROC curve and AUC based on the average values of 5 folds for different algorithms compared with our method
Predicted CircRNA-Disease relations with the highest probability for some selected diseases
| Disease | circRNA | Probability | Related article (PMID) |
|---|---|---|---|
| Lung cancer | hsa_circ_0007534 | 0.996 | 30017736 |
| hsa_circ_0001946 | 0.995 | 31249811 | |
| hsa_circ_0002874 | 0.992 | 33612481 | |
| hsa_circ_0014130 | 0.991 | 29440731, 31241217, 31818066, 32060230, 32616621, 34349347 | |
| hsa_circ_0002702 | 0.990 | 32962802 | |
| hsa_circ_0007874 | 0.988 | 30975029 | |
| hsa_circ_0074930 | 0.985 | 32962802 | |
| hsa_circ_0086414 | 0.983 | 30777071 | |
| hsa_circ_0079530 | 0.972 | 29689350 | |
| hsa_circ_0007385 | 0.972 | 29372377, 32602212, 32666646 | |
| hsa_circ_0016760 | 0.968 | 29440731 | |
| hsa_circ_0012673 | 0.960 | 29366790, 32141553 | |
| hsa_circ_0067934 | 0.954 | 33832139 | |
| hsa_circ_0000567 | 0.950 | 32328186, 33768996, 34435479 | |
| hsa_circ_0072088 | 0.941 | 32308427, 34135596 | |
| hsa_circ_0001727 | 0.934 | 32010565 | |
| hsa_circ_0008305 | 0.901 | 30261900 | |
| Gastric cancer | hsa_circ_0001313 | 0.999 | 32253030 |
| hsa_circ_0004771 | 0.998 | 29098316 | |
| hsa_circ_0002874 | 0.998 | 34388244 | |
| hsa_circ_0000615 | 0.998 | 34049561 | |
| hsa_circ_0006404 | 0.977 | 32445925 | |
| hsa_circ_0001982 | 0.977 | 33000178 | |
| hsa_circ_0032683 | 0.910 | 33449227 | |
| hsa_circ_0014130 | 0.819 | 32190005 | |
| Colorectal cancer | hsa_circ_0006054 | 0.995 | 30585259 |
| hsa_circ_0000745 | 0.990 | 28974900 | |
| hsa_circ_0044556 | 0.989 | 32884449 | |
| hsa_circ_0005075 | 0.964 | 31081084, 31476947, 34015582 | |
| hsa_circ_0040809 | 0.958 | 34438465 | |
| hsa_circ_0004771 | 0.945 | 31737058, 32419229 | |
| hsa_circ_0007874 | 0.924 | 32419229 | |
| hsa_circ_0080210 | 0.914 | 34222420 |