| Literature DB >> 35508967 |
Lei Deng1,2, Zixuan Liu1, Yurong Qian1, Jingpu Zhang3.
Abstract
BACKGROUND: Circular RNAs (circRNAs) play essential roles in cancer development and therapy resistance. Many studies have shown that circRNA is closely related to human health. The expression of circRNAs also affects the sensitivity of cells to drugs, thereby significantly affecting the efficacy of drugs. However, traditional biological experiments are time-consuming and expensive to validate drug-related circRNAs. Therefore, it is an important and urgent task to develop an effective computational method for predicting unknown circRNA-drug associations.Entities:
Keywords: Graph attention auto-encoder; Neural network; Similarity network; circRNA-drug associations
Mesh:
Substances:
Year: 2022 PMID: 35508967 PMCID: PMC9066932 DOI: 10.1186/s12859-022-04694-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1The flowchart of GATECDA. (1) We build a comprehensive similarity matrix CS for circRNAs and a comprehensive similarity matrix DS for drugs, respectively. (2) Two thresholds are utilized to binarize the corresponding comprehensive similarity matrices CS and DS. (3) GATE is employed to extract the representations of circRNAs and drugs respectively. (4). The representations of circRNAs and drugs are combined and fed into a fully connected neural network to predict the associations of each pair of circRNA and drug sensitivity
Fig. 2The process of using two-layer GATE to reconstruct the features of node 3. the neighbors of the node 3 are (1, 2, 3, 4, 5). we note that , and ,
Fig. 3Performance with different combinations of the two hyperparameters
Fig. 4Comparison on different network configurations in terms of AUC and AUPR
Comparison on different network configurations in terms of score, accuracy, recall, specificity and precision using 10-fold cross-validation
| Methods | Precision | Recall | F1 score | Accuracy | Specificity |
|---|---|---|---|---|---|
| GATECDA-S | 0.7576 | 0.8468 | 0.7997 | 0.7879 | 0.7290 |
| GATECDA-G | 0.7896 | 0.8187 | 0.8118 | 0.7735 | |
| GATECDA-SNF | 0.8335 | ||||
| GATECDA | 0.8128 | 0.8343 | 0.8234 | 0.8211 | 0.8079 |
The values with bold indicate the best results in terms of different metrics
Fig. 5Comparison results of GATECDA with the four state-of-the-art methods using 10-fold CV and 5-fold CV
Comparison with the four state-of-the-art methods in terms of aupr, score, accuracy, recall, specificity and precision using 10-fold cross-validation
| Methods | AUPR | Precision | Recall | F1 score | Accuracy | Specificity |
|---|---|---|---|---|---|---|
| KATZ | 0.8269 | 0.7176 | 0.7906 | 0.7669 | 0.6538 | |
| VGAE | 0.8725 | 0.7683 | 0.8313 | 0.7986 | 0.7864 | 0.7398 |
| VGAMF | 0.8681 | 0.7783 | 0.8471 | 0.8113 | 0.8029 | 0.7588 |
| GCNMDA | 0.8864 | 0.8039 | 0.8420 | 0.8225 | 0.8183 | 0.7946 |
| GATECDA | 0.8343 |
The values with bold indicate the best results in terms of different metrics
Comparison with the four state-of-the-art methods in terms of aupr, score, accuracy, recall, specificity and precision using 5-fold cross-validation
| Methods | AUPR | Precision | Recall | F1 score | Accuracy | Specificity |
|---|---|---|---|---|---|---|
| KATZ | 0.8223 | 0.7141 | 0.7866 | 0.7625 | 0.6494 | |
| VGAE | 0.8730 | 0.7763 | 0.8226 | 0.7988 | 0.7892 | 0.7546 |
| VGAMF | 0.8661 | 0.7911 | 0.8437 | 0.8165 | 0.8104 | 0.7772 |
| GCNMDA | 0.8761 | 0.7938 | 0.8427 | 0.8175 | 0.8119 | 0.7810 |
| GATECDA | 0.8316 |
The values with bold indicate the best results in terms of different metrics
The top 20 circRNAs associated with drug PAC-1. circRic(CTRP) indicates that the drug sensitivity in one circRNA-drug association is derived from the CTRP database
| Rank | circRNA | Evidence | Rank | circRNA | Evidence |
|---|---|---|---|---|---|
| 1 | VIM* | circRic(CTRP) | 11 | MEF2D* | circRic(CTRP) |
| 2 | CTTN* | circRic(CTRP) | 12 | PEA15* | circRic(CTRP) |
| 3 | POLR2A* | circRic(CTRP) | 13 | FBLN1* | circRic(CTRP) |
| 4 | CRIM1* | circRic(CTRP) | 14 | NCL | Nonsignificant |
| 5 | THBS1* | circRic(CTRP) | 15 | COL1A2* | circRic(CTRP) |
| 6 | ANP32B* | circRic(CTRP) | 16 | DCBLD2* | circRic(CTRP) |
| 7 | COL1A1* | circRic(CTRP) | 17 | COL6A2* | circRic(CTRP) |
| 8 | PTMS* | circRic(CTRP) | 18 | EHBP1L1 | Nonsignificant |
| 9 | SPINT2 | Nonsignificant | 19 | PSAP* | circRic(CTRP) |
| 10 | ASPH* | circRic(CTRP) | 20 | ANKRD36C* | circRic(CTRP) |
Nonsignificant means non-significant association. circRNAs marked with ‘*’ are verified
The top 20 circRNAs associated with drug Foretinib
| Rank | circRNA | Evidence | Rank | circRNA | Evidence |
|---|---|---|---|---|---|
| 1 | MUC16* | circRic(CTRP) | 11 | THBS1* | circRic(CTRP) |
| 2 | EVPL | Nonsignificant | 12 | PSAP* | circRic(CTRP) |
| 3 | ANP32B* | circRic(CTRP) | 13 | ARID1B* | circRic(CTRP) |
| 4 | ASPH* | circRic(CTRP) | 14 | WASF1* | circRic(CTRP) |
| 5 | GJB3* | circRic(CTRP) | 15 | LTBP3* | circRic(CTRP) |
| 6 | PTMS* | circRic(CTRP) | 16 | CRIM1* | circRic(CTRP) |
| 7 | CNKSR1* | circRic(CTRP) | 17 | MYC | Nonsignificant |
| 8 | LCN2* | circRic(CTRP) | 18 | ANKRD36C* | circRic(CTRP) |
| 9 | FBLN1 | Nonsignificant | 19 | PLEKHG2* | circRic(CTRP) |
| 10 | PHF21A | Nonsignificant | 20 | ANXA2* | circRic(CTRP) |
circRNAs marked with ‘*’ are verified
The top 10 predicted circRNAs related to two new drugs
| Erlotinib | MG-132 | ||||
|---|---|---|---|---|---|
| Rank | circRNA | Evidence | Rank | circRNA | Evidence |
| 1 | SPINT2* | circRic(CTRP) | 1 | CRIM1 | Nonsignificant |
| 2 | KRT19* | circRic(CTRP) | 2 | THBS1* | circRic(CTRP) |
| 3 | POLR2A | Nonsignificant | 3 | SPINT2 | Nonsignificant |
| 4 | LTBP3 | Nonsignificant | 4 | AHNAK | Nonsignificant |
| 5 | KRT7* | circRic(CTRP) | 5 | KRT19* | circRic(CTRP) |
| 6 | FN1 | Nonsignificant | 6 | EFEMP1* | circRic(CTRP) |
| 7 | THBS1 | Nonsignificant | 7 | COL1A2 | Nonsignificant |
| 8 | MAL2* | circRic(CTRP) | 8 | ANXA2* | circRic(CTRP) |
| 9 | CRIM1 | Nonsignificant | 9 | COL8A1 | unconfirmed |
| 10 | LCN2* | circRic(CTRP) | 10 | COL6A2 | Nonsignificant |
circRNAs marked with ‘*’ are verified