| Literature DB >> 35198012 |
Guanghui Li1, Diancheng Wang1, Yuejin Zhang1, Cheng Liang2, Qiu Xiao3, Jiawei Luo4.
Abstract
Cumulative research studies have verified that multiple circRNAs are closely associated with the pathogenic mechanism and cellular level. Exploring human circRNA-disease relationships is significant to decipher pathogenic mechanisms and provide treatment plans. At present, several computational models are designed to infer potential relationships between diseases and circRNAs. However, the majority of existing approaches could not effectively utilize the multisource data and achieve poor performance in sparse networks. In this study, we develop an advanced method, GATGCN, using graph attention network (GAT) and graph convolutional network (GCN) to detect potential circRNA-disease relationships. First, several sources of biomedical information are fused via the centered kernel alignment model (CKA), which calculates the corresponding weight of different kernels. Second, we adopt the graph attention network to learn latent representation of diseases and circRNAs. Third, the graph convolutional network is deployed to effectively extract features of associations by aggregating feature vectors of neighbors. Meanwhile, GATGCN achieves the prominent AUC of 0.951 under leave-one-out cross-validation and AUC of 0.932 under 5-fold cross-validation. Furthermore, case studies on lung cancer, diabetes retinopathy, and prostate cancer verify the reliability of GATGCN for detecting latent circRNA-disease pairs.Entities:
Keywords: centered kernel alignment; circRNA-disease associations; deep learning; graph attention network; graph convolutional network
Year: 2022 PMID: 35198012 PMCID: PMC8859418 DOI: 10.3389/fgene.2022.829937
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Overall workflow of the GATGCN.
FIGURE 2Outcome of comparing various learning rates.
FIGURE 3Outcome of comparing various embedding dimensions.
FIGURE 4Outcome of comparing various GCN layers.
FIGURE 5Outcome of comparing various dropout rates and penalty factors.
Results generated by the GATGCN under five-fold CV and LOOCV.
| Test set | Accu | Rec | Spe | F1 | AUC |
|---|---|---|---|---|---|
| 5-fold CV_1 | 0.988 | 0.682 | 0.989 | 0.437 | 0.956 |
| 5-fold CV_2 | 0.987 | 0.568 | 0.991 | 0.361 | 0.918 |
| 5-fold CV_3 | 0.987 | 0.644 | 0.988 | 0.373 | 0.922 |
| 5-fold CV_4 | 0.990 | 0.627 | 0.991 | 0.414 | 0.931 |
| 5-fold CV_5 | 0.991 | 0.647 | 0.990 | 0.402 | 0.934 |
| Average | 0.9886 ± 0.0024 | 0.6336 ± 0.0656 | 0.9898 ± 0.0012 | 0.3974 ± 0.0396 | 0.9322 ± 0.0238 |
| LOOCV | 0.987 | 0.782 | 0.992 | 0.542 | 0.951 |
FIGURE 6Performance of the GATGCN based on various model combinations.
FIGURE 7Comparison results of various prediction models under five-fold cross-validation.
FIGURE 8Performance of methods based on various percentages of known relationships.
Top 10 candidate circRNAs related to lung cancer.
| Rank | circRNA | Evidence (PMID) |
|---|---|---|
| 1 | hsa_circ_0007385 | 29372377 |
| 2 | hsa_circ_0014130 | 29440731 |
| 3 | hsa_circ_0016760 | 33416186 |
| 4 | hsa_circ_0043256 | 28958934 |
| 5 | hsa_circ_0012673 | 32141533 |
| 6 | hsa_circRNA_404833 | unconfirmed |
| 7 | hsa_circRNA_006411 | unconfirmed |
| 8 | hsa_circRNA_401977 | unconfirmed |
| 9 | hsa_circ_0013958 | 28685964 |
| 10 | hsa_circ_0006404 | unconfirmed |
Top 10 candidate circRNAs related to diabetes retinopathy.
| Rank | circRNA | Evidence (PMID) |
|---|---|---|
| 1 | hsa_circRNA_063981 | 28817829 |
| 2 | hsa_circRNA_404457 | 28817829 |
| 3 | hsa_circRNA_100750 | 28817829 |
| 4 | hsa_circRNA_406918 | 28817829 |
| 5 | hsa_circRNA_104387 | 28817829 |
| 6 | hsa_circRNA_103410 | 28817829 |
| 7 | hsa_circRNA_100192 | 28817829 |
| 8 | hsa_circ_0013509 | unconfirmed |
| 9 | circSLC8A1-1 | unconfirmed |
| 10 | hsa_circ_101396 | unconfirmed |
Top 10 candidate circRNAs related to prostate cancer.
| Rank | circRNA | Evidence (PMID) |
|---|---|---|
| 1 | circHIPK3 | 32547085 |
| 2 | hsa_circ_0004383 | unconfirmed |
| 3 | circ-Foxo3 | 31733095 |
| 4 | hsa-circRNA 2149 | unconfirmed |
| 5 | circR-284 | unconfirmed |
| 6 | circDLGAP4 | unconfirmed |
| 7 | hsa_circ_0008887 | unconfirmed |
| 8 | hsa_circ_0044516 | 31625175 |
| 9 | CDR1as | 23900077 |
| 10 | Cir-ITCH | 32904490 |
Top 10 candidate circRNAs related to cholangiocarcinoma.
| Rank | circRNA | Evidence (PMID) |
|---|---|---|
| 1 | hsa_circ_000438 | unconfirmed |
| 2 | circHIPK3 | 31654054 |
| 3 | ciRS-7 | 33390857 |
| 4 | circR-284 | unconfirmed |
| 5 | circDLGAP4 | unconfirmed |
| 6 | circSMARCA5 | 31880360 |
| 7 | hsa_circ_0008887 | unconfirmed |
| 8 | hsa_circ_0006404 | unconfirmed |
| 9 | hsa_circRNA_000585 | 34182814 |
| 10 | hsa_circ_0000673 | 33221765 |
Top 10 candidate circRNAs related to clear cell renal cell carcinoma.
| Rank | circRNA | Evidence (PMID) |
|---|---|---|
| 1 | circHIPK3 | 32409849 |
| 2 | circR-284 | unconfirmed |
| 3 | circDLGAP4 | unconfirmed |
| 4 | hsa_circ_0004383 | unconfirmed |
| 5 | Cir-ITCH | unconfirmed |
| 6 | hsa_circRNA_003251 | unconfirmed |
| 7 | circPVT1 | 33453148 |
| 8 | hsa_circ_0001451 | 30271486 |
| 9 | ciRS-7 | 32496306 |
| 10 | circZFR | 31571906 |