| Literature DB >> 35529153 |
Guanghui Li1, Yingjie Yue2, Cheng Liang3, Qiu Xiao4, Pingjian Ding5, Jiawei Luo6.
Abstract
A growing body of evidence indicates that circular RNAs (circRNAs) play a pivotal role in various biological processes and have a close association with the initiation and progression of diseases. Moreover, circRNAs are considered as promising biomarkers for disease diagnosis owing to their characteristics of conservation, stability and universality. Inferring disease-circRNA relationships will contribute to the understanding of disease pathology. However, it is costly and laborious to discover novel disease-circRNA interactions by wet-lab experiments, and few computational methods have been devoted to predicting potential circRNAs for diseases. Here, we advance a computational method (NCPCDA) to identify novel circRNA-disease associations based on network consistency projection. For starters, we make use of multi-view similarity data, including circRNA functional similarity, disease semantic similarity, and association profile similarity, to construct the integrated circRNA similarity and disease similarity. Then, we project circRNA space and disease space on the circRNA-disease interaction network, respectively. Finally, we can obtain the predicted circRNA-disease association score matrix by combining the above two space projection scores. Simulation results show that NCPCDA can efficiently infer disease-circRNA relationships with high accuracy, obtaining AUCs of 0.9541 and 0.9201 in leave-one-out cross validation and five-fold cross validation, respectively. Furthermore, case studies also suggest that NCPCDA is promising for discovering new disease-circRNA interactions. The NCPCDA dataset and code, as well as the detailed readme file for our code, can be downloaded from Github (https://github.com/ghli16/NNCPCD). This journal is © The Royal Society of Chemistry.Entities:
Year: 2019 PMID: 35529153 PMCID: PMC9073279 DOI: 10.1039/c9ra06133a
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 1The overall workflow of the NCPCDA method.
Fig. 2The ROC curves of different models under leave-one-out cross validation.
Fig. 3The ROC curves of different models under five-fold cross validation.
The top-20 newly discovered circRNAs for lung cancer predicted by NCPCDA
| Rank | circRNAs | Evidence |
|---|---|---|
| 1 | hsa_circ_0007385 | PMID: 29372377 |
| 2 | hsa_circ_0014130 | PMID: 29440731 |
| 3 | hsa_circ_0016760 | Unconfirmed |
| 4 | hsa_circ_0043256 | circRNADisease |
| 5 | hsa_circ_0012673 | PMID: 29366790 |
| 6 | hsa_circRNA_404833 | PMID: 29241190 |
| 7 | hsa_circRNA_006411 | PMID: 29241190 |
| 8 | hsa_circRNA_401977 | PMID: 29241190 |
| 9 | hsa_circ_0013958 | circRNADisease, Circ2Disease |
| 10 | circ-Foxo3/hsa_circ_0006404 | PMID: 29620202 |
| 11 | hsa_circRNA_100782/circHIPK3/hsa_circ_0000284 | circRNADisease, Circ2Disease |
| 12 | hsa_circ_0023404/circRNA_100876/circ-CER | circRNADisease, Circ2Disease |
| 13 | circPRKCI/hsa_circ_0067934 | PMID: 29588350 |
| 14 | hsa_circRNA_100855/hsa_circ_0023028 | Unconfirmed |
| 15 | hsa_circRNA_104912/hsa_circ_0088442 | Unconfirmed |
| 16 | hsa_circRNA_103110/hsa_circ_103110/hsa_circ_0004771 | Unconfirmed |
| 17 | hsa_circ_0001313/circCCDC66 | Unconfirmed |
| 18 | hsa_circRNA_102049 | Unconfirmed |
| 19 | hsa_circ_0001649 | Unconfirmed |
| 20 | CDR1as/ciRS-7/hsa_circ_0001946 | PMID: 30841451 |
The top-20 candidate circRNAs for lung cancer predicted by NCPCDA by eliminating all known associated pairs of this disease
| Rank | circRNAs | Evidence |
|---|---|---|
| 1 | circMAN2B2/hsa_circRNA_103595 | CircR2Disease |
| 2 | circRNA_102231 | CircR2Disease |
| 3 | hsa_circ_0000064 | CircR2Disease |
| 4 | hsa_circRNA_100782/circHIPK3/hsa_circ_0000284 | circRNADisease, Circ2Disease |
| 5 | hsa-circRNA 2149 | Unconfirmed |
| 6 | circular RNA100783/hsa_circ_0008887 | Unconfirmed |
| 7 | circDLGAP4 | Unconfirmed |
| 8 | circR-284 | Unconfirmed |
| 9 | circRNA_104983/hsa_circ_0089974 | Unconfirmed |
| 10 | circRNA_001059/hsa_circ_0000554 | Unconfirmed |
| 11 | circRNA_100984/hsa_circ_0002019 | Unconfirmed |
| 12 | circRNA_100367/hsa_circ_0014879 | Unconfirmed |
| 13 | circRNA_101877/hsa_circ_0004519 | Unconfirmed |
| 14 | circRNA_000695/hsa_circ_0001336 | Unconfirmed |
| 15 | circRNA_101419/hsa_circ_0032832 | Unconfirmed |
| 16 | circFUT8/hsa_circRNA_101368/hsa_circ_0003028 | Unconfirmed |
| 17 | circIPO11/hsa_circRNA_103847/hsa_circ_0007915 | Unconfirmed |
| 18 | hsa_circ_0001313/circCCDC66 | Unconfirmed |
| 19 | circPVT1/hsa_circ_0001821 | CircR2Disease |
| 20 | circZFR/hsa_circRNA_103809/hsa_circ_0072088 | PMID: 29698681 |
Fig. 4The percentage of predicted true positives by NCPCDA under different rankings based on the CircR2Disease dataset.
Fig. 5The percentage of predicted true positives by NCPCDA under different rankings based on the circRNADisease dataset.