| Literature DB >> 30585259 |
Chunyan Fan1, Xiujuan Lei1, Fang-Xiang Wu1,2.
Abstract
Circular RNAs (circRNAs) are a large group of endogenous non-coding RNAs which are key members of gene regulatory processes. Those circRNAs in human paly significant roles in health and diseases. Owing to the characteristics of their universality, specificity and stability, circRNAs are becoming an ideal class of biomarkers for disease diagnosis, treatment and prognosis. Identification of the relationships between circRNAs and diseases can help understand the complex disease mechanism. However, traditional experiments are costly and time-consuming, and little computational models have been developed to predict novel circRNA-disease associations. In this study, a heterogeneous network was constructed by employing the circRNA expression profiles, disease phenotype similarity and Gaussian interaction profile kernel similarity. Then, we developed a computational model of KATZ measures for human circRNA-disease association prediction (KATZHCDA). The leave-one-out cross validation (LOOCV) and 5-fold cross validation were implemented to investigate the effects of these four types of similarity measures. As a result, KATZHCDA model yields the AUCs of 0.8469 and 0.7936+/-0.0065 in LOOCV and 5-fold cross validation, respectively. Furthermore, we analyze the candidate association between hsa_circ_0006054 and colorectal cancer, and results showed that hsa_circ_0006054 may function as miRNA sponge in the carcinogenesis of colorectal cancer. Overall, it is anticipated that our proposed model could become an effective resource for clinical experimental guidance.Entities:
Keywords: CircRNA-disease association; KATZ model; similarity measure
Mesh:
Substances:
Year: 2018 PMID: 30585259 PMCID: PMC6299360 DOI: 10.7150/ijbs.28260
Source DB: PubMed Journal: Int J Biol Sci ISSN: 1449-2288 Impact factor: 6.580
Figure 1Bipartite graph of the circRNA-disease associations. The rectangles represent the diseases, and the circles represent the circRNAs. An edge corresponds to the gold standard circRNA-disease associations.
Figure 2Pie graph of degree distribution for circRNAs or diseases. (A) Degree proportion of circRNAs. (B) Degree proportion of diseases.
Figure 3Heatmap of circRNAs in different samples.
Figure 4The flowchart of the KATZHCDA model.
Figure 5Effect of parameter α and β when k was set as 2, 3 and 4.
5-fold CV experimental results of setting parameter k.
| 5-fold CV (Average AUC) | |||
|---|---|---|---|
| 0.7616+/-0.0062 | 0.7764+/-0.0066 | 0.7833+/-0.0075 | |
| 0.7561+/-0.0067 | 0.7710+/-0.0070 | 0.7763+/-0.0065 |
5-fold CV experimental results with different similarity measures.
| Similarity measures | 5-fold CV (k=2) | 5-fold CV (k=3) | 5-fold CV (k=4) |
|---|---|---|---|
| Similarity measure1 ( | 0.3256+/-0.0092 | 0.3184+/-0.0111 | 0.3922+/-0.0110 |
| Similarity measure2 ( | 0.4258+/-0.0120 | 0.4106+/-0.0097 | 0.3914+/-0.0099 |
| Similarity measure3 ( | 0.7405+/-0.0077 | 0.7659+/-0.0079 | 0.7786+/-0.0070 |
| Similarity measure4 ( | 0.7647+/-0.0060 | 0.7799+/-0.0067 | 0.7936+/-0.0065 |
| Similarity measure5 ( | 0.7616+/-0.0062 | 0.7764+/-0.0066 | 0.7833+/-0.0075 |
Figure 6Prediction performance of KATZHCDA model with different similarity measures in the framework of LOOCV. (A) Performance comparison among five similarity measures when k was set as 2. (B) Performance comparison among five similarity measures when k was set as 3. (C) Performance comparison among five similarity measures when k was set as 4.
Figure 7The hsa_circ_0006054-miRNA-mRNA network in colorectal cancer.