| Literature DB >> 31608124 |
Xiujuan Lei1, Zengqiang Fang1, Ling Guo2.
Abstract
With the development of high-throughput techniques, various biological molecules are discovered, which includes the circular RNAs (circRNAs). Circular RNA is a novel endogenous noncoding RNA that plays significant roles in regulating gene expression, moderating the microRNAs transcription as sponges, diagnosing diseases, and so on. Based on the circRNA particular molecular structures that are closed-loop structures with neither 5'-3' polarities nor polyadenylated tails, circRNAs are more stable and conservative than the normal linear coding or noncoding RNAs, which makes circRNAs a biomarker of various diseases. Although some conventional experiments are used to identify the associations between circRNAs and diseases, almost the techniques and experiments are time-consuming and expensive. In this study, we propose a collaboration filtering recommendation system-based computational method, which handles the "cold start" problem to predict the potential circRNA-disease associations, which is named ICFCDA. All the known circRNA-disease associations data are downloaded from circR2Disease database (http://bioinfo.snnu.edu.cn/CircR2Disease/). Based on these data, multiple data are extracted from different databases to calculate the circRNA similarity networks and the disease similarity networks. The collaboration filtering recommendation system algorithm is first employed to predict circRNA-disease associations. Then, the leave-one-out cross validation mechanism is adopted to measure the performance of our proposed computational method. ICFCDA achieves the areas under the curve of 0.946, which is better than other existing methods. In order to further illustrate the performance of ICFCDA, case studies of some common diseases are made, and the results are confirmed by other databases. The experimental results show that ICFCDA is competent in predicting the circRNA-disease associations.Entities:
Keywords: circRNA–disease association; collaboration filtering; multiple biological data; neighbor information; recommendation system
Year: 2019 PMID: 31608124 PMCID: PMC6773885 DOI: 10.3389/fgene.2019.00897
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1The flowchart of the computational method ICFCDA.
Figure 2The AUC value of ICFCDA compared with other computational methods.
Figure 3The AUC value of ICFCDA compared with CFCDA without solving the “cold start” problem.
Figure 4The AUC values of nine kinds of specific diseases.
The average AUC values of 42 diseases.
| KATZCDA | RWRCDA | NMFCDA | KNNR | SVRrbf | SVRpoly | ICFCDA | |
|---|---|---|---|---|---|---|---|
| Average | 0.719 | 0.478 | 0.616 | 0.536 | 0.441 | 0.415 | 0.885 |
Figure 5Comparison of the precision, recall, accuracy, and f-measure with different methods.
Figure 6The number of correct circRNA–disease association in top k predicting results.
AUC with different values for parameter k.
| k | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| AUC | 0.930 | 0.932 | 0.940 |
| 0.923 | 0.921 | 0.921 | 0.906 | 0.906 | 0.902 |
Figure 7The AUC of the parameter α and β based on the fixed parameter γ and k.
The top 10 bladder cancer related candidates’ circRNAs.
| Rank | CirRNA name/id | Evidences | Rank | CircRNA name/id | Evidences |
|---|---|---|---|---|---|
|
| hsa_circ_0000172 | + |
| hsa_circ_0002024 | + |
|
| hsa_circ_0002495 | + |
| circMylk/ | *, # |
|
| circRNABCRC4/ | PMID: 29270748 |
| circTCF25/ | # |
|
| hsa_circ_0003221/ | #, + |
| circFAM169A/ | # |
|
| hsa_circ_0091017 | #, + |
| circTRIM24/ | # |
The top 10 breast cancer–related candidates’ circRNAs.
| Rank | CirRNA name/id | Evidences | Rank | CircRNA name/id | Evidences |
|---|---|---|---|---|---|
|
| hsa_circ_0011946 | + |
| circAmotl1/ | *, # |
|
| hsa_circ_0093859 | + |
| hsa_circ_0006528 | *, #, + |
|
| hsa_circ_0001982 | #, + |
| hsa_circ_0002874 | #, + |
|
| hsa_circ_0001785 | #, + |
| hsa_circ_0085495 | #, + |
|
| hsa_circ_0108942 | #, + |
| hsa_circ_0086241 | #, + |