| Literature DB >> 30598076 |
Cheng Yan1,2, Jianxin Wang3, Fang-Xiang Wu4.
Abstract
BACKGROUND: Many evidences have demonstrated that circRNAs (circular RNA) play important roles in controlling gene expression of human, mouse and nematode. More importantly, circRNAs are also involved in many diseases through fine tuning of post-transcriptional gene expression by sequestering the miRNAs which associate with diseases. Therefore, identifying the circRNA-disease associations is very appealing to comprehensively understand the mechanism, treatment and diagnose of diseases, yet challenging. As the complex mechanism between circRNAs and diseases, wet-lab experiments are expensive and time-consuming to discover novel circRNA-disease associations. Therefore, it is of dire need to employ the computational methods to discover novel circRNA-disease associations. RESULT: In this study, we develop a method (DWNN-RLS) to predict circRNA-disease associations based on Regularized Least Squares of Kronecker product kernel. The similarity of circRNAs is computed from the Gaussian Interaction Profile(GIP) based on known circRNA-disease associations. In addition, the similarity of diseases is integrated by the mean of GIP similarity and sematic similarity which is computed by the direct acyclic graph (DAG) representation of diseases. The kernels of circRNA-disease pairs are constructed from the Kronecker product of the kernels of circRNAs and diseases. DWNN (decreasing weight k-nearest neighbor) method is adopted to calculate the initial relational score for new circRNAs and diseases. The Kronecker product kernel based regularised least squares approach is used to predict new circRNA-disease associations. We adopt 5-fold cross validation (5CV), 10-fold cross validation (10CV) and leave one out cross validation (LOOCV) to assess the prediction performance of our method, and compare it with other six competing methods (RLS-avg, RLS-Kron, NetLapRLS, KATZ, NBI, WP). CONLUSION: The experiment results show that DWNN-RLS reaches the AUC values of 0.8854, 0.9205 and 0.9701 in 5CV, 10CV and LOOCV, respectively, which illustrates that DWNN-RLS is superior to the competing methods RLS-avg, RLS-Kron, NetLapRLS, KATZ, NBI, WP. In addition, case studies also show that DWNN-RLS is an effective method to predict new circRNA-disease associations.Entities:
Keywords: CircRNA; CircRNA-disease association; Gaussian interaction profile; Kron-RLS
Mesh:
Substances:
Year: 2018 PMID: 30598076 PMCID: PMC6311892 DOI: 10.1186/s12859-018-2522-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The AUC curves of seven methods in the 5CV
Fig. 2The AUC curves of seven methods in the 10CV
Fig. 3The AUC curves of seven methods in the LOOCV
The 10CV prediction performance of various parameter values of ε ranging from 0.1 to 1.0 with 0.1 increments, the best result is in bold face
|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 |
|---|---|---|---|---|---|
| AUC | 0.7927 ±0.0048 | 0.7927 ±0.0035 | 0.7922 ±0.0042 | 0.7902 ±0.0034 | 0.7920 ±0.0035 |
|
| 0.6 | 0.7 | 0.8 | 0.9 | 1.0 |
| AUC | 0.7922 ±0.0042 | 0.7889 ±0.0032 | 0.7903 ±0.0047 | 0.7897 ±0.0044 |
|
The 10CV prediction performance of various parameter values of σ ranging from 0.1 to 1.0 with 0.1 increments, the best result is in bold face
|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 |
|---|---|---|---|---|---|
| AUC | 0.9200 ±0.0024 |
| 0.9182 ±0.0023 | 0.9154 ±0.0018 | 0.9136 ±0.0021 |
|
| 0.6 | 0.7 | 0.8 | 0.9 | 1.0 |
| AUC | 0.9110 ±0.0025 | 0.9078 ±0.0033 | 0.9042 ±0.0020 | 0.9041 ±0.0020 | 0.9010 ±0.0025 |
The validation results of predicted top 10 new circRNA-disease associations of Atherosclerotic vascular disease
| Disease | CircRNA | Rank | Source |
|---|---|---|---|
| Atherosclerotic vascular disease | cANRIL | 1 | PMID:28683453, |
| PMID:28946214 | |||
| hsa_circ_0003575 | 2 | Unknown | |
| circSMARCA5/hsa_circ_0001445 | 3 | Unknown | |
| hsa_circ_0000284/circHIPK3 | 4 | Unknown | |
| hsa_circ_0004383/cZNF292 | 5 | PMID:27836747 | |
| circRNA-chr19 | 6 | Unknown | |
| CircDOCK1/hsa_circ_100721 | 7 | Unknown | |
| mmu-circRNA-015947 | 8 | Unknown | |
| hsa-circRNA 2149 | 9 | Unknown | |
| circRar1 | 10 | Unknown |
The validation results of predicted top 10 new circRNA-disease associations of Breast cancer
| Disease | CircRNA | Rank | Source |
|---|---|---|---|
| Breast cancer | circGFRA1/hsa_circ_005239 | 1 | PMID:29037220 |
| circUBAP2 | 2 | Unknown | |
| circ-Foxo3/hsa_circ_0006404 | 3 | PMID:26657152 | |
| Cir-ITCH/hsa_circ_0001141/hsa_circ_001763 | 4 | Unknown | |
| hsa_circ_0001649 | 5 | Unknown | |
| CDR1as/ciRS-7/hsa_circ_0001946 | 6 | PMID:28049499 | |
| hsa_circ_0043256 | 7 | Unknown | |
| hsa_circ_0016760 | 8 | Unknown | |
| hsa_circ_0007385 | 9 | Unknown | |
| hsa_circ_0014130 | 10 | Unknown |