| Literature DB >> 31514111 |
Guobo Xie1, Tengfei Meng1, Yu Luo2, Zhenguo Liu3.
Abstract
Recently, prediction of lncRNA-disease associations has attracted more and more attentions. Various computational models have been proposed; however, there is still room to improve the prediction accuracy. In this paper, we propose a kernel fusion method with different types of similarities for the lncRNAs and diseases. The expression similarity and cosine similarity are used for lncRNAs, and the semantic similarity and cosine similarity are used for the diseases. To eliminate the noise effect, a neighbor constraint is enforced to refine all the similarity matrices before fusion. Experimental results show that the proposed similarity kernel fusion (SKF)-LDA method has the superiority performance in terms of AUC values and other measurements. In the schemes of LOOCV and 5-fold CV, AUC values of SKF-LDA achieve 0.9049 and 0.8743±0.0050 respectively. In addition, the conducted case studies of three diseases (hepatocellular carcinoma, lung cancer, and prostate cancer) show that SKF-LDA can predict related lncRNAs accurately.Entities:
Keywords: Laplacian regularized least squares; disease similarity; lncRNA similarity; lncRNA-disease association; similarity kernel fusion
Year: 2019 PMID: 31514111 PMCID: PMC6742806 DOI: 10.1016/j.omtn.2019.07.022
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Figure 1Flow Chart of SKF-LDA Applied to lncRNA-Disease Association Prediction
(A–D) SKF-LDA consists of four steps: (A) constructing the lncRNA-disease correlation matrix; (B) calculating the two similarities of lncRNA and disease similarity, respectively; (C) using SKF integration similarity; and (D) obtaining the prediction matrix by Laplacian regularized least squares.
Figure 2The AUC Values with Different αs
Figure 3The AUC Values with Different Values of the Number of Neighbors (K)
Figure 4The AUC Values with Different βs
Figure 5The ROC Curve and the PR Curve of Three Integration Methods
The AUC Values of SKF-LDA and Other Single Similarity in LOOCV and 5-fold CV Scheme
| lncRNA Similarity | Disease Similarity | LOOCV | 5-fold CV |
|---|---|---|---|
| Expression | semantic | 0.8512 | 0.8476 ± 0.0034 |
| Expression | cosine | 0.8630 | 0.8375 ± 0.0046 |
| Cosine | semantic | 0.8835 | 0.8502 ± 0.0073 |
| Expression | cosine | 0.8754 | 0.8519 ± 0.0070 |
| Expression + cosine | semantic + cosine | 0.9049 | 0.8743 ± 0.0050 |
Figure 6The ROC Curve and the PR Curve When Using Neighbor and without Neighbor Constraint
Figure 7The ROC Curve and AUC Values of Different Methods in LOOCV and 5-fold CV Scheme: SKF-LDA, RWRlncD, LRLSLDA, SIMCLDA, and BRWLDA
Results of Different Methods
| Measurement | SKF-LDA | RWRlncD | LRLSLDA | SIMCLDA | BRWLDA |
|---|---|---|---|---|---|
| AUC | 0.9049 | 0.6448 | 0.8349 | 0.8298 | 0.8024 |
| AUPR | 0.4082 | 0.0808 | 0.3343 | 0.2555 | 0.3068 |
| PRE | 0.4884 | 0.1076 | 0.4472 | 0.3539 | 0.4413 |
| Sensitivity | 0.3519 | 0.0444 | 0.2982 | 0.2019 | 0.2926 |
| Accuracy | 0.9732 | 0.9651 | 0.9718 | 0.9692 | 0.9716 |
| F1-score | 0.5206 | 0.0851 | 0.4593 | 0.3359 | 0.4527 |
| MCC | 0.4013 | 0.0532 | 0.3513 | 0.2526 | 0.3455 |
| PRE | 0.2407 | 0.1293 | 0.2283 | 0.2037 | 0.2100 |
| Sensitivity | 0.5852 | 0.2741 | 0.5463 | 0.4722 | 0.4907 |
| Accuracy | 0.9404 | 0.9321 | 0.9393 | 0.9374 | 0.9379 |
| F1-score | 0.7383 | 0.4302 | 0.7066 | 0.6415 | 0.6584 |
| MCC | 0.3501 | 0.1563 | 0.3271 | 0.2823 | 0.2937 |
The Top 10 lncRNA Candidates Predicted for Lung Cancer
| Rank | lncRNA | Disease | Evidence |
|---|---|---|---|
| 1 | GAS5 | lung cancer | lnc2Cancer2, MNDR |
| 2 | CCAT2 | lung cancer | MNDR |
| 3 | UCA1 | lung cancer | lnc2Cancer2, MNDR |
| 4 | HULC | lung cancer | unconfirmed |
| 5 | SPRY4-IT1 | lung cancer | MNDR |
| 6 | CCAT1 | lung cancer | MNDR |
| 7 | PVT1 | lung cancer | lnc2Cancer2, MNDR |
| 8 | NEAT1 | lung cancer | lnc2Cancer2, MNDR |
| 9 | XIST | lung cancer | MNDR |
| 10 | HNF1A-AS1 | lung cancer | MNDR |
The Top 10 lncRNA Candidates Predicted for Hepatocelluar Carcinoma
| Rank | lncRNA | Disease | Evidence |
|---|---|---|---|
| 1 | GAS5 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 2 | UCA1 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 3 | PVT1 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 4 | CCAT2 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 5 | CDKN2B-AS1 | hepatocelluar carcinoma | MNDR |
| 6 | CCAT1 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 7 | BANCR | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 8 | PTENP1 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 9 | SPRY4-IT1 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
| 10 | NEAT1 | hepatocelluar carcinoma | lnc2Cancer2, MNDR |
The Top 10 lncRNA Candidates Predicted for Prostate Cancer
| Rank | lncRNA | Disease | Evidence |
|---|---|---|---|
| 1 | CDKN2B-AS1 | prostate cancer | MNDR |
| 2 | CCAT2 | prostate cancer | lnc2Cancer2, MNDR |
| 3 | XIST | prostate cancer | lnc2Cancer2, MNDR |
| 4 | PTENP1 | prostate cancer | lnc2Cancer2, MNDR |
| 5 | LSINCT5 | prostate cancer | unconfirmed |
| 6 | IGF2-AS | prostate cancer | unconfirmed |
| 7 | SPRY4-IT1 | prostate cancer | lnc2Cancer2, MNDR |
| 8 | MINA | prostate cancer | unconfirmed |
| 9 | CCAT1 | prostate cancer | lnc2Cancer2 |
| 10 | BANCR | prostate cancer | MNDR |