| Literature DB >> 27517318 |
Xing Chen1, Zhu-Hong You2, Gui-Ying Yan3,4, Dun-Wei Gong1.
Abstract
In recent years, accumulating evidences have shown that the dysregulations of lncRNAs are associated with a wide range of human diseases. It is necessary and feasible to analyze known lncRNA-disease associations, predict potential lncRNA-disease associations, and provide the most possible lncRNA-disease pairs for experimental validation. Considering the limitations of traditional Random Walk with Restart (RWR), the model of Improved Random Walk with Restart for LncRNA-Disease Association prediction (IRWRLDA) was developed to predict novel lncRNA-disease associations by integrating known lncRNA-disease associations, disease semantic similarity, and various lncRNA similarity measures. The novelty of IRWRLDA lies in the incorporation of lncRNA expression similarity and disease semantic similarity to set the initial probability vector of the RWR. Therefore, IRWRLDA could be applied to diseases without any known related lncRNAs. IRWRLDA significantly improved previous classical models with reliable AUCs of 0.7242 and 0.7872 in two known lncRNA-disease association datasets downloaded from the lncRNADisease database, respectively. Further case studies of colon cancer and leukemia were implemented for IRWRLDA and 60% of lncRNAs in the top 10 prediction lists have been confirmed by recent experimental reports.Entities:
Keywords: cancer; disease; lncRNAs; random walk with restart
Mesh:
Substances:
Year: 2016 PMID: 27517318 PMCID: PMC5295400 DOI: 10.18632/oncotarget.11141
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Flowchart of IRWRLDA, demonstrating the basic ideas of uncovering potential disease-lncRNA associations by implementing random walk on the lncRNA similarity network constructed based on integrated lncRNA similarity
IRWRLDA consists of the following three steps: (1) decide the initial probability of all the lncRNAs, (2) implement random walk on the lncRNA similarity network constructed based on integrated lncRNA similarity, and (3) obtain stable probability of random walk and rank candidate lncRNAs. The essential difference between IRWRLDA and traditional RWR lies in the initial probability of lncRNAs.
Figure 2Comparison between IRWRLDA with other three the-state-of-art disease-lncRNA association prediction models in the framework of LOOCV was implemented
As a result, IRWRLDA achieved AUCs of 0.7242 and 0.7872 for the June-2012 Version and June-2014 Version datasets, respectively, significantly improving all the previous classical models.
Performance comparisons between IRWRLDA and LRLSLDA based on the rankings of newly updated disease associated lncRNAs in LncRNADisease database for the Colon cancer and Leukemia
| Disease | lncRNA | IRWRLDA | LRLSLDA |
|---|---|---|---|
| Colon cancer | CRNDE | 1 | 32 |
| Colon cancer | KCNQ1OT1 | 2 | 6 |
| Colon cancer | MALAT1 | 3 | 3 |
| Colon cancer | HOTAIR | 4 | 15 |
| Colon cancer | LSINCT5 | 52 | 115 |
| Leukemia | MEG3 | 1 | 4 |
| Leukemia | WT1-AS | 2 | 110 |
| Leukemia | DLEU2 | 4 | 10 |
| Leukemia | CDKN2B-AS1 | 6 | 2 |
| Leukemia | MIR155HG | 12 | 71 |
| Leukemia | DLEU1 | 18 | 63 |
| Average ranks | 9.55 | 39.18 | |
Performance comparisons between IRWRLDA and LRLSLDA based on the rankings of fourteen lncRNA-disease associations related with colon cancer and leukemia
| Disease | lncRNA | IRWRLDA | LRLSLDA |
|---|---|---|---|
| Colon cancer | CRNDE | 1 | 35 |
| Colon cancer | MALAT1 | 2 | 7 |
| Colon cancer | HOTAIR | 3 | 11 |
| Colon cancer | KCNQ1OT1 | 4 | 52 |
| Colon cancer | H19 | 6 | 1 |
| Colon cancer | MEG3 | 8 | 4 |
| Colon cancer | LSINCT5 | 21 | 92 |
| Leukemia | MEG3 | 1 | 4 |
| Leukemia | WT1-AS | 2 | 49 |
| Leukemia | DLEU2 | 4 | 13 |
| Leukemia | CDKN2B-AS1 | 6 | 2 |
| Leukemia | MIR155HG | 13 | 75 |
| Leukemia | DLEU1 | 18 | 74 |
| Leukemia | TCL6 | 20 | 102 |
| Average ranks | 7.79 | 37.21 | |
Here, case studies were implemented by removing all the known lncRNAs associated with investigated disease to fully demonstrate that IRWRLDA could be effectively applied to human diseases without any known related lncRNAs.
Performance comparisons between IRWRLDA and LRLSLDA based on the rankings of seven lncRNA-disease associations related with ALL, CLL, AML, and CML
| Disease | lncRNA | IRWRLDA | LRLSLDA |
|---|---|---|---|
| AML | MEG3 | 2 | 4 |
| AML | WT1-AS | 35 | 92 |
| CML | MEG3 | 2 | 6 |
| ALL | CDKN2B-AS1 | 13 | 2 |
| CLL | DLEU2 | 1 | 14 |
| CLL | DLEU1 | 71 | 117 |
| CLL | MIR155HG | 59 | 118 |
| Average ranks | 26.14 | 50.43 | |
Here, case studies were implemented by removing all the known lncRNAs associated with investigated disease to fully demonstrate that IRWRLDA could be effectively applied to human diseases without any known related lncRNAs.