| Literature DB >> 28963512 |
Changlong Gu1, Bo Liao2, Xiaoying Li1, Lijun Cai1, Zejun Li1,3, Keqin Li4, Jialiang Yang5.
Abstract
There is more and more evidence that the mutation and dysregulation of long non-coding RNA (lncRNA) are associated with numerous diseases, including cancers. However, experimental methods to identify associations between lncRNAs and diseases are expensive and time-consuming. Effective computational approaches to identify disease-related lncRNAs are in high demand; and would benefit the detection of lncRNA biomarkers for disease diagnosis, treatment, and prevention. In light of some limitations of existing computational methods, we developed a global network random walk model for predicting lncRNA-disease associations (GrwLDA) to reveal the potential associations between lncRNAs and diseases. GrwLDA is a universal network-based method and does not require negative samples. This method can be applied to a disease with no known associated lncRNA (isolated disease) and to lncRNA with no known associated disease (novel lncRNA). The leave-one-out cross validation (LOOCV) method was implemented to evaluate the predicted performance of GrwLDA. As a result, GrwLDA obtained reliable AUCs of 0.9449, 0.8562, and 0.8374 for overall, novel lncRNA and isolated disease prediction, respectively, significantly outperforming previous methods. Case studies of colon, gastric, and kidney cancers were also implemented, and the top 5 disease-lncRNA associations were reported for each disease. Interestingly, 13 (out of the 15) associations were confirmed by literature mining.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28963512 PMCID: PMC5622075 DOI: 10.1038/s41598-017-12763-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Performance comparisons of GrwLDA, LRLSLDA and KATZLDA in terms of ROC curves and AUCs based on LOOCV. (1) The overall predicted performance evaluation; (2) The predicted performance of novel lncRNA-associated diseases prediction; (3) The predicted performance of isolated disease-related lncRNAs prediction.
Figure 2Performance comparisons of GrwLDA, LRLSLDA and KATZLDA in terms of PR curves and AUPRs based on LOOCV. (1) The overall predicted performance evaluation; (2) The predicted performance of novel lncRNA-associated diseases prediction; (3) The predicted performance of isolated disease-related lncRNAs prediction.
The top five predicted results for colon cancer, kidney cancer and gastric cancer. Only two associations are not confirmed by the latest research literature.
| rank | disease | lncRNA | evidence |
|---|---|---|---|
| 1 | Colon cancer | HOTAIR | LncRNADisease |
| 2 | Colon cancer | MALAT1 | LncRNADisease |
| 3 | Colon cancer | CRNDE | LncRNADisease |
| 4 | Colon cancer | PVT1 | literature[ |
| 5 | Colon cancer | KCNQ1OT1 | unconfirmed |
| 1 | Kidney cancer | H19 | LncRNADisease |
| 2 | Kidney cancer | GNAS-AS1 | unconfirmed |
| 3 | Kidney cancer | PVT1 | LncRNADisease |
| 4 | Kidney cancer | WT1-AS | literature[ |
| 5 | Kidney cancer | KCNQ1DN | literature[ |
| 1 | Gastric cancer | H19 | LncRNADisease |
| 2 | Gastric cancer | HOTAIR | LncRNADisease |
| 3 | Gastric cancer | MEG3 | LncRNADisease |
| 4 | Gastric cancer | PVT1 | LncRNADisease |
| 5 | Gastric cancer | MALAT1 | literature[ |
Performance comparisons of GrwLDA, LRLSLDA and KATZLDA methods based on the newly collected lncRNAs associated with colon, gastric and kidney cancer by the updates of LncRNADisease database and their ranking of the three methods.
| disease | lncRNA | LRLSLDA | KATZLDA | GrwLDA |
|---|---|---|---|---|
| Colon cancer | HOTAIR | 2 | 2 | 1 |
| Colon cancer | CRNDE | 19 | 31 | 3 |
| Colon cancer | MALAT1 | 1 | 1 | 2 |
| Colon cancer | KCNQ1OT1 | 8 | 23 | 5 |
| Colon cancer | LSINCT5 | 21 | 11 | 15 |
| Gastric cancer | H19 | 2 | 1 | 1 |
| Gastric cancer | HOTAIR | 1 | 5 | 2 |
| Gastric cancer | MEG3 | 3 | 2 | 3 |
| Gastric cancer | PVT1 | 4 | 3 | 4 |
| Gastric cancer | CDKN2B-AS1 | 7 | 6 | 7 |
| Gastric cancer | LSINCT5 | 14 | 15 | 23 |
| Gastric cancer | UCA1 | 70 | 19 | 16 |
| Gastric cancer | SPRY4-IT1 | 72 | 44 | 45 |
| Kidney cancer | H19 | 6 | 1 | 1 |
| Kidney cancer | PVT1 | 12 | 3 | 3 |
| Kidney cancer | MEG3 | 15 | 2 | 7 |
| Kidney cancer | MALAT1 | 26 | 4 | 10 |
| Kidney cancer | GAS5 | 45 | 15 | 29 |
| Kidney cancer | KCNQ1OT1 | 66 | 36 | 37 |
| Average ranking of the three diseases | 20.74 | 11.79 | 11.26 | |
Isolated disease-related lncRNA prediction was implemented for colon, kidney and gastric cancers; the top five predicted results of each disease are listed. A total of 13 of the 15 predicted results are confirmed by the updates of the LncRNADisease database and by the latest research literature.
| rank | disease | lncRNA | evidence |
|---|---|---|---|
| 1 | Colon cancer | HOTAIR | LncRNADisease |
| 2 | Colon cancer | PVT1 | literature[ |
| 3 | Colon cancer | MALAT1 | LncRNADisease |
| 4 | Colon cancer | CRNDE | LncRNADisease |
| 5 | Colon cancer | KCNQ1OT1 | unconfirmed |
| 1 | Kidney cancer | H19 | LncRNADisease |
| 2 | Kidney cancer | PVT1 | LncRNADisease |
| 3 | Kidney cancer | MEG3 | LncRNADisease |
| 4 | Kidney cancer | MALAT1 | LncRNADisease |
| 5 | Kidney cancer | GNAS-AS1 | unconfirmed |
| 1 | Gastric cancer | H19 | LncRNADisease |
| 2 | Gastric cancer | HOTAIR | LncRNADisease |
| 3 | Gastric cancer | MEG3 | LncRNADisease |
| 4 | Gastric cancer | PVT1 | LncRNADisease |
| 5 | Gastric cancer | MALAT1 | literature[ |
Novel lncRNA-associated diseases predicting H19, HOTAIR and MALAT1 and the top five of each lncRNA-predicted results are listed. As a result, 14 of the 15 predicted results are confirmed by the updates of the LncRNADisease database and by the latest research literature.
| rank | lncRNA | disease | evidence |
|---|---|---|---|
| 1 | H19 | Prostatic Neoplasms | LncRNADisease |
| 2 | H19 | Lymphoma | literature[ |
| 3 | H19 | Colorectal Neoplasms | LncRNADisease |
| 4 | H19 | Testicular Neoplasms | literature[ |
| 5 | H19 | Neuroblastoma | LncRNADisease |
| 1 | HOTAIR | Prostatic Neoplasms | literature[ |
| 2 | HOTAIR | Lymphoma | literature[ |
| 3 | HOTAIR | Ovarian Neoplasms | literature[ |
| 4 | HOTAIR | Testicular Neoplasms | unconfirmed |
| 5 | HOTAIR | Melanoma | literature[ |
| 1 | MALAT1 | Breast Neoplasms | LncRNADisease |
| 2 | MALAT1 | Prostatic Neoplasms | literature[ |
| 3 | MALAT1 | Lymphoma | literature[ |
| 4 | MALAT1 | Ovarian Neoplasms | literature[ |
| 5 | MALAT1 | Melanoma | literature[ |
Figure 3Flowchart of GrwLDA. The GrwLDA method is implemented in three steps as follows: (1) RWR is restarted from lncRNA seed nodes associated with query disease; (2) RWR is restarted from disease seed nodes associated with query lncRNA; and (3) the potential lncRNA-disease associations are predicted by integrating the results of step (1) and step (2).