| Literature DB >> 27028993 |
Yu-An Huang1, Xing Chen2,3, Zhu-Hong You4, De-Shuang Huang5, Keith C C Chan6.
Abstract
Increasing observations have indicated that lncRNAs play a significant role in various critical biological processes and the development and progression of various human diseases. Constructing lncRNA functional similarity networks could benefit the development of computational models for inferring lncRNA functions and identifying lncRNA-disease associations. However, little effort has been devoted to quantifying lncRNA functional similarity. In this study, we developed an Improved LNCRNA functional SIMilarity calculation model (ILNCSIM) based on the assumption that lncRNAs with similar biological functions tend to be involved in similar diseases. The main improvement comes from the combination of the concept of information content and the hierarchical structure of disease directed acyclic graphs for disease similarity calculation. ILNCSIM was combined with the previously proposed model of Laplacian Regularized Least Squares for lncRNA-Disease Association to further evaluate its performance. As a result, new model obtained reliable performance in the leave-one-out cross validation (AUCs of 0.9316 and 0.9074 based on MNDR and Lnc2cancer databases, respectively), and 5-fold cross validation (AUCs of 0.9221 and 0.9033 for MNDR and Lnc2cancer databases), which significantly improved the prediction performance of previous models. It is anticipated that ILNCSIM could serve as an effective lncRNA function prediction model for future biomedical researches.Entities:
Keywords: cancer; directed acyclic graph; disease; functional similarity; lncRNAs
Mesh:
Substances:
Year: 2016 PMID: 27028993 PMCID: PMC5041953 DOI: 10.18632/oncotarget.8296
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Flowchart of disease semantic similarity function in ILNCSIM based on disease DAGs
Figure 2Flowchart of lncRNA functional similarity calculation model based on disease semantic similarity
Figure 3Performance comparisons between ILNCSIM and three the-state-of-art disease-lncRNA association prediction models (LRLSLDA, LRLSLDA-LNCSIM1, and LRLSLDA-LNCSIM2) in terms of ROC curve and AUC based on global LOOCV
As a result, ILNCSIM achieved AUCs of 0.9316 and 0.9074 based on the MNDR and Lnc2Cancer databases, which significantly improved all the previous classical models and effectively demonstrated its reliable predictive ability.
Prediction results of lncRNA associated with colon cancer, lung cancer and prostate cancer in top 20 ranking lists
| Disease | lncRNA | Evidence(PMID/Database) | Rank |
|---|---|---|---|
| Colon cancer | H19 | Lnc2cancer | 1 |
| Colon cancer | UCA1 | Lnc2cancer | 3 |
| Colon cancer | HOTAIR | LncRNADisease | 13 |
| Colon cancer | XIST | Lnc2cancer | 14 |
| Colon cancer | MEG3 | Lnc2cancer | 16 |
| Colon cancer | HULC | Lnc2cancer | 19 |
| Lung cancer | BC200 | Lnc2cancer | 1 |
| Lung cancer | UCA1 | 26380024 | 3 |
| Lung cancer | HOTAIR | Lnc2cancer | 4 |
| Lung cancer | XIST | Lnc2cancer | 8 |
| Lung cancer | GAS5 | Lnc2cancer | 10 |
| Lung cancer | MEG3 | Lnc2cancer | 17 |
| Lung cancer | LSINCT5 | Lnc2cancer | 20 |
| Prostate cancer | H19 | LncRNADisease | 1 |
| Prostate cancer | CBR3-AS1 | LncRNADisease | 2 |
| Prostate cancer | UCA1 | Lnc2cancer | 3 |
| Prostate cancer | KCNQ1OT1 | Lnc2cancer | 13 |
| Prostate cancer | LINCRNA-P21 | Lnc2cancer | 14 |
| Prostate cancer | MEG3 | LncRNADisease | 15 |
Performance comparison between LRLSLDA-ILNCSIM and three other previously proposed models based on the rankings of newly discovered lncRNAs associated with colon cancer, which were recorded in Lnc2Cancer and LncRNADisease databases
| LncRNA | LRLSLDA-ILNCSIM | LRLSLDA-LNCSIM1 | LRLSLDA-LNCSIM2 | LRLSLDA |
|---|---|---|---|---|
| BACE1AS | 192 | 181 | 192 | 38 |
| GAS5 | 23 | 54 | 57 | 35 |
| H19 | 1 | 1 | 1 | 2 |
| HOTAIR | 13 | 12 | 9 | 6 |
| HULC | 19 | 37 | 34 | 31 |
| KCNQ1OT1 | 24 | 21 | 13 | 93 |
| lincRNA-p21 | 36 | 183 | 73 | 94 |
| LSINCT5 | 39 | 71 | 78 | 195 |
| MEG3 | 16 | 16 | 23 | 10 |
| PRNCR1 | 67 | 50 | 49 | 83 |
| PVT1 | 68 | 190 | 94 | 84 |
| uc.338 | 37 | 59 | 52 | 57 |
| UCA1 | 3 | 3 | 3 | 4 |
| XIST | 14 | 33 | 35 | 33 |
| ZFAS1 | 59 | 73 | 93 | 88 |
| Percentage in the top 20 | 40% | 26.67% | 26.67% | 26.67% |
| Average rank | 40.73 | 65.60 | 53.73 | 56.87 |
Performance comparison between LRLSLDA-ILNCSIM and three other previously proposed models based on the rankings of newly discovered lncRNAs associated with lung cancer, which were recorded in Lnc2Cancer and LncRNADisease databases
| LncRNA | LRLSLDA-ILNCSIM | LRLSLDA-LNCSIM1 | LRLSLDA-LNCSIM2 | LRLSLDA |
|---|---|---|---|---|
| BC200 | 1 | 1 | 1 | 192 |
| GAS5 | 10 | 13 | 21 | 21 |
| HOTAIR | 4 | 3 | 4 | 190 |
| LSINCT5 | 20 | 26 | 39 | 18 |
| MEG3 | 17 | 4 | 5 | 74 |
| PVT1 | 34 | 63 | 65 | 15 |
| UCA1 | 3 | 18 | 11 | 1 |
| XIST | 11 | 17 | 18 | 17 |
| Percentage in the top 20 | 87.5% | 75% | 62.5% | 50% |
| Average rank | 12.50 | 18.13 | 20.50 | 66.00 |
Performance comparison between LRLSLDA-ILNCSIM and three other previously proposed models based on the rankings of newly discovered lncRNAs associated with prostate cancer, which were recorded in Lnc2Cancer and LncRNADisease databases
| LncRNA | LRLSLDA-ILNCSIM | LRLSLDA-LNCSIM1 | LRLSLDA-LNCSIM2 | LRLSLDA |
|---|---|---|---|---|
| CBR3-AS1 | 2 | 11 | 13 | 47 |
| H19 | 1 | 8 | 8 | 225 |
| HULC | 26 | 221 | 222 | 211 |
| IGF2-AS | 217 | 216 | 214 | 35 |
| KCNQ1OT1 | 13 | 38 | 28 | 213 |
| lincRNA-p21 | 14 | 9 | 10 | 54 |
| MEG3 | 15 | 7 | 3 | 215 |
| NEAT1 | 21 | 40 | 40 | 7 |
| UCA1 | 3 | 1 | 1 | 224 |
| Percentage in the top 20 | 66.67% | 55.56% | 55.56% | 11.11% |
| Average rank | 34.67 | 61.22 | 59.89 | 136.78 |