| Literature DB >> 31191605 |
Le Ou-Yang1,2, Jiang Huang3, Xiao-Fei Zhang4, Yan-Ran Li3, Yiwen Sun5, Shan He6, Zexuan Zhu3.
Abstract
Evidences increasingly indicate the involvement of long non-coding RNAs (lncRNAs) in various biological processes. As the mutations and abnormalities of lncRNAs are closely related to the progression of complex diseases, the identification of lncRNA-disease associations has become an important step toward the understanding and treatment of diseases. Since only a limited number of lncRNA-disease associations have been validated, an increasing number of computational approaches have been developed for predicting potential lncRNA-disease associations. However, how to predict potential associations precisely through computational approaches remains challenging. In this study, we propose a novel two-side sparse self-representation (TSSR) algorithm for lncRNA-disease association prediction. By learning the self-representations of lncRNAs and diseases from known lncRNA-disease associations adaptively, and leveraging the information provided by known lncRNA-disease associations and the intra-associations among lncRNAs and diseases derived from other existing databases, our model could effectively utilize the estimated representations of lncRNAs and diseases to predict potential lncRNA-disease associations. The experiment results on three real data sets demonstrate that our TSSR outperforms other competing methods significantly. Moreover, to further evaluate the effectiveness of TSSR in predicting potential lncRNAs-disease associations, case studies of Melanoma, Glioblastoma, and Glioma are carried out in this paper. The results demonstrate that TSSR can effectively identify some candidate lncRNAs associated with these three diseases.Entities:
Keywords: computational approaches; disease similarity; lncRNA similarity; lncRNAs-disease associations prediction; sparse representation
Year: 2019 PMID: 31191605 PMCID: PMC6546878 DOI: 10.3389/fgene.2019.00476
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The statistics of three datasets.
| LncRNA Disease | 285 | 226 | 621 | 0.01 |
| MNDR | 95 | 81 | 260 | 0.03 |
| Lnc2Cancer | 436 | 54 | 677 | 0.03 |
Figure 1AUC scores of various algorithms in LncRNADisease dataset (* indicates TSSR significantly outperforms the competitor with p < 0.05 using t-test, error bars denote 95% confidence intervals).
Figure 3AUC scores of various algorithms in Lnc2Cancer dataset (* indicates TSSR significantly outperforms the competitor with p < 0.05 using t-test, error bars denote 95% confidence intervals).
Figure 2AUC scores of various algorithms in MNDR dataset (* indicates TSSR significantly outperforms the competitor with p < 0.05 using t-test, error bars denote 95% confidence intervals).
Figure 4Performance of TSSR with and without external information (denoted by TSSR and TSSR_original, respectively) on LncRNADisease, MNDR, and Lnc2Cancer datasets, measured by AUC (error bars denote 95% confidence intervals).
Figure 5Performance of TSSR on LncRNADisease, MNDR, and Lnc2Cancer datasets, measured by AUC with different values of λ and λ (error bars denote 95% confidence intervals).
Figure 6Performance of TSSR on LncRNADisease, MNDR, and Lnc2Cancer datasets, measured by AUC with different values of β (error bars denote 95% confidence intervals).
The identified novel lncRNAs that have been verified to be associated with Melanoma.
| 1 | CCAT2 | MNDR | Prediction evidence | |
| 2 | TUSC7 | MNDR | Prediction evidence | |
| 9 | GHET1 | MNDR | Prediction evidence | |
| 12 | MEG3 | MNDR/lnc2Cancer | 29781534,29808164 | Up-regulated, differential expression |
| 13 | HOTAIR | MNDR/lnc2Cancer | 28067428,23862139 | up-regulated |
| 14 | SOX2-OT | MNDR | Prediction evidence | |
| 15 | MALAT1 | MNDR/lnc2Cancer | 27725873, 27564100,27966454,24892958,19625619 | Up-regulated,differential expression |
| 17 | SNHG5 | MNDR/lnc2Cancer | 26440365 | Up-regulated |
| 18 | BCAR4 | MNDR | Prediction evidence | |
| 19 | CCAT1 | lnc2Cancer | 28409554 | Up-regulated |
Prediction evidence denotes the prediction associations in MNDR database.
The identified novel lncRNAs that have been verified to be associated with Glioma.
| 2 | HOTAIR | MNDR/lnc2Cancer | 29323737,28083786 ,29218099, 27277755,24203894 | Up-regulated, down-regulated |
| 3 | MALAT1 | MNDR/lnc2Cancer | 28551849,27134488,26649728,25613066,26619802 | Up-regulated, down-regulated |
| 4 | GAS5 | MNDR/lnc2Cancer | 26370254,28666797 | Up-regulated, down-regulated |
| 7 | PVT1 | lnc2Cancer | 28351322,29108264,29620147,29501773,29046366 | Up-regulated, differential expression |
| 11 | SPRY4-IT1 | MNDR/lnc2Cancer | 29467908,27460732,26464658 | Up-regulated |
| 12 | GHET1 | MNDR | Prediction evidence | |
| 15 | IGF2-AS | MNDR | Prediction evidence | |
| 18 | LincRNA-p21 | lnc2Cancer | 28689810 | Down-regulated |
| 19 | SNHG4 | MNDR | Prediction evidence |
Prediction evidence denotes the prediction associations in MNDR database.
The identified novel lncRNAs that have been verified to be associated with Glioblastoma.
| 1 | MEG3 | MNDR/lnc2Cancer | 27306825,28187000,22234798,25378224,26111795 | Up-regulated |
| 2 | HOTAIR | MNDR/lnc2Cancer | 27306825,25428914,25823657,26111795,26943771 | Up-regulated |
| 6 | BCYRN1 | MNDR | 25561975 | Differentially expressed |
| 8 | GAS5 | MNDR/lnc2Cancer | 27784795,23726844 | Up-regulated, differentially expressed |
| 10 | NEAT1 | lnc2Cancer | 23046790 | Up-regulated |
| 11 | HIF1A-AS2 | MNDR/lnc2Cancer | 27264189 | Up-regulated |
| 15 | NBAT1 | lnc2Cancer | 29771423 | Up-regulated |
| 17 | NDM29 | MNDR | 25561975 | Differentially expressed |
Prediction evidence denotes the prediction associations in MNDR database.
Algorithm for the TSSR model
|
Initialize Update Update Check the convergence conditions. |