| Literature DB >> 24392133 |
Ming-Xi Liu1, Xing Chen2, Geng Chen3, Qing-Hua Cui3, Gui-Ying Yan2.
Abstract
As a major class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been implicated in various critical biological processes. Accumulating researches have linked dysregulations and mutations of lncRNAs to a variety of human disorders and diseases. However, to date, only a few human lncRNAs have been associated with diseases. Therefore, it is very important to develop a computational method to globally predict potential associated diseases for human lncRNAs. In this paper, we developed a computational framework to accomplish this by combining human lncRNA expression profiles, gene expression profiles, and human disease-associated gene data. Applying this framework to available human long intergenic noncoding RNAs (lincRNAs) expression data, we showed that the framework has reliable accuracy. As a result, for non-tissue-specific lincRNAs, the AUC of our algorithm is 0.7645, and the prediction accuracy is about 89%. This study will be helpful for identifying novel lncRNAs for human diseases, which will help in understanding the roles of lncRNAs in human diseases and facilitate treatment. The corresponding codes for our method and the predicted results are all available at http://asdcd.amss.ac.cn/MingXiLiu/lncRNA-disease.html.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24392133 PMCID: PMC3879311 DOI: 10.1371/journal.pone.0084408
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of genes, diseases and gene-associated diseases from five different data sources.
| Number of genes | Number of diseases | Number of gene-disease associations | |
|
| 1754 | 2243 | 2525 |
|
| 6065 | 4403 | 16382 |
|
| 2461 | 1395 | 12798 |
|
| 1253 | 1016 | 1749 |
|
| 6140 | 1847 | 59274 |
Figure 1Working principles of the computational framework.
Briefly, if a given lncRNA could be specifically linked with some human tissues, then we could, in turn, link it to diseases known to be related to these human tissues. Moreover, if we could find the effective associations of other lncRNAs with human genes, then we could construct a human lncRNA-gene co-expressed network and human gene-disease network and then infer the associations between lncRNAs and disease through incorporating the information provided by these two networks.
Figure 2The flowchart of our method.
There are four steps: (1) Calculation of tissue specificity score and partitioning all the lncRNAs to those that are tissue-specific and non-tissue-specific. (2) Prediction of potential lncRNA-associated diseases for tissue-specific lncRNAs. (3) For each non-tissue-specific lncRNA, find the corresponding genes co-expressed with this certain lncRNA through computing Spearman’s correlation coefficients. (4) Perform disease enrichment analysis for the set of genes co-expressed with each lncRNA and predict potential lncRNA-associated diseases for non-tissue-specific lncRNAs.
Figure 3Disease enrichment analysis for the set of genes co-expressed with each non-tissue-specific lncRNA.
The blue rectangle represents the whole human gene set, and the corresponding number is 17080. The red circle represents the set of genes co-expressed with a certain lncRNA, and the corresponding number is n. The green circle represents the set of genes related to a certain disease, and the corresponding number is x. The intersection of these two circles stands for the genes co-expressed with a certain lncRNA and related to a certain disease, and the corresponding number is y.
Case studies to evaluate the performance of our algorithm for tissue-specific lincRNAs. Four of the top ten associations were verified.
| LincRNA | Disease | Evidence |
| TCONS_00000720 | Ovary-related diseases | Yoshihara et al., 2011 |
| TCONS_00000721 | Ovary-related diseases | Yoshihara et al., 2011 |
| TCONS_l2_00001979 | Liver-related diseases | |
| TCONS_00000360 | White blood cell-related diseases | |
| TCONS_00001767 | HLF_r1-related diseases | |
| TCONS_l2_00002779 | Testes-related diseases | |
| TCONS_00000822 | Testes-related diseases | |
| TCONS_00000077 | Brain-related diseases | |
| TCONS_00000895 | Testes-related diseases | Gene Expression Atlas Database |
| TCONS_00001488 | Testes-related diseases | Gene Expression Atlas Database |
Figure 4The ROC curve for leave-one-out cross validation and the AUC of our algorithm is 0.7645.
Validation of predicted lincRNA-associated diseases for non-tissue-specific lincRNAs, in which 32 of 36 associations have been confirmed by known experimentally verified data in the LncRNADisease database.
| LincRNA | Disease | Evidence |
| TCONS_00017432 | Lymphoa, T-Cell, Cutaneous | |
| TCONS_00015353 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015354 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00014856 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015366 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015365 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015363 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015364 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015361 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015362 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015360 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00015359 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00014855 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00014854 | Lymphoa, T-Cell, Cutaneous | LncRNADisease verified |
| TCONS_00017432 | Leukemia | |
| TCONS_00015353 | Leukemia | LncRNADisease verified |
| TCONS_00015354 | Leukemia | LncRNADisease verified |
| TCONS_00014856 | Leukemia | LncRNADisease verified |
| TCONS_00015366 | Leukemia | LncRNADisease verified |
| TCONS_00015365 | Leukemia | LncRNADisease verified |
| TCONS_00015363 | Leukemia | LncRNADisease verified |
| TCONS_00015364 | Leukemia | LncRNADisease verified |
| TCONS_00015361 | Leukemia | LncRNADisease verified |
| TCONS_00015362 | Leukemia | LncRNADisease verified |
| TCONS_00015360 | Leukemia | LncRNADisease verified |
| TCONS_00015359 | Leukemia | LncRNADisease verified |
| TCONS_00014855 | Leukemia | LncRNADisease verified |
| TCONS_00014854 | Leukemia | LncRNADisease verified |
| TCONS_00063838_KCNQ1OT1 | Arthritis, Rheumatoid | |
| TCONS_00017432 | Breast Neoplasms | LncRNADisease verified |
| TCONS_00063838_KCNQ1OT1 | Lupus Erythematosus, Systemic | |
| TCONS_00017432 | Carcinoma | LncRNADisease verified |
| TCONS_00015353 | Breast Neoplasms | LncRNADisease verified |
| TCONS_00015354 | Breast Neoplasms | LncRNADisease verified |
| TCONS_00015353 | Carcinoma | LncRNADisease verified |
| TCONS_00015354 | Carcinoma | LncRNADisease verified |
The lincRNA-associated diseases in this table are sorted in ascending order of the corresponding adjusted p-value of the hypergeometric distribution test.