| Literature DB >> 34807960 |
Yi Zhang1,2, Min Chen3, Li Huang4,5, Xiaolan Xie1, Xin Li1, Hong Jin1, Xiaohua Wang6, Hanyan Wei6.
Abstract
It is well known that numerous long noncoding RNAs (lncRNAs) closely relate to the physiological and pathological processes of human diseases and can serves as potential biomarkers. Therefore, lncRNA-disease associations that are identified by computational methods as the targeted candidates reduce the cost of biological experiments focusing on deep study furtherly. However, inaccurate construction of similarity networks and inadequate numbers of observed known lncRNA-disease associations, such inherent problems make many mature computational methods that have been developed for many years still exit some limitations. It motivates us to explore a new computational method that was fused with KATZ measure and space projection to fast probing potential lncRNA-disease associations (namely KATZSP). KATZSP is comprised of following key steps: combining all the global information with which to change Boolean network of known lncRNA-disease associations into the weighted networks; changing the similarities calculation into counting the number of walks that connect lncRNA nodes and disease nodes in bipartite graphs; obtaining the space projection scores to refine the primary prediction scores. The process to fuse KATZ measure and space projection was simplified and uncomplicated with needing only one attenuation factor. The leave-one-out cross validation (LOOCV) experimental results showed that, compared with other state-of-the-art methods (NCPLDA, LDAI-ISPS and IIRWR), KATZSP had a higher predictive accuracy shown with area-under-the-curve (AUC) value on the three datasets built, while KATZSP well worked on inferring potential associations related to new lncRNAs (or isolated diseases). The results from real cases study (such as pancreas cancer, lung cancer and colorectal cancer) further confirmed that KATZSP is capable of superior predictive ability to be applied as a guide for traditional biological experiments.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34807960 PMCID: PMC8608294 DOI: 10.1371/journal.pone.0260329
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Impact with parameter variation on model prediction accuracy.
Fig 2Predictive abilities with different technical solutions on dataset 1.
Fig 4Predictive abilities with different technical solutions on dataset 3.
Fig 5Predictive abilities of KATZSP and other models on dataset 1.
Fig 7Predictive abilities of KATZSP and other models on dataset 3.
AUCs of KATZSP and other models on all three datasets.
| Model | NCPLDA | LDAI-ISPS | IIRWR | KATZSP |
|---|---|---|---|---|
| AUC on dataset 1 | 0.9107 (2.3%) | 0.9154 (1.9%) | 0.7883 (18.3%) | 0.9324 |
| AUC on dataset 2 | 0.9012 (4.3%) | 0.8341 (11.8%) | 0.8230 (14.3%) | 0.9403 |
| AUC on dataset 3 | 0.9307 (1.7%) | 0.8455 (12%) | 0.8745 (8.3%) | 0.9472 |
From data of “AUC on dataset 1” in Table 1, our KATZSP was demonstrated with higher AUC values which were 2.3%, 1.9% and 18.3% higher than that of NCPLDA, LDAI-ISPS and IIRWR, respectively. Similarly, the comparison results on dataset 2 demonstrated the AUC values of our KATZSP were 4.3%, 11.8% and 14.3% higher than that of NCPLDA, LDAI-ISPS and IIRWR, respectively. In the last row of Table 1, the 1.7%, 12% and 8.3% higher AUC values of our KATZSP were compared with that of NCPLDA, LDAI-ISPS and IIRWR, respectively. Therefore, our KATZSP was demonstrated with more reliable predictive ability over other previous models on all the three datasets under the evaluation framework of LOOCV.
Fig 8Predictive ability of KATZSP for new lncRNAs and isolated diseases.
Top 5 specific diseases-related candidate lncRNAs.
| Case | LncRNA | Evidences | Rank |
|---|---|---|---|
| Pancreas cancer | H19 | LncRNADisease | 1 |
| Pancreas cancer | MEG3 | LncRNADisease | 2 |
| Pancreas cancer | CDKN2B-AS1 | LncRNADisease | 3 |
| Pancreas cancer | GAS5 | LncRNADisease | 4 |
| Pancreas cancer | UCA1 | LncRNADisease | 5 |
| Lung cancer | PVT1 | LncRNADisease | 1 |
| Lung cancer | GAS5 | LncRNADisease | 2 |
| Lung cancer | CDKN2B-AS1 | LncRNADisease | 3 |
| Lung cancer | UCA1 | LncRNADisease | 4 |
| Lung cancer | NPTN-IT1 | Lnc2Cancer | 5 |
| Colorectal cancer | PVT1 | LncRNADisease | 1 |
| Colorectal cancer | CDKN2B-AS1 | Lnc2Cancer | 2 |
| Colorectal cancer | LSINCT5 | Lnc2Cancer | 3 |
| Colorectal cancer | GAS5 | Lnc2Cancer | 4 |
| Colorectal cancer | UCA1 | LncRNADisease | 5 |
The data in column “Evidences” of Table 2 showed that all the potential lncRNAs inferred relating to the three specific diseases have been found the evidence in LncRNADisease 2.0 or Lnc2Cancer 3.0. It validated the reliability of the inferred results coming from our KATZSP.
Top 5 specific isolated diseases-related candidate lncRNAs.
| Disease | lncRNA name | Evidences | Rank |
|---|---|---|---|
| pancreas cancer | HOTAIR | LncRNADisease | 1 |
| pancreas cancer | MALAT1 | LncRNADisease | 2 |
| pancreas cancer | H19 | LncRNADisease | 3 |
| pancreas cancer | MEG3 | LncRNADisease | 4 |
| pancreas cancer | TC0101441 | No evidence | 5 |
| lung cancer | HOTAIR | LncRNADisease | 1 |
| lung cancer | MALAT1 | LncRNADisease | 2 |
| lung cancer | H19 | LncRNADisease | 3 |
| lung cancer | MEG3 | LncRNADisease | 4 |
| lung cancer | PVT1 | LncRNADisease | 5 |
| colon cancer | HOTAIR | LncRNADisease | 1 |
| colon cancer | MALAT1 | LncRNADisease | 2 |
| colon cancer | H19 | LncRNADisease | 3 |
| colon cancer | EPB41L4A-AS1 | Literature [ | 4 |
| colon cancer | KRASP1 | No evidence | 5 |
Fig 9Workflow model of KATZSP.