| Literature DB >> 35168563 |
Xiaoquan Chu1, Tanlin Sun2, Qian Li3, Youjun Xu2, Zhuqing Zhang4, Luhua Lai5,6,7, Jianfeng Pei8.
Abstract
BACKGROUND: The liquid-liquid phase separation (LLPS) of biomolecules in cell underpins the formation of membraneless organelles, which are the condensates of protein, nucleic acid, or both, and play critical roles in cellular function. Dysregulation of LLPS is implicated in a number of diseases. Although the LLPS of biomolecules has been investigated intensively in recent years, the knowledge of the prevalence and distribution of phase separation proteins (PSPs) is still lag behind. Development of computational methods to predict PSPs is therefore of great importance for comprehensive understanding of the biological function of LLPS.Entities:
Keywords: Liquid–liquid phase separation (LLPS); Machine learning; Phase separation proteins (PSPs); Predictor
Mesh:
Substances:
Year: 2022 PMID: 35168563 PMCID: PMC8845408 DOI: 10.1186/s12859-022-04599-w
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The evaluation of the best model (model 1) for PSPs prediction
| Accuracya | F1a | Precisiona | Sensitivitya | Specificitya | MCCa |
|---|---|---|---|---|---|
| 0.95 ± 0.03 | 0.92 ± 0.01 | 0.95 ± 0.03 | 0.94 ± 0.04 | 0.96 ± 0.05 | 0.90 ± 0.05 |
aData are represented as mean ± SD
Fig. 1Relationship between percent recall and total percentage of human proteins accepted at given thresholds, for Model 0 and three best first generation prediction tools
Fig. 2Fraction of proteins in each category (scaffold, regulator or client) predicted as PSPs by PSPredictor or PScore
Fig. 3A Fraction of proteins in each tier group predicted as PSPs by first generation prediction tools and PSPredictor. B The number of predicted PSPs that overlapped between two prediction tools
Fig. 4The model architectures of CBOW (A) and Skip-gram (B)
The evaluation of the Com-Len model for PSPs prediction
| Accuracya | F1a | Precisiona | Sensitivitya | Specificitya | MCCa |
|---|---|---|---|---|---|
| 0.87 ± 0.04 | 0.87 ± 0.04 | 0.92 ± 0.04 | 0.81 ± 0.06 | 0.93 ± 0.04 | 0.76 ± 0.07 |
aData are represented as mean ± SD
Fig. 52D vector projection of PSPs and non-PSPs by PCA