| Literature DB >> 28575391 |
Max Hebditch1, M Alejandro Carballo-Amador2, Spyros Charonis1, Robin Curtis1, Jim Warwicker2.
Abstract
MOTIVATION: Protein solubility is an important property in industrial and therapeutic applications. Prediction is a challenge, despite a growing understanding of the relevant physicochemical properties.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28575391 PMCID: PMC5870856 DOI: 10.1093/bioinformatics/btx345
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
ROC plot and correlation analysis of predictions versus protein solubility or abundance
| Set | Proteins | 5% Tails | AUC | Acc at 58% | Corr |
|---|---|---|---|---|---|
| 2395 | 120 | 0.974 | 0.900 | 0.621 | |
| 2364 | 119 | 0.922 | 0.828 | 0.382 | |
| Yeast abundance Test | 4275 | 214 | 0.707 | 0.626 | 0.188 |
| Human abundance Test | 10662 | 534 | 0.708 | 0.659 | 0.190 |
Note: AUC is area under the curve, Acc is accuracy at 58% solubility prediction threshold.
Fig. 1Performance of the predictions across bacterial and eukaryotic proteomes. ROC plots are shown for prediction performance in the training set of measured solubilities, and 3 test sets of protein abundance
Fig. 2(A) The Protein–Sol calculation. Results are shown for the E.coli thioredoxin example. (B) E.coli thioredoxin (2trx chain A, Katti ) is shown colour-coded by subdomain (1–67 and 68–108), with salt-bridges E44-K96 and E48-K100 displayed between the subdomains. Drawn with PyMOL (http://pymol.org)