Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 SOLpro: accurate sequence-based prediction of protein solubility.

Literature DB >> 19549632

SOLpro: accurate sequence-based prediction of protein solubility.

Christophe N Magnan¹, Arlo Randall, Pierre Baldi.

Abstract

MOTIVATION: Protein insolubility is a major obstacle for many experimental studies. A sequence-based prediction method able to accurately predict the propensity of a protein to be soluble on overexpression could be used, for instance, to prioritize targets in large-scale proteomics projects and to identify mutations likely to increase the solubility of insoluble proteins.
RESULTS: Here, we first curate a large, non-redundant and balanced training set of more than 17 000 proteins. Next, we extract and study 23 groups of features computed directly or predicted (e.g. secondary structure) from the primary sequence. The data and the features are used to train a two-stage support vector machine (SVM) architecture. The resulting predictor, SOLpro, is compared directly with existing methods and shows significant improvement according to standard evaluation metrics, with an overall accuracy of over 74% estimated using multiple runs of 10-fold cross-validation.

Mesh：

Substances：
Proteins

Year: 2009 PMID： 19549632 DOI： 10.1093/bioinformatics/btp386

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

110 in total

1. High-throughput prediction of protein antigenicity using protein microarray data.

Authors: Christophe N Magnan; Michael Zeller; Matthew A Kayala; Adam Vigil; Arlo Randall; Philip L Felgner; Pierre Baldi
Journal: Bioinformatics Date: 2010-10-07 Impact factor: 6.937

Review 2. Stepwise optimization of recombinant protein production in Escherichia coli utilizing computational and experimental approaches.

Authors: Kulandai Arockia Rajesh Packiam; Ramakrishnan Nagasundara Ramanan; Chien Wei Ooi; Lakshminarasimhan Krishnaswamy; Beng Ti Tey
Journal: Appl Microbiol Biotechnol Date: 2020-02-19 Impact factor: 4.813

3. Structural introspection of a putative fluoride transporter in plants.

Authors: Aditya Banerjee; Aryadeep Roychoudhury
Journal: 3 Biotech Date: 2019-02-22 Impact factor: 2.406

4. In vitro and in silico assessment of the developability of a designed monoclonal antibody library.

Authors: Adriana-Michelle Wolf Pérez; Pietro Sormanni; Jonathan Sonne Andersen; Laila Ismail Sakhnini; Ileana Rodriguez-Leon; Jais Rose Bjelke; Annette Juhl Gajhede; Leonardo De Maria; Daniel E Otzen; Michele Vendruscolo; Nikolai Lorenzen
Journal: MAbs Date: 2019-01-18 Impact factor: 5.857

SOLpro: accurate sequence-based prediction of protein solubility.

1. High-throughput prediction of protein antigenicity using protein microarray data.

Review 2. Stepwise optimization of recombinant protein production in Escherichia coli utilizing computational and experimental approaches.

3. Structural introspection of a putative fluoride transporter in plants.

4. In vitro and in silico assessment of the developability of a designed monoclonal antibody library.

Review 5. Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.

6. Correlation Between Protein Primary Structure and Soluble Expression Level of HSA dAb in Escherichia coli.

7. SODA: prediction of protein solubility from disorder and aggregation propensity.

8. PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine.

9. High-throughput developability assays enable library-scale identification of producible protein scaffold variants.

10. Discrimination of soluble and aggregation-prone proteins based on sequence information.