Literature DB >> 22536855

PROSO II--a new method for protein solubility prediction.

Pawel Smialowski1, Gero Doose, Phillipp Torkler, Stefanie Kaufmann, Dmitrij Frishman.   

Abstract

Many fields of science and industry depend on efficient production of active protein using heterologous expression in Escherichia coli. The solubility of proteins upon expression is dependent on their amino acid sequence. Prediction of solubility from sequence is therefore highly valuable. We present a novel machine-learning-based model called PROSO II which makes use of new classification methods and growth in experimental data to improve coverage and accuracy of solubility predictions. The classification algorithm is organized as a two-layered structure in which the output of a primary Parzen window model for sequence similarity and a logistic regression classifier of amino acid k-mer composition serve as input for a second-level logistic regression classifier. Compared with previously published research our model is trained on five times more data than used by any other method before (82 000 proteins). When tested on a separate holdout set not used at any point of method development our server attained the best results in comparison with other currently available methods: accuracy 75.4%, Matthew's correlation coefficient 0.39, sensitivity 0.731, specificity 0.759, gain (soluble) 2.263. In summary, due to utilization of cutting edge machine learning technologies combined with the largest currently available experimental data set the PROSO II server constitutes a substantial improvement in protein solubility predictions. PROSO II is available at http://mips.helmholtz-muenchen.de/prosoII.
© 2012 The Authors Journal compilation © 2012 FEBS.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22536855     DOI: 10.1111/j.1742-4658.2012.08603.x

Source DB:  PubMed          Journal:  FEBS J        ISSN: 1742-464X            Impact factor:   5.542


  54 in total

Review 1.  Stepwise optimization of recombinant protein production in Escherichia coli utilizing computational and experimental approaches.

Authors:  Kulandai Arockia Rajesh Packiam; Ramakrishnan Nagasundara Ramanan; Chien Wei Ooi; Lakshminarasimhan Krishnaswamy; Beng Ti Tey
Journal:  Appl Microbiol Biotechnol       Date:  2020-02-19       Impact factor: 4.813

2.  Genome-wide characterization and expression analysis of common bean bHLH transcription factors in response to excess salt concentration.

Authors:  Musa Kavas; Mehmet Cengiz Baloğlu; Elif Seda Atabay; Ummugulsum Tanman Ziplar; Hayriye Yıldız Daşgan; Turgay Ünver
Journal:  Mol Genet Genomics       Date:  2015-07-21       Impact factor: 3.291

3.  Correlation Between Protein Primary Structure and Soluble Expression Level of HSA dAb in Escherichia coli.

Authors:  Yankun Yang; Guoqiang Liu; Meng Liu; Zhonghu Bai; Xiuxia Liu; Xiaofeng Dai; Wenwen Guo
Journal:  Food Technol Biotechnol       Date:  2018-03       Impact factor: 3.918

4.  Effect of C-Terminus Modification in Salmonella typhimurium FliC on Protein Purification Efficacy and Bioactivity.

Authors:  Mohammad-Hosein Khani; Masoumeh Bagheri; Ali Dehghanian; Azadeh Zahmatkesh; Soheila Moradi Bidhendi; Zahra Salehi Najafabadi; Reza Banihashemi
Journal:  Mol Biotechnol       Date:  2019-01       Impact factor: 2.695

5.  SODA: prediction of protein solubility from disorder and aggregation propensity.

Authors:  Lisanna Paladin; Damiano Piovesan; Silvio C E Tosatto
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

6.  PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine.

Authors:  Reda Rawi; Raghvendra Mall; Khalid Kunji; Chen-Hsiang Shen; Peter D Kwong; Gwo-Yu Chuang
Journal:  Bioinformatics       Date:  2018-04-01       Impact factor: 6.937

7.  Prediction of Protein Solubility Based on Sequence Feature Fusion and DDcCNN.

Authors:  Xianfang Wang; Yifeng Liu; Zhiyong Du; Mingdong Zhu; Aman Chandra Kaushik; Xue Jiang; Dongqing Wei
Journal:  Interdiscip Sci       Date:  2021-07-08       Impact factor: 2.233

8.  Dynamic transcriptional response of Escherichia coli to inclusion body formation.

Authors:  Faraz Baig; Lawrence P Fernando; Mary Alice Salazar; Rhonda R Powell; Terri F Bruce; Sarah W Harcum
Journal:  Biotechnol Bioeng       Date:  2014-01-30       Impact factor: 4.530

9.  Phosphoglycolate phosphatase is a metabolic proofreading enzyme essential for cellular function in Plasmodium berghei.

Authors:  Lakshmeesha Kempaiah Nagappa; Pardhasaradhi Satha; Thimmaiah Govindaraju; Hemalatha Balaram
Journal:  J Biol Chem       Date:  2019-01-30       Impact factor: 5.157

10.  Discrimination of soluble and aggregation-prone proteins based on sequence information.

Authors:  Yaping Fang; Jianwen Fang
Journal:  Mol Biosyst       Date:  2013-02-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.