Literature DB >> 16332713

A support vector machine-based method for predicting the propensity of a protein to be soluble or to form inclusion body on overexpression in Escherichia coli.

Susan Idicula-Thomas1, Abhijit J Kulkarni, Bhaskar D Kulkarni, Valadi K Jayaraman, Petety V Balaji.   

Abstract

MOTIVATION: Inclusion body formation has been a major deterrent for overexpression studies since a large number of proteins form insoluble inclusion bodies when overexpressed in Escherichia coli. The formation of inclusion bodies is known to be an outcome of improper protein folding; thus the composition and arrangement of amino acids in the proteins would be a major influencing factor in deciding its aggregation propensity. There is a significant need for a prediction algorithm that would enable the rational identification of both mutants and also the ideal protein candidates for mutations that would confer higher solubility-on-overexpression instead of the presently used trial-and-error procedures.
RESULTS: Six physicochemical properties together with residue and dipeptide-compositions have been used to develop a support vector machine-based classifier to predict the overexpression status in E.coli. The prediction accuracy is approximately 72% suggesting that it performs reasonably well in predicting the propensity of a protein to be soluble or to form inclusion bodies. The algorithm could also correctly predict the change in solubility for most of the point mutations reported in literature. This algorithm can be a useful tool in screening protein libraries to identify soluble variants of proteins.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16332713     DOI: 10.1093/bioinformatics/bti810

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  27 in total

1.  Multiple post-translational modifications affect heterologous protein synthesis.

Authors:  Alexander A Tokmakov; Atsushi Kurotani; Tetsuo Takagi; Mitsutoshi Toyama; Mikako Shirouzu; Yasuo Fukami; Shigeyuki Yokoyama
Journal:  J Biol Chem       Date:  2012-06-06       Impact factor: 5.157

2.  High-throughput prediction of protein antigenicity using protein microarray data.

Authors:  Christophe N Magnan; Michael Zeller; Matthew A Kayala; Adam Vigil; Arlo Randall; Philip L Felgner; Pierre Baldi
Journal:  Bioinformatics       Date:  2010-10-07       Impact factor: 6.937

3.  Correlation Between Protein Primary Structure and Soluble Expression Level of HSA dAb in Escherichia coli.

Authors:  Yankun Yang; Guoqiang Liu; Meng Liu; Zhonghu Bai; Xiuxia Liu; Xiaofeng Dai; Wenwen Guo
Journal:  Food Technol Biotechnol       Date:  2018-03       Impact factor: 3.918

Review 4.  Intrinsic disorder and functional proteomics.

Authors:  Predrag Radivojac; Lilia M Iakoucheva; Christopher J Oldfield; Zoran Obradovic; Vladimir N Uversky; A Keith Dunker
Journal:  Biophys J       Date:  2006-12-08       Impact factor: 4.033

5.  Learning to predict expression efficacy of vectors in recombinant protein production.

Authors:  Wen-Ching Chan; Po-Huang Liang; Yan-Ping Shih; Ueng-Cheng Yang; Wen-chang Lin; Chun-Nan Hsu
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

Review 6.  Current state and recent advances in biopharmaceutical production in Escherichia coli, yeasts and mammalian cells.

Authors:  Aleš Berlec; Borut Strukelj
Journal:  J Ind Microbiol Biotechnol       Date:  2013-02-06       Impact factor: 3.346

7.  Scoring function to predict solubility mutagenesis.

Authors:  Ye Tian; Christopher Deutsch; Bala Krishnamoorthy
Journal:  Algorithms Mol Biol       Date:  2010-10-07       Impact factor: 1.405

8.  Discrimination of soluble and aggregation-prone proteins based on sequence information.

Authors:  Yaping Fang; Jianwen Fang
Journal:  Mol Biosyst       Date:  2013-02-25

9.  Genetic classification of severe early childhood caries by use of subtracted DNA fragments from Streptococcus mutans.

Authors:  Deepak Saxena; Page W Caufield; Yihong Li; Stuart Brown; Jinmei Song; Robert Norman
Journal:  J Clin Microbiol       Date:  2008-07-02       Impact factor: 5.948

10.  Prediction of amyloid fibril-forming segments based on a support vector machine.

Authors:  Jian Tian; Ningfeng Wu; Jun Guo; Yunliu Fan
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.