Literature DB >> 20177794

Predicting protein crystallization propensity from protein sequence.

György Babnigg1, Andrzej Joachimiak.   

Abstract

The high-throughput structure determination pipelines developed by structural genomics programs offer a unique opportunity for data mining. One important question is how protein properties derived from a primary sequence correlate with the protein's propensity to yield X-ray quality crystals (crystallizability) and 3D X-ray structures. A set of protein properties were computed for over 1,300 proteins that expressed well but were insoluble, and for approximately 720 unique proteins that resulted in X-ray structures. The correlation of the protein's iso-electric point and grand average hydropathy (GRAVY) with crystallizability was analyzed for full length and domain constructs of protein targets. In a second step, several additional properties that can be calculated from the protein sequence were added and evaluated. Using statistical analyses we have identified a set of the attributes correlating with a protein's propensity to crystallize and implemented a Support Vector Machine (SVM) classifier based on these. We have created applications to analyze and provide optimal boundary information for query sequences and to visualize the data. These tools are available via the web site http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor .

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20177794      PMCID: PMC3366497          DOI: 10.1007/s10969-010-9080-0

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  46 in total

1.  SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics.

Authors:  P Bertone; Y Kluger; N Lan; D Zheng; D Christendat; A Yee; A M Edwards; C H Arrowsmith; G T Montelione; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-07-01       Impact factor: 16.971

2.  Use of limited proteolysis to identify protein domains suitable for structural analysis.

Authors:  Chris M Koth; Stephen M Orlicky; Stephan M Larson; Aled M Edwards
Journal:  Methods Enzymol       Date:  2003       Impact factor: 1.600

3.  GELBANK: a database of annotated two-dimensional gel electrophoresis patterns of biological systems with completed genomes.

Authors:  György Babnigg; Carol S Giometti
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

5.  Toward rational protein crystallization: A Web server for the design of crystallizable protein variants.

Authors:  Lukasz Goldschmidt; David R Cooper; Zygmunt S Derewenda; David Eisenberg
Journal:  Protein Sci       Date:  2007-08       Impact factor: 6.725

6.  ParCrys: a Parzen window density estimation approach to protein crystallization propensity prediction.

Authors:  Ian M Overton; Gianandrea Padovani; Mark A Girolami; Geoffrey J Barton
Journal:  Bioinformatics       Date:  2008-02-19       Impact factor: 6.937

7.  AAindex: Amino Acid Index Database.

Authors:  S Kawashima; H Ogata; M Kanehisa
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

8.  Turns in transmembrane helices: determination of the minimal length of a "helical hairpin" and derivation of a fine-grained turn propensity scale.

Authors:  M Monné; I Nilsson; A Elofsson; G von Heijne
Journal:  J Mol Biol       Date:  1999-11-05       Impact factor: 5.469

9.  Amino acid preferences for specific locations at the ends of alpha helices.

Authors:  J S Richardson; D C Richardson
Journal:  Science       Date:  1988-06-17       Impact factor: 47.728

10.  Statistical mechanical treatment of protein conformation. 5. A multistate model for specific-sequence copolymers of amino acids.

Authors:  S Tanaka; H A Scheraga
Journal:  Macromolecules       Date:  1977 Jan-Feb       Impact factor: 5.985

View more
  19 in total

1.  Target selection for structural genomics based on combining fold recognition and crystallisation prediction methods: application to the human proteome.

Authors:  James E Bray
Journal:  J Struct Funct Genomics       Date:  2012-02-22

2.  Improving the success rate of protein crystallization by random microseed matrix screening.

Authors:  Marisa Till; Alice Robson; Matthew J Byrne; Asha V Nair; Stefan A Kolek; Patrick D Shaw Stewart; Paul R Race
Journal:  J Vis Exp       Date:  2013-08-31       Impact factor: 1.355

Review 3.  High-throughput protein purification and quality assessment for crystallization.

Authors:  Youngchang Kim; Gyorgy Babnigg; Robert Jedrzejczak; William H Eschenfeldt; Hui Li; Natalia Maltseva; Catherine Hatzos-Skintges; Minyi Gu; Magdalena Makowska-Grzyska; Ruiying Wu; Hao An; Gekleng Chhor; Andrzej Joachimiak
Journal:  Methods       Date:  2011-08-31       Impact factor: 3.608

Review 4.  Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.

Authors:  Huilin Wang; Liubin Feng; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song; Donghai Lin
Journal:  Brief Bioinform       Date:  2018-09-28       Impact factor: 11.622

5.  Improving the chances of successful protein structure determination with a random forest classifier.

Authors:  Samad Jahandideh; Lukasz Jaroszewski; Adam Godzik
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2014-02-15

Review 6.  Databases, Repositories, and Other Data Resources in Structural Biology.

Authors:  Heping Zheng; Przemyslaw J Porebski; Marek Grabowski; David R Cooper; Wladek Minor
Journal:  Methods Mol Biol       Date:  2017

Review 7.  Computational crystallization.

Authors:  Irem Altan; Patrick Charbonneau; Edward H Snell
Journal:  Arch Biochem Biophys       Date:  2016-01-11       Impact factor: 4.013

8.  Data management in the modern structural biology and biomedical research environment.

Authors:  Matthew D Zimmerman; Marek Grabowski; Marcin J Domagalski; Elizabeth M Maclean; Maksymilian Chruszcz; Wladek Minor
Journal:  Methods Mol Biol       Date:  2014

Review 9.  The "Sticky Patch" Model of Crystallization and Modification of Proteins for Enhanced Crystallizability.

Authors:  Zygmunt S Derewenda; Adam Godzik
Journal:  Methods Mol Biol       Date:  2017

10.  Sequence-based prediction of protein crystallization, purification and production propensity.

Authors:  Marcin J Mizianty; Lukasz Kurgan
Journal:  Bioinformatics       Date:  2011-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.