Literature DB >> 28334201

Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.

Huilin Wang1, Liubin Feng2, Geoffrey I Webb3, Lukasz Kurgan4, Jiangning Song5, Donghai Lin1.   

Abstract

X-ray crystallography is the main tool for structural determination of proteins. Yet, the underlying crystallization process is costly, has a high attrition rate and involves a series of trial-and-error attempts to obtain diffraction-quality crystals. The Structural Genomics Consortium aims to systematically solve representative structures of major protein-fold classes using primarily high-throughput X-ray crystallography. The attrition rate of these efforts can be improved by selection of proteins that are potentially easier to be crystallized. In this context, bioinformatics approaches have been developed to predict crystallization propensities based on protein sequences. These approaches are used to facilitate prioritization of the most promising target proteins, search for alternative structural orthologues of the target proteins and suggest designs of constructs capable of potentially enhancing the likelihood of successful crystallization. We reviewed and compared nine predictors of protein crystallization propensity. Moreover, we demonstrated that integrating selected outputs from multiple predictors as candidate input features to build the predictive model results in a significantly higher predictive performance when compared to using these predictors individually. Furthermore, we also introduced a new and accurate predictor of protein crystallization propensity, Crysf, which uses functional features extracted from UniProt as inputs. This comprehensive review will assist structural biologists in selecting the most appropriate predictor, and is also beneficial for bioinformaticians to develop a new generation of predictive algorithms.

Mesh:

Substances:

Year:  2018        PMID: 28334201      PMCID: PMC6171492          DOI: 10.1093/bib/bbx018

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  68 in total

1.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

2.  Structural genomics in North America.

Authors:  T C Terwilliger
Journal:  Nat Struct Biol       Date:  2000-11

3.  SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics.

Authors:  P Bertone; Y Kluger; N Lan; D Zheng; D Christendat; A Yee; A M Edwards; C H Arrowsmith; G T Montelione; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-07-01       Impact factor: 16.971

4.  Structural biology. Structural genomics, round 2.

Authors:  Robert Service
Journal:  Science       Date:  2005-03-11       Impact factor: 47.728

5.  Toward rational protein crystallization: A Web server for the design of crystallizable protein variants.

Authors:  Lukasz Goldschmidt; David R Cooper; Zygmunt S Derewenda; David Eisenberg
Journal:  Protein Sci       Date:  2007-08       Impact factor: 6.725

Review 6.  Lessons from structural genomics.

Authors:  Thomas C Terwilliger; David Stuart; Shigeyuki Yokoyama
Journal:  Annu Rev Biophys       Date:  2009       Impact factor: 12.981

7.  Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

Authors:  H B Rao; F Zhu; G B Yang; Z R Li; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2011-05-23       Impact factor: 16.971

8.  SCMCRYS: predicting protein crystallization using an ensemble scoring card method with estimating propensity scores of P-collocated amino acid pairs.

Authors:  Phasit Charoenkwan; Watshara Shoombuatong; Hua-Chin Lee; Jeerayut Chaijaruwanich; Hui-Ling Huang; Shinn-Ying Ho
Journal:  PLoS One       Date:  2013-09-03       Impact factor: 3.240

9.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

10.  The SWISS-MODEL Repository and associated resources.

Authors:  Florian Kiefer; Konstantin Arnold; Michael Künzli; Lorenza Bordoli; Torsten Schwede
Journal:  Nucleic Acids Res       Date:  2008-10-18       Impact factor: 16.971

View more
  3 in total

1.  Sequence-Based Prediction of Transmembrane Protein Crystallization Propensity.

Authors:  Qizhi Zhu; Lihua Wang; Ruyu Dai; Wei Zhang; Wending Tang; Yannan Bin; Zeliang Wang; Junfeng Xia
Journal:  Interdiscip Sci       Date:  2021-06-18       Impact factor: 2.233

2.  TMCrys: predict propensity of success for transmembrane protein crystallization.

Authors:  Julia K Varga; Gábor E Tusnády
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

3.  The Dundee Resource for Sequence Analysis and Structure Prediction.

Authors:  Stuart A MacGowan; Fábio Madeira; Thiago Britto-Borges; Mateusz Warowny; Alexey Drozdetskiy; James B Procter; Geoffrey J Barton
Journal:  Protein Sci       Date:  2019-11-28       Impact factor: 6.725

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.