Literature DB >> 22354707

Target selection for structural genomics based on combining fold recognition and crystallisation prediction methods: application to the human proteome.

James E Bray1.   

Abstract

The objective of this study is to automatically identify regions of the human proteome that are suitable for 3D structure determination by X-ray crystallography and to annotate them according to their likelihood to produce diffraction quality crystals. The results provide a powerful tool for structural genomics laboratories who wish to select human proteins based on the statistical likelihood of crystallisation success. Combining fold recognition and crystallisation prediction algorithms enables the efficient calculation of the crystallisability of the entire human proteome. This novel study estimates that there are approximately 40,000 crystallisable regions in the human proteome. Currently, only 15% of these regions (approx. 6,000 sequences) have been solved to at least 95% sequence identity. The remaining unsolved regions have been categorised into 5 crystallisation classes and an integral membrane protein (IMP) class, based on established structure prediction, crystallisation prediction and transmembrane (TM) helix prediction algorithms. Approximately 750 unsolved regions (2% of the proteome) have been identified as having a PDB fold representative (template) and an 'optimal' likelihood of crystallisation. At the other end of the spectrum, more than 10,500 non-IMP regions with a PDB template are classified as 'very difficult' to crystallise (26%) and almost 2,500 regions (6%) were predicted to contain at least 3 TM helices. The 3D-SPECS (3D Structural Proteomics Explorer with Crystallisation Scores) website contains crystallisation predictions for the entire human proteome and can be found at http://www.bioinformaticsplus.org/3dspecs.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22354707     DOI: 10.1007/s10969-012-9130-x

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  45 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

2.  Structural genomics in North America.

Authors:  T C Terwilliger
Journal:  Nat Struct Biol       Date:  2000-11

3.  Targeting novel folds for structural genomics.

Authors:  Liam J McGuffin; David T Jones
Journal:  Proteins       Date:  2002-07-01

4.  pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination.

Authors:  Anna Lobley; Michael I Sadowski; David T Jones
Journal:  Bioinformatics       Date:  2009-05-07       Impact factor: 6.937

5.  A hidden Markov model for predicting transmembrane helices in protein sequences.

Authors:  E L Sonnhammer; G von Heijne; A Krogh
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1998

Review 6.  High-throughput crystallography for structural genomics.

Authors:  Andrzej Joachimiak
Journal:  Curr Opin Struct Biol       Date:  2009-09-16       Impact factor: 6.809

7.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life.

Authors:  J J Ward; J S Sodhi; L J McGuffin; B F Buxton; D T Jones
Journal:  J Mol Biol       Date:  2004-03-26       Impact factor: 5.469

8.  XANNpred: neural nets that predict the propensity of a protein to yield diffraction-quality crystals.

Authors:  Ian M Overton; C A Johannes van Niekerk; Geoffrey J Barton
Journal:  Proteins       Date:  2011-01-18

9.  High-throughput production of human proteins for crystallization: the SGC experience.

Authors:  Pavel Savitsky; James Bray; Christopher D O Cooper; Brian D Marsden; Pravin Mahajan; Nicola A Burgess-Brown; Opher Gileadi
Journal:  J Struct Biol       Date:  2010-06-10       Impact factor: 2.867

10.  The Structural Biology Knowledgebase: a portal to protein structures, sequences, functions, and methods.

Authors:  Margaret J Gabanyi; Paul D Adams; Konstantin Arnold; Lorenza Bordoli; Lester G Carter; Judith Flippen-Andersen; Lida Gifford; Juergen Haas; Andrei Kouranov; William A McLaughlin; David I Micallef; Wladek Minor; Raship Shah; Torsten Schwede; Yi-Ping Tao; John D Westbrook; Matthew Zimmerman; Helen M Berman
Journal:  J Struct Funct Genomics       Date:  2011-04-07
View more
  4 in total

Review 1.  Critical evaluation of bioinformatics tools for the prediction of protein crystallization propensity.

Authors:  Huilin Wang; Liubin Feng; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song; Donghai Lin
Journal:  Brief Bioinform       Date:  2018-09-28       Impact factor: 11.622

2.  Use of a structural alphabet to find compatible folds for amino acid sequences.

Authors:  Swapnil Mahajan; Alexandre G de Brevern; Yves-Henri Sanejouand; Narayanaswamy Srinivasan; Bernard Offmann
Journal:  Protein Sci       Date:  2014-10-25       Impact factor: 6.725

3.  TSTMP: target selection for structural genomics of human transmembrane proteins.

Authors:  Julia Varga; László Dobson; István Reményi; Gábor E Tusnády
Journal:  Nucleic Acids Res       Date:  2016-10-18       Impact factor: 16.971

4.  An estimated 5% of new protein structures solved today represent a new Pfam family.

Authors:  Jaina Mistry; Edda Kloppmann; Burkhard Rost; Marco Punta
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2013-10-12
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.