Literature DB >> 19154742

Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns.

Yan Yuan Tseng1, Joseph Dundas, Jie Liang.   

Abstract

Inferring protein functions from structures is a challenging task, as a large number of orphan protein structures from structural genomics project are now solved without their biochemical functions characterized. For proteins binding to similar substrates or ligands and carrying out similar functions, their binding surfaces are under similar physicochemical constraints, and hence the sets of allowed and forbidden residue substitutions are similar. However, it is difficult to isolate such selection pressure due to protein function from selection pressure due to protein folding, and evolutionary relationship reflected by global sequence and structure similarities between proteins is often unreliable for inferring protein function. We have developed a method, called pevoSOAR (pocket-based evolutionary search of amino acid residues), for predicting protein functions by solving the problem of uncovering amino acids residue substitution pattern due to protein function and separating it from amino acids substitution pattern due to protein folding. We incorporate evolutionary information specific to an individual binding region and match local surfaces on a large scale with millions of precomputed protein surfaces to identify those with similar functions. Our pevoSOAR method also generates a probablistic model called the computed binding a profile that characterizes protein-binding activities that may involve multiple substrates or ligands. We show that our method can be used to predict enzyme functions with accuracy. Our method can also assess enzyme binding specificity and promiscuity. In an objective large-scale test of 100 enzyme families with thousands of structures, our predictions are found to be sensitive and specific: At the stringent specificity level of 99.98%, we can correctly predict enzyme functions for 80.55% of the proteins. The overall area under the receiver operating characteristic curve measuring the performance of our prediction is 0.955, close to the perfect value of 1.00. The best Matthews coefficient is 86.6%. Our method also works well in predicting the biochemical functions of orphan proteins from structural genomics projects.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19154742      PMCID: PMC2670802          DOI: 10.1016/j.jmb.2008.12.072

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  58 in total

1.  Evolution of function in protein superfamilies, from a structural perspective.

Authors:  A E Todd; C A Orengo; J M Thornton
Journal:  J Mol Biol       Date:  2001-04-06       Impact factor: 5.469

2.  Selective constraints, amino acid composition, and the rate of protein evolution.

Authors:  N J Tourasse; W H Li
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

3.  Analysis and prediction of functional sub-types from protein sequence alignments.

Authors:  S S Hannenhalli; R B Russell
Journal:  J Mol Biol       Date:  2000-10-13       Impact factor: 5.469

4.  Are proteins well-packed?

Authors:  J Liang; K A Dill
Journal:  Biophys J       Date:  2001-08       Impact factor: 4.033

Review 5.  From genome to function.

Authors:  J M Thornton
Journal:  Science       Date:  2001-06-15       Impact factor: 47.728

6.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

7.  Enzyme function less conserved than anticipated.

Authors:  Burkhard Rost
Journal:  J Mol Biol       Date:  2002-04-26       Impact factor: 5.469

8.  ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information.

Authors:  Fabian Glaser; Tal Pupko; Inbal Paz; Rachel E Bell; Dalit Bechor-Shental; Eric Martz; Nir Ben-Tal
Journal:  Bioinformatics       Date:  2003-01       Impact factor: 6.937

9.  Predicting enzyme functional surfaces and locating key residues automatically from structures.

Authors:  Yan Yuan Tseng; Jie Liang
Journal:  Ann Biomed Eng       Date:  2007-02-09       Impact factor: 3.934

10.  The cyclization mechanism of cyclodextrin glycosyltransferase (CGTase) as revealed by a gamma-cyclodextrin-CGTase complex at 1.8-A resolution.

Authors:  J C Uitdehaag; K H Kalk; B A van Der Veen; L Dijkhuizen; B W Dijkstra
Journal:  J Biol Chem       Date:  1999-12-03       Impact factor: 5.157

View more
  33 in total

1.  Real-time ligand binding pocket database search using local surface descriptors.

Authors:  Rayan Chikhi; Lee Sael; Daisuke Kihara
Journal:  Proteins       Date:  2010-07

2.  Classification of protein functional surfaces using structural characteristics.

Authors:  Yan Yuan Tseng; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-11       Impact factor: 11.205

3.  Prediction and experimental validation of enzyme substrate specificity in protein structures.

Authors:  Shivas R Amin; Serkan Erdin; R Matthew Ward; Rhonald C Lua; Olivier Lichtarge
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-21       Impact factor: 11.205

4.  Evolutionary approach to predicting the binding site residues of a protein from its primary sequence.

Authors:  Yan Yuan Tseng; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2011-03-14       Impact factor: 11.205

5.  Geometry of protein shape and its evolutionary pattern for function prediction and characterization.

Authors:  Jie Liang
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2009

6.  Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction.

Authors:  Drew H Bryant; Mark Moll; Brian Y Chen; Viacheslav Y Fofanov; Lydia E Kavraki
Journal:  BMC Bioinformatics       Date:  2010-05-11       Impact factor: 3.169

7.  SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison.

Authors:  Jingyuan Ren; Lei Xie; Wilfred W Li; Philip E Bourne
Journal:  Nucleic Acids Res       Date:  2010-05-19       Impact factor: 16.971

8.  The LabelHash algorithm for substructure matching.

Authors:  Mark Moll; Drew H Bryant; Lydia E Kavraki
Journal:  BMC Bioinformatics       Date:  2010-11-11       Impact factor: 3.169

9.  SplitPocket: identification of protein functional surfaces and characterization of their spatial patterns.

Authors:  Yan Yuan Tseng; Craig Dupree; Z Jeffrey Chen; Wen-Hsiung Li
Journal:  Nucleic Acids Res       Date:  2009-04-30       Impact factor: 16.971

10.  Protein-protein docking using region-based 3D Zernike descriptors.

Authors:  Vishwesh Venkatraman; Yifeng D Yang; Lee Sael; Daisuke Kihara
Journal:  BMC Bioinformatics       Date:  2009-12-09       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.