Literature DB >> 17887954

Predicting and annotating catalytic residues: an information theoretic approach.

Beckett Sterner1, Rohit Singh, Bonnie Berger.   

Abstract

We introduce a computational method to predict and annotate the catalytic residues of a protein using only its sequence information, so that we describe both the residues' sequence locations (prediction) and their specific biochemical roles in the catalyzed reaction (annotation). While knowing the chemistry of an enzyme's catalytic residues is essential to understanding its function, the challenges of prediction and annotation have remained difficult, especially when only the enzyme's sequence and no homologous structures are available. Our sequence-based approach follows the guiding principle that catalytic residues performing the same biochemical function should have similar chemical environments; it detects specific conservation patterns near in sequence to known catalytic residues and accordingly constrains what combination of amino acids can be present near a predicted catalytic residue. We associate with each catalytic residue a short sequence profile and define a Kullback-Leibler (KL) distance measure between these profiles, which, as we show, effectively captures even subtle biochemical variations. We apply the method to the class of glycohydrolase enzymes. This class includes proteins from 96 families with very different sequences and folds, many of which perform important functions. In a cross-validation test, our approach correctly predicts the location of the enzymes' catalytic residues with a sensitivity of 80% at a specificity of 99.4%, and in a separate cross-validation we also correctly annotate the biochemical role of 80% of the catalytic residues. Our results compare favorably to existing methods. Moreover, our method is more broadly applicable because it relies on sequence and not structure information; it may, furthermore, be used in conjunction with structure-based methods.

Mesh:

Substances:

Year:  2007        PMID: 17887954     DOI: 10.1089/cmb.2007.0042

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  11 in total

1.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

2.  Application of information theory to feature selection in protein docking.

Authors:  Olaf G Othersen; Arno G Stefani; Johannes B Huber; Heinrich Sticht
Journal:  J Mol Model       Date:  2011-07-12       Impact factor: 1.810

3.  A protein sequence meta-functional signature for calcium binding residue prediction.

Authors:  Jeremy A Horst; Ram Samudrala
Journal:  Pattern Recognit Lett       Date:  2010-10-15       Impact factor: 3.756

4.  Networks of high mutual information define the structural proximity of catalytic sites: implications for catalytic residue identification.

Authors:  Cristina Marino Buslje; Elin Teppa; Tomas Di Doménico; José María Delfino; Morten Nielsen
Journal:  PLoS Comput Biol       Date:  2010-11-04       Impact factor: 4.475

5.  L1pred: a sequence-based prediction tool for catalytic residues in enzymes with the L1-logreg classifier.

Authors:  Yongchao Dou; Jun Wang; Jialiang Yang; Chi Zhang
Journal:  PLoS One       Date:  2012-04-27       Impact factor: 3.240

6.  Novel feature for catalytic protein residues reflecting interactions with other residues.

Authors:  Yizhou Li; Gongbing Li; Zhining Wen; Hui Yin; Mei Hu; Jiamin Xiao; Menglong Li
Journal:  PLoS One       Date:  2011-03-29       Impact factor: 3.240

7.  Identification of family-determining residues in PHD fingers.

Authors:  Patrick Slama; Donald Geman
Journal:  Nucleic Acids Res       Date:  2010-11-08       Impact factor: 16.971

8.  On the structural context and identification of enzyme catalytic residues.

Authors:  Yu-Tung Chien; Shao-Wei Huang
Journal:  Biomed Res Int       Date:  2013-02-03       Impact factor: 3.411

9.  Accurate prediction of protein catalytic residues by side chain orientation and residue contact density.

Authors:  Yu-Tung Chien; Shao-Wei Huang
Journal:  PLoS One       Date:  2012-10-24       Impact factor: 3.240

10.  Fast dynamics perturbation analysis for prediction of protein functional sites.

Authors:  Dengming Ming; Judith D Cohn; Michael E Wall
Journal:  BMC Struct Biol       Date:  2008-01-30
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.