Literature DB >> 15033369

Prediction of functional sites in proteins using conserved functional group analysis.

C Axel Innis1, A Prem Anand, R Sowdhamini.   

Abstract

A detailed knowledge of a protein's functional site is an absolute prerequisite for understanding its mode of action at the molecular level. However, the rapid pace at which sequence and structural information is being accumulated for proteins greatly exceeds our ability to determine their biochemical roles experimentally. As a result, computational methods are required which allow for the efficient processing of the evolutionary information contained in this wealth of data, in particular that related to the nature and location of functionally important sites and residues. The method presented here, referred to as conserved functional group (CFG) analysis, relies on a simplified representation of the chemical groups found in amino acid side-chains to identify functional sites from a single protein structure and a number of its sequence homologues. We show that CFG analysis can fully or partially predict the location of functional sites in approximately 96% of the 470 cases tested and that, unlike other methods available, it is able to tolerate wide variations in sequence identity. In addition, we discuss its potential in a structural genomics context, where automation, scalability and efficiency are critical, and an increasing number of protein structures are determined with no prior knowledge of function. This is exemplified by our analysis of the hypothetical protein Ydde_Ecoli, whose structure was recently solved by members of the North East Structural Genomics consortium. Although the proposed active site for this protein needs to be validated experimentally, this example illustrates the scope of CFG analysis as a general tool for the identification of residues likely to play an important role in a protein's biochemical function. Thus, our method offers a convenient solution to rapidly and automatically process the vast amounts of data that are beginning to emerge from structural genomics projects.

Mesh:

Substances:

Year:  2004        PMID: 15033369     DOI: 10.1016/j.jmb.2004.01.053

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  23 in total

1.  Automated prediction of protein function and detection of functional sites from structure.

Authors:  Florencio Pazos; Michael J E Sternberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-29       Impact factor: 11.205

2.  Analysis and prediction of functionally important sites in proteins.

Authors:  Saikat Chakrabarti; Christopher J Lanczycki
Journal:  Protein Sci       Date:  2007-01       Impact factor: 6.725

3.  Enhanced performance in prediction of protein active sites with THEMATICS and support vector machines.

Authors:  Wenxu Tong; Ronald J Williams; Ying Wei; Leonel F Murga; Jaeju Ko; Mary Jo Ondrechen
Journal:  Protein Sci       Date:  2007-12-20       Impact factor: 6.725

4.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

5.  Networks of high mutual information define the structural proximity of catalytic sites: implications for catalytic residue identification.

Authors:  Cristina Marino Buslje; Elin Teppa; Tomas Di Doménico; José María Delfino; Morten Nielsen
Journal:  PLoS Comput Biol       Date:  2010-11-04       Impact factor: 4.475

6.  Identification and mutational analysis of Arabidopsis FLS2 leucine-rich repeat domain residues that contribute to flagellin perception.

Authors:  F Mark Dunning; Wenxian Sun; Kristin L Jansen; Laura Helft; Andrew F Bent
Journal:  Plant Cell       Date:  2007-10-12       Impact factor: 11.277

7.  Functional region prediction with a set of appropriate homologous sequences--an index for sequence selection by integrating structure and sequence information with spatial statistics.

Authors:  Wataru Nemoto; Hiroyuki Toh
Journal:  BMC Struct Biol       Date:  2012-05-29

8.  L1pred: a sequence-based prediction tool for catalytic residues in enzymes with the L1-logreg classifier.

Authors:  Yongchao Dou; Jun Wang; Jialiang Yang; Chi Zhang
Journal:  PLoS One       Date:  2012-04-27       Impact factor: 3.240

9.  LRR conservation mapping to predict functional sites within protein leucine-rich repeat domains.

Authors:  Laura Helft; Vignyan Reddy; Xiyang Chen; Teresa Koller; Luca Federici; Juan Fernández-Recio; Rishabh Gupta; Andrew Bent
Journal:  PLoS One       Date:  2011-07-18       Impact factor: 3.240

10.  Active site prediction using evolutionary and structural information.

Authors:  Sriram Sankararaman; Fei Sha; Jack F Kirsch; Michael I Jordan; Kimmen Sjölander
Journal:  Bioinformatics       Date:  2010-01-14       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.