Literature DB >> 17189479

Evaluation of features for catalytic residue prediction in novel folds.

Eunseog Youn1, Brandon Peters, Predrag Radivojac, Sean D Mooney.   

Abstract

Structural genomics projects are determining the three-dimensional structure of proteins without full characterization of their function. A critical part of the annotation process involves appropriate knowledge representation and prediction of functionally important residue environments. We have developed a method to extract features from sequence, sequence alignments, three-dimensional structure, and structural environment conservation, and used support vector machines to annotate homologous and nonhomologous residue positions based on a specific training set of residue functions. In order to evaluate this pipeline for automated protein annotation, we applied it to the challenging problem of prediction of catalytic residues in enzymes. We also ranked the features based on their ability to discriminate catalytic from noncatalytic residues. When applying our method to a well-annotated set of protein structures, we found that top-ranked features were a measure of sequence conservation, a measure of structural conservation, a degree of uniqueness of a residue's structural environment, solvent accessibility, and residue hydrophobicity. We also found that features based on structural conservation were complementary to those based on sequence conservation and that they were capable of increasing predictor performance. Using a family nonredundant version of the ASTRAL 40 v1.65 data set, we estimated that the true catalytic residues were correctly predicted in 57.0% of the cases, with a precision of 18.5%. When testing on proteins containing novel folds not used in training, the best features were highly correlated with the training on families, thus validating the approach to nonhomologous catalytic residue prediction in general. We then applied the method to 2781 coordinate files from the structural genomics target pipeline and identified both highly ranked and highly clustered groups of predicted catalytic residues.

Mesh:

Substances:

Year:  2006        PMID: 17189479      PMCID: PMC2203287          DOI: 10.1110/ps.062523907

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  30 in total

1.  A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach.

Authors:  S Hua; Z Sun
Journal:  J Mol Biol       Date:  2001-04-27       Impact factor: 5.469

2.  Prediction of catalytic residues in enzymes based on known tertiary structure, stability profile, and sequence conservation.

Authors:  Motonori Ota; Kengo Kinoshita; Ken Nishikawa
Journal:  J Mol Biol       Date:  2003-04-11       Impact factor: 5.469

3.  Analysis of catalytic residues in enzyme active sites.

Authors:  Gail J Bartlett; Craig T Porter; Neera Borkakoti; Janet M Thornton
Journal:  J Mol Biol       Date:  2002-11-15       Impact factor: 5.469

4.  Predicted protein-protein interaction sites from local sequence information.

Authors:  Yanay Ofran; Burkhard Rost
Journal:  FEBS Lett       Date:  2003-06-05       Impact factor: 4.124

5.  Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.

Authors:  E Krissinel; K Henrick
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2004-11-26

6.  Improved method for predicting beta-turn using support vector machine.

Authors:  Qidong Zhang; Sukjoon Yoon; William J Welsh
Journal:  Bioinformatics       Date:  2005-03-29       Impact factor: 6.937

7.  Characterizing the microenvironment surrounding protein sites.

Authors:  S C Bagley; R B Altman
Journal:  Protein Sci       Date:  1995-04       Impact factor: 6.725

8.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

9.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life.

Authors:  J J Ward; J S Sodhi; L J McGuffin; B F Buxton; D T Jones
Journal:  J Mol Biol       Date:  2004-03-26       Impact factor: 5.469

10.  Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties.

Authors:  Natalia V Petrova; Cathy H Wu
Journal:  BMC Bioinformatics       Date:  2006-06-21       Impact factor: 3.169

View more
  35 in total

1.  Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease.

Authors:  Fuxiao Xin; Steven Myers; Yong Fuga Li; David N Cooper; Sean D Mooney; Predrag Radivojac
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

2.  Structure-based identification of catalytic residues.

Authors:  Ran Yahalom; Dan Reshef; Ayana Wiener; Sagiv Frankel; Nir Kalisman; Boaz Lerner; Chen Keasar
Journal:  Proteins       Date:  2011-04-12

3.  Enhanced performance in prediction of protein active sites with THEMATICS and support vector machines.

Authors:  Wenxu Tong; Ronald J Williams; Ying Wei; Leonel F Murga; Jaeju Ko; Mary Jo Ondrechen
Journal:  Protein Sci       Date:  2007-12-20       Impact factor: 6.725

4.  LIBRUS: combined machine learning and homology information for sequence-based ligand-binding residue prediction.

Authors:  Chris Kauffman; George Karypis
Journal:  Bioinformatics       Date:  2009-09-28       Impact factor: 6.937

5.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

6.  In silico functional profiling of human disease-associated and polymorphic amino acid substitutions.

Authors:  Matthew Mort; Uday S Evani; Vidhya G Krishnan; Kishore K Kamati; Peter H Baenziger; Angshuman Bagchi; Brandon J Peters; Rakesh Sathyesh; Biao Li; Yanan Sun; Bin Xue; Nigam H Shah; Maricel G Kann; David N Cooper; Predrag Radivojac; Sean D Mooney
Journal:  Hum Mutat       Date:  2010-03       Impact factor: 4.878

7.  Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

Authors:  John A Capra; Roman A Laskowski; Janet M Thornton; Mona Singh; Thomas A Funkhouser
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

8.  INTREPID: a web server for prediction of functionally important residues by evolutionary analysis.

Authors:  Sriram Sankararaman; Bryan Kolaczkowski; Kimmen Sjölander
Journal:  Nucleic Acids Res       Date:  2009-05-13       Impact factor: 16.971

9.  Automatic prediction of catalytic residues by modeling residue structural neighborhood.

Authors:  Elisa Cilia; Andrea Passerini
Journal:  BMC Bioinformatics       Date:  2010-03-03       Impact factor: 3.169

10.  ResBoost: characterizing and predicting catalytic residues in enzymes.

Authors:  Ron Alterovitz; Aaron Arvey; Sriram Sankararaman; Carolina Dallett; Yoav Freund; Kimmen Sjölander
Journal:  BMC Bioinformatics       Date:  2009-06-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.