Literature DB >> 18096640

Enhanced performance in prediction of protein active sites with THEMATICS and support vector machines.

Wenxu Tong1, Ronald J Williams, Ying Wei, Leonel F Murga, Jaeju Ko, Mary Jo Ondrechen.   

Abstract

Theoretical microscopic titration curves (THEMATICS) is a computational method for the identification of active sites in proteins through deviations in computed titration behavior of ionizable residues. While the sensitivity to catalytic sites is high, the previously reported sensitivity to catalytic residues was not as high, about 50%. Here THEMATICS is combined with support vector machines (SVM) to improve sensitivity for catalytic residue prediction from protein 3D structure alone. For a test set of 64 proteins taken from the Catalytic Site Atlas (CSA), the average recall rate for annotated catalytic residues is 61%; good precision is maintained selecting only 4% of all residues. The average false positive rate, using the CSA annotations is only 3.2%, far lower than other 3D-structure-based methods. THEMATICS-SVM returns higher precision, lower false positive rate, and better overall performance, compared with other 3D-structure-based methods. Comparison is also made with the latest machine learning methods that are based on both sequence alignments and 3D structures. For annotated sets of well-characterized enzymes, THEMATICS-SVM performance compares very favorably with methods that utilize sequence homology. However, since THEMATICS depends only on the 3D structure of the query protein, no decline in performance is expected when applied to novel folds, proteins with few sequence homologues, or even orphan sequences. An extension of the method to predict non-ionizable catalytic residues is also presented. THEMATICS-SVM predicts a local network of ionizable residues with strong interactions between protonation events; this appears to be a special feature of enzyme active sites.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18096640      PMCID: PMC2222725          DOI: 10.1110/ps.073213608

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  30 in total

1.  Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins.

Authors:  R Landgraf; I Xenarios; D Eisenberg
Journal:  J Mol Biol       Date:  2001-04-13       Impact factor: 5.469

2.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.

Authors:  C A Wilson; J Kreychman; M Gerstein
Journal:  J Mol Biol       Date:  2000-03-17       Impact factor: 5.469

3.  Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking.

Authors:  P Aloy; E Querol; F X Aviles; M J Sternberg
Journal:  J Mol Biol       Date:  2001-08-10       Impact factor: 5.469

4.  Four-body potentials reveal protein-specific correlations to stability changes caused by hydrophobic core mutations.

Authors:  C W Carter; B C LeFebvre; S A Cammer; A Tropsha; M H Edgell
Journal:  J Mol Biol       Date:  2001-08-24       Impact factor: 5.469

5.  Prediction of functionally important residues based solely on the computed energetics of protein structure.

Authors:  A H Elcock
Journal:  J Mol Biol       Date:  2001-09-28       Impact factor: 5.469

6.  Prediction of catalytic residues in enzymes based on known tertiary structure, stability profile, and sequence conservation.

Authors:  Motonori Ota; Kengo Kinoshita; Ken Nishikawa
Journal:  J Mol Biol       Date:  2003-04-11       Impact factor: 5.469

7.  Analysis of catalytic residues in enzyme active sites.

Authors:  Gail J Bartlett; Craig T Porter; Neera Borkakoti; Janet M Thornton
Journal:  J Mol Biol       Date:  2002-11-15       Impact factor: 5.469

8.  Future directions in protein function prediction.

Authors:  Ihsan A Shehadi; Huyuan Yang; Mary Jo Ondrechen
Journal:  Mol Biol Rep       Date:  2002-12       Impact factor: 2.316

9.  THEMATICS: a simple computational predictor of enzyme function from structure.

Authors:  M J Ondrechen; J G Clifton; D Ringe
Journal:  Proc Natl Acad Sci U S A       Date:  2001-10-16       Impact factor: 11.205

10.  Structure-based activity prediction for an enzyme of unknown function.

Authors:  Johannes C Hermann; Ricardo Marti-Arbona; Alexander A Fedorov; Elena Fedorov; Steven C Almo; Brian K Shoichet; Frank M Raushel
Journal:  Nature       Date:  2007-07-01       Impact factor: 49.962

View more
  14 in total

1.  Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease.

Authors:  Fuxiao Xin; Steven Myers; Yong Fuga Li; David N Cooper; Sean D Mooney; Predrag Radivojac
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

2.  Structure-based identification of catalytic residues.

Authors:  Ran Yahalom; Dan Reshef; Ayana Wiener; Sagiv Frankel; Nir Kalisman; Boaz Lerner; Chen Keasar
Journal:  Proteins       Date:  2011-04-12

3.  iCataly-PseAAC: Identification of Enzymes Catalytic Sites Using Sequence Evolution Information with Grey Model GM (2,1).

Authors:  Xuan Xiao; Meng-Juan Hui; Zi Liu; Wang-Ren Qiu
Journal:  J Membr Biol       Date:  2015-06-16       Impact factor: 1.843

Review 4.  Tailoring Proteins to Re-Evolve Nature: A Short Review.

Authors:  Angelica Jimenez-Rosales; Miriam V Flores-Merino
Journal:  Mol Biotechnol       Date:  2018-12       Impact factor: 2.695

Review 5.  A survey of computational intelligence techniques in protein function prediction.

Authors:  Arvind Kumar Tiwari; Rajeev Srivastava
Journal:  Int J Proteomics       Date:  2014-12-11

6.  SitesIdentify: a protein functional site prediction tool.

Authors:  Tracey Bray; Pedro Chan; Salim Bougouffa; Richard Greaves; Andrew J Doig; Jim Warwicker
Journal:  BMC Bioinformatics       Date:  2009-11-18       Impact factor: 3.169

7.  ResBoost: characterizing and predicting catalytic residues in enzymes.

Authors:  Ron Alterovitz; Aaron Arvey; Sriram Sankararaman; Carolina Dallett; Yoav Freund; Kimmen Sjölander
Journal:  BMC Bioinformatics       Date:  2009-06-27       Impact factor: 3.169

8.  Active site prediction using evolutionary and structural information.

Authors:  Sriram Sankararaman; Fei Sha; Jack F Kirsch; Michael I Jordan; Kimmen Sjölander
Journal:  Bioinformatics       Date:  2010-01-14       Impact factor: 6.937

9.  Partial order optimum likelihood (POOL): maximum likelihood prediction of protein active site residues using 3D Structure and sequence properties.

Authors:  Wenxu Tong; Ying Wei; Leonel F Murga; Mary Jo Ondrechen; Ronald J Williams
Journal:  PLoS Comput Biol       Date:  2009-01-16       Impact factor: 4.475

10.  Cutoff lensing: predicting catalytic sites in enzymes.

Authors:  Simon Aubailly; Francesco Piazza
Journal:  Sci Rep       Date:  2015-10-08       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.