Literature DB >> 18710875

Accurate sequence-based prediction of catalytic residues.

Tuo Zhang1, Hua Zhang, Ke Chen, Shiyi Shen, Jishou Ruan, Lukasz Kurgan.   

Abstract

MOTIVATION: Prediction of catalytic residues provides useful information for the research on function of enzymes. Most of the existing prediction methods are based on structural information, which limits their use. We propose a sequence-based catalytic residue predictor that provides predictions with quality comparable to modern structure-based methods and that exceeds quality of state-of-the-art sequence-based methods.
RESULTS: Our method (CRpred) uses sequence-based features and the sequence-derived PSI-BLAST profile. We used feature selection to reduce the dimensionality of the input (and explain the input) to support vector machine (SVM) classifier that provides predictions. Tests on eight datasets and side-by-side comparison with six modern structure- and sequence-based predictors show that CRpred provides predictions with quality comparable to current structure-based methods and better than sequence-based methods. The proposed method obtains 15-19% precision and 48-58% TP (true positive) rate, depending on the dataset used. CRpred also provides confidence values that allow selecting a subset of predictions with higher precision. The improved quality is due to newly designed features and careful parameterization of the SVM. The features incorporate amino acids characterized by the highest and the lowest propensities to constitute catalytic residues, Gly that provides flexibility for catalytic sites and sequence motifs characteristic to certain catalytic reactions. Our features indicate that catalytic residues are on average more conserved when compared with the general population of residues and that highly conserved amino acids characterized by high catalytic propensity are likely to form catalytic sites. We also show that local (with respect to the sequence) hydrophobicity contributes towards the prediction.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18710875     DOI: 10.1093/bioinformatics/btn433

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  36 in total

1.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

2.  iCataly-PseAAC: Identification of Enzymes Catalytic Sites Using Sequence Evolution Information with Grey Model GM (2,1).

Authors:  Xuan Xiao; Meng-Juan Hui; Zi Liu; Wang-Ren Qiu
Journal:  J Membr Biol       Date:  2015-06-16       Impact factor: 1.843

3.  The choice of sequence homologs included in multiple sequence alignments has a dramatic impact on evolutionary conservation analysis.

Authors:  Nelson Gil; Andras Fiser
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

4.  Catalytic residues in hydrolases: analysis of methods designed for ligand-binding site prediction.

Authors:  Katarzyna Prymula; Tomasz Jadczyk; Irena Roterman
Journal:  J Comput Aided Mol Des       Date:  2010-11-21       Impact factor: 3.686

5.  Automatic prediction of catalytic residues by modeling residue structural neighborhood.

Authors:  Elisa Cilia; Andrea Passerini
Journal:  BMC Bioinformatics       Date:  2010-03-03       Impact factor: 3.169

6.  Networks of high mutual information define the structural proximity of catalytic sites: implications for catalytic residue identification.

Authors:  Cristina Marino Buslje; Elin Teppa; Tomas Di Doménico; José María Delfino; Morten Nielsen
Journal:  PLoS Comput Biol       Date:  2010-11-04       Impact factor: 4.475

7.  A series of Notch3 mutations in CADASIL; insights from 3D molecular modelling and evolutionary analyses.

Authors:  Dimitrios Vlachakis; Spyridon Champeris Tsaniras; Katerina Ioannidou; Louis Papageorgiou; Marc Baumann; Sophia Kossida
Journal:  J Mol Biochem       Date:  2014

8.  SitesIdentify: a protein functional site prediction tool.

Authors:  Tracey Bray; Pedro Chan; Salim Bougouffa; Richard Greaves; Andrew J Doig; Jim Warwicker
Journal:  BMC Bioinformatics       Date:  2009-11-18       Impact factor: 3.169

9.  Prediction of protein binding sites in protein structures using hidden Markov support vector machine.

Authors:  Bin Liu; Xiaolong Wang; Lei Lin; Buzhou Tang; Qiwen Dong; Xuan Wang
Journal:  BMC Bioinformatics       Date:  2009-11-20       Impact factor: 3.169

10.  Integrative approaches to the prediction of protein functions based on the feature selection.

Authors:  Seokha Ko; Hyunju Lee
Journal:  BMC Bioinformatics       Date:  2009-12-31       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.