Literature DB >> 19086060

Computational chemistry study of 3D-structure-function relationships for enzymes based on Markov models for protein electrostatic, HINT, and van der Waals potentials.

Riccardo Concu1, Gianni Podda, Eugenio Uriarte, Humberto González-Díaz.   

Abstract

In a significant work, Dobson and Doig (J Mol Biol 2003, 330, 771) illustrated protein prediction as enzymatic or not from spatial structure without resorting to alignments. They used 52 protein features and a nonlinear support vector machine model to classify more than 1000 proteins collected from the PDB with a 77% overall accuracy. The most useful features were: the secondary-structure content, the amino acid frequencies, the number of disulphide bonds, and the largest cleft size. Working on the same dataset used by D&D, in this article we reported a good and simple model, based on the Markov chain models (MCM), to classify protein 3D structures as enzymatic or not, taking into consideration the spatial structure without resorting to alignments. Here we define, for the first time, a general MCM to calculate the electrostatic potential, molecular vibrations, van der Waals (vdw) interactions, and hydrophobic interactions (HINT) and use them in comparative studies of potential fields and/or protein function prediction. The dataset is composed of 1371 proteins divided into 689 enzymes and 682 nonenzymes, all proteins were collected from the PDB. The best model we found was a linear model carried out with the linear discriminant analysis; it was able to classify 74.18% of the proteins using only two electrostatic potentials. In the work described here, we define 3D-HINT potentials (mu(k)) and use them for the first time to derive a classifier for protein enzymes. We analyzed ROC curves, domain of applicability, parametric assumptions, desirability maps, and also tested other nonlinear artificial neural network models which did not improve the linear model. In closing, this MCM allows a fast calculation and comparison of different potentials deriving into accurate protein 3D structure-function relationships, notably simpler than the previous. (c) 2008 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19086060     DOI: 10.1002/jcc.21170

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  4 in total

1.  Prediction of ketoacyl synthase family using reduced amino acid alphabets.

Authors:  Wei Chen; Pengmian Feng; Hao Lin
Journal:  J Ind Microbiol Biotechnol       Date:  2011-10-26       Impact factor: 3.346

2.  Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method.

Authors:  Nicholas J Davidson; Xueyi Wang
Journal:  Proc Int Conf Mach Learn Appl       Date:  2010-12-12

3.  Automatic single- and multi-label enzymatic function prediction by machine learning.

Authors:  Shervine Amidi; Afshine Amidi; Dimitrios Vlachakis; Nikos Paragios; Evangelia I Zacharaki
Journal:  PeerJ       Date:  2017-03-29       Impact factor: 2.984

4.  Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.

Authors:  Guillermin Agüero-Chapin; Deborah Galpert; Reinaldo Molina-Ruiz; Evys Ancede-Gallardo; Gisselle Pérez-Machado; Gustavo A de la Riva; Agostinho Antunes
Journal:  Biomolecules       Date:  2019-12-23
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.