Literature DB >> 12589769

Automatic methods for predicting functionally important residues.

Antonio del Sol1, Antonio del Sol Mesa, Florencio Pazos, Alfonso Valencia.   

Abstract

Sequence analysis is often the first guide for the prediction of residues in a protein family that may have functional significance. A few methods have been proposed which use the division of protein families into subfamilies in the search for those positions that could have some functional significance for the whole family, but at the same time which exhibit the specificity of each subfamily ("Tree-determinant residues"). However, there are still many unsolved questions like the best division of a protein family into subfamilies, or the accurate detection of sequence variation patterns characteristic of different subfamilies. Here we present a systematic study in a significant number of protein families, testing the statistical meaning of the Tree-determinant residues predicted by three different methods that represent the range of available approaches. The first method takes as a starting point a phylogenetic representation of a protein family and, following the principle of Relative Entropy from Information Theory, automatically searches for the optimal division of the family into subfamilies. The second method looks for positions whose mutational behavior is reminiscent of the mutational behavior of the full-length proteins, by directly comparing the corresponding distance matrices. The third method is an automation of the analysis of distribution of sequences and amino acid positions in the corresponding multidimensional spaces using a vector-based principal component analysis. These three methods have been tested on two non-redundant lists of protein families: one composed by proteins that bind a variety of ligand groups, and the other composed by proteins with annotated functionally relevant sites. In most cases, the residues predicted by the three methods show a clear tendency to be close to bound ligands of biological relevance and to those amino acids described as participants in key aspects of protein function. These three automatic methods provide a wide range of possibilities for biologists to analyze their families of interest, in a similar way to the one presented here for the family of proteins related with ras-p21.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12589769     DOI: 10.1016/s0022-2836(02)01451-1

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  82 in total

1.  Are protein-protein interfaces more conserved in sequence than the rest of the protein surface?

Authors:  Daniel R Caffrey; Shyamal Somaroo; Jason D Hughes; Julian Mintseris; Enoch S Huang
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

2.  PANDORA: keyword-based analysis of protein sets by integration of annotation sources.

Authors:  Noam Kaplan; Avishay Vaaknin; Michal Linial
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

3.  Prediction of functional sites by analysis of sequence and structure conservation.

Authors:  Anna R Panchenko; Fyodor Kondrashov; Stephen Bryant
Journal:  Protein Sci       Date:  2004-03-09       Impact factor: 6.725

4.  FamClash: a method for ranking the activity of engineered enzymes.

Authors:  Manish C Saraf; Alexander R Horswill; Stephen J Benkovic; Costas D Maranas
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-23       Impact factor: 11.205

5.  Automated prediction of protein function and detection of functional sites from structure.

Authors:  Florencio Pazos; Michael J E Sternberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-29       Impact factor: 11.205

6.  Predicting functional residues of the Solanum lycopersicum aspartic protease inhibitor (SLAPI) by combining sequence and structural analysis with molecular docking.

Authors:  Yasel Guerra; Pedro A Valiente; Colin Berry; Tirso Pons
Journal:  J Mol Model       Date:  2011-11-20       Impact factor: 1.810

7.  Sequence and structure continuity of evolutionary importance improves protein functional site discovery and annotation.

Authors:  A D Wilkins; R Lua; S Erdin; R M Ward; O Lichtarge
Journal:  Protein Sci       Date:  2010-07       Impact factor: 6.725

8.  EFICAz: a comprehensive approach for accurate genome-scale enzyme function inference.

Authors:  Weidong Tian; Adrian K Arakaki; Jeffrey Skolnick
Journal:  Nucleic Acids Res       Date:  2004-12-01       Impact factor: 16.971

9.  Rod and cone opsin families differ in spectral tuning domains but not signal transducing domains as judged by saturated evolutionary trace analysis.

Authors:  Karen L Carleton; Tyrone C Spady; Rick H Cote
Journal:  J Mol Evol       Date:  2005-06-16       Impact factor: 2.395

10.  Residue centrality, functionally important residues, and active site shape: analysis of enzyme and non-enzyme families.

Authors:  Antonio del Sol; Hirotomo Fujihashi; Dolors Amoros; Ruth Nussinov
Journal:  Protein Sci       Date:  2006-08-01       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.