Literature DB >> 11021970

Analysis and prediction of functional sub-types from protein sequence alignments.

S S Hannenhalli1, R B Russell.   

Abstract

The increasing number and diversity of protein sequence families requires new methods to define and predict details regarding function. Here, we present a method for analysis and prediction of functional sub-types from multiple protein sequence alignments. Given an alignment and set of proteins grouped into sub-types according to some definition of function, such as enzymatic specificity, the method identifies positions that are indicative of functional differences by comparison of sub-type specific sequence profiles, and analysis of positional entropy in the alignment. Alignment positions with significantly high positional relative entropy correlate with those known to be involved in defining sub-types for nucleotidyl cyclases, protein kinases, lactate/malate dehydrogenases and trypsin-like serine proteases. We highlight new positions for these proteins that suggest additional experiments to elucidate the basis of specificity. The method is also able to predict sub-type for unclassified sequences. We assess several variations on a prediction method, and compare them to simple sequence comparisons. For assessment, we remove close homologues to the sequence for which a prediction is to be made (by a sequence identity above a threshold). This simulates situations where a protein is known to belong to a protein family, but is not a close relative of another protein of known sub-type. Considering the four families above, and a sequence identity threshold of 30 %, our best method gives an accuracy of 96 % compared to 80 % obtained for sequence similarity and 74 % for BLAST. We describe the derivation of a set of sub-type groupings derived from an automated parsing of alignments from PFAM and the SWISSPROT database, and use this to perform a large-scale assessment. The best method gives an average accuracy of 94 % compared to 68 % for sequence similarity and 79 % for BLAST. We discuss implications for experimental design, genome annotation and the prediction of protein function and protein intra-residue distances. Copyright 2000 Academic Press.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 11021970     DOI: 10.1006/jmbi.2000.4036

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  103 in total

1.  Interrogating protein interaction networks through structural biology.

Authors:  Patrick Aloy; Robert B Russell
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-23       Impact factor: 11.205

2.  Are protein-protein interfaces more conserved in sequence than the rest of the protein surface?

Authors:  Daniel R Caffrey; Shyamal Somaroo; Jason D Hughes; Julian Mintseris; Enoch S Huang
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

3.  Automated selection of positions determining functional specificity of proteins by comparative analysis of orthologous groups in protein families.

Authors:  Olga V Kalinina; Andrey A Mironov; Mikhail S Gelfand; Aleksandra B Rakhmaninova
Journal:  Protein Sci       Date:  2004-02       Impact factor: 6.725

4.  Determining the basis of channel-tetramerization specificity by x-ray crystallography and a sequence-comparison algorithm: Family Values (FamVal).

Authors:  Max H Nanao; Wei Zhou; Paul J Pfaffinger; Senyon Choe
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-30       Impact factor: 11.205

5.  PANTHER: a library of protein families and subfamilies indexed by function.

Authors:  Paul D Thomas; Michael J Campbell; Anish Kejariwal; Huaiyu Mi; Brian Karlak; Robin Daverman; Karen Diemer; Anushya Muruganujan; Apurva Narechania
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

6.  SDPpred: a tool for prediction of amino acid residues that determine differences in functional specificity of homologous proteins.

Authors:  Olga V Kalinina; Pavel S Novichkov; Andrey A Mironov; Mikhail S Gelfand; Aleksandra B Rakhmaninova
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

7.  Prediction of functional sites by analysis of sequence and structure conservation.

Authors:  Anna R Panchenko; Fyodor Kondrashov; Stephen Bryant
Journal:  Protein Sci       Date:  2004-03-09       Impact factor: 6.725

8.  Automated prediction of protein function and detection of functional sites from structure.

Authors:  Florencio Pazos; Michael J E Sternberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-29       Impact factor: 11.205

9.  Surveying the manifold divergence of an entire protein class for statistical clues to underlying biochemical mechanisms.

Authors:  Andrew F Neuwald
Journal:  Stat Appl Genet Mol Biol       Date:  2011-08-04

10.  A coiled coil trigger site is essential for rapid binding of synaptobrevin to the SNARE acceptor complex.

Authors:  Katrin Wiederhold; Tobias H Kloepper; Alexander M Walter; Alexander Stein; Nickias Kienle; Jakob B Sørensen; Dirk Fasshauer
Journal:  J Biol Chem       Date:  2010-04-20       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.