Literature DB >> 16037208

Effective function annotation through catalytic residue conservation.

Richard A George1, Ruth V Spriggs, Gail J Bartlett, Alex Gutteridge, Malcolm W MacArthur, Craig T Porter, Bissan Al-Lazikani, Janet M Thornton, Mark B Swindells.   

Abstract

Because of the extreme impact of genome sequencing projects, protein sequences without accompanying experimental data now dominate public databases. Homology searches, by providing an opportunity to transfer functional information between related proteins, have become the de facto way to address this. Although a single, well annotated, close relationship will often facilitate sufficient annotation, this situation is not always the case, particularly if mutations are present in important functional residues. When only distant relationships are available, the transfer of function information is more tenuous, and the likelihood of encountering several well annotated proteins with different functions is increased. The consequence for a researcher is a range of candidate functions with little way of knowing which, if any, are correct. Here, we address the problem directly by introducing a computational approach to accurately identify and segregate related proteins into those with a functional similarity and those where function differs. This approach should find a wide range of applications, including the interpretation of genomics/proteomics data and the prioritization of targets for high-throughput structure determination. The method is generic, but here we concentrate on enzymes and apply high-quality catalytic site data. In addition to providing a series of comprehensive benchmarks to show the overall performance of our approach, we illustrate its utility with specific examples that include the correct identification of haptoglobin as a nonenzymatic relative of trypsin, discrimination of acid-d-amino acid ligases from a much larger ligase pool, and the successful annotation of BioH, a structural genomics target.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16037208      PMCID: PMC1178014          DOI: 10.1073/pnas.0504833102

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  36 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Evolution of function in protein superfamilies, from a structural perspective.

Authors:  A E Todd; C A Orengo; J M Thornton
Journal:  J Mol Biol       Date:  2001-04-06       Impact factor: 5.469

3.  Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption.

Authors:  M Y Galperin; E V Koonin
Journal:  In Silico Biol       Date:  1998

Review 4.  Getting the most from PSI-BLAST.

Authors:  David T Jones; Mark B Swindells
Journal:  Trends Biochem Sci       Date:  2002-03       Impact factor: 13.807

5.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

Review 6.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

7.  An evolutionary trace method defines binding surfaces common to protein families.

Authors:  O Lichtarge; H R Bourne; F E Cohen
Journal:  J Mol Biol       Date:  1996-03-29       Impact factor: 5.469

8.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

9.  Nicastrin, a presenilin-interacting protein, contains an aminopeptidase/transferrin receptor superfamily domain.

Authors:  R Fagan; M Swindells; J Overington; M Weir
Journal:  Trends Biochem Sci       Date:  2001-04       Impact factor: 13.807

10.  Quod erat demonstrandum? The mystery of experimental validation of apparently erroneous computational analyses of protein sequences.

Authors:  L M Iyer; L Aravind; P Bork; K Hofmann; A R Mushegian; I B Zhulin; E V Koonin
Journal:  Genome Biol       Date:  2001-11-13       Impact factor: 13.583

View more
  23 in total

1.  Profile of Janet M. Thornton.

Authors:  Nick Zagorski
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-23       Impact factor: 11.205

2.  LigProf: a simple tool for in silico prediction of ligand-binding sites.

Authors:  Grzegorz Koczyk; Lucjan S Wyrwicz; Leszek Rychlewski
Journal:  J Mol Model       Date:  2007-01-03       Impact factor: 1.810

3.  Missing in action: enzyme functional annotations in biological databases.

Authors:  Nicholas Furnham; John S Garavelli; Rolf Apweiler; Janet M Thornton
Journal:  Nat Chem Biol       Date:  2009-08       Impact factor: 15.040

4.  Computational protein design: validation and possible relevance as a tool for homology searching and fold recognition.

Authors:  Marcel Schmidt Am Busch; Audrey Sedano; Thomas Simonson
Journal:  PLoS One       Date:  2010-05-05       Impact factor: 3.240

5.  Cell cycle kinases predicted from conserved biophysical properties.

Authors:  Kazimierz O Wrzeszczynski; Burkhard Rost
Journal:  Proteins       Date:  2009-02-15

Review 6.  Protein function annotation by homology-based inference.

Authors:  Yaniv Loewenstein; Domenico Raimondo; Oliver C Redfern; James Watson; Dmitrij Frishman; Michal Linial; Christine Orengo; Janet Thornton; Anna Tramontano
Journal:  Genome Biol       Date:  2009-02-02       Impact factor: 13.583

7.  Structural and phylogenetic analysis of a conserved actinobacteria-specific protein (ASP1; SCO1997) from Streptomyces coelicolor.

Authors:  Beile Gao; Seiji Sugiman-Marangos; Murray S Junop; Radhey S Gupta
Journal:  BMC Struct Biol       Date:  2009-06-10

8.  Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases.

Authors:  Chang-Yub Kim; Cecelia Webster; Justin K M Roberts; Jin Ho Moon; Emily Z Alipio Lyon; Heungbok Kim; Minmin Yu; Li-Wei Hung; Thomas C Terwilliger
Journal:  J Struct Funct Genomics       Date:  2009-11-13

9.  Characterization and prediction of residues determining protein functional specificity.

Authors:  John A Capra; Mona Singh
Journal:  Bioinformatics       Date:  2008-05-01       Impact factor: 6.937

10.  Active site prediction using evolutionary and structural information.

Authors:  Sriram Sankararaman; Fei Sha; Jack F Kirsch; Michael I Jordan; Kimmen Sjölander
Journal:  Bioinformatics       Date:  2010-01-14       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.