Literature DB >> 10338021

From fold predictions to function predictions: automation of functional site conservation analysis for functional genome predictions.

B Zhang1, L Rychlewski, K Pawłowski, J S Fetrow, J Skolnick, A Godzik.   

Abstract

A database of functional sites for proteins with known structures, SITE, is constructed and used in conjunction with a simple pattern matching program SiteMatch to evaluate possible function conservation in a recently constructed database of fold predictions for Escherichia coli proteins (Rychlewski L et al., 1999, Protein Sci 8:614-624). In this and other prediction databases, fold predictions are based on algorithms that can recognize weak sequence similarities and putatively assign new proteins into already characterized protein families. It is not clear whether such sequence similarities arise from distant homologies or general similarity of physicochemical features along the sequence. Leaving aside the important question of nature of relations within fold superfamilies, it is possible to assess possible function conservation by looking at the pattern of conservation of crucial functional residues. SITE consists of a multilevel function description based on structure annotations and structure analyses. In particular, active site residues, ligand binding residues, and patterns of hydrophobic residues on the protein surface are used to describe different functional features. SiteMatch, a simple pattern matching program, is designed to check the conservation of residues involved in protein activity in alignments generated by any alignment method. Here, this procedure is used to study conservation of functional features in alignments between protein sequences from the E. coli genome and their optimal structural templates. The optimal templates were identified and alignments taken from the database of genomic structural predictions was described in a previous publication (Rychlewski L et al., 1999, Protein Sci 8:614-624). An automated assessment of function conservation is used to analyze the relation between fold and function similarity for a large number of fold predictions. For instance, it is shown that identifying low significance predictions with a high level of functional residue conservations can be used to extend the prediction sensitivity for fold prediction methods. Over 100 new fold/function predictions in this class were obtained in the E. coli genome. At the same time, about 30% of our previous fold predictions are not confirmed as function predictions, further highlighting the problem of function divergence in fold superfamilies.

Entities:  

Mesh:

Year:  1999        PMID: 10338021      PMCID: PMC2144342          DOI: 10.1110/ps.8.5.1104

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  27 in total

1.  Topology fingerprint approach to the inverse protein folding problem.

Authors:  A Godzik; A Kolinski; J Skolnick
Journal:  J Mol Biol       Date:  1992-09-05       Impact factor: 5.469

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Dynamic programming algorithms for biological sequence comparison.

Authors:  W R Pearson; W Miller
Journal:  Methods Enzymol       Date:  1992       Impact factor: 1.600

4.  A new approach to protein fold recognition.

Authors:  D T Jones; W R Taylor; J M Thornton
Journal:  Nature       Date:  1992-07-02       Impact factor: 49.962

5.  A method to identify protein sequences that fold into a known three-dimensional structure.

Authors:  J U Bowie; R Lüthy; D Eisenberg
Journal:  Science       Date:  1991-07-12       Impact factor: 47.728

6.  Selection of representative protein data sets.

Authors:  U Hobohm; M Scharf; R Schneider; C Sander
Journal:  Protein Sci       Date:  1992-03       Impact factor: 6.725

7.  Structural diversity in a family of homologous proteins.

Authors:  K Pawłowski; A Bierzyński; A Godzik
Journal:  J Mol Biol       Date:  1996-05-03       Impact factor: 5.469

8.  Protease II from Escherichia coli: sequencing and expression of the enzyme gene and characterization of the expressed enzyme.

Authors:  A Kanatani; T Masuda; T Shimoda; F Misoka; X S Lin; T Yoshimoto; D Tsuru
Journal:  J Biochem       Date:  1991-09       Impact factor: 3.387

9.  The crystal structure of GMP synthetase reveals a novel catalytic triad and is a structural paradigm for two enzyme families.

Authors:  J J Tesmer; T J Klem; M L Deras; V J Davisson; J L Smith
Journal:  Nat Struct Biol       Date:  1996-01

10.  The metal-ion-free oxidoreductase from Streptomyces aureofaciens has an alpha/beta hydrolase fold.

Authors:  H J Hecht; H Sobek; T Haag; O Pfeifer; K H van Pée
Journal:  Nat Struct Biol       Date:  1994-08
View more
  16 in total

1.  LiveBench-1: continuous benchmarking of protein structure prediction servers.

Authors:  J M Bujnicki; A Elofsson; D Fischer; L Rychlewski
Journal:  Protein Sci       Date:  2001-02       Impact factor: 6.725

2.  Motif-based fold assignment.

Authors:  L Salwinski; D Eisenberg
Journal:  Protein Sci       Date:  2001-12       Impact factor: 6.725

3.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

4.  Analysis and prediction of functionally important sites in proteins.

Authors:  Saikat Chakrabarti; Christopher J Lanczycki
Journal:  Protein Sci       Date:  2007-01       Impact factor: 6.725

5.  Nonbonded terms extrapolated from nonlocal knowledge-based energy functions improve error detection in near-native protein structure models.

Authors:  Evandro Ferrada; Francisco Melo
Journal:  Protein Sci       Date:  2007-07       Impact factor: 6.725

6.  Operons in Escherichia coli: genomic analyses and predictions.

Authors:  H Salgado; G Moreno-Hagelsieb; T F Smith; J Collado-Vides
Journal:  Proc Natl Acad Sci U S A       Date:  2000-06-06       Impact factor: 11.205

7.  Predicting ligand-binding function in families of bacterial receptors.

Authors:  J M Johnson; G M Church
Journal:  Proc Natl Acad Sci U S A       Date:  2000-04-11       Impact factor: 11.205

8.  Fine-mapping, mutation analyses, and structural mapping of cerebrotendinous xanthomatosis in U.S. pedigrees.

Authors:  M H Lee; S Hazard; J D Carpten; S Yi; J Cohen; G T Gerhardt; G Salen; S B Patel
Journal:  J Lipid Res       Date:  2001-02       Impact factor: 5.922

9.  Catalytic residues in hydrolases: analysis of methods designed for ligand-binding site prediction.

Authors:  Katarzyna Prymula; Tomasz Jadczyk; Irena Roterman
Journal:  J Comput Aided Mol Des       Date:  2010-11-21       Impact factor: 3.686

10.  Prediction of functionally important sites from protein sequences using sparse kernel least squares classifiers.

Authors:  Ke Tang; Ganesan Pugalenthi; P N Suganthan; Christopher J Lanczycki; Saikat Chakrabarti
Journal:  Biochem Biophys Res Commun       Date:  2009-04-24       Impact factor: 3.575

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.