Literature DB >> 10329133

The relationship between protein structure and function: a comprehensive survey with application to the yeast genome.

H Hegyi1, M Gerstein.   

Abstract

For most proteins in the genome databases, function is predicted via sequence comparison. In spite of the popularity of this approach, the extent to which it can be reliably applied is unknown. We address this issue by systematically investigating the relationship between protein function and structure. We focus initially on enzymes functionally classified by the Enzyme Commission (EC) and relate these to by structurally classified domains the SCOP database. We find that the major SCOP fold classes have different propensities to carry out certain broad categories of functions. For instance, alpha/beta folds are disproportionately associated with enzymes, especially transferases and hydrolases, and all-alpha and small folds with non-enzymes, while alpha+beta folds have an equal tendency either way. These observations for the database overall are largely true for specific genomes. We focus, in particular, on yeast, analyzing it with many classifications in addition to SCOP and EC (i.e. COGs, CATH, MIPS), and find clear tendencies for fold-function association, across a broad spectrum of functions. Analysis with the COGs scheme also suggests that the functions of the most ancient proteins are more evenly distributed among different structural classes than those of more modern ones. For the database overall, we identify the most versatile functions, i.e. those that are associated with the most folds, and the most versatile folds, associated with the most functions. The two most versatile enzymatic functions (hydro-lyases and O-glycosyl glucosidases) are associated with seven folds each. The five most versatile folds (TIM-barrel, Rossmann, ferredoxin, alpha-beta hydrolase, and P-loop NTP hydrolase) are all mixed alpha-beta structures. They stand out as generic scaffolds, accommodating from six to as many as 16 functions (for the exceptional TIM-barrel). At the conclusion of our analysis we are able to construct a graph giving the chance that a functional annotation can be reliably transferred at different degrees of sequence and structural similarity. Supplemental information is available from http://bioinfo.mbb.yale.edu/genome/foldfunc++ +. Copyright 1999 Academic Press.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10329133     DOI: 10.1006/jmbi.1999.2661

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  99 in total

1.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

2.  Expectations from structural genomics.

Authors:  S E Brenner; M Levitt
Journal:  Protein Sci       Date:  2000-01       Impact factor: 6.725

3.  Whole-genome trees based on the occurrence of folds and orthologs: implications for comparing genomes on different levels.

Authors:  J Lin; M Gerstein
Journal:  Genome Res       Date:  2000-06       Impact factor: 9.043

4.  Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins.

Authors:  R Jansen; M Gerstein
Journal:  Nucleic Acids Res       Date:  2000-03-15       Impact factor: 16.971

5.  Directed evolution of a (beta alpha)8-barrel enzyme to catalyze related reactions in two different metabolic pathways.

Authors:  C Jürgens; A Strom; D Wegener; S Hettwer; M Wilmanns; R Sterner
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

6.  Motif-based fold assignment.

Authors:  L Salwinski; D Eisenberg
Journal:  Protein Sci       Date:  2001-12       Impact factor: 6.725

7.  An efficient algorithm for large-scale detection of protein families.

Authors:  A J Enright; S Van Dongen; C A Ouzounis
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

8.  Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22.

Authors:  Paul M Harrison; Hedi Hegyi; Suganthi Balasubramanian; Nicholas M Luscombe; Paul Bertone; Nathaniel Echols; Ted Johnson; Mark Gerstein
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

9.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

10.  Functional versatility and molecular diversity of the metabolic map of Escherichia coli.

Authors:  S Tsoka; C A Ouzounis
Journal:  Genome Res       Date:  2001-09       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.