Literature DB >> 9322050

Predicting enzyme function from sequence: a systematic appraisal.

I Shah1, L Hunter.   

Abstract

Gapped and ungapped sequence alignment were tested as possible methods to classify proteins into the functional classes defined by the International Enzyme Commission (EC). We exhaustively tested all 15,208 proteins labeled with any EC class in a recent release of the SwissProt database, evaluating all 1,327 relevant EC classes. We effectively tested all possible similarity thresholds that could be used for this assignment through the use of the ROC statistic. Approximately 60% of Enzyme Commission classes containing two or more proteins could not be perfectly discriminated by sequence similarity at any threshold. An analysis of the errors indicates that false positive matches dominate, and that various error mechanisms can be identified, including the multidomain nature of many proteins and polyproteins, convergent evolution, variation in enzyme specificity, and other factors. Many of the putatively false positives are in fact biologically relevant. This work strongly suggests that functional assignment of enzymes should attempt to delimit functionally significant subregions, or domains, before matching to EC classes.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9322050      PMCID: PMC2709532     

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  11 in total

1.  Mandelate racemase and muconate lactonizing enzyme are mechanistically distinct and structurally homologous.

Authors:  D J Neidhart; G L Kenyon; J A Gerlt; G A Petsko
Journal:  Nature       Date:  1990-10-18       Impact factor: 49.962

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  The SWISS-PROT protein sequence data bank.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

4.  Rapid and sensitive sequence comparison with FASTP and FASTA.

Authors:  W R Pearson
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

Review 5.  The alcohol dehydrogenase system.

Authors:  H Jörnvall; O Danielsson; L Hjelmqvist; B Persson; J Shafqat
Journal:  Adv Exp Med Biol       Date:  1995       Impact factor: 2.622

6.  Evolutionary families of peptidases.

Authors:  N D Rawlings; A J Barrett
Journal:  Biochem J       Date:  1993-02-15       Impact factor: 3.857

7.  The ENZYME data bank.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

Review 8.  [The evolutionary kinship of the crystallins of cephalopods and vertebrates with heat-shock proteins and stress-induced proteins].

Authors:  R D Zinov'eva; S I Tomarev; J Piatigorsky
Journal:  Izv Akad Nauk Ser Biol       Date:  1994 Jul-Aug

9.  Cis,cis-muconate lactonizing enzyme from Trichosporon cutaneum: evidence for a novel class of cycloisomerases in eucaryotes.

Authors:  P Mazur; W A Pieken; S R Budihas; S E Williams; S Wong; J W Kozarich
Journal:  Biochemistry       Date:  1994-02-22       Impact factor: 3.162

10.  Octopus S-crystallins with endogenous glutathione S-transferase (GST) activity: sequence comparison and evolutionary relationships with authentic GST enzymes.

Authors:  S H Chiou; C W Yu; C W Lin; F M Pan; S F Lu; H J Lee; G G Chang
Journal:  Biochem J       Date:  1995-08-01       Impact factor: 3.857

View more
  18 in total

1.  Mining molecular binding terminology from biomedical text.

Authors:  T C Rindflesch; L Hunter; A R Aronson
Journal:  Proc AMIA Symp       Date:  1999

2.  Visual management of large scale data mining projects.

Authors:  I Shah; L Hunter
Journal:  Pac Symp Biocomput       Date:  2000

3.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

4.  Proteome-wide protein interaction measurements of bacterial proteins of unknown function.

Authors:  Matthias Meier; Rene V Sit; Stephen R Quake
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-24       Impact factor: 11.205

5.  A top-down approach to classify enzyme functional classes and sub-classes using random forest.

Authors:  Chetan Kumar; Alok Choudhary
Journal:  EURASIP J Bioinform Syst Biol       Date:  2012-02-29

6.  Computational Approaches for Automated Classification of Enzyme Sequences.

Authors:  Akram Mohammed; Chittibabu Guda
Journal:  J Proteomics Bioinform       Date:  2011-08-23

Review 7.  The past, present and future of genome-wide re-annotation.

Authors:  Christos A Ouzounis; Peter D Karp
Journal:  Genome Biol       Date:  2002-01-31       Impact factor: 13.583

8.  Structural signatures of enzyme binding pockets from order-independent surface alignment: a study of metalloendopeptidase and NAD binding proteins.

Authors:  Joe Dundas; Larisa Adamian; Jie Liang
Journal:  J Mol Biol       Date:  2010-12-09       Impact factor: 5.469

9.  Charting the proteome of Cryptosporidium parvum sporozoites using sequence similarity-based BLAST searching.

Authors:  A M A M Z Siddiki; Jonathan M Wastling
Journal:  J Vet Sci       Date:  2009-09       Impact factor: 1.603

10.  Predicting protein linkages in bacteria: which method is best depends on task.

Authors:  Anis Karimpour-Fard; Sonia M Leach; Ryan T Gill; Lawrence E Hunter
Journal:  BMC Bioinformatics       Date:  2008-09-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.