Literature DB >> 10944397

Practical limits of function prediction.

D Devos1, A Valencia.   

Abstract

The widening gap between known protein sequences and their functions has led to the practice of assigning a potential function to a protein on the basis of sequence similarity to proteins whose function has been experimentally investigated. We present here a critical view of the theoretical and practical bases for this approach. The results obtained by analyzing a significant number of true sequence similarities, derived directly from structural alignments, point to the complexity of function prediction. Different aspects of protein function, including (i) enzymatic function classification, (ii) functional annotations in the form of key words, (iii) classes of cellular function, and (iv) conservation of binding sites can only be reliably transferred between similar sequences to a modest degree. The reason for this difficulty is a combination of the unavoidable database inaccuracies and the plasticity of protein function. In addition, analysis of the relationship between sequence and functional descriptions defines an empirical limit for pairwise-based functional annotations, namely, the three first digits of the six numbers used as descriptors of protein folds in the FSSP database can be predicted at an average level as low as 7.5% sequence identity, two of the four EC digits at 15% identity, half of the SWISS-PROT key words related to protein function would require 20% identity, and the prediction of half of the residues in the binding site can be made at the 30% sequence identity level.

Mesh:

Substances:

Year:  2000        PMID: 10944397

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  121 in total

1.  Finding nuclear localization signals.

Authors:  M Cokol; R Nair; B Rost
Journal:  EMBO Rep       Date:  2000-11       Impact factor: 8.807

2.  Motif-based fold assignment.

Authors:  L Salwinski; D Eisenberg
Journal:  Protein Sci       Date:  2001-12       Impact factor: 6.725

3.  Comparing function and structure between entire proteomes.

Authors:  J Liu; B Rost
Journal:  Protein Sci       Date:  2001-10       Impact factor: 6.725

4.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

5.  Structural characterization of the human proteome.

Authors:  Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

Review 6.  Bioinformatics methods to predict protein structure and function. A practical approach.

Authors:  Yvonne J K Edwards; Amanda Cottage
Journal:  Mol Biotechnol       Date:  2003-02       Impact factor: 2.695

7.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

8.  UniqueProt: Creating representative protein sequence sets.

Authors:  Sven Mika; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

9.  Identification of protein biochemical functions by similarity search using the molecular surface database eF-site.

Authors:  Kengo Kinoshita; Haruki Nakamura
Journal:  Protein Sci       Date:  2003-08       Impact factor: 6.725

10.  PrfA protein of Bacillus species: prediction and demonstration of endonuclease activity on DNA.

Authors:  Daniel J Rigden; Peter Setlow; Barbara Setlow; Irina Bagyan; Richard A Stein; Mark J Jedrzejas
Journal:  Protein Sci       Date:  2002-10       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.