Literature DB >> 9719646

Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases.

J S Fetrow1, J Skolnick.   

Abstract

The practical exploitation of the vast numbers of sequences in the genome sequence databases is crucially dependent on the ability to identify the function of each sequence. Unfortunately, current methods, including global sequence alignment and local sequence motif identification, are limited by the extent of sequence similarity between sequences of unknown and known function; these methods increasingly fail as the sequence identity diverges into and beyond the twilight zone of sequence identity. To address this problem, a novel method for identification of protein function based directly on the sequence-to-structure-to-function paradigm is described. Descriptors of protein active sites, termed "fuzzy functional forms" or FFFs, are created based on the geometry and conformation of the active site. By way of illustration, the active sites responsible for the disulfide oxidoreductase activity of the glutaredoxin/thioredoxin family and the RNA hydrolytic activity of the T1 ribonuclease family are presented. First, the FFFs are shown to correctly identify their corresponding active sites in a library of exact protein models produced by crystallography or NMR spectroscopy, most of which lack the specified activity. Next, these FFFs are used to screen for active sites in low-to-moderate resolution models produced by ab initio folding or threading prediction algorithms. Again, the FFFs can specifically identify the functional sites of these proteins from their predicted structures. The results demonstrate that low-to-moderate resolution models as produced by state-of-the-art tertiary structure prediction algorithms are sufficient to identify protein active sites. Prediction of a novel function for the gamma subunit of a yeast glycosyl transferase and prediction of the function of two hypothetical yeast proteins whose models were produced via threading are presented. This work suggests a means for the large-scale functional screening of genomic sequence databases based on the prediction of structure from sequence, then on the identification of functional active sites in the predicted structure. Copyright 1998 Academic Press

Entities:  

Mesh:

Substances:

Year:  1998        PMID: 9719646     DOI: 10.1006/jmbi.1998.1993

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  63 in total

1.  MODBASE, a database of annotated comparative protein structure models.

Authors:  R Sánchez; U Pieper; N Mirković; P I de Bakker; E Wittenstein; A Sali
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A systematic study of low-resolution recognition in protein--protein complexes.

Authors:  I A Vakser; O G Matar; C F Lam
Journal:  Proc Natl Acad Sci U S A       Date:  1999-07-20       Impact factor: 11.205

3.  Motif-based fold assignment.

Authors:  L Salwinski; D Eisenberg
Journal:  Protein Sci       Date:  2001-12       Impact factor: 6.725

4.  TOUCHSTONE: an ab initio protein structure prediction method that uses threading-based tertiary restraints.

Authors:  D Kihara; H Lu; A Kolinski; J Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

5.  On the molecular discrimination between adenine and guanine by proteins.

Authors:  I Nobeli; R A Laskowski; W S Valdar; J M Thornton
Journal:  Nucleic Acids Res       Date:  2001-11-01       Impact factor: 16.971

6.  Arabinogalactan protein and wall-associated kinase in a plasmalemmal reticulum with specialized vertices.

Authors:  J S Gens; M Fujiki; B G Pickard
Journal:  Protoplasma       Date:  2000 Jul-Aug       Impact factor: 3.356

7.  MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison.

Authors:  Angel R Ortiz; Charlie E M Strauss; Osvaldo Olmea
Journal:  Protein Sci       Date:  2002-11       Impact factor: 6.725

8.  Automated protein fold determination using a minimal NMR constraint strategy.

Authors:  Deyou Zheng; Yuanpeng J Huang; Hunter N B Moseley; Rong Xiao; James Aramini; G V T Swapna; Gaetano T Montelione
Journal:  Protein Sci       Date:  2003-06       Impact factor: 6.725

9.  Future directions in protein function prediction.

Authors:  Ihsan A Shehadi; Huyuan Yang; Mary Jo Ondrechen
Journal:  Mol Biol Rep       Date:  2002-12       Impact factor: 2.316

10.  Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures.

Authors:  Alexander Stark; Robert B Russell
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.