Literature DB >> 16410325

Automated discovery of 3D motifs for protein function annotation.

Benjamin J Polacco1, Patricia C Babbitt.   

Abstract

MOTIVATION: Function inference from structure is facilitated by the use of patterns of residues (3D motifs), normally identified by expert knowledge, that correlate with function. As an alternative to often limited expert knowledge, we use machine-learning techniques to identify patterns of 3-10 residues that maximize function prediction. This approach allows us to test the assumption that residues that provide function are the most informative for predicting function.
RESULTS: We apply our method, GASPS, to the haloacid dehalogenase, enolase, amidohydrolase and crotonase superfamilies and to the serine proteases. The motifs found by GASPS are as good at function prediction as 3D motifs based on expert knowledge. The GASPS motifs with the greatest ability to predict protein function consist mainly of known functional residues. However, several residues with no known functional role are equally predictive. For four groups, we show that the predictive power of our 3D motifs is comparable with or better than approaches that use the entire fold (Combinatorial-Extension) or sequence profiles (PSI-BLAST). AVAILABILITY: Source code is freely available for academic use by contacting the authors. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2006        PMID: 16410325     DOI: 10.1093/bioinformatics/btk038

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  41 in total

1.  Sequence and structure continuity of evolutionary importance improves protein functional site discovery and annotation.

Authors:  A D Wilkins; R Lua; S Erdin; R M Ward; O Lichtarge
Journal:  Protein Sci       Date:  2010-07       Impact factor: 6.725

Review 2.  FINDSITE: a combined evolution/structure-based approach to protein function prediction.

Authors:  Jeffrey Skolnick; Michal Brylinski
Journal:  Brief Bioinform       Date:  2009-03-26       Impact factor: 11.622

3.  Prediction and experimental validation of enzyme substrate specificity in protein structures.

Authors:  Shivas R Amin; Serkan Erdin; R Matthew Ward; Rhonald C Lua; Olivier Lichtarge
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-21       Impact factor: 11.205

Review 4.  Exploring the structure and function paradigm.

Authors:  Oliver C Redfern; Benoit Dessailly; Christine A Orengo
Journal:  Curr Opin Struct Biol       Date:  2008-06       Impact factor: 6.809

5.  Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction.

Authors:  Drew H Bryant; Mark Moll; Brian Y Chen; Viacheslav Y Fofanov; Lydia E Kavraki
Journal:  BMC Bioinformatics       Date:  2010-05-11       Impact factor: 3.169

Review 6.  Protein function annotation by homology-based inference.

Authors:  Yaniv Loewenstein; Domenico Raimondo; Oliver C Redfern; James Watson; Dmitrij Frishman; Michal Linial; Christine Orengo; Janet Thornton; Anna Tramontano
Journal:  Genome Biol       Date:  2009-02-02       Impact factor: 13.583

7.  Structural descriptor database: a new tool for sequence-based functional site prediction.

Authors:  Juliana S Bernardes; Jorge H Fernandez; Ana Tereza R Vasconcelos
Journal:  BMC Bioinformatics       Date:  2008-11-25       Impact factor: 3.169

8.  A comprehensive analysis of the structure-function relationship in proteins based on local structure similarity.

Authors:  Torgeir R Hvidsten; Astrid Laegreid; Andriy Kryshtafovych; Gunnar Andersson; Krzysztof Fidelis; Jan Komorowski
Journal:  PLoS One       Date:  2009-07-15       Impact factor: 3.240

9.  Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb.

Authors:  Kevin Nagel; Antonio Jimeno-Yepes; Dietrich Rebholz-Schuhmann
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

10.  FLORA: a novel method to predict protein function from structure in diverse superfamilies.

Authors:  Oliver C Redfern; Benoît H Dessailly; Timothy J Dallman; Ian Sillitoe; Christine A Orengo
Journal:  PLoS Comput Biol       Date:  2009-08-28       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.