Literature DB >> 17316683

Towards fully automated structure-based function prediction in structural genomics: a case study.

James D Watson1, Steve Sanderson, Alexandra Ezersky, Alexei Savchenko, Aled Edwards, Christine Orengo, Andrzej Joachimiak, Roman A Laskowski, Janet M Thornton.   

Abstract

As the global Structural Genomics projects have picked up pace, the number of structures annotated in the Protein Data Bank as hypothetical protein or unknown function has grown significantly. A major challenge now involves the development of computational methods to assign functions to these proteins accurately and automatically. As part of the Midwest Center for Structural Genomics (MCSG) we have developed a fully automated functional analysis server, ProFunc, which performs a battery of analyses on a submitted structure. The analyses combine a number of sequence-based and structure-based methods to identify functional clues. After the first stage of the Protein Structure Initiative (PSI), we review the success of the pipeline and the importance of structure-based function prediction. As a dataset, we have chosen all structures solved by the MCSG during the 5 years of the first PSI. Our analysis suggests that two of the structure-based methods are particularly successful and provide examples of local similarity that is difficult to identify using current sequence-based methods. No one method is successful in all cases, so, through the use of a number of complementary sequence and structural approaches, the ProFunc server increases the chances that at least one method will find a significant hit that can help elucidate function. Manual assessment of the results is a time-consuming process and subject to individual interpretation and human error. We present a method based on the Gene Ontology (GO) schema using GO-slims that can allow the automated assessment of hits with a success rate approaching that of expert manual assessment.

Entities:  

Mesh:

Year:  2007        PMID: 17316683      PMCID: PMC2566530          DOI: 10.1016/j.jmb.2007.01.063

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  31 in total

1.  GO-Mapper: functional analysis of gene expression data using the expression level as a score to evaluate Gene Ontology terms.

Authors:  Marcel Smid; Lambert C J Dorssers
Journal:  Bioinformatics       Date:  2004-05-06       Impact factor: 6.937

2.  TargetDB: a target registration database for structural genomics projects.

Authors:  Li Chen; Rose Oughtred; Helen M Berman; John Westbrook
Journal:  Bioinformatics       Date:  2004-05-06       Impact factor: 6.937

Review 3.  Prediction of protein function from protein sequence and structure.

Authors:  James C Whisstock; Arthur M Lesk
Journal:  Q Rev Biophys       Date:  2003-08       Impact factor: 5.318

4.  Automated prediction of protein function and detection of functional sites from structure.

Authors:  Florencio Pazos; Michael J E Sternberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-29       Impact factor: 11.205

5.  Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.

Authors:  E Krissinel; K Henrick
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2004-11-26

6.  Who tangos with GOA?-Use of Gene Ontology Annotation (GOA) for biological interpretation of '-omics' data and for validation of automatic annotation tools.

Authors:  Vivian Lee; Evelyn Camon; Emily Dimmer; Daniel Barrell; Rolf Apweiler
Journal:  In Silico Biol       Date:  2005

7.  Pfam: multiple sequence alignments and HMM-profiles of protein domains.

Authors:  E L Sonnhammer; S R Eddy; E Birney; A Bateman; R Durbin
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

Review 8.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms.

Authors:  W R Pearson
Journal:  Genomics       Date:  1991-11       Impact factor: 5.736

View more
  34 in total

1.  Systematic structural characterization of metabolites in Arabidopsis via candidate substrate-product pair networks.

Authors:  Kris Morreel; Yvan Saeys; Oana Dima; Fachuang Lu; Yves Van de Peer; Ruben Vanholme; John Ralph; Bartel Vanholme; Wout Boerjan
Journal:  Plant Cell       Date:  2014-03-31       Impact factor: 11.277

2.  Robust recognition of zinc binding sites in proteins.

Authors:  Jessica C Ebert; Russ B Altman
Journal:  Protein Sci       Date:  2007-11-27       Impact factor: 6.725

3.  Missing in action: enzyme functional annotations in biological databases.

Authors:  Nicholas Furnham; John S Garavelli; Rolf Apweiler; Janet M Thornton
Journal:  Nat Chem Biol       Date:  2009-08       Impact factor: 15.040

4.  Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.

Authors:  Deepak Bandyopadhyay; Jun Huan; Jan Prins; Jack Snoeyink; Wei Wang; Alexander Tropsha
Journal:  J Comput Aided Mol Des       Date:  2009-06-20       Impact factor: 3.686

Review 5.  Exploring the structure and function paradigm.

Authors:  Oliver C Redfern; Benoit Dessailly; Christine A Orengo
Journal:  Curr Opin Struct Biol       Date:  2008-06       Impact factor: 6.809

6.  Maps of protein structure space reveal a fundamental relationship between protein structure and function.

Authors:  Margarita Osadchy; Rachel Kolodny
Journal:  Proc Natl Acad Sci U S A       Date:  2011-07-07       Impact factor: 11.205

7.  Identification of recurring protein structure microenvironments and discovery of novel functional sites around CYS residues.

Authors:  Shirley Wu; Tianyun Liu; Russ B Altman
Journal:  BMC Struct Biol       Date:  2010-02-02

Review 8.  Protein function annotation by homology-based inference.

Authors:  Yaniv Loewenstein; Domenico Raimondo; Oliver C Redfern; James Watson; Dmitrij Frishman; Michal Linial; Christine Orengo; Janet Thornton; Anna Tramontano
Journal:  Genome Biol       Date:  2009-02-02       Impact factor: 13.583

9.  A comprehensive analysis of the structure-function relationship in proteins based on local structure similarity.

Authors:  Torgeir R Hvidsten; Astrid Laegreid; Andriy Kryshtafovych; Gunnar Andersson; Krzysztof Fidelis; Jan Komorowski
Journal:  PLoS One       Date:  2009-07-15       Impact factor: 3.240

Review 10.  From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

Authors:  Ursula Hinz
Journal:  Cell Mol Life Sci       Date:  2009-12-31       Impact factor: 9.261

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.