Literature DB >> 19435743

ESG: extended similarity group method for automated protein function prediction.

Meghana Chitale1, Troy Hawkins, Changsoon Park, Daisuke Kihara.   

Abstract

MOTIVATION: Importance of accurate automatic protein function prediction is ever increasing in the face of a large number of newly sequenced genomes and proteomics data that are awaiting biological interpretation. Conventional methods have focused on high sequence similarity-based annotation transfer which relies on the concept of homology. However, many cases have been reported that simple transfer of function from top hits of a homology search causes erroneous annotation. New methods are required to handle the sequence similarity in a more robust way to combine together signals from strongly and weakly similar proteins for effectively predicting function for unknown proteins with high reliability.
RESULTS: We present the extended similarity group (ESG) method, which performs iterative sequence database searches and annotates a query sequence with Gene Ontology terms. Each annotation is assigned with probability based on its relative similarity score with the multiple-level neighbors in the protein similarity graph. We will depict how the statistical framework of ESG improves the prediction accuracy by iteratively taking into account the neighborhood of query protein in the sequence similarity space. ESG outperforms conventional PSI-BLAST and the protein function prediction (PFP) algorithm. It is found that the iterative search is effective in capturing multiple-domains in a query protein, enabling accurately predicting several functions which originate from different domains. AVAILABILITY: ESG web server is available for automated protein function prediction at http://dragon.bio.purdue.edu/ESG/.

Mesh:

Substances:

Year:  2009        PMID: 19435743      PMCID: PMC2705228          DOI: 10.1093/bioinformatics/btp309

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  31 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

Review 2.  Automated protein function prediction--the genomic challenge.

Authors:  Iddo Friedberg
Journal:  Brief Bioinform       Date:  2006-05-23       Impact factor: 11.622

3.  Enhanced automated function prediction using distantly related sequences and contextual association by PFP.

Authors:  Troy Hawkins; Stanislav Luban; Daisuke Kihara
Journal:  Protein Sci       Date:  2006-05-02       Impact factor: 6.725

4.  New avenues in protein function prediction.

Authors:  Iddo Friedberg; Martin Jambon; Adam Godzik
Journal:  Protein Sci       Date:  2006-06       Impact factor: 6.725

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  A new measure for functional similarity of gene products based on Gene Ontology.

Authors:  Andreas Schlicker; Francisco S Domingues; Jörg Rahnenführer; Thomas Lengauer
Journal:  BMC Bioinformatics       Date:  2006-06-15       Impact factor: 3.169

7.  ProtoNet 4.0: a hierarchical classification of one million protein sequences.

Authors:  Noam Kaplan; Ori Sasson; Uri Inbar; Moriah Friedlich; Menachem Fromer; Hillel Fleischer; Elon Portugaly; Nathan Linial; Michal Linial
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

8.  GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes.

Authors:  David M A Martin; Matthew Berriman; Geoffrey J Barton
Journal:  BMC Bioinformatics       Date:  2004-11-18       Impact factor: 3.169

9.  The Universal Protein Resource (UniProt).

Authors:  Amos Bairoch; Rolf Apweiler; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  GOPET: a tool for automated predictions of Gene Ontology terms.

Authors:  Arunachalam Vinayagam; Coral del Val; Falk Schubert; Roland Eils; Karl-Heinz Glatting; Sándor Suhai; Rainer König
Journal:  BMC Bioinformatics       Date:  2006-03-20       Impact factor: 3.169

View more
  46 in total

1.  Real-time ligand binding pocket database search using local surface descriptors.

Authors:  Rayan Chikhi; Lee Sael; Daisuke Kihara
Journal:  Proteins       Date:  2010-07

Review 2.  Computational characterization of moonlighting proteins.

Authors:  Ishita K Khan; Daisuke Kihara
Journal:  Biochem Soc Trans       Date:  2014-12       Impact factor: 5.407

Review 3.  The cell biology of schistosomes: a window on the evolution of the early metazoa.

Authors:  R Alan Wilson
Journal:  Protoplasma       Date:  2012-07       Impact factor: 3.356

4.  Structure- and sequence-based function prediction for non-homologous proteins.

Authors:  Lee Sael; Meghana Chitale; Daisuke Kihara
Journal:  J Struct Funct Genomics       Date:  2012-01-22

5.  Computational Methods for Predicting Protein-Protein Interactions Using Various Protein Features.

Authors:  Ziyun Ding; Daisuke Kihara
Journal:  Curr Protoc Protein Sci       Date:  2018-06-21

6.  EnzymeDetector: an integrated enzyme function prediction tool and database.

Authors:  Susanne Quester; Dietmar Schomburg
Journal:  BMC Bioinformatics       Date:  2011-09-23       Impact factor: 3.169

7.  An orthology-based analysis of pathogenic protozoa impacting global health: an improved comparative genomics approach with prokaryotes and model eukaryote orthologs.

Authors:  Rafael R C Cuadrat; Sérgio Manuel da Serra Cruz; Diogo Antônio Tschoeke; Edno Silva; Frederico Tosta; Henrique Jucá; Rodrigo Jardim; Maria Luiza M Campos; Marta Mattoso; Alberto M R Dávila
Journal:  OMICS       Date:  2014-06-24

8.  Identification of Moonlighting Proteins in Genomes Using Text Mining Techniques.

Authors:  Aashish Jain; Hareesh Gali; Daisuke Kihara
Journal:  Proteomics       Date:  2018-10-10       Impact factor: 3.984

9.  The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.

Authors:  Ishita K Khan; Qing Wei; Samuel Chapman; Dukka B Kc; Daisuke Kihara
Journal:  Gigascience       Date:  2015-09-14       Impact factor: 6.524

10.  Functional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFP.

Authors:  Troy Hawkins; Meghana Chitale; Daisuke Kihara
Journal:  BMC Bioinformatics       Date:  2010-05-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.