Literature DB >> 20664722

Phylogenetic molecular function annotation.

Barbara E Engelhardt1, Michael I Jordan, Susanna T Repo, Steven E Brenner.   

Abstract

It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called "phylogenomics") is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods.

Entities:  

Year:  2009        PMID: 20664722      PMCID: PMC2909777          DOI: 10.1088/1742-6596/180/1/012024

Source DB:  PubMed          Journal:  J Phys Conf Ser        ISSN: 1742-6588


  27 in total

1.  Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption.

Authors:  M Y Galperin; E V Koonin
Journal:  In Silico Biol       Date:  1998

2.  The closest BLAST hit is often not the nearest neighbor.

Authors:  L B Koski; G B Golding
Journal:  J Mol Evol       Date:  2001-06       Impact factor: 2.395

3.  A simple algorithm to infer gene duplication and speciation events on a gene tree.

Authors:  C M Zmasek; S R Eddy
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

Review 4.  Phylogenomic inference of protein molecular function: advances and challenges.

Authors:  Kimmen Sjölander
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

5.  Phylogenomics: intersection of evolution and genomics.

Authors:  Jonathan A Eisen; Claire M Fraser
Journal:  Science       Date:  2003-06-13       Impact factor: 47.728

6.  Comparison of EST libraries from seven beetle species: towards a framework for phylogenomics of the Coleoptera.

Authors:  K Theodorides; A De Riva; J Gómez-Zurita; P G Foster; A P Vogler
Journal:  Insect Mol Biol       Date:  2002-10       Impact factor: 3.585

Review 7.  Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis.

Authors:  J A Eisen
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

8.  A general model for the genetic analysis of pedigree data.

Authors:  R C Elston; J Stewart
Journal:  Hum Hered       Date:  1971       Impact factor: 0.444

9.  Distinguishing homologous from analogous proteins.

Authors:  W M Fitch
Journal:  Syst Zool       Date:  1970-06

10.  Metagenomic analysis of the human distal gut microbiome.

Authors:  Steven R Gill; Mihai Pop; Robert T Deboy; Paul B Eckburg; Peter J Turnbaugh; Buck S Samuel; Jeffrey I Gordon; David A Relman; Claire M Fraser-Liggett; Karen E Nelson
Journal:  Science       Date:  2006-06-02       Impact factor: 47.728

View more
  6 in total

1.  Biochemical and mutational studies of the Bacillus cereus CECT 5050T formamidase support the existence of a C-E-E-K tetrad in several members of the nitrilase superfamily.

Authors:  Pablo Soriano-Maldonado; Ana Isabel Martínez-Gómez; Montserrat Andújar-Sánchez; José L Neira; Josefa María Clemente-Jiménez; Francisco Javier Las Heras-Vázquez; Felipe Rodríguez-Vico; Sergio Martínez-Rodríguez
Journal:  Appl Environ Microbiol       Date:  2011-06-24       Impact factor: 4.792

2.  The Amaryllidaceae alkaloids: biosynthesis and methods for enzyme discovery.

Authors:  Matthew B Kilgore; Toni M Kutchan
Journal:  Phytochem Rev       Date:  2015-12-17       Impact factor: 5.374

3.  Exploring the evolution of novel enzyme functions within structurally defined protein superfamilies.

Authors:  Nicholas Furnham; Ian Sillitoe; Gemma L Holliday; Alison L Cuff; Roman A Laskowski; Christine A Orengo; Janet M Thornton
Journal:  PLoS Comput Biol       Date:  2012-03-01       Impact factor: 4.475

Review 4.  Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence-Function Space and Genome Context to Discover Novel Functions.

Authors:  John A Gerlt
Journal:  Biochemistry       Date:  2017-08-22       Impact factor: 3.162

5.  Simple topological properties predict functional misannotations in a metabolic network.

Authors:  Rodrigo Liberal; John W Pinney
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

6.  Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer.

Authors:  Ruth Davidson; Pranjal Vachaspati; Siavash Mirarab; Tandy Warnow
Journal:  BMC Genomics       Date:  2015-10-02       Impact factor: 3.969

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.