Literature DB >> 17535793

Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution.

Philip R Kensche1, Vera van Noort, Bas E Dutilh, Martijn A Huynen.   

Abstract

The gap between the amount of genome information released by genome sequencing projects and our knowledge about the proteins' functions is rapidly increasing. To fill this gap, various 'genomic-context' methods have been proposed that exploit sequenced genomes to predict the functions of the encoded proteins. One class of methods, phylogenetic profiling, predicts protein function by correlating the phylogenetic distribution of genes with that of other genes or phenotypic characteristics. The functions of a number of proteins, including ones of medical relevance, have thus been predicted and subsequently confirmed experimentally. Additionally, various approaches to measure the similarity of phylogenetic profiles and to account for the phylogenetic bias in the data have been proposed. We review the successful applications of phylogenetic profiling and analyse the performance of various profile similarity measures with a set of one microsporidial and 25 fungal genomes. In the fungi, phylogenetic profiling yields high-confidence predictions for the highest and only the highest scoring gene pairs illustrating both the power and the limitations of the approach. Both practical examples and theoretical considerations suggest that in order to get a reliable and specific picture of a protein's function, results from phylogenetic profiling have to be combined with other sources of evidence.

Mesh:

Substances:

Year:  2008        PMID: 17535793      PMCID: PMC2405902          DOI: 10.1098/rsif.2007.1047

Source DB:  PubMed          Journal:  J R Soc Interface        ISSN: 1742-5662            Impact factor:   4.118


  154 in total

1.  Protein interaction maps for complete genomes based on gene fusion events.

Authors:  A J Enright; I Iliopoulos; N C Kyrpides; C A Ouzounis
Journal:  Nature       Date:  1999-11-04       Impact factor: 49.962

2.  SmpB, a unique RNA-binding protein essential for the peptide-tagging activity of SsrA (tmRNA).

Authors:  A W Karzai; M M Susskind; R T Sauer
Journal:  EMBO J       Date:  1999-07-01       Impact factor: 11.598

3.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

4.  Detecting protein function and protein-protein interactions from genome sequences.

Authors:  E M Marcotte; M Pellegrini; H L Ng; D W Rice; T O Yeates; D Eisenberg
Journal:  Science       Date:  1999-07-30       Impact factor: 47.728

5.  The use of gene clusters to infer functional coupling.

Authors:  R Overbeek; M Fonstein; M D'Souza; G D Pusch; N Maltsev
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-16       Impact factor: 11.205

Review 6.  Moonlighting proteins.

Authors:  C J Jeffery
Journal:  Trends Biochem Sci       Date:  1999-01       Impact factor: 13.807

7.  Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes.

Authors:  T Gaasterland; M A Ragan
Journal:  Microb Comp Genomics       Date:  1998

8.  Biosynthesis of terpenoids: YchB protein of Escherichia coli phosphorylates the 2-hydroxy group of 4-diphosphocytidyl-2C-methyl-D-erythritol.

Authors:  H Lüttgen; F Rohdich; S Herz; J Wungsintaweekul; S Hecht; C A Schuhr; M Fellermeier; S Sagner; M H Zenk; A Bacher; W Eisenreich
Journal:  Proc Natl Acad Sci U S A       Date:  2000-02-01       Impact factor: 11.205

9.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.

Authors:  M Pellegrini; E M Marcotte; M J Thompson; D Eisenberg; T O Yeates
Journal:  Proc Natl Acad Sci U S A       Date:  1999-04-13       Impact factor: 11.205

10.  The COG database: a tool for genome-scale analysis of protein functions and evolution.

Authors:  R L Tatusov; M Y Galperin; D A Natale; E V Koonin
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

View more
  38 in total

Review 1.  The Code of Silence: Widespread Associations Between Synonymous Codon Biases and Gene Function.

Authors:  Fran Supek
Journal:  J Mol Evol       Date:  2015-11-04       Impact factor: 2.395

2.  Systematic Discovery of Human Gene Function and Principles of Modular Organization through Phylogenetic Profiling.

Authors:  Gautam Dey; Ariel Jaimovich; Sean R Collins; Akiko Seki; Tobias Meyer
Journal:  Cell Rep       Date:  2015-02-12       Impact factor: 9.423

3.  Aquerium: A web application for comparative exploration of domain-based protein occurrences on the taxonomically clustered genome tree.

Authors:  Ogun Adebali; Igor B Zhulin
Journal:  Proteins       Date:  2016-11-13

Review 4.  Using comparative genomics to drive new discoveries in microbiology.

Authors:  Daniel H Haft
Journal:  Curr Opin Microbiol       Date:  2015-01-21       Impact factor: 7.934

5.  Predicting phenotypic traits of prokaryotes from protein domain frequencies.

Authors:  Thomas Lingner; Stefanie Mühlhausen; Toni Gabaldón; Cedric Notredame; Peter Meinicke
Journal:  BMC Bioinformatics       Date:  2010-09-24       Impact factor: 3.169

6.  Expansion of biological pathways based on evolutionary inference.

Authors:  Yang Li; Sarah E Calvo; Roee Gutman; Jun S Liu; Vamsi K Mootha
Journal:  Cell       Date:  2014-07-03       Impact factor: 41.582

Review 7.  Prediction and redesign of protein-protein interactions.

Authors:  Rhonald C Lua; David C Marciano; Panagiotis Katsonis; Anbu K Adikesavan; Angela D Wilkins; Olivier Lichtarge
Journal:  Prog Biophys Mol Biol       Date:  2014-05-27       Impact factor: 3.667

8.  Comparison of eukaryotic phylogenetic profiling approaches using species tree aware methods.

Authors:  Valentín Ruano-Rubio; Olivier Poch; Julie D Thompson
Journal:  BMC Bioinformatics       Date:  2009-11-24       Impact factor: 3.169

9.  TMEM107 recruits ciliopathy proteins to subdomains of the ciliary transition zone and causes Joubert syndrome.

Authors:  Nils J Lambacher; Ange-Line Bruel; Teunis J P van Dam; Katarzyna Szymańska; Gisela G Slaats; Stefanie Kuhns; Gavin J McManus; Julie E Kennedy; Karl Gaff; Ka Man Wu; Robin van der Lee; Lydie Burglen; Diane Doummar; Jean-Baptiste Rivière; Laurence Faivre; Tania Attié-Bitach; Sophie Saunier; Alistair Curd; Michelle Peckham; Rachel H Giles; Colin A Johnson; Martijn A Huynen; Christel Thauvin-Robinet; Oliver E Blacque
Journal:  Nat Cell Biol       Date:  2015-11-23       Impact factor: 28.824

10.  Effect of reference genome selection on the performance of computational methods for genome-wide protein-protein interaction prediction.

Authors:  Vijaykumar Yogesh Muley; Akash Ranjan
Journal:  PLoS One       Date:  2012-07-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.