Literature DB >> 10200254

Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.

M Pellegrini1, E M Marcotte, M J Thompson, D Eisenberg, T O Yeates.   

Abstract

Determining protein functions from genomic sequences is a central goal of bioinformatics. We present a method based on the assumption that proteins that function together in a pathway or structural complex are likely to evolve in a correlated fashion. During evolution, all such functionally linked proteins tend to be either preserved or eliminated in a new species. We describe this property of correlated evolution by characterizing each protein by its phylogenetic profile, a string that encodes the presence or absence of a protein in every known genome. We show that proteins having matching or similar profiles strongly tend to be functionally linked. This method of phylogenetic profiling allows us to predict the function of uncharacterized proteins.

Mesh:

Substances:

Year:  1999        PMID: 10200254      PMCID: PMC16324          DOI: 10.1073/pnas.96.8.4285

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  9 in total

Review 1.  Predicting function: from genes to genomes and back.

Authors:  P Bork; T Dandekar; Y Diaz-Lazcoz; F Eisenhaber; M Huynen; Y Yuan
Journal:  J Mol Biol       Date:  1998-11-06       Impact factor: 5.469

2.  Constructing multigenome views of whole microbial genomes.

Authors:  T Gaasterland; M A Ragan
Journal:  Microb Comp Genomics       Date:  1998

3.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

Review 4.  Bioinformatics: from genome data to biological knowledge.

Authors:  M A Andrade; C Sander
Journal:  Curr Opin Biotechnol       Date:  1997-12       Impact factor: 9.740

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  The complete genome sequence of Escherichia coli K-12.

Authors:  F R Blattner; G Plunkett; C A Bloch; N T Perna; V Burland; M Riley; J Collado-Vides; J D Glasner; C K Rode; G F Mayhew; J Gregor; N W Davis; H A Kirkpatrick; M A Goeden; D J Rose; B Mau; Y Shao
Journal:  Science       Date:  1997-09-05       Impact factor: 47.728

7.  EcoCyc: Encyclopedia of Escherichia coli genes and metabolism.

Authors:  P D Karp; M Riley; S M Paley; A Pellegrini-Toole; M Krummenacker
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

8.  Genes and proteins of Escherichia coli K-12.

Authors:  M Riley
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

9.  Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli.

Authors:  R L Tatusov; A R Mushegian; P Bork; N P Brown; W S Hayes; M Borodovsky; K E Rudd; E V Koonin
Journal:  Curr Biol       Date:  1996-03-01       Impact factor: 10.834

  9 in total
  633 in total

1.  Genomewide function conservation and phylogeny in the Herpesviridae.

Authors:  M M Albà; R Das; C A Orengo; P Kellam
Journal:  Genome Res       Date:  2001-01       Impact factor: 9.043

2.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

3.  Gene content phylogeny of herpesviruses.

Authors:  M G Montague; C A Hutchison
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-09       Impact factor: 11.205

4.  Predicting regulons and their cis-regulatory motifs by comparative genomics.

Authors:  A Manson McGuire; G M Church
Journal:  Nucleic Acids Res       Date:  2000-11-15       Impact factor: 16.971

5.  Motif-based fold assignment.

Authors:  L Salwinski; D Eisenberg
Journal:  Protein Sci       Date:  2001-12       Impact factor: 6.725

6.  Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes.

Authors:  I Yanai; A Derti; C DeLisi
Journal:  Proc Natl Acad Sci U S A       Date:  2001-07-03       Impact factor: 11.205

7.  Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs".

Authors:  L R Matthews; P Vaglio; J Reboul; H Ge; B P Davis; J Garrels; S Vincent; M Vidal
Journal:  Genome Res       Date:  2001-12       Impact factor: 9.043

8.  Predictome: a database of putative functional links between proteins.

Authors:  Joseph C Mellor; Itai Yanai; Karl H Clodfelter; Julian Mintseris; Charles DeLisi
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

9.  GTOP: a database of protein structures predicted from genome sequences.

Authors:  Takeshi Kawabata; Satoshi Fukuchi; Keiichi Homma; Motonori Ota; Jiro Araki; Takehiko Ito; Nobuyuki Ichiyoshi; Ken Nishikawa
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

10.  The identification of functional modules from the genomic association of genes.

Authors:  Berend Snel; Peer Bork; Martijn A Huynen
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-30       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.