Literature DB >> 10958638

Predicting protein function by genomic context: quantitative evaluation and qualitative inferences.

M Huynen1, B Snel, W Lathe, P Bork.   

Abstract

Various new methods have been proposed to predict functional interactions between proteins based on the genomic context of their genes. The types of genomic context that they use are Type I: the fusion of genes; Type II: the conservation of gene-order or co-occurrence of genes in potential operons; and Type III: the co-occurrence of genes across genomes (phylogenetic profiles). Here we compare these types for their coverage, their correlations with various types of functional interaction, and their overlap with homology-based function assignment. We apply the methods to Mycoplasma genitalium, the standard benchmarking genome in computational and experimental genomics. Quantitatively, conservation of gene order is the technique with the highest coverage, applying to 37% of the genes. By combining gene order conservation with gene fusion (6%), the co-occurrence of genes in operons in absence of gene order conservation (8%), and the co-occurrence of genes across genomes (11%), significant context information can be obtained for 50% of the genes (the categories overlap). Qualitatively, we observe that the functional interactions between genes are stronger as the requirements for physical neighborhood on the genome are more stringent, while the fraction of potential false positives decreases. Moreover, only in cases in which gene order is conserved in a substantial fraction of the genomes, in this case six out of twenty-five, does a single type of functional interaction (physical interaction) clearly dominate (>80%). In other cases, complementary function information from homology searches, which is available for most of the genes with significant genomic context, is essential to predict the type of interaction. Using a combination of genomic context and homology searches, new functional features can be predicted for 10% of M. genitalium genes.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10958638      PMCID: PMC310926          DOI: 10.1101/gr.10.8.1204

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  32 in total

Review 1.  Gene and context: integrative approaches to genome analysis.

Authors:  M A Huynen; B Snel
Journal:  Adv Protein Chem       Date:  2000

2.  Activities of enzymes of purine and pyrimidine metabolism in nine Mycoplasma species.

Authors:  M Hamet; C Bonissol; P Cartier
Journal:  Adv Exp Med Biol       Date:  1979       Impact factor: 2.622

3.  An evolutionary treasure: unification of a broad set of amidohydrolases related to urease.

Authors:  L Holm; C Sander
Journal:  Proteins       Date:  1997-05

4.  Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis.

Authors:  B T Korber; R M Farber; D H Wolpert; A S Lapedes
Journal:  Proc Natl Acad Sci U S A       Date:  1993-08-01       Impact factor: 11.205

5.  Gene order is not conserved in bacterial evolution.

Authors:  A R Mushegian; E V Koonin
Journal:  Trends Genet       Date:  1996-08       Impact factor: 11.639

6.  MSS1, a nuclear-encoded mitochondrial GTPase involved in the expression of COX1 subunit of cytochrome c oxidase.

Authors:  E Decoster; A Vassal; G Faye
Journal:  J Mol Biol       Date:  1993-07-05       Impact factor: 5.469

7.  Sequence similarity analysis of Escherichia coli proteins: functional and evolutionary implications.

Authors:  E V Koonin; R L Tatusov; K E Rudd
Journal:  Proc Natl Acad Sci U S A       Date:  1995-12-05       Impact factor: 11.205

8.  Enzymes of pyrimidine deoxyribonucleotide metabolism in Mycoplasma mycoides subsp. mycoides.

Authors:  G A Neale; A Mitchell; L R Finch
Journal:  J Bacteriol       Date:  1983-12       Impact factor: 3.490

9.  Molecular cloning and sequence of the thdF gene, which is involved in thiophene and furan oxidation by Escherichia coli.

Authors:  K Y Alam; D P Clark
Journal:  J Bacteriol       Date:  1991-10       Impact factor: 3.490

10.  Three-dimensional structure and stability of the KH domain: molecular insights into the fragile X syndrome.

Authors:  G Musco; G Stier; C Joseph; M A Castiglione Morelli; M Nilges; T J Gibson; A Pastore
Journal:  Cell       Date:  1996-04-19       Impact factor: 41.582

View more
  182 in total

1.  A DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis.

Authors:  Kira S Makarova; L Aravind; Nick V Grishin; Igor B Rogozin; Eugene V Koonin
Journal:  Nucleic Acids Res       Date:  2002-01-15       Impact factor: 16.971

2.  Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes.

Authors:  I Yanai; A Derti; C DeLisi
Journal:  Proc Natl Acad Sci U S A       Date:  2001-07-03       Impact factor: 11.205

3.  Predictome: a database of putative functional links between proteins.

Authors:  Joseph C Mellor; Itai Yanai; Karl H Clodfelter; Julian Mintseris; Charles DeLisi
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

4.  The identification of functional modules from the genomic association of genes.

Authors:  Berend Snel; Peer Bork; Martijn A Huynen
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-30       Impact factor: 11.205

5.  Connected gene neighborhoods in prokaryotic genomes.

Authors:  Igor B Rogozin; Kira S Makarova; Janos Murvai; Eva Czabarka; Yuri I Wolf; Roman L Tatusov; Laszlo A Szekely; Eugene V Koonin
Journal:  Nucleic Acids Res       Date:  2002-05-15       Impact factor: 16.971

6.  Structural and nucleotide-binding properties of YajQ and YnaF, two Escherichia coli proteins of unknown function.

Authors:  Cosmin Saveanu; Simona Miron; Tudor Borza; Constantin T Craescu; Gilles Labesse; Cristina Gagyi; Aurel Popescu; Francis Schaeffer; Abdelkader Namane; Christine Laurent-Winter; Octavian Bârzu; Anne-Marie Gilles
Journal:  Protein Sci       Date:  2002-11       Impact factor: 6.725

7.  Predicting genetic modifier loci using functional gene networks.

Authors:  Insuk Lee; Ben Lehner; Tanya Vavouri; Junha Shin; Andrew G Fraser; Edward M Marcotte
Journal:  Genome Res       Date:  2010-06-09       Impact factor: 9.043

8.  A cross-genomic approach for systematic mapping of phenotypic traits to genes.

Authors:  Kam Jim; Kush Parmar; Mona Singh; Saeed Tavazoie
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

9.  STRING: a database of predicted functional associations between proteins.

Authors:  Christian von Mering; Martijn Huynen; Daniel Jaeggi; Steffen Schmidt; Peer Bork; Berend Snel
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

10.  SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence.

Authors:  C Z Cai; L Y Han; Z L Ji; X Chen; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.