Literature DB >> 21775308

Discovering novel subsystems using comparative genomics.

Luciana Ferrer1, Alexander G Shearer, Peter D Karp.   

Abstract

MOTIVATION: Key problems for computational genomics include discovering novel pathways in genome data, and discovering functional interaction partners for genes to define new members of partially elucidated pathways.
RESULTS: We propose a novel method for the discovery of subsystems from annotated genomes. For each gene pair, a score measuring the likelihood that the two genes belong to a same subsystem is computed using genome context methods. Genes are then grouped based on these scores, and the resulting groups are filtered to keep only high-confidence groups. Since the method is based on genome context analysis, it relies solely on structural annotation of the genomes. The method can be used to discover new pathways, find missing genes from a known pathway, find new protein complexes or other kinds of functional groups and assign function to genes. We tested the accuracy of our method in Escherichia coli K-12. In one configuration of the system, we find that 31.6% of the candidate groups generated by our method match a known pathway or protein complex closely, and that we rediscover 31.2% of all known pathways and protein complexes of at least 4 genes. We believe that a significant proportion of the candidates that do not match any known group in E.coli K-12 corresponds to novel subsystems that may represent promising leads for future laboratory research. We discuss in-depth examples of these findings. AVAILABILITY: Predicted subsystems are available at http://brg.ai.sri.com/pwy-discovery/journal.html. CONTACT: lferrer@ai.sri.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2011        PMID: 21775308      PMCID: PMC3167049          DOI: 10.1093/bioinformatics/btr428

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  mraW, an essential gene at the dcw cluster of Escherichia coli codes for a cytoplasmic protein with methyltransferase activity.

Authors:  M Carrión; M J Gómez; R Merchante-Schubert; S Dongarrá; J A Ayala
Journal:  Biochimie       Date:  1999 Aug-Sep       Impact factor: 4.079

2.  Protein interaction maps for complete genomes based on gene fusion events.

Authors:  A J Enright; I Iliopoulos; N C Kyrpides; C A Ouzounis
Journal:  Nature       Date:  1999-11-04       Impact factor: 49.962

3.  A Bayesian networks approach for predicting protein-protein interactions from genomic data.

Authors:  Ronald Jansen; Haiyuan Yu; Dov Greenbaum; Yuval Kluger; Nevan J Krogan; Sambath Chung; Andrew Emili; Michael Snyder; Jack F Greenblatt; Mark Gerstein
Journal:  Science       Date:  2003-10-17       Impact factor: 47.728

4.  Expression and regulation of a silent operon, hyf, coding for hydrogenase 4 isoenzyme in Escherichia coli.

Authors:  William T Self; Adnan Hasona; K T Shanmugam
Journal:  J Bacteriol       Date:  2004-01       Impact factor: 3.490

Review 5.  Genomic channeling in bacterial cell division.

Authors:  Jesús Mingorance; Javier Tamames; Miguel Vicente
Journal:  J Mol Recognit       Date:  2004 Sep-Oct       Impact factor: 2.137

6.  Characterization of Escherichia coli MoeB and its involvement in the activation of molybdopterin synthase for the biosynthesis of the molybdenum cofactor.

Authors:  S Leimkühler; M M Wuebbens; K V Rajagopalan
Journal:  J Biol Chem       Date:  2001-07-19       Impact factor: 5.157

7.  Thiocarboxylation of molybdopterin synthase provides evidence for the mechanism of dithiolene formation in metal-binding pterins.

Authors:  G Gutzke; B Fischer; R R Mendel; G Schwarz
Journal:  J Biol Chem       Date:  2001-07-17       Impact factor: 5.157

8.  Participation of hyf-encoded hydrogenase 4 in molecular hydrogen release coupled with proton-potassium exchange in Escherichia coli.

Authors:  K Bagramyan; A Vassilian; N Mnatsakanyan; A Trchounian
Journal:  Membr Cell Biol       Date:  2001

9.  Nucleotide sequence and expression of an operon in Escherichia coli coding for formate hydrogenlyase components.

Authors:  R Böhm; M Sauter; A Böck
Journal:  Mol Microbiol       Date:  1990-02       Impact factor: 3.501

10.  Prolinks: a database of protein functional linkages derived from coevolution.

Authors:  Peter M Bowers; Matteo Pellegrini; Mike J Thompson; Joe Fierro; Todd O Yeates; David Eisenberg
Journal:  Genome Biol       Date:  2004-04-16       Impact factor: 13.583

View more
  3 in total

1.  Genome composition and phylogeny of microbes predict their co-occurrence in the environment.

Authors:  Olga K Kamneva
Journal:  PLoS Comput Biol       Date:  2017-02-02       Impact factor: 4.475

2.  Finding sequences for over 270 orphan enzymes.

Authors:  Alexander G Shearer; Tomer Altman; Christine D Rhee
Journal:  PLoS One       Date:  2014-05-14       Impact factor: 3.240

3.  The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases.

Authors:  Ron Caspi; Richard Billington; Luciana Ferrer; Hartmut Foerster; Carol A Fulcher; Ingrid M Keseler; Anamika Kothari; Markus Krummenacker; Mario Latendresse; Lukas A Mueller; Quang Ong; Suzanne Paley; Pallavi Subhraveti; Daniel S Weaver; Peter D Karp
Journal:  Nucleic Acids Res       Date:  2015-11-02       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.