Yuzhen Ye1, Andrei Osterman, Ross Overbeek, Adam Godzik. 1. Program in Bioinformatics and Systems Biology, The Burnham Institute 10901 N. Torrey Pines Road, La Jolla CA 92037, USA. yye@burnham.org
Abstract
MOTIVATION: Proteins work together in pathways and networks, collectively comprising the cellular machinery. A subsystem (a generalization of pathway concept) is a group of related functional roles (such as enzymes) jointly involved in a specific aspect of the cellular machinery. Subsystems provide a natural framework for comparative genome analysis and functional annotation. A subsystem may be implemented in a number of different functional variants in individual species. In order to reliably project functional assignments across multiple genomes, we have to be able to identify the variants implemented in each genome. The analysis of such variants across diverse species is an interesting problem by itself and may provide new evolutionary insights. However, no computational techniques are presently available for an automated detection and analysis of subsystem variants. RESULTS: Here we formulate the subsystem variant detection problem as finding the minimum number of subgraphs of a subsystem, which is represented as a graph, and solve the optimization problem by integer programming approach. The performance of our method was tested on subsystems encoded in the SEED, a genomic integration platform developed by the Fellowship for Interpretation of Genomes as a component of a large-scale effort on comparative analysis and annotation of multiple diverse genomes. Here we illustrate the results obtained for two expert-encoded subsystems of the biosynthesis of Coenzyme A and FMN/FAD cofactors. Applications of variant detection, to support genomic annotations and to assess divergence of species, are briefly discussed in the context of these universally conserved and essential metabolic subsystems. SUPPLEMENTARY INFORMATION: The details of the variant detection results are available at http://ffas.burnham.org/svar/supp.html.
MOTIVATION: Proteins work together in pathways and networks, collectively comprising the cellular machinery. A subsystem (a generalization of pathway concept) is a group of related functional roles (such as enzymes) jointly involved in a specific aspect of the cellular machinery. Subsystems provide a natural framework for comparative genome analysis and functional annotation. A subsystem may be implemented in a number of different functional variants in individual species. In order to reliably project functional assignments across multiple genomes, we have to be able to identify the variants implemented in each genome. The analysis of such variants across diverse species is an interesting problem by itself and may provide new evolutionary insights. However, no computational techniques are presently available for an automated detection and analysis of subsystem variants. RESULTS: Here we formulate the subsystem variant detection problem as finding the minimum number of subgraphs of a subsystem, which is represented as a graph, and solve the optimization problem by integer programming approach. The performance of our method was tested on subsystems encoded in the SEED, a genomic integration platform developed by the Fellowship for Interpretation of Genomes as a component of a large-scale effort on comparative analysis and annotation of multiple diverse genomes. Here we illustrate the results obtained for two expert-encoded subsystems of the biosynthesis of Coenzyme A and FMN/FAD cofactors. Applications of variant detection, to support genomic annotations and to assess divergence of species, are briefly discussed in the context of these universally conserved and essential metabolic subsystems. SUPPLEMENTARY INFORMATION: The details of the variant detection results are available at http://ffas.burnham.org/svar/supp.html.
Authors: Andrew R Joyce; Jennifer L Reed; Aprilfawn White; Robert Edwards; Andrei Osterman; Tomoya Baba; Hirotada Mori; Scott A Lesely; Bernhard Ø Palsson; Sanjay Agarwalla Journal: J Bacteriol Date: 2006-09-29 Impact factor: 3.490
Authors: Svetlana Y Gerdes; Oleg V Kurnasov; Konstantin Shatalin; Boris Polanuyer; Roman Sloutsky; Veronika Vonstein; Ross Overbeek; Andrei L Osterman Journal: J Bacteriol Date: 2006-04 Impact factor: 3.490
Authors: Leonardo Sorci; Dariusz Martynowski; Dmitry A Rodionov; Yvonne Eyobo; Xhavit Zogaj; Karl E Klose; Evgeni V Nikolaev; Giulio Magni; Hong Zhang; Andrei L Osterman Journal: Proc Natl Acad Sci U S A Date: 2009-02-09 Impact factor: 11.205