Literature DB >> 24812336

fastGapFill: efficient gap filling in metabolic networks.

Ines Thiele1, Nikos Vlassis1, Ronan M T Fleming1.   

Abstract

MOTIVATION: Genome-scale metabolic reconstructions summarize current knowledge about a target organism in a structured manner and as such highlight missing information. Such gaps can be filled algorithmically. Scalability limitations of available algorithms for gap filling hinder their application to compartmentalized reconstructions.
RESULTS: We present fastGapFill, a computationally efficient tractable extension to the COBRA toolbox that permits the identification of candidate missing knowledge from a universal biochemical reaction database (e.g. Kyoto Encyclopedia of Genes and Genomes) for a given (compartmentalized) metabolic reconstruction. The stoichiometric consistency of the universal reaction database and of the metabolic reconstruction can be tested for permitting the computation of biologically more relevant solutions. We demonstrate the efficiency and scalability of fastGapFill on a range of metabolic reconstructions.
AVAILABILITY AND IMPLEMENTATION: fastGapFill is freely available from http://thielelab.eu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2014        PMID: 24812336      PMCID: PMC4147887          DOI: 10.1093/bioinformatics/btu321

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

A biomolecular network reconstruction summarizes biochemical, physiological and genomic knowledge in a mathematically structured electronic format (Palsson, 2006). It can be converted into a computational model, and predictions have been used to accelerate biotechnological and biomedical discoveries (Oberhardt ). The predictive capacity and accuracy of a model depend on the comprehensiveness and biochemical fidelity of the reconstruction, with respect to the underlying biochemistry. The comprehensiveness of a genome-scale metabolic reconstruction can be improved by using the model to detect and fill network gaps (Rolfsson ). Similarly, reconstruction fidelity can be improved by using the model to detect reconstruction stoichiometry inconsistent with biochemistry (Gevorgyan ) or reactions inconsistent with steady state flux (Vlassis ). Existing gap-filling algorithms, reviewed by Orth and Palsson (2010), become intractable in high dimensions. Decompartmentalization of genome-scale compartmentalized metabolic networks reduces their dimension, rendering gap filling tractable (Rolfsson ). However, this approach underestimates the amount of missing information because it connects reactions that would normally not co-occur in the same cellular compartment. We present fastGapFill, the first scalable algorithm capable of efficiently detecting and filling network gaps in compartmentalized genome-scale models. fastGapFill draws on, and extends, fastcore (Vlassis ), an algorithm to approximate the cardinality function to identify a compact flux consist ent model, in which all reactions carry a non-zero flux in at least one flux distribution. fastGapFill allows integrating all three notions of model consistency, namely, gap-filling, flux consistency and stoichiometric consistency in a single tool.

2 METHODS

Formulation of the gap-filling problem. In the metabolic gap-filling problem (Reed ), one starts with a computational metabolic model, M, that contains at least one blocked reaction, which, though desired, does not admit a non-zero steady state flux. From a universal database, e.g. the Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa and Goto, 2000), a search is made for at least one reaction that needs to be added to fill at least one gap in the model, such that at least one formerly blocked reaction can carry flux. Among other criteria, it may also be desirable to compute a compact flux consistent model, where the number of added universal reactions is minimal. A specific instance of this problem occurs in metabolic modeling, although our algorithm is applicable for any biochemical network model with gaps. Computing a compact flux consistent model. We repurposed the recently developed fastcore algorithm (Vlassis ) to compute a near-minimal set of reactions that need to be added to an input metabolic model M to render it flux consistent. fastcore takes inputs M and a core set of reactions . Then, it greedily expands C by computing a set of modes of M whose overall support contains the whole of C and a minimal set from . This is achieved by a series of L1-norm regularized linear programs that optimize a relaxed version of an (intractable) integer program under cardinality constrains (Vlassis ). Our implementation efficiently identifies blocked reactions. Preprocessing to generate a global model. A cellularly compartmentalized metabolic model (S) without blocked reactions (B), where , is expanded by a universal metabolic database U (e.g. KEGG), such that a copy of U is placed in each cellular compartment of S (including the extracellular space), to generate SU. For each metabolite occurring in a non-cytosolic compartment, a reversible intercompartmental transport reaction is added. For each extracellular metabolite, an exchange reaction is added. The sum of the latter two reaction sets (X) is added to SU to generate a global model, which is extended with solvable blocked reactions (), that is, reactions that were previously flux inconsistent but become flux consistent when added to the global model. In the extended global model (SUX), all reactions are flux consistent. Note that not all blocked reactions B may be solvable, and thus, they will not be present in SUX. All reactions of S and B represent the core set. Computing a compact flux consistent subnetwork of a global model. fastGapFill computes a subnetwork of SUX, consisting of all core reactions, plus a minimal number of reactions from UX, such that all reactions in the resulting compact subnetwork are flux consistent. This is achieved by using a slightly modified version of fastcore, in which a vector of linear weightings prioritizes the addition of reactions within UX. For instance, one may prioritize the addition of metabolic reactions from U over transport reactions from X, or, by varying the weightings on non-core reactions, alternate compact sets of gap-filling reactions may be identified. Optional analysis of gap-filling reactions. We provide the option to compute a flux vector that maximizes the flux through each blocked reaction in turn, while minimizing the Euclidean norm of flux through the subnetwork of SUX computed by one call to fastGapFill. Note that flux through more than one solvable blocked reaction may be necessary to fill a gap, and that the computed flux vector may not be of minimum cardinality. Computing stoichiometric consistency. Many reaction databases contain stoichiometric inconsistencies (Gevorgyan ), where the stoichiometry for at least two reactions is inconsistent with conservation of mass. For instance, the reactions and are stoichiometrically inconsistent, as no positive molecular mass can be assigned to A, B and C, such that the mass on both sides of both reactions is equal. fastGapFill allows to identify stoichiometrically inconsistent reactions from filling gaps, by using the scalable approach for approximate cardinality maximization used within fastcore, to compute a maximal set of metabolites in U that are involved in reactions that conserve mass.

3 IMPLEMENTATION

An open source, MATLAB (Mathworks, Inc.), implementation of fastGapFill is available as a cross-platform desktop computer extension to the openCOBRA toolbox (Schellenberger ).

4 DISCUSSION

We applied fastGapFill to five metabolic models (Table 1), demonstrating its broad applicability and scalability for various sizes of the gap-filling problem. Alternate gap-filling solutions can be computed by changing weightings on non-core reactions in the preprocessed problem. Note that all candidate metabolic and transport reactions are hypotheses requiring experimental validation (Rolfsson ). Our implementation provides an openCOBRA (Schellenberger ) compatible version of the KEGG reaction database; however, any other universal reaction database could be used with fastGapFill, so long as the same input format is maintained and care is taken to correctly identify identical metabolites. fastGapFill is the first scalable approach to identify candidate missing knowledge in compartmentalized metabolic reconstructions, and the approach is applicable to any form of biochemical network gap-filling problem.
Table 1.

Gap filling of metabolic reconstructions on a standard desktop computer (Dell, Intel Core i5, 16 GB RAM, 64 bit)

Model nameThermotoga maritimaEscherichia coliSynechocystis sp.sIECRecon 2
(Zhang et al., 2009)(Feist et al., 2007)(Nogales et al., 2012)(Sahoo and Thiele, 2013)(Thiele et al., 2013)
Sa418 × 5351501 × 2232632 × 731834 × 12603187 × 5837
SUXa14 020 × 31 56621 614 × 49 35528 174 × 62 86648 970 × 109 52258 672 × 132 622
Compb23478
B116196132221603
Bs8415910017490
Number of gap-filling reactions8713817214400
tpreprocessing (s)c5223734410035552
tfastGapFill (s)212384351941826

aThe dimensions are given as metabolites × reactions.

bComp, compartments.

cPreprocessing includes computing the flux consistent metabolic model, merging of UX for all compartments of S and adding solvable blocked reactions Bs.

Note: Equal weighting of all reactions was used. See Supplementary Table S1 for candidate gap-filling solutions.

Gap filling of metabolic reconstructions on a standard desktop computer (Dell, Intel Core i5, 16 GB RAM, 64 bit) aThe dimensions are given as metabolites × reactions. bComp, compartments. cPreprocessing includes computing the flux consistent metabolic model, merging of UX for all compartments of S and adding solvable blocked reactions Bs. Note: Equal weighting of all reactions was used. See Supplementary Table S1 for candidate gap-filling solutions. Funding: I.T. was supported by an ATTRACT program grant (FNR/A12/01) from the Luxembourg National Research Fund (FNR). R.F. was supported by the Interagency Modeling and Analysis Group, Multi-scale Modeling Consortium U01 awards from the National Institute of General Medical Sciences, award GM102098-01, and U.S. Department of Energy, Office of Science, Biological and Environmental Research Program, award ER65524. Conflict of Interest: none declared.
  13 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Detailing the optimality of photosynthesis in cyanobacteria through systems biology analysis.

Authors:  Juan Nogales; Steinn Gudmundsson; Eric M Knight; Bernhard O Palsson; Ines Thiele
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-30       Impact factor: 11.205

Review 3.  Systematizing the generation of missing metabolic knowledge.

Authors:  Jeffrey D Orth; Bernhard Ø Palsson
Journal:  Biotechnol Bioeng       Date:  2010-10-15       Impact factor: 4.530

4.  Systems approach to refining genome annotation.

Authors:  Jennifer L Reed; Trina R Patel; Keri H Chen; Andrew R Joyce; Margaret K Applebee; Christopher D Herring; Olivia T Bui; Eric M Knight; Stephen S Fong; Bernhard O Palsson
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-06       Impact factor: 11.205

5.  Detection of stoichiometric inconsistencies in biomolecular models.

Authors:  Albert Gevorgyan; Mark G Poolman; David A Fell
Journal:  Bioinformatics       Date:  2008-08-12       Impact factor: 6.937

6.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0.

Authors:  Jan Schellenberger; Richard Que; Ronan M T Fleming; Ines Thiele; Jeffrey D Orth; Adam M Feist; Daniel C Zielinski; Aarash Bordbar; Nathan E Lewis; Sorena Rahmanian; Joseph Kang; Daniel R Hyduke; Bernhard Ø Palsson
Journal:  Nat Protoc       Date:  2011-08-04       Impact factor: 13.491

7.  Three-dimensional structural view of the central metabolic network of Thermotoga maritima.

Authors:  Ying Zhang; Ines Thiele; Dana Weekes; Zhanwen Li; Lukasz Jaroszewski; Krzysztof Ginalski; Ashley M Deacon; John Wooley; Scott A Lesley; Ian A Wilson; Bernhard Palsson; Andrei Osterman; Adam Godzik
Journal:  Science       Date:  2009-09-18       Impact factor: 47.728

8.  The human metabolic reconstruction Recon 1 directs hypotheses of novel human metabolic functions.

Authors:  Ottar Rolfsson; Bernhard Ø Palsson; Ines Thiele
Journal:  BMC Syst Biol       Date:  2011-10-01

9.  Fast reconstruction of compact context-specific metabolic network models.

Authors:  Nikos Vlassis; Maria Pires Pacheco; Thomas Sauter
Journal:  PLoS Comput Biol       Date:  2014-01-16       Impact factor: 4.475

10.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information.

Authors:  Adam M Feist; Christopher S Henry; Jennifer L Reed; Markus Krummenacker; Andrew R Joyce; Peter D Karp; Linda J Broadbelt; Vassily Hatzimanikatis; Bernhard Ø Palsson
Journal:  Mol Syst Biol       Date:  2007-06-26       Impact factor: 11.429

View more
  42 in total

1.  Quantitative flux coupling analysis.

Authors:  Mojtaba Tefagh; Stephen P Boyd
Journal:  J Math Biol       Date:  2018-12-10       Impact factor: 2.259

2.  Reply to "Challenges in modeling the human gut microbiome".

Authors:  Stefanía Magnúsdóttir; Almut Heinken; Ronan M T Fleming; Ines Thiele
Journal:  Nat Biotechnol       Date:  2018-08-06       Impact factor: 54.908

3.  Metage2Metabo, microbiota-scale metabolic complementarity for the identification of key species.

Authors:  Arnaud Belcour; Clémence Frioux; Méziane Aite; Anthony Bretaudeau; Falk Hildebrand; Anne Siegel
Journal:  Elife       Date:  2020-12-29       Impact factor: 8.140

4.  A Consensus Genome-scale Reconstruction of Chinese Hamster Ovary Cell Metabolism.

Authors:  Hooman Hefzi; Kok Siong Ang; Michael Hanscho; Aarash Bordbar; David Ruckerbauer; Meiyappan Lakshmanan; Camila A Orellana; Deniz Baycin-Hizal; Yingxiang Huang; Daniel Ley; Veronica S Martinez; Sarantos Kyriakopoulos; Natalia E Jiménez; Daniel C Zielinski; Lake-Ee Quek; Tune Wulff; Johnny Arnsdorf; Shangzhong Li; Jae Seong Lee; Giuseppe Paglia; Nicolas Loira; Philipp N Spahn; Lasse E Pedersen; Jahir M Gutierrez; Zachary A King; Anne Mathilde Lund; Harish Nagarajan; Alex Thomas; Alyaa M Abdel-Haleem; Juergen Zanghellini; Helene F Kildegaard; Bjørn G Voldborg; Ziomara P Gerdtzen; Michael J Betenbaugh; Bernhard O Palsson; Mikael R Andersen; Lars K Nielsen; Nicole Borth; Dong-Yup Lee; Nathan E Lewis
Journal:  Cell Syst       Date:  2016-11-23       Impact factor: 10.304

5.  Mimoza: web-based semantic zooming and navigation in metabolic networks.

Authors:  Anna Zhukova; David J Sherman
Journal:  BMC Syst Biol       Date:  2015-02-26

6.  Systematically gap-filling the genome-scale metabolic model of CHO cells.

Authors:  Hamideh Fouladiha; Sayed-Amir Marashi; Shangzhong Li; Zerong Li; Helen O Masson; Behrouz Vaziri; Nathan E Lewis
Journal:  Biotechnol Lett       Date:  2020-10-10       Impact factor: 2.461

7.  Comparative Analysis of Yeast Metabolic Network Models Highlights Progress, Opportunities for Metabolic Reconstruction.

Authors:  Benjamin D Heavner; Nathan D Price
Journal:  PLoS Comput Biol       Date:  2015-11-13       Impact factor: 4.475

8.  Improved Metabolic Models for E. coli and Mycoplasma genitalium from GlobalFit, an Algorithm That Simultaneously Matches Growth and Non-Growth Data Sets.

Authors:  Daniel Hartleb; Florian Jarre; Martin J Lercher
Journal:  PLoS Comput Biol       Date:  2016-08-02       Impact factor: 4.475

Review 9.  Mechanistic models of microbial community metabolism.

Authors:  Lillian R Dillard; Dawson D Payne; Jason A Papin
Journal:  Mol Omics       Date:  2021-06-14

10.  Phenotypic differentiation of gastrointestinal microbes is reflected in their encoded metabolic repertoires.

Authors:  Eugen Bauer; Cedric Christian Laczny; Stefania Magnusdottir; Paul Wilmes; Ines Thiele
Journal:  Microbiome       Date:  2015-11-30       Impact factor: 14.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.