Literature DB >> 35201291

CobraMod: A pathway-centric curation tool for constraint-based metabolic models.

Stefano Camborda1, Jan-Niklas Weder1, Nadine Töpfer1.   

Abstract

SUMMARY: COnstraint-Based Reconstruction and Analysis of genome-scale metabolic models has become a widely used tool to understand metabolic network behavior at a large-scale. However, existing reconstruction tools lack functionalities to address modellers' common objective to study metabolic networks on the pathway level. Thus, we developed CobraMod-a Python package for pathway-centric modification and extension of genome-scale metabolic networks. CobraMod can integrate data from various metabolic pathway databases as well as user-curated information. Our tool tests newly added metabolites, reactions, and pathways against multiple curation criteria, suggests manual curation steps, and provides the user with records of changes to ensure high quality metabolic reconstructions. CobraMod uses the visualization tool Escher for pathway representation and offers simple customization options for comparison of pathways and flux distributions. Our package enables coherent and reproducible workflows as it can be seamlessly integrated with COBRApy and Escher. AVAILABILITY: The source code can be found at https://github.com/Toepfer-Lab/cobramod/ and can be installed with pip. The documentation including tutorials is available at https://cobramod.readthedocs.io/.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Year:  2022        PMID: 35201291      PMCID: PMC9048663          DOI: 10.1093/bioinformatics/btac119

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

Genome-scale metabolic models (GEMs) and their analysis by constraint-based modeling techniques are widely used tools to study metabolic systems at a large scale. Several software tools for Constraint-Based Reconstruction and Analysis (COBRA) are available, such as the COBRA toolbox, ModelSEED, Pathway Tools, RAVEN, CarveMe and Merlin (Dias ; Heirendt ; Karp, 2002; Machado ; Seaver ; Wang ) and have been evaluated here (Faria ; Mendoza ). In recent years, the freely available and community-supported software package COBRApy has gained particular popularity (Ebrahim ). COBRApy performs commonly used COBRA methods such as flux balance analysis, flux variability analysis, gene deletion analysis and includes simple, object-oriented interfaces for model reconstruction. Several software packages complement COBRApy by implementing extended functionalities. For instance, Cameo and MewPy offer functionalities for computational strain optimization, MEMOTE includes a suite of tests to assess GEM quality, Medusa facilitates generating and analyzing ensembles of GEMs, and the Escher visualization tool offers an user-friendly interface for designing and manipulating pathway maps (Lieven, 2020; Cardoso ; King ; Medlock ; Pereira ). However, currently available reconstruction tools rely either on error-prone, automated reconstruction procedures or laborious, manual addition of individual reactions or reaction sets and thus preclude the extension and curation of GEMs based on their biologically meaningful subsets, i.e. the metabolic pathways. Here, we present CobraMod, a pathway-centric curation tool for the modification and extension of GEMs. CobraMod offers a comprehensible set of functions for semi-automated network extension, testing and visualization and enables easy, user-friendly manual curation and information logging to ensure high quality network reconstructions. CobraMod is written in Python 3; it builds upon and extends COBRApy and can directly interact with Escher for pathway and flux visualization.

2 Implementation

CobraMod is an open-source package which enables modifying and extending GEMs with metabolic pathway information from various databases or user-curated datasets. Our package converts these data into native COBRApy objects and quality-checks them for multiple curation criteria before incorporating them into the model (Fig. 1A). CobraMod’s main functions include downloading metabolic pathway information (get_data), creating COBRApy objects (create_objects) and including new metabolites (add_metabolites), reactions (add_reactions) or pathways (add_pathway), as well as testing a reaction’s capability to carry a non-zero flux (test_non_zero_flux) and pathway visualization (visualize).
Fig. 1.

CobraMod’s main functionalities and pathway visualization example. (A) CobraMod’s pathway-centric functionalities bridge COBRApy methods and the visualization tool Escher. (B) Visualization of a metabolic engineering case study of the shikimate pathway in E.coli. Flux solutions for two strains of E.coli (control and engineered) are visualized. For simplicity, we represented only three reactions of the whole pathway. Reaction names and pathway fluxes are given in blue. For comparability, flux values were normalized and darker colors indicate higher flux values

CobraMod’s main functionalities and pathway visualization example. (A) CobraMod’s pathway-centric functionalities bridge COBRApy methods and the visualization tool Escher. (B) Visualization of a metabolic engineering case study of the shikimate pathway in E.coli. Flux solutions for two strains of E.coli (control and engineered) are visualized. For simplicity, we represented only three reactions of the whole pathway. Reaction names and pathway fluxes are given in blue. For comparability, flux values were normalized and darker colors indicate higher flux values

2.1 Data retrieval

CobraMod supports all databases from the BioCyc collection (Karp ), the KEGG database (Kanehisa and Goto, 2000) and the BiGG Models repository (King ). The user can retrieve metabolic pathway information by specifying a database and the corresponding identifiers for metabolites, reactions or pathways. CobraMod automatically gathers gene information when obtaining information for reactions or pathways. CobraMod then downloads these datasets, stores them locally to ensure reproducibility (get_data), and transforms them into COBRApy objects (create_object). In addition, CobraMod can integrate user-curated metabolites and reactions via text file or direct script input (add_metabolites, add_reactions).

2.2 Curation steps

CobraMod enables modifying and analyzing GEMs on the metabolic pathway level. Thus, it combines sets of reactions into pathway-objects, which the user can directly add to the model (add_pathways). Reactions and metabolites of a given pathway-object will undergo a curation process in which they are tested for duplicate elements, missing chemical formulas of the metabolites, mass balance of reactions and reaction reversibility (detailed in the documentation). To ensure that the added pathways are functional we implemented a non-zero flux test (test_non_zero_flux). During the test, CobraMod can add auxiliary source reactions and suggests manual curation steps based on these auxiliary modifications. Moreover, CobraMod offers cross-referencing and meta-data curation and is MEMOTE-compliant. Our tool offers comprehensible and user-friendly tracking of the curation process. When a pathway-object is added to the model a summary is outputted and the complete curation procedure is written to a log file. If any of the curation criteria is not met or exceptions are encountered, CobraMod passes a warning through the Python console and the log file.

2.3 Visualization

CobraMod uses Escher for pathway visualization. To this end, each pathway-object includes a visualization method (visualize) which automatically generates pathway maps of the respective set of reactions. These pathway maps can be easily customized to visualize flux distributions using default or user-defined colors and gradients (linear or quantile normalized).

3 Test case

To demonstrate CobraMod’s functionalities we implemented two test cases based on in vivo and in silico overproduction studies in Escherichia coli. In the first example, we used a core model of E.coli (Orth ) to reproduce engineering strategies for improved shikimate synthesis (Chen ). Using our Escher interface, we visualized shikimate production for the control and one of the engineered strains (Fig. 1B). In a second example, we utilize a genome-scale model of E.coli (Monk ) to reproduce in silico experiments that introduce a synthetic homoserine cycle as an efficient route for methylotrophic growth (He ) and demonstrate the strength of CobraMod’s pathway-centric curation procedures. The test cases with a step-by-step workflow can be found in the documentation.

4 Conclusion

CobraMod offers user-friendly, pathway-centric extension, curation and flux visualization for large-scale metabolic networks. It thus addresses a common modeller's objective to study metabolic network behavior on the pathway level. CobraMod employs as much automation as possible and suggests necessary manual curation steps to ensure high quality metabolic reconstructions. Our tool can be directly linked with COBRApy and the Escher visualization tool and thus enables coherent and reproducible workflows.
  21 in total

1.  Reconstructing genome-scale metabolic models with merlin.

Authors:  Oscar Dias; Miguel Rocha; Eugénio C Ferreira; Isabel Rocha
Journal:  Nucleic Acids Res       Date:  2015-04-06       Impact factor: 16.971

Review 2.  Methods for automated genome-scale metabolic model reconstruction.

Authors:  José P Faria; Miguel Rocha; Isabel Rocha; Christopher S Henry
Journal:  Biochem Soc Trans       Date:  2018-07-31       Impact factor: 5.407

3.  Metabolic engineering of Escherichia coli for improving shikimate synthesis from glucose.

Authors:  Xianzhong Chen; Mingming Li; Li Zhou; Wei Shen; Govender Algasan; You Fan; Zhengxiang Wang
Journal:  Bioresour Technol       Date:  2014-05-21       Impact factor: 9.642

4.  Reconstruction and Use of Microbial Metabolic Networks: the Core Escherichia coli Metabolic Model as an Educational Guide.

Authors:  Jeffrey D Orth; R M T Fleming; Bernhard Ø Palsson
Journal:  EcoSal Plus       Date:  2010-09

5.  Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0.

Authors:  Laurent Heirendt; Sylvain Arreckx; Thomas Pfau; Sebastián N Mendoza; Anne Richelle; Almut Heinken; Hulda S Haraldsdóttir; Jacek Wachowiak; Sarah M Keating; Vanja Vlasov; Stefania Magnusdóttir; Chiam Yu Ng; German Preciat; Alise Žagare; Siu H J Chan; Maike K Aurich; Catherine M Clancy; Jennifer Modamio; John T Sauls; Alberto Noronha; Aarash Bordbar; Benjamin Cousins; Diana C El Assal; Luis V Valcarcel; Iñigo Apaolaza; Susan Ghaderi; Masoud Ahookhosh; Marouen Ben Guebila; Andrejs Kostromins; Nicolas Sompairac; Hoai M Le; Ding Ma; Yuekai Sun; Lin Wang; James T Yurkovich; Miguel A P Oliveira; Phan T Vuong; Lemmer P El Assal; Inna Kuperstein; Andrei Zinovyev; H Scott Hinton; William A Bryant; Francisco J Aragón Artacho; Francisco J Planes; Egils Stalidzans; Alejandro Maass; Santosh Vempala; Michael Hucka; Michael A Saunders; Costas D Maranas; Nathan E Lewis; Thomas Sauter; Bernhard Ø Palsson; Ines Thiele; Ronan M T Fleming
Journal:  Nat Protoc       Date:  2019-03       Impact factor: 13.491

6.  Escher: A Web Application for Building, Sharing, and Embedding Data-Rich Visualizations of Biological Pathways.

Authors:  Zachary A King; Andreas Dräger; Ali Ebrahim; Nikolaus Sonnenschein; Nathan E Lewis; Bernhard O Palsson
Journal:  PLoS Comput Biol       Date:  2015-08-27       Impact factor: 4.475

7.  COBRApy: COnstraints-Based Reconstruction and Analysis for Python.

Authors:  Ali Ebrahim; Joshua A Lerman; Bernhard O Palsson; Daniel R Hyduke
Journal:  BMC Syst Biol       Date:  2013-08-08

8.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models.

Authors:  Zachary A King; Justin Lu; Andreas Dräger; Philip Miller; Stephen Federowicz; Joshua A Lerman; Ali Ebrahim; Bernhard O Palsson; Nathan E Lewis
Journal:  Nucleic Acids Res       Date:  2015-10-17       Impact factor: 16.971

9.  MEWpy: A Computational Strain Optimization Workbench in Python.

Authors:  Vítor Pereira; Fernando Cruz; Miguel Rocha
Journal:  Bioinformatics       Date:  2021-01-18       Impact factor: 6.937

10.  RAVEN 2.0: A versatile toolbox for metabolic network reconstruction and a case study on Streptomyces coelicolor.

Authors:  Hao Wang; Simonas Marcišauskas; Benjamín J Sánchez; Iván Domenzain; Daniel Hermansson; Rasmus Agren; Jens Nielsen; Eduard J Kerkhoven
Journal:  PLoS Comput Biol       Date:  2018-10-18       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.