| Literature DB >> 35875150 |
Rachel H Ng1,2, Jihoon W Lee3,4, Priyanka Baloni1, Christian Diener1, James R Heath1,2, Yapeng Su4,5.
Abstract
The influence of metabolism on signaling, epigenetic markers, and transcription is highly complex yet important for understanding cancer physiology. Despite the development of high-resolution multi-omics technologies, it is difficult to infer metabolic activity from these indirect measurements. Fortunately, genome-scale metabolic models and constraint-based modeling provide a systems biology framework to investigate the metabolic states and define the genotype-phenotype associations by integrations of multi-omics data. Constraint-Based Reconstruction and Analysis (COBRA) methods are used to build and simulate metabolic networks using mathematical representations of biochemical reactions, gene-protein reaction associations, and physiological and biochemical constraints. These methods have led to advancements in metabolic reconstruction, network analysis, perturbation studies as well as prediction of metabolic state. Most computational tools for performing these analyses are written for MATLAB, a proprietary software. In order to increase accessibility and handle more complex datasets and models, community efforts have started to develop similar open-source tools in Python. To date there is a comprehensive set of tools in Python to perform various flux analyses and visualizations; however, there are still missing algorithms in some key areas. This review summarizes the availability of Python software for several components of COBRA methods and their applications in cancer metabolism. These tools are evolving rapidly and should offer a readily accessible, versatile way to model the intricacies of cancer metabolism for identifying cancer-specific metabolic features that constitute potential drug targets.Entities:
Keywords: cancer; constraint-based modeling; genome-scale metabolic models; metabolism; omics; python; single-cell analysis; systems biology
Year: 2022 PMID: 35875150 PMCID: PMC9303011 DOI: 10.3389/fonc.2022.914594
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1Constraint-based metabolic modeling. (A) A genome-scale metabolic model is a compartmentalized network of mass-balanced reactions that convert products to reactants, and boundary pseudo-reactions that import or export metabolites. Biological objectives, such as biomass production, require activity through a subset of internal reactions. (B) The metabolic model is converted into a stoichiometric matrix (S) of size m × n, with rows representing m metabolites and columns n reactions. Reaction flux through all internal reaction (v) and exchange reactions (e) is represented by vector v of length n. Objective function Z = c v is formulated as a linear combination of desired fluxes, weighted by vector c. (C) At steady state, the rate of production and consumption of a metabolite must be zero, which is described by the system of equations Sv = 0. There are many solutions to this system of equations, but the solution space can be constrained by imposing flux bounds (v ≤ v ≤ v ) and optimization such as maximization of objective function.
Figure 2Overview of Python software for major components of COBRA methods. Constraint-based metabolic modeling first requires loading a metabolic model into software that handles the various parts of the modeling framework (grey), such as metabolites, reactions, genes, stoichiometric matrix, and flux solutions. New metabolic models can be reconstructed from genome sequences and database, quality-checked by model testing software, made consistent using gap-filling tools, and visualized using web-based packages. Using the metabolic model, FBA (yellow) finds an optimal flux distribution that follows stoichiometry under steady state and can further be extended to dynamic systems. Since there are alternative optima (blue) to FBA, FVA and geometric FBA can be used to characterize the solution space. We can perturb (red) the system to predict the effect of knockouts and use such predictions to design an optimal system (‘strain’). To improve FBA predictions, we can add biophysical (green) constraints based on thermodynamics, proteins, and macromolecular expression. Metabolic modeling can be further enhanced by integration of multi-omics (purple) data, such as extracting reduced models based on omics data and adding regulatory constraints. Using omics data, metabolic modeling can become high-dimensional (brown), through single cell modeling and community modeling. Multiple metabolic models can be reduced into ensemble objects. In contrast to FBA, unbiased (pink) approaches do not require an objective function. These include methods for sampling flux distributions and pathway analyses. Names of software packages are in bold.
Python tools for constraint-based modeling.
| Category | Method | Software | URL | Doc. |
|---|---|---|---|---|
| Modeling framework | Object-oriented programming | COBRApy ( |
| ✔ |
| Testing | MEMOTE ( |
| ✔ | |
| Reconstruction | Template-based | AuReMe ( |
| ✔ |
| Template-based, gap-filling | CarveMe ( |
| ✔ | |
| Template-based | MetaDraft ( |
| ✔ | |
| Homology-based, multi-species, gap-filling | CoReCo ( |
| ✔ | |
| FBA | FBA ( | COBRApy | See above | ✔ |
| Dynamic metabolic modeling | Dynamic FBA ( | dfba ( |
| ✔ |
| Michaelis-Menten kinetics | DMPy ( |
| ✔ | |
| Alternative optima | Geometric FBA ( | COBRApy | See above | ✔ |
| FVA ( | ||||
| VFFVA | VFFVA ( |
| ✔ | |
| Knockout Simulation | Single/Double deletions ( | COBRApy | See above | ✔ |
| MOMA ( | ||||
| ROOM ( | ||||
| Flux- and graph-based | Conquest ( |
| ✔ | |
| Strain Design | OptGene ( | Cameo ( |
| ✔ |
| OptKnock ( | ||||
| Differential FVA | ||||
| FSEOF ( | ||||
| OptRAM ( | MEWpy ( |
| ✔ | |
| OptORF ( | ||||
| Omics constraints | E-flux ( | ReFramed ( |
| ✔ |
| CORDA | CORDA ( |
| ✔ | |
| GIM3E | GIM3E ( |
| ✔ | |
| FASTCORE ( | Troppo ( |
| ✘ | |
| CORDA ( | ||||
| GIMME ( | ||||
| tINIT ( | ||||
| iMAT ( | ||||
| Regulatory constraints | rFBA ( | MEWpy | See above | ✔ |
| SR-FBA ( | ||||
| PROM | PROM ( |
| ✔ | |
| GEM-PRO ( | ssbio ( |
| ✔ | |
| arFBA | arFBA ( |
| ✘ | |
| Thermodynamics | ll-FBA ( | COBRApy | See above | ✔ |
| CycleFreeFlux ( | ||||
| PTA | PTA ( |
| ✔ | |
| TFA, TVA ( | ReFramed | See above | ✔ | |
| TFA, TVA ( | pyTFA ( |
| ✔ | |
| Protein constraints | pFBA ( | COBRApy | See above | ✔ |
| GECKO ( | MEWpy | See above | ✔ | |
| sMOMENT | AutoPACMEN ( |
| ✔ | |
| ECMpy | ECMpy ( |
| ✔ | |
| ME-modeling | COBRAme | COBRAme ( |
| ✔ |
| Gap filling | MILP | COBRApy | See above | ✔ |
| Ensemble modeling | FBA | Medusa ( |
| ✔ |
| FVA | ||||
| Deletion | ||||
| ML | ||||
| Single cell modeling | Compass | Compass ( |
| ✔ |
| scFEA | scFEA ( |
| ✔ | |
| Community modeling | MICOM | MICOM ( |
| ✔ |
| Dynamic FBA | surfin_fba ( |
| ✔ | |
| Sampling | ACHR ( | COBRApy | See above | ✔ |
| OPTPG ( | ||||
| Pathway analysis | EFM | EFMlrs ( |
| ✔ |
| EFM ( | CoBAMP ( |
| ✔ | |
| Minimal cut sets ( | ||||
| Elementary flux patterns ( | ||||
| Visualization and web apps | Plug-in, website | Escher ( |
| ✔ |
| Plug-in, website | SAMMIpy ( |
| ✔ | |
| Plug-in | d3flux ( |
| ✔ |
Methods and their associated software packages as illustrated in , organized by their general function. Weblinks to each software’s official website and documentation are provided if available. Each software is assessed for availability of documentation (Doc.) or any form of demonstrative examples.
Pros and cons of COBRA methods.
| Category | Method/Tool | Pros | Cons |
|---|---|---|---|
|
| AuReMe | - Support for eukaryotes model | - Some manual refinement assistance |
| CarveMe | - GEMs ready for FBA | - No manual refinement assistance | |
| MetaDraft | -Support for eukaryotes model | - No manual refinement assistance | |
| CoReCo | - Support for eukaryotes model | - Requires KEGG license | |
|
| FBA | - Does not require kinetic parameters | - Requires objective function |
|
| Dynamic FBA (SOA and DAE) | - Couples pseudo-steady states to dynamical systems | - SOA requires small steps and thus more computation |
| DMPy | - Infers missing kinetic parameters using thermodynamics constraints | - Requires >80% of kinetic parameters for accuracy | |
|
| Geometric FBA | - Gives single representative solution – Reproducible typical solution (avoids randomly picking one solution from flux cone) | - Weak correlation with protein levels (without omics constraint) |
| FVA/VFFVA | - Determines min and max flux for a reaction would achieve optimal objective state | - Varies one reaction at a time | |
| Sampling | - Estimates probability distribution of feasible fluxes | - Computationally intensive | |
|
| E-flux | - Constraints reaction bounds only | - May over-constrain model based on noisy data |
| GIMME | - LP problem (fast) | - Discretizes data | |
| GIM3E | - Ensures operability of required metabolic function | - Discretizes data | |
| (t)INIT | - Ensures operability of required metabolic functions | - MILP problem (slow) | |
| iMAT | - No objective required | - Discretizes data | |
| FASTCORE | - LP problem (fast) | - Requires specification of core reactions | |
| CORDA | - LP problem (fast) | - Requires specification of core reactions | |
|
| rFBA | - Predicts flux over time intervals | - Uses boolean TRN |
| SR-FBA | - Combined calculation using metabolic and regulatory constraints | - Uses boolean TRN | |
| PROM | - Uses continuous TRN | - Requires TF-target gene relationships | |
| GEM-PRO | - Models protein instability | - Requires protein structures | |
| arFBA | - Models allosteric regulation | - Requires regulation matrix defining effector-reaction relationship | |
|
| ll-FBA | - Does not require metabolite concentrations or free energies | - MILP problem (slow) |
| CycleFreeFlux | - Post-process using LP problem (fast) | - Biased towards solutions with small total flux and those with same direction as their overlapping internal cycles | |
| TFA, TVA | - Explicitly models thermodynamics | - Requires metabolite concentrations and free energies | |
| PTA | - Explicitly models thermodynamics for optimization and sampling | - Requires metabolite concentrations and free energies | |
|
| pFBA | - Predicts growth rate, uptake/secretion rates, and essential genes | - Assumes that flux distribution with smallest magnitude minimizes protein costs |
| Enzymatic constraints | - Model proteome limitation at enzyme resolution | - Requires experimentally measured enzyme turnover numbers | |
|
| COBRAme | - Modeling proteome composition improves predictive accuracy | - Large model size and complexity |
|
| Medusa | - Compresses multiple models into compact ensemble objects | - No standardized SBML format for ensemble objects |
|
| Compass | - Genome-scale modeling | - Map gene expression to reaction expression using boolean relationships (GPR) |
| scFEA | - Minimizes flux imbalance of all cells to simulate exchange of metabolites | - Not easily scalable due to large memory usage | |
|
| MICOM | - Models exchanges and interactions between communities and environment | - Assumes trade-offs between individual and community growth rate (gut microbiome specific) |
| Dynamic FBA (surfin_fba) | - Reduces optimizations problems (and parameter space) required for dynamic FBA for communities | - Non-biological approach to choosing between non-unique optima | |
|
| EFM | - Unbiased characterization of models (no objective function required) | - (EFMlrs) EFM calculation performed by other tools not included in program |
Some method comparisons extracted from literature for reconstruction (87, 88), dynamic modeling (89), omics constraints (90, 91), and regulatory constraints (92). Growth rate, uptake/secretion rates, and cancer essential gene prediction performances from Jamialahmadi et al. are based on human metabolic models and available only for GIMME, INIT, iMAT, FASTCORE, CORDA, and pFBA (91).
Figure 3Overview of metabolic interactions within the tumor microenvironment. The TME is composed of cancer cells, immune cells, and stromal cells embedded in extracellular matrix (ECM). Limited nutrients and oxygen lead to metabolic competition between cancer and various lymphocytes, especially hampering anti-tumor activity of effector T cells (TEFF). Cancer cells adapts via upregulating nutrient transport and altering cancer-associated fibroblasts (CAF) to replenish metabolites. T cell immunity is further suppressed by cancer cells’ release of lactate produced by glycolysis and by recruitment of immune-suppressive cells due to Indoleamine 2,3-dioxygenase (IDO) activity. TMEM, memory T cell; NK, natural killer cell; Treg, regulatory T cell; TAM, tumor-associated macrophage.
List of cancer metabolic modeling studies.
| Ref. | Cancer | Purpose | Method | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Drug design Predict drug targets Explore Cancer biology Explain Warburg effect Patient classification | Integration | Model | Analysis | Constraints | Objective | ||||||||
| ( | Breast | x | x | Lee-12 | HMR 1 | FBA, Comparative, Topological | Transcriptome, Fluxomic | Data Correlation | |||||
| ( | Colorectal | x | x | tINIT | Human1 | TFA, TFVA, pTFVA | Thermodynamic, Transcriptome, Biomass | Biomass | |||||
| ( | Eye | x | x | iMAT | Recon 2 | Gap filling, FBA, FVA, Knockout | Transcriptome | Biomass, Tasks | |||||
| ( | Head and Neck | x | x | Upper bound | Recon 2 | FBA, Sampling, Knockout | Thermodynamics, Enzyme kinetics, Transcriptome, Metabolome | ATP, NADPH | |||||
| ( | Liver | x | x | iMAT-like | Recon 1 | Comparative, FBA, Sampling | Transcriptome | Data Similarity | |||||
| ( | Multiple | x | x | tINIT | HMR 2 | Comparative, Knockout | Transcriptome | Tasks | |||||
| ( | Multiple | x | x | tINIT | HMR 2 | FBA, Knockout | Transcriptome | Biomass, Tasks | |||||
| ( | Generic | x | x | Small-scale | FBA, DFBA, FVA, Knockout, Sampling | Biomass | |||||||
| ( | Kidney | x | x | tINIT | iCancer-Core | FBA, Knockout | Transcriptome | Biomass | |||||
| ( | Brain | x | tINIT | HMR 2 | Comparative, FBA, Knockout | Transcriptome | Biomass, Tasks | ||||||
| ( | Breast, Lung, Multiple | x | Upper bound | HMR 1 | FBA, Sampling, Knockout | Transcriptome | Biomass | ||||||
| ( | Generic | x | MBA | Recon 1 | FBA, Knockout | Transcriptome | Biomass | ||||||
| ( | Kidney | x | MBA | Recon 1 | FBA, Knockout | Transcriptome | Biomass | ||||||
| ( | Liver | x | tINIT | HMR 2 | Comparative, FBA, Knockout | Proteome | Biomass, Tasks | ||||||
| ( | Liver | x | tINIT | HMR 2 | FBA, Knockout, Topological | Transcriptome | Biomass, Tasks | ||||||
| ( | Multiple | x | iMAT | Recon 1 | FBA, ML, Topological | Transcriptome | Data Similarity | ||||||
| ( | Multiple | x | GIMME | Recon 2 | FBA, Sampling, Knockout | Mutations, Transcriptome | Biomass | ||||||
| ( | Prostate | x | tINIT | iCancer-Core | FBA, Knockout, Sampling | Transcriptome, Proteome | Biomass, Tasks | ||||||
| ( | Breast, Kidney, Liver, Prostate | x | x | KEGG | Network Propagation, Knockout, ML | Transcriptome | |||||||
| ( | Colorectal | x | x | tINIT | HMR 2 | Comparative | Transcriptome | ||||||
| ( | Multiple | x | x | Recon 2 | Regulatory, Topological, ML | Transcriptome, Metabolome | |||||||
| ( | Multiple | x | x | mCADRE | Recon 1 | Comparative | Transcriptome | Tasks, Biomass | |||||
| ( | Multiple, Brain, Lung, Breast, Leukemia, Prostate | x | x | tINIT | Recon 3D | Comparative, Knockout, ML | Mutations, Protein Structures | Biomass | |||||
| ( | Prostate | x | x | iMAT | Recon 2 | FBA, FVA | Transcriptome | Data Similarity | |||||
| ( | Kidney, Prostate | x | INIT | HMR 1 | Knockout | Proteome, Fluxomic | Biomass | ||||||
| ( | Multiple | x | INIT | HMR 1 | Comparative | Proteome | |||||||
| ( | Multiple | x | Topological | ||||||||||
| ( | Generic | x | x | C2M2N | FBA | Biosynthesis, Biomass | |||||||
| ( | Breast, Colorectal | x | x | Recon 2 | FVA, ML | Metabolome | Metabolite Change | ||||||
| ( | Liver | x | x | tINIT | HMR 2 | Comparative | Transcriptome, Proteome | Biomass, Tasks | |||||
| ( | Brain | x | GIMME, MADE | iMS570 | pFBA, Sampling | Transcriptome | Biomass | ||||||
| ( | Breast | x | E-Flux | Recon2 | FBA | Proteome | Biomass | ||||||
| ( | Colorectal | x | Recon 2.2 | Comparative | Transcriptome | ||||||||
| ( | Colorectal | x | CORDA | Recon 2.2 | FBA, FVA, Topological | Proteome | Biomass, ATP | ||||||
| ( | Generic | x | HMRcore | popFBA, Sampling | Loopless | Biomass | |||||||
| ( | Kidney | x | Recon 1 | pFBA | Flux measurements | Biomass | |||||||
| ( | Kidney | x | INIT | HMR 1 | Comparative | Proteome | |||||||
| ( | Liver | x | tINIT | HMR 2 | Comparative, Gap filling, Regulatory, FBA | Transcriptome, Metabolome | Biomass | ||||||
| ( | Lung | x | 13C flux analysis | Flux measurements, Labeling measurements | |||||||||
| ( | Lung | x | Central Carbon, Recon 2 | Elementary modes, Structural fluxes, pFBA | Protein efficiency | Biomass, Biosynthesis | |||||||
| ( | Lung, Breast | x | E-Flux | HMRcore | scFBA | scRNA-seq, metabolomics | Biomass | ||||||
| ( | Lung, Prostate | x | E-Flux | Recon 1 | FVA | Transcriptome | Biomass | ||||||
| ( | Multiple | x | tINIT | HMR 2 | Comparative | Transcriptome | |||||||
| ( | Prostate | x | IMAT, GIMME, Gonçalves, MADE | HMR 2 | FBA | Transcriptome | Data Similarity | ||||||
| ( | Generic | x | Recon 1 | FBA, FVA, Sampling | Protein efficiency, Enzyme kinetics | Biomass | |||||||
| ( | Generic | x | ATP | FBA | Protein efficiency | ATP | |||||||
| ( | Generic | x | ATP, BiGG | FBA | Protein efficiency | ATP, Nutrient cost | |||||||
| ( | Liver | x | MADE | Recon 2 | Comparative | Transcriptome | Data Similarity | ||||||
| ( | Liver | x | Upper bound | Recon 3D | FBA, FVA | Transcriptome, Nutrient availability | Biomass | ||||||
| ( | Liver | x | Bounds | Recon 2 | FBA | Protein efficiency, Transcriptome | ATP | ||||||
| ( | Multiple | x | E-Flux | Recon 1 | FBA | Transcriptome | Biomass | ||||||
Figure 4Applications of COBRA methods to cancer research. Workflow diagram of using various COBRA methods (colored) in combination to achieve different objectives (grey).
Human metabolic generic models and cancer models.
| Model | Scale | No. of Reactions | No. of Metabolite | No. of Genes | Web link |
|---|---|---|---|---|---|
| HMR 1 ( | Genome | 8174 | 6006 | 3674 |
|
| HMR 2 ( | Genome | 8181 | 6007 | 3765 |
|
| Recon 1 ( | Genome | 3741 | 2766 | 1905 |
|
| Recon 2 ( | Genome | 7440 | 5063 | 2194 |
|
| Recon 3D ( | Genome | 10600 | 5835 | 2248 |
|
| Human1 ( | Genome | 13069 | 8366 | 3067 |
|
| Cancer central metabolism ( | Small | 80 | 66 | 46 |
|
| iCancer-Core (iHCC2578) ( | Genome | 7762 | 5566 | 2892 |
|
| C2M2N ( | Small | 77 | 54 | – |
|
| HMRcore ( | Intermediate | 315 | 256 | 418 |
|
| Central Carbon ( | Small | 114 | 120 | – |
|
| iMS570 (brain) ( | Genome | 630 | 524 | 570 |
|
This table describes various human reconstructions that are used as the starting reference models in various cancer applications listed in . The most updated online links to these models may be different than previously described in their original manuscripts.