| Literature DB >> 35625648 |
Xinyu Bi1,2, Yanfeng Liu1,2, Jianghua Li1,2, Guocheng Du1,2, Xueqin Lv1,2, Long Liu1,2.
Abstract
Genome-scale metabolic models (GEMs) are effective tools for metabolic engineering and have been widely used to guide cell metabolic regulation. However, the single gene-protein-reaction data type in GEMs limits the understanding of biological complexity. As a result, multiscale models that add constraints or integrate omics data based on GEMs have been developed to more accurately predict phenotype from genotype. This review summarized the recent advances in the development of multiscale GEMs, including multiconstraint, multiomic, and whole-cell models, and outlined machine learning applications in GEM construction. This review focused on the frameworks, toolkits, and algorithms for constructing multiscale GEMs. The challenges and perspectives of multiscale GEM development are also discussed.Entities:
Keywords: machine learning; multiconstraint models; multiomics models; multiscale genome-scale metabolic models; whole-cell models
Mesh:
Year: 2022 PMID: 35625648 PMCID: PMC9139095 DOI: 10.3390/biom12050721
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1The main development timeline of multiscale GEMs and the application of machine learning.
Algorithms and frameworks for the construction and application of multiscale models.
| Model Type | Year | Algorithm/Framework | Language | Task | Reference |
|---|---|---|---|---|---|
| Constraint-based models | 2007 | TMFA | MATLAB | Thermodynamic constraint model | [ |
| 2019 | MatTFA, pyTFA | MATLAB, Python | Toolkit of build thermodynamic constraint model | [ | |
| 2007 | FBAwMC | MATLAB | Calculation scheme of enzyme concentration | [ | |
| 2012 | MOMENT | MATLAB | Enzymatic constraint model | [ | |
| 2017 | GECKO | MATLAB | Comprehensive framework for enzyme constraint models | [ | |
| 2006 | Structural Kinetic Modeling | MATLAB | Dynamic analysis of metabolic systems | [ | |
| 2008 | MASS framework | MATLAB | Evaluate the dynamic properties of the model and formulate a timescale hierarchy | [ | |
| 2010 | ORACLE | MATLAB | Introducing the state space of the enzyme into the model | [ | |
| 2008 | Ensemble Modelling | MATLAB | Framework for Steady-State kinetics model | [ | |
| 2016 | ABC-GRASP | MATLAB | Framework for modeling uncertain dynamics data | [ | |
| 2021 | ETGEM | Python | Framework of enzyme constraints and thermodynamic constraints | [ | |
| 2020 | Expression and Thermodynamics Flux models | Python | Multi-omics integrated framework | [ | |
| Multi-scale Integrated models | 2011 | TIGER | MATLAB | Integrate TRN and GEM platforms | [ |
| 2015 | FlexFlux | Java | Integrate TRN and GEM platforms | [ | |
| 2010 | Probabilistic Regulation of Metabolism | MATLAB | Toolkit of integrate TRN and GEM | [ | |
| 2017 | TRFBA | MATLAB | Toolkit of integrate TRN and GEM | [ | |
| 2019 | OptRAM | MATLAB | Predict optimal metabolic flux in TRN-integrated GEM | [ | |
| 2016 | GEM-PRO | MATLAB | Integration of protein structure with GEM | [ | |
| 2019 | GEMMER | Python + Java | Database for multiscale modeling | [ | |
| Whole cell model | 2006 | GEM System | Java | Toolbox for building metabolic pathways in whole-cell models | [ |
| 2021 | Pathway Tools | Python + Java | Software for pathway and genetic data | [ | |
| 2013 | WholeCellKB | Python + SQL | Database of whole-cell models | [ | |
| 2020 | CellML | XML | Mathematical models describing cellular physiological systems | [ | |
| 2003 | E-Cell | C++ | Multiplatform cell simulation system | [ | |
| 2014 | CellDesigner | SBML | modeling tool for biochemical networks | [ | |
| 2009 | Complex pathway simulator | SBML | Software for biochemical network modeling and simulation | [ | |
| 2009 | Biochemical simulations | Python | Random mixture algorithm | [ | |
| 2014 | WholeCellSimDB | Python + Java | Database of whole-cell model predictions. | [ | |
| 2013 | WholeCellViz | Java + SOL | visualization for whole-cell models | [ | |
| Machine learning-based models | 2019 | DeepEC | Python | EC number prediction by deep learning | [ |
| 2020 | ART, TeselaGen EVOLVE | Python | Multi-level training datasets for accurate prediction | [ | |
| 2020 | BEMKL, bagged random forest, multimodal artificial neural network, sparse group lasso, NSGA-II, iterative random forests | Python | Multiomics and multimodal algorithms to predict phenotypes | [ | |
| 2020 | AMMEDEUS | Python | Tools to identify changes in model structure | [ | |
| 2014 | regularized multinomial logistic regression | MATLAB | Tool for phenotypic inverse prediction of growth conditions | [ | |
| 2016 | primary elementary modal analysis | Python | Identifying metabolic patterns in fluxomics based on metabolic pathways | [ | |
| 2018 | dynEMR-DA | MATLAB | Algorithm for environment-driven dynamic performance discrimination | [ | |
| 2016 | support vector machines, k-nearest neighbors, decision trees | MATLAB | Method for rapid prediction of bacterial heterotrophic fluxomics | [ |
ABC-GRASP: Approximate Bayesian Computation-General Reaction Assembly and Sampling Platform; AMMEDEUS: automated metabolic model ensemble-driven elimination of uncertainty with statistical learning; ART and TeselaGen EVOLVE: Automatic Recommendation Tool and TeselaGen EVOLVE; BEMKL: Bayesian efficient multiple-kernel learning; dynEMR-DA: Dynamic Fundamental Mode Regression Discriminant Analysis; ETGEM: Pyomo-based model framework integrating enzymatic constraints and thermodynamic constraints; FBAwMC: Flux Balance Analysis with Molecular Crowding; GECKO: GEMs with Enzymatic Constraints using Kinetic and Omics data; GEM-PRO: genome-scale model with protein structure; GEMMER: genome-wide tool for multi-scale modeling data extraction and representation; MOMENT: MetabOlic Modeling with ENzyme kineTics; NSGA-II: nondominated sorting genetic algorithm II; OptRAM: optimization of regulatory and metabolic networks; ORACLE: Optimization and Risk Analysis of Complex Living Entities; TMFA: thermodynamically based metabolic flux analysis; TIGER: toolbox for integrating genome-scale metabolism; TRFBA: transcriptional regulation flux balance analysis.
Figure 2The main classification and construction framework of multiscale GEMs.
Figure 3Representative application of machine learning in GEMs.