| Literature DB >> 30108559 |
Osvaldo D Kim1,2, Miguel Rocha2, Paulo Maia1.
Abstract
Mathematical modeling is a key process to describe the behavior of biological networks. One of the most difficult challenges is to build models that allow quantitative predictions of the cells' states along time. Recently, this issue started to be tackled through novel in silico approaches, such as the reconstruction of dynamic models, the use of phenotype prediction methods, and pathway design via efficient strain optimization algorithms. The use of dynamic models, which include detailed kinetic information of the biological systems, potentially increases the scope of the applications and the accuracy of the phenotype predictions. New efforts in metabolic engineering aim at bridging the gap between this approach and other different paradigms of mathematical modeling, as constraint-based approaches. These strategies take advantage of the best features of each method, and deal with the most remarkable limitation-the lack of available experimental information-which affects the accuracy and feasibility of solutions. Parameter estimation helps to solve this problem, but adding more computational cost to the overall process. Moreover, the existing approaches include limitations such as their scalability, flexibility, convergence time of the simulations, among others. The aim is to establish a trade-off between the size of the model and the level of accuracy of the solutions. In this work, we review the state of the art of dynamic modeling and related methods used for metabolic engineering applications, including approaches based on hybrid modeling. We describe approaches developed to undertake issues regarding the mathematical formulation and the underlying optimization algorithms, and that address the phenotype prediction by including available kinetic rate laws of metabolic processes. Then, we discuss how these have been used and combined as the basis to build computational strain optimization methods for metabolic engineering purposes, how they lead to bi-level schemes that can be used in the industry, including a consideration of their limitations.Entities:
Keywords: dynamic modeling; hybrid modeling; metabolic engineering; phenotype prediction; strain optimization
Year: 2018 PMID: 30108559 PMCID: PMC6079213 DOI: 10.3389/fmicb.2018.01690
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Modeling framework based on dynamic systems. The process of engineering a wild-type strain starts by reconstructing a model from its genome sequence complemented with information extracted from biological databases and literature. Then, the process can be divided in three main interacting blocks. (A) Model: it can include stoichiometric information only, or a combination with kinetics, whose parameters need to be estimated. Both types are validated and improved within an iterative process of curation. (B) Simulation: the model is used to predict the phenotype of the system. For kinetic approaches, the behavior of the steady and transient states of metabolite concentrations and fluxes are calculated, while with purely stoichiometry approaches, typically, a sensible flux distribution obeying the imposed constraints and optimizing a given biological assumption is sought. This flux distribution can be further delimited, in a hybrid fashion, by using information from the solution of the ordinary differential equation (ODE) system, if available. (C) Strain optimization: the phenotype is evaluated and optimized until meeting a termination criterion. The cycle consists in integrating solutions as perturbations to the model, in the form of changes to the kinetic parameters or to the constraints, so that a new phenotype can be simulated. In the end, a set of candidate designs is obtained.
Classification of kinetic rate expressions.
| Mechanistic | Michaelis- Menten | To have the basic mechanistic expressions. With kinetics where enzyme concentration is much lower than the substrate concentration. To model the summation of the effects of two or more reversible inhibitors or activators | (+) For very complex and not fully-understood mechanisms. | Pharmacokinetic model | Sheiner and Beal, |
| Hill rate laws | To model structure functional features of molecular genetic systems that do not demand knowledge of their detailed mechanism | (+) Suitable for not well-known molecular mechanisms by using generalized functions | Regulation of the expression of the | Likhoshvai and Ratushny, | |
| Approximate | Lin-log | In gene regulatory systems where rates are proportional to enzyme levels. With number of parameters as small as possible | (+) Analytic solution of steady-state network balances are desirable. | Glycolisis in | del Rosario et al., |
| Log-lin | For metabolic systems subject to spatiotemporal variations of system parameters and the process operating conditions. | (+) Accurately describes dynamic responses of strongly non-linear systems with analytic solutions. | Yeast glycolytic system | Hatzimanikatis and Bailey, | |
| Power laws | For arbitrary systems of enzyme-catalyzed reactions. Able to simply model aggregation and consumption processes | (+) Suitable solution approximation for enormous non-linear chemical systems using conventional numerical methods | Basic growth of complex systems | Savageau, | |
| Convenience rate laws | To represent enzyme saturation and regulation by activators and inhibitors. It uses thermodynamically independent system parameters | (+) Small number of parameters that can be easily computed with least-squares estimation methods | Chinese hamster ovary cell metabolism | Nolan and Lee, | |
| Modular rate laws | In reversible rate laws for reactions with arbitrary stoichiometries and various types of regulation | (+) Simplifies thermodynamic-kinetic modeling formalisms being flexible and biochemically plausible. (−) Less accurate than detailed kinetic equations | Cycle of three reactions (illustrative example) | Liebermeister et al., | |
| Cooperativity and saturation | To fit experimental data using systems with saturable form | (+) Expected to be accurate over a wider range around the operating point if the approximated functions are saturated. (−) Need of a large number of parameters (increased estimation efforts). (−) Common canonical formalisms do not have saturable form | Illustrative example of metabolic network with one positive feedforward and one negative feedback | Sorribas et al., | |
| Stochastic | Continuous space | For stiff systems, which can evolve on slow and fast time scales, and having stability in the fastest modes | (+) Able to describe the common enzyme-catalyzed conversion of a substrate into a product. Dramatically speeds up the stiff reactions. | Simulation of the general stiff enzyme-substrate reaction | Cao et al., |
| Discrete space | For problems with identifiability issues, when changes in the species are discrete and random, rather than continuous and deterministic | (+) Capture variability for bistable systems; able to deal with noise (randomness) | Isomerization of proteins | Ullah and Wolkenhauer, |
A summary of dynamic modeling methods, condensing information about the cases for using the different methods, their advantages and disadvantages, and an illustrative example of their application.
Figure 2Phenotype prediction approaches: interaction networks, constraint-based and kinetics-based methods can be classified qualitatively according to the level of detail and accuracy (horizontal axis), and the usual size of network (vertical axis). Position toward upper levels means genome-scale networks. Constraint-based and kinetics-based approaches can be joined in hybrid methods that aim at taking the best advantages of each of them. Examples of applications of kinetics-based and hybrid approaches are given (from Tables 1, 3, respectively).
Examples of hybrid models used only for phenotype prediction or for computational strain optimization.
| Dynamic Flux Balance Analysis (DFBA) | Exact, nonlinear programming | Phenotype prediction | Diauxic growth in | The COBRA Toolbox | Experimental | Mahadevan et al., |
| Integrated DFBA | Exact, linear programming | Phenotype prediction | Integrated signaling, metabolism and transcription regulation in | Not available | Theoretical | Lee et al., |
| Integrated Flux Balance Analysis | Exact, linear programming | Phenotype prediction | Metabolism, regulation and signaling of | SimTK | Theoretical | Covert et al., |
| Integration of kinetic expressions as constraints into DFBA | Exact, nonlinear programming | Strain optimization | Production of glycerol and ethanol in | Not available | Theoretical | Gadkar et al., |
| Dynamic strain scanning optimization for balanced yield, titer and productivity | Exact, nonlinear programming | Strain optimization | Production of succinate and 1,4-butanediol in | Framed GitHub repository | Theoretical | Zhuang et al., |
| k-OptForce: integration of kinetics with Flux Balance Analysis | Exact, mixed-integer nonlinear programming | Strain optimization | Production of L-serine in mutant | Not available | Experimental | Chowdhury et al., |
The COBRA Toolbox. “Dynamic FBA”. Opencobra.github.io. https://opencobra.github.io/cobratoolbox/stable/modules/analysis/dynamicFBA/index.html?highlight=dynamicfba
SimTK. “Integrated Flux Balance Analysis Model of Escherichia coli”. SimTK.org. https://www.simtk.org/projects/ifba/
Framed GitHub repository. “A python FRAmework for Metabolic Engineering and Design”. Github.com. .
Figure 3Description of the overall parameter estimation procedure. First, quality of experimental data is studied to determine suitable parameters via structural identifiability analysis. Then, parameter estimation is performed, locally or globally, according to the type of problem formulation. The equivalence between the geometrical and statistical formulations is noted.
Applications of computational strain optimization methods using dynamic models.
| Linearization of kinetic model and iterative optimization | Approximate, mixed-integer linear programming | Serine overproduction in | Theoretical (experimental evidence) | Vital-Lopez et al., |
| Metaheuristics to identify modifications of parameters | Stochastic, evolutionary computation | Maximize production of dihydroxyacetone phosphate in | Theoretical (experimental evidence) | Evangelista et al., |
| Brute-force to simultaneously identify process and cell modifications | Exact, exhaustive search | Improvement of antibody production in Chinese Hamster Ovary cells | Experimental | Nolan and Lee, |
| Metaheuristics to identify modifications of parameters | Stochastic, evolutionary computation | Maximization of serine production by | Theoretical (experimental evidence) | Evangelista et al., |
| Multi-objective dynamic optimization | Exact, mixed-integer nonlinear programming | Sustained and robust growth of Chinese Hamster Ovary cells | Theoretical (experimental evidence) | Villaverde et al., |