Literature DB >> 26076722

REDEMPTION: reduced dimension ensemble modeling and parameter estimation.

Yang Liu1, Erica Manesso1, Rudiyanto Gunawan1.   

Abstract

UNLABELLED: Here, we present REDEMPTION ( RE: duced D: imension E: nsemble M: odeling and P: arameter estima TION: ), a toolbox for parameter estimation and ensemble modeling of ordinary differential equations (ODEs) using time-series data. For models with more reactions than measured species, a common scenario in biological modeling, the parameter estimation is formulated as a nested optimization problem based on incremental parameter estimation strategy. REDEMPTION also includes a tool for the identification of an ensemble of parameter combinations that provide satisfactory goodness-of-fit to the data. The functionalities of REDEMPTION are accessible through a MATLAB user interface (UI), as well as through programming script. For computational speed-up, REDEMPTION provides a numerical parallelization option using MATLAB Parallel Computing toolbox.
AVAILABILITY AND IMPLEMENTATION: REDEMPTION can be downloaded from http://www.cabsel.ethz.ch/tools/redemption. CONTACT: rudi.gunawan@chem.ethz.ch.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2015        PMID: 26076722      PMCID: PMC4595898          DOI: 10.1093/bioinformatics/btv365

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Mathematical modeling plays an increasingly important role in understanding and predicting biological phenomena. In this regard, ordinary differential equation (ODE) models are often created to describe the dynamic behavior of biological systems based on mass or molar conservation, as follow: where t denotes time, is the species concentration vector, is the stoichiometric or connectivity matrix, is the reaction rate or flux equation vector, denotes the kinetic parameter vector and denotes the initial concentration vector. The estimation of unknown kinetic parameters from time-series data is the most challenging step in the creation of such biological models, due to: (i) high computational cost associated with global optimizations and with integrating ODEs (Chou and Voit, 2009), and (ii) the lack of parameter identifiability (Srinath and Gunawan, 2010). The high computational complexity could be addressed by performing the parameter estimation incrementally. Here, time-series data are first smoothened and differentiated, and subsequently the dynamic flux values are computed by viewing Equation (1) as an algebraic equation. The parameters are estimated one flux at a time while avoiding ODE integrations (Voit and Almeida, 2004). However, such an approach requires that S has a full column rank, which is often not satisfied in the modeling of biological systems. We recently developed incremental parameter estimation (IPE) and integrated flux parameter estimation (IFPE) to address the limitation related to the matrix S. In IPE (IFPE), we formulated the parameter estimation as a nested optimization, where the outer optimization was performed over a subset of parameters corresponding to the independent (integrated) fluxes, defined such that given their values, the remaining (integrated) fluxes could be estimated from the slope of time-series data (Jia ; Liu and Gunawan, 2014). The inner optimizations involved estimating parameters associated with the dependent flux subset using their estimated (integrated) flux values, performed one flux at a time. Meanwhile, incomplete parameter identifiability implies that many parameter combinations could fit the data equally well (Jia ). In ensemble modeling, instead of generating one best-fit parameter estimate, one looks for a set of parameter combinations which fit the data within an acceptable prediction accuracy, e.g. an upper bound for the error function. We extended our incremental estimation strategy above such that the identification of parameter ensemble could be efficiently performed over smaller parameter dimensions (Jia ). Reduced Dimension Ensemble Modeling and Parameter estimation (REDEMPTION) provides a UI for efficient parameter estimation and ensemble modeling based on IPE and IFPE in MATLAB. The functions within REDEMPTION were designed to address a common scenario in biological modeling, where the number of reactions exceeds that of the (measured) species. When this scenario does not apply, REDEMPTION automatically reverts to a standard IPE. For ensemble modeling, REDEMPTION employs an efficient parameter exploration algorithm HYPERSPACE, which combines an Adaptive Monte Carlo method and a multiple ellipsoids based sampling (Zamora-Sillero ).

2 Main features

Model and data specifications: REDEMPTION’s UI starts with the Main window (Fig. 1a), from which users can access all functionalities. The ODE model equations can be specified manually through the Model Editor using power-law or linear-logarithmic (lin-log) kinetics (Fig. 1c), or by importing an SBML file. In addition, REDEMPTION requires upper and lower bound values for the model parameters. A parameter will be estimated from data when the upper and lower bounds differ. Users also need to provide the time-series concentration data in Comma-Separated Values (CSV) format. For data pre-processing, REDEMPTION includes piecewise polynomial spline-fitting, where users can adjust the number of pieces and the order of polynomials (Fig 1d).
Fig. 1.

Workflow of parameter estimation and ensemble generation in REDEMPTION applied to branched pathway example

Workflow of parameter estimation and ensemble generation in REDEMPTION applied to branched pathway example Parameter estimation: Figure 1e depicts REDEMPTION’s parameter estimation UI. For parameter estimation, REDEMPTION provides four variants of the IPE and IFPE methods and two error functions using sum of squares of the absolute or relative model prediction errors (User Guide for more details). Here, users can select MATLAB’s Genetic Algorithm (GA) or enhanced Scatter Search (eSS) (Egea ) as the global optimization algorithm. Ensemble modeling: After a successful run of parameter estimation, users can generate parameter ensemble corresponding to parameter combinations whose error function values are smaller than a user-specified threshold (Fig. 1f). The optimal error function value obtained from the prior parameter estimation step is provided as a reference for setting this threshold. The appropriate threshold could for example be calculated by bootstrapping the original data (Jia ). Figure 1f shows a 2D parameter projection of the generated parameter ensemble, in which each blue dot represents a parameter combination in the ensemble and the largest dot corresponds to the optimal solution of the parameter estimation step. To include an additional dataset, users can perform parameter estimation and ensemble modeling again using the additional data and new parameter bounds based on the parameter ensemble (User Guide for more information).

3 Examples

Figure 1 illustrates the workflow of REDEMPTION for parameter estimation and ensemble generation using a branched pathway example as shown in Figure 1b (Voit and Almeida, 2004). REDEMPTION also includes the lin-log modeling of Lactococcus lactis glycolytic pathway as another example (del Rosario ). Details of these examples can be found in the User Guide.

4 Conclusions

REDEMPTION provides an integrated, numerically efficient tool for estimating kinetic parameters and parameter ensemble of ODE models. The tools within REDEMPTION are accessible through a user-friendly MATLAB UI and applicable for ODE models with power-law or lin-log kinetics, and those in SBML format.
  8 in total

1.  Decoupling dynamical systems for pathway identification from metabolic profiles.

Authors:  Eberhard O Voit; Jonas Almeida
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

2.  Challenges in lin-log modelling of glycolysis in Lactococcus lactis.

Authors:  R C H del Rosario; E Mendoza; E O Voit
Journal:  IET Syst Biol       Date:  2008-05       Impact factor: 1.615

3.  Parameter identifiability of power-law biochemical system models.

Authors:  Sridharan Srinath; Rudiyanto Gunawan
Journal:  J Biotechnol       Date:  2010-03-01       Impact factor: 3.307

Review 4.  Recent developments in parameter estimation and structure identification of biochemical and genomic systems.

Authors:  I-Chun Chou; Eberhard O Voit
Journal:  Math Biosci       Date:  2009-03-25       Impact factor: 2.144

5.  Efficient characterization of high-dimensional parameter spaces for systems biology.

Authors:  Elías Zamora-Sillero; Marc Hafner; Ariane Ibig; Joerg Stelling; Andreas Wagner
Journal:  BMC Syst Biol       Date:  2011-09-15

6.  Ensemble kinetic modeling of metabolic networks from dynamic metabolic profiles.

Authors:  Gengjie Jia; Gregory Stephanopoulos; Rudiyanto Gunawan
Journal:  Metabolites       Date:  2012-11-12

7.  Parameter estimation of dynamic biological network models using integrated fluxes.

Authors:  Yang Liu; Rudiyanto Gunawan
Journal:  BMC Syst Biol       Date:  2014-11-18

8.  Incremental parameter estimation of kinetic metabolic network models.

Authors:  Gengjie Jia; Gregory Stephanopoulos; Rudiyanto Gunawan
Journal:  BMC Syst Biol       Date:  2012-11-21
  8 in total
  1 in total

1.  Understanding biochemical design principles with ensembles of canonical non-linear models.

Authors:  Lukas Bromig; Andreas Kremling; Alberto Marin-Sanguino
Journal:  PLoS One       Date:  2020-04-30       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.