Literature DB >> 29069312

PESTO: Parameter EStimation TOolbox.

Paul Stapor1,2, Daniel Weindl1, Benjamin Ballnus1,2, Sabine Hug1,2, Carolin Loos1,2, Anna Fiedler1,2, Sabrina Krause1,2, Sabrina Hroß1,2, Fabian Fröhlich1,2, Jan Hasenauer1,2.   

Abstract

Summary: PESTO is a widely applicable and highly customizable toolbox for parameter estimation in MathWorks MATLAB. It offers scalable algorithms for optimization, uncertainty and identifiability analysis, which work in a very generic manner, treating the objective function as a black box. Hence, PESTO can be used for any parameter estimation problem, for which the user can provide a deterministic objective function in MATLAB. Availability and implementation: PESTO is a MATLAB toolbox, freely available under the BSD license. The source code, along with extensive documentation and example code, can be downloaded from https://github.com/ICB-DCM/PESTO/. Contact: jan.hasenauer@helmholtz-muenchen.de. Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2017. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2018        PMID: 29069312      PMCID: PMC5860618          DOI: 10.1093/bioinformatics/btx676

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Background

In many disciplines such as computational biology, engineering and operations research, researchers are faced with the problem of finding a set of parameters optimizing a given objective function and determining uncertainties of those estimates. There are various toolboxes available which offer this functionality, however, most of them focus either on individual aspects or are tailored to specific model types, e.g. ordinary differential equations (ODEs) or partial differential equations (PDEs). Hence, the user must adapt to different tools, depending on the current problem structure. A feature comparison of a selection of toolboxes related to parameter estimation is provided as Supplementary Table S1. In an effort to provide a universally applicable parameter estimation toolbox, we developed PESTO. PESTO is a freely available MATLAB toolbox offering scalable state-of-the-art algorithms for optimization, uncertainty and identifiability analysis, which do not depend on any problem-specific assumptions.

2 Features

PESTO offers the following features (Supplementary Fig. S1): Multi-start local optimization and interfaces to global and hybrid optimizers Optimization-based, integration-based or hybrid profile calculation for uncertainty and identifiability analysis Several sampling methods for uncertainty and identifiability analysis Visualization of all analysis results Efficient work flow and optional parallelization High customizability PESTO is agnostic of the underlying model or objective function and can be used in conjunction with any deterministic optimization problem formulated as a MATLAB function. The objective function is treated as black box, no further assumptions are made. The optimization can be accelerated when gradient and Hessian of the objective function are provided. PESTO can be applied to many optimization problems within the systems biology field and beyond.

2.1 Optimization

Many biological problems are non-convex and exhibit several local optima of which usually the global one is of interest. These optima cannot be derived analytically and evaluating the whole parameter space is computationally prohibitive. Therefore, there are diverse stochastic, deterministic or hybrid algorithms available (Raue ). To deal with multiple local optima, PESTO provides a multi-start local optimization strategy which has been shown to perform well in terms of convergence and computation time for biological ODE and PDE models (Hross and Hasenauer, 2016; Raue ). The user can choose between different strategies for automated starting point selection, such as latin hypercube or uniform sampling, or provide custom starting points. Multi-start local optimization usually performs very well in terms of finding the assumingly global optimum (Raue ) (Supplementary Fig. S2). Nevertheless, PESTO additionally provides access to two global optimization toolboxes, MEIGO and PSwarm (Egea ; Vaz and Vicente, 2007) through a uniform interface.

2.2 Identifiability and uncertainty analysis

Optimization problems arising in biology are often ill-posed and the estimated parameters are subject to large uncertainties. PESTO provides profile calculation and Markov-chain Monte Carlo methods to assess the properties of the objective function and the uncertainty and practical identifiability of parameters as well as any functions thereof. For profile calculation, the user can choose between optimization- and integration-based or hybrid profile calculation methods which provide maximum projections of the objective function (Boiger ). The novel hybrid profile calculation method provides the robustness of optimization-based profile calculation and the efficiency of integration-based profile calculation for which, to the best of our knowledge, no other implementation exists so far. Markov-chain Monte Carlo methods sample from parameter distributions, e.g. posteriors arising in Bayesian inference. PESTO supports the classical Metropolis-Hastings algorithm, two adaptive Metropolis-Hastings algorithms, the Metropolis-adjusted Langevin algorithm as well as the schemes from the MATLAB-based Delayed Rejection Adaptive Metropolis (DRAM) toolbox (Haario ). Additionally, PESTO provides multi-chain algorithms parallel tempering (Lacki and Miasojedow, 2015; Vousden ) and parallel hierarchical sampling (Rigat and Mira, 2012). In case profile calculation and sampling are computationally too expensive, PESTO can perform uncertainty analysis based on local quadratic approximations which are, however, less reliable. PESTO is the first toolbox to provide generic implementations of several of the above-mentioned algorithms for MATLAB or in general (Supplementary Table S1).

2.3 Visualization

For all above-mentioned analyses, PESTO generates various kinds of result plots: line plots, parallel coordinate plots, scatter plots and bar plots, some of which are shown in Supplementary Figure S1. Plots for optimization, parameter sampling and profile calculation are updated continuously to monitor progress of the computations. All plots are extensively customizable and can be exported as vector or raster graphics.

3 Examples, documentation and scalability

PESTO comes with extensive documentation and examples demonstrating all of its functions. Examples include several ODE and PDE models from recent research in computational biology, such as Fröhlich or Boiger . The dimensionality of the example problems included in PESTO ranks among the highest currently found in systems biology and demonstrates the scalability of the implemented methods (Supplementary Fig. S3). PESTO has been successfully applied in more than 10 published studies referenced on the project website. Although these case studies come from computational biology, PESTO is also applicable in other research fields. Due to the high abstraction level of PESTO, it does not directly support systems biology model definition standards such as the systems biology markup language (SBML). However, such models can easily be analyzed using PESTO in combination with AMICI, a toolbox for scalable simulation of ODE systems (e.g. Fröhlich ). AMICI handles the time discretization necessary for ODEs and allows also for the simulation of discretized PDEs (Serban and Hindmarsh, 2005), which is demonstrated in the examples included in PESTO.

4 Implementation and availability

PESTO is implemented fully in MATLAB and compatible with recent MATLAB releases running on Windows, Mac and Linux. For local optimization, PESTO requires the MATLAB ‘Optimization Toolbox’ (MathWorks). With the MATLAB ‘Parallel Computing Toolbox’ or ‘Distributed Computing Toolbox’ (MathWorks) installed, several PESTO routines can make use of multi-core processors. PESTO is released under a BSD license. The source code, license terms, extensive documentation and example code are available on https://github.com/ICB-DCM/PESTO/.

5 Conclusion

Here we presented PESTO, a toolbox for many aspects of parameter estimation in MATLAB. Its strengths are the combination of many state-of-the-art methods which are applicable to a wide array of different parameter estimation problems through a single interface. This facilitates a synergistic use and an in-depth analysis of problems (Hug ).

Funding

This project has received funding through the European Union‘s Horizon 2020 research and innovation programme under grant agreement no. 686282, the German Federal Ministry of Education and Research under the grant agreement no. 01ZX1310B and the German Research Foundation via the Graduate School of Quantitative Biosciences Munich (QBM). Conflict of Interest: none declared. Click here for additional data file.
  5 in total

1.  High-dimensional Bayesian parameter estimation: case study for a model of JAK2/STAT5 signaling.

Authors:  S Hug; A Raue; J Hasenauer; J Bachmann; U Klingmüller; J Timmer; F J Theis
Journal:  Math Biosci       Date:  2013-04-16       Impact factor: 2.144

2.  Analysis of CFSE time-series data using division-, age- and label-structured population models.

Authors:  Sabrina Hross; Jan Hasenauer
Journal:  Bioinformatics       Date:  2016-03-08       Impact factor: 6.937

3.  Scalable Parameter Estimation for Genome-Scale Biochemical Reaction Networks.

Authors:  Fabian Fröhlich; Barbara Kaltenbacher; Fabian J Theis; Jan Hasenauer
Journal:  PLoS Comput Biol       Date:  2017-01-23       Impact factor: 4.475

4.  Lessons learned from quantitative dynamical modeling in systems biology.

Authors:  Andreas Raue; Marcel Schilling; Julie Bachmann; Andrew Matteson; Max Schelker; Max Schelke; Daniel Kaschek; Sabine Hug; Clemens Kreutz; Brian D Harms; Fabian J Theis; Ursula Klingmüller; Jens Timmer
Journal:  PLoS One       Date:  2013-09-30       Impact factor: 3.240

5.  MEIGO: an open-source software suite based on metaheuristics for global optimization in systems biology and bioinformatics.

Authors:  Jose A Egea; David Henriques; Thomas Cokelaer; Alejandro F Villaverde; Aidan MacNamara; Diana-Patricia Danciu; Julio R Banga; Julio Saez-Rodriguez
Journal:  BMC Bioinformatics       Date:  2014-05-10       Impact factor: 3.169

  5 in total
  23 in total

1.  Parameter Estimation and Uncertainty Quantification for Systems Biology Models.

Authors:  Eshan D Mitra; William S Hlavacek
Journal:  Curr Opin Syst Biol       Date:  2019-11-06

2.  Bayesian parameter estimation for biochemical reaction networks using region-based adaptive parallel tempering.

Authors:  Benjamin Ballnus; Steffen Schaper; Fabian J Theis; Jan Hasenauer
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

3.  Bayesian calibration, process modeling and uncertainty quantification in biotechnology.

Authors:  Laura Marie Helleckes; Michael Osthege; Wolfgang Wiechert; Eric von Lieres; Marco Oldiges
Journal:  PLoS Comput Biol       Date:  2022-03-07       Impact factor: 4.475

4.  A protocol for dynamic model calibration.

Authors:  Alejandro F Villaverde; Dilan Pathirana; Fabian Fröhlich; Jan Hasenauer; Julio R Banga
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

5.  Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows.

Authors:  Olivia Eriksson; Upinder Singh Bhalla; Kim T Blackwell; Sharon M Crook; Daniel Keller; Andrei Kramer; Marja-Leena Linne; Ausra Saudargienė; Rebecca C Wade; Jeanette Hellgren Kotaleski
Journal:  Elife       Date:  2022-07-06       Impact factor: 8.713

6.  Differential methods for assessing sensitivity in biological models.

Authors:  Rachel Mester; Alfonso Landeros; Chris Rackauckas; Kenneth Lange
Journal:  PLoS Comput Biol       Date:  2022-06-13       Impact factor: 4.779

7.  Relationship Between Dimensionality and Convergence of Optimization Algorithms: A Comparison Between Data-Driven Normalization and Scaling Factor-Based Methods Using PEPSSBI.

Authors:  Andrea Degasperi; Lan K Nguyen; Dirk Fey; Boris N Kholodenko
Journal:  Methods Mol Biol       Date:  2022

8.  Receptor-Driven ERK Pulses Reconfigure MAPK Signaling and Enable Persistence of Drug-Adapted BRAF-Mutant Melanoma Cells.

Authors:  Luca Gerosa; Christopher Chidley; Fabian Fröhlich; Gabriela Sanchez; Sang Kyun Lim; Jeremy Muhlich; Jia-Yun Chen; Sreeram Vallabhaneni; Gregory J Baker; Denis Schapiro; Mariya I Atanasova; Lily A Chylek; Tujin Shi; Lian Yi; Carrie D Nicora; Allison Claas; Thomas S C Ng; Rainer H Kohler; Douglas A Lauffenburger; Ralph Weissleder; Miles A Miller; Wei-Jun Qian; H Steven Wiley; Peter K Sorger
Journal:  Cell Syst       Date:  2020-10-27       Impact factor: 10.304

9.  Hierarchical optimization for the efficient parametrization of ODE models.

Authors:  Carolin Loos; Sabrina Krause; Jan Hasenauer
Journal:  Bioinformatics       Date:  2018-12-15       Impact factor: 6.937

10.  Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold.

Authors:  M Gabriela M Gomes; Marcelo U Ferreira; Rodrigo M Corder; Jessica G King; Caetano Souto-Maior; Carlos Penha-Gonçalves; Guilherme Gonçalves; Maria Chikina; Wesley Pegden; Ricardo Aguas
Journal:  medRxiv       Date:  2022-02-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.