| Literature DB >> 31842943 |
Clemens Kreutz1,2.
Abstract
Insufficient performance of optimization-based approaches for the fitting of mathematical models is still a major bottleneck in systems biology. In this article, the reasons and methodological challenges are summarized as well as their impact in benchmark studies. Important aspects for achieving an increased level of evidence for benchmark results are discussed. Based on general guidelines for benchmarking in computational biology, a collection of tailored guidelines is presented for performing informative and unbiased benchmarking of optimization-based fitting approaches. Comprehensive benchmark studies based on these recommendations are urgently required for the establishment of a robust and reliable methodology for the systems biology community.Entities:
Keywords: Benchmarking; Differential equation models; Optimization; Parameter estimation; Systems biology
Mesh:
Year: 2019 PMID: 31842943 PMCID: PMC6915982 DOI: 10.1186/s13059-019-1887-9
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Tasks to be accomplished for fitting ODE models. The fitting of ODE models requires several generic tasks. The optimization problem has to be defined in terms of bounds of the search space and geometry (e.g., linear vs. log scale). Moreover, the selected generic optimization algorithm applied as the core of optimization-based fitting has to be initialized. There are many ways of combining global and local search strategies. A prominent global search strategy is random drawing of multiple initial guesses and performing local optimization for each starting point. In each optimization step of a local optimization run, the ODEs have to be solved for the evaluation of the objective function χ2(θ). Incremental improvement strategies are applied for suggesting a new parameter vector for the next iteration step in the core optimization routine which is usually performed based on derivatives or approximations thereof
Fig. 2Impact of hyperparameters. Multiple configurations can have an impact on performance. In this illustration example, two optimization approaches have different sensitivities with respect to the choice of tolerances controlling the numerical error of ODE integration. Moreover, both approaches have different optimal choices for this hyperparameter. For most integration tolerances, approach B is superior. However, approach A displays the overall best performance for optimally chosen tolerances. This illustration example highlights the importance of the evaluation of hyperparameters for drawing valid conclusions
Covariates
| Abbreviation | Covariate | Typical possible choices |
|---|---|---|
| C1 | Application problem | Model equations and the data set(s) |
| C2 | Primary performance criteria | Convergence per computation time, iteration steps |
| C3 | Secondary performance criteria | Documentation, user-friendliness, code quality |
| C4 | Parameter scale | Linear vs. log scale |
| C5 | Global search strategy | Multiple initial guesses, scatter search algorithms, stochastic search |
| C6 | Initial guess strategy | Fixed vs. random, normally distributed vs. uniform vs. latin-hypercube |
| C7 | Parameter constraints | Upper and lower bounds |
| C8 | Prior knowledge | None vs. (weakly) informative priors |
| C9 | ODE integration implementation | SUNDIALS, Matlab, R |
| C10 | ODE integration algorithm | Stiff vs. non-stiff approaches, Adams-Moulton vs. BDF |
| C11 | Integration accuracy | ODE integrator tolerances |
| C12 | Derivative calculation | Finite differences, sensitivity equations, adjoint sensitivities |
| C13 | Stopping rule | Optimization termination criteria |
| C14 | Handling of non-converging ODE integration | Termination of optimization vs. infinite loss |
| C15 | Algorithm-specific configurations | Cross-over rate, annealing temperature, number of particles |
The performance of an optimization approaches depend on many decisions and configurations C1–C15. For the comparison of several approaches, these attributes appear as covariates. Performance benefits for individual choices do not necessarily indicate a general advantage because benefits might merely originate from the chosen configurations
Fig. 3Ambiguous interpretation of optimization outcomes. For non-trivial optimization problems, the results of independent optimization runs are typically not the same. The upper left panel indicates an outcome for three optimization runs, e.g., generated with different starting points. If the objective function values after optimization are different (scenario A), such an outcome could be explained by local optima (explanation A1) or by convergence problems of the optimization algorithm (explanation A2). If the same values are obtained for objective function, there might be several local optima with the same value of the objective function (explanation B1), there might be a convergence problem (B2), or non-identifiabilities might exist (B3), i.e., the estimated parameters are not uniquely specified by the data and then the same value of the objective function is achieved in multi-dimensional sub-spaces