| Literature DB >> 30624608 |
Helge Hass1,2, Carolin Loos3,4, Elba Raimúndez-Álvarez3,4, Jens Timmer1,2,5,6, Jan Hasenauer3,4, Clemens Kreutz1,2,5.
Abstract
MOTIVATION: Dynamic models are used in systems biology to study and understand cellular processes like gene regulation or signal transduction. Frequently, ordinary differential equation (ODE) models are used to model the time and dose dependency of the abundances of molecular compounds as well as interactions and translocations. A multitude of computational approaches, e.g. for parameter estimation or uncertainty analysis have been developed within recent years. However, many of these approaches lack proper testing in application settings because a comprehensive set of benchmark problems is yet missing.Entities:
Mesh:
Year: 2019 PMID: 30624608 PMCID: PMC6735869 DOI: 10.1093/bioinformatics/btz020
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Table summarizing the 20 benchmark models and their properties
| Name | Description | Biochemical species | Observables | Data points | Experimental conditions | Parameters | Features |
|---|---|---|---|---|---|---|---|
| Bachmann | The model by | 25 | 11 | 542 | 23 | 113 | C, |
| Becker | The model by | 6 | 4 | 85 | 13 | 16 |
|
| Beer | The model by | 4 | 2 | 27 132 | 19 | 72 |
|
| Boehm | The model by | 8 | 3 | 48 | 1 | 9 | C, |
| Brannmark | The model by | 9 | 2 | 43 | 8 | 22 |
|
| Bruno | The model by | 7 | 6 | 77 | 6 | 13 | Ex, |
| Chen | The model by | 500 | 3 | 105 | 4 | 154 |
|
| Crauste | The model by | 5 | 4 | 21 | 1 | 12 | Ex, NI, |
| Fiedler | The model by | 6 | 2 | 72 | 3 | 19 |
|
| Fujita | The model by | 9 | 3 | 144 | 6 | 19 | Ex, ev, NI, |
| Hass | The model by | 9 | 6 | 221 | 17 | 49 | Ex, ev, |
| Isensee | The model by | 25 | 3 | 713 | 109 | 46 | C, |
| Lucarelli | The model by | 33 | 43 | 1755 | 12 | 84 |
|
| Merkle | The model by | 23 | 22 | 1141 | 62 | 197 | C, |
| Raia | The model by | 14 | 8 | 205 | 4 | 39 | C, |
| Schwen | The model by | 11 | 4 | 292 | 7 | 30 |
|
| Sobotta | The model by | 13 | 11 | 2220 | 110 | 260 | C, |
| Swameye | The model by | 9 | 3 | 46 | 1 | 13 | C, Ex, NI, |
| Weber | The model by | 7 | 8 | 135 | 3 | 36 |
|
| Zheng | The model is adapted from | 15 | 15 | 60 | 1 | 46 |
|
Note: The models are abbreviated with the last name of the first author. Many models are based on Western blot data. Number of parameters denotes unknown parameters that are estimated in the model. The number of experimental conditions is specified as the number of different simulation conditions. The feature abbreviations denote the following: C = several compartments, = constant error parameters, Eq. (3), = error model of Eq. (4), = error model of Eq. (5), Ex = known measurement errors, ev = events, NI = non-identifiable parameters, u(t) = time dependent input function, = input function with unknown parameter(s). Initial values are specified according to the following order: = known initial values, = initial condition given by unknown parameters, = parameter dependent functions and = pre-equilibration for initial steady state conditions. The models are described in more detail in Supplementary Section S4.
Fig. 1.Property distribution in the presented benchmark collection. (A) Histograms for numerical model properties: number of observables, conditions, data points and parameters. Properties of individual models are indicated with an overlayed parallel coordinate plot. The parallel coordinates facilitate highlight correlations: most lines are parallel positive correlations; most lines cross negative correlation. (B) Mosaic plot for the categoric model properties: initial conditions (indicated by columns), error models (indicated by colors) and input functions (indicated by saturation levels). The areas encode the percentage of models with a particular combination of properties. Combinations of model properties which are not observed are crossed out in the legend. Non-analytical parameter-dependent initial conditions cannot be solved analytically and are obtained by simulating the system to steady state (Color version of this figure is available at Bioinformatics online.)
Fig. 2.Linear versus logarithmic scale. (A) Performance of the multi-start local optimization scheme using the MATLAB optimizer lsqnonlin for: (x-axis) sampling of initial values in log scale and optimization in linear scale; and (y-axis) sampling and optimization in log scale. Performance is measured as average number of converged starts per minute. (B) Level-sets of the objective function for a synthesis-degradation process described in the Supplementary Section S6. (top) The level-sets in linear parameter are non-convex, implying that the objective function is non-convex. (bottom) The level sets in log-transformed parameters are convex. (C) Convexity properties of the benchmark problems in linear parameters and log-transformed parameters. It is indicated whether the two parameters are sampled in linear or log space and whether the connection between the two parameters is checked in linear or log space. Statistically significant differences are shown (P-value for rank sum test, * < 0.05, ** <0.01)
Fig. 3.Comparison of optimizer performance. Scatter plot of the average number of converged starts per minute for the interior-point algorithm versus trust-region-reflective algorithm
Fig. 4.Influence of problem size. (A) Average number of optimizer iterations and (B) average computation time versus the number of parameters. For optimization the trust-region-reflective algorithm implemented in the MATLAB function lsqnonlin was used and the averages across 1000 runs with different starting points were computed. The influence of the number of parameters was analyzed using correlation analysis and linear regression
Fig. 5.Eigenvalue spectra of the Hessians of the log-likelihood. Each spectrum was normalized by dividing through the maximal eigenvalue. According to the literature, a model is termed sloppy, if the eigenvalues spread over more than six orders of magnitude. This range is indicated by gray shading. The spectra of non-identifiable models are plotted in red. For our depiction at the log-scale, eigenvalues which are smaller than after normalization with respect to the maximal eigenvalue were set to and occur as line at the bottom of each panel (Color version of this figure is available at Bioinformatics online.)