| Literature DB >> 25880925 |
Alejandro F Villaverde1, David Henriques2,3, Kieran Smallbone4, Sophia Bongard5, Joachim Schmid6, Damjan Cicin-Sain7,8, Anton Crombach9,10, Julio Saez-Rodriguez11, Klaus Mauch12, Eva Balsa-Canto13, Pedro Mendes14, Johannes Jaeger15,16, Julio R Banga17.
Abstract
BACKGROUND: Dynamic modelling is one of the cornerstones of systems biology. Many research efforts are currently being invested in the development and exploitation of large-scale kinetic models. The associated problems of parameter estimation (model calibration) and optimal experimental design are particularly challenging. The community has already developed many methods and software packages which aim to facilitate these tasks. However, there is a lack of suitable benchmark problems which allow a fair and systematic evaluation and comparison of these contributions.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25880925 PMCID: PMC4342829 DOI: 10.1186/s12918-015-0144-4
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Models
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Model Ref | [ | [ | [ | [ | [ | [ |
| Cell |
|
|
| CHO | Generic |
|
|
| ||||||
| Description | Metabolic: | Metabolic: | Metabolic: CCM | Metabolic | Signal | Developmental |
| level | genome scale | CCM | & transcription | transduction | GRN (spatial) | |
| Parameters | 1759 | 116 | 178 | 117 | 86 | 37 |
| Dynamic states | 276 | 18 | 47 | 34 | 26 | 108 –212 |
| Observed states | 44 | 9 | 47 | 13 | 6 | 108 –212 |
| Experiments | 1 | 1 | 1 | 1 | 10 | 1 |
| Data points | 5280 | 110 | 7567 | 169 | 96 | 1804 |
| Data type | simulated | measured | simulated | simulated | simulated | measured |
| Noise level |
| real | no noise | variable |
| real |
Main features of the benchmark models. The standard deviation, σ, is given in those cases where artificial noise with constant variance was added to the data.
Figure 1Benchmark 2: sensitivities. The two panels on top show the local rank of the parameters, i.e., the parameters ordered in decreasing order of their influence on the system’s behaviour (, as defined in equations (9) and (10)). Note that the middle panel is a continuation of the upper one with a smaller y-axis scale. The array in the bottom panel shows the sensitivity of the 9 state variables (metabolite concentrations, in columns) of the model with respect to the 116 parameters. The colour bar in the right shows the sensitivity range: high sensitivities are plotted in red, low sensitivities in blue.
Figure 2Benchmark 3. Histograms of local searches. The X axis shows the values of the solutions found by the DHC local method, and the Y axis shows their frequency. Of the total of 1000 local searches launched, only the 188 that converged are shown.
Figure 3Convergence curves. Representative results of parameter estimation runs of the six benchmarks, carried out with the eSS method. The curves plot the (logarithmic) objective function value as a function of the (logarithmic) computation time. For ease of visualization, the values in the curves have been divided by the final value reached by each of them, i.e. the y axis plots J/J . Note that, since the benchmarks have different number of variables and data points, and different noise levels, the objective function values are not equivalent for different models. Results obtained on a computer with Intel Xeon Quadcore processor, 2.50 GHz, using Matlab 7.9.0.529 (R2009b) 32-bit.
Figure 4Dispersion of convergence curves. Results of 20 parameter estimation runs of the B4 benchmark (CHO cells) with the eSS method. The figures plot the objective function value as a function of the computation time (in log-log scale). Results obtained on a computer with Intel Xeon Quadcore processor, 2.50 GHz, using Matlab 7.9.0.529 (R2009b) 32-bit.
Parameter estimation with eSS (AMIGO implementation): settings and results
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| 5· |
|
| 5· | varying | varying |
|
| 0.2· |
|
| 0.2· | varying | varying |
| Local method | DHC | FMINCON | none | FMINCON | DHC | FMINCON |
| CPU time | ≈170 hours | ≈3 hours | ≈336 hours | ≈1 hour | ≈16 hours | ≈24 hours |
| Evaluations | 6.9678·105 | 9.0728·104 | 7.2193·106 | 1.6193·105 | 8.8393·104 | 2.0751·106 |
|
| 5.8819·109 | 3.1136·104 | 4.6930·1016 | 6.6034·108 | 3.1485·104 | 8.5769·105 |
|
| 1.3753·104 | 2.3390·102 | 3.7029·10−1 | 4.5718·101 | 3.0725·103 | 1.0833·105 |
|
| 1.0846·106 | − | 0 | 3.9068·101 | 4.2737·103 | − |
|
| 3.5834·101 | 8.5995·10−2 | 3.5457·101 | 4.8005·101 | 4.0434·101 | 2.3808·102 |
|
| 5.7558 | 2.4921 | 2.9298·10−1 | 2.8010 | 2.7430·101 | 1.6212·102 |
|
| 3.8203 | − | 0 | 2.8273 | 3.0114·101 | − |
Optimization settings and results obtained for each of the benchmarks with the eSS method, using the implementation provided in the AMIGO toolbox. In some cases the lower (p ) and upper (p ) bounds in the parameters are specified as a function of the nominal parameter vector, p . There may be exceptions to these bounds, in cases where it makes sense biologically to have a different range of values (e.g. Hill coefficients in the range of 1–12). Cases with exceptions are marked by (. In other cases all the parameters have specific bounds; this is marked as “varying”. The initial objective function value, J 0, corresponds to the parameter vector p 0 used as initial guess in the optimizations, which is randomly selected between the bounds p and p . The only exception is benchmark B2, where p 0 is the parameter vector reported in the original publication. The final value achieved in the optimizations is J f, and the value obtained with the nominal parameter vector is J . More details about the definition of the objective functions J are given in section “Problem statement”. NRMSE is the cumulative normalized root-mean-square error as defined in eq. (12); the subscripts (, , ) have the same meaning as in the objective functions J. Results obtained on a computer with Intel Xeon Quadcore processor, 2.50 GHz, using Matlab 7.9.0.529 (R2009b) 32-bit.
Figure 5Benchmark 5. Data fits: time courses. Pseudo-experimental data (red circles) vs. optimal solution (solid blue lines) for the 6 observed states. X axis: time [minutes]. Y axis: activation level [0 ÷1].
Figure 6Benchmark 4, typical parameter estimation results. (A) Optimal vs. nominal parameters. (B) Pseudo-experimental (“measured states”) vs. simulated data (“predicted states”). (C) Errors in the parameters: histogram of the differences between the nominal parameter vector and the optimal solution, in %. (D) Errors in the predictions: histogram of the difference between pseudo-experimental and simulated data, in %.