| Literature DB >> 28952528 |
Rudiyanto Gunawan1,2, Sandro Hutter3,4.
Abstract
Metabolic flux analysis (MFA) is an indispensable tool in metabolic engineering. The simplest variant of MFA relies on an overdetermined stoichiometric model of the cell's metabolism under the pseudo-steady state assumption to evaluate the intracellular flux distribution. Despite its long history, the issue of model error in overdetermined MFA, particularly misspecifications of the stoichiometric matrix, has not received much attention. We evaluated the performance of statistical tests from linear least square regressions, namely Ramsey's Regression Equation Specification Error Test (RESET), the F-test, and the Lagrange multiplier test, in detecting model misspecifications in the overdetermined MFA, particularly missing reactions. We further proposed an iterative procedure using the F-test to correct such an issue. Using Chinese hamster ovary and random metabolic networks, we demonstrated that: (1) a statistically significant regression does not guarantee high accuracy of the flux estimates; (2) the removal of a reaction with a low flux magnitude can cause disproportionately large biases in the flux estimates; (3) the F-test could efficiently detect missing reactions; and (4) the proposed iterative procedure could robustly resolve the omission of reactions. Our work demonstrated that statistical analysis and tests could be used to systematically assess, detect, and resolve model misspecifications in the overdetermined MFA.Entities:
Keywords: Chinese hamster ovary cell culture; constraint-based model; metabolic flux analysis; model misspecification; stoichiometric model
Year: 2017 PMID: 28952528 PMCID: PMC5590471 DOI: 10.3390/bioengineering4020048
Source DB: PubMed Journal: Bioengineering (Basel) ISSN: 2306-5354
Figure 1Chinese hamster ovary metabolic network model in Case Study I and III (adapted from [16,27]). The dashed arrows indicate the measured exchange (uptake) fluxes. The magnitudes of the flux estimates are indicated by the thickness of the arrows, while the colors of the arrows represent the average relative magnitude of the specification bias caused by the removal of the reaction.
Case Study I: Specification bias in the Chinese hamster ovary (CHO) model.
| Reaction a | Absolute Specification Bias (%) d | |||||
|---|---|---|---|---|---|---|
| Min | Median | Mean | Max | |||
| 25 | −0.02 | 0.00 ± 0.00 | 0.00 | 0.41 | 2.73 | 54.1 |
| 19 | 0.03 | 0.00 ± 0.00 | 0.00 | 0.39 | 2.48 | 48.8 |
| 10 | −1.46 | 0.00 ± 0.00 | 0.00 | 0.15 | 1.96 | 11.6 |
| 45 | −0.21 | 0.00 ± 0.00 | 0.00 | 2.04 | 18.3 | 269 |
| 17 | −0.21 | 0.00 ± 0.00 | 0.00 | 2.11 | 19.0 | 280 |
| 31 | −0.24 | 0.00 ± 0.00 | 0.00 | 2.83 | 24.9 | 361 |
| 27 | 0.34 | 0.00 ± 0.00 | 0.00 | 2.12 | 15.6 | 229 |
| 14 | 12.50 | 0.00 ± 0.00 | 0.00 | 1.31 | 33.3 | 855 |
| 9 | 12.50 | 0.00 ± 0.00 | 0.00 | 1.31 | 33.3 | 855 |
| 46 | 15.04 | 0.00 ± 0.00 | 0.00 | 0.88 | 38.1 | 1020 |
| 8 | 15.04 | 0.00 ± 0.00 | 0.00 | 0.88 | 38.1 | 1020 |
| 37 | 0.27 | 0.00 ± 0.00 | 0.00 | 5.86 | 54.1 | 753 |
| 12 | 17.42 | 0.00 ± 0.00 | 0.02 | 1.28 | 43.1 | 1190 |
| 11 | 17.84 | 0.00 ± 0.00 | 0.02 | 1.09 | 44.0 | 1220 |
| 13 | 18.06 | 0.00 ± 0.00 | 0.02 | 1.66 | 46.7 | 1230 |
| 30 | −0.27 | 0.00 ± 0.00 | 0.00 | 6.87 | 63.8 | 889 |
| 24 | −0.38 | 0.00 ± 0.00 | 0.00 | 6.67 | 60.9 | 860 |
| 26 | 0.27 | 0.00 ± 0.00 | 0.00 | 4.19 | 36.1 | 509 |
| 35 | 0.13 | 0.00 ± 0.00 | 0.00 | 3.12 | 28.8 | 399 |
| 33 | 0.22 | 0.00 ± 0.00 | 0.00 | 11.7 | 124 | 2060 |
| 32 b | −1.18 | 0.01 ± 0.01 | 0.00 | 21.0 | 196 | 2770 |
| 29 b | 0.99 | 0.01 ± 0.01 | 0.00 | 13.9 | 170 | 3840 |
| 34 b | 0.47 | 0.01 ± 0.01 | 0.00 | 16.2 | 152 | 2110 |
| 36 b | 0.87 | 0.02 ± 0.01 | 0.00 | 23.1 | 217 | 3020 |
| 23 b | −1.34 | 0.02 ± 0.01 | 0.00 | 17.1 | 177 | 2470 |
| 28 b | 1.03 | 0.02 ± 0.01 | 0.00 | 17.6 | 158 | 2220 |
| 15 | 12.31 | 0.05 ± 0.02 | 0.00 | 14.3 | 121 | 2210 |
| 4 | 1.24 | 0.05 ± 0.02 | 0.00 | 5.25 | 65.1 | 2420 |
| 21 | −6.81 | 0.13 ± 0.03 | 0.00 | 53.9 | 475 | 6980 |
| 16 | 19.26 | 0.15 ± 0.03 | 0.00 | 61.4 | 573 | 8100 |
| 18 | −21.53 | 0.20 ± 0.04 | 0.00 | 61.6 | 632 | 8830 |
| 43 | 19.52 | 0.46 ± 0.04 | 0.00 | 74.0 | 477 | 8050 |
| 22 | 7.24 | 0.47 ± 0.04 | 0.00 | 121 | 988 | 14,700 |
| 7 | 19.63 | 0.52 ± 0.04 | 0.00 | 21.8 | 114 | 2360 |
| 2 | 157.77 | 0.97 ± 0.01 | 0.00 | 1.28 | 803 | 21,600 |
| 3 | 315.55 | 0.97 ± 0.01 | 0.00 | 1.28 | 803 | 21,600 |
The reaction numbers refer to the CHO metabolic network in Figure 1; b The omission of reactions with a low flux could cause large specification biases in the flux estimate; c The significance of regression was assessed by ANOVA. The average p value (mean ± standard error) was computed for 10,000 GLS regressions using independently generated in silico data; d The minimum, median, mean, and maximum biases were computed over the remaining reaction fluxes in the model.
Case Study II: Performance of model misspecification tests (values represent rates).
| CoV | RESET Test (p = 1) | RESET Test (p = 2) | F-Test | LM Test | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TP | FN | FP | TN | TP | FN | FP | TN | TP | FN | FP | TN | TP | FN | FP | TN | |||||
| 0.18 | 0.82 | 0.56 | 0.44 | 0.33 | 0.67 | 0.75 | 0.25 | 0.86 | 0.14 | 0.09 | 0.91 | 0.68 | 0.32 | 0.11 | 0.89 | |||||
| 0.28 | 0.72 | 0.57 | 0.43 | 0.44 | 0.56 | 0.78 | 0.22 | 0.82 | 0.19 | 0.09 | 0.91 | 0.67 | 0.33 | 0.14 | 0.86 | |||||
| 0.32 | 0.69 | 0.58 | 0.42 | 0.51 | 0.49 | 0.76 | 0.24 | 0.82 | 0.19 | 0.10 | 0.90 | 0.66 | 0.34 | 0.16 | 0.84 | |||||
| 0.42 | 0.58 | 0.56 | 0.44 | 0.69 | 0.31 | 0.81 | 0.19 | 0.71 | 0.29 | 0.08 | 0.92 | 0.60 | 0.41 | 0.18 | 0.82 | |||||
| 0.11 | 0.89 | 0.57 | 0.43 | 0.33 | 0.67 | 0.76 | 0.25 | 0.99 | 0.01 | 0.14 | 0.87 | 0.71 | 0.29 | 0.07 | 0.93 | |||||
| 0.12 | 0.88 | 0.54 | 0.46 | 0.34 | 0.67 | 0.73 | 0.27 | 0.98 | 0.02 | 0.12 | 0.88 | 0.73 | 0.27 | 0.06 | 0.94 | |||||
| 0.19 | 0.81 | 0.54 | 0.46 | 0.41 | 0.59 | 0.75 | 0.25 | 0.97 | 0.03 | 0.13 | 0.87 | 0.71 | 0.29 | 0.11 | 0.90 | |||||
| 0.29 | 0.71 | 0.55 | 0.45 | 0.58 | 0.42 | 0.82 | 0.19 | 0.93 | 0.07 | 0.11 | 0.89 | 0.70 | 0.30 | 0.12 | 0.88 | |||||
| 0.11 | 0.89 | 0.57 | 0.43 | 0.40 | 0.60 | 0.73 | 0.27 | 1.00 | 0.00 | 0.11 | 0.89 | 0.47 | 0.53 | 0.00 | 1.00 | |||||
| 0.13 | 0.87 | 0.57 | 0.43 | 0.42 | 0.58 | 0.76 | 0.24 | 1.00 | 0.00 | 0.10 | 0.90 | 0.48 | 0.52 | 0.01 | 0.99 | |||||
| 0.16 | 0.84 | 0.54 | 0.46 | 0.47 | 0.53 | 0.75 | 0.26 | 1.00 | 0.00 | 0.13 | 0.87 | 0.48 | 0.52 | 0.01 | 0.99 | |||||
| 0.26 | 0.74 | 0.57 | 0.43 | 0.57 | 0.43 | 0.79 | 0.21 | 0.99 | 0.01 | 0.12 | 0.88 | 0.44 | 0.56 | 0.01 | 0.99 | |||||
Case Study II: Additional misspecification tests using the F-test (values represent rates).
| m | CoV | TP | FN | FP | TN | |||
|---|---|---|---|---|---|---|---|---|
| 0.01 | 0.86 | 0.14 | 0.11 | 0.89 | ||||
| 0.05 | 0.82 | 0.18 | 0.10 | 0.90 | ||||
| 0.1 | 0.75 | 0.25 | 0.09 | 0.91 | ||||
| 0.2 | 0.69 | 0.31 | 0.09 | 0.91 | ||||
| 0.01 | 0.99 | 0.01 | 0.10 | 0.90 | ||||
| 0.05 | 0.98 | 0.02 | 0.10 | 0.90 | ||||
| 0.1 | 0.97 | 0.03 | 0.10 | 0.90 | ||||
| 0.2 | 0.92 | 0.08 | 0.11 | 0.89 | ||||
| 0.01 | 1.00 | 0.00 | 0.10 | 0.90 | ||||
| 0.05 | 1.00 | 0.00 | 0.09 | 0.91 | ||||
| 0.1 | 1.00 | 0.00 | 0.09 | 0.91 | ||||
| 0.2 | 0.99 | 0.02 | 0.11 | 0.90 | ||||
| 0.01 | 0.76 | 0.24 | 0.11 | 0.89 | ||||
| 0.05 | 0.73 | 0.27 | 0.10 | 0.90 | ||||
| 0.1 | 0.67 | 0.33 | 0.07 | 0.93 | ||||
| 0.2 | 0.58 | 0.42 | 0.10 | 0.90 | ||||
| 0.01 | 0.97 | 0.03 | 0.16 | 0.84 | ||||
| 0.05 | 0.95 | 0.05 | 0.11 | 0.89 | ||||
| 0.1 | 0.94 | 0.07 | 0.13 | 0.87 | ||||
| 0.2 | 0.88 | 0.12 | 0.13 | 0.88 | ||||
| 0.01 | 1.00 | 0.00 | 0.15 | 0.85 | ||||
| 0.05 | 0.99 | 0.01 | 0.16 | 0.84 | ||||
| 0.1 | 1.00 | 0.01 | 0.13 | 0.87 | ||||
| 0.2 | 0.98 | 0.02 | 0.15 | 0.85 | ||||
| 0.01 | 1.00 | 0.00 | 0.14 | 0.86 | ||||
| 0.05 | 1.00 | 0.00 | 0.14 | 0.86 | ||||
| 0.1 | 1.00 | 0.00 | 0.15 | 0.86 | ||||
| 0.2 | 1.00 | 0.00 | 0.14 | 0.86 |
Case Study III: Iterative procedure for resolving model misspecification in the CHO model.
| Number of Remaining Reactions a | ||||
|---|---|---|---|---|
| Extra Reactions | Omitted Reactions | |||
| 1 | 3 | 3 | 2.82 ± 0.38 | 0.99 ± 0.10 |
| 5 | 5 | 4.13 ± 0.63 | 1.34 ± 0.46 | |
| 8 | 8 | 5.89 ± 0.83 | 2.21 ± 0.48 | |
| 1 then 2 | 5 | 5 | 3.66 ± 0.59 | 0.97 ± 0.17 |
| 8 | 8 | 5.03 ± 0.70 | 1.00 ± 0.29 | |
The number of remaining reactions (mean ± standard error) corresponds to the average over 100 generations of the stoichiometric matrix , of the median number across 100 in silico data simulations.