| Literature DB >> 28098205 |
Ding Ma1, Laurence Yang2, Ronan M T Fleming3, Ines Thiele3, Bernhard O Palsson2,4, Michael A Saunders1.
Abstract
Constraint-Based Reconstruction and Analysis (COBRA) is currently the only methodology that permits integrated modeling of Metabolism and macromolecular Expression (ME) at genome-scale. Linear optimization computes steady-state flux solutions to ME models, but flux values are spread over many orders of magnitude. Data values also have greatly varying magnitudes. Standard double-precision solvers may return inaccurate solutions or report that no solution exists. Exact simplex solvers based on rational arithmetic require a near-optimal warm start to be practical on large problems (current ME models have 70,000 constraints and variables and will grow larger). We have developed a quadruple-precision version of our linear and nonlinear optimizer MINOS, and a solution procedure (DQQ) involving Double and Quad MINOS that achieves reliability and efficiency for ME models and other challenging problems tested here. DQQ will enable extensive use of large linear and nonlinear models in systems biology and other applications involving multiscale data.Entities:
Mesh:
Year: 2017 PMID: 28098205 PMCID: PMC5241643 DOI: 10.1038/srep40863
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Runtime options for MINOS in each step of the DQQ procedure.
| Default | Step D | Step Q1 | Step Q2 | |
|---|---|---|---|---|
| Precision | Double | Double | Quad | Quad |
| Scale | Yes | Yes | Yes | No |
| Feasibility tol | 1e − 6 | 1e − 7 | 1e − 15 | 1e − 15 |
| Optimality tol | 1e − 6 | 1e − 7 | 1e − 15 | 1e − 15 |
| Expand frequency | 10000 | 100000 | 100000 | 100000 |
| LU Factor tol | 100.0 | 1.9 | 10.0 | 5.0 |
| LU Update tol | 10.0 | 1.9 | 10.0 | 5.0 |
Three large ME biochemical network models TMA_ME, GlcAerWT, GlcAlift111527.
| ME model | TMA_ME | GlcAerWT | GlcAlift |
|---|---|---|---|
| 18210 | 68300 | 69529 | |
| 17535 | 76664 | 77893 | |
| nnz( | 336302 | 926357 | 928815 |
| max | | 2.1e + 04 | 8.0e + 05 | 2.6e + 05 |
| || | 5.9e + 00 | 6.3e + 07 | 6.3e + 07 |
| || | 1.1e + 00 | 2.4e + 07 | 2.4e + 07 |
| D itns | 21026 | 47718 | 93857 |
| D time | 350.9 | 10567.8 | 15913.7 |
| Q1 itns | 597 | 4287 | 1631 |
| Q1 time | 29.0 | 1958.9 | 277.3 |
| Q2 itns | 0 | 4 | 1 |
| Q2 time | 5.4 | 72.1 | 44.0 |
| Total time | 385 | 12599 | 16235 |
Dimensions of m × n constraint matrices S, size of the largest optimal primal and dual variables v*, y*, number of iterations and runtimes in seconds for each step, and the total runtime of each model.
Three large ME biochemical network models TMA_ME, GlcAerWT, GlcAlift111527.
| ME model | Step | Objective | Pinf | Dinf |
|---|---|---|---|---|
| TMA_ME | D | 8.3789966820e − 07 | −06 | −05 |
| Q1 | 8.7036315385e − 07 | −25 | −32 | |
| Q2 | 8.7036315385e − 07 | − | ||
| GlcAerWT | D | −6.7687059922e + 05 | −04 | +00 |
| Q1 | −7.0382449681e + 05 | −07 | −26 | |
| Q2 | −7.0382449681e + 05 | − | − | |
| GlcAlift | D | −5.3319574961e + 05 | −03 | −01 |
| Q1 | −7.0434008750e + 05 | −08 | −22 | |
| Q2 | −7.0434008750e + 05 | − | − |
Optimal objective value of each step, Pinf and Dinf = final maximum primal and dual infeasibilities (log10 values tabulated, except – means 0). Bold figures show the final (step Q2) Pinf and Dinf.
Three pilot models from Netlib46 and eight problematic problems from Mészáros47.
| model | nnz( | max| | || | || | ||
|---|---|---|---|---|---|---|
| pilot4 | 411 | 1000 | 5145 | 2.8e + 04 | 9.6e + 04 | 2.7e + 02 |
| pilot | 1442 | 3652 | 43220 | 1.5e + 02 | 4.1e + 03 | 2.0e + 02 |
| pilot87 | 2031 | 4883 | 73804 | 1.0e + 03 | 2.4e + 04 | 1.1e + 01 |
| de063155 | 853 | 1488 | 5405 | 8.3e + 11 | 3.1e + 13 | 6.2e + 04 |
| de063157 | 937 | 1488 | 5551 | 2.3e + 18 | 2.3e + 17 | 6.2e + 04 |
| de080285 | 937 | 1488 | 5471 | 9.7e + 02 | 1.1e + 02 | 2.6e + 01 |
| gen1 | 770 | 2560 | 64621 | 1.0e + 00 | 3.0e + 00 | 1.0e + 00 |
| gen2 | 1122 | 3264 | 84095 | 1.0e + 00 | 3.3e + 00 | 1.0e + 00 |
| gen4 | 1538 | 4297 | 110174 | 1.0e + 00 | 3.0e + 00 | 1.0e + 00 |
| l30 | 2702 | 15380 | 64790 | 1.8e + 00 | 1.0e + 09 | 4.2e + 00 |
| iprob | 3002 | 3001 | 12000 | 9.9e + 03 | 3.1e + 02 | 1.1e + 00 |
Dimensions of m × n constraint matrices S, size of the largest nonzero in S, and norm of the optimal primal and dual variables v*, y*.
Iterations and runtimes in seconds for steps D, Q1, Q2 on the problems of Table 4.
| model | Itns | Times | Final objective | Pinf | Dinf |
|---|---|---|---|---|---|
| pilot4 | 1464 | 0.1 | −2.5811392619e + 03 | −05 | −12 |
| 7 | 0.0 | −2.5811392589e + 03 | −52 | −31 | |
| 0 | 0.0 | −2.5811392589e + 03 | — | − | |
| pilot | 16060 | 9.0 | −5.5739887685e + 02 | −06 | −03 |
| 29 | 0.3 | −5.5748972928e + 02 | — | −32 | |
| 0 | 0.1 | −5.5748972928e + 02 | − | ||
| pilot87 | 19340 | 22.6 | 3.0171038489e + 02 | −08 | −06 |
| 32 | 0.9 | 3.0171034733e + 02 | — | −32 | |
| 0 | 0.6 | 3.0171034733e + 02 | − | ||
| de063155 | 973 | 0.1 | 1.8968895791e + 10 | −14 | +03 |
| 90 | 0.1 | 9.8830944565e + 09 | — | −27 | |
| 0 | 0.0 | 9.8830944565e + 09 | − | ||
| de063157 | 1473 | 0.1 | 2.6170359397e + 12 | — | +08 |
| 286 | 0.2 | 2.1528501109e + 07 | −29 | −12 | |
| 0 | 0.0 | 2.1528501109e + 07 | − | ||
| de080285 | 418 | 0.0 | 1.4495817688e + 01 | −09 | −02 |
| 132 | 0.1 | 1.3924732864e + 01 | −35 | −32 | |
| 0 | 0.0 | 1.3924732864e + 01 | − | ||
| gen1 | 303212 | 156.9 | −8.1861282705e − 08 | −06 | −13 |
| 216746 | 3431.2 | 1.2939275026e − 06 | −12 | −31 | |
| 8304 | 112.5 | 1.2953925804e − 06 | − | − | |
| gen2 | 45905 | 60.0 | 3.2927907833e + 00 | −04 | −12 |
| 2192 | 359.9 | 3.2927907840e + 00 | — | −29 | |
| 0 | 10.4 | 3.2927907840e + 00 | − | ||
| gen4 | 38111 | 151.3 | −1.2724113149e − 07 | −07 | −12 |
| 58118 | 6420.2 | 2.8932557999e − 06 | −12 | −31 | |
| 50 | 4.3 | 2.8933064888e − 06 | − | − | |
| l30 | 1302602 | 805.6 | 9.5266141670e − 01 | −08 | −09 |
| 500000 | 6168.8 | −4.5793509329e − 26 | −25 | −00 | |
| 16292 | 204.4 | −6.6656750251e − 26 | − | − | |
| iprob | 1087 | 0.2 | 2.6891551285e + 03 | +02 | −11 |
| 0 | 0.0 | 2.6891551285e + 03 | +02 | −30 | |
| 0 | 0.0 | 2.6891551285e + 03 | +02 | − |
Pinf and Dinf = final maximum primal and dual infeasibilities (log10 values tabulated, except – means 0). Problem iprob is infeasible. Bold figures show Pinf and Dinf at the end of step Q2. Note that Pinf/||v*||∞ and Dinf/||y*||∞ are O(10−30) or smaller, even though only O(10−15) was requested.