Literature DB >> 34329395

PolyRound: Polytope rounding for random sampling in metabolic networks.

Axel Theorell1,2, Johann F Jadebeck2,3, Katharina Nöh2, Jörg Stelling1.   

Abstract

SUMMARY: Random flux sampling is a powerful tool for the constraint-based analysis of metabolic networks. The most efficient sampling method relies on a rounding transform of the constraint polytope, but no available rounding implementation can round all relevant models. By removing redundant polytope constraints on the go, PolyRound simplifies the numerical problem and rounds all the 108 models in the BiGG database without parameter tuning, compared to about 50% for the state-of-the-art implementation. AVAILABILITY: The implementation is available on gitlab: https://gitlab.com/csb.ethz/PolyRound. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Year:  2021        PMID: 34329395      PMCID: PMC8723145          DOI: 10.1093/bioinformatics/btab552

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Random sampling of constraint-based models of metabolism is a powerful approach to characterize the potential behaviors of metabolic networks (Schellenberger and Palsson, 2009; Herrmann ). The field is actively developed, for example, with recent extensions to model inference (Theorell and Nöh, 2020). Algorithmically, Markov Chain Monte Carlo (MCMC) based coordinate hit-and-run with rounding (CHRR) (Haraldsdóttir ) showed superior performance in computational benchmarks (Herrmann ) and it is available in a highly efficient and modular implementation (Jadebeck ). The relevant sampling space is a polytope P: a bounded set in constrained by hyperplanes. In constraint based models, P originates from stoichiometric reaction constraints and capacity constraints that give rise to equalities and inequalities. with matrices and , and vectors and . For Hit-and-Run samplers, such as CHRR, the asymptotic mixing time (a common efficiency measure in MCMC) depends quadratically on the sandwiching ratio, the ratio of radii of the largest sphere contained in P and the smallest sphere containing P (Lovász and Vempala, 2006). Efficient random sampling therefore relies on an efficient ’rounding’ preprocessing step: it applies a linear transformation to make the polytope more spherical. In practice, sampling the rounded polytope converges within minutes, whereas sampling the unrounded polytope fails to converge in reasonable time. Deterministic and stochastic algorithms for polytope rounding exist (Mangoubi and Vishnoi, 2019; Martino ). All current implementations of CHRR rely on deterministic search for the maximum volume ellipsoid (MVE) (Zhang and Gao, 2003); after rounding via a linear transform, the MVE equals the unit sphere. An implementation that handles polytopes formulated as P is interfaced from the CobraToolbox (CT) (Heirendt ). However, we found that it rounds only about half of the models in the BiGG repository (King ) due to numerical failures. Because this strongly limits the scope of models for random sampling, we provide PolyRound, an open source Python toolbox that uses a modified reformulation and rounding scheme optimized for robustness.

2 Materials and methods

2.1 Workflow

In a first step, PolyRound reformulates P in a form with only inequality constraints and embeds it in a space where it has non-zero hypervolume. With the null space matrix of A (computed by SVD), we express P in the null space coordinates u = Nx + x0 (x0 is an arbitrary feasible point) using only inequality constraints. However, P may still have zero hyper volume, since the inequality constraints may contain the so-called 0-facets, directions in which the width of P is 0. Therefore, prior to reformulation, PolyRound computes all facet widths by sequentially locating the minimal and maximal feasible point in the direction orthogonal to each inequality constraint. This requires solving two linear programs (LPs) per constraint. If the width is smaller than a threshold (e.g. ), the corresponding constraint is a de-facto equality constraint and is moved to the equality system. PolyRound also checks for redundant inequality constraints and removes these, using a third LP in which the right hand side of the constraint under investigation is relaxed. After all redundant constraints have been removed and 0-facet constraints have been refunctioned, the now smaller problem can be solved more accurately, so that new redundant constraints and 0-facets may be detected. Therefore, PolyRound iterates until no more changes of P are induced. In a second step, PolyRound computes the MVE using the F2PD algorithm by Zhang and Gao (2003). The PolyRound MVE is an optimized python implementation of CT’s default routine Bounciness/Volume-and-Sampling (Haraldsdóttir ), including an iterative scheme that improves numerical stability (Supplementary Information S3).

2.2 Implementation

PolyRound is implemented in Python 3 and equipped with an easy-to-use command line interface. It reads constraint based models in SBML format (Hucka ) using COBRApy (Ebrahim ), and generic polytopes in plain HDF5 and csv representations. LPs are solved via optlang (Jensen ), enabling easy use of different solvers. To reproduce the benchmarks, see Supplementary Information.

3 Benchmarks

We first assessed success in terms of obtaining a rounded polytope. PolyRound successfully rounded 100% of the 108 models in the BiGG database (King ) (Fig. 1A) , compared to at most 51% (Supplementary Fig. S1) for CT, using different parameters and versions. The performance difference is due to PolyRound’s primary invention, removal of redundant constraints: without it, PolyRound’s success rate dropped to 67% (see Supplementary Information). PolyRound and CT produced similar reductions in dimensionality, compared to the expected original dimensionality, but PolyRound achieved substantially larger reductions of the number of constraints (Fig. 1A, Supplementary Fig. S2), thus easing numerical computations in the rounding workflow. To validate that the polytopes generated by PolyRound yield efficient sampling, we collected 11 models spanning a range of sizes (Supplementary Table S1). The effective sample size (ESS) per time for uniform sampling with the HOPS library (Jadebeck ) was consistently higher for PolyRound than for CT (Fig. 1B).
Fig. 1.

(a) Fraction of constraints after rounding (number of rows of the processed inequality matrix), relative to the number for unrounded BiGG models (number of rows of A). Results for CT: best commit and default parameters. Due to failed rounding, the orange bars have ∼50% of the surface area of the blue bars. (B) ESS per time for a selection of models (Supplementary Table S1). The number of dimensions refers to the PolyRound processed models

(a) Fraction of constraints after rounding (number of rows of the processed inequality matrix), relative to the number for unrounded BiGG models (number of rows of A). Results for CT: best commit and default parameters. Due to failed rounding, the orange bars have ∼50% of the surface area of the blue bars. (B) ESS per time for a selection of models (Supplementary Table S1). The number of dimensions refers to the PolyRound processed models

4 Conclusion

PolyRound is an open-source, robust implementation of polytope rounding, which, by numerical craftsmanship, strongly widens the number of constraint based models for random sampling. Rounded models are maximally constraint reduced, which speeds up later computations. Click here for additional data file.
  10 in total

1.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models.

Authors:  M Hucka; A Finney; H M Sauro; H Bolouri; J C Doyle; H Kitano; A P Arkin; B J Bornstein; D Bray; A Cornish-Bowden; A A Cuellar; S Dronov; E D Gilles; M Ginkel; V Gor; I I Goryanin; W J Hedley; T C Hodgman; J-H Hofmeyr; P J Hunter; N S Juty; J L Kasberger; A Kremling; U Kummer; N Le Novère; L M Loew; D Lucio; P Mendes; E Minch; E D Mjolsness; Y Nakayama; M R Nelson; P F Nielsen; T Sakurada; J C Schaff; B E Shapiro; T S Shimizu; H D Spence; J Stelling; K Takahashi; M Tomita; J Wagner; J Wang
Journal:  Bioinformatics       Date:  2003-03-01       Impact factor: 6.937

Review 2.  Use of randomized sampling for analysis of metabolic networks.

Authors:  Jan Schellenberger; Bernhard Ø Palsson
Journal:  J Biol Chem       Date:  2008-10-20       Impact factor: 5.157

3.  HOPS: high-performance library for (non-)uniform sampling of convex-constrained models.

Authors:  Johann F Jadebeck; Axel Theorell; Samuel Leweke; Katharina Nöh
Journal:  Bioinformatics       Date:  2021-07-19       Impact factor: 6.937

4.  Reversible jump MCMC for multi-model inference in Metabolic Flux Analysis.

Authors:  Axel Theorell; Katharina Nöh
Journal:  Bioinformatics       Date:  2020-01-01       Impact factor: 6.937

5.  Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0.

Authors:  Laurent Heirendt; Sylvain Arreckx; Thomas Pfau; Sebastián N Mendoza; Anne Richelle; Almut Heinken; Hulda S Haraldsdóttir; Jacek Wachowiak; Sarah M Keating; Vanja Vlasov; Stefania Magnusdóttir; Chiam Yu Ng; German Preciat; Alise Žagare; Siu H J Chan; Maike K Aurich; Catherine M Clancy; Jennifer Modamio; John T Sauls; Alberto Noronha; Aarash Bordbar; Benjamin Cousins; Diana C El Assal; Luis V Valcarcel; Iñigo Apaolaza; Susan Ghaderi; Masoud Ahookhosh; Marouen Ben Guebila; Andrejs Kostromins; Nicolas Sompairac; Hoai M Le; Ding Ma; Yuekai Sun; Lin Wang; James T Yurkovich; Miguel A P Oliveira; Phan T Vuong; Lemmer P El Assal; Inna Kuperstein; Andrei Zinovyev; H Scott Hinton; William A Bryant; Francisco J Aragón Artacho; Francisco J Planes; Egils Stalidzans; Alejandro Maass; Santosh Vempala; Michael Hucka; Michael A Saunders; Costas D Maranas; Nathan E Lewis; Thomas Sauter; Bernhard Ø Palsson; Ines Thiele; Ronan M T Fleming
Journal:  Nat Protoc       Date:  2019-03       Impact factor: 13.491

6.  COBRApy: COnstraints-Based Reconstruction and Analysis for Python.

Authors:  Ali Ebrahim; Joshua A Lerman; Bernhard O Palsson; Daniel R Hyduke
Journal:  BMC Syst Biol       Date:  2013-08-08

7.  Uniform sampling of steady states in metabolic networks: heterogeneous scales and rounding.

Authors:  Daniele De Martino; Matteo Mori; Valerio Parisi
Journal:  PLoS One       Date:  2015-04-07       Impact factor: 3.240

8.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models.

Authors:  Zachary A King; Justin Lu; Andreas Dräger; Philip Miller; Stephen Federowicz; Joshua A Lerman; Ali Ebrahim; Bernhard O Palsson; Nathan E Lewis
Journal:  Nucleic Acids Res       Date:  2015-10-17       Impact factor: 16.971

9.  Flux sampling is a powerful tool to study metabolism under changing environmental conditions.

Authors:  Helena A Herrmann; Beth C Dyson; Lucy Vass; Giles N Johnson; Jean-Marc Schwartz
Journal:  NPJ Syst Biol Appl       Date:  2019-09-02

10.  CHRR: coordinate hit-and-run with rounding for uniform sampling of constraint-based models.

Authors:  Hulda S Haraldsdóttir; Ben Cousins; Ines Thiele; Ronan M T Fleming; Santosh Vempala
Journal:  Bioinformatics       Date:  2017-06-01       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.