Literature DB >> 26543173

Exact quantification of cellular robustness in genome-scale metabolic networks.

Matthias P Gerstl1, Steffen Klamt2, Christian Jungreuthmayer1, Jürgen Zanghellini1.   

Abstract

MOTIVATION: Robustness, the ability of biological networks to uphold their functionality in spite of perturbations, is a key characteristic of all living systems. Although several theoretical approaches have been developed to formalize robustness, it still eludes an exact quantification. Here, we present a rigorous and quantitative approach for the structural robustness of metabolic networks by measuring their ability to tolerate random reaction (or gene) knockouts.
RESULTS: In analogy to reliability theory, based on an explicit consideration of all possible knockout sets, we exactly quantify the probability of failure for a given network function (e.g. growth). This measure can be computed if the network's minimal cut sets (MSCs) are known. We show that even in genome-scale metabolic networks the probability of (network) failure can be reliably estimated from MSCs with lowest cardinalities. We demonstrate the applicability of our theory by analyzing the structural robustness of multiple Enterobacteriaceae and Blattibacteriaceae and show a dramatically low structural robustness for the latter. We find that structural robustness develops from the ability to proliferate in multiple growth environments consistent with experimentally found knowledge.
CONCLUSION: The probability of (network) failure provides thus a reliable and easily computable measure of structural robustness and redundancy in (genome-scale) metabolic networks.
AVAILABILITY AND IMPLEMENTATION: Source code is available under the GNU General Public License at https://github.com/mpgerstl/networkRobustnessToolbox CONTACT: juergen.zanghellini@boku.ac.at SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2015        PMID: 26543173      PMCID: PMC4795620          DOI: 10.1093/bioinformatics/btv649

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

A major characteristic of biological systems is their ‘robustness’, i.e. the ability to perform normally under the presence of perturbations (Kitano, 2002). Despite its fundamental importance, robustness is hard to quantify and a comprehensive, quantitative understanding of robustness has yet to be developed (Kitano, 2007). The problem is augmented by the fact that robustness is an extremely general concept which obstructs efforts for an exact definition (Stelling ). Clearly, robustness arises as a consequence of the interactions between the components in a system. It therefore requires a network-based approach for its analysis (Larhlimi ). In the following, we study a key aspect of cellular robustness, namely structural robustness in metabolic networks (Wilhelm ), which is a measure of the apparent redundancy in metabolic networks. Redundancy is one reason why specific (network) functions may persist in spite of changes in the networks’ topologies. Metabolic networks are particularly suitable to study structural robustness as their topologies are either well known or can be reliably reconstructed by well-established protocols (Thiele and Palsson, 2010; Vlassis ). An unbiased way to characterize all topologically feasible flux distributions in a metabolic network can be achieved in terms of elementary flux modes (EFMs) (Schuster ). EFMs are steady-state pathways that use a minimal set of reactions (enzymes) in the appropriate thermodynamic directions. Biologically, they can be interpreted as simple, indivisible functional building blocks, which represent all feasible metabolic capabilities of the organism. Thus, EFMs are likely candidates to be used in a definition of structural robustness. In fact, Wilhelm were the first to define structural robustness by counting the number of EFMs in different situations. An alternative approach to EFMs is minimal cut sets (MCSs) (Klamt and Gilles, 2004). These are minimal sets of reaction (enzyme) knockouts that completely disable a particular network functionality. MCSs are intimately connected to EFMs. Any network function can be represented by a set of EFMs. Thus, MCSs can equivalently be defined as the minimal set of reaction knockouts that prevent a set of target EFMs from carrying a steady-state flux. An MCS is therefore a so-called minimal hitting set of the target EFMs (Klamt, 2006). If the EFMs or the MCSs are known, the respective other can be calculated. Mathematically, this relationship is known as duality (Berge, 1989). While EFMs describe the possibilities to achieve a particular function, MCSs describe the effort it takes to remove that function from the network. Thus, the size distribution of the MCSs provides a measure for the structural fragility of a network (Klamt and Gilles, 2004). One major disadvantage for both measures, the EFM-based structural robustness measure as well as the MCS-based structural fragility measure, is that they presuppose the availability of the complete set of EFMs and MCSs, respectively. This is a computationally intractable problem in genome-scale metabolic models (GSMMs). Here, we present a novel method that overcomes the computational limitations and allows one for the first time to estimate the structural robustness even in GSMMs. We use MCSs to calculate the exact (structural) probability of failure (PoF) of a network. This puts the estimation of structural robustness on a theoretically sound fundament.

2 Theory

We consider a metabolic network made up of m (internal) metabolites and r reactions. The network contains n EFMs. We use the support representation to uniquely characterize an EFM, . E is the set of all (reaction) indices i for which the EFM vector carries any flux, i.e. with . We collect all EFMs in the set . If we delete d reactions, several EFMs will get disabled. We represent a set of d deleted reactions by a cut set (CS) C of cardinality d.C contains the set of (reaction) indices that get deleted. A CS C disables all EFMs involving any of the d deleted reactions. To study the impact of a deletion strategy, C, on the network, we collect all EFMs not affected by C in the set of the remaining EFMs, . The complement of is denoted by and contains all EFMs that are disabled by C: . A minimal (irreducible) set of reaction (enzyme) deletions that completely disables a given set of target EFMs, , is called an MCS, M (Klamt and Gilles, 2004). For any target set we collect all μ MCSs in the set of MCSs, . Finally, we collect all possible CSs of cardinality d in the set of CSs, .

2.1 Structural robustness:

For a given number d of deletions, Behre defined the network’s structural robustness as the average ratio of the number of remaining EFMs to the total number of EFMs over all possible combinations of exactly d knockouts, i.e. We show in the Supplementary material that the network’s structural robustness only depends on the cardinality of the network’s EFMs and can easily be calculated for any number of deletions d by Thus, network structural robustness is merely determined by the EFMs’ length.

2.2 PoF: F(d)

For a given number of deletions, d, we define the network’s PoF, F(d), as the ratio of the number of CSs that disable a particular network objective, obj, to the total number of all possible combinations of exactly d knockouts, i.e. CSs that disable a particular objective (i.e. the elements of ) are either MCSs or supersets of MCSs disabling ‘obj’. We show in the Supplementary material that the PoF can be calculated by where and J is a (multi)-index over all power sets of the indices of the set of MCSs blocking ‘obj’. Note that the definition of the PoF requires specification of an objective, that can be the operation of the whole system or of a particular function. A typical cellular function is the production of biomass (BM). Thus, measures the probability that the deletion of d reactions is lethal. In what follows we will drop the index for the objective and implicitly assume that the production of BM is targeted, unless stated otherwise.

2.3 Overall structural robustness and overall PoF (OPoF)

Both and F(d) are defined for a specific number of reaction deletions. Following Behre , we define the overall structural robustness, , if at most (all) r reactions are knocked out, as the weighted sum of all possible (Behre ), Analogous to the structural robustness we define the OPoF, as Behre suggested to use the following weights However, their choice is difficult to interpret in terms of a probability distribution. That is why we opted for an appropriately selected probability distribution function for the calculation of . may be naturally modeled by a binomial distribution, which estimates the number of loss-of-function mutations with exactly d reaction deletions assuming a constant (reaction) mutation rate p. In typical metabolic networks, the number of reactions r is large and the mutation rate is small. We therefore approximate the binomial distribution with the simpler Poisson distribution, with . and all range between 0 and 1. However, while indicates a totally robust network, denotes the reverse, a completely fragile network.

3 Implementation and computation

The computation of the structural robustness according to Equation (2) is straight forward as long as the complete set of EFMs is known. The computation of the PoF is computationally more demanding as the number of summands in the sum of Equation (4) explodes combinatorially. In the following, we will outline countermeasures to estimate the PoF even in GSMMs. Algorithm 1. Recursive CS count Require: 1: forlength(mcs) do 2:   get_cs_count() 3: endfor 4: 5: functionget_cs_count() 6:   false 7: 8:   cardinality(combCs) 9:   choose() 10:  for do 11: 12:    cardinality(testCs) 13:    if then 14:     iftestCd > combCdthen 15: get_cs_count(, ) 16:     else 17:       true 18:       exit for loop 19:     endif 20:    endif 21:  endfor 22:  if then 23:    ifinRecursionthen 24: 25:    else 26: 27:    endif 28:  endif 29:  returncsCount 30: endfunction Algorithm 1 implements Equation (4) by calculating the total number of CSs for given deletions d in a recursive way. The number of reactions in the model, as well as all mcs with a cardinality up to the current number of reaction deletions are needed for the calculation. The algorithm calculates the number of all possible CSs for a single MCS by the function choose, which computes the binomial coefficient for , where combCd is the number of deletions resulting from the combination of MCSs. In order to avoid the consideration of a CS multiple times those CSs that have already been calculated by other MCS, indicated by their index, are subtracted. This is done by creating an intermediate CS by a logical OR operation and then following the inclusion–exclusion principal (Ryser, 1963) in a recursive procedure. Algorithm 1 was implemented as multi-threaded C program which is available at https://github.com/mpgerstl/networkRobustnessToolbox.

4 Results

In the following analysis, we used , unless stated otherwise.

4.1 reliably identifies structural fragility

We used a medium-scale metabolic model (MSMM) that describes the core metabolism of Escherichia coli (see Table 1 for an overview of the topological properties of the models used) to study the influence of the carbon source on the structural robustness and the PoF (see Table 2). In all cases, a complete EFM and MCS analysis was feasible.
Table 1.

Major topological properties of the MSMMs and GSMMs for the investigated Enterobacteriaceae and Blattibacteriaceae

Model IDOrganismMedium M r Rank(S) n nBM
EColi_core + glc (Gerstl et al., 2015) E. coli K-12 MG1655Glucose709065169 916121 753
EColi_core + gly (Gerstl et al., 2015) E. coli K-12 MG1655Glycerol71916660 49548 944
EColi_core + ac (Gerstl et al., 2015) E. coli K-12 MG1655Acetate6988641299736
iJO1366 + glc (Monk et al., 2013) E. coli K-12 MG1655Glucose116517261131n/an/a
iJO1366 + mel (Monk et al., 2013) E. coli K-12 MG1655Melibiose116317181128n/an/a
iECs_1301 + mel (Monk et al., 2013) E. coli O157:H7 SakaiMelibiose109816661057n/an/a
iS_1188 + mel (Monk et al., 2013) S. flexneri 2a 2457TMelibiose1 0261 517982n/an/a
iCG230 + full (González-Domenech et al., 2012) B. cuenoti PamFull299342292n/an/a
iCG238 + full (González-Domenech et al., 2012) B. cuenoti BgeFull306350299n/an/a

Models were taken from Gerstl , Monk and González-Domenech . Aerobic growth for Enterobacteriaceae was simulated on minimal medium with glucose or melibiose as sole carbon source and for Blattibacteriaceae with full medium (for media compositions see Supplementary Table S1). All reactions that could not carry a steady state flux under any circumstances for the given growth media were removed from the original models. m and r refer to the (remaining) number of internal metabolites and reactions, respectively, for fully consistent models (represented by the respective internal stoichiometric matrix, ). n and refer to the total number and the number of BM producing EFMs in the respective models. An EFM analysis was not applicable (n/a) for GSMMs.

Table 2.

Structural robustness and PoF in a core metabolic model of E. coli growing on minimal media (see Supplementary Table S1) and three different carbon sources (glc, glucose; gly, glycerol; ac, acetate)

model ID R(1) R(2) R¯ F(1) F(2) F¯
EColi_core + glc46.3221.5437.9821.1140.4510.31
EColi_core + gly46.1921.6937.9531.8754.6314.84
EColi_core + ac50.8626.9842.7542.0567.4219.06

We used all EFMs for the calculation of and all synthetic lethals of groups of up to reactions for the estimation of with . All values are listed in %

Major topological properties of the MSMMs and GSMMs for the investigated Enterobacteriaceae and Blattibacteriaceae Models were taken from Gerstl , Monk and González-Domenech . Aerobic growth for Enterobacteriaceae was simulated on minimal medium with glucose or melibiose as sole carbon source and for Blattibacteriaceae with full medium (for media compositions see Supplementary Table S1). All reactions that could not carry a steady state flux under any circumstances for the given growth media were removed from the original models. m and r refer to the (remaining) number of internal metabolites and reactions, respectively, for fully consistent models (represented by the respective internal stoichiometric matrix, ). n and refer to the total number and the number of BM producing EFMs in the respective models. An EFM analysis was not applicable (n/a) for GSMMs. Structural robustness and PoF in a core metabolic model of E. coli growing on minimal media (see Supplementary Table S1) and three different carbon sources (glc, glucose; gly, glycerol; ac, acetate) We used all EFMs for the calculation of and all synthetic lethals of groups of up to reactions for the estimation of with . All values are listed in % We found that both measures indicated a strong impact of the carbon sources although with opposing trends. The OPoF was lowest for growth on glucose and highest for growth on acetate. We observed that in all three growth environments the PoFs F(d) could be well approximated (coefficient of determination, ) by with three free parameters α, β and γ (see Supplementary Fig. S1). however ranked growth on acetate to be most robust and growth on glucose or glycerol similarly robust but far behind growth on acetate. According to Equation (2) this implied that for acetate EFMs were on average shorter then for glucose (see Supplementary Fig. S2). Finally, the simple ratio between the number of BM producing EFMs and the total number of EFMs revealed yet another ranking with the highest ratio for growth on glycerol and the lowest ratio for growth on acetate (see Supplementary Table S2). Comparing the different measures, we found that only the OPoF scored the (lack of) structural robustness in E. coli consistently to previous analysis (Klamt and Gilles, 2004; Stelling ).

4.2 and can be calculated exactly in MSMMs

Judged by the number of EFMs, the three growth models significantly differed in size (see Table 1). Yet all structural robustness-values were calculated within few seconds for any number of reaction deletions. As the number of summands in Equation (4) grew excessively for high number of reaction deletions, d, the calculation of F(d) was more demanding. We found an exponential increase in the number of recursions for Algorithm 1 (see Fig. 1). Our optimized implementation scaled consistently better than that and allowed us to evaluate F(d) in the largest model, EColi_core + glc, up to d = 9 in < 10 min using 10 threads on an Intel® CoreTM i7-3930 K with two CPUs á six cores and 3.20 GHz operated with Ubuntu 12.04.
Fig. 1.

Number of recursions in Algorithm 1 as function of the number of reaction deletions, d evaluated for the model EColi_core+glc in different computation scenarios. In addition, we plotted the number of MCS (dotted line) as function of their cardinality for the model used

Number of recursions in Algorithm 1 as function of the number of reaction deletions, d evaluated for the model EColi_core+glc in different computation scenarios. In addition, we plotted the number of MCS (dotted line) as function of their cardinality for the model used

4.3 and are determined by the first few deletions

We analyzed the contribution of higher-order deletions to the overall structural robustness and OPoF, respectively. In the best case higher-order terms and F(d) vanish for causing no error in the sums of Equations (5) and (6). In the worst case higher-order terms maximally contribute to the sums. Thus, the maximum error ε is easily calculated by setting for all terms , where the last sum represents the cumulative distribution function of with . We found that the resulting maximum error dropped quickly with d0 even in huge GSMMs (see Fig. 2). For instance, for λ ≤ 1 the ε drops below 10−4 if all MCSs up to cardinality six are known. Thus, we conclude that for practical means the overall structural robustness and the OPoF can be approximated by the first few terms in their respective sums For fixed λ the OPoF depends only on the d0 smallest MCS and consistently converges toward its true value with increasing MCS cardinality. In the E. coli example used above (see Table 2), all MCSs up to cardinality 4 were sufficient to estimate within an error of 0.02 percentage points. However, a similar argument for the overall structural robustness is not available. The knowledge of the d0 shortest EFMs does not allow one to accurately estimate . Supplementary Figure S5 illustrates the error in the overall structural robustness as function of the d0 shortest EFMs for the E. coli model used above. In that case, >55% of all EFMs were required to get within a 10% error margin.
Fig. 2.

Maximum error, ε, as function of the expansion length, d0, for various λ values. Note that according to Equation (10) the maximum error only depends on λ and is independent of the specific topology and size of the metabolic model

Maximum error, ε, as function of the expansion length, d0, for various λ values. Note that according to Equation (10) the maximum error only depends on λ and is independent of the specific topology and size of the metabolic model

4.4 For small λ, is well approximated in GSMMs

According to Equation (11), the OPoF could be estimated if all low cardinality MCSs were known. Recently, von Kamp and Klamt (2014) showed that it is in fact possible to enumerate the smallest MCSs in GSMMs. In the following, we used their approach to calculate low cardinality MCSs. We calculated all lethal MCSs up to a cardinality of 4 in GSMMs of three E. coli strains growing on minimal medium and glucose or melibiose as the sole carbon source and estimated the OPoF (see Table 3). The models represented E. coli K-12 MG1655 (iJO1366), the pathogenic enterohemorrhagic E. coli O157:H7 Sakai (iECs_1301) and the Shigella flexneri 2a 2457T (iS_1188).
Table 3.

PoFs in GSMMs of different Enterobacteriaceae growing on minimal media (see Supplementary Table S1) and glucose (glc) or melibiose (mel)

Model ID F(1) F(2) F(3) F(4) F¯ ε(4)
iJO1366 + glc16.7430.7142.3552.068.020.02
iJO1366 + mel17.1731.4243.2453.048.220.02
iECs_1301 + mel17.5932.1144.1053.988.410.02
iS_1188 + mel19.9735.9948.8359.129.510.00

The OPoF was estimated by considering all synthetic lethals of groups of up to reactions and . All values are listed in %

PoFs in GSMMs of different Enterobacteriaceae growing on minimal media (see Supplementary Table S1) and glucose (glc) or melibiose (mel) The OPoF was estimated by considering all synthetic lethals of groups of up to reactions and . All values are listed in % Comparing the three organisms against each other revealed that growth on glucose is more failsafe than growth on the alternative carbon source melibiose. In fact, growth on glucose (and glucose 1-phosphate) was found to be more robust than on any other single, nitrogen-free carbon source (see Supplementary Fig. S4). The two E. coli were similarly robust (with a small advantage for E. coli K-12 MG1655), while the S.flexneri was most fragile. Although the difference in the OPoF was small, it is indicative as the difference in for any two models is larger than . We observed the same trend not only for but also for F(d) for all tested cardinalities , as well. The GSMMs were also found to be more robust than the E. coli core model. Again, in all models, F(d) could be nicely approximated by (see Supplementary Table S4). Finally, we calculated all synthetic lethal reactions in groups of up to reaction deletions and evaluated the OPoF in the GSMMs of the endosymbiotic Blattabacterium cuenoti Bge (iCG238 + full) and B. cuenoti Pam (iCG230 + full), see Table 4, for full growth media containing all nutrients that were possibly taken up, see Supplementary Table S1 for detailed media compositions. Both strains were found to be extremely fragile with OPoFs > 26% and a maximal inaccuracy of percentage points. Any three reaction deletions almost certainly killed these two strains (with a small chance of survival of <5%).
Table 4.

PoFs in GSMMs of B. cuenoti Bge (iCG238 + full) and B. cuenoti Pam (iCG230 + full) growing on full media (see Supplementary Table S1)

Model ID F(1) F(2) F(3) F(4) F(5) F(6) F¯ ε(6)
iCG230 + full64.6287.6895.8198.6199.5599.8627.63 104
iCG238 + full62.5786.3695.1698.3199.4399.8126.89 104

The OPoF was estimated by considering all synthetic lethals of groups of up to reactions and . All values are listed in %

PoFs in GSMMs of B. cuenoti Bge (iCG238 + full) and B. cuenoti Pam (iCG230 + full) growing on full media (see Supplementary Table S1) The OPoF was estimated by considering all synthetic lethals of groups of up to reactions and . All values are listed in %

4.5 Gene–centric PoFs showed same behavior as reaction–centric PoFs

For simplicity so far we analyzed structural robustness from a reaction–centric (RC) viewpoint. However, it is possible to change from an RC to a gene–centric (GC) viewpoint while retaining our formalism for the PoF calculation. PoF only requires the calculation of MCSs. These can be calculated in an RC model as well as in a GC model provided that a gene–reaction mapping is available as Boolean function of the genes (as it is typically the case). GC MCSs consider the effect of gene knock-outs on reactions based on the evaluation of the provided Boolean rules. If the complete set of EFMs is known, GC MCSs can be calculated using an established integer programming procedure (Jungreuthmayer and Zanghellini, 2012). In GSMMs, where the complete set of EFMs is unavailable, the dual systems approach (von Kamp and Klamt, 2014) can be adopted to compute GC MCSs. The common dual approach calculates RC MCSs (ordered by ascending cardinality) by iteratively solving a mixed integer linear program (MILP) (De Figueiredo ). In order to obtain a GC MCS instead of an RC MCS the gene–reaction association has to be integrated into the MILP, which can be achieved by means of the so-called indicator constraints. These concepts are provided by modern MILP solvers, which allow for the Boolean coupling of reactions and their associated genes. We evaluated GC PoFs for models listed in Table 1 and compared them against the corresponding RC PoFs (see Table 5). We found comparable (O)PoF values except for the MSMM of E. coli, where we observed larger differences. Yet the absolute ranking of the organisms with respect to structural robustness remained unchanged no matter if the GC or RC PoF was used.
Table 5.

GC and RC PoF for various MSMMs and GSMMs

Model ID F(1)
F(2)
F¯
GCRCGCRCGCRC
EColi_core + glc5.0021.1110.0140.452.5010.31
EColi_core + gly5.7131.8711.4654.632.8614.84
EColi_core + ac15.0042.0529.1067.427.3819.06
iJO1366 + glc16.9416.7431.0330.718.118.02
iJO1366 + mel17.5517.1732.0531.428.398.22
iECs_1301 + mel19.1417.5934.6532.119.128.41
iS_1188 + mel21.8219.9738.9235.9910.339.51
iCG230 + full66.3464.6288.9687.6828.2427.63
iCG238+full62.2662.5786.3386.3626.7926.89

For the GSMMs the OPoFs were estimated by considering all synthetic lethals of groups of up to reactions and . For these settings the estimation error was < 0.02 percentage points. All values are listed in %

GC and RC PoF for various MSMMs and GSMMs For the GSMMs the OPoFs were estimated by considering all synthetic lethals of groups of up to reactions and . For these settings the estimation error was < 0.02 percentage points. All values are listed in %

5 Discussion

Here, we developed a new general measure to quantify the structural robustness (or fragility) of metabolic networks based on the (O)PoF. Both, PoF F(d) as well as OPoF are based on MCSs and share analogies with the previously defined structural robustness measures R(d) and which rely on EFMs (Behre ). In contrast to , the OPoF remains computationally feasible even in GSMMs for two reasons: (i) the OPoF can be well estimated from the shortest MCSs with up to d0 reaction cuts; and (ii) these d0 shortest MCSs can be calculated with a MILP even in GSMMs (von Kamp and Klamt, 2014). Previously, it was shown that MCSs in the (primal) network are EFMs in a network dual to the original network (Ballerstein ). Thus, finding the shortest EFMs in this dual network is equivalent to finding the low cardinality MCSs in the primal GSMM (von Kamp and Klamt, 2014). A high performance computing infrastructure is not required for this task. Thus, the OPoF is in principle an easily computable measure. However, due to the combinatorial explosion of the number of summands in F(d), see Equation (4), the evaluation is practically limited to low cardinalities. This does not cause practical problems for the OPoF as the impact of high-cardinality MCSs on quickly decreases (see Fig. 2). The slope of this decrease depends on the selected probability density that models the number of deletions that occur per mutation. Here, we used a Poisson distribution which is typically used for the study of rare events, like mutations (Lee , and references therein). We were able to show that up to our analysis is computationally feasible on standard computer infrastructure. This is sufficient for the study of naturally occurring mutations. For instance, in E. coli a mutation rate of about per genome and generation was found (Lee ), which is far below our limit of , as we can assume that not all mutations lead to a loss of function of metabolic enzymes. All our conclusions remained valid (data not shown) if we used other probability densities like a binomial distribution or the one used by Behre , see Equation (7). Previously, MCSs were already used to compute a so-called fragility coefficient which characterized the sensitivity of a metabolic network to deletions (Klamt, 2006; Klamt and Gilles, 2004). Fragility was defined for each reaction as the inverse of the average cardinality of all MCSs that were supported by that reaction. Upon averaging the reaction-specific fragility over all reactions one obtained a measure for the overall network fragility. In contrast to the here introduced concept of the OPoF, network fragility lacks a rigorous probabilistic definition and more importantly requires the complete set of MCSs for its calculation. Network fragility is therefore not applicable to GSMMs. A clear difference between the PoF and structural robustness is that the former is function oriented while the latter characterizes the network. is calculated with respect to some particular function that fails. Typically, networks perform multiple functions at the same time, which may be differentially robust. Thus, the OPoF is not only a topological property but also a property of the associated function. For instance, it is interesting to ask (and scope of further analysis) whether two essential functions like cellular energy production and BM production are similarly robust. on the other hand characterizes the network topology by essentially measuring the proportion of short EFMs in a network. This definition is independent of any functions performed by the network. counterintuitively identified growth on acetate to be more robust than growth on glucose which indicates that the length of EFMs may not be an appropriate proxy for structural robustness. Even if we selected only those EFMs that support a particular target function (like BM production), and calculated for the reduced set of EFMs, we had misidentified growth on acetate as the most robust growth environment as growth supporting EFMs on acetate were on average ‘shorter’ than on glucose (see Supplementary Fig. S2). For better understanding we illustrated this effect in a toy network in Supplementary Figure S3. Thus, we propose to use the complement of the OPoF: as a new measure for structural robustness in metabolic networks. PoFs can be calculated equally well on the basis of reaction deletions as well as on the basis of gene deletions. Some reactions in an organism are not associated with a gene (e.g. non-catalyzed reactions, reactions with unknown annotations) and can therefore not be knocked out. Other reactions are catalyzed by several different enzymes and are therefore more difficult to knock-out. However, our analysis indicates that the question if an organism or a growth condition is more robust than another can be identically answered in both approaches. It was criticized that structural robustness (as defined by Behre ) lacks the ability to identify critical enzymes, such as knot enzymes (Min ). This is true for the OPoF as well. For instance, according to our measure, network A in Figure 3 was correctly identified to be more robust than network B. The cause for the lack of structural robustness is not apparent from the numerical value of the OPoF. Min proposed to solve this dilemma by considering the number of independent (i.e. non-overlapping) EFMs, which are not available in GSMMs. However, by measuring the specific PoF for each individual reaction (enzyme) R we are at least able to identify critical enzymes. For this purpose, we define the (reaction) specific PoF to be the ratio between all specific CSs, , that are supported by reaction R and the number of all possible CSs, (and similarly for ). By ranking reactions according to their impact on the network we are able to identify critical reactions which may be useful drug targets.
Fig. 3.

Two differently robust networks (panel A, ; panel B, ) and their overall specific PoF, with (panel C)

Two differently robust networks (panel A, ; panel B, ) and their overall specific PoF, with (panel C) We calculated OPoFs in MSMMs and GSMMs of multiple Enterobacteriaceae growing on minimal media to illustrate our concept. Among these, the ordinary lab strain E. coli K-12 MG1655 growing on glucose was found to be most robust. This was consistently observed in MSMMs and GSMMs in agreement with expectations (Stelling ). Moreover, catabolite repression mechanisms support this observation of glucose as a preferential growth source. Escherichia coli K-12 MG1655 was also found to be more robust than the pathogenic E. coli O157:H7 Sakai, and the least robust S.flexneri 2a 2457T. While the difference between K-12 MG1655 and O157:H7 Sakai was small (but larger than the possible error), S.flexneri was clearly set apart. Due to its adaption to a specific growth niche, S.flexneri has lost many catabolic pathways for various nutrient sources (Bliven and Maurelli, 2012). It was demonstrated computationally as well as experimentally that the number of growth-supporting conditions is larger for E. coli K-12 MG1655 than for E. coli O157:H7 Sakai than for S.flexneri 2a 2457T (Monk ). In contrast we here investigated the OPoF as a measure of structural robustness for these organisms in a single growth condition. Yet we found the same ranking as obtained from counting the number of growth-supporting conditions. Structural robustness is thought to come about from the need to survive in multiple environments. Many of these adaptations occur through rewiring of existing enzymes (Wagner, 2013), thus increasing the connectivity in the network, and—as a byproduct—also introducing many alternative routes. These topological alternatives can be exploited for other growth sources as well and were mirrored in our analysis in low OPoFs. The two strains of the highly specialized endosymbiont B. cuenoti, which for millions of years have lived in the relatively constant environment provided by specialized cells of cockroaches (Lo ), were found to be extremely sensitive to loss-of-function mutations (González-Domenech ). This was confirmed by an extremely high OPoF of >26%. B. cuenoti was more than three times less robust than E. coli. It is known that B. cuenoti grows only on an extremely limited palette of metabolic substrates (Sabree ), which results in their low structural robustness as documented here. Consistent with our findings above on S.flexneri this lack of structural robustness is thought to be shaped by massive gene loss (Sabree ). A characteristic difference between B. cuenoti Pam and Bge is a defunct TCA cycle in the former due to the absence of the first three enzymatic steps (González-Domenech ). This fact was mirrored in a lower structural robustness of B. cuenoti Pam compared with Bge. Thus, we conclude that the OPoF is indeed an appropriate measure to correctly assess the structural robustness of cells. Here, we analyzed the structural robustness of multiple organisms in several growth environments. Consistent with previous findings our results confirm that structural robustness is correlated with the ability to utilize multiple carbon sources (Papp ). They concluded that structural robustness is not a trait selected by evolution but a byproduct of environmental flexibility. However, very recently Yang showed that structural redundancy also impacts a cell factory’s ability to produce a product of interest in the presence of perturbations. Based on predictions from GSMMs, they argue that pathway diversification leads to robust production capabilities under large perturbations. For instance, we show in Supplementary Table S5 two strain designs with different OPoFs but identical production/growth characteristics. Thus, calculating the OPoF will identify promising designs for cell factories. Interestingly, we found that the raise in F(d) with increasing d was well approximated by an exponential function in all investigated models and independent of the investigated organism. Whether this is an expression of some general topological properties of metabolic networks or a coincidence will be scope of further work.

6 Conclusion

We developed a consistent theory of cellular redundancy based on a rigorous probabilistic definition of failure in metabolic networks. The new measure, called OPoF, allows quantification of the structural robustness of biochemical reaction networks with respect to certain functionalities. OPoF can be reliably estimated—even in GSMMs—from low cardinality MSCs. In contrast to other measures, the OPoF quantifies structural robustness and fragility of cellular metabolism concurrent with current biological paradigms and experimental findings. More specifically we showed that the number of growth-supporting nutrients indirectly correlates with the organisms’ OPoF. As the OPoF is easily computable, we expect that it will be useful in increasing our understanding of key properties of metabolic networks. Click here for additional data file.
  28 in total

1.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks.

Authors:  S Schuster; D A Fell; T Dandekar
Journal:  Nat Biotechnol       Date:  2000-03       Impact factor: 54.908

2.  Metabolic network structure determines key aspects of functionality and regulation.

Authors:  Jörg Stelling; Steffen Klamt; Katja Bettenbrock; Stefan Schuster; Ernst Dieter Gilles
Journal:  Nature       Date:  2002-11-14       Impact factor: 49.962

3.  Minimal cut sets in biochemical reaction networks.

Authors:  Steffen Klamt; Ernst Dieter Gilles
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

4.  Metabolic network analysis of the causes and evolution of enzyme dispensability in yeast.

Authors:  Balázs Papp; Csaba Pál; Laurence D Hurst
Journal:  Nature       Date:  2004-06-10       Impact factor: 49.962

Review 5.  Robustness of cellular functions.

Authors:  Jörg Stelling; Uwe Sauer; Zoltan Szallasi; Francis J Doyle; John Doyle
Journal:  Cell       Date:  2004-09-17       Impact factor: 41.582

6.  Generalized concept of minimal cut sets in biochemical networks.

Authors:  Steffen Klamt
Journal:  Biosystems       Date:  2005-11-21       Impact factor: 1.973

7.  Analysis of structural robustness of metabolic networks.

Authors:  T Wilhelm; J Behre; S Schuster
Journal:  Syst Biol (Stevenage)       Date:  2004-06

8.  Cockroaches that lack Blattabacterium endosymbionts: the phylogenetically divergent genus Nocticola.

Authors:  Nathan Lo; Tiziana Beninati; Fred Stone; James Walker; Luciano Sacchi
Journal:  Biol Lett       Date:  2007-06-22       Impact factor: 3.703

9.  Towards a theory of biological robustness.

Authors:  Hiroaki Kitano
Journal:  Mol Syst Biol       Date:  2007-09-18       Impact factor: 11.429

10.  Metabolomics integrated elementary flux mode analysis in large metabolic networks.

Authors:  Matthias P Gerstl; David E Ruckerbauer; Diethard Mattanovich; Christian Jungreuthmayer; Jürgen Zanghellini
Journal:  Sci Rep       Date:  2015-03-10       Impact factor: 4.379

View more
  3 in total

1.  Stoichiometric Representation of Gene-Protein-Reaction Associations Leverages Constraint-Based Analysis from Reaction to Gene-Level Phenotype Prediction.

Authors:  Daniel Machado; Markus J Herrgård; Isabel Rocha
Journal:  PLoS Comput Biol       Date:  2016-10-06       Impact factor: 4.475

2.  Speeding up the core algorithm for the dual calculation of minimal cut sets in large metabolic networks.

Authors:  Steffen Klamt; Radhakrishnan Mahadevan; Axel von Kamp
Journal:  BMC Bioinformatics       Date:  2020-11-09       Impact factor: 3.169

3.  Which sets of elementary flux modes form thermodynamically feasible flux distributions?

Authors:  Matthias P Gerstl; Christian Jungreuthmayer; Stefan Müller; Jürgen Zanghellini
Journal:  FEBS J       Date:  2016-03-31       Impact factor: 5.542

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.