| Literature DB >> 23009214 |
David Byrne1, Alexandra Dumitriu, Daniel Segrè.
Abstract
BACKGROUND: Metabolic engineering design methodology has evolved from using pathway-centric, random and empirical-based methods to using systems-wide, rational and integrated computational and experimental approaches. Persistent during these advances has been the desire to develop design strategies that address multiple simultaneous engineering goals, such as maximizing productivity, while minimizing raw material costs.Entities:
Mesh:
Year: 2012 PMID: 23009214 PMCID: PMC3484036 DOI: 10.1186/1752-0509-6-127
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Procedural overview. The multi-goal metabolic engineering process incorporates three main decision-variable components: (1) organism model definitions, (2) imposed environmental and genetic conditions, and (3) extraction of desirable engineering metrics and goals. The resultant extracellular engineering phenotypes, relationships between the decision variables, and intracellular pathway activity can then be analyzed and experimentally verified to provide mechanistic insight and achieve optimal engineering designs.
Figure 2Engineering meta-phenotypes for economic data set. Each column (listed by Cluster Id and grouped by organism) is the centroid associated with a corresponding k-means phenotype cluster (meta-phenotype) for the simulation data subset with economic metrics. Cluster Ids increase sequentially, from left to right: 1 to 10 for E. coli, 11 to 30 for S. cerevisiae, and 41 to 60 for S. oneidensis. To compare across phenotype clusters and organisms, the metric values have been transformed into row-wise z-scores. All the rows and organism-specific columns were then hierarchically clustered. The “cluster sizes” are the number of individual phenotype simulations associated to each phenotype cluster. The “cluster heterogeneity” is the within-cluster sum of squared errors (SSE) for each phenotype cluster normalized by the maximum for each organism.
Selected Pareto optimal engineering designs
| 1 | 8 | Succinate production rate (0.99), Succinate purity (0.01) | candidate | edd, gnd | malthx, fum, gam, pi, so4 | ec | 0 | 0 | 1.85 | NA | 195.34 | 0.63 | 19314.4 |
| 2 | 8 | Succinate production rate (0.5), Succinate purity (0.5) | candidate | atpH, caiD | malthx, fum, gam, pi, so4 | ec | 0 | 0 | 1.85 | NA | 181.57 | 0.71 | 19305.4 |
| 3 | 3 | Succinate purity (1) | candidate | SO4417, SO3136 | ac, fum, nh4, pi, so4 | so | 0 | 0 | 0.01 | NA | 20 | 1 | 3.72 |
| 4 | 1 | Succinate production rate (0.01), Total economic cost rate (−0.99) | candidate | kgtP, lysP | sucr, o2, gam, ppt, so4 | ec | 36.17 | 0.35 | 1.85 | NA | 51.45 | 0.49 | 4.33 |
| 5 | 30 | Succinate purity (0.5), Total economic cost rate (−0.5) | candidate | YBR196C, YMR256C | glc, o2, urea, pi, so4 | sc | 0 | 0 | 0.26 | NA | 14.77 | 0.43 | 0.05 |
| 6 | 1 | Succinate production rate (0.33), Succinate purity (0.33), Total economic cost rate (−0.33) | candidate | SO4417, SO3136 | glyclt, fum, nh4, pi, so4 | so | 0 | 0 | 0.06 | NA | 20 | 1 | 2.26 |
| 7 | 1 | Succinate production rate (1) | validated | ptsG, pykFA, pfl | glc, NA, nh4, pi, so4 | ec | 4.28 | 0.24 | 0.12 | NA | 9.12 | 0.5 | 0.03 |
| 8 | 1 | Acetate production rate (1) | microarray | appY | glc, NA, nh4, pi, so4 | ec | 6.76 | 0.24 | 0.17 | 0.46 | 0.06 | 0 | 0.75 |
| 9 | 1 | Acetate purity (1) | microarray | arcA | glc, o2, nh4, pi, so4 | ec | 1.1 | 1 | 0.63 | 0.02 | 0 | 0 | 2.35 |
| 10 | 1 | Microarray consistency (1) | microarray | arcA | glc, NA, nh4, pi, so4 | ec | 0 | 0 | 0.11 | 0.56 | 0.04 | 0 | 0.02 |
| 11 | 1 | Acetate production rate (0.33), Microarray consistency (0.33), Acetate purity (0.33) | microarray | appY | glc, NA, nh4, pi, so4 | ec | 6.76 | 0.24 | 0.17 | 0.46 | 0.06 | 0 | 0.75 |
a Engineering designs are referenced in the text by this identifier. b Each engineering design is associated with an engineering phenotype cluster (meta-phenotype) and identified according to the “Clusters” shown in Figure 6(A) for E. coli, Additional file 1: Figure S13(A) for S. cerevisiae, and Additional file 1: Figure S14(A) for S. oneidensis. c The specified engineering design goals and weights (shown in parentheses, where positive weights indicate goal maximization and negative weights indicate goal minimization) are used to determine the optimal engineering designs for the corresponding “Design type”. d “Validated” designs are designs that were experimentally validated, “microarray” designs are designs that had matching experimental microarray data, and “candidate” designs are simulated designs that are neither “validated” nor “microarray” designs. e Nutrients are ordered by nutrient type: carbon, electron acceptor, nitrogen, phosphorous, and sulfur source. “NA” indicates that no nutrient source of that type was provided. Metabolite abbreviations: ac = Acetate, fum = Fumarate, gam = D-Glucosamine, glc = D-Glucose, glu-L = L-Glutamate, glyclt = Glycolate, malthx = Maltohexoase, nh4 = Ammonium, o2 = O2, pi = Phosphate, ppt = Phosphonate, sbt-D = D-Sorbitol, so4 = Sulfate, sucr = Sucrose, urea = Urea. f Organism abbreviations: ec = Escherichia coli, sc = Saccharomyces cerevisiae, so = Shewanella oneidensis. g Engineering design simulations were compared with microarray data to compute experimental microarray consistency. “NA” indicates that no microarray data was available for the corresponding design.
Figure 6Perturbation effects on phenotype changes in .(A) A subset of the engineering metrics associated with E. coli phenotype clusters (meta-phenotypes) shown in Figure 2. For values of engineering metrics (z-scores) and cluster sizes, refer to legend in Figure 2. (B) Meta-phenotype transition network for E. coli. Nodes i and j represent two viable-growth engineering meta-phenotypes (the nonviable-growth meta-phenotype is not shown). Node labels correspond to Clusters shown in (A). Node sizes are proportional to cluster sizes shown in (A). Edge t represents the cumulative phenotype-cluster transition frequency between Nodes i and j due to a specified perturbation type. Edges are bidirectional, so t is equivalent to t. Edge thickness is proportional to the cumulative transition frequency for environmental or genetic perturbations. Edges with relative frequency < 1% have been filtered out, primarily omitting low relative frequency single and double gene-deletion perturbations. (C) Legend for meta-phenotype transition network in (B). Node faces are divided into quadrants that correspond to the selected engineering metrics shown in (A). Quadrant colors indicate the associated metric z-scores for the corresponding Cluster. Perturbation type (edge attribute) abbreviations: C = carbon sources, EA = electron acceptor sources, N = nitrogen sources, P = phosphorous sources, S = sulfur sources, SGD = single gene deletions, and DGD = double gene deletions.
Figure 3Pareto optima and trade-offs for multiple engineering goals.(A-C). Two-dimensional (2D) and three-dimensional (3D) candidate and Pareto optimal designs and frontiers for engineering goals related to succinate production. Sub-optimal candidate designs for all organisms and experimentally validated E. coli designs are shown in (A) and (B), but are not shown in (C) for clarity. (D-F). 2D and 3D candidate and Pareto optimal designs that had matching experimental microarray data and frontiers for engineering goals related to acetate production. Error bars are too small to be visualized. For comparison, candidate designs that use the same media conditions as the microarray designs are shown in (D), however only the microarray designs are used to compute Pareto optima and frontiers. Sub-optimal microarray designs are shown in (D) and (E), but are not shown in (F) for clarity. The 3D Pareto frontiers shown in (C) and (F) are colored in proportion to the total economic cost and acetate production rates, respectively. Trade-offs can be determined by evaluating the rate of change of the Pareto frontier over the desired range of Pareto optimal designs. “Validated” designs are designs that were experimentally validated, “microarray” designs are designs that had matching experimental microarray data, and “candidate” designs are simulated designs that are neither “validated” nor “microarray” designs. Abbreviations: ec = Escherichia coli, sc = Saccharomyces cerevisiae, so = Shewanella oneidensis. The legend on top of the panels summarizes the symbols used.
Figure 4Metabolic network pathway activity in (A-C). Differential pathway activity between the wild-type E. coli grown anaerobically on glucose minimal media and the mutant E. coli Δedd Δgnd grown anaerobically on maltohexoase, fumarate, D-glucosamine, phosphate, and sulfate minimal media (Design 1 in Table 1). (A) The complete metabolic network map, and close-ups (regions framed by red boxes) of (B) the citric acid cycle and (C) the lipid metabolism. Metabolic pathway reactions are color-coded according to the relative flux differences (color legend for flux values in (C) applies to (A-C)) between the engineering designs. Similar metabolic network maps for all organisms and Pareto optimal designs can be viewed using the Multi-Goal Metabolic Engineering Website (Methods).
Figure 5Pair-wise correlations of engineering metrics. Pearson correlations are computed between all pairs of engineering metrics for the combined organism data. Rows and columns are hierarchically clustered. Column labels are in the same order as the row labels, with the top metric row associated with the left-most metric column.