| Literature DB >> 27857205 |
Laurence Yang1, James T Yurkovich1,2, Colton J Lloyd1, Ali Ebrahim1, Michael A Saunders3, Bernhard O Palsson1,2,4.
Abstract
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the "generalist" (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and "hedging" against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27857205 PMCID: PMC5114563 DOI: 10.1038/srep36734
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Model-based interpretation of proteomic data.
(a) Predicted mass fractions are validated by proteins grouped by COG for optimal and generalist ME models. Ellipses show 95% confidence intervals. (b) Growth rate predictions improve due to proteome sector constraints. (c) The model predicts global proteome reallocation due to sector constraints for Metabolic (M) and Expression (E) systems. (d) The ME model computes global metabolic shifts due to proteome reallocation. Abbreviations: C, Energy production and conversion; E, Amino acid transport and metabolism; G, Carbohydrate transport and metabolism; J, Translation, ribosomal structure and biogenesis; M, Cell wall/membrane/envelope biogenesis; O, Posttranslational modification, protein turnover, chaperones; CPS, Cellular Processes and Signaling; ISP, Information Storage and Processing; MET, Metabolism; MUU, Mobilome, Unknown, and Ungrouped; SSE, sum of squared errors.
Sectors and their mass fractions defined from proteomics data and used to constrain the ME model.
| Sector | Mass fraction |
|---|---|
| Amino acid transport and metabolism | 0.115 |
| Carbohydrate transport and metabolism | 0.089 |
| Cell wall/membrane/envelope biogenesis | 0.068 |
| Energy production and conversion | 0.096 |
| Posttranslational modification, protein turnover, chaperones | 0.054 |
| Translation, ribosomal structure and biogenesis | 0.156 |
Predicted protein percent of cell dry weight.
| Condition | Optimal | Generalist | ||
|---|---|---|---|---|
| Growth rate, h−1 | Protein, % dry weight | Growth rate, h−1 | Protein, % dry weight | |
| Acetate | 0.672 | 60.1 | 0.344 | 70.0 |
| Chemostat (µ = 0.12) | 0.120 | 58.2 | 0.120 | 72.7 |
| Chemostat (µ = 0.20) | 0.200 | 58.0 | 0.200 | 71.0 |
| Chemostat (µ = 0.35) | 0.350 | 57.9 | 0.350 | 68.8 |
| Chemostat (µ = 0.50) | 0.500 | 58.1 | 0.500 | 66.7 |
| Fructose | 1.140 | 57.1 | 0.876 | 62.5 |
| Fumarate | 1.020 | 59.4 | 0.581 | 67.5 |
| Galactose | 1.020 | 57.8 | 0.760 | 62.8 |
| Glucosamine | 1.130 | 57.0 | 0.854 | 62.7 |
| Glucose | 1.140 | 57.4 | 0.886 | 62.4 |
| Glycerol | 1.140 | 57.2 | 0.821 | 63.4 |
| Mannose | 1.110 | 57.2 | 0.848 | 62.7 |
| Pyruvate | 1.010 | 58.7 | 0.950 | 61.9 |
| Succinate | 1.060 | 59.6 | 0.601 | 67.7 |
| Xylose | 1.020 | 57.9 | 0.734 | 63.6 |
Figure 2Sensitivity to model parameters.
(a) Probability density of the coefficients of variation of simulated protein mass fractions across random perturbations of effective rate constants. (b) Variation in simulated metabolic fluxes upon perturbing effective rate constants. (c) Variation in simulated growth rate upon perturbing effective rate constants.