| Literature DB >> 28691262 |
Ayca Cankorur-Cetinkaya1, Duygu Dikicioglu1, Stephen G Oliver1.
Abstract
Genome-scale metabolic models are valuable tools for the design of novel strains of industrial microorganisms, such as Komagataella phaffii (syn. Pichia pastoris). However, as is the case for many industrial microbes, there is no executable metabolic model for K. phaffiii that confirms to current standards by providing the metabolite and reactions IDs, to facilitate model extension and reuse, and gene-reaction associations to enable identification of targets for genetic manipulation. In order to remedy this deficiency, we decided to reconstruct the genome-scale metabolic model of K. phaffii by reconciling the extant models and performing extensive manual curation in order to construct an executable model (Kp.1.0) that conforms to current standards. We then used this model to study the effect of biomass composition on the predictive success of the model. Twelve different biomass compositions obtained from published empirical data obtained under a range of growth conditions were employed in this investigation. We found that the success of Kp1.0 in predicting both gene essentiality and growth characteristics was relatively unaffected by biomass composition. However, we found that biomass composition had a profound effect on the distribution of the fluxes involved in lipid, DNA, and steroid biosynthetic processes, cellular alcohol metabolic process, and oxidation-reduction process. Furthermore, we investigated the effect of biomass composition on the identification of suitable target genes for strain development. The analyses revealed that around 40% of the predictions of the effect of gene overexpression or deletion changed depending on the representation of biomass composition in the model. Considering the robustness of the in silico flux distributions to the changing biomass representations enables better interpretation of experimental results, reduces the risk of wrong target identification, and so both speeds and improves the process of directed strain development.Entities:
Keywords: Komagataella phaffii; Pichia pastoris; biomass composition; genome-scale metabolic model; metabolic target identification; recombinant protein production
Mesh:
Year: 2017 PMID: 28691262 PMCID: PMC5659126 DOI: 10.1002/bit.26380
Source DB: PubMed Journal: Biotechnol Bioeng ISSN: 0006-3592 Impact factor: 4.530
Datasets used to constrain the model
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|
| Carnicer et al. ( | ||||||||
| Wild type | ||||||||
| Normoxic | 0.99 | n/a | n/a | 2.35 | n/a | n/a | n/a | |
| Oxygen‐limited | 1.28 | n/a | n/a | 2.01 | 0.31 | 0.13 | n/a | |
| Hypoxic | 1.72 | n/a | n/a | 2.01 | 0.84 | 0.33 | n/a | |
| Fab producing | ||||||||
| Normoxic | 1.01 | n/a | n/a | 2.44 | n/a | n/a | 0.0004 | |
| Oxygen‐limited | 1.37 | n/a | n/a | 1.99 | 0.41 | 0.19 | 0.0007 | |
| Hypoxic | 1.56 | n/a | n/a | 1.81 | 0.83 | 0.24 | 0.0007 | |
| Rußmayer et al. ( | ||||||||
| Wild type | ||||||||
| Grown on Glu | 1.02 | n/a | n/a | 2.39 | n/a | n/a | n/a | |
| Grown on Met + Gly | n/a | 0.81 | 1.64 | 3.09 | n/a | n/a | n/a | |
Glu, Met, Gly, O2, EtOH, Ara, and Fab denote glucose, methanol, glycerol, oxygen, ethanol, arabitol and antibody Fab fragment, respectively. q denotes specific utilization rates for glucose, methanol, glycerol, oxygen, and specific production rates for ethanol, arabitol and antibody Fab fragment and given in mmol/gCDW/hr.
Evaluation of the predictive ability of the gene essentiality
| Kp.1.0 prediction for deletion of | Deletion mutant phenotype of | |
|---|---|---|
| Viable | Viable | True positive (TP) |
| Viable | Inviable | False positive (FP) |
| Inviable | Inviable | True negative (TN) |
| Inviable | Viable | False negative (FN) |
Figure 1Comparison of the existing GEMs of K. phaffii. The numbers in the intersection sets of the models for the metabolites and reactions represent the number of entities commonly found either two or all of the models. The remaining metabolites and reactions were classified as unique to each model
A comparison of the predictive ability of Kp.1.0 and Y7.6
| Kp1.0 | Y.7.6 | iLC915 | |
|---|---|---|---|
| # of genes | 720 | 909 | 915 |
| Essential genes | 26% | 17% | 9% |
| # of orthologous genes in GEM | 612 | 517 | 726 |
| Essential genes (among orthologs) | 29% | 21% | 10% |
| # of TP | 370 | 338 | 483 |
| # of FP | 62 | 32 | 171 |
| # of TN | 82 | 74 | 21 |
| # of FN | 95 | 73 | 51 |
| Sensitivity | 80% | 82% | 90% |
| Specificity | 57% | 70% | 11% |
| Positive predictive value | 86% | 91% | 74% |
| Negative predictive value | 46% | 50% | 29% |
| % Correct prediction | 74% | 80% | 69% |
Model predictions on growth characteristics
| Experimental values | Predicted values | |||
|---|---|---|---|---|
| Growth rate (hr−1) | CER (mmol/gCDW/hr) | Growth rate (hr−1) | CER (mmol/gCDW/hr) | |
| Wild type | ||||
| Normoxic | 0.1 | 2.43 | 0.10 | 2.62 |
| Oxygen‐limited | 0.1 | 2.55 | 0.09 | 2.71 |
| Hypoxic | 0.1 | 3.21 | 0.09 | 3.37 |
| Fab producing | ||||
| Normoxic | 0.1 | 2.52 | 0.09 | 2.70 |
| Oxygen‐limited | 0.1 | 2.68 | 0.09 | 2.88 |
| Hypoxic | 0.1 | 2.94 | 0.09 | 3.16 |
| Wild type | ||||
| Grown on Glu | 1 | 2.11 | 0.10 | 2.66 |
| Grown on Met + Gly | 0.1 | 1.86 | 0.10 | 2.15 |
The model predictions provided in this table were obtained when the biomass composition for cells grown under normoxic conditions for wild‐type cell were used.
Biomass effect on model prediction
| Biomass content data under description of condition | Predicted CER (mmol/gCDW × hr) (experimentally determined: 1.86 mmol/gCDW × hr) | Predicted growth rate (hr−1) (experimentally determined: 0.1 hr−1) |
|---|---|---|
| 1. Wild type—normoxic | 2.15 | 0.10 |
| 2. Wild type—oxygen limited | 2.18 | 0.09 |
| 3. Wild type—hypoxic | 2.21 | 0.09 |
| 4. Fab producing—normoxic | 2.14 | 0.09 |
| 5. Fab producing—oxygen limited | 2.24 | 0.09 |
| 6. Fab producing—hypoxic | 2.28 | 0.10 |
| 7. 80/20 glycerol/methanol— | 2.23 | 0.10 |
| 8. 60/40 glycerol/methanol— | 2.16 | 0.11 |
| 9. 40/60 glycerol/methanol— | 2.17 | 0.10 |
| 10. 80/20 glycerol/methanol— | 2.03 | 0.11 |
| 11. 60/40 glycerol/methanol— | 2.02 | 0.11 |
| 12. 40/60 glycerol/methanol— | 2.16 | 0.11 |
Figure 2Correlations between flux distributions and corresponding biomass compositions. The plot represents (a) the correlations between the flux distributions when the simulation were conducted using different biomass compositions (b) the correlation between the corresponding biomass compositions. The labels in the abscissa and ordinate corresponds to conditions where, wild type cells grown under normoxic (1), oxygen limited (2), hypoxic environment (3), Fab producing cells grown under normoxic (4), oxygen limited (5) and hypoxic environment (6), cells grown at a dilution are of 0.05 hr−1 using 80/20 glycerol/methanol (7), 60/40 glycerol/methanol (8), 40/60 glycerol/methanol (9) as the carbon source and cells grown at a dilution are of 0.16 hr−1 using 80/20 glycerol/methanol (10), 60/40 glycerol/methanol (11), 40/60 glycerol/methanol (12) as the carbon source. The color bar indicates values of the Pearson correlation coefficient: an increase from red to blue means transition from high correlation to low correlation
Figure 3Volcano plot of the flux distributions obtained using different biomass composition reactions. The plot represents the comparison of the flux values for every possible combinations of 12 different distributions. The x‐axis shows the log 2 fold changes and the y‐axis shows the log 10 of the p‐values. Each blue dot corresponds to a reaction for a specific comparison. The orange dots represents the threshold for fold change values (FC > 2) and the black dots represent the threshold used for p‐values (p‐value < 0.01)