| Literature DB >> 27895706 |
Sandra Castillo1, Dorothee Barth1, Mikko Arvas1, Tiina M Pakula1, Esa Pitkänen2, Peter Blomberg1, Tuulikki Seppanen-Laakso1, Heli Nygren1, Dhinakaran Sivasiddarthan1, Merja Penttilä1, Merja Oja1.
Abstract
BACKGROUND: Trichoderma reesei is one of the main sources of biomass-hydrolyzing enzymes for the biotechnology industry. There is a need for improving its enzyme production efficiency. The use of metabolic modeling for the simulation and prediction of this organism's metabolism is potentially a valuable tool for improving its capabilities. An accurate metabolic model is needed to perform metabolic modeling analysis.Entities:
Year: 2016 PMID: 27895706 PMCID: PMC5117618 DOI: 10.1186/s13068-016-0665-0
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1CoReCo metabolic model reconstruction pipeline. The algorithm has three main inputs: (1) genomes of the organisms (set of proteins), (2) phylogeny of the organisms, and (3) reaction database containing lists of metabolic reactions, reaction directionalities, and sequences of the enzymes catalyzing the reactions
Shows the origin of the reactions added to the reaction database
| Species | Source | Reactions | % of matched reactions | % of balanced reactions | Selected as representative | Representatives that balance |
|---|---|---|---|---|---|---|
|
| ymn6.06 | 1888 | 96 | 90 | 1020 | 923 |
|
| iMA871 | 1399 | 57 | 95 | 266 | 262 |
|
| iHD666 | 1303 | 85 | 69 | 88 | 72 |
|
| iWV1314 | 2360 | 93 | 74 | 265 | 235 |
|
| iLC915 | 1423 | 88 | 91 | 237 | 207 |
|
| iSS884 | 1332 | 88 | 92 | 6 | 5 |
|
| iAL1006 | 1636 | 83 | 95 | 75 | 67 |
| KEGG reaction | KEGG | 9236 | 93 | 90 | 6618 | 6376 |
| MetaCyc | MetaCyc | 11,181 | 62 | 90 | 3145 | 3054 |
| Total | 31,758 | 77 | 88 | 11,720 | 11,201 |
A fully resolved reaction is a reaction having all reactants identified by metabolites in the updated database. All representative reactions balance elements other than hydrogen. Reactions that also balance electrons are denoted “reactions that balance"
Fig. 2Example illustrating the benefit of adding reaction directions to the CoReCo gapless reconstruction step and improving the reaction database. The reactions that contain a T. reesei gene annotation in KEGG are colored blue. The reactions wrongly added in our previous metabolic model version for the production of zymosterol are colored red. The model presented in this work contains only the reactions colored blue and yellow
The macromolecular composition of T. reesei
| Biomass component | % (w/w) | ||
|---|---|---|---|
| Proteins |
| ||
| Ala | 2.715b | ||
| Arg | 3.944b | ||
| Asn | 1.613b | ||
| Asp | 2.677b | ||
| Cys | 0.549b | ||
| Gln | 2.063b | ||
| Glu | 3.200b | ||
| Gly | 1.649b | ||
| His | 1.346b | ||
| Ile | 2.186b | ||
| Leu | 4.101b | ||
| Lys | 2.595b | ||
| Met | 1.202b | ||
| Phe | 2.220b | ||
| Pro | 2.363b | ||
| Ser | 2.808b | ||
| Thr | 2.354b | ||
| Trp | 1.144b | ||
| Tyr | 1.846b | ||
| Val | 2.523b | ||
| Carbohydrates |
| ||
| Chitin | 7.820c | ||
| Other carbohydrates | 15.370d | ||
| RNA |
| ||
| DNA |
| ||
| Lipids |
| ||
| Fatty acids-esters | |||
| Myristic acid (C14:0) est | 0.004a | ||
| Palmitic acid (C16:0) est | 0.613a | ||
| Palmitoleic acid (C16:1n-7) est | 0.011a | ||
| Stearic acid (C18:0) est | 0.070a | ||
| Oleic acid (C18:1n-9) est | 0.126a | ||
| Linoleic acid (C18:2n-6) est | 1.425a | ||
| | 0.292a | ||
| Arachidic acid (C20:0) est | 0.004a | ||
| Lignoceric acid (C24:0) est | 0.005a | ||
| Fatty acids-free | |||
| Palmitic acid (C16:0) FFA | 0.060a | ||
| Stearic acid (C18:0) FFA | 0.012a | ||
| Oleic acid (C18:1n-9) FFA | 0.107a | ||
| Linoleic acid (C18:2n-6) FFA | 0.274a | ||
| Ergosterol | 0.278a | ||
| Triacylglycerol | 1.792e | ||
| Phosphatidylethanolamine | 0.551e | ||
| Phosphatidylcholine | 1.102e | ||
| Ash |
| ||
aMeasured as described in "Methods" section
b Amino acid ratios calculated based on codon ratios in RNAseq transcriptome data, with transcripts encoding secreted proteins removed
c Estimated based on the chitin content (% w/w) of A. oryzae biomass [8], and corrected for different ash contents in A. oryzae and T. reesei preparates
d Carbohydrates other than chitin
e Calculated based on the measured ratio of TAG:PE:PC 52:16:32 and the measured amount of fatty acid esters
f Measured from lactose-limited chemostat cultures of T. reesei Rut-C30 (D = 0.051/h) described in [40]
g Sum of measured free fatty acids, ergosterol, triacylglycerol, phosphatidylethanolamine and phosphatidylcholine
The coefficients (mmol/g CDW) used in the biomass equation of the T. reesei model
| Kegg Id | Compound name | Coefficient |
|---|---|---|
| C00133 | Alanine | −0.382a |
| C00062 | Arginine | −0.253a |
| C00152 | Asparagine | −0.141a |
| C00049 | Aspartate | −0.233a |
| C00097 | Cysteine | −0.053a |
| C00064 | Glutamine | −0.161a |
| C00025 | Glutamate | −0.248a |
| C00037 | Glycine | −0.289a |
| C00135 | Histidine | −0.098a |
| C00407 | Isoleucine | −0.193a |
| C00123 | Leucine | −0.362a |
| C00047 | Lysine | −0.202a |
| C00073 | Methionine | −0.092a |
| C00079 | Phenylalanine | −0.151a |
| C00148 | Proline | −0.243a |
| C00065 | Serine | −0.322a |
| C00188 | Threonine | −0.233a |
| C00078 | Tryptophan | −0.061a |
| C00082 | Tyrosine | −0.113a |
| C00183 | Valine | −0.254a |
| C06424 | Myristic acid | −0.00015b |
| C08362 | Palmitoleic acid | −0.00043b |
| C06427 |
| −0.01b |
| C00219 | Arachidic acid | −0.00013b |
| C08320 | Lignoceric acid | −0.00014b |
| C00249 | Palmitic acid | −0.026c |
| C01530 | Stearic acid | −0.003c |
| C00712 | Oleic acid | −0.008c |
| C01595 | Linoleic acid | −0.061c |
| C01694 | Ergosterol | −0.007d |
| C00093 | Glycerol−3− | −0.04e |
| C00588 | Phosphocholine | −0.014e |
| C00346 | Phosphoethanolamine | −0.007e |
| C00031 |
| −0.385f |
| C00140 |
| −0.948f |
| C00360 | Deoxyadenosine monophosphate | −0.007g |
| C00239 | Deoxycytidine monophosphate | −0.008g |
| C00362 | Deoxyguanosine monophosphate | −0.008g |
| C00364 | Deoxythymidine monophosphate | −0.007g |
| C00020 | Adenosine-monophosphate | −0.047h |
| C00055 | Cytidine monophosphate | −0.045h |
| C00144 | Guanosine monophosphate | −0.053h |
| C00105 | Uridine monophosphate | −0.045h |
| C00001 | Water | −59.276i |
| C00002 | Adenosine triphosphate | −59.276i |
| C00008 | Adenosine diphosphate | 59.276i |
| C00009 | Organic phosphorous | 59.305i |
| C00059 | Sulfate | −0.02i |
| C00255 | Riboflavin | −0.001i |
| C00010 | Coenzyme A | −0.000001j |
| C00003 | Nicotinamide adenine dinucleotide (NAD) | −0.000001j |
| C00016 | Flavin adenine dinucleotide (FAD) | −0.000001j |
| C00051 | Glutathione | −0.000001j |
| C00101 | tetrahydrofolate | −0.000001j |
| C00575 | 3′,5′-cyclic AMP | −0.000001j |
| C00096 | GDP-mannose | −0.000001j |
| C01083 |
| −0.000001j |
The coefficients correspond to the measured or estimated molar amounts of the compound in the cells, as described in "Methods" section
aThe amount of amino acids calculated based on the measured cellular protein and the ratio of amino acids in cellular proteins calculated based on the codon abundancy in the RNAseq data of transcriptome
b Esterified fatty acid measured using GC-MS
c Sum of measured esterified and free fatty acid measured using GC-MS
d Measured using GC-MS
e The amount estimated to be needed for synthesis of triacylglycerols, phophatidylethanolamines or phosphatidylcholines (1 mol of glycerol-3-P, phosphoethanolamine or phosphocholine per 1 mol of triacylglycerol, phosphatidylethanolamine or phosphatidylcholine, respectively)
f The measured total carbohydrate was assumed to consist of polymers of d-glucose subunits (glucan) and polymers of N-acetyl-d-glucosamine (chitin). Chitin content of the cells was estimated based on the amount of chitin in Aspergillus oryzae [8], and the rest of the measured carbohydrate as glucan
g The amount of deoxyribonucleotides in DNA calculated based on the cellular DNA amount and the GC content of the genome
h The amount of ribonucleotides in RNA was estimated based on the measured total RNA amount the nucleotide ratio in genome region encoding ribosomal 18S-28S pre-rRNA
i As in S. cerevisiae model iMM904 [42]
j Trace amount of the compound was added
Fig. 3Overview of growth simulation results in the predicted fungal models. Shown on the left is the phylogenetic tree of the fungal species colored by taxons. Shown on the right are four sets of barplots: simulated growth rate (1/h), tetrahydrofolate production rate (mmol/g Cell Dry Weight (CDW) h), AMP production rate (mmol/g CDW h), respectively; and lastly, the number of reactions in the model (exchange reactions and orphan reactions are not included in this number). The rates are computed by FBA from 1 mmol/g CDW h of glucose uptake rate. Growth was simulated using the yeast biomass equation. Of the yeast biomass components, tetrahydrofolate and AMP are the two compounds most commonly unsuccessfully produced by the model, causing failure of simulating growth
Fig. 4Production rates for components of the T. reesei biomass equation, simulated by FBA. As carbon sources, the models were given tenunits of glucose. Shown at the top is the phylogenetic tree of the fungal species colored by taxons. Shown below is a heatmap of production rates for each compound and species. For reference, the compound production rates have also been computed for the Yeast 7.6 model [43, 44]. Numbers have been rounded to integers
Carbon sources used by T. reesei found in the literature
| Carbon source | Growth rate (per unit of compound) |
|---|---|
|
| Not found in the model |
|
| Not found in the model |
| Arbutin | 0.171 |
| Cellobiose | 0.190 |
|
| 0.080 |
|
| 0.090 |
|
| 0.095 |
|
| 0.090 |
|
| 0.095 |
|
| 0.090 |
|
| 0.100 |
|
| 0.075 |
| Esculin | Not found in the model |
| Glycerol | 0.049 |
| Glycerol-1-monoacetate | Not found in the model |
|
| 0.075 |
|
| 0.095 |
|
| 0.115 |
| Salicin | 0.185 |
| Trehalose | 0.190 |
The exchange reaction for each of these compounds has been opened to allow the model to have one unit of uptake. Growth has been maximized using FBA
Fig. 5Growth rate simulations. On the y-axes, the predicted growth rate (h−1) and on x-axes a measured growth rate (h−1) and b measured carbon source, i.e., lactose consumption rate (mmol/(g CDW h)) are shown. The data points are marked with the fermentation identifiers (i.e., F32 is fermentation number 32) and surrounded by a box, colored to indicate the case corresponding to the fermentation. See further details from text
Comparison of the previously reconstructed models [21] with the new models produced by CoReCo in this article
| CoReCo model |
|
| Average of all modelsa | |||
|---|---|---|---|---|---|---|
| Pitkänen [ | This article (%) | Pitkänen [ | This article (%) | Pitkänen | This article (%) | |
| Dead end reactions | 58 | 44 | 61 | 44 | 59 | 44 |
| Dead end metabolites | 68 | 59 | 70 | 58 | 69 | 58 |
| Reactions added during the gap-filling step | 37 | 20 | 35 | 22 | 39 | 21 |
| Biomass components produced | 7 | 100 | 7 | 100 | 7 | 89 |
| Biomass components produced* | 98* | 100 | 100* | 100 | 91* | 89 |
| Biomass components produced with stoichiometrically unrealistic yield | 98* | 0 | 64* | 0 | 85* | 0 |
* When eight extra compounds were added as a source metabolites into the model simulations
a The obligate intracellular parasite Encephalitozoon cuniculi has been excluded from the averages