| Literature DB >> 30721240 |
Yanping Xi1,2,3, Fei Wang1,2.
Abstract
Cellular systems shift metabolic states by adjusting gene expression and enzyme activities to adapt to physiological and environmental changes. Biochemical and genetic studies are identifying how metabolic regulation affects the selection of metabolic phenotypes. However, how metabolism influences its regulatory architecture still remains unexplored. We present a new method of extreme pathway analysis (the minimal set of conically independent metabolic pathways) to deduce regulatory structures from pure pathway information. Applying our method to metabolic networks of human red blood cells and Escherichia coli, we shed light on how metabolic regulation are organized by showing which reactions within metabolic networks are more prone to transcriptional or allosteric regulation. Applied to a human genome-scale metabolic system, our method detects disease-associated reactions. Thus, our study deepens the understanding of the organizing principle of cellular metabolic regulation and may contribute to metabolic engineering, synthetic biology, and disease treatment.Entities:
Mesh:
Year: 2019 PMID: 30721240 PMCID: PMC6363282 DOI: 10.1371/journal.pone.0210539
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Logic table for determining the constraints of the exchange reaction that transports metabolite in and out of the target subsystem based on flux variability analysis of the reaction .
The forward direction of is defined as from to . And the forward direction of the exchange reaction is defined as taking away from the target subsystem. Min, minimum; Max, maximum.
| Case | Exchange flux of a subnetwork | Available direction | |
|---|---|---|---|
| Min. | Max. | ||
| 1 | < 0 | > 0 | Out and in (unconstrained) |
| 2 | = 0 | > 0 | Out (positive) |
| 3 | < 0 | = 0 | In (negative) |
| 4 | = 0 | = 0 | None (zero) |
Fig 1Heat map showing conditional entropy of internal reaction pairs of the hRBC metabolic network.
The heat map colors represent the conditional entropy of each reaction at the beginning of each line, given the reaction listed at the bottom of each column. The black blocks on the diagonal represent the internal EqSets.
Fig 2P-value (the line with circles) and evaluation score σ (the line with squares) of the resulting EqSet sequence of hRBC metabolic network as a function of the parameter μ.
As μ ranges from 0 to 1 with an increasing step of 0.025, p-value varies between 2.10 × 10−4 and 5.1 × 10−3, and σ varies between 44.5 and 55.
Fig 3Evaluation score distribution of randomly organized EqSet sequences of hRBC metabolic network.
The arrow denotes the EqSet sequence shown in Table 2.
The internal EqSet sequence of hRBC metabolic network in descending order of regulatory importance.
The regulated reactions are denoted in boldface type. Full names for the abbreviations are listed in Table B in S1 Text.
| EqSet | Rank |
|---|---|
| AMPase | 7 |
Fig 4The resulting EqSet sequences calculated from extreme pathways (EPs; the lines with soft dots) versus those calculated from elementary modes (EMs; the lines with hard dots) in p-value (the lines with circles) and evaluation score σ (the lines with squares).
As μ ranges from 0 to 1, p-value of the EqSet sequences calculated from EMs varies between 0.0408 and 0.9173, and the corresponding σ varies between 66 and 112.5. These values are always higher than those of the sequences calculated from EPs.
Fig 5P-value distribution of the EqSet sequences calculated from artificial metabolic pathways of hRBC metabolic network.
A set of artificial metabolic pathways are built as follows: (1) An artificial pathway is a summation of several randomly selected EPs. (2) The probability an EP being selected is p. (3) Altogether, t artificial pathways are generated, and the unique ones form the set. One hundred different artificial pathway sets are built for certain values of p and t. The distributions of the number of artificial pathways and the number of EPs contained in an artificial pathway in the sets generated at different values of p and t are shown in Fig I in S1 Text.
Subsystems contained in each metabolic sub network of E.coli.
| Subnetwork | Subsystems |
|---|---|
| Amino acid metabolism | Alanine and aspartate metabolism; alternate carbon metabolism; arginine and proline metabolism; cofactor and prosthetic group biosynthesis; cysteine metabolism; folate metabolism; glutamate metabolism; glycine and serine metabolism; histidine metabolism; methionine metabolism; threonine and lysine metabolism; tyrosine tryptophan and phenylalanine metabolism; unassigned; valine leucine and isoleucine metabolism |
| Hydrocarbon metabolism | Citric acid cycle; cofactor and prosthetic group biosynthesis; folate metabolism; glycolysis gluconeogenesis; glyoxylate metabolism; methylglyoxal metabolism; oxidative phosphorylation; pentose phosphate pathway; putative; unassigned |
| Lipid metabolism | Anaplerotic reactions; cell envelope biosynthesis; citric acid cycle; cofactor and prosthetic group biosynthesis; folate metabolism; membrane lipid metabolism; methylglyoxal metabolism; nitrogen metabolism; oxidative phosphorylation; pyruvate metabolism |
The number of internal reactions, exchange reactions, extreme pathways and regulatory or disease-associated reactions in each subnetwork.
The values in the parentheses are the proportion of regulatory or disease-associated reactions among the internal reactions. EP, extreme pathway; RXN, reaction; Reg-, regulatory; Dis, disease-associated.
| Subnetwork | No. internal RXNs | No. exchange RXNs | No. EPs | No. Reg- or Dis- RXNs (Rate) |
|---|---|---|---|---|
| Amino acid metabolism ( | 255 | 141 | 385 | 167 (65.49%) |
| Hydrocarbon metabolism ( | 114 | 87 | 396 | 54 (47.37%) |
| Lipid metabolism ( | 127 | 83 | 239 | 65 (51.18%) |
| Hydrocarbon metabolism (human) | 333 | 320 | 922 | 101 (30.33%) |
| Amino acid metabolism (human) | 365 | 288 | 402 | 107 (29.32%) |
The span of the four parameters required in EqSet sequence calculation.
| Parameter | Span |
|---|---|
| 0, 0.025, 0.05, 0.075, 0.1, 0.125, 0.15, 0.175, 0.2, 0.25, 0.3, 0.35, 0.4, 0.45, 0.5, 0.6, 0.7, 0.8, 0.9, 1 | |
| 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 18, 20, 22, 24, 28, 32, 36, 40, 44, 48, 52, ∞ | |
| 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 18, 20, 22, 24, 28, 32, 36, 40, 44, 48, 52, ∞ | |
| 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40 |
Summary of the distribution of p-values corresponding to the EqSet sequences calculated on the candidate parameter values.
The two right-most columns show the proportion of candidate parameter values that result in an EqSet sequence with p-value less than 0.05 or 0.01, respectively. Min, minimum; Max, maximum.
| Subnetwork | Min. | Max. | Median | Average | Variance | ||
|---|---|---|---|---|---|---|---|
| Amino acid metabolism ( | 3.10 × 10−4 | 0.9778 | 0.0266 | 0.1076 | 0.1924 | 62.07% | 7.27% |
| Hydrocarbon metabolism ( | 4.78 × 10−6 | 0.9943 | 0.0232 | 0.1207 | 0.2389 | 78.94% | 29.90% |
| Lipid metabolism ( | 2.08 × 10−9 | 0.7448 | 7.19 × 10−6 | 0.0098 | 0.0456 | 96.13% | 90.83% |
| Amino acid metabolism (human) | 1.41 × 10−7 | 0.7621 | 5.37 × 10−4 | 0.0185 | 0.0047 | 91.93% | 86.83% |
| Hydrocarbon metabolism (human) | 7.33 × 10−6 | 0.7892 | 0.0665 | 0.0830 | 0.0049 | 35.42% | 5.73% |
Fig 6Box plots show the distribution of the lowest 5% of the p-values of the EqSet sequences of amino acid metabolism in E. coli when the parameter specified by the label of each subgraph is fixed to the value under the box.
The ordinate axis is plotted in log scale.
Fig 7Box plots show the distribution of the lowest 5% of the p-values of the EqSet sequences of hydrocarbon metabolism of E. coli when the parameter specified by the label of each subgraph is fixed to the value under the box.
The ordinate axis is plotted in log scale.
Fig 8Box plots show the distribution of the lowest 5% of the p-values of the EqSet sequences of lipid metabolism of E. coli when the parameter specified by the label of each subgraph is fixed to the value under the box.
The ordinate axis is plotted in log scale.
Subsystems contained in each metabolic sub network of human.
| Subnetwork | Subsystems |
|---|---|
| Amino acid metabolism | Alanine and aspartate metabolism; aminosugar metabolism; arginine and proline metabolism; citric acid cycle; CoA biosynthesis; CoA catabolism; cysteine metabolism; D-alanine metabolism; folate metabolism; glutamate metabolism; glutathione metabolism; glycine, serine, and threonine metabolism; heme biosynthesis; heme degradation; histidine metabolism; lysine metabolism; methionine metabolism; phenylalanine metabolism; salvage pathway; taurine and hypotaurine metabolism; tetrahydrobiopterin; tryptophan metabolism; Tyr, Phe, Trp biosynthesis; tyrosine metabolism; urea cycle/amino group metabolism; valine, leucine, and isoleucine metabolism; vitamin b6 metabolism; beta-alanine metabolism |
| Hydrocarbon metabolism | Ascorbate and aldarate metabolism; biotin metabolism; CYP metabolism; citric acid cycle; CoA biosynthesis; CoA catabolism; folate metabolism; miscellaneous; fructose and mannose metabolism; NAD metabolism; glycolysis/gluconeogenesis; galactose metabolism; glyoxylate and dicarboxylate metabolism; hyaluronan metabolism; IMP biosynthesis; keratan sulfate degradation; N-glycan biosynthesis; unassigned; N-glycan degradation; oxidative phosphorylation; pentose phosphate pathway; pyruvate metabolism; pentose and glucuronate interconversions; putative; propanoate metabolism; riboflavin metabolism; salvage pathway; starch and sucrose metabolism; thiamine metabolism; oxidative phosphorylation; pentose phosphate pathway |
Fig 9Box plots show the distribution of the lowest 5% of the p-values of the EqSet sequences of human amino acid metabolism when the parameter specified by the label of each subgraph is fixed to the value under the box.
The ordinate axis is plotted in log scale.
Fig 10Box plots show the distribution of the lowest 5% of the p-values of the EqSet sequences of human hydrocarbon metabolism when the parameter specified by the label of each subgraph is fixed to the value under the box.
The ordinate axis is plotted in log scale.
Non-disease-associated internal EqSets in top 30 of the sequences of the human amino acid or hydrocarbon metabolic networks with the lowest p-value.
The reaction abbreviation and reaction name look-up table is listed in Table F in S1 Text.
| EqSet | Rank | Disease-related reaction | Related diseases | References |
|---|---|---|---|---|
| 3HAO, HKYNH, KYN3OX | 13 | 3HAO | pellagra, olivopontocerebellar atrophy | [ |
| MTHFCm | 24 | MTHFCm | myelomeningocele, spina bifida | [ |
| MTHFD2 | 26 | MTHFD2 | lung cancer, spina bifida, breast cancer | [ |
| FKYNH, TRPO2 | 29 | TRPO2 | pellagra, haemophilus influenzae | [ |
| GHMT2r | 30 | GHMT2r | adult acute lymphocytic leukemia, pediatric osteosarcoma | [ |
| NMNS | 9 | NMNS | idiopathic recurrent pericarditis, gestational diabetes | [ |
| NNDPR, EX_Sub_quln[c] | 11 | NNDPR | pellagra, follicular thyroid carcinoma | [ |
| DOLGLCP_Lter, DOL_GPP_Ler, UDPDOLPT_L | 12 | - | - | - |
| DOLGLCP_Uter, DOL_GPP_Uer, UDPDOLPT_U | 14 | - | - | - |
| GAPD | 19 | GAPD | diffuse large B-cell lymphoma, obesity | [ |
| RBK, EX_Sub_rib-D[c] | 27 | - | - | - |
| FPGS4 | 28 | FPGS4 | Rheumatoid arthritis, psoriasis, colorectal cancer, non-small-cell lung cancer, non-Hodgkin lymphoma | [ |
Fig 11The average true positive rate versus the background proportion of a disease-associated EqSet.
The average true positive rate equals the proportion of disease-associated EqSets among those that participate in the top 10% of the sequence composed of the test EqSets. The background proportion equals the ratio of disease-associated EqSets among all the test EqSets. The horizontal axis represents the fraction of EqSets that used as training data.
Fig 12The heuristic parameter searching algorithm quickly identified the best or nearly best among all the candidates after a few attempts.
The horizontal axis represents the number of attempts in which an EqSet sequence is calculated on certain values of the 4 parameters. The number of attempts ranges from 1,000 to 50,000, with a step of 1,000. The vertical axis represents the number of the candidate parameter values for which the resulting p-values are no higher than those of the parameters suggested by the search algorithm. The algorithm was repeated 100 times at each attempt. The average ranks are shown by the line marks and standard deviations are shown with the error bars. The boxed off portion of the graph is zoomed in as a separate one to show more details.