| Literature DB >> 17897474 |
Matthew Yeung1, Ines Thiele, Bernard O Palsson.
Abstract
ABSTRACT:Entities:
Year: 2007 PMID: 17897474 PMCID: PMC2089122 DOI: 10.1186/1471-2105-8-363
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Identification of Potential Contributing Factors
| * ∑( | ∑ | * ∑( | |||||||||
| 0.459 | 0.656 | 0.509 | 0.666 | 0.6 | 0.0178 | 0.62 | 0.135 | 0.604 | 0.408 | ||
| 0.827 | 0.875 | 0.764 | 0.870 | 0.860 | 0.343 | 0.856 | 0.426 | 0.561 | 0.623 | ||
| 0.841 | 0.876 | 0.943 | 0.876 | 0.855 | -0.059 | 0.845 | 0.478 | 0.496 | 0.603 | ||
| 8 | 15 | 5 | 2 | 1.67 | 0.09 | 0.06 | 2 | 2 | 6 | ||
| 174 | 89132 | 35.08 | 5781.73 | 665.38 | 0.9 | 47.78 | 25 | 47 | 414 | ||
A good factor for an estimating function must have a high correlation to that is being estimated. We further required that the factor must grow consistently with the number of ExPas. The rows labelled 'Pre-' and 'Post-' Log show the Pearson's correlations, r, between the number of ExPas and the corresponding factors. These factors were created using the following basic networks measurements: R = Rthe number of active reactions given the environmental conditions, d± = d±(r) the incoming/outgoing connectivity of reaction r, c = c(r) the clustering coefficient of the ireaction, the set of input reactions, and the set of output reactions. Both Pearson's and Spearman's Rank correlation coefficients, r and ρ were used a guide to identify reliable contributing factors. Given this information, the final chosen factors are emphasized by an asterisk (*).
Figure 1Relationship between the Number of ExPas and Factor . Graphs displaying the two relationships derived from the factor . It was observed that θ1 had an exponential relation to p as shown in (i). The use of Ras a scaling factor was found to improve the fit of the data (ii).
Figure 2Relationship between the Number of ExPas and Factor . Similar to Figure 1, it was observed that also had an exponential relation to log [p] (i), which could be improved if scaled by R(ii).
Fit of Training Data to Estimating Functions
| Fit of Factors | Over-Estimation | Under-Estimation | |||||
| Factor | Correlation | m.a.e. | r.m.s. | Max. Value | Members>1 | Min. Value | Members<1 |
| 0.883439 | 0.407565 | 0.539834 | 1.512650 | 5 | -1.138180 | 1 | |
| †0.887057 | 0.406882 | 0.544633 | 1.585230 | 3 | -1.071100 | 1 | |
| ‡0.900704 | 0.383868 | 0.512515 | 1.629060 | 3 | -0.918259 | 0 | |
| 0.898332 | 0.381380 | 0.518272 | 1.674300 | 3 | -0.963172 | 0 | |
This table summarizes the statistics describing the relationships between the four different estimations f(θ) and the 52 data points. The results show that the two factors θ1 and θ2 allowed better estimations when the scaling factor Rwas included. The top two rows and the bottom two rows of the table, respectively, represent the factors before and after scaling by the factor R. The cells with the highest Pearson's correlation, r, before and after scaling are emphasized with † and ‡, respectively.
Figure 3Comparison of Test Models to the Estimating Functions. Figures displaying the relationships amongst the test data points and the four estimation functions given by equations (2), (3), (4) and (5). These are shown in (i), (ii), (iii) and (iv) respectively. The red lines in each case are given by f(θ) ± 1 and indicate the boundaries of the regions that are within a factor of 10 of the respective estimations.
Fit of Test Data to Estimating Functions
| Fit of Factors | Over-Estimation | Under-Estimation | |||||
| Factor | Correlation | m.a.e. | r.m.s. | Max. Value | Members>1 | Min. Value | Members<1 |
| 0.893058 | 0.956777 | 1.042668 | 1.513670 | 7 | -0.785898 | 0 | |
| *0.894445 | 0.806018 | 0.919502 | 1.353102 | 7 | -0.765522 | 0 | |
| 0.864965 | 0.891715 | 1.004652 | 1.501480 | 7 | -0.851316 | 0 | |
| 0.876409 | 0.777881 | 0.897856 | 1.401680 | 7 | -0.808617 | 0 | |
Table detailing the relationships between the 16 test data points and the four estimating functions. Although all four functions failed to predict 7 models to within a factor of 10 of their actual number of ExPas, the function f2 (θ1) had the least range for over- and under-estimation. Rows representing information of estimating functions using factors θ1 and θ2, respectively, in the top two and bottom two rows. The cell corresponding to the highest correlation, r, is emphasized by an asterisk (*).
Figure 4Connectivity of Reactions. Diagram describing different types of connectivities. Reaction rutilizes metabolite A, which is produced by three reactions, and produces metabolite B, which is consumed by three reactions. Reaction rthen has an incoming degree of d-(r) = 3 due to metabolite A, and outgoing degree of d+(r) = 3.
Figure 5Projection from Directed Hypergraph to One-mode Graph. Projection from directed hypergraph to one-mode graph, where the hyperedges on the left-hand side become the nodes of the the graph on the right-hand side. A thick black arrow in the graph on the right signifies an edge ris adjacent to r, whereas a thin blue line signifies two edges connected that are not adjacent.
Figure 6Relationship between Reactions with Non-zero Clustering-coefficient and Alternative Routes. Diagram showing relationship between non-zero clustering coefficients and alternative pathways. (i) shows three possible routes for a simple system; (ii) is the non-directed representation of this system using the above projection. The system has non-zero clustering coefficients, emphasizing alternative routes are possible; (iii) is the projection conforming to that shown in Figure 5, where non-zero clustering coefficient is found for 5 of the reactions that are involved in the branching points of alterative routes.
Basic Information of Models Used
| 479 | 166 | 485 | 13 | [3] | |
| Mitochondria | 200 | 92 | 238 | 16 | [4] |
| Core | 62 | 35 | 63 | 9 | [28] |
| Central | 56 | 19 | 62 | 2 | [30] |
| Toy | 50 | 37 | 53 | 4 | [31] |
| Red Blood Cell | 32 | 17 | 39 | 8 | [5] |
Simple measurements, statistics and sources of the models used in this study.
Environmental Conditions of Models. Table detailing the models used for estimation formulation, along with their environmental conditions.
| co2 | Free | Ac | ac | mal-L | Single Input along with core metabolite, allowing all outputs (unless specified as input only) | ||
| fe2 | Free | acac | akg | orn | |||
| fe3 | Free | ad | asp-L | phe-L | |||
| h2o | Free | ade | acald | pro-L | |||
| h | Free | akg | etoh | pyr | |||
| nh4 | Free | etoh | fum | ser-L | |||
| pi | Free | fum | glu-L | succ | |||
| so4 | Free | gsn | gsn | thr-L | |||
| o2 | Free | lac | gua | trp-L | |||
| mal | h2co3 | tyr-L | |||||
| pyr | hxan | urea | |||||
| succ | lac-L | ||||||
| urea | lys-L | ||||||
| co2 | Free | arachd | atp | glu-L | Single Input along with core metabolite, allowing all outputs (unless specified as input only) | ||
| h | Free | bhb | pheme | gly | |||
| h2o | Free | crvnc | phs-L | glyc | |||
| fe2 | Input | glc-D | 12dgr_m | glyc3p | |||
| o2 | Input | glu-L | acac | hdca | |||
| pi | Input | hdca | arachd | lac-L | |||
| urea | Output | lac-L | bhb | ocdca | |||
| ocdc All | coa | ocdcea | |||||
| ocdca | crvnc | ocdcya | |||||
| ocdcea | cys-L | pheme | |||||
| ocdcya | glc-D | ps_m | |||||
| co2, h, h20 fe2, fe3, o2 urea | Free | acac | atp | 12dgr_m | (1) Single K/O of: CYOOm3, SUCD3-u10m | ||
| Input | arachd | lac-L | coa | ||||
| Output | Bhb | pheme | cys-L | ||||
| crvnc | phs-L | glu-L | (2) Individual Request for: atp, phs-L, pheme | ||||
| glc-D | gly | ||||||
| glyc | ps_m | ||||||
| glyc3p | |||||||
| hdca | |||||||
| ocdca | |||||||
| ocdcea | |||||||
| ocdcya | |||||||
| co2 | Free | ac | ac | Single Input along with core metabolite, allowing all outputs (unless specified as input only) | |||
| h | Free | akg | akg | ||||
| h2o | Free | etoh | etoh | ||||
| o2 | Input | for | for | ||||
| pi | Input | fum | fum | ||||
| glc-D | lac-D | ||||||
| lac-D | pyr | ||||||
| pyr | succ | ||||||
| succ | |||||||
| adp, atp, co2, h, h2o, nad, nadh nadp, nadph, nh3, pi | Free | ade | 23dpg | ino | (1) Single Input along with core metabolite, allowing all outputs (unless specified as input only) | ||
| Free | ado | ade | lac | ||||
| Free | glc | ado | pyr | ||||
| hx | glc | ||||||
| ino | hx | ||||||
| lac | |||||||
| pyr | |||||||
| glc | 23dpg | ade | (2) Multiple Inputs | ||||
| hx | ado | ||||||
| ino | |||||||
| pyr | |||||||
| ino | |||||||
| adp, atp, coa, co2, h, h2o, nad, nadh, nadp, nadph, pi, ppi | Free | glycogen | 2dmmql8 | mqn8 | (1) Glycogen as Primary Input | ||
| Free | 2dmmq8 | mql8 | |||||
| Free | 3pg | oaa | |||||
| akg | pep | ||||||
| amp | pyr | ||||||
| e4p | q8 | ||||||
| fad | q8h2 | ||||||
| fadh2 | r5p | ||||||
| fad | 2dmmq8 | pyr | (2) A different set of Primary Input | ||||
| q8 | fadh2 | pep | |||||
| 2dmmql8 | q8h2 | e4p | |||||
| amp | r5p | ||||||
| oaa | 3pg | ||||||
| akg | |||||||
External Metabolites Abbreviation
| 12dgr_m | 1,2-Diacylglycerol | hx | Hypoxanthine |
| 23dpg | 2,3-Phospho-D-glyceroyl phosphate | hxan | Hypoxanthine |
| 2dmmq8 | 2-Demethylmenaquinone 8 | ile-L | L-Isoleucine |
| 2dmmql8 | 2-Demethylmenaquinol 8 | ino | Inosine |
| 3pg | 3-Phospho-D-glycerate | lac | Lactate |
| ac | Acetate | lac-L | L-Lactate |
| acac | Acetoacetate | leu-L | L-Leucine |
| acald | Acetaldehyde | lys-L | L-Lysine |
| ad | Acetamide | mal | Malate |
| ade | Adenine | mal-L | L-Malate |
| ado | Adenosine | meoh | Methanol |
| adp | ADP | mql8 | Menaquinol 8 |
| akg | 2-Oxoglutarate | mqn8 | Menaquinone 8 |
| ala-L | L-Alanine | nad | Nicotinamide adenine dinucleotide |
| ala-S | S-Alanine | nadh | Nicotinamide adenine dinucleotide – reduced |
| amp | AMP | nadp | Nicotinamide adenine dinucleotide phosphate |
| arachd | Arachidonic Acid (C20:4) | nadph | Nicotinamide adenine dinucleotide phosphate – reduced |
| asp-L | L-Asparagine | nh3 | Ammonium |
| atp | ATP | nh4 | Ammonium Ion |
| bhb | (2)-3-Hydroxybuanoate | o2 | O2 |
| ch4 | Methan | oaa | Oxaloacetate |
| co2 | CO2 | ocdc All | octadecanoate, octadecenoate, octadecynoate |
| coa | Coenzyme A | ocdca | octadecanoate |
| crvnc | Cervonic Acid (C22:6, n-3) | ocdcea | octadecenoate |
| cys-L | L-Cysteine | ocdcya | octadecynoate |
| e4p | D-Erythrose 4-phosphate | orn | Ornithine |
| etoh | Ethanol | pep | Phosphoenolpyruvate |
| fad | Flavin adenine dinucleotide | phe-L | L-Phenylalanine |
| fadh2 | Flavin adenine dinucleotide (reduced form) | pheme | Protoheme |
| fe2 | Iron (II) | phs-L | Phospholipid |
| fe3 | Iron (III) | pi | Phosphate |
| for | Formate | ppi | Diphosphate |
| fum | Fumarate | pro-L | L-Proline |
| glc | Glucose | ps_m | Phosphatidylserine |
| glc-D | D-Glucose | pyr | Pyruvate |
| glu-L | L-Glutamate | q8 | Ubiquinone-8 |
| gly | Glycine | q8h2 | Ubiquinol-8 |
| glyc | Glycerol | r5p | alpha-D-Ribose 5-phosphate |
| glyc3p | Glycerol 3-phosphate | ser-L | L-Serine |
| gsn | Guanosine | so4 | Sulfate |
| gua | Guanine | succ | Succinate |
| h | H+ | thr-L | L-Threonine |
| h2 | H2 | trp-L | L-Tryptophan |
| h2co3 | carbonic acid | tyr-L | L-Tyrosine |
| h2o | H2O | urea | Urea |
| hdca | Hexadecanoate (n-C16:0) | val-L | L-Valine |
Abbreviations of all external metabolite found in the 68 models used in this study.
Environmental Conditions of Test Models
| 84 | 121 | ac, ala-L, alac-S, ch4, co2, cys-L, gly, h, h2 h2o, ile-L, leu-L, meoh, pi, pyr, val-L | |
| 61 | 83 | ac, akg, co2, for, fum, glc-D, h, hxan nh4, mal-L, pi, pyr | |
| 48 | 65 | acald, akg, co2, etoh, for, fum, glc-D, h h2co3, lac-L, mal-L, o2, pi | |
| 168 | 170 | ac, acald, akg, asp-L, co2, etoh, fum, glc-D, glu-L, h, h2co3, h2o, lac-L, lys-L, mal-L, nh4, o2, phe-L, pi, pyr, ser-L, succ, thr-L, trp-L, tyr-L, urea |
Simple network measurements and the lists of external metabolites for the networks used to test the capability of estimations developed in the section 'Single Factor Estimate'.