| Literature DB >> 24489532 |
Rainer Breitling1, Shawn Ritchie2, Dayan Goodenowe2, Mhairi L Stewart3, Michael P Barrett3.
Abstract
Fourier transform mass spectrometry has recently been introduced into the field of metabolomics as a technique that enables the mass separation of complex mixtures at very high resolution and with ultra high mass accuracy. Here we show that this enhanced mass accuracy can be exploited to predict large metabolic networks ab initio, based only on the observed metabolites without recourse to predictions based on the literature. The resulting networks are highly information-rich and clearly non-random. They can be used to infer the chemical identity of metabolites and to obtain a global picture of the structure of cellular metabolic networks. This represents the first reconstruction of metabolic networks based on unbiased metabolomic data and offers a breakthrough in the systems-wide analysis of cellular metabolism.Entities:
Keywords: Fourier transform mass spectrometry; computational methods; metabolic networks; network reconstruction
Year: 2006 PMID: 24489532 PMCID: PMC3906711 DOI: 10.1007/s11306-006-0029-z
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Figure 1.Zipf plots of the reaction (A) and metabolite (B) degrees in the ab initio metabolic network of Trypanosoma brucei. In (A), the red dots correspond to the reactions inferred automatically from the mass–mass differences, while the green dots are based on a pre-defined set of common biochemical reactions. The fitted lines are based on power-law (A) and exponential distributions (B), respectively.
Molecular standards detected by FTMS
| Compound | Predicted mass | Measured mass | ppm | Average S/N |
|---|---|---|---|---|
| Glutathione | 307.083807 | 307.0835 | 1 | 438 |
| Oxidized glutathione | 612.152 | 612.1516 | 1 | 328 |
| Trypanothione | 723.3044 | 723.3036 | 1 | 16 |
| Oxidized trypanothione | 721.2887 | 721.2889 | 0 | 281 |
| NADP | 743.075458 | 743.0766 | 2 | 442 |
| NAD | 663.109125 | 663.1096 | 1 | 1229 |
| ATP | 506.99575 | 506.9945 | 2 | 289 |
| ADP | 427.029418 | 427.0293 | 0 | 118 |
| AMP | 347.063086 | 347.0633 | 1 | 14 |
| Berenil | 281.138894 | 281.139 | 0 | 9 |
| Pentamidine | 340.1899 | 340.1897 | 1 | 67 |
| DB75 | 304.132411 | 304.1325 | 0 | 115 |
| Melarsen oxide | 292.00538 | 292.0053 | 0 | 113 |
Several of the standards were detected in multiple ionization modes. Only the measured mass with the strongest signal is listed in these cases. Average S/N is the average signal-to-noise ratio over all ionization modes that gave a detectable signal.
Average number of matching empirical formulae identified in PubChem at various mass accuracies, averaging over all masses in the present release of the database (mass range 2–9200, median mass 438)
| ppm | 0.01 | 0.02 | 0.05 | 0.08 | 0.1 | 0.2 | 0.5 | 0.8 | 1 | 2 | 5 | 10 | 20 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Hits | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.03 | 1.06 | 1.09 | 1.32 | 2.03 | 3.40 | 6.08 |
Common metabolic transformations and corresponding formulae
| Alanine | C3H5NO | Guanosine 5-diphosphate (–H2O) | C10H13N5O10P2 |
| Arginine | C6H12N4O | Guanosine 5-monophosphate (–H2O) | C10H12N5O7P |
| Asparagine | C4H6N2O2 | Guanine (–H) | C5H4N5O |
| Aspartic acid | C4H5NO3 | Guanosine (–H2O) | C10H11N5O4 |
| Cysteine | C3H5NOS | Deoxythymidine 5′-diphosphate (–H2O) | C10H14N2O10P2 |
| Cystine | C6H10N2O3S2 | Thymidine (–H2O) | C10H12N2O4 |
| Glutamic acid | C5H7NO3 | Thymine (–H) | C5H5N2O2 |
| Glutamine | C5H8N2O2 | Thymidine 5′-monophosphate (–H2O) | C10H13N2O7P |
| Glycine | C2H3NO | Uridine 5′-diphosphate (–H2O) | C9H12N2O11P2 |
| Histidine | C6H7N3O | Uridine 5′-monophosphate (–H2O) | C9H11N2O8P |
| Isoleucine | C6H11NO | Uracil (–H) | C4H3N2O2 |
| Leucine | C6H11NO | Uridine (–H2O) | C9H10N2O5 |
| Lysine | C6H12N2O | Acetylation (–H) | C2H3O2 |
| Methionine | C5H9NOS | Acetylation (–H2O) | C2H2O |
| Phenylalanine | C9H9NO | C2H2 | C2H2 |
| Proline | C5H7NO | Carboxylation | CO2 |
| Serine | C3H5NO2 | CHO2 | CHO2 |
| Threonine | C4H7NO2 | Condensation/dehydration | H2O |
| Tryptophan | C11H10N2O | Diphosphate | H3O6P2 |
| Tyrosine | C9H9NO2 | Ethyl addition (–H2O) | C2H4 |
| Valine | C5H9NO | Formic Acid (–H2O) | CO |
| Acetotacetate (–H2O) | C4H4O2 | Glyoxylate (–H2O) | C2O2 |
| Acetone (–H) | C3H5O | Hydrogenation/dehydrogenation | H2 |
| Adenylate (–H2O) | C10H12N5O6P | Hydroxylation (–H) | O |
| Biotinyl (–H) | C10H15N2O3S | Inorganic phosphate | P |
| Biotinyl (–H2O) | C10H14N2O2S | Ketol group (–H2O) | C2H2O |
| Carbamoyl P transfer (–H2PO4) | CH2ON | Methanol (–H2O) | CH2 |
| Co-enzyme A (–H) | C21H34N7O16P3S | Phosphate | HPO3 |
| Co-enzyme A (–H2O) | C21H33N7O15P3S | Primary amine | NH2 |
| Glutathione (–H2O) | C10H15N3O5S | Pyrophosphate | PP |
| Isoprene addition (–H) | C5H7 | Secondary amine | NH |
| Malonyl group (–H2O) | C3H2O3 | Sulfate (–H2O) | SO3 |
| Palmitoylation (–H2O) | C16H30O | Tertiary amine | N |
| Pyridoxal phosphate (–H2O) | C8H8NO5P | C6H10O5 | C6H10O5 |
| Urea addition (–H) | CH3N2O | C6H10O6 | C6H10O6 |
| Adenine (–H) | C5H4N5 | D-ribose (–H2O) (ribosylation) | C5H8O4 |
| Adenosine (–H2O) | C10H11N5O3 | Disaccharide (–H2O) | C12H20O11 |
| Adenosine 5′-diphosphate (–H2O) | C10H13N5O9P2 | Glucose-N-phosphate (–H2O) | C6H11O8P |
| Adenosine 5′-monophosphate (–H2O) | C10H12N5O6P | Glucuronic acid (–H2O) | C6H8O6 |
| Cytidine 5′-diphosphate (–H2O) | C9H13N3O10P2 | Monosaccharide (–H2O) | C6H10O5 |
| Cytidine 5′-monophsophate (–H2O) | C9H12N3O7P | Trisaccharide (–H2O) | C18H30O15 |
| Cytosine (–H) | C4H4N3O |
Comparison of the most common mass differences in observed and random metabolite networks
| Observed metabolite network | Random metabolite network | ||||
|---|---|---|---|---|---|
| Mass difference | Frequency | Formula | Exact mass | Mass difference | Frequency |
| 2.01595 | 382 | H2 | 2.01565 | 92.70975 | 7 |
| 21.98312 | 326 | Na–H | 21.98194 | 205.30491 | 7 |
| 1.00320 | 284 | 13C isotope | 1.0033 | 52.82462 | 7 |
| 24.00000 | 260 | C2 | 24 | 193.60014 | 6 |
| 26.01629 | 237 | C2H2 | 26.01565 | 243.29213 | 6 |
| 28.03188 | 218 | C2H4 | 28.03130 | 254.75355 | 6 |
| 4.03201 | 197 | H4 | 4.03130 | 6.46724 | 6 |
| 1.01259 | 164 | H2−13C isotope | 1.01229 | 52.69339 | 6 |
| 3.01910 | 148 | H2+13C isotope | 3.01900 | 21.98649 | 6 |
| 22.99695 | 140 | C2−13C isotope | 22.99664 | 22.12482 | 6 |
| Total | 25,370 (in 2472 clusters of >5 members) | Total | 115 ± 22 (in 19 ± 4 clusters of >5 members) | ||
Stability of network inference against noise
| Real data | 1 ppm | 2 ppm | 5 ppm | 10 ppm | 20 ppm | 100 | 1000 | 10,000 | Random | |
|---|---|---|---|---|---|---|---|---|---|---|
| Clusters | 2472 | 2254 | 2040 | 1660 | 1365 | 1113 | 470 | 53 | 35 | 19 |
| Explained distances | 25,370 | 22,988 | 21,094 | 16,966 | 13,213 | 10,003 | 3261 | 328 | 213 | 115 |
| % Excess over random | 100 | 90.5 | 83.1 | 66.7 | 51.9 | 39.2 | 12.5 | 0.8 | 0.4 | 0 |
Uniformly distributed random noise of the indicated size was added to all observed masses and the network reconstructed as described in the text.
Figure 2.Extract of the ab initio metabolic network of Trypanosoma brucei. For clarity we show only metabolites that correlate in abundance with mass 809.5939, an unsaturated phosphatidyl choline phospholipid that is part of an enriched metabolite family in the parasite. The inset (A) highlights the first generation of transformations originating from mass 809.5939, the main figure (B) shows the entire subgraph, which connects more than 60% of the most strongly correlating masses (Pearson correlation r > 0.85). Assigned molecular identities for each metabolite are indicated in a shorthand notation, where Cn:m stands for a phosphatidyl choline with n carbon atoms in the side chains and m unsaturated bonds. Alternative headgroups are explicitly mentioned in the labels. Shades of green indicate the abundance of the metabolites in the parasite. The graph layout was generated using aiSee (http://www.aisee.com).
Figure 3.Abundance profile of various phospholipid classes. Diacyl cholines, alkylacyl cholines and alkylacyl ethanolamines are shown. The number of unsaturated bonds increases from left to right, the number of carbons in the side chains from top to bottom. The upper left corresponds to a saturated C16, C16 phospholipid, the bottom right to an 8-fold unsaturated C22, C22 molecule. The left column shows the absolute signal strength in trypanosomes in vivo. The right column shows the relative abdundance of the lipids compared to their concentration in the serum supernatant. Shades of blue indicate depletion in the parasites, yellow and red enrichment. One star denotes that the difference is significant at p < 0.05 (two-tailed t-test), two stars indicate that the same significant difference is also seen in vitro. A bar highlights masses that correlate in abundance with mass 809.5939. All three lipid classes show the same overall trends, with higher unsaturation at higher chain length. The highest abundance is found for three types of alkylacyl lipids, which can be putatively identified as C18:2,C18:0 alkylacyl phosphatidyl choline, C18:2, C18:0 alk-1-enylacyl phosphatidyl choline, and C18:2, C18:0 alk-1-enylacyl phosphatidyl ethanolamine.
Masses that occur in at least 30 putative condensation reactions among masses in our dataset. Their relative abundance in the various types of samples is indicated on an arbitrary scale. n.d., not detectable
| Mass | # Of condensations | Putative identity |
|
| Serum | Medium |
|---|---|---|---|---|---|---|
| 280.2395 | 42 | Linoleic acid | ++ | ++ | +++ | +++ |
| 302.2245 | 39 | Icosapentaenoic acid | + | n.d. | n.d. | n.d. |
| 320.2342 | 39 | Hydroxy icosatetraenoic acid? | + | n.d. | n.d. | n.d. |
| 309.2753 | 35 | ? | n.d. | + | + | + |
| 312.2666 | 34 | Hydroxy nonadecenoic acid | + | n.d. | n.d. | n.d. |
| 328.2407 | 30 | Docosahexaenoic acid | + | n.d. | n.d. | n.d. |