| Literature DB >> 19015140 |
Sebastian Böcker1, Matthias C Letzel, Zsuzsanna Lipták, Anton Pervukhin.
Abstract
MOTIVATION: High-resolution mass spectrometry (MS) is among the most widely used technologies in metabolomics. Metabolites participate in almost all cellular processes, but most metabolites still remain uncharacterized. Determination of the sum formula is a crucial step in the identification of an unknown metabolite, as it reduces its possible structures to a hopefully manageable set.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19015140 PMCID: PMC2639009 DOI: 10.1093/bioinformatics/btn603
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Natural isotopic distribution: relative abundance of isotopes and their masses in Dalton
| Element | Isotope | mass | Mass difference | Abundance (%) |
|---|---|---|---|---|
| Hydrogen | 1H | 1.007825 | 99.985 | |
| 2H | 2.014102 | +1.006277 | 0.015 | |
| Carbon | 12C | 12.0 | 98.890 | |
| 13C | 13.003355 | +1.003355 | 1.110 | |
| Nitrogen | 14N | 14.003074 | 99.634 | |
| 15N | 15.000109 | +0.997035 | 0.366 | |
| Oxygen | 16O | 15.994915 | 99.762 | |
| 17O | 16.999132 | +1.004217 | 0.038 | |
| 18O | 17.999161 | +2.004246 | 0.200 | |
| Phosphor | 31P | 30.973762 | 100 | |
| Sulfur | 32S | 31.972071 | 95.020 | |
| 33S | 32.971459 | +0.999388 | 0.750 | |
| 34S | 33.967867 | +1.995796 | 4.210 | |
| 36S | 35.967081 | +3.995010 | 0.020 |
Fig. 1.Number of decompositions over the elements CHNOPS for intervals of width 0.001 Da. Minima and maxima taken in intervals of width 1 Da. True number of decompositions in comparison with approximate (2) (approx). As is shown in the inlay, γ(m) varies with a periodic function of period ∼1 Da.
Number of correct sum formulas at certain positions of the output list, for the FT-ICR dataset and the oa-TOF dataset
| Mass range | No. sp. | Rank in output list | No. sum formulas | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3–5 | 6–10 | 11+ | Int. | Real | Chem. | Time | ||
| FT-ICR dataset | ||||||||||
| 200–300 | 13 | 11 | 2 | 0 | 0 | 0 | 67 | 37.2 | 8.6 | 1.5 |
| 300–400 | 37 | 28 | 5 | 2 | 2 | 0 | 200 | 109 | 10.4 | 4.3 |
| 400–500 | 57 | 39 | 6 | 6 | 5 | 1 | 579.5 | 318.2 | 22.8 | 13.5 |
| 500–600 | 7 | 5 | 2 | 0 | 0 | 0 | 1800.4 | 990.3 | 59.6 | 40.1 |
| 600–700 | 4 | 3 | 0 | 0 | 1 | 0 | 2668.5 | 1454 | 37.3 | 55 |
| 700–800 | 5 | 0 | 1 | 1 | 1 | 2 | 8797.8 | 4812 | 247 | 232 |
| 800–900 | 14 | 3 | 2 | 1 | 3 | 5 | 14781.6 | 8101 | 534.6 | 485 |
| 900–1000 | 16 | 0 | 1 | 1 | 1 | 13 | 31805.7 | 17448 | 1570 | 1281 |
We report the number of spectra in this mass range (no. sp.), as well as the average number of sum formulas over all molecules in the mass range (no. sum formulas). We distinguish between the number of integer decompositions (int.), the number of real decompositions (real) and the number of those sum formulas that pass Senior's third theorem (chem.). Finally, we give the average running time in milliseconds per spectrum (time).
Fig. 2.Screenshot of the SIRIUS software, available from http://bio.informatik.uni-jena.de/sirius.