| Literature DB >> 24958003 |
Kai Dührkop1, Kerstin Scheubert2, Sebastian Böcker3.
Abstract
We present results of the SIRIUS2 submission to the 2012 CASMI contest. Only results for Category 1 (molecular formula identification) were submitted. The SIRIUS method and the parameters used are briefly described, followed by detailed analysis of the results and a discussion of cases where SIRIUS2 was unable to come up with the correct molecular formula. SIRIUS2 returns consistently high quality results, with the exception of fragmentation pattern analysis of time-of-flight data. We then discuss possibilities for further improving SIRIUS2 in the future.Entities:
Year: 2013 PMID: 24958003 PMCID: PMC3901276 DOI: 10.3390/metabo3020506
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Parameter values depending on the peak intensity. In SIRIUS2 0.9, α(·) and β(·) are linear functions (green dashed lines). In SIRIUS2 0.9.5, these are replaced by piecewise linear functions. Left: Mass accuracy factor α for varying peak intensity; Right: Intensity accuracy β for varying peak intensity. We choose a higher intensity accuracy for TOF than for Orbitrap. Bottom: Interpolation points for linear interpolation of α(·) and β(·).
Details of the challenge compounds and rank of the correct molecular formula for isotope pattern data, fragmentation pattern data and a combination of both scores. TOF data (1–6) and Orbitrap data (10–17). For Challenges 2, 4, 6 we used the recalibrated data. For Challenge 5 we used the original data, as mass accuracy after recalibration appears to be even worse.
| Chal. No. | molecular formula | parent peak (m/z) | mode | No. of isotopes | rank isot. pattern | rank frag. pattern | combined rank |
|---|---|---|---|---|---|---|---|
| 1 | C18H36N4O11 | 485.245 | positive | 3 | 15 | 7 | |
| 2 | C28H32O14 | 591.171 | negative | 4 | 604 | 2 | |
| 3 | C14H27NO9S3 | 448.075 | negative | 5 | 12 | ||
| 4 | C19H17NO4 | 324.122 | positive | 3 | 8 | 2 | |
| 5 | C19H23NO4 | 330.171 | positive | 4 | 18 | 2 | |
| 6 | C21H21NO6 | 384.144 | positive | 4 | 15 | 2 | |
| 10 | C14H9NO2 | 224.071 | APCI positive | 3 | 5 | ||
| 11 | C17H12O | 231.080 * | APCI positive | 2 * | 1 * | 5 * | 1 * |
| 12 | C17H16N4O4 | 341.126 | APCI positive | 2 | 21 | 18 | |
| 13 | C19H17OP | 293.110 | ESI positive | 3 | 2 | ||
| 14 | C12H9N | 168.080 | APCI positive | 2 | |||
| 15 | C12H13NO2 | 204.102 | APCI positive | 2 | |||
| 16 | C18H21N3O5 | - | APCI positive | - | N/A | 5 (2 **) | 5 (2 **) |
| 17 | C13H13N3 | 212.119 | ESI positive | 3 | 2 |
* For Challenge 11 we wrongly interpreted the parent peak and its fragment as isotope pattern. We reported a molecular formula of C17H11O which differs only by a single hydrogen from the correct molecular formula of C17H12O; ** Due to an operating error we used the wrong parent mass in our initial analysis; the number in brackets is the rank when using the correct parent mass.
Figure 2Left: Relative mass deviation plotted against relative intensities of isotope peaks. For all but the monoisotopic peak, we use the mass deviation of the difference between the peak mass and the monoisotopic peak mass; Right: Logarithmic ratio of measured to theoretical isotopic peak intensity (log-ratio) plotted against relative peak intensities. For Orbitrap data, the log-ratio of the monoisotopic peaks is mostly negative and is always positive for the +1 peaks. The variance of the log-ratios is higher for lower intensities.