| Literature DB >> 24957992 |
Anne-Christin Hauschild1, Dominik Kopczynski2, Marianna D'Addario3, Jörg Ingo Baumbach4, Sven Rahmann5, Jan Baumbach1.
Abstract
Ion mobility spectrometry with pre-separation by multi-capillary columns (MCC/IMS) has become an established inexpensive, non-invasive bioanalytics technology for detecting volatile organic compounds (VOCs) with various metabolomics applications in medical research. To pave the way for this technology towards daily usage in medical practice, different steps still have to be taken. With respect to modern biomarker research, one of the most important tasks is the automatic classification of patient-specific data sets into different groups, healthy or not, for instance. Although sophisticated machine learning methods exist, an inevitable preprocessing step is reliable and robust peak detection without manual intervention. In this work we evaluate four state-of-the-art approaches for automated IMS-based peak detection: local maxima search, watershed transformation with IPHEx, region-merging with VisualNow, and peak model estimation (PME).We manually generated Metabolites 2013, 3 278 a gold standard with the aid of a domain expert (manual) and compare the performance of the four peak calling methods with respect to two distinct criteria. We first utilize established machine learning methods and systematically study their classification performance based on the four peak detectors' results. Second, we investigate the classification variance and robustness regarding perturbation and overfitting. Our main finding is that the power of the classification accuracy is almost equally good for all methods, the manually created gold standard as well as the four automatic peak finding methods. In addition, we note that all tools, manual and automatic, are similarly robust against perturbations. However, the classification performance is more robust against overfitting when using the PME as peak calling preprocessor. In summary, we conclude that all methods, though small differences exist, are largely reliable and enable a wide spectrum of real-world biomedical applications.Entities:
Year: 2013 PMID: 24957992 PMCID: PMC3901270 DOI: 10.3390/metabo3020277
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Heat map of an MCC/IMS measurement. X-axis: inverse reduced mobility 1/K0 in Vs/cm2; Y-axis: retention time r in seconds; signal: white (lowest) < blue < purple < red < yellow (highest), reaction ion peak (RIP) at 1/K0 = 0.46 Vs/cm2 .
Figure 2Overview: Evaluation pipeline.
Figure 3Schematic view of an IMS device. After ionization the analytes (charged molecules) are accelerated by an electric field and move towards a Faraday plate to which they transfer their charge. This is measured as a voltage signal. A drift gas flows in the opposite direction, thereby causing collisions that separate the analytes by their chemical properties. See text for details.
The number of peaks detected by all methods. The second column gives the number of peak clusters after merging the peak lists (postprocessing).
| # Peaks | # Peak Clusters | |
|---|---|---|
| Manual VisualNow | 1661 | 41 |
| Local Maxima Search | 1477 | 69 |
| Automatic VisualNow | 4292 | 88 |
| Automatic IPHEX | 5697 | 420 |
| Peak Model Estimation | 1358 | 69 |
Overlap of the five peak detection methods. The overlap of the peak list A (row) and peak list B (column) is defined as the number of peaks in V that can be mapped to at least one peak in W. Note that the resulting mapping count table is not symmetric.
| Manual | LMS | VisualNow | IPHEx | PME | |
|---|---|---|---|---|---|
| Manual | 1661 | 911 | 1522 | 1184 | 791 |
| Local Maxima | 868 | 1477 | 1096 | 1074 | 1128 |
| VisualNow | 2667 | 2233 | 4292 | 2341 | 2082 |
| IPHEx | 1112 | 1009 | 1157 | 5697 | 912 |
| PME | 737 | 1086 | 983 | 926 | 1358 |
Classification Results of the linear support vector machine. The quality measures are the AUC, accuracy (ACC), sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).
| AUC | ACC | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|---|
| Manual VisualNow | 77.4 | 70.9 | 69.7 | 72.4 | 75.7 | 65.9 |
| Local Maxima Search | 77 | 67.8 | 70.6 | 64.4 | 71 | 64 |
| Automatic VisualNow | 76.6 | 68.3 | 66.8 | 70.1 | 73.4 | 63.1 |
| Automatic IPHEx | 79.8 | 73 | 70.5 | 76 | 78.4 | 67.6 |
| Peak Model Estimation | 82.2 | 72.2 | 77.2 | 66.1 | 73.7 | 70.1 |
Classification Results of the random forest. The quality measures are the AUC, accuracy (ACC), sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).
| AUC | ACC | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|---|
| Manual VisualNow | 86.9 | 76.3 | 78.7 | 73.4 | 78.5 | 73.6 |
| Local Maxima Search | 80.8 | 70.5 | 75 | 64.9 | 72.5 | 67.8 |
| Automatic VisualNow | 81.1 | 71.9 | 75.6 | 67.3 | 74.1 | 69.1 |
| Automatic IPHEx | 80 | 68.9 | 72.8 | 64 | 71.4 | 65.6 |
| Peak Model Estimation | 81.9 | 74.2 | 81.6 | 65 | 74.2 | 74.1 |
Figure 4Boxplots of 100 runs of the ten-fold cross validation for both, the linear SVM and the random forest method.
Figure 5Boxplots illustrating the variation within the linear SVM tuning results in a single ten-fold cross validation run. The yellow boxes show the results when tuning the original feature sets. The green boxes show the results when tuning the randomly labeled feature sets.