| Literature DB >> 35604083 |
Christoph Bueschl1,2, Maria Doppler2,3, Elisabeth Varga4, Bernhard Seidl2, Mira Flasch4, Benedikt Warth4, Juergen Zanghellini1.
Abstract
MOTIVATION: Chromatographic peak picking is among the first steps in data processing workflows of raw LC-HRMS datasets in untargeted metabolomics applications. Its performance is crucial for the holistic detection of all metabolic features as well as their relative quantification for statistical analysis and metabolite identification. Random noise, non-baseline separated compounds and unspecific background signals complicate this task.Entities:
Year: 2022 PMID: 35604083 PMCID: PMC9237678 DOI: 10.1093/bioinformatics/btac344
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Fig. 1.Overview of the detection and training steps of PeakBot. The left panel shows the steps for detecting chromatographic peaks with an already trained PeakBot model, while the right panel shows the steps of training a new PeakBot model from reference LC-HRMS data
Overview of loss/metric values
| Loss/metric | Description |
|---|---|
| box_iou | Intersection over union (IOU) of the predicted and ground-truth bounding box of a true chromatographic peak. Higher values indicate a better overlap between the ground-truth and the prediction ( |
| center_loss | Mean-Squared-Error (MSE) of the predicted and ground-truth peak center (retention-time and |
| peakType_ACCPeakNoPeak | Accuracy for correctly reporting a chromatographic peak or not. Higher values indicate a better overlap. |
Fig. 2.Overview of training losses and metrices on the training dataset T and the additional validation sets V, iV, iT
Fig. 3.Comparison of results from PeakBot, XCMS, MS-Dial and peakOnly for a selected sample of the PHM dataset
Fig. 4.(a) RSD values of the peak areas of the WheatEar dataset. (b) Comparison of peak areas integrated with PeakBot and XCMS
Run times of PeakBot for the WheatEar dataset
| Workstation PC | HPC server | |
|---|---|---|
| CPU | Intel i7-4790K | AMD Epyc 7542 |
| Main memory | 16 GB | 2096 GB |
| GPU | Nvidia GTX 970 | Nvidia Tesla v100S |
| GPU memory | 4 GB | 32 GB |
| OS | Windows 10 | Debian 10.9 |
| Generating training and validation sets | 48 min | 8.9 min |
| Training CNN model | 33 min | 13.4 min |
| Average processing time of 31 259 local maxima | 45 s | 17 s |