| Literature DB >> 35699774 |
Nils Paulhe1, Cécile Canlet2, Annelaure Damont3, Lindsay Peyriga4, Stéphanie Durand1, Catherine Deborde5, Sandra Alves3, Stephane Bernillon5, Thierry Berton5, Raphael Bir1, Alyssa Bouville2, Edern Cahoreau4, Delphine Centeno1, Robin Costantino2, Laurent Debrauwer2, Alexis Delabrière3, Christophe Duperier1, Sylvain Emery1, Amelie Flandin5, Ulli Hohenester3, Daniel Jacob5, Charlotte Joly1, Cyril Jousse1, Marie Lagree1, Nadia Lamari5, Marie Lefebvre5, Claire Lopez-Piffet1, Bernard Lyan1, Mickael Maucourt5, Carole Migne1, Marie-Francoise Olivier3, Estelle Rathahao-Paris3, Pierre Petriacq5, Julie Pinelli5, Léa Roch5, Pierrick Roger3, Simon Roques5, Jean-Claude Tabet3, Marie Tremblay-Franco2, Mounir Traïkia1, Anna Warnet3, Vanessa Zhendre5, Dominique Rolin5, Fabien Jourdan2, Etienne Thévenot3, Annick Moing5, Emilien Jamin2, François Fenaille3, Christophe Junot3, Estelle Pujos-Guillot1, Franck Giacomoni6.
Abstract
INTRODUCTION: Accuracy of feature annotation and metabolite identification in biological samples is a key element in metabolomics research. However, the annotation process is often hampered by the lack of spectral reference data in experimental conditions, as well as logistical difficulties in the spectral data management and exchange of annotations between laboratories.Entities:
Keywords: Curation; Database; FAIR; Interoperability; Metabolite identification; Spectral library
Mesh:
Year: 2022 PMID: 35699774 PMCID: PMC9197906 DOI: 10.1007/s11306-022-01899-3
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.747
Fig. 1PeakForest database inputs and outputs
Fig. 2Example of a chemical compound card (L-tryptophan) with compound basic information (A), related names and spectra parts (B) and externals identifiers and related Metabolights studies (C)
Spectral data and metadata group view with links of corresponding templates (available in the supplementary material section) and potential usage for users
| Metadata group label | Context and template part link | Metadata usage |
|---|---|---|
All spectra templates “sample” sheet | Informs users about the spectrum’s type (single chemical compound, mix of chemical compounds, NIST plasma or biological matrix) – Extra information is available for NMR spectra like the optional isotopic labelling – Information about sample preparation will be available in a future planned release | |
Only fullscan and fragmentation LC spectra (LC–MS / LC–MS/MS) “chromatography” sheet | Informs users about chromatography data (column brand, type, name, length, diameter, flow rate, injection volume, gradient…) – The column characteristics can be used as filter in Web service | |
Only full scan GC spectra (GC–MS) “chromatography” sheet | Informs users about chromatography data (column brand, type, name, length, diameter, …) – The column characteristics can be used as filter in Web service | |
Only fullscan IC spectra (IC-MS) “chromatography” sheet | Informs users about chromatography data (column brand, type, name, length, diameter, …) | |
All mass spectra “MS_analyzer” sheet “GCMS_analyzer” sheet for LC and IC spectra | Informs users about instrument characteristics and settings for the acquisition | |
All mass spectra “MS_analyzer” sheet for LC and IC spectra “GCMS_analyzer” sheet for LC and IC spectra | Informs users about instrument characteristics and settings for the acquisition | |
For all NMR spectra “NMR_analyzer” sheet | Informs users about instrument and software characteristics and settings for the acquisition and processing parameters | |
For all spectra “Other” sheet | Informs users about the spectrum’s authors, ownership, raw file, … |
Fig. 3Spectral card examples related to L-Tryptophan; pH 7.0; HSQC-2D (hsqcetgp)—600 MHz card (A) and Urea; GC-EI-QTOF; MS; 2 TMS card (B)
Fig. 4Screenshot of the PeakForest search modules with the “quick search” tool (A), and the advanced search tool for compounds (B) and for NMR data (C)
Fig. 5Example of an inter-platform workflow strategy used to generate LC-HRMS/MS, NMR and other orthogonal data related to unknown metabolites in biological matrices, and subsequent PeakForest database enrichment with biological compounds identified with a confidence “MSI level 2”