Literature DB >> 18689839

Towards de novo identification of metabolites by analyzing tandem mass spectra.

Sebastian Böcker1, Florian Rasche.   

Abstract

MOTIVATION: Mass spectrometry is among the most widely used technologies in proteomics and metabolomics. Being a high-throughput method, it produces large amounts of data that necessitates an automated analysis of the spectra. Clearly, database search methods for protein analysis can easily be adopted to analyze metabolite mass spectra. But for metabolites, de novo interpretation of spectra is even more important than for protein data, because metabolite spectra databases cover only a small fraction of naturally occurring metabolites: even the model plant Arabidopsis thaliana has a large number of enzymes whose substrates and products remain unknown. The field of bio-prospection searches biologically diverse areas for metabolites which might serve as pharmaceuticals. De novo identification of metabolite mass spectra requires new concepts and methods since, unlike proteins, metabolites possess a non-linear molecular structure.
RESULTS: In this work, we introduce a method for fully automated de novo identification of metabolites from tandem mass spectra. Mass spectrometry data is usually assumed to be insufficient for identification of molecular structures, so we want to estimate the molecular formula of the unknown metabolite, a crucial step for its identification. The method first calculates all molecular formulas that explain the parent peak mass. Then, a graph is build where vertices correspond to molecular formulas of all peaks in the fragmentation mass spectra, whereas edges correspond to hypothetical fragmentation steps. Our algorithm afterwards calculates the maximum scoring subtree of this graph: each peak in the spectra must be scored at most once, so the subtree shall contain only one explanation per peak. Unfortunately, finding this subtree is NP-hard. We suggest three exact algorithms (including one fixed parameter tractable algorithm) as well as two heuristics to solve the problem. Tests on real mass spectra show that the FPT algorithm and the heuristics solve the problem suitably fast and provide excellent results: for all 32 test compounds the correct solution was among the top five suggestions, for 26 compounds the first suggestion of the exact algorithm was correct. AVAILABILITY: http://www.bio.inf.uni-jena.de/tandemms

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18689839     DOI: 10.1093/bioinformatics/btn270

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  31 in total

1.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID.

Authors:  Kai Dührkop; Huibin Shen; Marvin Meusel; Juho Rousu; Sebastian Böcker
Journal:  Proc Natl Acad Sci U S A       Date:  2015-09-21       Impact factor: 11.205

2.  Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics.

Authors:  Arpana Vaniya; Oliver Fiehn
Journal:  Trends Analyt Chem       Date:  2015-06-01       Impact factor: 12.296

3.  Winners of CASMI2013: Automated Tools and Challenge Data.

Authors:  Takaaki Nishioka; Takeshi Kasama; Tomoya Kinumi; Hidefumi Makabe; Fumio Matsuda; Daisuke Miura; Masahiro Miyashita; Takemichi Nakamura; Ken Tanaka; Atsushi Yamamoto
Journal:  Mass Spectrom (Tokyo)       Date:  2014-09-02

4.  Molecular Formula Identification Using Isotope Pattern Analysis and Calculation of Fragmentation Trees.

Authors:  Kai Dührkop; Franziska Hufsky; Sebastian Böcker
Journal:  Mass Spectrom (Tokyo)       Date:  2014-07-18

5.  Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches.

Authors:  Dai Hai Nguyen; Canh Hao Nguyen; Hiroshi Mamitsuka
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

6.  Identification of Vitamin D3 Oxidation Products Using High-Resolution and Tandem Mass Spectrometry.

Authors:  Fatemeh Mahmoodani; Conrad O Perera; Grant Abernethy; Bruno Fedrizzi; David Greenwood; Hong Chen
Journal:  J Am Soc Mass Spectrom       Date:  2018-03-19       Impact factor: 3.109

7.  SIRIUS: decomposing isotope patterns for metabolite identification.

Authors:  Sebastian Böcker; Matthias C Letzel; Zsuzsanna Lipták; Anton Pervukhin
Journal:  Bioinformatics       Date:  2008-11-17       Impact factor: 6.937

8.  Assessment of metabolome annotation quality: a method for evaluating the false discovery rate of elemental composition searches.

Authors:  Fumio Matsuda; Yoko Shinbo; Akira Oikawa; Masami Yokota Hirai; Oliver Fiehn; Shigehiko Kanaya; Kazuki Saito
Journal:  PLoS One       Date:  2009-10-16       Impact factor: 3.240

9.  Computational mass spectrometry for small molecules.

Authors:  Kerstin Scheubert; Franziska Hufsky; Sebastian Böcker
Journal:  J Cheminform       Date:  2013-03-01       Impact factor: 5.514

10.  Fragmentation trees for the structural characterisation of metabolites.

Authors:  Piotr T Kasper; Miguel Rojas-Chertó; Robert Mistrik; Theo Reijmers; Thomas Hankemeier; Rob J Vreeken
Journal:  Rapid Commun Mass Spectrom       Date:  2012-10-15       Impact factor: 2.419

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.