Yuanyue Li1, Michael Kuhn1, Anne-Claude Gavin1,2, Peer Bork1,2,3,4. 1. Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany. 2. Molecular Medicine Partnership Unit (MMPU), 69117 Heidelberg, Germany. 3. Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany. 4. Department of Bioinformatics, Biocenter, University of Würzburg, 97074 Würzburg, Germany.
Abstract
MOTIVATION: Untargeted mass spectrometry (MS/MS) is a powerful method for detecting metabolites in biological samples. However, fast and accurate identification of the metabolites' structures from MS/MS spectra is still a great challenge. RESULTS: We present a new analysis method, called SubFragment-Matching (SF-Matching) that is based on the hypothesis that molecules with similar structural features will exhibit similar fragmentation patterns. We combine information on fragmentation patterns of molecules with shared substructures and then use random forest models to predict whether a given structure can yield a certain fragmentation pattern. These models can then be used to score candidate molecules for a given mass spectrum. For rapid identification, we pre-compute such scores for common biological molecular structure databases. Using benchmarking datasets, we find that our method has similar performance to CSI: FingerID and those very high accuracies can be achieved by combining our method with CSI: FingerID. Rarefaction analysis of the training dataset shows that the performance of our method will increase as more experimental data become available. AVAILABILITY AND IMPLEMENTATION: SF-Matching is available from http://www.bork.embl.de/Docu/sf_matching. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Untargeted mass spectrometry (MS/MS) is a powerful method for detecting metabolites in biological samples. However, fast and accurate identification of the metabolites' structures from MS/MS spectra is still a great challenge. RESULTS: We present a new analysis method, called SubFragment-Matching (SF-Matching) that is based on the hypothesis that molecules with similar structural features will exhibit similar fragmentation patterns. We combine information on fragmentation patterns of molecules with shared substructures and then use random forest models to predict whether a given structure can yield a certain fragmentation pattern. These models can then be used to score candidate molecules for a given mass spectrum. For rapid identification, we pre-compute such scores for common biological molecular structure databases. Using benchmarking datasets, we find that our method has similar performance to CSI: FingerID and those very high accuracies can be achieved by combining our method with CSI: FingerID. Rarefaction analysis of the training dataset shows that the performance of our method will increase as more experimental data become available. AVAILABILITY AND IMPLEMENTATION: SF-Matching is available from http://www.bork.embl.de/Docu/sf_matching. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Carlos Hernan Becerra-Mojica; Miguel Antonio Parra-Saavedra; Luis Alfonso Diaz-Martinez; Raigam Jafet Martinez-Portilla; Bladimiro Rincon Orozco Journal: BMJ Open Date: 2022-05-30 Impact factor: 3.006
Authors: Anupriya Tripathi; Yoshiki Vázquez-Baeza; Julia M Gauglitz; Mingxun Wang; Kai Dührkop; Mélissa Nothias-Esposito; Deepa D Acharya; Madeleine Ernst; Justin J J van der Hooft; Qiyun Zhu; Daniel McDonald; Asker D Brejnrod; Antonio Gonzalez; Jo Handelsman; Markus Fleischauer; Marcus Ludwig; Sebastian Böcker; Louis-Félix Nothias; Rob Knight; Pieter C Dorrestein Journal: Nat Chem Biol Date: 2020-11-16 Impact factor: 15.040
Authors: Flávia Luísa Dias-Audibert; Luiz Claudio Navarro; Diogo Noin de Oliveira; Jeany Delafiori; Carlos Fernando Odir Rodrigues Melo; Tatiane Melina Guerreiro; Flávia Troncon Rosa; Diego Lima Petenuci; Maria Angelica Ehara Watanabe; Licio Augusto Velloso; Anderson Rezende Rocha; Rodrigo Ramos Catharino Journal: Front Bioeng Biotechnol Date: 2020-01-24
Authors: Marcus Miethke; Marco Pieroni; Tilmann Weber; Mark Brönstrup; Peter Hammann; Ludovic Halby; Paola B Arimondo; Philippe Glaser; Bertrand Aigle; Helge B Bode; Rui Moreira; Yanyan Li; Andriy Luzhetskyy; Marnix H Medema; Jean-Luc Pernodet; Marc Stadler; José Rubén Tormo; Olga Genilloud; Andrew W Truman; Kira J Weissman; Eriko Takano; Stefano Sabatini; Evi Stegmann; Heike Brötz-Oesterhelt; Wolfgang Wohlleben; Myriam Seemann; Martin Empting; Anna K H Hirsch; Brigitta Loretz; Claus-Michael Lehr; Alexander Titz; Jennifer Herrmann; Timo Jaeger; Silke Alt; Thomas Hesterkamp; Mathias Winterhalter; Andrea Schiefer; Kenneth Pfarr; Achim Hoerauf; Heather Graz; Michael Graz; Mika Lindvall; Savithri Ramurthy; Anders Karlén; Maarten van Dongen; Hrvoje Petkovic; Andreas Keller; Frédéric Peyrane; Stefano Donadio; Laurent Fraisse; Laura J V Piddock; Ian H Gilbert; Heinz E Moser; Rolf Müller Journal: Nat Rev Chem Date: 2021-08-19 Impact factor: 34.571