Literature DB >> 31277571

Improving MetFrag with statistical learning of fragment annotations.

Christoph Ruttkies1, Steffen Neumann2,3, Stefan Posch4.   

Abstract

BACKGROUND: Molecule identification is a crucial step in metabolomics and environmental sciences. Besides in silico fragmentation, as performed by MetFrag, also machine learning and statistical methods evolved, showing an improvement in molecule annotation based on MS/MS data. In this work we present a new statistical scoring method where annotations of m/z fragment peaks to fragment-structures are learned in a training step. Based on a Bayesian model, two additional scoring terms are integrated into the new MetFrag2.4.5 and evaluated on the test data set of the CASMI 2016 contest.
RESULTS: The results on the 87 MS/MS spectra from positive and negative mode show a substantial improvement of the results compared to submissions made by the former MetFrag approach. Top1 rankings increased from 5 to 21 and Top10 rankings from 39 to 55 both showing higher values than for CSI:IOKR, the winner of the CASMI 2016 contest. For the negative mode spectra, MetFrag's statistical scoring outperforms all other participants which submitted results for this type of spectra.
CONCLUSIONS: This study shows how statistical learning can improve molecular structure identification based on MS/MS data compared on the same method using combinatorial in silico fragmentation only. MetFrag2.4.5 shows especially in negative mode a better performance compared to the other participating approaches.

Entities:  

Keywords:  Identification; Mass spectrometry; Statistical modeling

Mesh:

Year:  2019        PMID: 31277571      PMCID: PMC6612146          DOI: 10.1186/s12859-019-2954-7

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  15 in total

1.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID.

Authors:  Kai Dührkop; Huibin Shen; Marvin Meusel; Juho Rousu; Sebastian Böcker
Journal:  Proc Natl Acad Sci U S A       Date:  2015-09-21       Impact factor: 11.205

2.  Automatic Compound Annotation from Mass Spectrometry Data Using MAGMa.

Authors:  Lars Ridder; Justin J J van der Hooft; Stefan Verhoeven
Journal:  Mass Spectrom (Tokyo)       Date:  2014-07-02

3.  Metabolite identification and molecular fingerprint prediction through machine learning.

Authors:  Markus Heinonen; Huibin Shen; Nicola Zamboni; Juho Rousu
Journal:  Bioinformatics       Date:  2012-07-18       Impact factor: 6.937

4.  Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software.

Authors:  Hiroshi Tsugawa; Tobias Kind; Ryo Nakabayashi; Daichi Yukihira; Wataru Tanaka; Tomas Cajka; Kazuki Saito; Oliver Fiehn; Masanori Arita
Journal:  Anal Chem       Date:  2016-08-04       Impact factor: 6.986

5.  In silico fragmentation for computer assisted identification of metabolite mass spectra.

Authors:  Sebastian Wolf; Stephan Schmidt; Matthias Müller-Hannemann; Steffen Neumann
Journal:  BMC Bioinformatics       Date:  2010-03-22       Impact factor: 3.169

6.  Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking.

Authors:  Mingxun Wang; Jeremy J Carver; Vanessa V Phelan; Laura M Sanchez; Neha Garg; Yao Peng; Don Duy Nguyen; Jeramie Watrous; Clifford A Kapono; Tal Luzzatto-Knaan; Carla Porto; Amina Bouslimani; Alexey V Melnik; Michael J Meehan; Wei-Ting Liu; Max Crüsemann; Paul D Boudreau; Eduardo Esquenazi; Mario Sandoval-Calderón; Roland D Kersten; Laura A Pace; Robert A Quinn; Katherine R Duncan; Cheng-Chih Hsu; Dimitrios J Floros; Ronnie G Gavilan; Karin Kleigrewe; Trent Northen; Rachel J Dutton; Delphine Parrot; Erin E Carlson; Bertrand Aigle; Charlotte F Michelsen; Lars Jelsbak; Christian Sohlenkamp; Pavel Pevzner; Anna Edlund; Jeffrey McLean; Jörn Piel; Brian T Murphy; Lena Gerwick; Chih-Chuang Liaw; Yu-Liang Yang; Hans-Ulrich Humpf; Maria Maansson; Robert A Keyzers; Amy C Sims; Andrew R Johnson; Ashley M Sidebottom; Brian E Sedio; Andreas Klitgaard; Charles B Larson; Cristopher A Boya P; Daniel Torres-Mendoza; David J Gonzalez; Denise B Silva; Lucas M Marques; Daniel P Demarque; Egle Pociute; Ellis C O'Neill; Enora Briand; Eric J N Helfrich; Eve A Granatosky; Evgenia Glukhov; Florian Ryffel; Hailey Houson; Hosein Mohimani; Jenan J Kharbush; Yi Zeng; Julia A Vorholt; Kenji L Kurita; Pep Charusanti; Kerry L McPhail; Kristian Fog Nielsen; Lisa Vuong; Maryam Elfeki; Matthew F Traxler; Niclas Engene; Nobuhiro Koyama; Oliver B Vining; Ralph Baric; Ricardo R Silva; Samantha J Mascuch; Sophie Tomasi; Stefan Jenkins; Venkat Macherla; Thomas Hoffman; Vinayak Agarwal; Philip G Williams; Jingqui Dai; Ram Neupane; Joshua Gurr; Andrés M C Rodríguez; Anne Lamsa; Chen Zhang; Kathleen Dorrestein; Brendan M Duggan; Jehad Almaliti; Pierre-Marie Allard; Prasad Phapale; Louis-Felix Nothias; Theodore Alexandrov; Marc Litaudon; Jean-Luc Wolfender; Jennifer E Kyle; Thomas O Metz; Tyler Peryea; Dac-Trung Nguyen; Danielle VanLeer; Paul Shinn; Ajit Jadhav; Rolf Müller; Katrina M Waters; Wenyuan Shi; Xueting Liu; Lixin Zhang; Rob Knight; Paul R Jensen; Bernhard O Palsson; Kit Pogliano; Roger G Linington; Marcelino Gutiérrez; Norberto P Lopes; William H Gerwick; Bradley S Moore; Pieter C Dorrestein; Nuno Bandeira
Journal:  Nat Biotechnol       Date:  2016-08-09       Impact factor: 54.908

7.  InChI, the IUPAC International Chemical Identifier.

Authors:  Stephen R Heller; Alan McNaught; Igor Pletnev; Stephen Stein; Dmitrii Tchekhovskoi
Journal:  J Cheminform       Date:  2015-05-30       Impact factor: 5.514

8.  Critical Assessment of Small Molecule Identification 2016: automated methods.

Authors:  Emma L Schymanski; Christoph Ruttkies; Martin Krauss; Céline Brouard; Tobias Kind; Kai Dührkop; Felicity Allen; Arpana Vaniya; Dries Verdegem; Sebastian Böcker; Juho Rousu; Huibin Shen; Hiroshi Tsugawa; Tanvir Sajed; Oliver Fiehn; Bart Ghesquière; Steffen Neumann
Journal:  J Cheminform       Date:  2017-03-27       Impact factor: 5.514

9.  MetFrag relaunched: incorporating strategies beyond in silico fragmentation.

Authors:  Christoph Ruttkies; Emma L Schymanski; Sebastian Wolf; Juliane Hollender; Steffen Neumann
Journal:  J Cheminform       Date:  2016-01-29       Impact factor: 5.514

10.  PubChem Substance and Compound databases.

Authors:  Sunghwan Kim; Paul A Thiessen; Evan E Bolton; Jie Chen; Gang Fu; Asta Gindulyte; Lianyi Han; Jane He; Siqian He; Benjamin A Shoemaker; Jiyao Wang; Bo Yu; Jian Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

View more
  9 in total

Review 1.  Prevalence and Implications of Per- and Polyfluoroalkyl Substances (PFAS) in Settled Dust.

Authors:  Tina Savvaides; Jeremy P Koelmel; Yakun Zhou; Elizabeth Z Lin; Paul Stelben; Juan J Aristizabal-Henao; John A Bowden; Krystal J Godri Pollitt
Journal:  Curr Environ Health Rep       Date:  2022-01-05

Review 2.  Operationalizing the Exposome Using Passive Silicone Samplers.

Authors:  Zoe Coates Fuentes; Yuri Levin Schwartz; Anna R Robuck; Douglas I Walker
Journal:  Curr Pollut Rep       Date:  2022-01-04

Review 3.  Marine dissolved organic matter: a vast and unexplored molecular space.

Authors:  Teresa S Catalá; Spencer Shorte; Thorsten Dittmar
Journal:  Appl Microbiol Biotechnol       Date:  2021-09-18       Impact factor: 4.813

Review 4.  Networks and Graphs Discovery in Metabolomics Data Analysis and Interpretation.

Authors:  Adam Amara; Clément Frainay; Fabien Jourdan; Thomas Naake; Steffen Neumann; Elva María Novoa-Del-Toro; Reza M Salek; Liesa Salzer; Sarah Scharfenberg; Michael Witting
Journal:  Front Mol Biosci       Date:  2022-03-08

5.  Augmentation of MS/MS Libraries with Spectral Interpolation for Improved Identification.

Authors:  Ethan King; Richard Overstreet; Julia Nguyen; Danielle Ciesielski
Journal:  J Chem Inf Model       Date:  2022-07-29       Impact factor: 6.162

Review 6.  Metabolomics in clinical and forensic toxicology, sports anti-doping and veterinary residues.

Authors:  Bethany Keen; Adam Cawley; Brian Reedy; Shanlin Fu
Journal:  Drug Test Anal       Date:  2022-03-08       Impact factor: 3.234

7.  Probabilistic framework for integration of mass spectrum and retention time information in small molecule identification.

Authors:  Eric Bach; Simon Rogers; John Williamson; Juho Rousu
Journal:  Bioinformatics       Date:  2021-07-19       Impact factor: 6.937

Review 8.  From Samples to Insights into Metabolism: Uncovering Biologically Relevant Information in LC-HRMS Metabolomics Data.

Authors:  Julijana Ivanisevic; Elizabeth J Want
Journal:  Metabolites       Date:  2019-12-17

9.  MassGenie: A Transformer-Based Deep Learning Method for Identifying Small Molecules from Their Mass Spectra.

Authors:  Aditya Divyakant Shrivastava; Neil Swainston; Soumitra Samanta; Ivayla Roberts; Marina Wright Muelas; Douglas B Kell
Journal:  Biomolecules       Date:  2021-11-30
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.