Literature DB >> 22815355

Metabolite identification and molecular fingerprint prediction through machine learning.

Markus Heinonen1, Huibin Shen, Nicola Zamboni, Juho Rousu.   

Abstract

MOTIVATION: Metabolite identification from tandem mass spectra is an important problem in metabolomics, underpinning subsequent metabolic modelling and network analysis. Yet, currently this task requires matching the observed spectrum against a database of reference spectra originating from similar equipment and closely matching operating parameters, a condition that is rarely satisfied in public repositories. Furthermore, the computational support for identification of molecules not present in reference databases is lacking. Recent efforts in assembling large public mass spectral databases such as MassBank have opened the door for the development of a new genre of metabolite identification methods.
RESULTS: We introduce a novel framework for prediction of molecular characteristics and identification of metabolites from tandem mass spectra using machine learning with the support vector machine. Our approach is to first predict a large set of molecular properties of the unknown metabolite from salient tandem mass spectral signals, and in the second step to use the predicted properties for matching against large molecule databases, such as PubChem. We demonstrate that several molecular properties can be predicted to high accuracy and that they are useful in de novo metabolite identification, where the reference database does not contain any spectra of the same molecule. AVAILABILITY: An Matlab/Python package of the FingerID tool is freely available on the web at http://www.sourceforge.net/p/fingerid. CONTACT: markus.heinonen@cs.helsinki.fi.

Mesh:

Year:  2012        PMID: 22815355     DOI: 10.1093/bioinformatics/bts437

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  54 in total

1.  Illuminating the dark matter in metabolomics.

Authors:  Ricardo R da Silva; Pieter C Dorrestein; Robert A Quinn
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-01       Impact factor: 11.205

2.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID.

Authors:  Kai Dührkop; Huibin Shen; Marvin Meusel; Juho Rousu; Sebastian Böcker
Journal:  Proc Natl Acad Sci U S A       Date:  2015-09-21       Impact factor: 11.205

3.  Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics.

Authors:  Arpana Vaniya; Oliver Fiehn
Journal:  Trends Analyt Chem       Date:  2015-06-01       Impact factor: 12.296

4.  Molecular Formula Identification Using Isotope Pattern Analysis and Calculation of Fragmentation Trees.

Authors:  Kai Dührkop; Franziska Hufsky; Sebastian Böcker
Journal:  Mass Spectrom (Tokyo)       Date:  2014-07-18

5.  Target-Decoy-Based False Discovery Rate Estimation for Large-Scale Metabolite Identification.

Authors:  Xusheng Wang; Drew R Jones; Timothy I Shaw; Ji-Hoon Cho; Yuanyuan Wang; Haiyan Tan; Boer Xie; Suiping Zhou; Yuxin Li; Junmin Peng
Journal:  J Proteome Res       Date:  2018-05-29       Impact factor: 4.466

6.  Discriminating precursors of common fragments for large-scale metabolite profiling by triple quadrupole mass spectrometry.

Authors:  Igor Nikolskiy; Gary Siuzdak; Gary J Patti
Journal:  Bioinformatics       Date:  2015-02-16       Impact factor: 6.937

7.  Cognitive analysis of metabolomics data for systems biology.

Authors:  Erica L-W Majumder; Elizabeth M Billings; H Paul Benton; Richard L Martin; Amelia Palermo; Carlos Guijas; Markus M Rinschen; Xavier Domingo-Almenara; J Rafael Montenegro-Burke; Bradley A Tagtow; Robert S Plumb; Gary Siuzdak
Journal:  Nat Protoc       Date:  2021-01-22       Impact factor: 13.491

8.  Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches.

Authors:  Dai Hai Nguyen; Canh Hao Nguyen; Hiroshi Mamitsuka
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

9.  Gas Chromatography-Tandem Mass Spectrometry of Lignin Pyrolyzates with Dopant-Assisted Atmospheric Pressure Chemical Ionization and Molecular Structure Search with CSI:FingerID.

Authors:  Evan A Larson; Carolyn P Hutchinson; Young Jin Lee
Journal:  J Am Soc Mass Spectrom       Date:  2018-06-12       Impact factor: 3.109

10.  Chemically informed analyses of metabolomics mass spectrometry data with Qemistree.

Authors:  Anupriya Tripathi; Yoshiki Vázquez-Baeza; Julia M Gauglitz; Mingxun Wang; Kai Dührkop; Mélissa Nothias-Esposito; Deepa D Acharya; Madeleine Ernst; Justin J J van der Hooft; Qiyun Zhu; Daniel McDonald; Asker D Brejnrod; Antonio Gonzalez; Jo Handelsman; Markus Fleischauer; Marcus Ludwig; Sebastian Böcker; Louis-Félix Nothias; Rob Knight; Pieter C Dorrestein
Journal:  Nat Chem Biol       Date:  2020-11-16       Impact factor: 15.040

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.