Literature DB >> 27856765

Topic modeling for untargeted substructure exploration in metabolomics.

Justin Johan Jozias van der Hooft1,2, Joe Wandy1,3, Michael P Barrett1,4, Karl E V Burgess1, Simon Rogers5,3.   

Abstract

The potential of untargeted metabolomics to answer important questions across the life sciences is hindered because of a paucity of computational tools that enable extraction of key biochemically relevant information. Available tools focus on using mass spectrometry fragmentation spectra to identify molecules whose behavior suggests they are relevant to the system under study. Unfortunately, fragmentation spectra cannot identify molecules in isolation but require authentic standards or databases of known fragmented molecules. Fragmentation spectra are, however, replete with information pertaining to the biochemical processes present, much of which is currently neglected. Here, we present an analytical workflow that exploits all fragmentation data from a given experiment to extract biochemically relevant features in an unsupervised manner. We demonstrate that an algorithm originally used for text mining, latent Dirichlet allocation, can be adapted to handle metabolomics datasets. Our approach extracts biochemically relevant molecular substructures ("Mass2Motifs") from spectra as sets of co-occurring molecular fragments and neutral losses. The analysis allows us to isolate molecular substructures, whose presence allows molecules to be grouped based on shared substructures regardless of classical spectral similarity. These substructures, in turn, support putative de novo structural annotation of molecules. Combining this spectral connectivity to orthogonal correlations (e.g., common abundance changes under system perturbation) significantly enhances our ability to provide mechanistic explanations for biological behavior.

Entities:  

Keywords:  bioinformatics; fragmentation; mass spectrometry; metabolomics; topic modeling

Mesh:

Year:  2016        PMID: 27856765      PMCID: PMC5137707          DOI: 10.1073/pnas.1608041113

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  25 in total

1.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID.

Authors:  Kai Dührkop; Huibin Shen; Marvin Meusel; Juho Rousu; Sebastian Böcker
Journal:  Proc Natl Acad Sci U S A       Date:  2015-09-21       Impact factor: 11.205

2.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification.

Authors:  Colin A Smith; Elizabeth J Want; Grace O'Maille; Ruben Abagyan; Gary Siuzdak
Journal:  Anal Chem       Date:  2006-02-01       Impact factor: 6.986

3.  The latent process decomposition of cDNA microarray data sets.

Authors:  Simon Rogers; Mark Girolami; Colin Campbell; Rainer Breitling
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2005 Apr-Jun       Impact factor: 3.710

4.  MS/MS networking guided analysis of molecule and gene cluster families.

Authors:  Don Duy Nguyen; Cheng-Hsuan Wu; Wilna J Moree; Anne Lamsa; Marnix H Medema; Xiling Zhao; Ronnie G Gavilan; Marystella Aparicio; Librada Atencio; Chanaye Jackson; Javier Ballesteros; Joel Sanchez; Jeramie D Watrous; Vanessa V Phelan; Corine van de Wiel; Roland D Kersten; Samina Mehnaz; René De Mot; Elizabeth A Shank; Pep Charusanti; Harish Nagarajan; Brendan M Duggan; Bradley S Moore; Nuno Bandeira; Bernhard Ø Palsson; Kit Pogliano; Marcelino Gutiérrez; Pieter C Dorrestein
Journal:  Proc Natl Acad Sci U S A       Date:  2013-06-24       Impact factor: 11.205

5.  PeakML/mzMatch: a file format, Java library, R library, and tool-chain for mass spectrometry data analysis.

Authors:  Richard A Scheltema; Andris Jankevics; Ritsert C Jansen; Morris A Swertz; Rainer Breitling
Journal:  Anal Chem       Date:  2011-03-14       Impact factor: 6.986

6.  Automatic chemical structure annotation of an LC-MS(n) based metabolic profile from green tea.

Authors:  Lars Ridder; Justin J J van der Hooft; Stefan Verhoeven; Ric C H de Vos; Raoul J Bino; Jacques Vervoort
Journal:  Anal Chem       Date:  2013-05-31       Impact factor: 6.986

7.  HMDB 3.0--The Human Metabolome Database in 2013.

Authors:  David S Wishart; Timothy Jewison; An Chi Guo; Michael Wilson; Craig Knox; Yifeng Liu; Yannick Djoumbou; Rupasri Mandal; Farid Aziat; Edison Dong; Souhaila Bouatra; Igor Sinelnikov; David Arndt; Jianguo Xia; Philip Liu; Faizath Yallou; Trent Bjorndahl; Rolando Perez-Pineiro; Roman Eisner; Felicity Allen; Vanessa Neveu; Russ Greiner; Augustin Scalbert
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

8.  MS2Analyzer: A software for small molecule substructure annotations from accurate tandem mass spectra.

Authors:  Yan Ma; Tobias Kind; Dawei Yang; Carlos Leon; Oliver Fiehn
Journal:  Anal Chem       Date:  2014-10-14       Impact factor: 6.986

9.  MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis.

Authors:  Hiroshi Tsugawa; Tomas Cajka; Tobias Kind; Yan Ma; Brendan Higgins; Kazutaka Ikeda; Mitsuhiro Kanazawa; Jean VanderGheynst; Oliver Fiehn; Masanori Arita
Journal:  Nat Methods       Date:  2015-05-04       Impact factor: 28.547

10.  Urinary antihypertensive drug metabolite screening using molecular networking coupled to high-resolution mass spectrometry fragmentation.

Authors:  Justin J J van der Hooft; Sandosh Padmanabhan; Karl E V Burgess; Michael P Barrett
Journal:  Metabolomics       Date:  2016-07-05       Impact factor: 4.290

View more
  66 in total

1.  Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches.

Authors:  Dai Hai Nguyen; Canh Hao Nguyen; Hiroshi Mamitsuka
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

2.  compMS2Miner: An Automatable Metabolite Identification, Visualization, and Data-Sharing R Package for High-Resolution LC-MS Data Sets.

Authors:  William M B Edmands; Lauren Petrick; Dinesh K Barupal; Augustin Scalbert; Mark J Wilson; Jeffrey K Wickliffe; Stephen M Rappaport
Journal:  Anal Chem       Date:  2017-03-27       Impact factor: 6.986

Review 3.  Metabolomics and genomics in natural products research: complementary tools for targeting new chemical entities.

Authors:  Lindsay K Caesar; Rana Montaser; Nancy P Keller; Neil L Kelleher
Journal:  Nat Prod Rep       Date:  2021-11-17       Impact factor: 13.423

4.  Multi-Omic Profiling of Melophlus Sponges Reveals Diverse Metabolomic and Microbiome Architectures that Are Non-overlapping with Ecological Neighbors.

Authors:  Ipsita Mohanty; Sheila Podell; Jason S Biggs; Neha Garg; Eric E Allen; Vinayak Agarwal
Journal:  Mar Drugs       Date:  2020-02-19       Impact factor: 5.118

5.  Chemically informed analyses of metabolomics mass spectrometry data with Qemistree.

Authors:  Anupriya Tripathi; Yoshiki Vázquez-Baeza; Julia M Gauglitz; Mingxun Wang; Kai Dührkop; Mélissa Nothias-Esposito; Deepa D Acharya; Madeleine Ernst; Justin J J van der Hooft; Qiyun Zhu; Daniel McDonald; Asker D Brejnrod; Antonio Gonzalez; Jo Handelsman; Markus Fleischauer; Marcus Ludwig; Sebastian Böcker; Louis-Félix Nothias; Rob Knight; Pieter C Dorrestein
Journal:  Nat Chem Biol       Date:  2020-11-16       Impact factor: 15.040

6.  Systematic classification of unknown metabolites using high-resolution fragmentation mass spectra.

Authors:  Kai Dührkop; Louis-Félix Nothias; Markus Fleischauer; Raphael Reher; Marcus Ludwig; Martin A Hoffmann; Daniel Petras; William H Gerwick; Juho Rousu; Pieter C Dorrestein; Sebastian Böcker
Journal:  Nat Biotechnol       Date:  2020-11-23       Impact factor: 54.908

7.  Gestational age-dependent development of the neonatal metabolome.

Authors:  Madeleine Ernst; Simon Rogers; Ulrik Lausten-Thomsen; Anders Björkbom; Susan Svane Laursen; Julie Courraud; Anders Børglum; Merete Nordentoft; Thomas Werge; Preben Bo Mortensen; David M Hougaard; Arieh S Cohen
Journal:  Pediatr Res       Date:  2020-09-17       Impact factor: 3.756

Review 8.  Microbial natural product databases: moving forward in the multi-omics era.

Authors:  Jeffrey A van Santen; Satria A Kautsar; Marnix H Medema; Roger G Linington
Journal:  Nat Prod Rep       Date:  2020-08-28       Impact factor: 13.423

Review 9.  Mining genomes to illuminate the specialized chemistry of life.

Authors:  Marnix H Medema; Tristan de Rond; Bradley S Moore
Journal:  Nat Rev Genet       Date:  2021-06-03       Impact factor: 53.242

Review 10.  Ecology and genomics of Actinobacteria: new concepts for natural product discovery.

Authors:  Doris A van Bergeijk; Barbara R Terlouw; Marnix H Medema; Gilles P van Wezel
Journal:  Nat Rev Microbiol       Date:  2020-06-01       Impact factor: 60.633

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.