Literature DB >> 33085002

MetaClean: a machine learning-based classifier for reduced false positive peak detection in untargeted LC-MS metabolomics data.

Kelsey Chetnik1, Lauren Petrick2,3, Gaurav Pandey4,5.   

Abstract

INTRODUCTION: Despite the availability of several pre-processing software, poor peak integration remains a prevalent problem in untargeted metabolomics data generated using liquid chromatography high-resolution mass spectrometry (LC-MS). As a result, the output of these pre-processing software may retain incorrectly calculated metabolite abundances that can perpetuate in downstream analyses.
OBJECTIVES: To address this problem, we propose a computational methodology that combines machine learning and peak quality metrics to filter out low quality peaks.
METHODS: Specifically, we comprehensively and systematically compared the performance of 24 different classifiers generated by combining eight classification algorithms and three sets of peak quality metrics on the task of distinguishing reliably integrated peaks from poorly integrated ones. These classifiers were compared to using a residual standard deviation (RSD) cut-off in pooled quality-control (QC) samples, which aims to remove peaks with analytical error.
RESULTS: The best performing classifier was found to be a combination of the AdaBoost algorithm and a set of 11 peak quality metrics previously explored in untargeted metabolomics and proteomics studies. As a complementary approach, applying our framework to peaks retained after filtering by 30% RSD across pooled QC samples was able to further distinguish poorly integrated peaks that were not removed from filtering alone. An R implementation of these classifiers and the overall computational approach is available as the MetaClean package at https://CRAN.R-project.org/package=MetaClean .
CONCLUSION: Our work represents an important step forward in developing an automated tool for filtering out unreliable peak integrations in untargeted LC-MS metabolomics data.

Entities:  

Keywords:  Machine learning; Metabolomics; Peak integration; Pre-processing; Quality control; Untargeted

Mesh:

Year:  2020        PMID: 33085002      PMCID: PMC7895495          DOI: 10.1007/s11306-020-01738-3

Source DB:  PubMed          Journal:  Metabolomics        ISSN: 1573-3882            Impact factor:   4.290


  22 in total

1.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry.

Authors:  Warwick B Dunn; David Broadhurst; Paul Begley; Eva Zelena; Sue Francis-McIntyre; Nadine Anderson; Marie Brown; Joshau D Knowles; Antony Halsall; John N Haselden; Andrew W Nicholls; Ian D Wilson; Douglas B Kell; Royston Goodacre
Journal:  Nat Protoc       Date:  2011-06-30       Impact factor: 13.491

2.  Detailed Investigation and Comparison of the XCMS and MZmine 2 Chromatogram Construction and Chromatographic Peak Detection Methods for Preprocessing Mass Spectrometry Metabolomics Data.

Authors:  Owen D Myers; Susan J Sumner; Shuzhao Li; Stephen Barnes; Xiuxia Du
Journal:  Anal Chem       Date:  2017-08-17       Impact factor: 6.986

3.  Metabolomic Profiling of Submaximal Exercise at a Standardised Relative Intensity in Healthy Adults.

Authors:  Ali Muhsen Ali; Mia Burleigh; Evangelia Daskalaki; Tong Zhang; Chris Easton; David G Watson
Journal:  Metabolites       Date:  2016-02-26

4.  Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools.

Authors:  Manish Sud; Eoin Fahy; Dawn Cotter; Kenan Azam; Ilango Vadivelu; Charles Burant; Arthur Edison; Oliver Fiehn; Richard Higashi; K Sreekumaran Nair; Susan Sumner; Shankar Subramaniam
Journal:  Nucleic Acids Res       Date:  2015-10-13       Impact factor: 16.971

Review 5.  Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies.

Authors:  David Broadhurst; Royston Goodacre; Stacey N Reinke; Julia Kuligowski; Ian D Wilson; Matthew R Lewis; Warwick B Dunn
Journal:  Metabolomics       Date:  2018-05-18       Impact factor: 4.290

6.  Quality assessment and interference detection in targeted mass spectrometry data using machine learning.

Authors:  Shadi Toghi Eshghi; Paul Auger; W Rodney Mathews
Journal:  Clin Proteomics       Date:  2018-10-06       Impact factor: 3.988

7.  Filtering procedures for untargeted LC-MS metabolomics data.

Authors:  Courtney Schiffman; Lauren Petrick; Kelsi Perttula; Yukiko Yano; Henrik Carlsson; Todd Whitehead; Catherine Metayer; Josie Hayes; Stephen Rappaport; Sandrine Dudoit
Journal:  BMC Bioinformatics       Date:  2019-06-14       Impact factor: 3.169

8.  WiPP: Workflow for Improved Peak Picking for Gas Chromatography-Mass Spectrometry (GC-MS) Data.

Authors:  Nico Borgsmüller; Yoann Gloaguen; Tobias Opialla; Eric Blanc; Emilie Sicard; Anne-Lise Royer; Bruno Le Bizec; Stéphanie Durand; Carole Migné; Mélanie Pétéra; Estelle Pujos-Guillot; Franck Giacomoni; Yann Guitton; Dieter Beule; Jennifer Kirwan
Journal:  Metabolites       Date:  2019-08-21

9.  Lipid metabolites as potential diagnostic and prognostic biomarkers for acute community acquired pneumonia.

Authors:  Kelvin K W To; Kim-Chung Lee; Samson S Y Wong; Kong-Hung Sze; Yi-Hong Ke; Yin-Ming Lui; Bone S F Tang; Iris W S Li; Susanna K P Lau; Ivan F N Hung; Chun-Yiu Law; Ching-Wan Lam; Kwok-Yung Yuen
Journal:  Diagn Microbiol Infect Dis       Date:  2016-03-14       Impact factor: 2.803

10.  xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data.

Authors:  Karan Uppal; Quinlyn A Soltow; Frederick H Strobel; W Stephen Pittard; Kim M Gernert; Tianwei Yu; Dean P Jones
Journal:  BMC Bioinformatics       Date:  2013-01-16       Impact factor: 3.169

View more
  11 in total

1.  IDSL.IPA Characterizes the Organic Chemical Space in Untargeted LC/HRMS Data Sets.

Authors:  Sadjad Fakouri Baygi; Yashwant Kumar; Dinesh Kumar Barupal
Journal:  J Proteome Res       Date:  2022-05-17       Impact factor: 5.370

Review 2.  New software tools, databases, and resources in metabolomics: updates from 2020.

Authors:  Biswapriya B Misra
Journal:  Metabolomics       Date:  2021-05-11       Impact factor: 4.290

3.  Tooth biomarkers to characterize the temporal dynamics of the fetal and early-life exposome.

Authors:  Miao Yu; Peijun Tu; Georgia Dolios; Priyanthi S Dassanayake; Heather Volk; Craig Newschaffer; M Daniele Fallin; Lisa Croen; Kristen Lyall; Rebecca Schmidt; Irva Hertz-Piccioto; Christine Austin; Manish Arora; Lauren M Petrick
Journal:  Environ Int       Date:  2021-09-02       Impact factor: 9.621

Review 4.  Mining plant metabolomes: Methods, applications, and perspectives.

Authors:  Aimin Ma; Xiaoquan Qi
Journal:  Plant Commun       Date:  2021-09-04

5.  Comprehensive Peak Characterization (CPC) in Untargeted LC-MS Analysis.

Authors:  Kristian Pirttilä; David Balgoma; Johannes Rainer; Curt Pettersson; Mikael Hedeland; Carl Brunius
Journal:  Metabolites       Date:  2022-02-02

6.  Deep Learning-Assisted Peak Curation for Large-Scale LC-MS Metabolomics.

Authors:  Yoann Gloaguen; Jennifer A Kirwan; Dieter Beule
Journal:  Anal Chem       Date:  2022-03-15       Impact factor: 6.986

7.  AI/ML-driven advances in untargeted metabolomics and exposomics for biomedical applications.

Authors:  Lauren M Petrick; Noam Shomron
Journal:  Cell Rep Phys Sci       Date:  2022-07-20

8.  Recurrent Topics in Mass Spectrometry-Based Metabolomics and Lipidomics-Standardization, Coverage, and Throughput.

Authors:  Evelyn Rampler; Yasin El Abiead; Harald Schoeny; Mate Rusz; Felina Hildebrand; Veronika Fitz; Gunda Koellensperger
Journal:  Anal Chem       Date:  2020-11-28       Impact factor: 6.986

9.  Commentary: Data Processing Thresholds for Abundance and Sparsity and Missed Biological Insights in an Untargeted Chemical Analysis of Blood Specimens for Exposomics.

Authors:  Pekka Keski-Rahkonen; Oliver Robinson; Rossella Alfano; Michelle Plusquin; Augustin Scalbert
Journal:  Front Public Health       Date:  2022-01-17

10.  Data Processing Thresholds for Abundance and Sparsity and Missed Biological Insights in an Untargeted Chemical Analysis of Blood Specimens for Exposomics.

Authors:  Dinesh Kumar Barupal; Sadjad Fakouri Baygi; Robert O Wright; Manish Arora
Journal:  Front Public Health       Date:  2021-06-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.