Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 MetaClean: a machine learning-based classifier for reduced false positive peak detection in untargeted LC-MS metabolomics data.

Literature DB >> 33085002

MetaClean: a machine learning-based classifier for reduced false positive peak detection in untargeted LC-MS metabolomics data.

Kelsey Chetnik¹, Lauren Petrick^2,3, Gaurav Pandey^4,5.

Abstract

INTRODUCTION: Despite the availability of several pre-processing software, poor peak integration remains a prevalent problem in untargeted metabolomics data generated using liquid chromatography high-resolution mass spectrometry (LC-MS). As a result, the output of these pre-processing software may retain incorrectly calculated metabolite abundances that can perpetuate in downstream analyses.
OBJECTIVES: To address this problem, we propose a computational methodology that combines machine learning and peak quality metrics to filter out low quality peaks.
METHODS: Specifically, we comprehensively and systematically compared the performance of 24 different classifiers generated by combining eight classification algorithms and three sets of peak quality metrics on the task of distinguishing reliably integrated peaks from poorly integrated ones. These classifiers were compared to using a residual standard deviation (RSD) cut-off in pooled quality-control (QC) samples, which aims to remove peaks with analytical error.
RESULTS: The best performing classifier was found to be a combination of the AdaBoost algorithm and a set of 11 peak quality metrics previously explored in untargeted metabolomics and proteomics studies. As a complementary approach, applying our framework to peaks retained after filtering by 30% RSD across pooled QC samples was able to further distinguish poorly integrated peaks that were not removed from filtering alone. An R implementation of these classifiers and the overall computational approach is available as the MetaClean package at https://CRAN.R-project.org/package=MetaClean .
CONCLUSION: Our work represents an important step forward in developing an automated tool for filtering out unreliable peak integrations in untargeted LC-MS metabolomics data.

Entities: Chemical Disease Gene Species

Keywords: Machine learning; Metabolomics; Peak integration; Pre-processing; Quality control; Untargeted

Mesh：

Year: 2020 PMID： 33085002 PMCID： PMC7895495 DOI： 10.1007/s11306-020-01738-3

Source DB: PubMed Journal: Metabolomics ISSN： 1573-3882 Impact factor: 4.290

22 in total

1. Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry.

Authors: Warwick B Dunn; David Broadhurst; Paul Begley; Eva Zelena; Sue Francis-McIntyre; Nadine Anderson; Marie Brown; Joshau D Knowles; Antony Halsall; John N Haselden; Andrew W Nicholls; Ian D Wilson; Douglas B Kell; Royston Goodacre
Journal: Nat Protoc Date: 2011-06-30 Impact factor: 13.491

2. Detailed Investigation and Comparison of the XCMS and MZmine 2 Chromatogram Construction and Chromatographic Peak Detection Methods for Preprocessing Mass Spectrometry Metabolomics Data.

Authors: Owen D Myers; Susan J Sumner; Shuzhao Li; Stephen Barnes; Xiuxia Du
Journal: Anal Chem Date: 2017-08-17 Impact factor: 6.986

3. Metabolomic Profiling of Submaximal Exercise at a Standardised Relative Intensity in Healthy Adults.

Authors: Ali Muhsen Ali; Mia Burleigh; Evangelia Daskalaki; Tong Zhang; Chris Easton; David G Watson
Journal: Metabolites Date: 2016-02-26

4. Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools.

Authors: Manish Sud; Eoin Fahy; Dawn Cotter; Kenan Azam; Ilango Vadivelu; Charles Burant; Arthur Edison; Oliver Fiehn; Richard Higashi; K Sreekumaran Nair; Susan Sumner; Shankar Subramaniam
Journal: Nucleic Acids Res Date: 2015-10-13 Impact factor: 16.971

Review 5. Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies.

Authors: David Broadhurst; Royston Goodacre; Stacey N Reinke; Julia Kuligowski; Ian D Wilson; Matthew R Lewis; Warwick B Dunn
Journal: Metabolomics Date: 2018-05-18 Impact factor: 4.290

6. Quality assessment and interference detection in targeted mass spectrometry data using machine learning.

Authors: Shadi Toghi Eshghi; Paul Auger; W Rodney Mathews
Journal: Clin Proteomics Date: 2018-10-06 Impact factor: 3.988

7. Filtering procedures for untargeted LC-MS metabolomics data.

Authors: Courtney Schiffman; Lauren Petrick; Kelsi Perttula; Yukiko Yano; Henrik Carlsson; Todd Whitehead; Catherine Metayer; Josie Hayes; Stephen Rappaport; Sandrine Dudoit
Journal: BMC Bioinformatics Date: 2019-06-14 Impact factor: 3.169

8. WiPP: Workflow for Improved Peak Picking for Gas Chromatography-Mass Spectrometry (GC-MS) Data.

Authors: Nico Borgsmüller; Yoann Gloaguen; Tobias Opialla; Eric Blanc; Emilie Sicard; Anne-Lise Royer; Bruno Le Bizec; Stéphanie Durand; Carole Migné; Mélanie Pétéra; Estelle Pujos-Guillot; Franck Giacomoni; Yann Guitton; Dieter Beule; Jennifer Kirwan
Journal: Metabolites Date: 2019-08-21

9. Lipid metabolites as potential diagnostic and prognostic biomarkers for acute community acquired pneumonia.

Authors: Kelvin K W To; Kim-Chung Lee; Samson S Y Wong; Kong-Hung Sze; Yi-Hong Ke; Yin-Ming Lui; Bone S F Tang; Iris W S Li; Susanna K P Lau; Ivan F N Hung; Chun-Yiu Law; Ching-Wan Lam; Kwok-Yung Yuen
Journal: Diagn Microbiol Infect Dis Date: 2016-03-14 Impact factor: 2.803

10. xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data.

Authors: Karan Uppal; Quinlyn A Soltow; Frederick H Strobel; W Stephen Pittard; Kim M Gernert; Tianwei Yu; Dean P Jones
Journal: BMC Bioinformatics Date: 2013-01-16 Impact factor: 3.169

11 in total

1. IDSL.IPA Characterizes the Organic Chemical Space in Untargeted LC/HRMS Data Sets.

Authors: Sadjad Fakouri Baygi; Yashwant Kumar; Dinesh Kumar Barupal
Journal: J Proteome Res Date: 2022-05-17 Impact factor: 5.370

Review 2. New software tools, databases, and resources in metabolomics: updates from 2020.

Authors: Biswapriya B Misra
Journal: Metabolomics Date: 2021-05-11 Impact factor: 4.290

3. Tooth biomarkers to characterize the temporal dynamics of the fetal and early-life exposome.

Authors: Miao Yu; Peijun Tu; Georgia Dolios; Priyanthi S Dassanayake; Heather Volk; Craig Newschaffer; M Daniele Fallin; Lisa Croen; Kristen Lyall; Rebecca Schmidt; Irva Hertz-Piccioto; Christine Austin; Manish Arora; Lauren M Petrick
Journal: Environ Int Date: 2021-09-02 Impact factor: 9.621

Review 4. Mining plant metabolomes: Methods, applications, and perspectives.

Authors: Aimin Ma; Xiaoquan Qi
Journal: Plant Commun Date: 2021-09-04

5. Comprehensive Peak Characterization (CPC) in Untargeted LC-MS Analysis.

Authors: Kristian Pirttilä; David Balgoma; Johannes Rainer; Curt Pettersson; Mikael Hedeland; Carl Brunius
Journal: Metabolites Date: 2022-02-02

6. Deep Learning-Assisted Peak Curation for Large-Scale LC-MS Metabolomics.

Authors: Yoann Gloaguen; Jennifer A Kirwan; Dieter Beule
Journal: Anal Chem Date: 2022-03-15 Impact factor: 6.986

7. AI/ML-driven advances in untargeted metabolomics and exposomics for biomedical applications.

Authors: Lauren M Petrick; Noam Shomron
Journal: Cell Rep Phys Sci Date: 2022-07-20

8. Recurrent Topics in Mass Spectrometry-Based Metabolomics and Lipidomics-Standardization, Coverage, and Throughput.

Authors: Evelyn Rampler; Yasin El Abiead; Harald Schoeny; Mate Rusz; Felina Hildebrand; Veronika Fitz; Gunda Koellensperger
Journal: Anal Chem Date: 2020-11-28 Impact factor: 6.986

9. Commentary: Data Processing Thresholds for Abundance and Sparsity and Missed Biological Insights in an Untargeted Chemical Analysis of Blood Specimens for Exposomics.

Authors: Pekka Keski-Rahkonen; Oliver Robinson; Rossella Alfano; Michelle Plusquin; Augustin Scalbert
Journal: Front Public Health Date: 2022-01-17

10. Data Processing Thresholds for Abundance and Sparsity and Missed Biological Insights in an Untargeted Chemical Analysis of Blood Specimens for Exposomics.

Authors: Dinesh Kumar Barupal; Sadjad Fakouri Baygi; Robert O Wright; Manish Arora
Journal: Front Public Health Date: 2021-06-10