Literature DB >> 30926040

WaveICA: A novel algorithm to remove batch effects for large-scale untargeted metabolomics data based on wavelet analysis.

Kui Deng1, Fan Zhang2, Qilong Tan1, Yue Huang1, Wei Song1, Zhiwei Rong1, Zheng-Jiang Zhu3, Kang Li4, Zhenzi Li5.   

Abstract

Metabolomics provides new insights into disease pathogenesis and biomarker discovery. Samples from large-scale untargeted metabolomics studies are typically analyzed using a liquid chromatography-mass spectrometry platform in several batches. Batch effects that are caused by non-biological systematic biases are unavoidable in large-scale metabolomics studies, even with properly designed experiments. The statistical analysis of large-scale metabolomics data without managing batch effects will yield misleading results. In this study, we propose a novel algorithm, called WaveICA, which is based on the wavelet transform method with independent component analysis, as the threshold processing method to capture and remove batch effects for large-scale metabolomics data. The WaveICA method uses the time trend of samples over the injection order, decomposes the original data into multi-scale data with different features, extracts and removes the batch effect information in multi-scale data, and obtains clean data. The WaveICA method was tested on real metabolomics data. After applying the WaveICA method, scattered quality control samples (QCS) and subject samples in a PCA score plot of the original data were closely clustered, respectively. The average Pearson correlation coefficients for all peaks of the QCS increased from 0.872 to 0.972. Additionally, WaveICA significantly improved the classification accuracy for metabolomics data. The method was compared with three representative methods, and outperformed all of them. To conclude, WaveICA can efficiently remove batch effects while revealing more biological information. This method can be used in large-scale untargeted metabolomics studies to preprocess raw metabolomics data.
Copyright © 2019 Elsevier B.V. All rights reserved.

Keywords:  Batch effect; Data normalization; Independent component analysis; Metabolomics; Wavelet transform

Mesh:

Year:  2019        PMID: 30926040     DOI: 10.1016/j.aca.2019.02.010

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  10 in total

1.  Child serum metabolome and traffic-related air pollution exposure in pregnancy.

Authors:  Beate Ritz; Qi Yan; Di He; Jun Wu; Douglas I Walker; Karan Uppal; Dean P Jones; Julia E Heck
Journal:  Environ Res       Date:  2021-08-19       Impact factor: 6.498

2.  TIGER: technical variation elimination for metabolomics data using ensemble learning architecture.

Authors:  Siyu Han; Jialing Huang; Francesco Foppiano; Cornelia Prehn; Jerzy Adamski; Karsten Suhre; Ying Li; Giuseppe Matullo; Freimut Schliess; Christian Gieger; Annette Peters; Rui Wang-Sattler
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

3.  Data Processing and Analysis in Mass Spectrometry-Based Metabolomics.

Authors:  Ángela Peralbo-Molina; Pol Solà-Santos; Alexandre Perera-Lluna; Eduardo Chicano-Gálvez
Journal:  Methods Mol Biol       Date:  2023

Review 4.  Optimization of metabolomic data processing using NOREVA.

Authors:  Jianbo Fu; Ying Zhang; Yunxia Wang; Hongning Zhang; Jin Liu; Jing Tang; Qingxia Yang; Huaicheng Sun; Wenqi Qiu; Yinghui Ma; Zhaorong Li; Mingyue Zheng; Feng Zhu
Journal:  Nat Protoc       Date:  2021-12-24       Impact factor: 13.491

5.  High-Resolution Metabolomic Assessment of Pesticide Exposure in Central Valley, California.

Authors:  Qi Yan; Kimberly C Paul; Douglas I Walker; Melissa A Furlong; Irish Del Rosario; Yu Yu; Keren Zhang; Myles G Cockburn; Dean P Jones; Beate R Ritz
Journal:  Chem Res Toxicol       Date:  2021-04-29       Impact factor: 3.973

6.  DBnorm as an R package for the comparison and selection of appropriate statistical methods for batch effect correction in metabolomic studies.

Authors:  Nasim Bararpour; Federica Gilardi; Cristian Carmeli; Jonathan Sidibe; Julijana Ivanisevic; Tiziana Caputo; Marc Augsburger; Silke Grabherr; Béatrice Desvergne; Nicolas Guex; Murielle Bochud; Aurelien Thomas
Journal:  Sci Rep       Date:  2021-03-11       Impact factor: 4.379

7.  Application of Lipidomics for Assessing Tissue Lipid Profiles of Patients With Squamous Cell Carcinoma.

Authors:  Weibiao Zeng; Wen Zheng; Sheng Hu; Jianyong Zhang; Wenxiong Zhang; Jianjun Xu; Dongliang Yu; Jinhua Peng; Lu Zhang; Meng Gong; Yiping Wei
Journal:  Technol Cancer Res Treat       Date:  2021 Jan-Dec

8.  A novel meta-analysis based on data augmentation and elastic data shared lasso regularization for gene expression.

Authors:  Hai-Hui Huang; Hao Rao; Rui Miao; Yong Liang
Journal:  BMC Bioinformatics       Date:  2022-08-23       Impact factor: 3.307

9.  MetaboAnalystR 3.0: Toward an Optimized Workflow for Global Metabolomics.

Authors:  Zhiqiang Pang; Jasmine Chong; Shuzhao Li; Jianguo Xia
Journal:  Metabolites       Date:  2020-05-07

10.  Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing.

Authors:  Qin Liu; Douglas Walker; Karan Uppal; Zihe Liu; Chunyu Ma; ViLinh Tran; Shuzhao Li; Dean P Jones; Tianwei Yu
Journal:  Sci Rep       Date:  2020-08-17       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.