Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Examining the practical limits of batch effect-correction algorithms: When should you care about batch effects?

Literature DB >> 31611172

Examining the practical limits of batch effect-correction algorithms: When should you care about batch effects?

Longjian Zhou¹, Andrew Chi-Hau Sue¹, Wilson Wen Bin Goh².

Abstract

Batch effects are technical sources of variation and can confound analysis. While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm (BECA), we hold the viewpoint that the notion of best is context-dependent. Moreover, alternative questions beyond the simplistic notion of "best" are also interesting: are BECAs robust against various degrees of confounding and if so, what is the limit? Using two different methods for simulating class (phenotype) and batch effects and taking various representative datasets across both genomics (RNA-Seq) and proteomics platforms, we demonstrate that under situations where sample classes and batch factors are moderately confounded, most BECAs are remarkably robust and only weakly affected by upstream normalization procedures. This observation is consistently supported across the multitude of test datasets. BECAs do have limits: When sample classes and batch factors are strongly confounded, BECA performance declines, with variable performance in precision, recall and also batch correction. We also report that while conventional normalization methods have minimal impact on batch effect correction, they do not affect downstream statistical feature selection, and in strongly confounded scenarios, may even outperform BECAs. In other words, removing batch effects is no guarantee of optimal functional analysis. Overall, this study suggests that simplistic performance ranking exercises are quite trivial, and all BECAs are compromises in some context or another.

Keywords: Batch effects; Bioinformatics; Feature selection; Normalization; Statistics

Year: 2019 PMID： 31611172 DOI： 10.1016/j.jgg.2019.08.002

Source DB: PubMed Journal: J Genet Genomics ISSN： 1673-8527 Impact factor: 4.275

Keyword Cloud
Cited

5 in total

1. Simulating ComBat: how batch correction can lead to the systematic introduction of false positive results in DNA methylation microarray studies.

Authors: Tristan Zindler; Helge Frieling; Alexandra Neyazi; Stefan Bleich; Eva Friedel
Journal: BMC Bioinformatics Date: 2020-06-30 Impact factor: 3.169

Examining the practical limits of batch effect-correction algorithms: When should you care about batch effects?

1. Simulating ComBat: how batch correction can lead to the systematic introduction of false positive results in DNA methylation microarray studies.

2. The role of gene to gene interaction in the breast's genomic signature of pregnancy.

Review 3. Mathematical-based microbiome analytics for clinical translation.

4. Doppelgänger spotting in biomedical gene expression data.

Review 5. Perspectives for better batch effect correction in mass-spectrometry-based proteomics.