Literature DB >> 20838408

Tackling the widespread and critical impact of batch effects in high-throughput data.

Jeffrey T Leek1, Robert B Scharpf, Héctor Corrada Bravo, David Simcha, Benjamin Langmead, W Evan Johnson, Donald Geman, Keith Baggerly, Rafael A Irizarry.   

Abstract

High-throughput technologies are widely used, for example to assay genetic variants, gene and protein expression, and epigenetic modifications. One often overlooked complication with such studies is batch effects, which occur because measurements are affected by laboratory conditions, reagent lots and personnel differences. This becomes a major problem when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. Using both published studies and our own analyses, we argue that batch effects (as well as other technical and biological artefacts) are widespread and critical to address. We review experimental and computational approaches for doing so.

Entities:  

Mesh:

Year:  2010        PMID: 20838408      PMCID: PMC3880143          DOI: 10.1038/nrg2825

Source DB:  PubMed          Journal:  Nat Rev Genet        ISSN: 1471-0056            Impact factor:   53.242


  24 in total

1.  Singular value decomposition for genome-wide expression data processing and modeling.

Authors:  O Alter; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

2.  The International HapMap Project.

Authors: 
Journal:  Nature       Date:  2003-12-18       Impact factor: 49.962

3.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Authors:  B M Bolstad; R A Irizarry; M Astrand; T P Speed
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

4.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

Authors:  Rafael A Irizarry; Bridget Hobbs; Francois Collin; Yasmin D Beazer-Barclay; Kristen J Antonellis; Uwe Scherf; Terence P Speed
Journal:  Biostatistics       Date:  2003-04       Impact factor: 5.899

5.  Gene expression in the urinary bladder: a common carcinoma in situ gene expression signature exists disregarding histopathological classification.

Authors:  Lars Dyrskjøt; Mogens Kruhøffer; Thomas Thykjaer; Niels Marcussen; Jens L Jensen; Klaus Møller; Torben F Ørntoft
Journal:  Cancer Res       Date:  2004-06-01       Impact factor: 12.701

6.  High-resolution serum proteomic features for ovarian cancer detection.

Authors:  T P Conrads; V A Fusaro; S Ross; D Johann; V Rajapakse; B A Hitt; S M Steinberg; E C Kohn; D A Fishman; G Whitely; J C Barrett; L A Liotta; E F Petricoin; T D Veenstra
Journal:  Endocr Relat Cancer       Date:  2004-06       Impact factor: 5.678

7.  A multilevel model to address batch effects in copy number estimation using SNP arrays.

Authors:  Robert B Scharpf; Ingo Ruczinski; Benilton Carvalho; Betty Doan; Aravinda Chakravarti; Rafael A Irizarry
Journal:  Biostatistics       Date:  2010-07-12       Impact factor: 5.899

8.  High-resolution serum proteomic patterns for ovarian cancer detection.

Authors:  K A Baggerly; S R Edmonson; J S Morris; K R Coombes
Journal:  Endocr Relat Cancer       Date:  2004-12       Impact factor: 5.678

9.  Genomewide linkage analyses of bipolar disorder: a new sample of 250 pedigrees from the National Institute of Mental Health Genetics Initiative.

Authors:  Danielle M Dick; Tatiana Foroud; Leah Flury; Elizabeth S Bowman; Marvin J Miller; N Leela Rau; P Ryan Moe; Nalini Samavedy; Rif El-Mallakh; Husseini Manji; Debra A Glitz; Eric T Meyer; Carrie Smiley; Rhoda Hahn; Clifford Widmark; Rebecca McKinney; Laura Sutton; Christos Ballas; Dorothy Grice; Wade Berrettini; William Byerley; William Coryell; Raymond DePaulo; Dean F MacKinnon; Elliot S Gershon; John R Kelsoe; Francis J McMahon; Melvin McInnis; Dennis L Murphy; Theodore Reich; William Scheftner; John I Nurnberger
Journal:  Am J Hum Genet       Date:  2003-05-27       Impact factor: 11.025

10.  Effects of atmospheric ozone on microarray data quality.

Authors:  Thomas L Fare; Ernest M Coffey; Hongyue Dai; Yudong D He; Deborah A Kessler; Kristopher A Kilian; John E Koch; Eric LeProust; Matthew J Marton; Michael R Meyer; Roland B Stoughton; George Y Tokiwa; Yanqun Wang
Journal:  Anal Chem       Date:  2003-09-01       Impact factor: 6.986

View more
  771 in total

1.  Adding the Team into T1 Translational Research: A Case Study of Multidisciplinary Team Science in the Evaluation of Biomarkers of Prostate Cancer Risk and Prognosis.

Authors:  Michael T Marrone; Corinne E Joshu; Sarah B Peskoe; Angelo M De Marzo; Christopher M Heaphy; Shawn E Lupold; Alan K Meeker; Elizabeth A Platz
Journal:  Clin Chem       Date:  2018-12-05       Impact factor: 8.327

Review 2.  Application of metabolomics to prostate cancer.

Authors:  Bruce J Trock
Journal:  Urol Oncol       Date:  2011 Sep-Oct       Impact factor: 3.498

Review 3.  Statistical approaches for the analysis of DNA methylation microarray data.

Authors:  Kimberly D Siegmund
Journal:  Hum Genet       Date:  2011-04-26       Impact factor: 4.132

4.  STrengthening the reporting of OBservational studies in Epidemiology-Molecular Epidemiology (STROBE-ME): an extension of the STROBE statement.

Authors:  Valentina Gallo; Matthias Egger; Valerie McCormack; Peter B Farmer; John P A Ioannidis; Micheline Kirsch-Volders; Giuseppe Matullo; David H Phillips; Bernadette Schoket; Ulf Stromberg; Roel Vermeulen; Christopher Wild; Miquel Porta; Paolo Vineis
Journal:  Eur J Epidemiol       Date:  2011-10-29       Impact factor: 8.082

5.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

6.  The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors:  Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

Review 7.  A critical analysis of cancer biobank practices in relation to biospecimen quality.

Authors:  Amanda Rush; Kevin Spring; Jennifer A Byrne
Journal:  Biophys Rev       Date:  2015-10-22

Review 8.  The role of replicates for error mitigation in next-generation sequencing.

Authors:  Kimberly Robasky; Nathan E Lewis; George M Church
Journal:  Nat Rev Genet       Date:  2013-12-10       Impact factor: 53.242

Review 9.  Epigenetics and development of food allergy (FA) in early childhood.

Authors:  Xiumei Hong; Xiaobin Wang
Journal:  Curr Allergy Asthma Rep       Date:  2014-09       Impact factor: 4.806

Review 10.  Considerations when processing and interpreting genomics data of the placenta.

Authors:  Chaini Konwar; Giulia Del Gobbo; Victor Yuan; Wendy P Robinson
Journal:  Placenta       Date:  2019-01-07       Impact factor: 3.481

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.