Literature DB >> 24687561

Covariance adjustment for batch effect in gene expression data.

Jung Ae Lee1, Kevin K Dobbin, Jeongyoun Ahn.   

Abstract

Batch bias has been found in many microarray gene expression studies that involve multiple batches of samples. A serious batch effect can alter not only the distribution of individual genes but also the inter-gene relationships. Even though some efforts have been made to remove such bias, there has been relatively less development on a multivariate approach, mainly because of the analytical difficulty due to the high-dimensional nature of gene expression data. We propose a multivariate batch adjustment method that effectively eliminates inter-gene batch effects. The proposed method utilizes high-dimensional sparse covariance estimation based on a factor model and a hard thresholding. Another important aspect of the proposed method is that if it is known that one of the batches is produced in a superior condition, the other batches can be adjusted so that they resemble the target batch. We study high-dimensional asymptotic properties of the proposed estimator and compare the performance of the proposed method with some popular existing methods with simulated data and gene expression data sets.
Copyright © 2014 John Wiley & Sons, Ltd.

Entities:  

Keywords:  batch effect; factor model; gene expression; high-dimensional covariance estimation

Mesh:

Year:  2014        PMID: 24687561      PMCID: PMC4065794          DOI: 10.1002/sim.6157

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  24 in total

1.  Efficient quadratic regularization for expression arrays.

Authors:  Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2004-07       Impact factor: 5.899

2.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

3.  Regularized linear discriminant analysis and its application in microarrays.

Authors:  Yaqian Guo; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2006-04-07       Impact factor: 5.899

4.  Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer.

Authors:  Liat Ein-Dor; Or Zuk; Eytan Domany
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-03       Impact factor: 11.205

5.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.

Authors:  Leming Shi; Laura H Reid; Wendell D Jones; Richard Shippy; Janet A Warrington; Shawn C Baker; Patrick J Collins; Francoise de Longueville; Ernest S Kawasaki; Kathleen Y Lee; Yuling Luo; Yongming Andrew Sun; James C Willey; Robert A Setterquist; Gavin M Fischer; Weida Tong; Yvonne P Dragan; David J Dix; Felix W Frueh; Frederico M Goodsaid; Damir Herman; Roderick V Jensen; Charles D Johnson; Edward K Lobenhofer; Raj K Puri; Uwe Schrf; Jean Thierry-Mieg; Charles Wang; Mike Wilson; Paul K Wolber; Lu Zhang; Shashi Amur; Wenjun Bao; Catalin C Barbacioru; Anne Bergstrom Lucas; Vincent Bertholet; Cecilie Boysen; Bud Bromley; Donna Brown; Alan Brunner; Roger Canales; Xiaoxi Megan Cao; Thomas A Cebula; James J Chen; Jing Cheng; Tzu-Ming Chu; Eugene Chudin; John Corson; J Christopher Corton; Lisa J Croner; Christopher Davies; Timothy S Davison; Glenda Delenstarr; Xutao Deng; David Dorris; Aron C Eklund; Xiao-hui Fan; Hong Fang; Stephanie Fulmer-Smentek; James C Fuscoe; Kathryn Gallagher; Weigong Ge; Lei Guo; Xu Guo; Janet Hager; Paul K Haje; Jing Han; Tao Han; Heather C Harbottle; Stephen C Harris; Eli Hatchwell; Craig A Hauser; Susan Hester; Huixiao Hong; Patrick Hurban; Scott A Jackson; Hanlee Ji; Charles R Knight; Winston P Kuo; J Eugene LeClerc; Shawn Levy; Quan-Zhen Li; Chunmei Liu; Ying Liu; Michael J Lombardi; Yunqing Ma; Scott R Magnuson; Botoul Maqsodi; Tim McDaniel; Nan Mei; Ola Myklebost; Baitang Ning; Natalia Novoradovskaya; Michael S Orr; Terry W Osborn; Adam Papallo; Tucker A Patterson; Roger G Perkins; Elizabeth H Peters; Ron Peterson; Kenneth L Philips; P Scott Pine; Lajos Pusztai; Feng Qian; Hongzu Ren; Mitch Rosen; Barry A Rosenzweig; Raymond R Samaha; Mark Schena; Gary P Schroth; Svetlana Shchegrova; Dave D Smith; Frank Staedtler; Zhenqiang Su; Hongmei Sun; Zoltan Szallasi; Zivana Tezak; Danielle Thierry-Mieg; Karol L Thompson; Irina Tikhonova; Yaron Turpaz; Beena Vallanat; Christophe Van; Stephen J Walker; Sue Jane Wang; Yonghong Wang; Russ Wolfinger; Alex Wong; Jie Wu; Chunlin Xiao; Qian Xie; Jun Xu; Wen Yang; Liang Zhang; Sheng Zhong; Yaping Zong; William Slikker
Journal:  Nat Biotechnol       Date:  2006-09       Impact factor: 54.908

6.  Frozen robust multiarray analysis (fRMA).

Authors:  Matthew N McCall; Benjamin M Bolstad; Rafael A Irizarry
Journal:  Biostatistics       Date:  2010-01-22       Impact factor: 5.899

7.  The practical effect of batch on genomic prediction.

Authors:  Hilary S Parker; Jeffrey T Leek
Journal:  Stat Appl Genet Mol Biol       Date:  2012

Review 8.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

9.  Can survival prediction be improved by merging gene expression data sets?

Authors:  Haleh Yasrebi; Peter Sperisen; Viviane Praz; Philipp Bucher
Journal:  PLoS One       Date:  2009-10-23       Impact factor: 3.240

10.  A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data.

Authors:  J Luo; M Schumacher; A Scherer; D Sanoudou; D Megherbi; T Davison; T Shi; W Tong; L Shi; H Hong; C Zhao; F Elloumi; W Shi; R Thomas; S Lin; G Tillinghast; G Liu; Y Zhou; D Herman; Y Li; Y Deng; H Fang; P Bushel; M Woods; J Zhang
Journal:  Pharmacogenomics J       Date:  2010-08       Impact factor: 3.550

View more
  6 in total

Review 1.  Statistical Issues in the Design and Analysis of nCounter Projects.

Authors:  Sin-Ho Jung; Insuk Sohn
Journal:  Cancer Inform       Date:  2014-12-14

2.  Combining location-and-scale batch effect adjustment with data cleaning by latent factor adjustment.

Authors:  Roman Hornung; Anne-Laure Boulesteix; David Causeur
Journal:  BMC Bioinformatics       Date:  2016-01-12       Impact factor: 3.169

3.  Correcting nuisance variation using Wasserstein distance.

Authors:  Gil Tabak; Minjie Fan; Samuel Yang; Stephan Hoyer; Geoffrey Davis
Journal:  PeerJ       Date:  2020-02-28       Impact factor: 2.984

4.  Stability of Imbalanced Triangles in Gene Regulatory Networks of Cancerous and Normal Cells.

Authors:  Abbas Karimi Rizi; Mina Zamani; Amirhossein Shirazi; G Reza Jafari; János Kertész
Journal:  Front Physiol       Date:  2021-01-20       Impact factor: 4.566

5.  A Novel Statistical Method to Diagnose, Quantify and Correct Batch Effects in Genomic Studies.

Authors:  Gift Nyamundanda; Pawan Poudel; Yatish Patil; Anguraj Sadanandam
Journal:  Sci Rep       Date:  2017-09-07       Impact factor: 4.379

6.  Comparative study of joint analysis of microarray gene expression data in survival prediction and risk assessment of breast cancer patients.

Authors:  Haleh Yasrebi
Journal:  Brief Bioinform       Date:  2015-10-26       Impact factor: 11.622

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.