Literature DB >> 22101192

Using control genes to correct for unwanted variation in microarray data.

Johann A Gagnon-Bartsch1, Terence P Speed.   

Abstract

Microarray expression studies suffer from the problem of batch effects and other unwanted variation. Many methods have been proposed to adjust microarray data to mitigate the problems of unwanted variation. Several of these methods rely on factor analysis to infer the unwanted variation from the data. A central problem with this approach is the difficulty in discerning the unwanted variation from the biological variation that is of interest to the researcher. We present a new method, intended for use in differential expression studies, that attempts to overcome this problem by restricting the factor analysis to negative control genes. Negative control genes are genes known a priori not to be differentially expressed with respect to the biological factor of interest. Variation in the expression levels of these genes can therefore be assumed to be unwanted variation. We name this method "Remove Unwanted Variation, 2-step" (RUV-2). We discuss various techniques for assessing the performance of an adjustment method and compare the performance of RUV-2 with that of other commonly used adjustment methods such as Combat and Surrogate Variable Analysis (SVA). We present several example studies, each concerning genes differentially expressed with respect to gender in the brain and find that RUV-2 performs as well or better than other methods. Finally, we discuss the possibility of adapting RUV-2 for use in studies not concerned with differential expression and conclude that there may be promise but substantial challenges remain.

Mesh:

Year:  2011        PMID: 22101192      PMCID: PMC3577104          DOI: 10.1093/biostatistics/kxr034

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  21 in total

1.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Authors:  B M Bolstad; R A Irizarry; M Astrand; T P Speed
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

2.  Human housekeeping genes are compact.

Authors:  Eli Eisenberg; Erez Y Levanon
Journal:  Trends Genet       Date:  2003-07       Impact factor: 11.639

3.  Correction for hidden confounders in the genetic analysis of gene expression.

Authors:  Jennifer Listgarten; Carl Kadie; Eric E Schadt; David Heckerman
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-01       Impact factor: 11.205

4.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

5.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

Review 6.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

7.  Molecular characterisation of soft tissue tumours: a gene expression study.

Authors:  Torsten O Nielsen; Rob B West; Sabine C Linn; Orly Alter; Margaret A Knowling; John X O'Connell; Shirley Zhu; Mike Fero; Gavin Sherlock; Jonathan R Pollack; Patrick O Brown; David Botstein; Matt van de Rijn
Journal:  Lancet       Date:  2002-04-13       Impact factor: 79.321

8.  Supervised normalization of microarrays.

Authors:  Brigham H Mecham; Peter S Nelson; John D Storey
Journal:  Bioinformatics       Date:  2010-03-31       Impact factor: 6.937

9.  Exploring the use of internal and externalcontrols for assessing microarray technical performance.

Authors:  Katrice A Lippa; David L Duewer; Marc L Salit; Laurence Game; Helen C Causton
Journal:  BMC Res Notes       Date:  2010-12-28

10.  Identification and validation of suitable endogenous reference genes for gene expression studies in human peripheral blood.

Authors:  Boryana S Stamova; Michelle Apperson; Wynn L Walker; Yingfang Tian; Huichun Xu; Peter Adamczy; Xinhua Zhan; Da-Zhi Liu; Bradley P Ander; Isaac H Liao; Jeffrey P Gregg; Renee J Turner; Glen Jickling; Lisa Lit; Frank R Sharp
Journal:  BMC Med Genomics       Date:  2009-08-05       Impact factor: 3.063

View more
  183 in total

1.  The Dissection of Expression Quantitative Trait Locus Hotspots.

Authors:  Jianan Tian; Mark P Keller; Aimee Teo Broman; Christina Kendziorski; Brian S Yandell; Alan D Attie; Karl W Broman
Journal:  Genetics       Date:  2016-02-02       Impact factor: 4.562

2.  Covariance adjustment for batch effect in gene expression data.

Authors:  Jung Ae Lee; Kevin K Dobbin; Jeongyoun Ahn
Journal:  Stat Med       Date:  2014-03-28       Impact factor: 2.373

3.  Normalization of RNA-seq data using factor analysis of control genes or samples.

Authors:  Davide Risso; John Ngai; Terence P Speed; Sandrine Dudoit
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

4.  Division of labor in honey bees is associated with transcriptional regulatory plasticity in the brain.

Authors:  Adam R Hamilton; Ian M Traniello; Allyson M Ray; Arminius S Caldwell; Samuel A Wickline; Gene E Robinson
Journal:  J Exp Biol       Date:  2019-07-16       Impact factor: 3.312

5.  MSPrep--summarization, normalization and diagnostics for processing of mass spectrometry-based metabolomic data.

Authors:  Grant Hughes; Charmion Cruickshank-Quinn; Richard Reisdorph; Sharon Lutz; Irina Petrache; Nichole Reisdorph; Russell Bowler; Katerina Kechris
Journal:  Bioinformatics       Date:  2013-10-29       Impact factor: 6.937

6.  Count-based differential expression analysis of RNA sequencing data using R and Bioconductor.

Authors:  Simon Anders; Davis J McCarthy; Yunshun Chen; Michal Okoniewski; Gordon K Smyth; Wolfgang Huber; Mark D Robinson
Journal:  Nat Protoc       Date:  2013-08-22       Impact factor: 13.491

7.  Identification of the Bile Acid Transporter Slco1a6 as a Candidate Gene That Broadly Affects Gene Expression in Mouse Pancreatic Islets.

Authors:  Jianan Tian; Mark P Keller; Angie T Oler; Mary E Rabaglia; Kathryn L Schueler; Donald S Stapleton; Aimee Teo Broman; Wen Zhao; Christina Kendziorski; Brian S Yandell; Bruno Hagenbuch; Karl W Broman; Alan D Attie
Journal:  Genetics       Date:  2015-09-18       Impact factor: 4.562

8.  Granulocyte macrophage colony-stimulating factor induces CCL17 production via IRF4 to mediate inflammation.

Authors:  Adrian Achuthan; Andrew D Cook; Ming-Chin Lee; Reem Saleh; Hsu-Wei Khiew; Melody W N Chang; Cynthia Louis; Andrew J Fleetwood; Derek C Lacey; Anne D Christensen; Ashlee T Frye; Pui Yeng Lam; Hitoshi Kusano; Koji Nomura; Nancy Steiner; Irmgard Förster; Stephen L Nutt; Moshe Olshansky; Stephen J Turner; John A Hamilton
Journal:  J Clin Invest       Date:  2016-08-15       Impact factor: 14.808

9.  Accessory subunits are integral for assembly and function of human mitochondrial complex I.

Authors:  David A Stroud; Elliot E Surgenor; Luke E Formosa; Boris Reljic; Ann E Frazier; Marris G Dibley; Laura D Osellame; Tegan Stait; Traude H Beilharz; David R Thorburn; Agus Salim; Michael T Ryan
Journal:  Nature       Date:  2016-09-14       Impact factor: 49.962

10.  Correcting gene expression data when neither the unwanted variation nor the factor of interest are observed.

Authors:  Laurent Jacob; Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2015-08-17       Impact factor: 5.899

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.