Literature DB >> 20810919

Correction for hidden confounders in the genetic analysis of gene expression.

Jennifer Listgarten1, Carl Kadie, Eric E Schadt, David Heckerman.   

Abstract

Understanding the genetic underpinnings of disease is important for screening, treatment, drug development, and basic biological insight. One way of getting at such an understanding is to find out which parts of our DNA, such as single-nucleotide polymorphisms, affect particular intermediary processes such as gene expression. Naively, such associations can be identified using a simple statistical test on all paired combinations of genetic variants and gene transcripts. However, a wide variety of confounders lie hidden in the data, leading to both spurious associations and missed associations if not properly addressed. We present a statistical model that jointly corrects for two particular kinds of hidden structure--population structure (e.g., race, family-relatedness), and microarray expression artifacts (e.g., batch effects), when these confounders are unknown. Applying our method to both real and synthetic, human and mouse data, we demonstrate the need for such a joint correction of confounders, and also the disadvantages of other possible approaches based on those in the current literature. In particular, we show that our class of models has maximum power to detect eQTL on synthetic data, and has the best performance on a bronze standard applied to real data. Lastly, our software and the associations we found with it are available at http://www.microsoft.com/science.

Entities:  

Mesh:

Year:  2010        PMID: 20810919      PMCID: PMC2944732          DOI: 10.1073/pnas.1002425107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  30 in total

1.  Association mapping in structured populations.

Authors:  J K Pritchard; M Stephens; N A Rosenberg; P Donnelly
Journal:  Am J Hum Genet       Date:  2000-05-26       Impact factor: 11.025

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

4.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness.

Authors:  Jianming Yu; Gael Pressoir; William H Briggs; Irie Vroh Bi; Masanori Yamasaki; John F Doebley; Michael D McMullen; Brandon S Gaut; Dahlia M Nielsen; James B Holland; Stephen Kresovich; Edward S Buckler
Journal:  Nat Genet       Date:  2005-12-25       Impact factor: 38.330

5.  An integrative genomics approach to infer causal associations between gene expression and disease.

Authors:  Eric E Schadt; John Lamb; Xia Yang; Jun Zhu; Steve Edwards; Debraj Guhathakurta; Solveig K Sieberts; Stephanie Monks; Marc Reitman; Chunsheng Zhang; Pek Yee Lum; Amy Leonardson; Rolf Thieringer; Joseph M Metzger; Liming Yang; John Castle; Haoyuan Zhu; Shera F Kash; Thomas A Drake; Alan Sachs; Aldons J Lusis
Journal:  Nat Genet       Date:  2005-06-19       Impact factor: 38.330

6.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

7.  Assessing the prospects of genome-wide association studies performed in inbred mice.

Authors:  Wan-Lin Su; Solveig K Sieberts; Robert R Kleinhanz; Karine Lux; Joshua Millstein; Cliona Molony; Eric E Schadt
Journal:  Mamm Genome       Date:  2010-02-05       Impact factor: 2.957

8.  Genetics of gene expression surveyed in maize, mouse and man.

Authors:  Eric E Schadt; Stephanie A Monks; Thomas A Drake; Aldons J Lusis; Nam Che; Veronica Colinayo; Thomas G Ruff; Stephen B Milligan; John R Lamb; Guy Cavet; Peter S Linsley; Mao Mao; Roland B Stoughton; Stephen H Friend
Journal:  Nature       Date:  2003-03-20       Impact factor: 49.962

9.  Genomewide association analysis in diverse inbred mice: power and population structure.

Authors:  Phillip McClurg; Jeff Janes; Chunlei Wu; David L Delano; John R Walker; Serge Batalov; Joseph S Takahashi; Kazuhiro Shimomura; Akira Kohsaka; Joseph Bass; Tim Wiltshire; Andrew I Su
Journal:  Genetics       Date:  2007-04-03       Impact factor: 4.562

Review 10.  A tutorial on statistical methods for population association studies.

Authors:  David J Balding
Journal:  Nat Rev Genet       Date:  2006-10       Impact factor: 53.242

View more
  68 in total

1.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

2.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations.

Authors:  Andrey A Shabalin
Journal:  Bioinformatics       Date:  2012-04-06       Impact factor: 6.937

Review 3.  Computational tools for discovery and interpretation of expression quantitative trait loci.

Authors:  Fred A Wright; Andrey A Shabalin; Ivan Rusyn
Journal:  Pharmacogenomics       Date:  2012-02       Impact factor: 2.533

4.  The Dissection of Expression Quantitative Trait Locus Hotspots.

Authors:  Jianan Tian; Mark P Keller; Aimee Teo Broman; Christina Kendziorski; Brian S Yandell; Alan D Attie; Karl W Broman
Journal:  Genetics       Date:  2016-02-02       Impact factor: 4.562

5.  Normalization of RNA-seq data using factor analysis of control genes or samples.

Authors:  Davide Risso; John Ngai; Terence P Speed; Sandrine Dudoit
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

6.  HEFT: eQTL analysis of many thousands of expressed genes while simultaneously controlling for hidden factors.

Authors:  Chuan Gao; Nicole L Tignor; Jacqueline Salit; Yael Strulovici-Barel; Neil R Hackett; Ronald G Crystal; Jason G Mezey
Journal:  Bioinformatics       Date:  2013-12-04       Impact factor: 6.937

Review 7.  A systems view of genetics in chronic kidney disease.

Authors:  Benjamin J Keller; Sebastian Martini; John R Sedor; Matthias Kretzler
Journal:  Kidney Int       Date:  2011-10-19       Impact factor: 10.612

8.  Identification of the Bile Acid Transporter Slco1a6 as a Candidate Gene That Broadly Affects Gene Expression in Mouse Pancreatic Islets.

Authors:  Jianan Tian; Mark P Keller; Angie T Oler; Mary E Rabaglia; Kathryn L Schueler; Donald S Stapleton; Aimee Teo Broman; Wen Zhao; Christina Kendziorski; Brian S Yandell; Bruno Hagenbuch; Karl W Broman; Alan D Attie
Journal:  Genetics       Date:  2015-09-18       Impact factor: 4.562

9.  PERSONALIZED MEDICINE: FROM GENOTYPES AND MOLECULAR PHENOTYPES TOWARDS COMPUTED THERAPY.

Authors:  Oliver Stegle; Steven E Brenner; Quaid Morris; Jennifer Listgarten
Journal:  Pac Symp Biocomput       Date:  2013

10.  Correcting gene expression data when neither the unwanted variation nor the factor of interest are observed.

Authors:  Laurent Jacob; Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2015-08-17       Impact factor: 5.899

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.