Literature DB >> 31439967

CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

Jingshu Wang1, Qingyuan Zhao1, Trevor Hastie2, Art B Owen2.   

Abstract

We consider large-scale studies in which thousands of significance tests are performed simultaneously. In some of these studies, the multiple testing procedure can be severely biased by latent confounding factors such as batch effects and unmeasured covariates that correlate with both primary variable(s) of interest (e.g., treatment variable, phenotype) and the outcome. Over the past decade, many statistical methods have been proposed to adjust for the confounders in hypothesis testing. We unify these methods in the same framework, generalize them to include multiple primary variables and multiple nuisance variables, and analyze their statistical properties. In particular, we provide theoretical guarantees for RUV-4 [Gagnon-Bartsch, Jacob and Speed (2013)] and LEAPP [Ann. Appl. Stat. 6 (2012) 1664-1688], which correspond to two different identification conditions in the framework: the first requires a set of "negative controls" that are known a priori to follow the null distribution; the second requires the true nonnulls to be sparse. Two different estimators which are based on RUV-4 and LEAPP are then applied to these two scenarios. We show that if the confounding factors are strong, the resulting estimators can be asymptotically as powerful as the oracle estimator which observes the latent confounding factors. For hypothesis testing, we show the asymptotic z-tests based on the estimators can control the type I error. Numerical experiments show that the false discovery rate is also controlled by the Benjamini-Hochberg procedure when the sample size is reasonably large.

Entities:  

Keywords:  Empirical null; Primary 62J15; batch effect; robust regression; secondary 62H25; surrogate variable analysis; unwanted variation

Year:  2017        PMID: 31439967      PMCID: PMC6706069          DOI: 10.1214/16-AOS1511

Source DB:  PubMed          Journal:  Ann Stat        ISSN: 0090-5364            Impact factor:   4.028


  23 in total

1.  Significance analysis of microarrays applied to the ionizing radiation response.

Authors:  V G Tusher; R Tibshirani; G Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-17       Impact factor: 11.205

2.  Singular value decomposition for genome-wide expression data processing and modeling.

Authors:  O Alter; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

3.  Incipient Alzheimer's disease: microarray correlation analyses reveal major transcriptional and tumor suppressor responses.

Authors:  Eric M Blalock; James W Geddes; Kuey Chu Chen; Nada M Porter; William R Markesbery; Philip W Landfield
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-09       Impact factor: 11.205

4.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

Authors:  Rafael A Irizarry; Bridget Hobbs; Francois Collin; Yasmin D Beazer-Barclay; Kristen J Antonellis; Uwe Scherf; Terence P Speed
Journal:  Biostatistics       Date:  2003-04       Impact factor: 5.899

5.  Gender-specific gene expression in post-mortem human brain: localization to sex chromosomes.

Authors:  Marquis P Vawter; Simon Evans; Prabhakara Choudary; Hiroaki Tomita; Jim Meador-Woodruff; Margherita Molnar; Jun Li; Juan F Lopez; Rick Myers; David Cox; Stanley J Watson; Huda Akil; Edward G Jones; William E Bunney
Journal:  Neuropsychopharmacology       Date:  2004-02       Impact factor: 7.853

6.  Discovery of meaningful associations in genomic data using partial correlation coefficients.

Authors:  Alberto de la Fuente; Nan Bing; Ina Hoeschele; Pedro Mendes
Journal:  Bioinformatics       Date:  2004-07-29       Impact factor: 6.937

Review 7.  Bias as a threat to the validity of cancer molecular-marker research.

Authors:  David F Ransohoff
Journal:  Nat Rev Cancer       Date:  2005-02       Impact factor: 60.716

Review 8.  Integrative analysis of the cancer transcriptome.

Authors:  Daniel R Rhodes; Arul M Chinnaiyan
Journal:  Nat Genet       Date:  2005-06       Impact factor: 38.330

9.  Genomic expression programs in the response of yeast cells to environmental changes.

Authors:  A P Gasch; P T Spellman; C M Kao; O Carmel-Harel; M B Eisen; G Storz; D Botstein; P O Brown
Journal:  Mol Biol Cell       Date:  2000-12       Impact factor: 4.138

10.  Effects of atmospheric ozone on microarray data quality.

Authors:  Thomas L Fare; Ernest M Coffey; Hongyue Dai; Yudong D He; Deborah A Kessler; Kristopher A Kilian; John E Koch; Eric LeProust; Matthew J Marton; Michael R Meyer; Roland B Stoughton; George Y Tokiwa; Yanqun Wang
Journal:  Anal Chem       Date:  2003-09-01       Impact factor: 6.986

View more
  15 in total

1.  Multiply robust causal inference with double-negative control adjustment for categorical unmeasured confounding.

Authors:  Xu Shi; Wang Miao; Jennifer C Nelson; Eric J Tchetgen Tchetgen
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2020-01-22       Impact factor: 4.488

2.  ESTIMATION AND INFERENCE IN METABOLOMICS WITH NON-RANDOM MISSING DATA AND LATENT FACTORS.

Authors:  Chris McKennan; Carole Ober; Dan Nicolae
Journal:  Ann Appl Stat       Date:  2020-06-29       Impact factor: 2.083

3.  Robust high dimensional factor models with applications to statistical machine learning.

Authors:  Jianqing Fan; Kaizheng Wang; Yiqiao Zhong; Ziwei Zhu
Journal:  Stat Sci       Date:  2021-04-19       Impact factor: 2.901

4.  Negative Control Exposures: Causal Effect Identifiability and Use in Probabilistic-bias and Bayesian Analyses With Unmeasured Confounders.

Authors:  W Dana Flanders; Lance A Waller; Qi Zhang; Darios Getahun; Michael Silverberg; Michael Goodman
Journal:  Epidemiology       Date:  2022-07-27       Impact factor: 4.860

5.  DOUBLY DEBIASED LASSO: HIGH-DIMENSIONAL INFERENCE UNDER HIDDEN CONFOUNDING.

Authors:  Zijian Guo; Domagoj Ćevid; Peter Bühlmann
Journal:  Ann Stat       Date:  2022-06-16       Impact factor: 4.904

6.  Empirical Bayes shrinkage and false discovery rate estimation, allowing for unwanted variation.

Authors:  David Gerard; Matthew Stephens
Journal:  Biostatistics       Date:  2020-01-01       Impact factor: 5.899

7.  Benchmarking association analyses of continuous exposures with RNA-seq in observational studies.

Authors:  Tamar Sofer; Nuzulul Kurniansyah; François Aguet; Kristin Ardlie; Peter Durda; Deborah A Nickerson; Joshua D Smith; Yongmei Liu; Sina A Gharib; Susan Redline; Stephen S Rich; Jerome I Rotter; Kent D Taylor
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

8.  A Selective Review of Negative Control Methods in Epidemiology.

Authors:  Xu Shi; Wang Miao; Eric Tchetgen Tchetgen
Journal:  Curr Epidemiol Rep       Date:  2020-10-15

9.  Accounting for unobserved covariates with varying degrees of estimability in high-dimensional biological data.

Authors:  Chris McKennan; Dan Nicolae
Journal:  Biometrika       Date:  2019-09-16       Impact factor: 3.028

10.  A spectral theory for Wright's inbreeding coefficients and related quantities.

Authors:  Olivier François; Clément Gain
Journal:  PLoS Genet       Date:  2021-07-19       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.