Literature DB >> 18932138

Resampling-based empirical Bayes multiple testing procedures for controlling generalized tail probability and expected value error rates: focus on the false discovery rate and simulation study.

Sandrine Dudoit1, Houston N Gilbert, Mark J van der Laan.   

Abstract

This article proposes resampling-based empirical Bayes multiple testing procedures for controlling a broad class of Type I error rates, defined as generalized tail probability (gTP) error rates, gTP (q,g) = Pr(g (V(n),S(n)) > q), and generalized expected value (gEV) error rates, gEV (g) = E [g (V(n),S(n))], for arbitrary functions g (V(n),S(n)) of the numbers of false positives V(n) and true positives S(n). Of particular interest are error rates based on the proportion g (V(n),S(n)) = V(n) /(V(n) + S(n)) of Type I errors among the rejected hypotheses, such as the false discovery rate (FDR), FDR = E [V(n) /(V(n) + S(n))]. The proposed procedures offer several advantages over existing methods. They provide Type I error control for general data generating distributions, with arbitrary dependence structures among variables. Gains in power are achieved by deriving rejection regions based on guessed sets of true null hypotheses and null test statistics randomly sampled from joint distributions that account for the dependence structure of the data. The Type I error and power properties of an FDR-controlling version of the resampling-based empirical Bayes approach are investigated and compared to those of widely-used FDR-controlling linear step-up procedures in a simulation study. The Type I error and power trade-off achieved by the empirical Bayes procedures under a variety of testing scenarios allows this approach to be competitive with or outperform the Storey and Tibshirani (2003) linear step-up procedure, as an alternative to the classical Benjamini and Hochberg (1995) procedure.

Entities:  

Mesh:

Year:  2008        PMID: 18932138      PMCID: PMC4130579          DOI: 10.1002/bimj.200710473

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  7 in total

1.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

2.  Augmentation procedures for control of the generalized family-wise error rate and tail probabilities for the proportion of false positives.

Authors:  Mark J van der Laan; Sandrine Dudoit; Katherine S Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2004-06-15

3.  Multiple testing. Part II. Step-down procedures for control of the family-wise error rate.

Authors:  Mark J van der Laan; Sandrine Dudoit; Katherine S Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2004-06-14

4.  Empirical Bayes and resampling based multiple testing procedure controlling tail probability of the proportion of false positives.

Authors:  Mark J van der Laan; Merrill D Birkner; Alan E Hubbard
Journal:  Stat Appl Genet Mol Biol       Date:  2005-10-07

5.  Multiple testing. Part I. Single-step procedures for control of general type I error rates.

Authors:  Sandrine Dudoit; Mark J van der Laan; Katherine S Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2004-06-09

6.  Quantile-function based null distribution in resampling based multiple testing.

Authors:  Mark J van der Laan; Alan E Hubbard
Journal:  Stat Appl Genet Mol Biol       Date:  2006-05-21

7.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors:  T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

  7 in total
  8 in total

1.  Statistical inference and reverse engineering of gene regulatory networks from observational expression data.

Authors:  Frank Emmert-Streib; Galina V Glazko; Gökmen Altay; Ricardo de Matos Simoes
Journal:  Front Genet       Date:  2012-02-03       Impact factor: 4.599

2.  Spiked Dirichlet Process Prior for Bayesian Multiple Hypothesis Testing in Random Effects Models.

Authors:  Sinae Kim; David B Dahl; Marina Vannucci
Journal:  Bayesian Anal       Date:  2009       Impact factor: 3.728

3.  Filtering, FDR and power.

Authors:  Maarten van Iterson; Judith M Boer; Renée X Menezes
Journal:  BMC Bioinformatics       Date:  2010-09-07       Impact factor: 3.169

4.  POWER-ENHANCED MULTIPLE DECISION FUNCTIONS CONTROLLING FAMILY-WISE ERROR AND FALSE DISCOVERY RATES.

Authors:  Edsel A Peña; Joshua D Habiger; Wensong Wu
Journal:  Ann Stat       Date:  2011-02       Impact factor: 4.028

5.  Statistical approaches to analyzing HIV-1 neutralizing antibody assay data.

Authors:  Xuesong Yu; Peter B Gilbert; Catarina E Hioe; Susan Zolla-Pazner; Steven G Self
Journal:  Stat Biopharm Res       Date:  2012-01-01       Impact factor: 1.452

6.  A hierarchical Bayesian approach to multiple testing in disease mapping.

Authors:  Dolores Catelan; Corrado Lagazio; Annibale Biggeri
Journal:  Biom J       Date:  2010-12       Impact factor: 2.207

7.  Identification of differentially expressed genes regulated by transcription factors in glioblastomas by bioinformatics analysis.

Authors:  Bo Wei; Le Wang; Chao Du; Guozhang Hu; Lina Wang; Ying Jin; Daliang Kong
Journal:  Mol Med Rep       Date:  2014-12-15       Impact factor: 2.952

8.  Multiplex gene regulation by CRISPR-ddCpf1.

Authors:  Xiaochun Zhang; Jingman Wang; Qiuxiang Cheng; Xuan Zheng; Guoping Zhao; Jin Wang
Journal:  Cell Discov       Date:  2017-06-06       Impact factor: 10.849

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.