Literature DB >> 17049030

A method to increase the power of multiple testing procedures through sample splitting.

Daniel Rubin1, Sandrine Dudoit, Mark van der Laan.   

Abstract

Consider the standard multiple testing problem where many hypotheses are to be tested, each hypothesis is associated with a test statistic, and large test statistics provide evidence against the null hypotheses. One proposal to provide probabilistic control of Type-I errors is the use of procedures ensuring that the expected number of false positives does not exceed a user-supplied threshold. Among such multiple testing procedures, we derive the most powerful method, meaning the test statistic cutoffs that maximize the expected number of true positives. Unfortunately, these optimal cutoffs depend on the true unknown data generating distribution, so could never be used in a practical setting. We instead consider splitting the sample so that the optimal cutoffs are estimated from a portion of the data, and then testing on the remaining data using these estimated cutoffs. When the null distributions for all test statistics are the same, the obvious way to control the expected number of false positives would be to use a common cutoff for all tests. In this work, we consider the common cutoff method as a benchmark multiple testing procedure. We show that in certain circumstances the use of estimated optimal cutoffs via sample splitting can dramatically outperform this benchmark method, resulting in increased true discoveries, while retaining Type-I error control. This paper is an updated version of the work presented in Rubin et al. (2005), later expanded upon by Wasserman and Roeder (2006).

Mesh:

Year:  2006        PMID: 17049030     DOI: 10.2202/1544-6115.1148

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  15 in total

1.  Independent filtering increases detection power for high-throughput experiments.

Authors:  Richard Bourgon; Robert Gentleman; Wolfgang Huber
Journal:  Proc Natl Acad Sci U S A       Date:  2010-05-11       Impact factor: 11.205

2.  Increasing power in association studies by using linkage disequilibrium structure and molecular function as prior information.

Authors:  Eleazar Eskin
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

3.  Weighted multiple hypothesis testing procedures.

Authors:  Guolian Kang; Keying Ye; Nianjun Liu; David B Allison; Guimin Gao
Journal:  Stat Appl Genet Mol Biol       Date:  2009-04-16

4.  Compound p-value statistics for multiple testing procedures.

Authors:  Joshua D Habiger; Edsel A Peña
Journal:  J Multivar Anal       Date:  2014-04-01       Impact factor: 1.473

5.  Genome-Wide Significance Levels and Weighted Hypothesis Testing.

Authors:  Kathryn Roeder; Larry Wasserman
Journal:  Stat Sci       Date:  2009-11       Impact factor: 2.901

6.  POWER-ENHANCED MULTIPLE DECISION FUNCTIONS CONTROLLING FAMILY-WISE ERROR AND FALSE DISCOVERY RATES.

Authors:  Edsel A Peña; Joshua D Habiger; Wensong Wu
Journal:  Ann Stat       Date:  2011-02       Impact factor: 4.028

7.  Weighted mining of massive collections of [Formula: see text]-values by convex optimization.

Authors:  Edgar Dobriban
Journal:  Inf inference       Date:  2017-12-08

8.  Optimal multiple testing under a Gaussian prior on the effect sizes.

Authors:  Edgar Dobriban; Kristen Fortney; Stuart K Kim; Art B Owen
Journal:  Biometrika       Date:  2015-11-04       Impact factor: 2.445

9.  Weighted multiple testing procedures for genomic studies.

Authors:  Jiang Gui; Tor D Tosteson; Mark Borsuk
Journal:  BioData Min       Date:  2012-06-07       Impact factor: 2.522

10.  Power and stability properties of resampling-based multiple testing procedures with applications to gene oncology studies.

Authors:  Dongmei Li; Timothy D Dye
Journal:  Comput Math Methods Med       Date:  2013-11-20       Impact factor: 2.238

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.