Literature DB >> 30387496

A non-randomized procedure for large-scale heterogeneous multiple discrete testing based on randomized tests.

Xiaoyu Dai1, Nan Lin1,2, Daofeng Li3, Ting Wang3.   

Abstract

In the analysis of next-generation sequencing technology, massive discrete data are generated from short read counts with varying biological coverage. Conducting conditional hypothesis testing such as Fisher's Exact Test at every genomic region of interest thus leads to a heterogeneous multiple discrete testing problem. However, most existing multiple testing procedures for controlling the false discovery rate (FDR) assume that test statistics are continuous and become conservative for discrete tests. To overcome the conservativeness, in this article, we propose a novel multiple testing procedure for better FDR control on heterogeneous discrete tests. Our procedure makes decisions based on the marginal critical function (MCF) of randomized tests, which enables achieving a powerful and non-randomized multiple testing procedure. We provide upper bounds of the positive FDR (pFDR) and the positive false non-discovery rate (pFNR) corresponding to our procedure. We also prove that the set of detections made by our method contains every detection made by a naive application of the widely-used q-value method. We further demonstrate the improvement of our method over other existing multiple testing procedures by simulations and a real example of differentially methylated region (DMR) detection using whole-genome bisulfite sequencing (WGBS) data.
© 2019 Wiley Periodicals, Inc.

Entities:  

Keywords:  Discrete P-value; differentially methylated regions; marginal critical function; multiple testing; randomized test; whole-genome bisulfite sequencing

Mesh:

Substances:

Year:  2019        PMID: 30387496      PMCID: PMC6565503          DOI: 10.1111/biom.12996

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  16 in total

1.  False Discovery Rate Control With Groups.

Authors:  James X Hu; Hongyu Zhao; Harrison H Zhou
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

2.  Improving false discovery rate estimation.

Authors:  Stan Pounds; Cheng Cheng
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

3.  False discovery rate estimation for large-scale homogeneous discrete p-values.

Authors:  Kun Liang
Journal:  Biometrics       Date:  2015-10-22       Impact factor: 2.571

4.  Robust estimation of the false discovery rate.

Authors:  Stan Pounds; Cheng Cheng
Journal:  Bioinformatics       Date:  2006-06-15       Impact factor: 6.937

5.  Exploring the information in p-values for the analysis and planning of multiple-test experiments.

Authors:  David Ruppert; Dan Nettleton; J T Gene Hwang
Journal:  Biometrics       Date:  2007-06       Impact factor: 2.571

6.  Extension of the Neyman-Pearson theory of tests to discontinuous variates.

Authors:  K D TOCHER
Journal:  Biometrika       Date:  1950-06       Impact factor: 2.445

7.  Detection of significantly differentially methylated regions in targeted bisulfite sequencing data.

Authors:  Katja Hebestreit; Martin Dugas; Hans-Ulrich Klein
Journal:  Bioinformatics       Date:  2013-05-08       Impact factor: 6.937

8.  A modified Bonferroni method for discrete data.

Authors:  R E Tarone
Journal:  Biometrics       Date:  1990-06       Impact factor: 2.571

9.  Dnmt3a is essential for hematopoietic stem cell differentiation.

Authors:  Grant A Challen; Deqiang Sun; Mira Jeong; Min Luo; Jaroslav Jelinek; Jonathan S Berg; Christoph Bock; Aparna Vasanthakumar; Hongcang Gu; Yuanxin Xi; Shoudan Liang; Yue Lu; Gretchen J Darlington; Alexander Meissner; Jean-Pierre J Issa; Lucy A Godley; Wei Li; Margaret A Goodell
Journal:  Nat Genet       Date:  2011-12-04       Impact factor: 38.330

10.  Genome-scale DNA methylation mapping of clinical samples at single-nucleotide resolution.

Authors:  Hongcang Gu; Christoph Bock; Tarjei S Mikkelsen; Natalie Jäger; Zachary D Smith; Eleni Tomazou; Andreas Gnirke; Eric S Lander; Alexander Meissner
Journal:  Nat Methods       Date:  2010-01-10       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.