Literature DB >> 31662591

Bayesian Hidden Markov Models for Dependent Large-Scale Multiple Testing.

Xia Wang1, Ali Shojaie2, Jian Zou3.   

Abstract

An optimal and flexible multiple hypotheses testing procedure is constructed for dependent data based on Bayesian techniques, aiming at handling two challenges, namely dependence structure and non-null distribution specification. Ignoring dependence among hypotheses tests may lead to loss of efficiency and bias in decision. Misspecification in the non-null distribution, on the other hand, can result in both false positive and false negative errors. Hidden Markov models are used to accommodate the dependence structure among the tests. Dirichlet mixture process prior is applied on the non-null distribution to overcome the potential pitfalls in distribution misspecification. The testing algorithm based on Bayesian techniques optimizes the false negative rate (FNR) while controlling the false discovery rate (FDR). The procedure is applied to pointwise and clusterwise analysis. Its performance is compared with existing approaches using both simulated and real data examples.

Entities:  

Keywords:  Bayesian hierarchical model; Dirichlet mixture process prior; False discovery rate; Hidden Markov model; Multiple hypotheses testing

Year:  2019        PMID: 31662591      PMCID: PMC6818740          DOI: 10.1016/j.csda.2019.01.009

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


  13 in total

1.  False Discovery Control in Large-Scale Spatial Multiple Testing.

Authors:  Wenguang Sun; Brian J Reich; T Tony Cai; Michele Guindani; Armin Schwartzman
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2015-01-01       Impact factor: 4.488

2.  Multiple testing in genome-wide association studies via hidden Markov models.

Authors:  Zhi Wei; Wenguang Sun; Kai Wang; Hakon Hakonarson
Journal:  Bioinformatics       Date:  2009-08-04       Impact factor: 6.937

3.  The effect of correlation in false discovery rate estimation.

Authors:  Armin Schwartzman; Xihong Lin
Journal:  Biometrika       Date:  2011-03       Impact factor: 2.445

4.  High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis.

Authors:  Z John Daye; Jinbo Chen; Hongzhe Li
Journal:  Biometrics       Date:  2011-08-12       Impact factor: 2.571

5.  Comment on "Correlated z-values and the accuracy of large-scale statistical estimates" by Bradley Efron.

Authors:  Armin Schwartzman
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

6.  A Bayesian Discovery Procedure.

Authors:  Michele Guindani; Peter Müller; Song Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2009-11-01       Impact factor: 4.488

7.  Multiple testing for neuroimaging via hidden Markov random field.

Authors:  Hai Shu; Bin Nan; Robert Koeppe
Journal:  Biometrics       Date:  2015-05-26       Impact factor: 2.571

8.  Graphical-model Based Multiple Testing under Dependence, with Applications to Genome-wide Association Studies.

Authors:  Jie Liu; Peggy Peissig; Chunming Zhang; Elizabeth Burnside; Catherine McCarty; David Page
Journal:  Uncertain Artif Intell       Date:  2012

9.  Bayesian optimal discovery procedure for simultaneous significance testing.

Authors:  Jing Cao; Xian-Jin Xie; Song Zhang; Angelique Whitehurst; Michael A White
Journal:  BMC Bioinformatics       Date:  2009-01-06       Impact factor: 3.169

10.  Large-scale multiple testing in genome-wide association studies via region-specific hidden Markov models.

Authors:  Jian Xiao; Wensheng Zhu; Jianhua Guo
Journal:  BMC Bioinformatics       Date:  2013-09-25       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.