Literature DB >> 24729644

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence.

Jianqing Fan1, Xu Han2, Weijie Gu3.   

Abstract

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any SNPs are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging under arbitrary dependence. In the current paper, we propose a novel method based on principal factor approximation, which successfully subtracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive an approximate expression for false discovery proportion (FDP) in large scale multiple testing when a common threshold is used and provide a consistent estimate of realized FDP. This result has important applications in controlling FDR and FDP. Our estimate of realized FDP compares favorably with Efron (2007)'s approach, as demonstrated in the simulated examples. Our approach is further illustrated by some real data applications. We also propose a dependence-adjusted procedure, which is more powerful than the fixed threshold procedure.

Entities:  

Keywords:  Multiple hypothesis testing; arbitrary dependence structure; false discovery rate; genome-wide association studies; high dimensional inference

Year:  2012        PMID: 24729644      PMCID: PMC3983872          DOI: 10.1080/01621459.2012.720478

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  5 in total

1.  Variance estimation using refitted cross-validation in ultrahigh dimensional regression.

Authors:  Jianqing Fan; Shaojun Guo; Ning Hao
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-01-01       Impact factor: 4.488

2.  A general framework for multiple testing dependence.

Authors:  Jeffrey T Leek; John D Storey
Journal:  Proc Natl Acad Sci U S A       Date:  2008-11-24       Impact factor: 11.205

3.  Correlated z-values and the accuracy of large-scale statistical estimates.

Authors:  Bradley Efron
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

4.  Gene expression variation and expression quantitative trait mapping of human chromosome 21 genes.

Authors:  Samuel Deutsch; Robert Lyle; Emmanouil T Dermitzakis; Homa Attar; Lakshman Subrahmanyan; Corinne Gehrig; Leila Parand; Maryline Gagnebin; Jacques Rougemont; C Victor Jongeneel; Stylianos E Antonarakis
Journal:  Hum Mol Genet       Date:  2005-10-26       Impact factor: 6.150

5.  Penalized Composite Quasi-Likelihood for Ultrahigh-Dimensional Variable Selection.

Authors:  Jelena Bradic; Jianqing Fan; Weiwei Wang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2011-06       Impact factor: 4.488

  5 in total
  36 in total

1.  PROJECTED PRINCIPAL COMPONENT ANALYSIS IN FACTOR MODELS.

Authors:  Jianqing Fan; Yuan Liao; Weichen Wang
Journal:  Ann Stat       Date:  2016-02       Impact factor: 4.028

2.  Pathway crosstalk effects: Shrinkage and disentanglement using a Bayesian hierarchical model.

Authors:  Alin Tomoiaga; Peter Westfall; Michele Donato; Sorin Draghici; Sonia Hassan; Roberto Romero; Paola Tellaroli
Journal:  Stat Biosci       Date:  2016-07-26

3.  Sufficient Forecasting Using Factor Models.

Authors:  Jianqing Fan; Lingzhou Xue; Jiawei Yao
Journal:  J Econom       Date:  2017-08-26       Impact factor: 2.388

Review 4.  Beyond smartphones and sensors: choosing appropriate statistical methods for the analysis of longitudinal data.

Authors:  Ian Barnett; John Torous; Patrick Staples; Matcheri Keshavan; Jukka-Pekka Onnela
Journal:  J Am Med Inform Assoc       Date:  2018-12-01       Impact factor: 4.497

5.  Structured variable selection with q-values.

Authors:  Tanya P Garcia; Samuel Müller; Raymond J Carroll; Tamara N Dunn; Anthony P Thomas; Sean H Adams; Suresh D Pillai; Rosemary L Walzem
Journal:  Biostatistics       Date:  2013-04-10       Impact factor: 5.899

6.  Testing a single regression coefficient in high dimensional linear models.

Authors:  Wei Lan; Ping-Shou Zhong; Runze Li; Hansheng Wang; Chih-Ling Tsai
Journal:  J Econom       Date:  2016-06-15       Impact factor: 2.388

7.  IPAD: Stable Interpretable Forecasting with Knockoffs Inference.

Authors:  Yingying Fan; Jinchi Lv; Mahrad Sharifvaghefi; Yoshimasa Uematsu
Journal:  J Am Stat Assoc       Date:  2019-09-17       Impact factor: 5.033

8.  NOISY MATRIX COMPLETION: UNDERSTANDING STATISTICAL GUARANTEES FOR CONVEX RELAXATION VIA NONCONVEX OPTIMIZATION.

Authors:  Yuxin Chen; Yuejie Chi; Jianqing Fan; Cong Ma; Yuling Yan
Journal:  SIAM J Optim       Date:  2020-10-28       Impact factor: 2.850

9.  Robust high dimensional factor models with applications to statistical machine learning.

Authors:  Jianqing Fan; Kaizheng Wang; Yiqiao Zhong; Ziwei Zhu
Journal:  Stat Sci       Date:  2021-04-19       Impact factor: 2.901

10.  Detecting and Testing Altered Brain Connectivity Networks with K-partite Network Topology.

Authors:  Shuo Chen; F DuBois Bowman; Yishi Xing
Journal:  Comput Stat Data Anal       Date:  2019-07-09       Impact factor: 1.681

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.