Literature DB >> 33012899

Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures.

Yaowu Liu1, Jun Xie2.   

Abstract

Combining individual p-values to aggregate multiple small effects has a long-standing interest in statistics, dating back to the classic Fisher's combination test. In modern large-scale data analysis, correlation and sparsity are common features and efficient computation is a necessary requirement for dealing with massive data. To overcome these challenges, we propose a new test that takes advantage of the Cauchy distribution. Our test statistic has a simple form and is defined as a weighted sum of Cauchy transformation of individual p-values. We prove a non-asymptotic result that the tail of the null distribution of our proposed test statistic can be well approximated by a Cauchy distribution under arbitrary dependency structures. Based on this theoretical result, the p-value calculation of our proposed test is not only accurate, but also as simple as the classic z-test or t-test, making our test well suited for analyzing massive data. We further show that the power of the proposed test is asymptotically optimal in a strong sparsity setting. Extensive simulations demonstrate that the proposed test has both strong power against sparse alternatives and a good accuracy with respect to p-value calculations, especially for very small p-values. The proposed test has also been applied to a genome-wide association study of Crohn's disease and compared with several existing tests.

Entities:  

Keywords:  Cauchy distribution; Correlation matrix; Global hypothesis testing; High dimensional data; Non-asymptotic approximation; Sparse alternative

Year:  2019        PMID: 33012899      PMCID: PMC7531765          DOI: 10.1080/01621459.2018.1554485

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  15 in total

1.  Powerful SNP-set analysis for case-control genome-wide association studies.

Authors:  Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

2.  Accurate and Efficient P-value Calculation via Gaussian Approximation: a Novel Monte-Carlo Method.

Authors:  Yaowu Liu; Jun Xie
Journal:  J Am Stat Assoc       Date:  2018-06-28       Impact factor: 5.033

3.  A genome-wide association study identifies IL23R as an inflammatory bowel disease gene.

Authors:  Richard H Duerr; Kent D Taylor; Steven R Brant; John D Rioux; Mark S Silverberg; Mark J Daly; A Hillary Steinhart; Clara Abraham; Miguel Regueiro; Anne Griffiths; Themistocles Dassopoulos; Alain Bitton; Huiying Yang; Stephan Targan; Lisa Wu Datta; Emily O Kistner; L Philip Schumm; Annette T Lee; Peter K Gregersen; M Michael Barmada; Jerome I Rotter; Dan L Nicolae; Judy H Cho
Journal:  Science       Date:  2006-10-26       Impact factor: 47.728

4.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

5.  The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies.

Authors:  Ian Barnett; Rajarshi Mukherjee; Xihong Lin
Journal:  J Am Stat Assoc       Date:  2017-05-03       Impact factor: 5.033

Review 6.  Rare-variant association analysis: study designs and statistical tests.

Authors:  Seunggeung Lee; Gonçalo R Abecasis; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2014-07-03       Impact factor: 11.025

7.  Estimation of the false discovery proportion with unknown dependence.

Authors:  Jianqing Fan; Xu Han
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2016-09-26       Impact factor: 4.488

8.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits.

Authors:  Jian Yang; Teresa Ferreira; Andrew P Morris; Sarah E Medland; Pamela A F Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael N Weedon; Ruth J Loos; Timothy M Frayling; Mark I McCarthy; Joel N Hirschhorn; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2012-03-18       Impact factor: 38.330

9.  JEPEG: a summary statistics based tool for gene-level joint testing of functional variants.

Authors:  Donghyung Lee; Vernell S Williamson; T Bernard Bigdeli; Brien P Riley; Ayman H Fanous; Vladimir I Vladimirov; Silviu-Alin Bacanu
Journal:  Bioinformatics       Date:  2014-12-12       Impact factor: 6.937

10.  Partitioning heritability by functional annotation using genome-wide association summary statistics.

Authors:  Hilary K Finucane; Brendan Bulik-Sullivan; Alexander Gusev; Gosia Trynka; Yakir Reshef; Po-Ru Loh; Verneri Anttila; Han Xu; Chongzhi Zang; Kyle Farh; Stephan Ripke; Felix R Day; Shaun Purcell; Eli Stahl; Sara Lindstrom; John R B Perry; Yukinori Okada; Soumya Raychaudhuri; Mark J Daly; Nick Patterson; Benjamin M Neale; Alkes L Price
Journal:  Nat Genet       Date:  2015-09-28       Impact factor: 38.330

View more
  42 in total

1.  HiC-ACT: improved detection of chromatin interactions from Hi-C data via aggregated Cauchy test.

Authors:  Taylor M Lagler; Armen Abnousi; Ming Hu; Yuchen Yang; Yun Li
Journal:  Am J Hum Genet       Date:  2021-02-04       Impact factor: 11.025

Review 2.  Advancing the use of genome-wide association studies for drug repurposing.

Authors:  William R Reay; Murray J Cairns
Journal:  Nat Rev Genet       Date:  2021-07-23       Impact factor: 53.242

3.  A Bottom-up Approach to Testing Hypotheses That Have a Branching Tree Dependence Structure, with Error Rate Control.

Authors:  Yunxiao Li; Yi-Juan Hu; Glen A Satten
Journal:  J Am Stat Assoc       Date:  2020-09-16       Impact factor: 4.369

4.  Statistical analysis of spatially resolved transcriptomic data by incorporating multiomics auxiliary information.

Authors:  Yan Li; Xiang Zhou; Hongyuan Cao
Journal:  Genetics       Date:  2022-07-30       Impact factor: 4.402

5.  A gene-level methylome-wide association analysis identifies novel Alzheimer's disease genes.

Authors:  Chong Wu; Jonathan Bradley; Yanming Li; Lang Wu; Hong-Wen Deng
Journal:  Bioinformatics       Date:  2021-02-01       Impact factor: 6.937

6.  ZERO-INFLATED QUANTILE RANK-SCORE BASED TEST (ZIQRANK) WITH APPLICATION TO SCRNA-SEQ DIFFERENTIAL GENE EXPRESSION ANALYSIS.

Authors:  Wodan Ling; Wenfei Zhang; Bin Cheng; Ying Wei
Journal:  Ann Appl Stat       Date:  2021-12-21       Impact factor: 2.083

7.  InTACT: An adaptive and powerful framework for joint-tissue transcriptome-wide association studies.

Authors:  Ye Eun Bae; Lang Wu; Chong Wu
Journal:  Genet Epidemiol       Date:  2021-07-13       Impact factor: 2.135

Review 8.  Statistical methods for mediation analysis in the era of high-throughput genomics: Current successes and future challenges.

Authors:  Ping Zeng; Zhonghe Shao; Xiang Zhou
Journal:  Comput Struct Biotechnol J       Date:  2021-05-26       Impact factor: 7.271

9.  An integrative multiomics analysis identifies putative causal genes for COVID-19 severity.

Authors:  Lang Wu; Jingjing Zhu; Duo Liu; Yanfa Sun; Chong Wu
Journal:  Genet Med       Date:  2021-06-28       Impact factor: 8.822

10.  Leveraging Methylation Alterations to Discover Potential Causal Genes Associated With the Survival Risk of Cervical Cancer in TCGA Through a Two-Stage Inference Approach.

Authors:  Jinhui Zhang; Haojie Lu; Shuo Zhang; Ting Wang; Huashuo Zhao; Fengjun Guan; Ping Zeng
Journal:  Front Genet       Date:  2021-06-02       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.