Literature DB >> 35707063

The Chi-Square Test of Distance Correlation.

Cencheng Shen1, Sambit Panda2, Joshua T Vogelstein2,3.   

Abstract

Distance correlation has gained much recent attention in the data science community: the sample statistic is straightforward to compute and asymptotically equals zero if and only if independence, making it an ideal choice to discover any type of dependency structure given sufficient sample size. One major bottleneck is the testing process: because the null distribution of distance correlation depends on the underlying random variables and metric choice, it typically requires a permutation test to estimate the null and compute the p-value, which is very costly for large amount of data. To overcome the difficulty, in this paper we propose a chi-square test for distance correlation. Method-wise, the chi-square test is non-parametric, extremely fast, and applicable to bias-corrected distance correlation using any strong negative type metric or characteristic kernel. The test exhibits a similar testing power as the standard permutation test, and can be utilized for K-sample and partial testing. Theory-wise, we show that the underlying chi-square distribution well approximates and dominates the limiting null distribution in upper tail, prove the chi-square test can be valid and universally consistent for testing independence, and establish a testing power inequality with respect to the permutation test.

Entities:  

Keywords:  centered chi-square distribution; nonparametric test; testing independence; unbiased distance covariance

Year:  2021        PMID: 35707063      PMCID: PMC9191842          DOI: 10.1080/10618600.2021.1938585

Source DB:  PubMed          Journal:  J Comput Graph Stat        ISSN: 1061-8600            Impact factor:   1.884


  6 in total

Review 1.  Advances in biomarker research for pancreatic cancer.

Authors:  Kruttika Bhat; Fengfei Wang; Qingyong Ma; Qinyu Li; Sanku Mallik; Tze-Chen Hsieh; Erxi Wu
Journal:  Curr Pharm Des       Date:  2012       Impact factor: 3.116

2.  On Brownian Distance Covariance and High Dimensional Data.

Authors:  Michael R Kosorok
Journal:  Ann Appl Stat       Date:  2009-01-01       Impact factor: 2.083

3.  CONDITIONAL DISTANCE CORRELATION.

Authors:  Xueqin Wang; Wenliang Pan; Wenhao Hu; Yuan Tian; Heping Zhang
Journal:  J Am Stat Assoc       Date:  2015-01-23       Impact factor: 5.033

4.  Mutant proteins as cancer-specific biomarkers.

Authors:  Qing Wang; Raghothama Chaerkady; Jian Wu; Hee Jung Hwang; Nick Papadopoulos; Levy Kopelovich; Anirban Maitra; Hanno Matthaei; James R Eshleman; Ralph H Hruban; Kenneth W Kinzler; Akhilesh Pandey; Bert Vogelstein
Journal:  Proc Natl Acad Sci U S A       Date:  2011-01-19       Impact factor: 11.205

5.  Discovering and deciphering relationships across disparate data modalities.

Authors:  Joshua T Vogelstein; Eric W Bridgeford; Qing Wang; Carey E Priebe; Mauro Maggioni; Cencheng Shen
Journal:  Elife       Date:  2019-01-15       Impact factor: 8.140

6.  Feature Screening via Distance Correlation Learning.

Authors:  Runze Li; Wei Zhong; Liping Zhu
Journal:  J Am Stat Assoc       Date:  2012-07-01       Impact factor: 5.033

  6 in total
  1 in total

1.  Distinct noncoding RNAs and RNA binding proteins associated with high-risk pediatric and adult acute myeloid leukemias detected by regulatory network analysis.

Authors:  Zhenqiu Liu; Vladimir S Spiegelman; Hong-Gang Wang
Journal:  Cancer Rep (Hoboken)       Date:  2021-12-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.