Literature DB >> 26858467

The Empirical Distribution of a Large Number of Correlated Normal Variables.

David Azriel1, Armin Schwartzman2.   

Abstract

Motivated by the advent of high dimensional highly correlated data, this work studies the limit behavior of the empirical cumulative distribution function (ecdf) of standard normal random variables under arbitrary correlation. First, we provide a necessary and sufficient condition for convergence of the ecdf to the standard normal distribution. Next, under general correlation, we show that the ecdf limit is a random, possible infinite, mixture of normal distribution functions that depends on a number of latent variables and can serve as an asymptotic approximation to the ecdf in high dimensions. We provide conditions under which the dimension of the ecdf limit, defined as the smallest number of effective latent variables, is finite. Estimates of the latent variables are provided and their consistency proved. We demonstrate these methods in a real high-dimensional data example from brain imaging where it is shown that, while the study exhibits apparently strongly significant results, they can be entirely explained by correlation, as captured by the asymptotic approximation developed here.

Entities:  

Keywords:  asymptotic approximation; dependent random variables; empirical null; factor analysis; high dimensional data; strong correlation

Year:  2014        PMID: 26858467      PMCID: PMC4742377          DOI: 10.1080/01621459.2014.958156

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  6 in total

1.  Brain gray matter deficits at 33-year follow-up in adults with attention-deficit/hyperactivity disorder established in childhood.

Authors:  Erika Proal; Philip T Reiss; Rachel G Klein; Salvatore Mannuzza; Kristin Gotimer; Maria A Ramos-Olazagasti; Jason P Lerch; Yong He; Alex Zijdenbos; Clare Kelly; Michael P Milham; F Xavier Castellanos
Journal:  Arch Gen Psychiatry       Date:  2011-11

2.  The effect of correlation in false discovery rate estimation.

Authors:  Armin Schwartzman; Xihong Lin
Journal:  Biometrika       Date:  2011-03       Impact factor: 2.445

3.  Correlated z-values and the accuracy of large-scale statistical estimates.

Authors:  Bradley Efron
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

4.  Comment on "Correlated z-values and the accuracy of large-scale statistical estimates" by Bradley Efron.

Authors:  Armin Schwartzman
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

5.  Paradoxical results of adaptive false discovery rate procedures in neuroimaging studies.

Authors:  Philip T Reiss; Armin Schwartzman; Feihan Lu; Lei Huang; Erika Proal
Journal:  Neuroimage       Date:  2012-07-27       Impact factor: 6.556

6.  Estimating False Discovery Proportion Under Arbitrary Covariance Dependence.

Authors:  Jianqing Fan; Xu Han; Weijie Gu
Journal:  J Am Stat Assoc       Date:  2012       Impact factor: 5.033

  6 in total
  1 in total

1.  Estimation of the false discovery proportion with unknown dependence.

Authors:  Jianqing Fan; Xu Han
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2016-09-26       Impact factor: 4.488

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.