Literature DB >> 28835726

Asymptotics of empirical eigenstructure for high dimensional spiked covariance.

Weichen Wang1, Jianqing Fan1,2.   

Abstract

We derive the asymptotic distributions of the spiked eigenvalues and eigenvectors under a generalized and unified asymptotic regime, which takes into account the magnitude of spiked eigenvalues, sample size, and dimensionality. This regime allows high dimensionality and diverging eigenvalues and provides new insights into the roles that the leading eigenvalues, sample size, and dimensionality play in principal component analysis. Our results are a natural extension of those in Paul (2007) to a more general setting and solve the rates of convergence problems in Shen et al. (2013). They also reveal the biases of estimating leading eigenvalues and eigenvectors by using principal component analysis, and lead to a new covariance estimator for the approximate factor model, called shrinkage principal orthogonal complement thresholding (S-POET), that corrects the biases. Our results are successfully applied to outstanding problems in estimation of risks of large portfolios and false discovery proportions for dependent test statistics and are illustrated by simulation studies.

Entities:  

Keywords:  Approximate factor model; Asymptotic distributions; Diverging eigenvalues; False discovery proportion; Principal component analysis; Relative risk management; Spiked covariance model

Year:  2017        PMID: 28835726      PMCID: PMC5563862          DOI: 10.1214/16-AOS1487

Source DB:  PubMed          Journal:  Ann Stat        ISSN: 0090-5364            Impact factor:   4.028


  17 in total

1.  Noise reduction in BOLD-based fMRI using component analysis.

Authors:  Christopher G Thomas; Richard A Harshman; Ravi S Menon
Journal:  Neuroimage       Date:  2002-11       Impact factor: 6.556

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  What is principal component analysis?

Authors:  Markus Ringnér
Journal:  Nat Biotechnol       Date:  2008-03       Impact factor: 54.908

4.  A general framework for multiple testing dependence.

Authors:  Jeffrey T Leek; John D Storey
Journal:  Proc Natl Acad Sci U S A       Date:  2008-11-24       Impact factor: 11.205

5.  Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies.

Authors:  Yumi Yamaguchi-Kabata; Kazuyuki Nakazono; Atsushi Takahashi; Susumu Saito; Naoya Hosono; Michiaki Kubo; Yusuke Nakamura; Naoyuki Kamatani
Journal:  Am J Hum Genet       Date:  2008-09-25       Impact factor: 11.025

6.  CONVERGENCE AND PREDICTION OF PRINCIPAL COMPONENT SCORES IN HIGH-DIMENSIONAL SETTINGS.

Authors:  Seunggeun Lee; Fei Zou; Fred A Wright
Journal:  Ann Stat       Date:  2010-01-01       Impact factor: 4.028

7.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Authors:  Iain M Johnstone; Arthur Yu Lu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

Review 8.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

9.  Large Covariance Estimation by Thresholding Principal Orthogonal Complements.

Authors:  Jianqing Fan; Yuan Liao; Martina Mincheva
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2013-09-01       Impact factor: 4.488

10.  Permutation-validated principal components analysis of microarray data.

Authors:  Jobst Landgrebe; Wolfgang Wurst; Gerhard Welzl
Journal:  Genome Biol       Date:  2002-03-22       Impact factor: 13.583

View more
  12 in total

1.  Robust Covariance Estimation for Approximate Factor Models.

Authors:  Jianqing Fan; Weichen Wang; Yiqiao Zhong
Journal:  J Econom       Date:  2018-10-06       Impact factor: 2.388

2.  Robust high dimensional factor models with applications to statistical machine learning.

Authors:  Jianqing Fan; Kaizheng Wang; Yiqiao Zhong; Ziwei Zhu
Journal:  Stat Sci       Date:  2021-04-19       Impact factor: 2.901

3.  Distributed estimation of principal eigenspaces.

Authors:  Jianqing Fan; Dong Wang; Kaizheng Wang; Ziwei Zhu
Journal:  Ann Stat       Date:  2019-10-31       Impact factor: 4.028

4.  Bayesian Factor-adjusted Sparse Regression.

Authors:  Jianqing Fan; Bai Jiang; Qiang Sun
Journal:  J Econom       Date:  2021-11-01       Impact factor: 3.363

5.  D-GCCA: Decomposition-based Generalized Canonical Correlation Analysis for Multi-view High-dimensional Data.

Authors:  Hai Shu; Zhe Qu; Hongtu Zhu
Journal:  J Mach Learn Res       Date:  2022       Impact factor: 5.177

6.  DOUBLY DEBIASED LASSO: HIGH-DIMENSIONAL INFERENCE UNDER HIDDEN CONFOUNDING.

Authors:  Zijian Guo; Domagoj Ćevid; Peter Bühlmann
Journal:  Ann Stat       Date:  2022-06-16       Impact factor: 4.904

7.  CDPA: Common and Distinctive Pattern Analysis between High-dimensional Datasets.

Authors:  Hai Shu; Zhe Qu
Journal:  Electron J Stat       Date:  2022-04-04       Impact factor: 1.225

8.  PCA in High Dimensions: An orientation.

Authors:  Iain M Johnstone; Debashis Paul
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2018-07-18       Impact factor: 10.961

9.  Factor-Adjusted Regularized Model Selection.

Authors:  Jianqing Fan; Yuan Ke; Kaizheng Wang
Journal:  J Econom       Date:  2020-02-07       Impact factor: 2.388

10.  Heterogeneity adjustment with applications to graphical model inference.

Authors:  Jianqing Fan; Han Liu; Weichen Wang; Ziwei Zhu
Journal:  Electron J Stat       Date:  2018-12-05       Impact factor: 1.125

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.