Literature DB >> 34321713

Robust high dimensional factor models with applications to statistical machine learning.

Jianqing Fan1, Kaizheng Wang2, Yiqiao Zhong3, Ziwei Zhu4.   

Abstract

Factor models are a class of powerful statistical models that have been widely used to deal with dependent measurements that arise frequently from various applications from genomics and neuroscience to economics and finance. As data are collected at an ever-growing scale, statistical machine learning faces some new challenges: high dimensionality, strong dependence among observed variables, heavy-tailed variables and heterogeneity. High-dimensional robust factor analysis serves as a powerful toolkit to conquer these challenges. This paper gives a selective overview on recent advance on high-dimensional factor models and their applications to statistics including Factor-Adjusted Robust Model selection (FarmSelect) and Factor-Adjusted Robust Multiple testing (FarmTest). We show that classical methods, especially principal component analysis (PCA), can be tailored to many new problems and provide powerful tools for statistical estimation and inference. We highlight PCA and its connections to matrix perturbation theory, robust statistics, random projection, false discovery rate, etc., and illustrate through several applications how insights from these fields yield solutions to modern challenges. We also present far-reaching connections between factor models and popular statistical learning problems, including network analysis and low-rank matrix recovery.

Keywords:  Factor model; FarmSelect; FarmTest; PCA; covariance estimation; perturbation bounds; random sketch; robustness

Year:  2021        PMID: 34321713      PMCID: PMC8315369          DOI: 10.1214/20-sts785

Source DB:  PubMed          Journal:  Stat Sci        ISSN: 0883-4237            Impact factor:   2.901


  21 in total

1.  HIGH DIMENSIONAL COVARIANCE MATRIX ESTIMATION IN APPROXIMATE FACTOR MODELS.

Authors:  Jianqing Fan; Yuan Liao; Martina Mincheva
Journal:  Ann Stat       Date:  2011-01-01       Impact factor: 4.028

2.  A RATIONALE AND TEST FOR THE NUMBER OF FACTORS IN FACTOR ANALYSIS.

Authors:  J L HORN
Journal:  Psychometrika       Date:  1965-06       Impact factor: 2.500

Review 3.  An introduction to the five-factor model and its applications.

Authors:  R R McCrae; O P John
Journal:  J Pers       Date:  1992-06

4.  Optimal M-estimation in high-dimensional regression.

Authors:  Derek Bean; Peter J Bickel; Noureddine El Karoui; Bin Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-16       Impact factor: 11.205

5.  LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS.

Authors:  Jianqing Fan; Han Liu; Weichen Wang
Journal:  Ann Stat       Date:  2018-06-27       Impact factor: 4.028

6.  ENTRYWISE EIGENVECTOR ANALYSIS OF RANDOM MATRICES WITH LOW EXPECTED RANK.

Authors:  Emmanuel Abbe; Jianqing Fan; Kaizheng Wang; Yiqiao Zhong
Journal:  Ann Stat       Date:  2020-07-17       Impact factor: 4.028

7.  Asymptotics of empirical eigenstructure for high dimensional spiked covariance.

Authors:  Weichen Wang; Jianqing Fan
Journal:  Ann Stat       Date:  2017-06-13       Impact factor: 4.028

8.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Authors:  Iain M Johnstone; Arthur Yu Lu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

9.  Viewing Direction Estimation in Cryo-EM Using Synchronization.

Authors:  Yoel Shkolnisky; Amit Singer
Journal:  SIAM J Imaging Sci       Date:  2012-09-01       Impact factor: 2.867

10.  Large Covariance Estimation by Thresholding Principal Orthogonal Complements.

Authors:  Jianqing Fan; Yuan Liao; Martina Mincheva
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2013-09-01       Impact factor: 4.488

View more
  1 in total

1.  Bayesian Factor-adjusted Sparse Regression.

Authors:  Jianqing Fan; Bai Jiang; Qiang Sun
Journal:  J Econom       Date:  2021-11-01       Impact factor: 3.363

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.