Literature DB >> 28018116

The Statistics and Mathematics of High Dimension Low Sample Size Asymptotics.

Dan Shen1, Haipeng Shen2, Hongtu Zhu3, J S Marron3.   

Abstract

The aim of this paper is to establish several deep theoretical properties of principal component analysis for multiple-component spike covariance models. Our new results reveal an asymptotic conical structure in critical sample eigendirections under the spike models with distinguishable (or indistinguishable) eigenvalues, when the sample size and/or the number of variables (or dimension) tend to infinity. The consistency of the sample eigenvectors relative to their population counterparts is determined by the ratio between the dimension and the product of the sample size with the spike size. When this ratio converges to a nonzero constant, the sample eigenvector converges to a cone, with a certain angle to its corresponding population eigenvector. In the High Dimension, Low Sample Size case, the angle between the sample eigenvector and its population counterpart converges to a limiting distribution. Several generalizations of the multi-spike covariance models are also explored, and additional theoretical results are presented.

Entities:  

Keywords:  Big data; Conical behavior; High dimension low sample size; PCA

Year:  2016        PMID: 28018116      PMCID: PMC5173295          DOI: 10.5705/ss.202015.0088

Source DB:  PubMed          Journal:  Stat Sin        ISSN: 1017-0405            Impact factor:   1.261


  9 in total

1.  RNA-Seq-quantitative measurement of expression through massively parallel RNA-sequencing.

Authors:  Brian T Wilhelm; Josette-Renée Landry
Journal:  Methods       Date:  2009-03-29       Impact factor: 3.608

Review 2.  The incredible shrinking world of DNA microarrays.

Authors:  Sarah J Wheelan; Francisco Martínez Murillo; Jef D Boeke
Journal:  Mol Biosyst       Date:  2008-04-17

3.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

Review 4.  Overview of object oriented data analysis.

Authors:  J Steve Marron; Andrés M Alonso
Journal:  Biom J       Date:  2014-01-13       Impact factor: 2.207

5.  CONVERGENCE AND PREDICTION OF PRINCIPAL COMPONENT SCORES IN HIGH-DIMENSIONAL SETTINGS.

Authors:  Seunggeun Lee; Fei Zou; Fred A Wright
Journal:  Ann Stat       Date:  2010-01-01       Impact factor: 4.028

6.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Authors:  Iain M Johnstone; Arthur Yu Lu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

7.  Distributions of Angles in Random Packing on Spheres.

Authors:  Tony Cai; Jianqing Fan; Tiefeng Jiang
Journal:  J Mach Learn Res       Date:  2013-01       Impact factor: 3.654

8.  SWISS MADE: Standardized WithIn Class Sum of Squares to evaluate methodologies and dataset elements.

Authors:  Christopher R Cabanski; Yuan Qi; Xiaoying Yin; Eric Bair; Michele C Hayward; Cheng Fan; Jianying Li; Matthew D Wilkerson; J S Marron; Charles M Perou; D Neil Hayes
Journal:  PLoS One       Date:  2010-03-26       Impact factor: 3.240

9.  Annotating genomes with massive-scale RNA sequencing.

Authors:  France Denoeud; Jean-Marc Aury; Corinne Da Silva; Benjamin Noel; Odile Rogier; Massimo Delledonne; Michele Morgante; Giorgio Valle; Patrick Wincker; Claude Scarpelli; Olivier Jaillon; François Artiguenave
Journal:  Genome Biol       Date:  2008-12-16       Impact factor: 13.583

  9 in total
  5 in total

1.  A survey of high dimension low sample size asymptotics.

Authors:  Makoto Aoshima; Dan Shen; Haipeng Shen; Kazuyoshi Yata; Yi-Hui Zhou; J S Marron
Journal:  Aust N Z J Stat       Date:  2018-03-14       Impact factor: 0.640

2.  Distributed estimation of principal eigenspaces.

Authors:  Jianqing Fan; Dong Wang; Kaizheng Wang; Ziwei Zhu
Journal:  Ann Stat       Date:  2019-10-31       Impact factor: 4.028

3.  PCA in High Dimensions: An orientation.

Authors:  Iain M Johnstone; Debashis Paul
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2018-07-18       Impact factor: 10.961

4.  Factor-Adjusted Regularized Model Selection.

Authors:  Jianqing Fan; Yuan Ke; Kaizheng Wang
Journal:  J Econom       Date:  2020-02-07       Impact factor: 2.388

5.  A Guide for Sparse PCA: Model Comparison and Applications.

Authors:  Rosember Guerra-Urzola; Katrijn Van Deun; Juan C Vera; Klaas Sijtsma
Journal:  Psychometrika       Date:  2021-06-29       Impact factor: 2.290

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.