Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Literature DB >> 20617121

On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Abstract

Principal components analysis (PCA) is a classic method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. Contemporary datasets often have p comparable with or even much larger than n. Our main assertions, in such settings, are (a) that some initial reduction in dimensionality is desirable before applying any PCA-type search for principal modes, and (b) the initial reduction in dimensionality is best achieved by working in a basis in which the signals have a sparse representation. We describe a simple asymptotic model in which the estimate of the leading principal component vector via standard PCA is consistent if and only if p(n)/n→0. We provide a simple algorithm for selecting a subset of coordinates with largest sample variances, and show that if PCA is done on the selected subset, then consistency is recovered, even if p(n) ⪢ n.

Entities: Chemical Disease Gene Species

Year: 2009 PMID： 20617121 PMCID： PMC2898454 DOI： 10.1198/jasa.2009.0121

Source DB: PubMed Journal: J Am Stat Assoc ISSN： 0162-1459 Impact factor: 5.033

1 in total

1. Principal-component-analysis eigenvalue spectra from data with symmetry-breaking structure.

Authors: D C Hoyle; M Rattray
Journal: Phys Rev E Stat Nonlin Soft Matter Phys Date: 2004-02-27

1 in total

72 in total

On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

1. Principal-component-analysis eigenvalue spectra from data with symmetry-breaking structure.

1. Limitations of GCTA as a solution to the missing heritability problem.

2. Biclustering with heterogeneous variance.

3. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES.

4. Two-Step Hypothesis Testing When the Number of Variables Exceeds the Sample Size.

5. LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS.

6. Statistical challenges of high-dimensional data.

7. FLCRM: Functional linear cox regression model.

8. Scale-Invariant Sparse PCA on High Dimensional Meta-elliptical Data.

9. PCA in High Dimensions: An orientation.

10. Sparse principal component analysis by choice of norm.