Literature DB >> 26352632

High Dimensional Semiparametric Scale-Invariant Principal Component Analysis.

Fang Han, Han Liu.   

Abstract

We propose a new high dimensional semiparametric principal component analysis (PCA) method, named Copula Component Analysis (COCA). The semiparametric model assumes that, after unspecified marginally monotone transformations, the distributions are multivariate Gaussian. COCA improves upon PCA and sparse PCA in three aspects: (i) It is robust to modeling assumptions; (ii) It is robust to outliers and data contamination; (iii) It is scale-invariant and yields more interpretable results. We prove that the COCA estimators obtain fast estimation rates and are feature selection consistent when the dimension is nearly exponentially large relative to the sample size. Careful experiments confirm that COCA outperforms sparse PCA on both synthetic and real-world data sets.

Entities:  

Mesh:

Year:  2014        PMID: 26352632      PMCID: PMC5266498          DOI: 10.1109/TPAMI.2014.2307886

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  8 in total

1.  The isomap algorithm and topological stability.

Authors:  Mukund Balasubramanian; Eric L Schwartz
Journal:  Science       Date:  2002-01-04       Impact factor: 47.728

2.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis.

Authors:  Daniela M Witten; Robert Tibshirani; Trevor Hastie
Journal:  Biostatistics       Date:  2009-04-17       Impact factor: 5.899

3.  Frozen robust multiarray analysis (fRMA).

Authors:  Matthew N McCall; Benjamin M Bolstad; Rafael A Irizarry
Journal:  Biostatistics       Date:  2010-01-22       Impact factor: 5.899

4.  Statistical analysis of latent generalized correlation matrix estimation in transelliptical distribution.

Authors:  Fang Han; Han Liu
Journal:  Bernoulli (Andover)       Date:  2016-09-27       Impact factor: 1.595

5.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Authors:  Iain M Johnstone; Arthur Yu Lu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

6.  Functional network organization of the human brain.

Authors:  Jonathan D Power; Alexander L Cohen; Steven M Nelson; Gagan S Wig; Kelly Anne Barnes; Jessica A Church; Alecia C Vogel; Timothy O Laumann; Fran M Miezin; Bradley L Schlaggar; Steven E Petersen
Journal:  Neuron       Date:  2011-11-17       Impact factor: 17.173

7.  Automated diagnoses of attention deficit hyperactive disorder using magnetic resonance imaging.

Authors:  Ani Eloyan; John Muschelli; Mary Beth Nebel; Han Liu; Fang Han; Tuo Zhao; Anita D Barber; Suresh Joel; James J Pekar; Stewart H Mostofsky; Brian Caffo
Journal:  Front Syst Neurosci       Date:  2012-08-30

8.  Capturing heterogeneity in gene expression studies by surrogate variable analysis.

Authors:  Jeffrey T Leek; John D Storey
Journal:  PLoS Genet       Date:  2007-08-01       Impact factor: 5.917

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.