Literature DB >> 35693984

Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA).

H Robert Frost1.   

Abstract

We present a novel technique for sparse principal component analysis. This method, named Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA), is based on the formula for computing squared eigenvector loadings of a Hermitian matrix from the eigenvalues of the full matrix and associated sub-matrices. We explore two versions of the EESPCA method: a version that uses a fixed threshold for inducing sparsity and a version that selects the threshold via cross-validation. Relative to the state-of-the-art sparse PCA methods of Witten et al., Yuan & Zhang and Tan et al., the fixed threshold EESPCA technique offers an order-of-magnitude improvement in computational speed, does not require estimation of tuning parameters via cross-validation, and can more accurately identify true zero principal component loadings across a range of data matrix sizes and covariance structures. Importantly, the EESPCA method achieves these benefits while maintaining out-of-sample reconstruction error and PC estimation error close to the lowest error generated by all evaluated approaches. EESPCA is a practical and effective technique for sparse PCA with particular relevance to computationally demanding statistical problems such as the analysis of high-dimensional data sets or application of statistical techniques like resampling that involve the repeated calculation of sparse PCs.

Entities:  

Keywords:  eigenvector-eigenvalue identity; principal component analysis; sparse eigenvalue decomposition; sparse principal component analysis

Year:  2021        PMID: 35693984      PMCID: PMC9187050          DOI: 10.1080/10618600.2021.1987254

Source DB:  PubMed          Journal:  J Comput Graph Stat        ISSN: 1061-8600            Impact factor:   1.884


  16 in total

1.  Comprehensive Integration of Single-Cell Data.

Authors:  Tim Stuart; Andrew Butler; Paul Hoffman; Christoph Hafemeister; Efthymia Papalexi; William M Mauck; Yuhan Hao; Marlon Stoeckius; Peter Smibert; Rahul Satija
Journal:  Cell       Date:  2019-06-06       Impact factor: 41.582

Review 2.  Principal component analysis: a review and recent developments.

Authors:  Ian T Jolliffe; Jorge Cadima
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2016-04-13       Impact factor: 4.226

3.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Authors:  Iain M Johnstone; Arthur Yu Lu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

4.  Gene ontology analysis for RNA-seq: accounting for selection bias.

Authors:  Matthew D Young; Matthew J Wakefield; Gordon K Smyth; Alicia Oshlack
Journal:  Genome Biol       Date:  2010-02-04       Impact factor: 13.583

Review 5.  Revealing the vectors of cellular identity with single-cell genomics.

Authors:  Allon Wagner; Aviv Regev; Nir Yosef
Journal:  Nat Biotechnol       Date:  2016-11-08       Impact factor: 54.908

Review 6.  Scaling single-cell genomics from phenomenology to mechanism.

Authors:  Amos Tanay; Aviv Regev
Journal:  Nature       Date:  2017-01-18       Impact factor: 49.962

7.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

8.  Pathway level analysis of gene expression using singular value decomposition.

Authors:  John Tomfohr; Jun Lu; Thomas B Kepler
Journal:  BMC Bioinformatics       Date:  2005-09-12       Impact factor: 3.169

9.  Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.

Authors:  Christoph Hafemeister; Rahul Satija
Journal:  Genome Biol       Date:  2019-12-23       Impact factor: 13.583

10.  The Gene Ontology in 2010: extensions and refinements.

Authors: 
Journal:  Nucleic Acids Res       Date:  2009-11-17       Impact factor: 16.971

View more
  2 in total

1.  Technology-mediated teaching and learning process: A conceptual study of educators' response amidst the Covid-19 pandemic.

Authors:  Kingsley Okoye; Jorge Alfonso Rodriguez-Tort; Jose Escamilla; Samira Hosseini
Journal:  Educ Inf Technol (Dordr)       Date:  2021-05-18

2.  Impact of digital technologies upon teaching and learning in higher education in Latin America: an outlook on the reach, barriers, and bottlenecks.

Authors:  Kingsley Okoye; Haruna Hussein; Arturo Arrona-Palacios; Héctor Nahún Quintero; Luis Omar Peña Ortega; Angela Lopez Sanchez; Elena Arias Ortiz; Jose Escamilla; Samira Hosseini
Journal:  Educ Inf Technol (Dordr)       Date:  2022-08-15
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.