Literature DB >> 30828106

Streaming PCA and Subspace Tracking: The Missing Data Case.

Laura Balzano1, Yuejie Chi2, Yue M Lu3.   

Abstract

For many modern applications in science and engineering, data are collected in a streaming fashion carrying time-varying information, and practitioners need to process them with a limited amount of memory and computational resources in a timely manner for decision making. This often is coupled with the missing data problem, such that only a small fraction of data attributes are observed. These complications impose significant, and unconventional, constraints on the problem of streaming Principal Component Analysis (PCA) and subspace tracking, which is an essential building block for many inference tasks in signal processing and machine learning. This survey article reviews a variety of classical and recent algorithms for solving this problem with low computational and memory complexities, particularly those applicable in the big data regime with missing data. We illustrate that streaming PCA and subspace tracking algorithms can be understood through algebraic and geometric perspectives, and they need to be adjusted carefully to handle missing data. Both asymptotic and non-asymptotic convergence guarantees are reviewed. Finally, we benchmark the performance of several competitive algorithms in the presence of missing data for both well-conditioned and ill-conditioned systems.

Entities:  

Keywords:  ODE analysis; missing data; streaming PCA; subspace and low-rank models; subspace tracking

Year:  2018        PMID: 30828106      PMCID: PMC6395049          DOI: 10.1109/JPROC.2018.2847041

Source DB:  PubMed          Journal:  Proc IEEE Inst Electr Electron Eng        ISSN: 0018-9219            Impact factor:   10.961


  3 in total

1.  Multivariate Time Series Imputation: An Approach Based on Dictionary Learning.

Authors:  Xiaomeng Zheng; Bogdan Dumitrescu; Jiamou Liu; Ciprian Doru Giurcăneanu
Journal:  Entropy (Basel)       Date:  2022-07-31       Impact factor: 2.738

2.  Adaptive dimensionality reduction for neural network-based online principal component analysis.

Authors:  Nico Migenda; Ralf Möller; Wolfram Schenck
Journal:  PLoS One       Date:  2021-03-30       Impact factor: 3.240

3.  Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.

Authors:  Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido
Journal:  Genome Biol       Date:  2020-01-20       Impact factor: 13.583

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.