Literature DB >> 23003134

Number of relevant directions in principal component analysis and Wishart random matrices.

Satya N Majumdar1, Pierpaolo Vivo.   

Abstract

We compute analytically, for large N, the probability P(N+,N) that a N×N Wishart random matrix has N+ eigenvalues exceeding a threshold Nζ, including its large deviation tails. This probability plays a benchmark role when performing the principal component analysis of a large empirical data set. We find that P(N+,N)≈exp[-βN2ψζ(N+/N)], where β is the Dyson index of the ensemble and ψζ(κ) is a rate function that we compute explicitly in the full range 0≤κ≤1 and for any ζ. The rate function ψζ(κ) displays a quadratic behavior modulated by a logarithmic singularity close to its minimum κ⋆(ζ). This is shown to be a consequence of a phase transition in an associated Coulomb gas problem. The variance Δ(N) of the number of relevant components is also shown to grow universally (independent of ζ) as Δ(N)∼(βπ2)-1 lnN for large N.

Year:  2012        PMID: 23003134     DOI: 10.1103/PhysRevLett.108.200601

Source DB:  PubMed          Journal:  Phys Rev Lett        ISSN: 0031-9007            Impact factor:   9.161


  1 in total

1.  Tail sums of Wishart and Gaussian eigenvalues beyond the bulk edge.

Authors:  Iain M Johnstone
Journal:  Aust N Z J Stat       Date:  2018-03-14       Impact factor: 0.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.