| Literature DB >> 35845818 |
Minerva Mukhopadhyay1, Didong Li2, David B Dunson2.
Abstract
Current tools for multivariate density estimation struggle when the density is concentrated near a non-linear subspace or manifold. Most approaches require the choice of a kernel, with the multivariate Gaussian kernel by far the most commonly used. Although heavy-tailed and skewed extensions have been proposed, such kernels cannot capture curvature in the support of the data. This leads to poor performance unless the sample size is very large relative to the dimension of the data. The paper proposes a novel generalization of the Gaussian distribution, which includes an additional curvature parameter. We refer to the proposed class as Fisher-Gaussian kernels, since they arise by sampling from a von Mises-Fisher density on the sphere and adding Gaussian noise. The Fisher-Gaussian density has an analytic form and is amenable to straightforward implementation within Bayesian mixture models by using Markov chain Monte Carlo sampling. We provide theory on large support and illustrate gains relative to competitors in simulated and real data applications.Entities:
Keywords: Bayesian mixture models; Kernel density estimation; Manifold learning; Markov chain Monte Carlo methods; Mixture model; Spherical data; von Mises–Fisher density
Year: 2020 PMID: 35845818 PMCID: PMC9286319 DOI: 10.1111/rssb.12390
Source DB: PubMed Journal: J R Stat Soc Series B Stat Methodol ISSN: 1369-7412 Impact factor: 4.933