| Literature DB >> 32929389 |
Xiuyuan Cheng1, Alexander Cloninger2, Ronald R Coifman3.
Abstract
The paper introduces a new kernel-based Maximum Mean Discrepancy (MMD) statistic for measuring the distance between two distributions given finitely many multivariate samples. When the distributions are locally low-dimensional, the proposed test can be made more powerful to distinguish certain alternatives by incorporating local covariance matrices and constructing an anisotropic kernel. The kernel matrix is asymmetric; it computes the affinity between [Formula: see text] data points and a set of [Formula: see text] reference points, where [Formula: see text] can be drastically smaller than [Formula: see text]. While the proposed statistic can be viewed as a special class of Reproducing Kernel Hilbert Space MMD, the consistency of the test is proved, under mild assumptions of the kernel, as long as [Formula: see text], and a finite-sample lower bound of the testing power is obtained. Applications to flow cytometry and diffusion MRI datasets are demonstrated, which motivate the proposed approach to compare distributions.Keywords: anisotropic kernel; maximum mean discrepancy; two-sample statistics
Year: 2019 PMID: 32929389 PMCID: PMC7478116 DOI: 10.1093/imaiai/iaz018
Source DB: PubMed Journal: Inf inference ISSN: 2049-8764