Literature DB >> 32929389

Two-sample statistics based on anisotropic kernels.

Xiuyuan Cheng1, Alexander Cloninger2, Ronald R Coifman3.   

Abstract

The paper introduces a new kernel-based Maximum Mean Discrepancy (MMD) statistic for measuring the distance between two distributions given finitely many multivariate samples. When the distributions are locally low-dimensional, the proposed test can be made more powerful to distinguish certain alternatives by incorporating local covariance matrices and constructing an anisotropic kernel. The kernel matrix is asymmetric; it computes the affinity between [Formula: see text] data points and a set of [Formula: see text] reference points, where [Formula: see text] can be drastically smaller than [Formula: see text]. While the proposed statistic can be viewed as a special class of Reproducing Kernel Hilbert Space MMD, the consistency of the test is proved, under mild assumptions of the kernel, as long as [Formula: see text], and a finite-sample lower bound of the testing power is obtained. Applications to flow cytometry and diffusion MRI datasets are demonstrated, which motivate the proposed approach to compare distributions.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Institute of Mathematics and its Applications. All rights reserved.

Keywords:  anisotropic kernel; maximum mean discrepancy; two-sample statistics

Year:  2019        PMID: 32929389      PMCID: PMC7478116          DOI: 10.1093/imaiai/iaz018

Source DB:  PubMed          Journal:  Inf inference        ISSN: 2049-8764


  4 in total

1.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: multiscale methods.

Authors:  R R Coifman; S Lafon; A B Lee; M Maggioni; B Nadler; F Warner; S W Zucker
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-17       Impact factor: 11.205

2.  FastMMD: Ensemble of Circular Discrepancy for Efficient Two-Sample Test.

Authors:  Ji Zhao; Deyu Meng
Journal:  Neural Comput       Date:  2015-03-16       Impact factor: 2.026

Review 3.  Acute myeloid leukemia: epidemiology and etiology.

Authors:  Barbara Deschler; Michael Lübbert
Journal:  Cancer       Date:  2006-11-01       Impact factor: 6.860

4.  Automated identification of stratifying signatures in cellular subpopulations.

Authors:  Robert V Bruggner; Bernd Bodenmiller; David L Dill; Robert J Tibshirani; Garry P Nolan
Journal:  Proc Natl Acad Sci U S A       Date:  2014-06-16       Impact factor: 11.205

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.