Literature DB >> 21709771

Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination.

Christopher Yau1, Chris Holmes.   

Abstract

We propose a hierarchical Bayesian nonparametric mixture model for clustering when some of the covariates are assumed to be of varying relevance to the clustering problem. This can be thought of as an issue in variable selection for unsupervised learning. We demonstrate that by defining a hierarchical population based nonparametric prior on the cluster locations scaled by the inverse covariance matrices of the likelihood we arrive at a 'sparsity prior' representation which admits a conditionally conjugate prior. This allows us to perform full Gibbs sampling to obtain posterior distributions over parameters of interest including an explicit measure of each covariate's relevance and a distribution over the number of potential clusters present in the data. This also allows for individual cluster specific variable selection. We demonstrate improved inference on a number of canonical problems.

Entities:  

Year:  2011        PMID: 21709771      PMCID: PMC3121559          DOI: 10.1214/11-BA612

Source DB:  PubMed          Journal:  Bayesian Anal        ISSN: 1931-6690            Impact factor:   3.728


  3 in total

1.  Simultaneous feature selection and clustering using mixture models.

Authors:  Martin H C Law; Mário A T Figueiredo; Anil K Jain
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2004-09       Impact factor: 6.226

2.  Variable selection for clustering with Gaussian mixture models.

Authors:  Cathy Maugis; Gilles Celeux; Marie-Laure Martin-Magniette
Journal:  Biometrics       Date:  2009-02-04       Impact factor: 2.571

3.  Genomic aberrations and survival in chronic lymphocytic leukemia.

Authors:  H Döhner; S Stilgenbauer; A Benner; E Leupolt; A Kröber; L Bullinger; K Döhner; M Bentz; P Lichter
Journal:  N Engl J Med       Date:  2000-12-28       Impact factor: 91.245

  3 in total
  2 in total

1.  Digital phenotyping of sleep patterns among heterogenous samples of Latinx adults using unsupervised learning.

Authors:  Ipek Ensari; Billy A Caceres; Kasey B Jackman; Niurka Suero-Tejeda; Ari Shechter; Michelle L Odlum; Suzanne Bakken
Journal:  Sleep Med       Date:  2021-07-19       Impact factor: 4.842

2.  Model-based clustering based on sparse finite Gaussian mixtures.

Authors:  Gertraud Malsiner-Walli; Sylvia Frühwirth-Schnatter; Bettina Grün
Journal:  Stat Comput       Date:  2014-08-26       Impact factor: 2.559

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.