Literature DB >> 29983475

Mixture models with a prior on the number of components.

Jeffrey W Miller1, Matthew T Harrison2.   

Abstract

A natural Bayesian approach for mixture models with an unknown number of components is to take the usual finite mixture model with symmetric Dirichlet weights, and put a prior on the number of components-that is, to use a mixture of finite mixtures (MFM). The most commonly-used method of inference for MFMs is reversible jump Markov chain Monte Carlo, but it can be nontrivial to design good reversible jump moves, especially in high-dimensional spaces. Meanwhile, there are samplers for Dirichlet process mixture (DPM) models that are relatively simple and are easily adapted to new applications. It turns out that, in fact, many of the essential properties of DPMs are also exhibited by MFMs-an exchangeable partition distribution, restaurant process, random measure representation, and stick-breaking representation-and crucially, the MFM analogues are simple enough that they can be used much like the corresponding DPM properties. Consequently, many of the powerful methods developed for inference in DPMs can be directly applied to MFMs as well; this simplifies the implementation of MFMs and can substantially improve mixing. We illustrate with real and simulated data, including high-dimensional gene expression data used to discriminate cancer subtypes.

Entities:  

Keywords:  Bayesian; clustering; density estimation; model selection; nonparametric

Year:  2017        PMID: 29983475      PMCID: PMC6035010          DOI: 10.1080/01621459.2016.1255636

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  19 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  A mixture model-based approach to the clustering of microarray expression data.

Authors:  G J McLachlan; R W Bean; D Peel
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

3.  Bayesian mixture model based clustering of replicated microarray data.

Authors:  M Medvedovic; K Y Yeung; R E Bumgarner
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

4.  A Dirichlet process mixture model for brain MRI tissue classification.

Authors:  Adelino R Ferreira da Silva
Journal:  Med Image Anal       Date:  2006-12-21       Impact factor: 8.545

5.  Bayesian haplotype inference via the Dirichlet process.

Authors:  Eric P Xing; Michael I Jordan; Roded Sharan
Journal:  J Comput Biol       Date:  2007-04       Impact factor: 1.479

6.  Multichannel electrophysiological spike sorting via joint dictionary learning and mixture modeling.

Authors:  David E Carlson; Joshua T Vogelstein; Colin R Stoetzner; Daryl Kipke; Douglas Weber; David B Dunson; Lawrence Carin
Journal:  IEEE Trans Biomed Eng       Date:  2013-07-30       Impact factor: 4.538

7.  Nonparametric Bayesian models through probit stick-breaking processes.

Authors:  Abel Rodríguez; David B Dunson
Journal:  Bayesian Anal       Date:  2011-03-01       Impact factor: 3.728

8.  Evolution and impact of subclonal mutations in chronic lymphocytic leukemia.

Authors:  Dan A Landau; Scott L Carter; Petar Stojanov; Aaron McKenna; Kristen Stevenson; Michael S Lawrence; Carrie Sougnez; Chip Stewart; Andrey Sivachenko; Lili Wang; Youzhong Wan; Wandi Zhang; Sachet A Shukla; Alexander Vartanov; Stacey M Fernandes; Gordon Saksena; Kristian Cibulskis; Bethany Tesar; Stacey Gabriel; Nir Hacohen; Matthew Meyerson; Eric S Lander; Donna Neuberg; Jennifer R Brown; Gad Getz; Catherine J Wu
Journal:  Cell       Date:  2013-02-14       Impact factor: 41.582

9.  Random Partition Models with Regression on Covariates.

Authors:  Peter Müller; Fernando Quintana
Journal:  J Stat Plan Inference       Date:  2010-10-01       Impact factor: 1.111

10.  Clustering cancer gene expression data: a comparative study.

Authors:  Marcilio C P de Souto; Ivan G Costa; Daniel S A de Araujo; Teresa B Ludermir; Alexander Schliep
Journal:  BMC Bioinformatics       Date:  2008-11-27       Impact factor: 3.169

View more
  9 in total

1.  Bayesian hierarchical finite mixture of regression for histopathological imaging-based cancer data analysis.

Authors:  Yunju Im; Yuan Huang; Jian Huang; Shuangge Ma
Journal:  Stat Med       Date:  2022-01-13       Impact factor: 2.373

2.  Generalized infinite factorization models.

Authors:  L Schiavon; A Canale; D B Dunson
Journal:  Biometrika       Date:  2022-01-19       Impact factor: 3.028

3.  How many data clusters are in the Galaxy data set?: Bayesian cluster analysis in action.

Authors:  Bettina Grün; Gertraud Malsiner-Walli; Sylvia Frühwirth-Schnatter
Journal:  Adv Data Anal Classif       Date:  2021-08-26

4.  Point process models for sequence detection in high-dimensional neural spike trains.

Authors:  Alex H Williams; Anthony Degleris; Yixin Wang; Scott W Linderman
Journal:  Adv Neural Inf Process Syst       Date:  2020-12

5.  Robust Clustering with Subpopulation-specific Deviations.

Authors:  Briana J K Stephenson; Amy H Herring; Andrew Olshan
Journal:  J Am Stat Assoc       Date:  2019-06-19       Impact factor: 5.033

6.  Joint analysis of recurrence and termination: A Bayesian latent class approach.

Authors:  Zhixing Xu; Debajyoti Sinha; Jonathan R Bradley
Journal:  Stat Methods Med Res       Date:  2020-10-13       Impact factor: 3.021

7.  Bayesian Nonparametric Modeling of Categorical Data for Information Fusion and Causal Inference.

Authors:  Sihan Xiong; Yiwei Fu; Asok Ray
Journal:  Entropy (Basel)       Date:  2018-05-23       Impact factor: 2.524

8.  Bayesian inference for continuous-time hidden Markov models with an unknown number of states.

Authors:  Yu Luo; David A Stephens
Journal:  Stat Comput       Date:  2021-08-10       Impact factor: 2.559

9.  Consensus clustering for Bayesian mixture models.

Authors:  Stephen Coleman; Paul D W Kirk; Chris Wallace
Journal:  BMC Bioinformatics       Date:  2022-07-21       Impact factor: 3.307

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.