Literature DB >> 26321800

Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations.

David I Hastie1, Silvia Liverani2, Sylvia Richardson3.   

Abstract

We consider the question of Markov chain Monte Carlo sampling from a general stick-breaking Dirichlet process mixture model, with concentration parameter [Formula: see text]. This paper introduces a Gibbs sampling algorithm that combines the slice sampling approach of Walker (Communications in Statistics - Simulation and Computation 36:45-54, 2007) and the retrospective sampling approach of Papaspiliopoulos and Roberts (Biometrika 95(1):169-186, 2008). Our general algorithm is implemented as efficient open source C++ software, available as an R package, and is based on a blocking strategy similar to that suggested by Papaspiliopoulos (A note on posterior sampling from Dirichlet mixture models, 2008) and implemented by Yau et al. (Journal of the Royal Statistical Society, Series B (Statistical Methodology) 73:37-57, 2011). We discuss the difficulties of achieving good mixing in MCMC samplers of this nature in large data sets and investigate sensitivity to initialisation. We additionally consider the challenges when an additional layer of hierarchy is added such that joint inference is to be made on [Formula: see text]. We introduce a new label-switching move and compute the marginal partition posterior to help to surmount these difficulties. Our work is illustrated using a profile regression (Molitor et al. Biostatistics 11(3):484-498, 2010) application, where we demonstrate good mixing behaviour for both synthetic and real examples.

Entities:  

Keywords:  Bayesian clustering; Dirichlet process; Mixture model; Profile regression

Year:  2014        PMID: 26321800      PMCID: PMC4550296          DOI: 10.1007/s11222-014-9471-3

Source DB:  PubMed          Journal:  Stat Comput        ISSN: 0960-3174            Impact factor:   2.559


  10 in total

1.  Bayesian semiparametric joint models for functional predictors.

Authors:  Jamie L Bigelow; David B Dunson
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

2.  Bayesian profile regression with an application to the National Survey of Children's Health.

Authors:  John Molitor; Michail Papathomas; Michael Jerrett; Sylvia Richardson
Journal:  Biostatistics       Date:  2010-03-29       Impact factor: 5.899

3.  Bayesian Inference on Changes in Response Densities over Predictor Clusters.

Authors:  David B Dunson; Amy Herring; Anna Maria Siega-Riz
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

4.  Identifying vulnerable populations through an examination of the association between multipollutant profiles and poverty.

Authors:  John Molitor; Jason G Su; Nuoo-Ting Molitor; Virgilio Gómez Rubio; Sylvia Richardson; David Hastie; Rachel Morello-Frosch; Michael Jerrett
Journal:  Environ Sci Technol       Date:  2011-08-19       Impact factor: 9.028

5.  Nonparametric Bayes local partition models for random effects.

Authors:  David B Dunson
Journal:  Biometrika       Date:  2009       Impact factor: 2.445

6.  Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene × gene patterns.

Authors:  Michail Papathomas; John Molitor; Clive Hoggart; David Hastie; Sylvia Richardson
Journal:  Genet Epidemiol       Date:  2012-07-31       Impact factor: 2.135

7.  Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.

Authors:  C Yau; O Papaspiliopoulos; G O Roberts; C Holmes
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2011-01-01       Impact factor: 4.488

8.  PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes.

Authors:  Silvia Liverani; David I Hastie; Lamiae Azizi; Michail Papathomas; Sylvia Richardson
Journal:  J Stat Softw       Date:  2015-03-20       Impact factor: 6.440

9.  Examining the joint effect of multiple risk factors using exposure risk profiles: lung cancer in nonsmokers.

Authors:  Michail Papathomas; John Molitor; Sylvia Richardson; Elio Riboli; Paolo Vineis
Journal:  Environ Health Perspect       Date:  2010-10-04       Impact factor: 9.031

10.  A semi-parametric approach to estimate risk functions associated with multi-dimensional exposure profiles: application to smoking and lung cancer.

Authors:  David I Hastie; Silvia Liverani; Lamiae Azizi; Sylvia Richardson; Isabelle Stücker
Journal:  BMC Med Res Methodol       Date:  2013-10-23       Impact factor: 4.615

  10 in total
  6 in total

1.  A Bayesian semiparametric latent variable approach to causal mediation.

Authors:  Chanmin Kim; Michael Daniels; Yisheng Li; Kathrin Milbury; Lorenzo Cohen
Journal:  Stat Med       Date:  2017-12-18       Impact factor: 2.373

2.  How Short Is Long Enough? Modeling Temporal Aspects of Human Mobility Behavior Using Mobile Phone Data.

Authors:  Eun-Hye Yoo
Journal:  Ann Am Assoc Geogr       Date:  2019-05-20

3.  Pattern learning reveals brain asymmetry to be linked to socioeconomic status.

Authors:  Timm B Poeppl; Emile Dimas; Katrin Sakreida; Julius M Kernbach; Ross D Markello; Oliver Schöffski; Alain Dagher; Philipp Koellinger; Gideon Nave; Martha J Farah; Bratislav Mišić; Danilo Bzdok
Journal:  Cereb Cortex Commun       Date:  2022-05-20

4.  PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes.

Authors:  Silvia Liverani; David I Hastie; Lamiae Azizi; Michail Papathomas; Sylvia Richardson
Journal:  J Stat Softw       Date:  2015-03-20       Impact factor: 6.440

5.  Optimal Bayesian estimators for latent variable cluster models.

Authors:  Riccardo Rastelli; Nial Friel
Journal:  Stat Comput       Date:  2017-10-31       Impact factor: 2.559

6.  Bayesian Profile Regression to Deal With Multiple Highly Correlated Exposures and a Censored Survival Outcome. First Application in Ionizing Radiation Epidemiology.

Authors:  Marion Belloni; Olivier Laurent; Chantal Guihenneuc; Sophie Ancelet
Journal:  Front Public Health       Date:  2020-10-27
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.