| Literature DB >> 25870462 |
Edoardo M Airoldi1, Thiago Costa2, Federico Bassetti3, Fabrizio Leisen4, Michele Guindani5.
Abstract
Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a novel and probabilistically coherent family of non-exchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of independent Beta random variables. We compare their theoretical clustering properties with those of the Dirichlet Process and the two parameters Poisson-Dirichlet process. The proposed construction provides a complete characterization of the joint process, differently from existing work. We then propose the use of such process as prior distribution in a hierarchical Bayes modeling framework, and we describe a Markov Chain Monte Carlo sampler for posterior inference. We evaluate the performance of the prior and the robustness of the resulting inference in a simulation study, providing a comparison with popular Dirichlet Processes mixtures and Hidden Markov Models. Finally, we develop an application to the detection of chromosomal aberrations in breast cancer by leveraging array CGH data.Entities:
Keywords: Bayesian non-parametrics; Cancer; Genomics; MCMC; Predictive Probability Functions; Random Partitions; Species Sampling Priors
Year: 2014 PMID: 25870462 PMCID: PMC4392726 DOI: 10.1080/01621459.2014.950735
Source DB: PubMed Journal: J Am Stat Assoc ISSN: 0162-1459 Impact factor: 5.033