Literature DB >> 31011242

Double-Parallel Monte Carlo for Bayesian Analysis of Big Data.

Jingnan Xue1, Faming Liang2.   

Abstract

This paper proposes a simple, practical and efficient MCMC algorithm for Bayesian analysis of big data. The proposed algorithm suggests to divide the big dataset into some smaller subsets and provides a simple method to aggregate the subset posteriors to approximate the full data posterior. To further speed up computation, the proposed algorithm employs the population stochastic approximation Monte Carlo (Pop-SAMC) algorithm, a parallel MCMC algorithm, to simulate from each subset posterior. Since this algorithm consists of two levels of parallel, data parallel and simulation parallel, it is coined as "Double Parallel Monte Carlo". The validity of the proposed algorithm is justified mathematically and numerically.

Entities:  

Keywords:  Divide-and-Combine; Embarrassingly Parallel; MCMC; Pop-SAMC; Subset Posterior Aggregation

Year:  2017        PMID: 31011242      PMCID: PMC6474686          DOI: 10.1007/s11222-017-9791-1

Source DB:  PubMed          Journal:  Stat Comput        ISSN: 0960-3174            Impact factor:   2.559


  2 in total

1.  Extended Stochastic Gradient MCMC for Large-Scale Bayesian Variable Selection.

Authors:  Qifan Song; Yan Sun; Mao Ye; Faming Liang
Journal:  Biometrika       Date:  2020-07-13       Impact factor: 2.445

2.  Bayesian mendelian randomization with study heterogeneity and data partitioning for large studies.

Authors:  Linyi Zou; Hui Guo; Carlo Berzuini
Journal:  BMC Med Res Methodol       Date:  2022-06-03       Impact factor: 4.612

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.