Literature DB >> 26681992

Reuse, Recycle, Reweigh: Combating Influenza through Efficient Sequential Bayesian Computation for Massive Data.

Jennifer A Tom1, Janet S Sinsheimer2, Marc A Suchard2.   

Abstract

Massive datasets in the gigabyte and terabyte range combined with the availability of increasingly sophisticated statistical tools yield analyses at the boundary of what is computationally feasible. Compromising in the face of this computational burden by partitioning the dataset into more tractable sizes results in stratified analyses, removed from the context that justified the initial data collection. In a Bayesian framework, these stratified analyses generate intermediate realizations, often compared using point estimates that fail to account for the variability within and correlation between the distributions these realizations approximate. However, although the initial concession to stratify generally precludes the more sensible analysis using a single joint hierarchical model, we can circumvent this outcome and capitalize on the intermediate realizations by extending the dynamic iterative reweighting MCMC algorithm. In doing so, we reuse the available realizations by reweighting them with importance weights, recycling them into a now tractable joint hierarchical model. We apply this technique to intermediate realizations generated from stratified analyses of 687 influenza A genomes spanning 13 years allowing us to revisit hypotheses regarding the evolutionary history of influenza within a hierarchical statistical framework.

Entities:  

Keywords:  Gibbs variable selection; Markov chain Monte Carlo; hierarchical Bayesian model; importance sampling; influenza A; massive data

Year:  2010        PMID: 26681992      PMCID: PMC4679157          DOI: 10.1214/10-AOAS349

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  26 in total

1.  Model parameterization, prior distributions, and the general time-reversible model in Bayesian phylogenetics.

Authors:  Derrick Zwickl; Mark Holder
Journal:  Syst Biol       Date:  2004-12       Impact factor: 15.683

2.  Bayesian coalescent inference of past population dynamics from molecular sequences.

Authors:  A J Drummond; A Rambaut; B Shapiro; O G Pybus
Journal:  Mol Biol Evol       Date:  2005-02-09       Impact factor: 16.240

3.  Evolution in Mendelian Populations.

Authors:  S Wright
Journal:  Genetics       Date:  1931-03       Impact factor: 4.562

4.  Improving phylogenetic analyses by incorporating additional information from genetic sequence databases.

Authors:  Li-Jung Liang; Robert E Weiss; Benjamin Redelings; Marc A Suchard
Journal:  Bioinformatics       Date:  2009-08-06       Impact factor: 6.937

5.  A Sequential Monte Carlo Method for Bayesian Analysis of Massive Datasets.

Authors:  Greg Ridgeway; David Madigan
Journal:  Data Min Knowl Discov       Date:  2003-07-01       Impact factor: 3.670

6.  Reuse, Recycle, Reweigh: Combating Influenza through Efficient Sequential Bayesian Computation for Massive Data.

Authors:  Jennifer A Tom; Janet S Sinsheimer; Marc A Suchard
Journal:  Ann Appl Stat       Date:  2010       Impact factor: 2.083

7.  Origins and evolutionary genomics of the 2009 swine-origin H1N1 influenza A epidemic.

Authors:  Gavin J D Smith; Dhanasekaran Vijaykrishna; Justin Bahl; Samantha J Lycett; Michael Worobey; Oliver G Pybus; Siu Kit Ma; Chung Lam Cheung; Jayna Raghwani; Samir Bhatt; J S Malik Peiris; Yi Guan; Andrew Rambaut
Journal:  Nature       Date:  2009-06-25       Impact factor: 49.962

8.  The influenza virus enigma.

Authors:  Rachelle Salomon; Robert G Webster
Journal:  Cell       Date:  2009-02-06       Impact factor: 41.582

9.  Hierarchical phylogenetic models for analyzing multipartite sequence data.

Authors:  Marc A Suchard; Christina M R Kitchen; Janet S Sinsheimer; Robert E Weiss
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

10.  Whole-genome analysis of human influenza A virus reveals multiple persistent lineages and reassortment among recent H3N2 viruses.

Authors:  Edward C Holmes; Elodie Ghedin; Naomi Miller; Jill Taylor; Yiming Bao; Kirsten St George; Bryan T Grenfell; Steven L Salzberg; Claire M Fraser; David J Lipman; Jeffery K Taubenberger
Journal:  PLoS Biol       Date:  2005-07-26       Impact factor: 8.029

View more
  7 in total

1.  Does history repeat itself? Wavelets and the phylodynamics of influenza A.

Authors:  Jennifer A Tom; Janet S Sinsheimer; Marc A Suchard
Journal:  Mol Biol Evol       Date:  2011-12-08       Impact factor: 16.240

2.  Joining and splitting models with Markov melding.

Authors:  Robert J B Goudie; Anne M Presanis; David Lunn; Daniela De Angelis; Lorenz Wernisch
Journal:  Bayesian Anal       Date:  2019-01       Impact factor: 3.728

3.  Reuse, Recycle, Reweigh: Combating Influenza through Efficient Sequential Bayesian Computation for Massive Data.

Authors:  Jennifer A Tom; Janet S Sinsheimer; Marc A Suchard
Journal:  Ann Appl Stat       Date:  2010       Impact factor: 2.083

4.  Phylogenetic inference via sequential Monte Carlo.

Authors:  Alexandre Bouchard-Côté; Sriram Sankararaman; Michael I Jordan
Journal:  Syst Biol       Date:  2012-01-04       Impact factor: 15.683

5.  Graph hierarchies for phylogeography.

Authors:  Gabriela B Cybis; Janet S Sinsheimer; Philippe Lemey; Marc A Suchard
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2013-02-04       Impact factor: 6.237

6.  Four key challenges in infectious disease modelling using data from multiple sources.

Authors:  Daniela De Angelis; Anne M Presanis; Paul J Birrell; Gianpaolo Scalia Tomba; Thomas House
Journal:  Epidemics       Date:  2014-09-28       Impact factor: 4.396

7.  A numerically stable algorithm for integrating Bayesian models using Markov melding.

Authors:  Andrew A Manderson; Robert J B Goudie
Journal:  Stat Comput       Date:  2022-02-18       Impact factor: 2.324

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.