Literature DB >> 29244177

Online Bayesian Phylogenetic Inference: Theoretical Foundations via Sequential Monte Carlo.

Vu Dinh1, Aaron E Darling2, Frederick A Matsen Iv3.   

Abstract

Phylogenetics, the inference of evolutionary trees from molecular sequence data such as DNA, is an enterprise that yields valuable evolutionary understanding of many biological systems. Bayesian phylogenetic algorithms, which approximate a posterior distribution on trees, have become a popular if computationally expensive means of doing phylogenetics. Modern data collection technologies are quickly adding new sequences to already substantial databases. With all current techniques for Bayesian phylogenetics, computation must start anew each time a sequence becomes available, making it costly to maintain an up-to-date estimate of a phylogenetic posterior. These considerations highlight the need for an online Bayesian phylogenetic method which can update an existing posterior with new sequences. Here, we provide theoretical results on the consistency and stability of methods for online Bayesian phylogenetic inference based on Sequential Monte Carlo (SMC) and Markov chain Monte Carlo. We first show a consistency result, demonstrating that the method samples from the correct distribution in the limit of a large number of particles. Next, we derive the first reported set of bounds on how phylogenetic likelihood surfaces change when new sequences are added. These bounds enable us to characterize the theoretical performance of sampling algorithms by bounding the effective sample size (ESS) with a given number of particles from below. We show that the ESS is guaranteed to grow linearly as the number of particles in an SMC sampler grows. Surprisingly, this result holds even though the dimensions of the phylogenetic model grow with each new added sequence.

Entities:  

Mesh:

Year:  2018        PMID: 29244177      PMCID: PMC5920340          DOI: 10.1093/sysbio/syx087

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  14 in total

1.  Polyhedral geometry of phylogenetic rogue taxa.

Authors:  María Angélica Cueto; Frederick A Matsen
Journal:  Bull Math Biol       Date:  2010-07-17       Impact factor: 1.758

2.  BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny.

Authors:  Marc A Suchard; Benjamin D Redelings
Journal:  Bioinformatics       Date:  2006-05-05       Impact factor: 6.937

3.  Phylogenetic inference via sequential Monte Carlo.

Authors:  Alexandre Bouchard-Côté; Sriram Sankararaman; Michael I Jordan
Journal:  Syst Biol       Date:  2012-01-04       Impact factor: 15.683

4.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

5.  PyNAST: a flexible tool for aligning sequences to a template alignment.

Authors:  J Gregory Caporaso; Kyle Bittinger; Frederic D Bushman; Todd Z DeSantis; Gary L Andersen; Rob Knight
Journal:  Bioinformatics       Date:  2009-11-13       Impact factor: 6.937

6.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

7.  Real-time digital pathogen surveillance - the time is now.

Authors:  Jennifer Gardy; Nicholas J Loman; Andrew Rambaut
Journal:  Genome Biol       Date:  2015-07-30       Impact factor: 13.583

8.  nextflu: real-time tracking of seasonal influenza virus evolution in humans.

Authors:  Richard A Neher; Trevor Bedford
Journal:  Bioinformatics       Date:  2015-06-26       Impact factor: 6.937

9.  PUmPER: phylogenies updated perpetually.

Authors:  Fernando Izquierdo-Carrasco; John Cazes; Stephen A Smith; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-28       Impact factor: 6.937

10.  Real-time, portable genome sequencing for Ebola surveillance.

Authors:  Joshua Quick; Nicholas J Loman; Sophie Duraffour; Jared T Simpson; Ettore Severi; Lauren Cowley; Joseph Akoi Bore; Raymond Koundouno; Gytis Dudas; Amy Mikhail; Nobila Ouédraogo; Babak Afrough; Amadou Bah; Jonathan Hj Baum; Beate Becker-Ziaja; Jan-Peter Boettcher; Mar Cabeza-Cabrerizo; Alvaro Camino-Sanchez; Lisa L Carter; Juiliane Doerrbecker; Theresa Enkirch; Isabel Graciela García Dorival; Nicole Hetzelt; Julia Hinzmann; Tobias Holm; Liana Eleni Kafetzopoulou; Michel Koropogui; Abigail Kosgey; Eeva Kuisma; Christopher H Logue; Antonio Mazzarelli; Sarah Meisel; Marc Mertens; Janine Michel; Didier Ngabo; Katja Nitzsche; Elisa Pallash; Livia Victoria Patrono; Jasmine Portmann; Johanna Gabriella Repits; Natasha Yasmin Rickett; Andrea Sachse; Katrin Singethan; Inês Vitoriano; Rahel L Yemanaberhan; Elsa G Zekeng; Racine Trina; Alexander Bello; Amadou Alpha Sall; Ousmane Faye; Oumar Faye; N'Faly Magassouba; Cecelia V Williams; Victoria Amburgey; Linda Winona; Emily Davis; Jon Gerlach; Franck Washington; Vanessa Monteil; Marine Jourdain; Marion Bererd; Alimou Camara; Hermann Somlare; Abdoulaye Camara; Marianne Gerard; Guillaume Bado; Bernard Baillet; Déborah Delaune; Koumpingnin Yacouba Nebie; Abdoulaye Diarra; Yacouba Savane; Raymond Bernard Pallawo; Giovanna Jaramillo Gutierrez; Natacha Milhano; Isabelle Roger; Christopher J Williams; Facinet Yattara; Kuiama Lewandowski; Jamie Taylor; Philip Rachwal; Daniel Turner; Georgios Pollakis; Julian A Hiscox; David A Matthews; Matthew K O'Shea; Andrew McD Johnston; Duncan Wilson; Emma Hutley; Erasmus Smit; Antonino Di Caro; Roman Woelfel; Kilian Stoecker; Erna Fleischmann; Martin Gabriel; Simon A Weller; Lamine Koivogui; Boubacar Diallo; Sakoba Keita; Andrew Rambaut; Pierre Formenty; Stephan Gunther; Miles W Carroll
Journal:  Nature       Date:  2016-02-03       Impact factor: 69.504

View more
  6 in total

1.  Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals.

Authors:  Mathieu Fourment; Brian C Claywell; Vu Dinh; Connor McCoy; Frederick A Matsen Iv; Aaron E Darling
Journal:  Syst Biol       Date:  2018-05-01       Impact factor: 15.683

2.  A Surrogate Function for One-Dimensional Phylogenetic Likelihoods.

Authors:  Brian C Claywell; Vu Dinh; Mathieu Fourment; Connor O McCoy; Frederick A Matsen Iv
Journal:  Mol Biol Evol       Date:  2018-01-01       Impact factor: 16.240

3.  Statistical Challenges in Tracking the Evolution of SARS-CoV-2.

Authors:  Lorenzo Cappello; Jaehee Kim; Sifan Liu; Julia A Palacios
Journal:  Stat Sci       Date:  2022-05-16       Impact factor: 4.015

Review 4.  Marginal Likelihoods in Phylogenetics: A Review of Methods and Applications.

Authors:  Jamie R Oaks; Kerry A Cobb; Vladimir N Minin; Adam D Leaché
Journal:  Syst Biol       Date:  2019-09-01       Impact factor: 15.683

5.  Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction.

Authors:  Mandev S Gill; Philippe Lemey; Marc A Suchard; Andrew Rambaut; Guy Baele
Journal:  Mol Biol Evol       Date:  2020-06-01       Impact factor: 16.240

Review 6.  Scalable Bayesian phylogenetics.

Authors:  Alexander A Fisher; Gabriel W Hassler; Xiang Ji; Guy Baele; Marc A Suchard; Philippe Lemey
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.