Literature DB >> 31504998

19 Dubious Ways to Compute the Marginal Likelihood of a Phylogenetic Tree Topology.

Mathieu Fourment1, Andrew F Magee2, Chris Whidden3, Arman Bilge3, Frederick A Matsen3, Vladimir N Minin4.   

Abstract

The marginal likelihood of a model is a key quantity for assessing the evidence provided by the data in support of a model. The marginal likelihood is the normalizing constant for the posterior density, obtained by integrating the product of the likelihood and the prior with respect to model parameters. Thus, the computational burden of computing the marginal likelihood scales with the dimension of the parameter space. In phylogenetics, where we work with tree topologies that are high-dimensional models, standard approaches to computing marginal likelihoods are very slow. Here, we study methods to quickly compute the marginal likelihood of a single fixed tree topology. We benchmark the speed and accuracy of 19 different methods to compute the marginal likelihood of phylogenetic topologies on a suite of real data sets under the JC69 model. These methods include several new ones that we develop explicitly to solve this problem, as well as existing algorithms that we apply to phylogenetic models for the first time. Altogether, our results show that the accuracy of these methods varies widely, and that accuracy does not necessarily correlate with computational burden. Our newly developed methods are orders of magnitude faster than standard approaches, and in some cases, their accuracy rivals the best established estimators.
© The Author(s) 2019. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Keywords:  Bayesian inference; evidence; importance sampling; model selection; variational Bayes

Mesh:

Year:  2020        PMID: 31504998      PMCID: PMC7571498          DOI: 10.1093/sysbio/syz046

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  26 in total

1.  Efficient approximations for learning phylogenetic HMM models from data.

Authors:  Vladimir Jojic; Nebojsa Jojic; Chris Meek; Dan Geiger; Adam Siepel; David Haussler; D Heckerman
Journal:  Bioinformatics       Date:  2004-08-04       Impact factor: 6.937

2.  Robustness of compound Dirichlet priors for Bayesian inference of branch lengths.

Authors:  Chi Zhang; Bruce Rannala; Ziheng Yang
Journal:  Syst Biol       Date:  2012-02-10       Impact factor: 15.683

3.  Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty.

Authors:  Guy Baele; Philippe Lemey; Marc A Suchard
Journal:  Syst Biol       Date:  2015-11-01       Impact factor: 15.683

4.  Bayesian estimation of divergence times from large sequence alignments.

Authors:  Stéphane Guindon
Journal:  Mol Biol Evol       Date:  2010-03-01       Impact factor: 16.240

5.  Model Selection and Parameter Inference in Phylogenetics Using Nested Sampling.

Authors:  Patricio Maturana Russel; Brendon J Brewer; Steffen Klaere; Remco R Bouckaert
Journal:  Syst Biol       Date:  2019-03-01       Impact factor: 15.683

6.  The estimation of tree posterior probabilities using conditional clade probability distributions.

Authors:  Bret Larget
Journal:  Syst Biol       Date:  2013-03-11       Impact factor: 15.683

7.  Estimating Bayesian Phylogenetic Information Content.

Authors:  Paul O Lewis; Ming-Hui Chen; Lynn Kuo; Louise A Lewis; Karolina Fučíková; Suman Neupane; Yu-Bo Wang; Daoyuan Shi
Journal:  Syst Biol       Date:  2016-05-06       Impact factor: 15.683

8.  Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty.

Authors:  Guy Baele; Philippe Lemey; Trevor Bedford; Andrew Rambaut; Marc A Suchard; Alexander V Alekseyenko
Journal:  Mol Biol Evol       Date:  2012-03-07       Impact factor: 16.240

9.  A tutorial on bridge sampling.

Authors:  Quentin F Gronau; Alexandra Sarafoglou; Dora Matzke; Alexander Ly; Udo Boehm; Maarten Marsman; David S Leslie; Jonathan J Forster; Eric-Jan Wagenmakers; Helen Steingroever
Journal:  J Math Psychol       Date:  2017-12       Impact factor: 2.223

10.  Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.

Authors:  Marc A Suchard; Philippe Lemey; Guy Baele; Daniel L Ayres; Alexei J Drummond; Andrew Rambaut
Journal:  Virus Evol       Date:  2018-06-08
View more
  7 in total

1.  Testing Phylogenetic Stability with Variable Taxon Sampling.

Authors:  Christopher Lowell Edward Powell; Fabia Ursula Battistuzzi
Journal:  Methods Mol Biol       Date:  2022

2.  QuCo: quartet-based co-estimation of species trees and gene trees.

Authors:  Maryam Rabiee; Siavash Mirarab
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

3.  Dynamic causal modelling of COVID-19 and its mitigations.

Authors:  Karl J Friston; Guillaume Flandin; Adeel Razi
Journal:  Sci Rep       Date:  2022-07-20       Impact factor: 4.996

4.  Systematic Exploration of the High Likelihood Set of Phylogenetic Tree Topologies.

Authors:  Chris Whidden; Brian C Claywell; Thayer Fisher; Andrew F Magee; Mathieu Fourment; Frederick A Matsen
Journal:  Syst Biol       Date:  2020-03-01       Impact factor: 15.683

5.  The Emergence of SARS-CoV-2 Variants of Concern Is Driven by Acceleration of the Substitution Rate.

Authors:  John H Tay; Ashleigh F Porter; Wytamma Wirth; Sebastian Duchene
Journal:  Mol Biol Evol       Date:  2022-02-03       Impact factor: 16.240

6.  Bayesian model averaging for nonparametric discontinuity design.

Authors:  Max Hinne; David Leeftink; Marcel A J van Gerven; Luca Ambrogioni
Journal:  PLoS One       Date:  2022-06-30       Impact factor: 3.752

7.  Variational Phylodynamic Inference Using Pandemic-scale Data.

Authors:  Caleb Ki; Jonathan Terhorst
Journal:  Mol Biol Evol       Date:  2022-08-03       Impact factor: 8.800

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.