Literature DB >> 31985810

Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.

Chi Zhang1,2, John P Huelsenbeck3, Fredrik Ronquist4.   

Abstract

Sampling across tree space is one of the major challenges in Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) algorithms. Standard MCMC tree moves consider small random perturbations of the topology, and select from candidate trees at random or based on the distance between the old and new topologies. MCMC algorithms using such moves tend to get trapped in tree space, making them slow in finding the globally most probable trees (known as "convergence") and in estimating the correct proportions of the different types of them (known as "mixing"). Here, we introduce a new class of moves, which propose trees based on their parsimony scores. The proposal distribution derived from the parsimony scores is a quickly computable albeit rough approximation of the conditional posterior distribution over candidate trees. We demonstrate with simulations that parsimony-guided moves correctly sample the uniform distribution of topologies from the prior. We then evaluate their performance against standard moves using six challenging empirical data sets, for which we were able to obtain accurate reference estimates of the posterior using long MCMC runs, a mix of topology proposals, and Metropolis coupling. On these data sets, ranging in size from 357 to 934 taxa and from 1740 to 5681 sites, we find that single chains using parsimony-guided moves usually converge an order of magnitude faster than chains using standard moves. They also exhibit better mixing, that is, they cover the most probable trees more quickly. Our results show that tree moves based on quick and dirty estimates of the posterior probability can significantly outperform standard moves. Future research will have to show to what extent the performance of such moves can be improved further by finding better ways of approximating the posterior probability, taking the trade-off between accuracy and speed into account. [Bayesian phylogenetic inference; MCMC; parsimony; tree proposal.].
© The Author(s) 2020. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

Entities:  

Mesh:

Year:  2020        PMID: 31985810      PMCID: PMC7440752          DOI: 10.1093/sysbio/syaa002

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  31 in total

1.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

2.  Robustness of compound Dirichlet priors for Bayesian inference of branch lengths.

Authors:  Chi Zhang; Bruce Rannala; Ziheng Yang
Journal:  Syst Biol       Date:  2012-02-10       Impact factor: 15.683

3.  Hastings ratio of the LOCAL proposal used in Bayesian phylogenetics.

Authors:  Mark T Holder; Paul O Lewis; David L Swofford; Bret Larget
Journal:  Syst Biol       Date:  2005-12       Impact factor: 15.683

4.  Efficiency of Markov chain Monte Carlo tree proposals in Bayesian phylogenetics.

Authors:  Clemens Lakner; Paul van der Mark; John P Huelsenbeck; Bret Larget; Fredrik Ronquist
Journal:  Syst Biol       Date:  2008-02       Impact factor: 15.683

5.  A Bayesian perspective on a non-parsimonious parsimony model.

Authors:  John P Huelsenbeck; Cécile Ané; Bret Larget; Fredrik Ronquist
Journal:  Syst Biol       Date:  2008-06       Impact factor: 15.683

6.  Guided tree topology proposals for Bayesian phylogenetic inference.

Authors:  Sebastian Höhna; Alexei J Drummond
Journal:  Syst Biol       Date:  2011-08-09       Impact factor: 15.683

7.  The estimation of tree posterior probabilities using conditional clade probability distributions.

Authors:  Bret Larget
Journal:  Syst Biol       Date:  2013-03-11       Impact factor: 15.683

8.  Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods.

Authors:  Z Yang
Journal:  J Mol Evol       Date:  1994-09       Impact factor: 2.395

9.  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

Authors:  M Kimura
Journal:  J Mol Evol       Date:  1980-12       Impact factor: 2.395

10.  Quantifying MCMC exploration of phylogenetic tree space.

Authors:  Chris Whidden; Frederick A Matsen
Journal:  Syst Biol       Date:  2015-01-27       Impact factor: 15.683

View more
  5 in total

1.  StarBeast3: Adaptive Parallelized Bayesian Inference under the Multispecies Coalescent.

Authors:  Jordan Douglas; Cinthy L Jiménez-Silva; Remco Bouckaert
Journal:  Syst Biol       Date:  2022-06-16       Impact factor: 9.160

2.  Practical Speedup of Bayesian Inference of Species Phylogenies by Restricting the Space of Gene Trees.

Authors:  Yaxuan Wang; Huw A Ogilvie; Luay Nakhleh
Journal:  Mol Biol Evol       Date:  2020-06-01       Impact factor: 16.240

3.  Adaptive dating and fast proposals: Revisiting the phylogenetic relaxed clock model.

Authors:  Jordan Douglas; Rong Zhang; Remco Bouckaert
Journal:  PLoS Comput Biol       Date:  2021-02-02       Impact factor: 4.475

Review 4.  Recent progress on methods for estimating and updating large phylogenies.

Authors:  Paul Zaharias; Tandy Warnow
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

5.  The origin of Rhinocerotoidea and phylogeny of Ceratomorpha (Mammalia, Perissodactyla).

Authors:  Bin Bai; Jin Meng; Chi Zhang; Yan-Xin Gong; Yuan-Qing Wang
Journal:  Commun Biol       Date:  2020-09-14
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.