Literature DB >> 18852106

Empirical evaluation of a prior for Bayesian phylogenetic inference.

Ziheng Yang1.   

Abstract

The Bayesian method of phylogenetic inference often produces high posterior probabilities (PPs) for trees or clades, even when the trees are clearly incorrect. The problem appears to be mainly due to large sizes of molecular datasets and to the large-sample properties of Bayesian model selection and its sensitivity to the prior when several of the models under comparison are nearly equally correct (or nearly equally wrong) and are of the same dimension. A previous suggestion to alleviate the problem is to let the internal branch lengths in the tree become increasingly small in the prior with the increase in the data size so that the bifurcating trees are increasingly star-like. In particular, if the internal branch lengths are assigned the exponential prior, the prior mean mu0 should approach zero faster than 1/square root n but more slowly than 1/n, where n is the sequence length. This paper examines the usefulness of this data size-dependent prior using a dataset of the mitochondrial protein-coding genes from the baleen whales, with the prior mean fixed at mu0=0.1n(-2/3). In this dataset, phylogeny reconstruction is sensitive to the assumed evolutionary model, species sampling and the type of data (DNA or protein sequences), but Bayesian inference using the default prior attaches high PPs for conflicting phylogenetic relationships. The data size-dependent prior alleviates the problem to some extent, giving weaker support for unstable relationships. This prior may be useful in reducing apparent conflicts in the results of Bayesian analysis or in making the method less sensitive to model violations.

Mesh:

Year:  2008        PMID: 18852106      PMCID: PMC2607411          DOI: 10.1098/rstb.2008.0164

Source DB:  PubMed          Journal:  Philos Trans R Soc Lond B Biol Sci        ISSN: 0962-8436            Impact factor:   6.237


  36 in total

1.  Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability.

Authors:  Christophe J Douady; Frédéric Delsuc; Yan Boucher; W Ford Doolittle; Emmanuel J P Douzery
Journal:  Mol Biol Evol       Date:  2003-02       Impact factor: 16.240

2.  Model misspecification and probabilistic tests of topology: evidence from empirical data sets.

Authors:  Thomas R Buckley
Journal:  Syst Biol       Date:  2002-06       Impact factor: 15.683

3.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci.

Authors:  Bruce Rannala; Ziheng Yang
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

5.  Hastings ratio of the LOCAL proposal used in Bayesian phylogenetics.

Authors:  Mark T Holder; Paul O Lewis; David L Swofford; Bret Larget
Journal:  Syst Biol       Date:  2005-12       Impact factor: 15.683

6.  Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-07       Impact factor: 16.240

7.  Maximum-Likelihood Models for Combined Analyses of Multiple Sequence Data

Authors: 
Journal:  J Mol Evol       Date:  1996-05       Impact factor: 2.395

8.  Probability distribution of molecular evolutionary trees: a new method of phylogenetic inference.

Authors:  B Rannala; Z Yang
Journal:  J Mol Evol       Date:  1996-09       Impact factor: 2.395

9.  A space-time process model for the evolution of DNA sequences.

Authors:  Z Yang
Journal:  Genetics       Date:  1995-02       Impact factor: 4.562

10.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

View more
  4 in total

Review 1.  Statistics and truth in phylogenomics.

Authors:  Sudhir Kumar; Alan J Filipski; Fabia U Battistuzzi; Sergei L Kosakovsky Pond; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2011-08-26       Impact factor: 16.240

2.  Bayesian selection of misspecified models is overconfident and may cause spurious posterior probabilities for phylogenetic trees.

Authors:  Ziheng Yang; Tianqi Zhu
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-05       Impact factor: 11.205

Review 3.  Understanding phylogenetic incongruence: lessons from phyllostomid bats.

Authors:  Liliana M Dávalos; Andrea L Cirranello; Jonathan H Geisler; Nancy B Simmons
Journal:  Biol Rev Camb Philos Soc       Date:  2012-08-14

4.  Introduction. Statistical and computational challenges in molecular phylogenetics and evolution.

Authors:  Nick Goldman; Ziheng Yang
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.