Literature DB >> 17661232

The importance of data partitioning and the utility of Bayes factors in Bayesian phylogenetics.

Jeremy M Brown1, Alan R Lemmon.   

Abstract

As larger, more complex data sets are being used to infer phylogenies, accuracy of these phylogenies increasingly requires models of evolution that accommodate heterogeneity in the processes of molecular evolution. We investigated the effect of improper data partitioning on phylogenetic accuracy, as well as the type I error rate and sensitivity of Bayes factors, a commonly used method for choosing among different partitioning strategies in Bayesian analyses. We also used Bayes factors to test empirical data for the need to divide data in a manner that has no expected biological meaning. Posterior probability estimates are misleading when an incorrect partitioning strategy is assumed. The error was greatest when the assumed model was underpartitioned. These results suggest that model partitioning is important for large data sets. Bayes factors performed well, giving a 5% type I error rate, which is remarkably consistent with standard frequentist hypothesis tests. The sensitivity of Bayes factors was found to be quite high when the across-class model heterogeneity reflected that of empirical data. These results suggest that Bayes factors represent a robust method of choosing among partitioning strategies. Lastly, results of tests for the inclusion of unexpected divisions in empirical data mirrored the simulation results, although the outcome of such tests is highly dependent on accounting for rate variation among classes. We conclude by discussing other approaches for partitioning data, as well as other applications of Bayes factors.

Mesh:

Year:  2007        PMID: 17661232     DOI: 10.1080/10635150701546249

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  65 in total

1.  Inferring the evolutionary history of Mo-dependent nitrogen fixation from phylogenetic studies of nifK and nifDK.

Authors:  Linda S Hartmann; Susan R Barnum
Journal:  J Mol Evol       Date:  2010-07-17       Impact factor: 2.395

2.  Assessment of substitution model adequacy using frequentist and Bayesian methods.

Authors:  Jennifer Ripplinger; Jack Sullivan
Journal:  Mol Biol Evol       Date:  2010-07-08       Impact factor: 16.240

3.  Source identification in two criminal cases using phylogenetic analysis of HIV-1 DNA sequences.

Authors:  Diane I Scaduto; Jeremy M Brown; Wade C Haaland; Derrick J Zwickl; David M Hillis; Michael L Metzker
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-15       Impact factor: 11.205

4.  Improving marginal likelihood estimation for Bayesian phylogenetic model selection.

Authors:  Wangang Xie; Paul O Lewis; Yu Fan; Lynn Kuo; Ming-Hui Chen
Journal:  Syst Biol       Date:  2010-12-27       Impact factor: 15.683

5.  Newly discovered sister lineage sheds light on early ant evolution.

Authors:  Christian Rabeling; Jeremy M Brown; Manfred Verhaagh
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-15       Impact factor: 11.205

Review 6.  Statistics and truth in phylogenomics.

Authors:  Sudhir Kumar; Alan J Filipski; Fabia U Battistuzzi; Sergei L Kosakovsky Pond; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2011-08-26       Impact factor: 16.240

7.  Evolutionary history of mammalian sucking lice (Phthiraptera: Anoplura).

Authors:  Jessica E Light; Vincent S Smith; Julie M Allen; Lance A Durden; David L Reed
Journal:  BMC Evol Biol       Date:  2010-09-22       Impact factor: 3.260

8.  Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships.

Authors:  Björn M von Reumont; Karen Meusemann; Nikolaus U Szucsich; Emiliano Dell'Ampio; Vivek Gowri-Shankar; Daniela Bartel; Sabrina Simon; Harald O Letsch; Roman R Stocsits; Yun-xia Luan; Johann Wolfgang Wägele; Günther Pass; Heike Hadrys; Bernhard Misof
Journal:  BMC Evol Biol       Date:  2009-05-27       Impact factor: 3.260

9.  Data mining approach identifies research priorities and data requirements for resolving the red algal tree of life.

Authors:  Heroen Verbruggen; Christine A Maggs; Gary W Saunders; Line Le Gall; Hwan Su Yoon; Olivier De Clerck
Journal:  BMC Evol Biol       Date:  2010-01-20       Impact factor: 3.260

10.  Snake mitochondrial genomes: phylogenetic relationships and implications of extended taxon sampling for interpretations of mitogenomic evolution.

Authors:  Desirée A Douglas; David J Gower
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.