Literature DB >> 19851702

Measuring fit of sequence data to phylogenetic model: gain of power using marginal tests.

Peter J Waddell1, Rissa Ota, David Penny.   

Abstract

Testing fit of data to model is fundamentally important to any science, but publications in the field of phylogenetics rarely do this. Such analyses discard fundamental aspects of science as prescribed by Karl Popper. Indeed, not without cause, Popper (Unended quest: an intellectual autobiography. Fontana, London, 1976) once argued that evolutionary biology was unscientific as its hypotheses were untestable. Here we trace developments in assessing fit from Penny et al. (Nature 297:197-200, 1982) to the present. We compare the general log-likelihood ratio (the G or G (2) statistic) statistic between the evolutionary tree model and the multinomial model with that of marginalized tests applied to an alignment (using placental mammal coding sequence data). It is seen that the most general test does not reject the fit of data to model (P approximately 0.5), but the marginalized tests do. Tests on pairwise frequency (F) matrices, strongly (P < 0.001) reject the most general phylogenetic (GTR) models commonly in use. It is also clear (P < 0.01) that the sequences are not stationary in their nucleotide composition. Deviations from stationarity and homogeneity seem to be unevenly distributed amongst taxa; not necessarily those expected from examining other regions of the genome. By marginalizing the 4( t ) patterns of the i.i.d. model to observed and expected parsimony counts, that is, from constant sites, to singletons, to parsimony informative characters of a minimum possible length, then the likelihood ratio test regains power, and it too rejects the evolutionary model with P << 0.001. Given such behavior over relatively recent evolutionary time, readers in general should maintain a healthy skepticism of results, as the scale of the systematic errors in published trees may really be far larger than the analytical methods (e.g., bootstrap) report.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19851702     DOI: 10.1007/s00239-009-9268-8

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  28 in total

1.  Cluster inference methods and graphical models evaluated on NCI60 microarray gene expression data.

Authors:  P J Waddell; H Kishino
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  2000

2.  Heterogeneity in the substitution process of amino acid sites of proteins coded for by mitochondrial DNA.

Authors:  J H Reeves
Journal:  J Mol Evol       Date:  1992-07       Impact factor: 2.395

3.  Modeling compositional heterogeneity.

Authors:  Peter G Foster
Journal:  Syst Biol       Date:  2004-06       Impact factor: 15.683

4.  Statistical tests of models of DNA substitution.

Authors:  N Goldman
Journal:  J Mol Evol       Date:  1993-02       Impact factor: 2.395

5.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

6.  Resolution of the early placental mammal radiation using Bayesian phylogenetics.

Authors:  W J Murphy; E Eizirik; S J O'Brien; O Madsen; M Scally; C J Douady; E Teeling; O A Ryder; M J Stanhope; W W de Jong; M S Springer
Journal:  Science       Date:  2001-12-14       Impact factor: 47.728

7.  General time-reversible distances with unequal rates across sites: mixing gamma and inverse Gaussian distributions with invariant sites.

Authors:  P J Waddell; M A Steel
Journal:  Mol Phylogenet Evol       Date:  1997-12       Impact factor: 4.286

8.  Pika and vole mitochondrial genomes increase support for both rodent monophyly and glires.

Authors:  Yu-Hsin Lin; Peter J Waddell; David Penny
Journal:  Gene       Date:  2002-07-10       Impact factor: 3.688

9.  Evolutionary history of LINE-1 in the major clades of placental mammals.

Authors:  Paul D Waters; Gauthier Dobigny; Peter J Waddell; Terence J Robinson
Journal:  PLoS One       Date:  2007-01-17       Impact factor: 3.240

10.  Retroposed elements as archives for the evolutionary history of placental mammals.

Authors:  Jan Ole Kriegs; Gennady Churakov; Martin Kiefmann; Ursula Jordan; Jürgen Brosius; Jürgen Schmitz
Journal:  PLoS Biol       Date:  2006-03-14       Impact factor: 8.029

View more
  5 in total

1.  Assessment of substitution model adequacy using frequentist and Bayesian methods.

Authors:  Jennifer Ripplinger; Jack Sullivan
Journal:  Mol Biol Evol       Date:  2010-07-08       Impact factor: 16.240

2.  The Multispecies Coalescent Model Outperforms Concatenation Across Diverse Phylogenomic Data Sets.

Authors:  Xiaodong Jiang; Scott V Edwards; Liang Liu
Journal:  Syst Biol       Date:  2020-07-01       Impact factor: 15.683

3.  Patterns of care in testicular torsion: influence of hospital transfer on testicular outcomes.

Authors:  Jenny H Yiee; Lynne Chang; Alan Kaplan; Lorna Kwan; Paul J Chung; Mark S Litwin
Journal:  J Pediatr Urol       Date:  2013-07-27       Impact factor: 1.830

4.  Phylogenomic analysis resolves the interordinal relationships and rapid diversification of the laurasiatherian mammals.

Authors:  Xuming Zhou; Shixia Xu; Junxiao Xu; Bingyao Chen; Kaiya Zhou; Guang Yang
Journal:  Syst Biol       Date:  2011-09-07       Impact factor: 15.683

5.  An Algorithm for Calculating the Probability of Classes of Data Patterns on a Genealogy.

Authors:  Jordan M Koch; Mark T Holder
Journal:  PLoS Curr       Date:  2012-12-14
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.