Literature DB >> 29309694

Information Criteria for Comparing Partition Schemes.

Tae-Kun Seo1, Jeffrey L Thorne2.   

Abstract

When inferring phylogenies, one important decision is whether and how nucleotide substitution parameters should be shared across different subsets or partitions of the data. One sort of partitioning error occurs when heterogeneous subsets are mistakenly lumped together and treated as if they share parameter values. The opposite kind of error is mistakenly treating homogeneous subsets as if they result from distinct sets of parameters. Lumping and splitting errors are not equally bad. Lumping errors can yield parameter estimates that do not accurately reflect any of the subsets that were combined whereas splitting errors yield estimates that did not benefit from sharing information across partitions. Phylogenetic partitioning decisions are often made by applying information criteria such as the Akaike information criterion (AIC). As with other information criteria, the AIC evaluates a model or partition scheme by combining the maximum log-likelihood value with a penalty that depends on the number of parameters being estimated. For the purpose of selecting an optimal partitioning scheme, we derive an adjustment to the AIC that we refer to as the AIC$^{(p)}$ and that is motivated by the idea that splitting errors are less serious than lumping errors. We also introduce a similar adjustment to the Bayesian information criterion (BIC) that we refer to as the BIC$^{(p)}$. Via simulation and empirical data analysis, we contrast AIC and BIC behavior to our suggested adjustments. We discuss these results and also emphasize why we expect the probability of lumping errors with the AIC$^{(p)}$ and the BIC$^{(p)}$ to be relatively robust to model parameterization.

Mesh:

Year:  2018        PMID: 29309694      PMCID: PMC6005138          DOI: 10.1093/sysbio/syx097

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  32 in total

1.  Partitionfinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses.

Authors:  Robert Lanfear; Brett Calcott; Simon Y W Ho; Stephane Guindon
Journal:  Mol Biol Evol       Date:  2012-01-20       Impact factor: 16.240

2.  Estimation of the number of nucleotide substitutions when there are strong transition-transversion and G+C-content biases.

Authors:  K Tamura
Journal:  Mol Biol Evol       Date:  1992-07       Impact factor: 16.240

3.  Calculating bootstrap probabilities of phylogeny using multilocus sequence data.

Authors:  Tae-Kun Seo
Journal:  Mol Biol Evol       Date:  2008-02-14       Impact factor: 16.240

4.  Optimal data partitioning and a test case for ray-finned fishes (Actinopterygii) based on ten nuclear loci.

Authors:  Chenhong Li; Guoqing Lu; Guillermo Ortí
Journal:  Syst Biol       Date:  2008-08       Impact factor: 15.683

5.  Maximum-Likelihood Models for Combined Analyses of Multiple Sequence Data

Authors: 
Journal:  J Mol Evol       Date:  1996-05       Impact factor: 2.395

6.  Estimating the pattern of nucleotide substitution.

Authors:  Z Yang
Journal:  J Mol Evol       Date:  1994-07       Impact factor: 2.395

7.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees.

Authors:  K Tamura; M Nei
Journal:  Mol Biol Evol       Date:  1993-05       Impact factor: 16.240

8.  Mitochondrial phylogeny of hedgehogs and monophyly of Eulipotyphla.

Authors:  Masato Nikaido; Ying Cao; Masashi Harada; Norihiro Okada; Masami Hasegawa
Journal:  Mol Phylogenet Evol       Date:  2003-08       Impact factor: 4.286

9.  Bayesian selection of nucleotide substitution models and their site assignments.

Authors:  Chieh-Hsi Wu; Marc A Suchard; Alexei J Drummond
Journal:  Mol Biol Evol       Date:  2012-12-11       Impact factor: 16.240

10.  Selecting optimal partitioning schemes for phylogenomic datasets.

Authors:  Robert Lanfear; Brett Calcott; David Kainer; Christoph Mayer; Alexandros Stamatakis
Journal:  BMC Evol Biol       Date:  2014-04-17       Impact factor: 3.260

View more
  4 in total

1.  Measuring Phylogenetic Information of Incomplete Sequence Data.

Authors:  Tae-Kun Seo; Olivier Gascuel; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

2.  Comparing Partitioned Models to Mixture Models: Do Information Criteria Apply?

Authors:  Stephen M Crotty; Barbara R Holland
Journal:  Syst Biol       Date:  2022-10-12       Impact factor: 9.160

3.  Sphenodontian phylogeny and the impact of model choice in Bayesian morphological clock estimates of divergence times and evolutionary rates.

Authors:  Tiago R Simões; Michael W Caldwell; Stephanie E Pierce
Journal:  BMC Biol       Date:  2020-12-07       Impact factor: 7.431

4.  Evidence for sponges as sister to all other animals from partitioned phylogenomics with mixture models and recoding.

Authors:  Anthony K Redmond; Aoife McLysaght
Journal:  Nat Commun       Date:  2021-03-19       Impact factor: 14.919

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.