Literature DB >> 31688943

On the Use of Information Criteria for Model Selection in Phylogenetics.

Edward Susko1, Andrew J Roger2.   

Abstract

The information criteria Akaike information criterion (AIC), AICc, and Bayesian information criterion (BIC) are widely used for model selection in phylogenetics, however, their theoretical justification and performance have not been carefully examined in this setting. Here, we investigate these methods under simple and complex phylogenetic models. We show that AIC can give a biased estimate of its intended target, the expected predictive log likelihood (EPLnL) or, equivalently, expected Kullback-Leibler divergence between the estimated model and the true distribution for the data. Reasons for bias include commonly occurring issues such as small edge-lengths or, in mixture models, small weights. The use of partitioned models is another issue that can cause problems with information criteria. We show that for partitioned models, a different BIC correction is required for it to be a valid approximation to a Bayes factor. The commonly used AICc correction is not clearly defined in partitioned models and can actually create a substantial bias when the number of parameters gets large as is the case with larger trees and partitioned models. Bias-corrected cross-validation corrections are shown to provide better approximations to EPLnL than AIC. We also illustrate how EPLnL, the estimation target of AIC, can sometimes favor an incorrect model and give reasons for why selection of incorrectly under-partitioned models might be desirable in partitioned model settings.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Akaike information criteria; Bayesian information criteria; cross-validation; mixture model; model selection; partition model; phylogenetics

Mesh:

Year:  2020        PMID: 31688943     DOI: 10.1093/molbev/msz228

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  5 in total

1.  Measuring Phylogenetic Information of Incomplete Sequence Data.

Authors:  Tae-Kun Seo; Olivier Gascuel; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

2.  Molecular detection of Coxiella-like endosymbionts and absence of Coxiella burnetii in Amblyomma mixtum from Veracruz, Mexico.

Authors:  Estefanía Grostieta; Héctor M Zazueta-Islas; Timoteo Cruz-Valdez; Gerardo G Ballados-González; Lucía Álvarez-Castillo; Sandra M García-Esparza; Anabel Cruz-Romero; Dora Romero-Salas; Mariel Aguilar-Domínguez; Ingeborg Becker; Sokani Sánchez-Montes
Journal:  Exp Appl Acarol       Date:  2022-10-16       Impact factor: 2.380

3.  Comparing Partitioned Models to Mixture Models: Do Information Criteria Apply?

Authors:  Stephen M Crotty; Barbara R Holland
Journal:  Syst Biol       Date:  2022-10-12       Impact factor: 9.160

4.  Positive Selection Drives the Adaptive Evolution of Mitochondrial Antiviral Signaling (MAVS) Proteins-Mediating Innate Immunity in Mammals.

Authors:  Hafiz Ishfaq Ahmad; Gulnaz Afzal; Muhammad Nouman Iqbal; Muhammad Arslan Iqbal; Borhan Shokrollahi; Muhammad Khalid Mansoor; Jinping Chen
Journal:  Front Vet Sci       Date:  2022-01-31

5.  Identification and Validation of a Ferroptosis-Related Long Non-coding RNA Signature for Predicting the Outcome of Lung Adenocarcinoma.

Authors:  Zhiyuan Zheng; Qian Zhang; Wei Wu; Yan Xue; Shuhan Liu; Qiaoqian Chen; Donghong Lin
Journal:  Front Genet       Date:  2021-07-22       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.