Literature DB >> 27633354

Who Let the CAT Out of the Bag? Accurately Dealing with Substitutional Heterogeneity in Phylogenomic Analyses.

Nathan V Whelan1, Kenneth M Halanych1.   

Abstract

As phylogenetic datasets have increased in size, site-heterogeneous substitution models such as CAT-F81 and CAT-GTR have been advocated in favor of other models because they purportedly suppress long-branch attraction (LBA). These models are two of the most commonly used models in phylogenomics, and they have been applied to a variety of taxa, ranging from Drosophila to land plants. However, many arguments in favor of CAT models have been based on tenuous assumptions about the true phylogeny, rather than rigorous testing with known trees via simulation. Moreover, CAT models have not been compared to other approaches for handling substitutional heterogeneity such as data partitioning with site-homogeneous substitution models. We simulated amino acid sequence datasets with substitutional heterogeneity on a variety of tree shapes including those susceptible to LBA. Data were analyzed with both CAT models and partitioning to explore model performance; in total over 670,000 CPU hours were used, of which over 97% was spent running analyses with CAT models. In many cases, all models recovered branching patterns that were identical to the known tree. However, CAT-F81 consistently performed worse than other models in inferring the correct branching patterns, and both CAT models often overestimated substitutional heterogeneity. Additionally, reanalysis of two empirical metazoan datasets supports the notion that CAT-F81 tends to recover less accurate trees than data partitioning and CAT-GTR. Given these results, we conclude that partitioning and CAT-GTR perform similarly in recovering accurate branching patterns. However, computation time can be orders of magnitude less for data partitioning, with commonly used implementations of CAT-GTR often failing to reach completion in a reasonable time frame (i.e., for Bayesian analyses to converge). Practices such as removing constant sites and parsimony uninformative characters, or using CAT-F81 when CAT-GTR is deemed too computationally expensive, cannot be logically justified. Given clear problems with CAT-F81, phylogenies previously inferred with this model should be reassessed. [Data partitioning; phylogenomics, simulation, site-heterogeneity, substitution models.].
© The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2017        PMID: 27633354     DOI: 10.1093/sysbio/syw084

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  19 in total

1.  Tetraconatan phylogeny with special focus on Malacostraca and Branchiopoda: highlighting the strength of taxon-specific matrices in phylogenomics.

Authors:  Martin Schwentner; Stefan Richter; D Christopher Rogers; Gonzalo Giribet
Journal:  Proc Biol Sci       Date:  2018-08-22       Impact factor: 5.349

2.  mtProtEvol: the resource presenting molecular evolution analysis of proteins involved in the function of Vertebrate mitochondria.

Authors:  Anastasia A Kuzminkova; Anastasia D Sokol; Kristina E Ushakova; Konstantin Yu Popadin; Konstantin V Gunbin
Journal:  BMC Evol Biol       Date:  2019-02-26       Impact factor: 3.260

3.  Model Choice, Missing Data, and Taxon Sampling Impact Phylogenomic Inference of Deep Basidiomycota Relationships.

Authors:  Arun N Prasanna; Daniel Gerber; Teeratas Kijpornyongpan; M Catherine Aime; Vinson P Doyle; Laszlo G Nagy
Journal:  Syst Biol       Date:  2020-01-01       Impact factor: 15.683

4.  Different phylogenomic methods support monophyly of enigmatic 'Mesozoa' (Dicyemida + Orthonectida, Lophotrochozoa).

Authors:  Marie Drábková; Kevin M Kocot; Kenneth M Halanych; Todd H Oakley; Leonid L Moroz; Johanna T Cannon; Armand Kuris; Ana Elisa Garcia-Vedrenne; M Sabrina Pankey; Emily A Ellis; Rebecca Varney; Jan Štefka; Jan Zrzavý
Journal:  Proc Biol Sci       Date:  2022-07-06       Impact factor: 5.530

5.  New Insights Into the Relationships Within Subtribe Scorzonerinae (Cichorieae, Asteraceae) Using Hybrid Capture Phylogenomics (Hyb-Seq).

Authors:  Elham Hatami; Katy E Jones; Norbert Kilian
Journal:  Front Plant Sci       Date:  2022-07-01       Impact factor: 6.627

6.  The Making of Calibration Sausage Exemplified by Recalibrating the Transcriptomic Timetree of Jawed Vertebrates.

Authors:  David Marjanović
Journal:  Front Genet       Date:  2021-05-12       Impact factor: 4.599

7.  Comparing Partitioned Models to Mixture Models: Do Information Criteria Apply?

Authors:  Stephen M Crotty; Barbara R Holland
Journal:  Syst Biol       Date:  2022-10-12       Impact factor: 9.160

8.  Revisiting metazoan phylogeny with genomic sampling of all phyla.

Authors:  Christopher E Laumer; Rosa Fernández; Sarah Lemer; David Combosch; Kevin M Kocot; Ana Riesgo; Sónia C S Andrade; Wolfgang Sterrer; Martin V Sørensen; Gonzalo Giribet
Journal:  Proc Biol Sci       Date:  2019-07-10       Impact factor: 5.349

Review 9.  Embracing Uncertainty in Reconstructing Early Animal Evolution.

Authors:  Nicole King; Antonis Rokas
Journal:  Curr Biol       Date:  2017-10-09       Impact factor: 10.834

10.  Ctenophore relationships and their placement as the sister group to all other animals.

Authors:  Nathan V Whelan; Kevin M Kocot; Tatiana P Moroz; Krishanu Mukherjee; Peter Williams; Gustav Paulay; Leonid L Moroz; Kenneth M Halanych
Journal:  Nat Ecol Evol       Date:  2017-10-09       Impact factor: 15.460

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.