Literature DB >> 15805016

Missing the forest for the trees: phylogenetic compression and its implications for inferring complex evolutionary histories.

Cécile Ané1, Michael Sanderson.   

Abstract

Phylogenetic tree reconstruction is difficult in the presence of lateral gene transfer and other processes generating conflicting signals. We develop a new approach to this problem using ideas borrowed from algorithmic information theory. It selects the hypothesis that simultaneously minimizes the descriptive complexity of the tree(s) plus the data when encoded using those tree(s). In practice this is the hypothesis that can compress the data the most. We show not only that phylogenetic compression is an efficient method for encoding most phylogenetic data sets and is more efficient than compression schemes designed for single sequences, but also that it provides a clear information theoretic rule for determining when a collection of conflicting trees is a better explanation of the data than a single tree. By casting the parsimony problem in this more general framework, we also conclude that the so-called total-evidence tree--the tree constructed from all the data simultaneously--is not always the most economical explanation of the data.

Mesh:

Year:  2005        PMID: 15805016     DOI: 10.1080/10635150590905984

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  9 in total

1.  Normalized Compression Distance of Multisets with Applications.

Authors:  Andrew R Cohen; Paul M B Vitányi
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-08       Impact factor: 6.226

2.  A Daily-Updated Database and Tools for Comprehensive SARS-CoV-2 Mutation-Annotated Trees.

Authors:  Jakob McBroome; Bryan Thornlow; Angie S Hinrichs; Alexander Kramer; Nicola De Maio; Nick Goldman; David Haussler; Russell Corbett-Detig; Yatish Turakhia
Journal:  Mol Biol Evol       Date:  2021-12-09       Impact factor: 8.800

3.  Detecting phylogenetic breakpoints and discordance from genome-wide alignments for species tree reconstruction.

Authors:  Cécile Ané
Journal:  Genome Biol Evol       Date:  2011-02-28       Impact factor: 3.416

4.  Multigenic phylogeny and analysis of tree incongruences in Triticeae (Poaceae).

Authors:  Juan S Escobar; Céline Scornavacca; Alberto Cenci; Claire Guilhaumon; Sylvain Santoni; Emmanuel J P Douzery; Vincent Ranwez; Sylvain Glémin; Jacques David
Journal:  BMC Evol Biol       Date:  2011-06-24       Impact factor: 3.260

5.  STBase: one million species trees for comparative biology.

Authors:  Michelle M McMahon; Akshay Deepak; David Fernández-Baca; Darren Boss; Michael J Sanderson
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

6.  Phylesystem: a git-based data store for community-curated phylogenetic estimates.

Authors:  Emily Jane McTavish; Cody E Hinchliff; James F Allman; Joseph W Brown; Karen A Cranston; Mark T Holder; Jonathan A Rees; Stephen A Smith
Journal:  Bioinformatics       Date:  2015-05-04       Impact factor: 6.937

7.  Efficiently Summarizing Relationships in Large Samples: A General Duality Between Statistics of Genealogies and Genomes.

Authors:  Peter Ralph; Kevin Thornton; Jerome Kelleher
Journal:  Genetics       Date:  2020-05-01       Impact factor: 4.562

8.  Fine-scale phylogenetic discordance across the house mouse genome.

Authors:  Michael A White; Cécile Ané; Colin N Dewey; Bret R Larget; Bret A Payseur
Journal:  PLoS Genet       Date:  2009-11-20       Impact factor: 5.917

9.  Exploring Parallel MPI Fault Tolerance Mechanisms for Phylogenetic Inference with RAxML-NG.

Authors:  Lukas Hübner; Alexey M Kozlov; Demian Hespe; Peter Sanders; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2021-05-26       Impact factor: 6.931

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.