Literature DB >> 11928477

The accuracy of fast phylogenetic methods for large datasets.

Luay Nakhleh1, Bernard M E Moret, Usman Roshan, Katherine St John, Jerry Sun, Tandy Warnow.   

Abstract

Whole-genome phylogenetic studies require various sources of phylogenetic signals to produce an accurate picture of the evolutionary history of a group of genomes. In particular, sequence-based reconstruction will play an important role, especially in resolving more recent events. But using sequences at the level of whole genomes means working with very large amounts of data--large numbers of sequences--as well as large phylogenetic distances, so that reconstruction methods must be both fast and robust as well as accurate. We study the accuracy, convergence rate, and speed of several fast reconstruction methods: neighbor-joining, Weighbor (a weighted version of neighbor-joining), greedy parsimony, and a new phylogenetic reconstruction method based on disk-covering and parsimony search (DCM-NJ + MP). Our study uses extensive simulations based on random birth-death trees, with controlled deviations from ultrametricity. We find that Weighbor, thanks to its sophisticated handling of probabilities, outperforms other methods for short sequences, while our new method is the best choice for sequence lengths above 100. For very large sequence lengths, all four methods have similar accuracy, so that the speed of neighbor-joining and greedy parsimony makes them the two methods of choice.

Mesh:

Substances:

Year:  2002        PMID: 11928477

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  4 in total

1.  Distance-based genome rearrangement phylogeny.

Authors:  Li-San Wang; Tandy Warnow; Bernard M E Moret; Robert K Jansen; Linda A Raubeson
Journal:  J Mol Evol       Date:  2006-10-04       Impact factor: 2.395

2.  Standard maximum likelihood analyses of alignments with gaps can be statistically inconsistent.

Authors:  Tandy Warnow
Journal:  PLoS Curr       Date:  2012-03-09

3.  DACTAL: divide-and-conquer trees (almost) without alignments.

Authors:  Serita Nelesen; Kevin Liu; Li-San Wang; C Randal Linder; Tandy Warnow
Journal:  Bioinformatics       Date:  2012-06-15       Impact factor: 6.937

4.  Evolution of proteins and proteomes: a phylogenetics approach.

Authors:  Toni Gabaldón
Journal:  Evol Bioinform Online       Date:  2007-02-24       Impact factor: 1.625

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.