Literature DB >> 17691890

Efficiently computing the Robinson-Foulds metric.

Nicholas D Pattengale1, Eric J Gottlieb, Bernard M E Moret.   

Abstract

The Robinson-Foulds (RF) metric is the measure most widely used in comparing phylogenetic trees; it can be computed in linear time using Day's algorithm. When faced with the need to compare large numbers of large trees, however, even linear time becomes prohibitive. We present a randomized approximation scheme that provides, in sublinear time and with high probability, a (1 + epsilon) approximation of the true RF metric. Our approach is to use a sublinear-space embedding of the trees, combined with an application of the Johnson-Lindenstrauss lemma to approximate vector norms very rapidly. We complement our algorithm by presenting an efficient embedding procedure, thereby resolving an open issue from the preliminary version of this paper. We have also improved the performance of Day's (exact) algorithm in practice by using techniques discovered while implementing our approximation scheme. Indeed, we give a unified framework for edge-based tree algorithms in which implementation tradeoffs are clear. Finally, we present detailed experimental results illustrating the precision and running-time tradeoffs as well as demonstrating the speed of our approach. Our new implementation, FastRF, is available as an open-source tool for phylogenetic analysis.

Mesh:

Substances:

Year:  2007        PMID: 17691890     DOI: 10.1089/cmb.2007.R012

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  14 in total

1.  PhyLAT: a phylogenetic local alignment tool.

Authors:  Hongtao Sun; Jeremy D Buhler
Journal:  Bioinformatics       Date:  2012-04-06       Impact factor: 6.937

2.  Robinson-Foulds supertrees.

Authors:  Mukul S Bansal; J Gordon Burleigh; Oliver Eulenstein; David Fernández-Baca
Journal:  Algorithms Mol Biol       Date:  2010-02-24       Impact factor: 1.405

3.  A Linear Time Solution to the Labeled Robinson-Foulds Distance Problem.

Authors:  Samuel Briand; Christophe Dessimoz; Nadia El-Mabrouk; Yannis Nevers
Journal:  Syst Biol       Date:  2022-10-12       Impact factor: 9.160

4.  On the use of cartographic projections in visualizing phylo-genetic tree space.

Authors:  Kenneth Sundberg; Mark Clement; Quinn Snell
Journal:  Algorithms Mol Biol       Date:  2010-06-08       Impact factor: 1.405

5.  Constructing majority-rule supertrees.

Authors:  Jianrong Dong; David Fernández-Baca; F R McMorris
Journal:  Algorithms Mol Biol       Date:  2010-01-04       Impact factor: 1.405

6.  Dental Data Perform Relatively Poorly in Reconstructing Mammal Phylogenies: Morphological Partitions Evaluated with Molecular Benchmarks.

Authors:  Robert S Sansom; Matthew Albion Wills; Tamara Williams
Journal:  Syst Biol       Date:  2017-09-01       Impact factor: 9.160

7.  Refining discordant gene trees.

Authors:  Pawel Górecki; Oliver Eulenstein
Journal:  BMC Bioinformatics       Date:  2014-11-13       Impact factor: 3.169

8.  Clustering Genes of Common Evolutionary History.

Authors:  Kevin Gori; Tomasz Suchan; Nadir Alvarez; Nick Goldman; Christophe Dessimoz
Journal:  Mol Biol Evol       Date:  2016-02-17       Impact factor: 16.240

9.  Phylogenetic inference under recombination using Bayesian stochastic topology selection.

Authors:  Alex Webb; John M Hancock; Chris C Holmes
Journal:  Bioinformatics       Date:  2008-11-20       Impact factor: 6.937

10.  Phylogenetic search through partial tree mixing.

Authors:  Kenneth Sundberg; Mark Clement; Quinn Snell; Dan Ventura; Michael Whiting; Keith Crandall
Journal:  BMC Bioinformatics       Date:  2012-08-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.