Literature DB >> 27576546

Efficient Detection of Repeating Sites to Accelerate Phylogenetic Likelihood Calculations.

K Kobert1, A Stamatakis2,3, T Flouri2,3.   

Abstract

The phylogenetic likelihood function (PLF) is the major computational bottleneck in several applications of evolutionary biology such as phylogenetic inference, species delimitation, model selection, and divergence times estimation. Given the alignment, a tree and the evolutionary model parameters, the likelihood function computes the conditional likelihood vectors for every node of the tree. Vector entries for which all input data are identical result in redundant likelihood operations which, in turn, yield identical conditional values. Such operations can be omitted for improving run-time and, using appropriate data structures, reducing memory usage. We present a fast, novel method for identifying and omitting such redundant operations in phylogenetic likelihood calculations, and assess the performance improvement and memory savings attained by our method. Using empirical and simulated data sets, we show that a prototype implementation of our method yields up to 12-fold speedups and uses up to 78% less memory than one of the fastest and most highly tuned implementations of the PLF currently available. Our method is generic and can seamlessly be integrated into any phylogenetic likelihood implementation. [Algorithms; maximum likelihood; phylogenetic likelihood function; phylogenetics].
© The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

Entities:  

Mesh:

Year:  2017        PMID: 27576546      PMCID: PMC5837535          DOI: 10.1093/sysbio/syw075

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   9.160


  20 in total

1.  A dirichlet process prior for estimating lineage-specific substitution rates.

Authors:  Tracy A Heath; Mark T Holder; John P Huelsenbeck
Journal:  Mol Biol Evol       Date:  2011-11-02       Impact factor: 16.240

2.  ProtTest: selection of best-fit models of protein evolution.

Authors:  Federico Abascal; Rafael Zardoya; David Posada
Journal:  Bioinformatics       Date:  2005-01-12       Impact factor: 6.937

3.  PAML 4: phylogenetic analysis by maximum likelihood.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-04       Impact factor: 16.240

4.  Evolutionary trees from DNA sequences: a maximum likelihood approach.

Authors:  J Felsenstein
Journal:  J Mol Evol       Date:  1981       Impact factor: 2.395

5.  Time and memory efficient likelihood-based tree searches on phylogenomic alignments with missing data.

Authors:  Alexandros Stamatakis; Nikolaos Alachiotis
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

6.  Algorithms, data structures, and numerics for likelihood-based phylogenetic inference of huge trees.

Authors:  Fernando Izquierdo-Carrasco; Stephen A Smith; Alexandros Stamatakis
Journal:  BMC Bioinformatics       Date:  2011-12-13       Impact factor: 3.169

7.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

8.  The phylogenetic likelihood library.

Authors:  T Flouri; F Izquierdo-Carrasco; D Darriba; A J Aberer; L-T Nguyen; B Q Minh; A Von Haeseler; A Stamatakis
Journal:  Syst Biol       Date:  2014-10-30       Impact factor: 15.683

9.  Optimization strategies for fast detection of positive selection on phylogenetic trees.

Authors:  Mario Valle; Hannes Schabauer; Christoph Pacher; Heinz Stockinger; Alexandros Stamatakis; Marc Robinson-Rechavi; Nicolas Salamin
Journal:  Bioinformatics       Date:  2014-01-02       Impact factor: 6.937

10.  ExaBayes: massively parallel bayesian tree inference for the whole-genome era.

Authors:  Andre J Aberer; Kassian Kobert; Alexandros Stamatakis
Journal:  Mol Biol Evol       Date:  2014-08-18       Impact factor: 16.240

View more
  8 in total

1.  BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics.

Authors:  Daniel L Ayres; Michael P Cummings; Guy Baele; Aaron E Darling; Paul O Lewis; David L Swofford; John P Huelsenbeck; Philippe Lemey; Andrew Rambaut; Marc A Suchard
Journal:  Syst Biol       Date:  2019-11-01       Impact factor: 15.683

2.  A LASSO-based approach to sample sites for phylogenetic tree search.

Authors:  Noa Ecker; Dana Azouri; Ben Bettisworth; Alexandros Stamatakis; Yishay Mansour; Itay Mayrose; Tal Pupko
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

3.  RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.

Authors:  Alexey M Kozlov; Diego Darriba; Tomáš Flouri; Benoit Morel; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

4.  ModelTest-NG: A New and Scalable Tool for the Selection of DNA and Protein Evolutionary Models.

Authors:  Diego Darriba; David Posada; Alexey M Kozlov; Alexandros Stamatakis; Benoit Morel; Tomas Flouri
Journal:  Mol Biol Evol       Date:  2020-01-01       Impact factor: 16.240

5.  Harnessing machine learning to guide phylogenetic-tree search algorithms.

Authors:  Dana Azouri; Shiran Abadi; Yishay Mansour; Itay Mayrose; Tal Pupko
Journal:  Nat Commun       Date:  2021-03-31       Impact factor: 14.919

6.  Felsenstein Phylogenetic Likelihood.

Authors:  David Posada; Keith A Crandall
Journal:  J Mol Evol       Date:  2021-01-13       Impact factor: 2.395

7.  Fast algorithms for computing phylogenetic divergence time.

Authors:  Ralph W Crosby; Tiffani L Williams
Journal:  BMC Bioinformatics       Date:  2017-12-06       Impact factor: 3.169

8.  Improving the performance of Bayesian phylogenetic inference under relaxed clock models.

Authors:  Rong Zhang; Alexei Drummond
Journal:  BMC Evol Biol       Date:  2020-05-14       Impact factor: 3.260

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.