Literature DB >> 32458974

Gradients Do Grow on Trees: A Linear-Time O(N)-Dimensional Gradient for Statistical Phylogenetics.

Xiang Ji1,2, Zhenyu Zhang3, Andrew Holbrook3, Akihiko Nishimura4, Guy Baele5, Andrew Rambaut6, Philippe Lemey5, Marc A Suchard1,3,7.   

Abstract

Calculation of the log-likelihood stands as the computational bottleneck for many statistical phylogenetic algorithms. Even worse is its gradient evaluation, often used to target regions of high probability. Order O(N)-dimensional gradient calculations based on the standard pruning algorithm require O(N2) operations, where N is the number of sampled molecular sequences. With the advent of high-throughput sequencing, recent phylogenetic studies have analyzed hundreds to thousands of sequences, with an apparent trend toward even larger data sets as a result of advancing technology. Such large-scale analyses challenge phylogenetic reconstruction by requiring inference on larger sets of process parameters to model the increasing data heterogeneity. To make these analyses tractable, we present a linear-time algorithm for O(N)-dimensional gradient evaluation and apply it to general continuous-time Markov processes of sequence substitution on a phylogenetic tree without a need to assume either stationarity or reversibility. We apply this approach to learn the branch-specific evolutionary rates of three pathogenic viruses: West Nile virus, Dengue virus, and Lassa virus. Our proposed algorithm significantly improves inference efficiency with a 126- to 234-fold increase in maximum-likelihood optimization and a 16- to 33-fold computational performance increase in a Bayesian framework.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian inference; linear-time gradient algorithm; maximum likelihood; random-effects molecular clock model

Mesh:

Year:  2020        PMID: 32458974      PMCID: PMC7530611          DOI: 10.1093/molbev/msaa130

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  37 in total

1.  Effects of models of rate evolution on estimation of divergence dates with special reference to the metazoan 18S ribosomal RNA phylogeny.

Authors:  Stéphane Aris-Brosou; Ziheng Yang
Journal:  Syst Biol       Date:  2002-10       Impact factor: 15.683

Review 2.  Molecular clocks: four decades of evolution.

Authors:  Sudhir Kumar
Journal:  Nat Rev Genet       Date:  2005-08       Impact factor: 53.242

3.  Estimating the rate of evolution of the rate of molecular evolution.

Authors:  J L Thorne; H Kishino; I S Painter
Journal:  Mol Biol Evol       Date:  1998-12       Impact factor: 16.240

4.  Phylogeography takes a relaxed random walk in continuous space and time.

Authors:  Philippe Lemey; Andrew Rambaut; John J Welch; Marc A Suchard
Journal:  Mol Biol Evol       Date:  2010-03-04       Impact factor: 16.240

Review 5.  Computational advances in maximum likelihood methods for molecular phylogeny.

Authors:  E E Schadt; J S Sinsheimer; K Lange
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

6.  Emerging infectious diseases: A proactive approach.

Authors:  David E Bloom; Steven Black; Rino Rappuoli
Journal:  Proc Natl Acad Sci U S A       Date:  2017-04-10       Impact factor: 11.205

7.  Phylogeography and population dynamics of dengue viruses in the Americas.

Authors:  Orchid M Allicock; Philippe Lemey; Andrew J Tatem; Oliver G Pybus; Shannon N Bennett; Brandi A Mueller; Marc A Suchard; Jerome E Foster; Andrew Rambaut; Christine V F Carrington
Journal:  Mol Biol Evol       Date:  2012-01-06       Impact factor: 16.240

8.  Bayesian random local clocks, or one rate to rule them all.

Authors:  Alexei J Drummond; Marc A Suchard
Journal:  BMC Biol       Date:  2010-08-31       Impact factor: 7.431

9.  Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.

Authors:  Marc A Suchard; Philippe Lemey; Guy Baele; Daniel L Ayres; Alexei J Drummond; Andrew Rambaut
Journal:  Virus Evol       Date:  2018-06-08

10.  Metagenomic sequencing at the epicenter of the Nigeria 2018 Lassa fever outbreak.

Authors:  E Ogbaini-Emovon; S Günther; S Duraffour; L E Kafetzopoulou; S T Pullan; P Lemey; M A Suchard; D U Ehichioya; M Pahlmann; A Thielebein; J Hinzmann; L Oestereich; D M Wozniak; K Efthymiadis; D Schachten; F Koenig; J Matjeschk; S Lorenzen; S Lumley; Y Ighodalo; D I Adomeh; T Olokor; E Omomoh; R Omiunu; J Agbukor; B Ebo; J Aiyepada; P Ebhodaghe; B Osiemi; S Ehikhametalor; P Akhilomen; M Airende; R Esumeh; E Muoebonam; R Giwa; A Ekanem; G Igenegbale; G Odigie; G Okonofua; R Enigbe; J Oyakhilome; E O Yerumoh; I Odia; C Aire; M Okonofua; R Atafo; E Tobin; D Asogun; N Akpede; P O Okokhere; M O Rafiu; K O Iraoyah; C O Iruolagbe; P Akhideno; C Erameh; G Akpede; E Isibor; D Naidoo; R Hewson; J A Hiscox; R Vipond; M W Carroll; C Ihekweazu; P Formenty; S Okogbenin
Journal:  Science       Date:  2019-01-04       Impact factor: 47.728

View more
  4 in total

1.  Hamiltonian Monte Carlo sampling to estimate past population dynamics using the skygrid coalescent model in a Bayesian phylogenetics framework.

Authors:  Guy Baele; Mandev S Gill; Philippe Lemey; Marc A Suchard
Journal:  Wellcome Open Res       Date:  2020-03-30

2.  Felsenstein Phylogenetic Likelihood.

Authors:  David Posada; Keith A Crandall
Journal:  J Mol Evol       Date:  2021-01-13       Impact factor: 2.395

Review 3.  Scalable Bayesian phylogenetics.

Authors:  Alexander A Fisher; Gabriel W Hassler; Xiang Ji; Guy Baele; Marc A Suchard; Philippe Lemey
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

4.  Variational Phylodynamic Inference Using Pandemic-scale Data.

Authors:  Caleb Ki; Jonathan Terhorst
Journal:  Mol Biol Evol       Date:  2022-08-03       Impact factor: 8.800

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.