Marc A Suchard1, Andrew Rambaut. 1. Department of Biomathematics, University of California, Los Angeles, CA 90095, USA. msuchard@ucla.edu
Abstract
MOTIVATION: Statistical phylogenetics is computationally intensive, resulting in considerable attention meted on techniques for parallelization. Codon-based models allow for independent rates of synonymous and replacement substitutions and have the potential to more adequately model the process of protein-coding sequence evolution with a resulting increase in phylogenetic accuracy. Unfortunately, due to the high number of codon states, computational burden has largely thwarted phylogenetic reconstruction under codon models, particularly at the genomic-scale. Here, we describe novel algorithms and methods for evaluating phylogenies under arbitrary molecular evolutionary models on graphics processing units (GPUs), making use of the large number of processing cores to efficiently parallelize calculations even for large state-size models. RESULTS: We implement the approach in an existing Bayesian framework and apply the algorithms to estimating the phylogeny of 62 complete mitochondrial genomes of carnivores under a 60-state codon model. We see a near 90-fold speed increase over an optimized CPU-based computation and a >140-fold increase over the currently available implementation, making this the first practical use of codon models for phylogenetic inference over whole mitochondrial or microorganism genomes. AVAILABILITY AND IMPLEMENTATION: Source code provided in BEAGLE: Broad-platform Evolutionary Analysis General Likelihood Evaluator, a cross-platform/processor library for phylogenetic likelihood computation (http://beagle-lib.googlecode.com/). We employ a BEAGLE-implementation using the Bayesian phylogenetics framework BEAST (http://beast.bio.ed.ac.uk/).
MOTIVATION: Statistical phylogenetics is computationally intensive, resulting in considerable attention meted on techniques for parallelization. Codon-based models allow for independent rates of synonymous and replacement substitutions and have the potential to more adequately model the process of protein-coding sequence evolution with a resulting increase in phylogenetic accuracy. Unfortunately, due to the high number of codon states, computational burden has largely thwarted phylogenetic reconstruction under codon models, particularly at the genomic-scale. Here, we describe novel algorithms and methods for evaluating phylogenies under arbitrary molecular evolutionary models on graphics processing units (GPUs), making use of the large number of processing cores to efficiently parallelize calculations even for large state-size models. RESULTS: We implement the approach in an existing Bayesian framework and apply the algorithms to estimating the phylogeny of 62 complete mitochondrial genomes of carnivores under a 60-state codon model. We see a near 90-fold speed increase over an optimized CPU-based computation and a >140-fold increase over the currently available implementation, making this the first practical use of codon models for phylogenetic inference over whole mitochondrial or microorganism genomes. AVAILABILITY AND IMPLEMENTATION: Source code provided in BEAGLE: Broad-platform Evolutionary Analysis General Likelihood Evaluator, a cross-platform/processor library for phylogenetic likelihood computation (http://beagle-lib.googlecode.com/). We employ a BEAGLE-implementation using the Bayesian phylogenetics framework BEAST (http://beast.bio.ed.ac.uk/).
Authors: Elizabeth R Dumont; Liliana M Dávalos; Aaron Goldberg; Sharlene E Santana; Katja Rex; Christian C Voigt Journal: Proc Biol Sci Date: 2011-11-23 Impact factor: 5.349
Authors: Diana Edo-Matas; Philippe Lemey; Jennifer A Tom; Cèlia Serna-Bolea; Agnes E van den Blink; Angélique B van 't Wout; Hanneke Schuitemaker; Marc A Suchard Journal: Mol Biol Evol Date: 2010-12-06 Impact factor: 16.240
Authors: Marc A Suchard; Quanli Wang; Cliburn Chan; Jacob Frelinger; Andrew Cron; Mike West Journal: J Comput Graph Stat Date: 2010-06-01 Impact factor: 2.302
Authors: Marcio R T Nunes; Gustavo Palacios; Jedson F Cardoso; Livia C Martins; Edivaldo C Sousa; Clayton P S de Lima; Daniele B A Medeiros; Nazir Savji; Aaloki Desai; Sueli G Rodrigues; Valeria L Carvalho; W Ian Lipkin; Pedro F C Vasconcelos Journal: J Virol Date: 2012-09-26 Impact factor: 5.103
Authors: S N Bennett; A J Drummond; D D Kapan; M A Suchard; J L Muñoz-Jordán; O G Pybus; E C Holmes; D J Gubler Journal: Mol Biol Evol Date: 2009-12-04 Impact factor: 16.240
Authors: Martha I Nelson; Angel Balmaseda; Guillermina Kuan; Saira Saborio; Xudong Lin; Rebecca A Halpin; Timothy B Stockwell; David E Wentworth; Eva Harris; Aubree Gordon Journal: Virology Date: 2014-06-22 Impact factor: 3.616
Authors: Karsten Salomo; James F Smith; Taylor S Feild; Marie-Stéphanie Samain; Laura Bond; Christopher Davidson; Jay Zimmers; Christoph Neinhuis; Stefan Wanke Journal: Syst Bot Date: 2017-12-18 Impact factor: 1.101