Literature DB >> 9501494

A genetic algorithm for maximum-likelihood phylogeny inference using nucleotide sequence data.

P O Lewis1.   

Abstract

Phylogeny reconstruction is a difficult computational problem, because the number of possible solutions increases with the number of included taxa. For example, for only 14 taxa, there are more than seven trillion possible unrooted phylogenetic trees. For this reason, phylogenetic inference methods commonly use clustering algorithms (e.g., the neighbor-joining method) or heuristic search strategies to minimize the amount of time spent evaluating nonoptimal trees. Even heuristic searches can be painfully slow, especially when computationally intensive optimality criteria such as maximum likelihood are used. I describe here a different approach to heuristic searching (using a genetic algorithm) that can tremendously reduce the time required for maximum-likelihood phylogenetic inference, especially for data sets involving large numbers of taxa. Genetic algorithms are simulations of natural selection in which individuals are encoded solutions to the problem of interest. Here, labeled phylogenetic trees are the individuals, and differential reproduction is effected by allowing the number of offspring produced by each individual to be proportional to that individual's rank likelihood score. Natural selection increases the average likelihood in the evolving population of phylogenetic trees, and the genetic algorithm is allowed to proceed until the likelihood of the best individual ceases to improve over time. An example is presented involving rbcL sequence data for 55 taxa of green plants. The genetic algorithm described here required only 6% of the computational effort required by a conventional heuristic search using tree bisection/reconnection (TBR) branch swapping to obtain the same maximum-likelihood topology.

Mesh:

Year:  1998        PMID: 9501494     DOI: 10.1093/oxfordjournals.molbev.a025924

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  19 in total

1.  The metapopulation genetic algorithm: An efficient solution for the problem of large phylogeny estimation.

Authors:  Alan R Lemmon; Michel C Milinkovitch
Journal:  Proc Natl Acad Sci U S A       Date:  2002-07-25       Impact factor: 11.205

Review 2.  New methods for inferring population dynamics from microbial sequences.

Authors:  Marcos Pérez-Losada; Megan L Porter; Loubna Tazi; Keith A Crandall
Journal:  Infect Genet Evol       Date:  2006-04-19       Impact factor: 3.342

3.  Matchings and phylogenetic trees.

Authors:  P W Diaconis; S P Holmes
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

4.  A LASSO-based approach to sample sites for phylogenetic tree search.

Authors:  Noa Ecker; Dana Azouri; Ben Bettisworth; Alexandros Stamatakis; Yishay Mansour; Itay Mayrose; Tal Pupko
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

5.  HIV-1 molecular epidemiology in Guinea-Bissau, West Africa: origin, demography and migrations.

Authors:  Joakim Esbjörnsson; Mattias Mild; Fredrik Månsson; Hans Norrgren; Patrik Medstrand
Journal:  PLoS One       Date:  2011-02-18       Impact factor: 3.240

6.  MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics.

Authors:  Raphaël Helaers; Michel C Milinkovitch
Journal:  BMC Bioinformatics       Date:  2010-07-15       Impact factor: 3.169

7.  PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference.

Authors:  Stéphane Guindon; Franck Lethiec; Patrice Duroux; Olivier Gascuel
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

8.  Efficient tree searches with available algorithms.

Authors:  Gonzalo Giribet
Journal:  Evol Bioinform Online       Date:  2007-11-12       Impact factor: 1.625

9.  Efficient implementation of MrBayes on multi-GPU.

Authors:  Jie Bao; Hongju Xia; Jianfu Zhou; Xiaoguang Liu; Gang Wang
Journal:  Mol Biol Evol       Date:  2013-03-14       Impact factor: 16.240

10.  Estimation of phylogeny using a general Markov model.

Authors:  Vivek Jayaswal; Lars S Jermiin; John Robinson
Journal:  Evol Bioinform Online       Date:  2007-02-25       Impact factor: 1.625

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.