Literature DB >> 33020189

A Model of Indel Evolution by Finite-State, Continuous-Time Machines.

Ian Holmes1.   

Abstract

We introduce a systematic method of approximating finite-time transition probabilities for continuous-time insertion-deletion models on sequences. The method uses automata theory to describe the action of an infinitesimal evolutionary generator on a probability distribution over alignments, where both the generator and the alignment distribution can be represented by pair hidden Markov models (HMMs). In general, combining HMMs in this way induces a multiplication of their state spaces; to control this, we introduce a coarse-graining operation to keep the state space at a constant size. This leads naturally to ordinary differential equations for the evolution of the transition probabilities of the approximating pair HMM. The TKF91 model emerges as an exact solution to these equations for the special case of single-residue indels. For the more general case of multiple-residue indels, the equations can be solved by numerical integration. Using simulated data, we show that the resulting distribution over alignments, when compared to previous approximations, is a better fit over a broader range of parameters. We also propose a related approach to develop differential equations for sufficient statistics to estimate the underlying instantaneous indel rates by expectation maximization. Our code and data are available at https://github.com/ihh/trajectory-likelihood.
Copyright © 2020 Holmes by the Genetics Society of America.

Entities:  

Keywords:  Markov processes; automata; hidden Markov models; indels; molecular evolution; phylogenetics

Mesh:

Year:  2020        PMID: 33020189      PMCID: PMC7768254          DOI: 10.1534/genetics.120.303630

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  14 in total

1.  Evolutionary HMMs: a Bayesian approach to multiple alignment.

Authors:  I Holmes; W J Bruno
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

Review 2.  Inching toward reality: an improved likelihood model of sequence evolution.

Authors:  J L Thorne; H Kishino; J Felsenstein
Journal:  J Mol Evol       Date:  1992-01       Impact factor: 2.395

3.  An algorithm for progressive multiple alignment of sequences with insertions.

Authors:  Ari Löytynoja; Nick Goldman
Journal:  Proc Natl Acad Sci U S A       Date:  2005-07-06       Impact factor: 11.205

4.  HOMSTRAD: a database of protein structure alignments for homologous families.

Authors:  K Mizuguchi; C M Deane; T L Blundell; J P Overington
Journal:  Protein Sci       Date:  1998-11       Impact factor: 6.725

5.  A Simulation-Based Approach to Statistical Alignment.

Authors:  Eli Levy Karin; Haim Ashkenazy; Jotun Hein; Tal Pupko
Journal:  Syst Biol       Date:  2019-03-01       Impact factor: 15.683

6.  Evolutionary inference via the Poisson Indel Process.

Authors:  Alexandre Bouchard-Côté; Michael I Jordan
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-28       Impact factor: 11.205

7.  The Cumulative Indel Model: Fast and Accurate Statistical Evolutionary Alignment.

Authors:  Nicola De Maio
Journal:  Syst Biol       Date:  2021-02-10       Impact factor: 15.683

8.  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

Authors:  M Kimura
Journal:  J Mol Evol       Date:  1980-12       Impact factor: 2.395

9.  Accurate reconstruction of insertion-deletion histories by statistical phylogenetics.

Authors:  Oscar Westesson; Gerton Lunter; Benedict Paten; Ian Holmes
Journal:  PLoS One       Date:  2012-04-20       Impact factor: 3.240

10.  Parameterizing sequence alignment with an explicit evolutionary model.

Authors:  Elena Rivas; Sean R Eddy
Journal:  BMC Bioinformatics       Date:  2015-12-10       Impact factor: 3.169

View more
  2 in total

1.  Correlations between alignment gaps and nucleotide substitution or amino acid replacement.

Authors:  Tae-Kun Seo; Benjamin D Redelings; Jeffrey L Thorne
Journal:  Proc Natl Acad Sci U S A       Date:  2022-08-16       Impact factor: 12.779

2.  Measuring Phylogenetic Information of Incomplete Sequence Data.

Authors:  Tae-Kun Seo; Olivier Gascuel; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.