Literature DB >> 14694074

A "Long Indel" model for evolutionary sequence alignment.

I Miklós1, G A Lunter, I Holmes.   

Abstract

We present a new probabilistic model of sequence evolution, allowing indels of arbitrary length, and give sequence alignment algorithms for our model. Previously implemented evolutionary models have allowed (at most) single-residue indels or have introduced artifacts such as the existence of indivisible "fragments." We compare our algorithm to these previous methods by applying it to the structural homology dataset HOMSTRAD, evaluating the accuracy of (1) alignments and (2) evolutionary time estimates. With our method, it is possible (for the first time) to integrate probabilistic sequence alignment, with reliability indicators and arbitrary gap penalties, in the same framework as phylogenetic reconstruction. Our alignment algorithm requires that we evaluate the likelihood of any specific path of mutation events in a continuous-time Markov model, with the event times integrated out. To this effect, we introduce a "trajectory likelihood" algorithm (Appendix A). We anticipate that this algorithm will be useful in more general contexts, such as Markov Chain Monte Carlo simulations.

Mesh:

Year:  2003        PMID: 14694074     DOI: 10.1093/molbev/msh043

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  41 in total

1.  HandAlign: Bayesian multiple sequence alignment, phylogeny and ancestral reconstruction.

Authors:  Oscar Westesson; Lars Barquist; Ian Holmes
Journal:  Bioinformatics       Date:  2012-01-28       Impact factor: 6.937

2.  A stochastic evolutionary model for protein structure alignment and phylogeny.

Authors:  Christopher J Challis; Scott C Schmidler
Journal:  Mol Biol Evol       Date:  2012-06-21       Impact factor: 16.240

3.  Basing population genetic inferences and models of molecular evolution upon desired stationary distributions of DNA or protein sequences.

Authors:  Sang Chul Choi; Benjamin D Redelings; Jeffrey L Thorne
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

4.  Problems and solutions for estimating indel rates and length distributions.

Authors:  Reed A Cartwright
Journal:  Mol Biol Evol       Date:  2008-11-28       Impact factor: 16.240

Review 5.  Multilocus phylogeography and phylogenetics using sequence-based markers.

Authors:  Patrícia H Brito; Scott V Edwards
Journal:  Genetica       Date:  2008-07-24       Impact factor: 1.082

6.  Uncertainty in homology inferences: assessing and improving genomic sequence alignment.

Authors:  Gerton Lunter; Andrea Rocco; Naila Mimouni; Andreas Heger; Alexandre Caldeira; Jotun Hein
Journal:  Genome Res       Date:  2007-12-11       Impact factor: 9.043

7.  Genome-wide nucleotide-level mammalian ancestor reconstruction.

Authors:  Benedict Paten; Javier Herrero; Stephen Fitzgerald; Kathryn Beal; Paul Flicek; Ian Holmes; Ewan Birney
Journal:  Genome Res       Date:  2008-10-10       Impact factor: 9.043

8.  Evolutionary triplet models of structured RNA.

Authors:  Robert K Bradley; Ian Holmes
Journal:  PLoS Comput Biol       Date:  2009-08-28       Impact factor: 4.475

9.  Tools for simulating evolution of aligned genomic regions with integrated parameter estimation.

Authors:  Avinash Varadarajan; Robert K Bradley; Ian H Holmes
Journal:  Genome Biol       Date:  2008-10-08       Impact factor: 13.583

10.  BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC.

Authors:  Rahul Satija; Adám Novák; István Miklós; Rune Lyngsø; Jotun Hein
Journal:  BMC Evol Biol       Date:  2009-08-28       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.