Literature DB >> 12611804

Statistical alignment based on fragment insertion and deletion models.

Dirk Metzler1.   

Abstract

MOTIVATION: The topic of this paper is the estimation of alignments and mutation rates based on stochastic sequence-evolution models that allow insertions and deletions of subsequences ('fragments') and not just single bases. The model we propose is a variant of a model introduced by Thorne et al., (J. Mol. Evol., 34, 3-16, 1992). The computational tractability of the model depends on certain restrictions in the insertion/deletion process; possible effects we discuss.
RESULTS: The process of fragment insertion and deletion in the sequence-evolution model induces a hidden Markov structure at the level of alignments and thus makes possible efficient statistical alignment algorithms. As an example we apply a sampling procedure to assess the variability in alignment and mutation parameter estimates for HVR1 sequences of human and orangutan, improving results of previous work. Simulation studies give evidence that estimation methods based on the proposed model also give satisfactory results when applied to data for which the restrictions in the insertion/deletion process do not hold. AVAILABILITY: The source code of the software for sampling alignments and mutation rates for a pair of DNA sequences according to the fragment insertion and deletion model is freely available from http://www.math.uni-frankfurt.de/~stoch/software/mcmcsalut under the terms of the GNU public license (GPL, 2000).

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12611804     DOI: 10.1093/bioinformatics/btg026

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Predicting RNA secondary structures with pseudoknots by MCMC sampling.

Authors:  Dirk Metzler; Markus E Nebel
Journal:  J Math Biol       Date:  2007-06-23       Impact factor: 2.259

2.  Problems and solutions for estimating indel rates and length distributions.

Authors:  Reed A Cartwright
Journal:  Mol Biol Evol       Date:  2008-11-28       Impact factor: 16.240

3.  Uncertainty in homology inferences: assessing and improving genomic sequence alignment.

Authors:  Gerton Lunter; Andrea Rocco; Naila Mimouni; Andreas Heger; Alexandre Caldeira; Jotun Hein
Journal:  Genome Res       Date:  2007-12-11       Impact factor: 9.043

4.  Measuring Phylogenetic Information of Incomplete Sequence Data.

Authors:  Tae-Kun Seo; Olivier Gascuel; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

5.  Treelength optimization for phylogeny estimation.

Authors:  Kevin Liu; Tandy Warnow
Journal:  PLoS One       Date:  2012-03-19       Impact factor: 3.240

6.  SIMPROT: using an empirically determined indel distribution in simulations of protein evolution.

Authors:  Andy Pang; Andrew D Smith; Paulo A S Nuin; Elisabeth R M Tillier
Journal:  BMC Bioinformatics       Date:  2005-09-27       Impact factor: 3.169

7.  Evolutionary models for insertions and deletions in a probabilistic modeling framework.

Authors:  Elena Rivas
Journal:  BMC Bioinformatics       Date:  2005-03-21       Impact factor: 3.169

8.  Bayesian coestimation of phylogeny and sequence alignment.

Authors:  Gerton Lunter; István Miklós; Alexei Drummond; Jens Ledet Jensen; Jotun Hein
Journal:  BMC Bioinformatics       Date:  2005-04-01       Impact factor: 3.169

9.  Benchmarking tools for the alignment of functional noncoding DNA.

Authors:  Daniel A Pollard; Casey M Bergman; Jens Stoye; Susan E Celniker; Michael B Eisen
Journal:  BMC Bioinformatics       Date:  2004-01-21       Impact factor: 3.169

10.  Dinucleotide controlled null models for comparative RNA gene prediction.

Authors:  Tanja Gesell; Stefan Washietl
Journal:  BMC Bioinformatics       Date:  2008-05-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.