Literature DB >> 17110370

Indelign: a probabilistic framework for annotation of insertions and deletions in a multiple alignment.

Jaebum Kim1, Saurabh Sinha.   

Abstract

MOTIVATION: A quantitative study of molecular evolutionary events such as substitutions, insertions and deletions from closely related genomes requires (1) an accurate multiple sequence alignment program and (2) a method to annotate the insertions and deletions that explain the 'gaps' in the alignment. Although the former requirement has been extensively addressed, the latter problem has received little attention, especially in a comprehensive probabilistic framework.
RESULTS: Here, we present Indelign, a program that uses a probabilistic evolutionary model to compute the most likely scenario of insertions and deletions consistent with an input multiple alignment. It is also capable of modifying the given alignment so as to obtain a better agreement with the evolutionary model. We find close to optimal performance and substantial improvement over alternative methods, in tests of Indelign on synthetic data. We use Indelign to analyze regulatory sequences in Drosophila, and find an excess of insertions over deletions, which is different from what has been reported for neutral sequences. AVAILABILITY: The Indelign program may be downloaded from the website http://veda.cs.uiuc.edu/indelign/ SUPPLEMENTARY INFORMATION: Supplementary material is available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17110370     DOI: 10.1093/bioinformatics/btl578

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  Comparative analysis of the primate X-inactivation center region and reconstruction of the ancestral primate XIST locus.

Authors:  Julie E Horvath; Christina B Sheedy; Stephanie L Merrett; Abdoulaye Banire Diallo; David L Swofford; Eric D Green; Huntington F Willard
Journal:  Genome Res       Date:  2011-04-25       Impact factor: 9.043

2.  PhyLAT: a phylogenetic local alignment tool.

Authors:  Hongtao Sun; Jeremy D Buhler
Journal:  Bioinformatics       Date:  2012-04-06       Impact factor: 6.937

3.  Problems and solutions for estimating indel rates and length distributions.

Authors:  Reed A Cartwright
Journal:  Mol Biol Evol       Date:  2008-11-28       Impact factor: 16.240

4.  Genome-wide nucleotide-level mammalian ancestor reconstruction.

Authors:  Benedict Paten; Javier Herrero; Stephen Fitzgerald; Kathryn Beal; Paul Flicek; Ian Holmes; Ewan Birney
Journal:  Genome Res       Date:  2008-10-10       Impact factor: 9.043

Review 5.  Systems genetics in "-omics" era: current and future development.

Authors:  Hong Li
Journal:  Theory Biosci       Date:  2012-11-09       Impact factor: 1.919

6.  Local conservation scores without a priori assumptions on neutral substitution rates.

Authors:  Janis Dingel; Pavol Hanus; Niccolò Leonardi; Joachim Hagenauer; Jürgen Zech; Jakob C Mueller
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

7.  Evolutionary triplet models of structured RNA.

Authors:  Robert K Bradley; Ian Holmes
Journal:  PLoS Comput Biol       Date:  2009-08-28       Impact factor: 4.475

8.  In silico evidence for functional specialization after genome duplication in yeast.

Authors:  Ossi Turunen; Ralph Seelke; Jed Macosko
Journal:  FEMS Yeast Res       Date:  2009-02       Impact factor: 2.796

9.  Towards realistic benchmarks for multiple alignments of non-coding sequences.

Authors:  Jaebum Kim; Saurabh Sinha
Journal:  BMC Bioinformatics       Date:  2010-01-26       Impact factor: 3.169

10.  Evolution of regulatory sequences in 12 Drosophila species.

Authors:  Jaebum Kim; Xin He; Saurabh Sinha
Journal:  PLoS Genet       Date:  2009-01-09       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.