Literature DB >> 16000407

An algorithm for progressive multiple alignment of sequences with insertions.

Ari Löytynoja1, Nick Goldman.   

Abstract

Dynamic programming algorithms guarantee to find the optimal alignment between two sequences. For more than a few sequences, exact algorithms become computationally impractical, and progressive algorithms iterating pairwise alignments are widely used. These heuristic methods have a serious drawback because pairwise algorithms do not differentiate insertions from deletions and end up penalizing single insertion events multiple times. Such an unrealistically high penalty for insertions typically results in overmatching of sequences and an underestimation of the number of insertion events. We describe a modification of the traditional alignment algorithm that can distinguish insertion from deletion and avoid repeated penalization of insertions and illustrate this method with a pair hidden Markov model that uses an evolutionary scoring function. In comparison with a traditional progressive alignment method, our algorithm infers a greater number of insertion events and creates gaps that are phylogenetically consistent but spatially less concentrated. Our results suggest that some insertion/deletion "hot spots" may actually be artifacts of traditional alignment algorithms.

Mesh:

Year:  2005        PMID: 16000407      PMCID: PMC1180752          DOI: 10.1073/pnas.0409137102

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  14 in total

1.  A workbench for multiple alignment construction and analysis.

Authors:  G D Schuler; S F Altschul; D J Lipman
Journal:  Proteins       Date:  1991

Review 2.  Profile hidden Markov models.

Authors:  S R Eddy
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

3.  A new method that simultaneously aligns and reconstructs ancestral sequences for any number of homologous sequences, when the phylogeny is given.

Authors:  J Hein
Journal:  Mol Biol Evol       Date:  1989-11       Impact factor: 16.240

4.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

5.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

6.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

7.  The alignment of sets of sequences and the construction of phyletic trees: an integrated method.

Authors:  P Hogeweg; B Hesper
Journal:  J Mol Evol       Date:  1984       Impact factor: 2.395

8.  Evolutionary trees from DNA sequences: a maximum likelihood approach.

Authors:  J Felsenstein
Journal:  J Mol Evol       Date:  1981       Impact factor: 2.395

9.  An improved algorithm for matching biological sequences.

Authors:  O Gotoh
Journal:  J Mol Biol       Date:  1982-12-15       Impact factor: 5.469

10.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

View more
  403 in total

1.  Hydractinia allodeterminant alr1 resides in an immunoglobulin superfamily-like gene complex.

Authors:  Sabrina F P Rosa; Anahid E Powell; Rafael D Rosengarten; Matthew L Nicotra; Maria A Moreno; Jane Grimwood; Fadi G Lakkis; Stephen L Dellaporta; Leo W Buss
Journal:  Curr Biol       Date:  2010-05-27       Impact factor: 10.834

Review 2.  Molecular phylogenetics: principles and practice.

Authors:  Ziheng Yang; Bruce Rannala
Journal:  Nat Rev Genet       Date:  2012-03-28       Impact factor: 53.242

3.  Anti-predator defence drives parallel morphological evolution in flea beetles.

Authors:  Deyan Ge; Douglas Chesters; Jesús Gómez-Zurita; Lijie Zhang; Xingke Yang; Alfried P Vogler
Journal:  Proc Biol Sci       Date:  2010-12-15       Impact factor: 5.349

4.  Nucleotide diversity of a genomic sequence similar to SHATTERPROOF (PvSHP1) in domesticated and wild common bean (Phaseolus vulgaris L.).

Authors:  L Nanni; E Bitocchi; E Bellucci; M Rossi; D Rau; G Attene; P Gepts; R Papa
Journal:  Theor Appl Genet       Date:  2011-08-10       Impact factor: 5.699

5.  Conservation of Regional Variation in Sex-Specific Sex Chromosome Regulation.

Authors:  Alison E Wright; Fabian Zimmer; Peter W Harrison; Judith E Mank
Journal:  Genetics       Date:  2015-08-05       Impact factor: 4.562

6.  DNA-dependent formation of transcription factor pairs alters their binding specificity.

Authors:  Arttu Jolma; Yimeng Yin; Kazuhiro R Nitta; Kashyap Dave; Alexander Popov; Minna Taipale; Martin Enge; Teemu Kivioja; Ekaterina Morgunova; Jussi Taipale
Journal:  Nature       Date:  2015-11-09       Impact factor: 49.962

7.  Genomic analysis of snub-nosed monkeys (Rhinopithecus) identifies genes and processes related to high-altitude adaptation.

Authors:  Li Yu; Guo-Dong Wang; Jue Ruan; Yong-Bin Chen; Cui-Ping Yang; Xue Cao; Hong Wu; Yan-Hu Liu; Zheng-Lin Du; Xiao-Ping Wang; Jing Yang; Shao-Chen Cheng; Li Zhong; Lu Wang; Xuan Wang; Jing-Yang Hu; Lu Fang; Bing Bai; Kai-Le Wang; Na Yuan; Shi-Fang Wu; Bao-Guo Li; Jin-Guo Zhang; Ye-Qin Yang; Cheng-Lin Zhang; Yong-Cheng Long; Hai-Shu Li; Jing-Yuan Yang; David M Irwin; Oliver A Ryder; Ying Li; Chung-I Wu; Ya-Ping Zhang
Journal:  Nat Genet       Date:  2016-07-11       Impact factor: 38.330

8.  A genome-wide identification of genes potentially associated with host specificity of Brucella species.

Authors:  Kyung Mo Kim; Kyu-Won Kim; Samsun Sung; Heebal Kim
Journal:  J Microbiol       Date:  2011-11-09       Impact factor: 3.422

9.  Sequential duplications of an ancient member of the DnaJ-family expanded the functional chaperone network in the eukaryotic cytosol.

Authors:  Chandan Sahi; Jacek Kominek; Thomas Ziegelhoffer; Hyun Young Yu; Maciej Baranowski; Jaroslaw Marszalek; Elizabeth A Craig
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

10.  Multiple evolution of flavonoid 3',5'-hydroxylase.

Authors:  Christian Seitz; Stefanie Ameres; Karin Schlangen; Gert Forkmann; Heidi Halbwirth
Journal:  Planta       Date:  2015-04-28       Impact factor: 4.116

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.