Literature DB >> 15520295

A novel method for multiple alignment of sequences with repeated and shuffled elements.

Benjamin Raphael1, Degui Zhi, Haixu Tang, Pavel Pevzner.   

Abstract

We describe ABA (A-Bruijn alignment), a new method for multiple alignment of biological sequences. The major difference between ABA and existing multiple alignment methods is that ABA represents an alignment as a directed graph, possibly containing cycles. This representation provides more flexibility than does a traditional alignment matrix or the recently introduced partial order alignment (POA) graph by allowing a larger class of evolutionary relationships between the aligned sequences. Our graph representation is particularly well-suited to the alignment of protein sequences with shuffled and/or repeated domain structure, and allows one to construct multiple alignments of proteins containing (1) domains that are not present in all proteins, (2) domains that are present in different orders in different proteins, and (3) domains that are present in multiple copies in some proteins. In addition, ABA is useful in the alignment of genomic sequences that contain duplications and inversions. We provide several examples illustrating the applications of ABA.

Entities:  

Mesh:

Year:  2004        PMID: 15520295      PMCID: PMC525693          DOI: 10.1101/gr.2657504

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  39 in total

1.  An Eulerian path approach to global multiple alignment for DNA sequences.

Authors:  Yu Zhang; Michael S Waterman
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

2.  A workbench for multiple alignment construction and analysis.

Authors:  G D Schuler; S F Altschul; D J Lipman
Journal:  Proteins       Date:  1991

Review 3.  1-Tuple DNA sequencing: computer analysis.

Authors:  P A Pevzner
Journal:  J Biomol Struct Dyn       Date:  1989-08

4.  On the complexity of multiple sequence alignment.

Authors:  L Wang; T Jiang
Journal:  J Comput Biol       Date:  1994       Impact factor: 1.479

5.  Approximate matching of network expressions with spacers.

Authors:  E W Myers
Journal:  J Comput Biol       Date:  1996       Impact factor: 1.479

6.  Multiple DNA and protein sequence alignment based on segment-to-segment comparison.

Authors:  B Morgenstern; A Dress; T Werner
Journal:  Proc Natl Acad Sci U S A       Date:  1996-10-29       Impact factor: 11.205

7.  Extracting protein alignment models from the sequence database.

Authors:  A F Neuwald; J S Liu; D J Lipman; C E Lawrence
Journal:  Nucleic Acids Res       Date:  1997-05-01       Impact factor: 16.971

Review 8.  The multiplicity of domains in proteins.

Authors:  R F Doolittle
Journal:  Annu Rev Biochem       Date:  1995       Impact factor: 23.643

9.  A new algorithm for DNA sequence assembly.

Authors:  R M Idury; M S Waterman
Journal:  J Comput Biol       Date:  1995       Impact factor: 1.479

10.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

View more
  40 in total

1.  CAGE: Combinatorial Analysis of Gene-cluster Evolution.

Authors:  Giltae Song; Louxin Zhang; Tomas Vinar; Webb Miller
Journal:  J Comput Biol       Date:  2010-09       Impact factor: 1.479

2.  Short read fragment assembly of bacterial genomes.

Authors:  Mark J Chaisson; Pavel A Pevzner
Journal:  Genome Res       Date:  2007-12-14       Impact factor: 9.043

3.  Multiple whole-genome alignments without a reference organism.

Authors:  Inna Dubchak; Alexander Poliakov; Andrey Kislyuk; Michael Brudno
Journal:  Genome Res       Date:  2009-01-28       Impact factor: 9.043

4.  Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs.

Authors:  Benedict Paten; Javier Herrero; Kathryn Beal; Stephen Fitzgerald; Ewan Birney
Journal:  Genome Res       Date:  2008-10-10       Impact factor: 9.043

5.  Cactus: Algorithms for genome multiple sequence alignment.

Authors:  Benedict Paten; Dent Earl; Ngan Nguyen; Mark Diekhans; Daniel Zerbino; David Haussler
Journal:  Genome Res       Date:  2011-06-10       Impact factor: 9.043

Review 6.  Visualization of multiple alignments, phylogenies and gene family evolution.

Authors:  James B Procter; Julie Thompson; Ivica Letunic; Chris Creevey; Fabrice Jossinet; Geoffrey J Barton
Journal:  Nat Methods       Date:  2010-03       Impact factor: 28.547

7.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

8.  The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors:  Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal:  PLoS Comput Biol       Date:  2010-07-15       Impact factor: 4.475

9.  Mapping sequences by parts.

Authors:  Gilles Didier; Carito Guziolowski
Journal:  Algorithms Mol Biol       Date:  2007-09-19       Impact factor: 1.405

10.  Evolutionary history reconstruction for Mammalian complex gene clusters.

Authors:  Yu Zhang; Giltae Song; Tomás Vinar; Eric D Green; Adam Siepel; Webb Miller
Journal:  J Comput Biol       Date:  2009-08       Impact factor: 1.479

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.