Literature DB >> 11934745

Multiple sequence alignment using partial order graphs.

Christopher Lee1, Catherine Grasso, Mark F Sharlow.   

Abstract

MOTIVATION: Progressive Multiple Sequence Alignment (MSA) methods depend on reducing an MSA to a linear profile for each alignment step. However, this leads to loss of information needed for accurate alignment, and gap scoring artifacts.
RESULTS: We present a graph representation of an MSA that can itself be aligned directly by pairwise dynamic programming, eliminating the need to reduce the MSA to a profile. This enables our algorithm (Partial Order Alignment (POA)) to guarantee that the optimal alignment of each new sequence versus each sequence in the MSA will be considered. Moreover, this algorithm introduces a new edit operator, homologous recombination, important for multidomain sequences. The algorithm has improved speed (linear time complexity) over existing MSA algorithms, enabling construction of massive and complex alignments (e.g. an alignment of 5000 sequences in 4 h on a Pentium II). We demonstrate the utility of this algorithm on a family of multidomain SH2 proteins, and on EST assemblies containing alternative splicing and polymorphism. AVAILABILITY: The partial order alignment program POA is available at http://www.bioinformatics.ucla.edu/poa.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11934745     DOI: 10.1093/bioinformatics/18.3.452

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  329 in total

1.  Morphological and molecular identification of the ectomycorrhizal association of Lactarius fumosibrunneus and Fagus grandifolia var. mexicana trees in eastern Mexico.

Authors:  Edith Garay-Serrano; Victor Manuel Bandala; Leticia Montoya
Journal:  Mycorrhiza       Date:  2012-03-09       Impact factor: 3.387

2.  A large complement of the predicted Arabidopsis ARM repeat proteins are members of the U-box E3 ubiquitin ligase family.

Authors:  Yashwanti Mudgil; Shin-Han Shiu; Sophia L Stone; Jennifer N Salt; Daphne R Goring
Journal:  Plant Physiol       Date:  2003-12-04       Impact factor: 8.340

3.  Finding functional sequence elements by multiple local alignment.

Authors:  Martin C Frith; Ulla Hansen; John L Spouge; Zhiping Weng
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

4.  Contact-based sequence alignment.

Authors:  Jens Kleinjung; John Romein; Kuang Lin; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2004-04-30       Impact factor: 16.971

5.  DIALIGN: multiple DNA and protein sequence alignment at BiBiServ.

Authors:  Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

6.  Mauve: multiple alignment of conserved genomic sequence with rearrangements.

Authors:  Aaron C E Darling; Bob Mau; Frederick R Blattner; Nicole T Perna
Journal:  Genome Res       Date:  2004-07       Impact factor: 9.043

7.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

8.  Aligning multiple genomic sequences with the threaded blockset aligner.

Authors:  Mathieu Blanchette; W James Kent; Cathy Riemer; Laura Elnitski; Arian F A Smit; Krishna M Roskin; Robert Baertsch; Kate Rosenbloom; Hiram Clawson; Eric D Green; David Haussler; Webb Miller
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

9.  Evidence for a subpopulation of conserved alternative splicing events under selection pressure for protein reading frame preservation.

Authors:  Alissa Resch; Yi Xing; Alexander Alekseyenko; Barmak Modrek; Christopher Lee
Journal:  Nucleic Acids Res       Date:  2004-02-24       Impact factor: 16.971

10.  MIPS: analysis and annotation of proteins from whole genomes.

Authors:  H W Mewes; C Amid; R Arnold; D Frishman; U Güldener; G Mannhaupt; M Münsterkötter; P Pagel; N Strack; V Stümpflen; J Warfsmann; A Ruepp
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.