Literature DB >> 15374876

A memory-efficient algorithm for multiple sequence alignment with constraints.

Chin Lung Lu1, Yen Pin Huang.   

Abstract

MOTIVATION: Recently, the concept of the constrained sequence alignment was proposed to incorporate the knowledge of biologists about structures/functionalities/consensuses of their datasets into sequence alignment such that the user-specified residues/nucleotides are aligned together in the computed alignment. The currently developed programs use the so-called progressive approach to efficiently obtain a constrained alignment of several sequences. However, the kernels of these programs, the dynamic programming algorithms for computing an optimal constrained alignment between two sequences, run in (gamman2) memory, where gamma is the number of the constraints and n is the maximum of the lengths of sequences. As a result, such a high memory requirement limits the overall programs to align short sequences only.
RESULTS: We adopt the divide-and-conquer approach to design a memory-efficient algorithm for computing an optimal constrained alignment between two sequences, which greatly reduces the memory requirement of the dynamic programming approaches at the expense of a small constant factor in CPU time. This new algorithm consumes only O(alphan) space, where alpha is the sum of the lengths of constraints and usually alpha << n in practical applications. Based on this algorithm, we have developed a memory-efficient tool for multiple sequence alignment with constraints. AVAILABILITY: http://genome.life.nctu.edu.tw/MUSICME.

Mesh:

Year:  2004        PMID: 15374876      PMCID: PMC7109922          DOI: 10.1093/bioinformatics/bth468

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Improvement of the A(*) Algorithm for Multiple Sequence Alignment.

Authors: 
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  1998

Review 2.  Recent progress in multiple sequence alignment: a survey.

Authors:  Cédric Notredame
Journal:  Pharmacogenomics       Date:  2002-01       Impact factor: 2.533

3.  MATCH-BOX: a fundamentally new algorithm for the simultaneous alignment of several protein sequences.

Authors:  E Depiereux; E Feytmans
Journal:  Comput Appl Biosci       Date:  1992-10

4.  Constrained multiple sequence alignment tool development and its application to RNase family alignment.

Authors:  Chuan Yi Tang; Chin Lung Lu; Margaret Dah-Tsyr Chang; Yin-Te Tsai; Yuh-Ju Sun; Kun-Mao Chao; Jia-Ming Chang; Yu-Han Chiou; Chia-Mao Wu; Hao-Teng Chang; Wei-I Chou
Journal:  J Bioinform Comput Biol       Date:  2003-07       Impact factor: 1.122

5.  MuSiC: a tool for multiple sequence alignment with constraints.

Authors:  Yin Te Tsai; Yen Pin Huang; Ching Ta Yu; Chin Lung Lu
Journal:  Bioinformatics       Date:  2004-04-01       Impact factor: 6.937

6.  A workbench for multiple alignment construction and analysis.

Authors:  G D Schuler; S F Altschul; D J Lipman
Journal:  Proteins       Date:  1991

7.  A general method for fast multiple sequence alignment.

Authors:  U Tönges; S W Perrey; J Stoye; A W Dress
Journal:  Gene       Date:  1996-06-12       Impact factor: 3.688

8.  Progressive multiple alignment with constraints.

Authors:  G Myers; S Selznick; Z Zhang; W Miller
Journal:  J Comput Biol       Date:  1996       Impact factor: 1.479

9.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

10.  Comparative analysis of multiple protein-sequence alignment methods.

Authors:  M A McClure; T K Vasi; W M Fitch
Journal:  Mol Biol Evol       Date:  1994-07       Impact factor: 16.240

View more
  1 in total

1.  RE-MuSiC: a tool for multiple sequence alignment with regular expression constraints.

Authors:  Yun-Sheng Chung; Wei-Hsun Lee; Chuan Yi Tang; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2007-05-08       Impact factor: 16.971

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.