Literature DB >> 7680269

Efficient methods for multiple sequence alignment with guaranteed error bounds.

D Gusfield1.   

Abstract

Multiple string (sequence) alignment is a difficult and important problem in computational biology, where it is central in two related tasks: finding highly conserved subregions or embedded patterns of a set of biological sequences (strings of DNA, RNA or amino acids), and inferring the evolutionary history of a set of taxa from their associated biological sequences. Several precise measures have been proposed for evaluating the goodness of a multiple alignment, but no efficient methods are known which compute the optimal alignment for any of these measures in any but small cases. In this paper, we consider two previously proposed measures, and give two computationaly efficient multiple alignment methods (one for each measure) whose deviation from the optimal value is guaranteed to be less than a factor of two. This is the novel feature of thse methods. but the methods have additional virtues as well. For both methods, the guaranteed bounds are much smaller than two when the number of strings is small (1.33 for three strings of any length); for one of the methods we give a related randomized method which is much faster and which gives, with high probability, multiple alignments with fairly small error bounds; and for the other measure, the method given yields a non-obvious lower bound on the optimal alignment.

Entities:  

Mesh:

Substances:

Year:  1993        PMID: 7680269     DOI: 10.1007/bf02460299

Source DB:  PubMed          Journal:  Bull Math Biol        ISSN: 0092-8240            Impact factor:   1.758


  8 in total

1.  A workbench for multiple alignment construction and analysis.

Authors:  G D Schuler; S F Altschul; D J Lipman
Journal:  Proteins       Date:  1991

2.  Gap costs for multiple sequence alignment.

Authors:  S F Altschul
Journal:  J Theor Biol       Date:  1989-06-08       Impact factor: 2.691

3.  A method for the simultaneous alignment of three or more amino acid sequences.

Authors:  M S Johnson; R F Doolittle
Journal:  J Mol Evol       Date:  1986       Impact factor: 2.395

4.  A tool for multiple sequence alignment.

Authors:  D J Lipman; S F Altschul; J D Kececioglu
Journal:  Proc Natl Acad Sci U S A       Date:  1989-06       Impact factor: 11.205

5.  Multiple sequence alignment.

Authors:  D J Bacon; W F Anderson
Journal:  J Mol Biol       Date:  1986-09-20       Impact factor: 5.469

6.  Multiple sequence alignment by consensus.

Authors:  M S Waterman
Journal:  Nucleic Acids Res       Date:  1986-11-25       Impact factor: 16.971

7.  Simultaneous comparison of three protein sequences.

Authors:  M Murata; J S Richardson; J L Sussman
Journal:  Proc Natl Acad Sci U S A       Date:  1985-05       Impact factor: 11.205

8.  Progressive sequence alignment as a prerequisite to correct phylogenetic trees.

Authors:  D F Feng; R F Doolittle
Journal:  J Mol Evol       Date:  1987       Impact factor: 2.395

  8 in total
  8 in total

1.  Multiple structural alignment by secondary structures: algorithm and applications.

Authors:  Oranit Dror; Hadar Benyamini; Ruth Nussinov; Haim J Wolfson
Journal:  Protein Sci       Date:  2003-11       Impact factor: 6.725

2.  A memory-efficient algorithm for multiple sequence alignment with constraints.

Authors:  Chin Lung Lu; Yen Pin Huang
Journal:  Bioinformatics       Date:  2004-09-16       Impact factor: 6.937

3.  Hidden Markov models of biological primary sequence information.

Authors:  P Baldi; Y Chauvin; T Hunkapiller; M A McClure
Journal:  Proc Natl Acad Sci U S A       Date:  1994-02-01       Impact factor: 11.205

4.  A combinatorial optimization approach for diverse motif finding applications.

Authors:  Elena Zaslavsky; Mona Singh
Journal:  Algorithms Mol Biol       Date:  2006-08-17       Impact factor: 1.405

5.  A multiple-template approach to protein threading.

Authors:  Jian Peng; Jinbo Xu
Journal:  Proteins       Date:  2011-04-04

6.  VSEARCH: a versatile open source tool for metagenomics.

Authors:  Torbjørn Rognes; Tomáš Flouri; Ben Nichols; Christopher Quince; Frédéric Mahé
Journal:  PeerJ       Date:  2016-10-18       Impact factor: 2.984

7.  Lower bounds on multiple sequence alignment using exact 3-way alignment.

Authors:  Charles J Colbourn; Sudhir Kumar
Journal:  BMC Bioinformatics       Date:  2007-04-30       Impact factor: 3.169

8.  A multi-template combination algorithm for protein comparative modeling.

Authors:  Jianlin Cheng
Journal:  BMC Struct Biol       Date:  2008-03-17
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.