Literature DB >> 3753770

An efficient string matching algorithm with k differences for nucleotide and amino acid sequences.

G M Landau, U Vishkin, R Nussinov.   

Abstract

There are a few algorithms designed to solve the problem of the optimal alignment of one sequence, the pattern, of length m, with another, longer sequence the text, of length n. These algorithms allow mismatches, deletions and insertions. Algorithms to date run in O(mn) time. Let us define an integer, k, which is the maximal number of differences allowed. We present a simple algorithm showing that sequences can be optimally aligned in O(k2n) time. For long sequences the gain factor over the currently used algorithms is very large.

Mesh:

Year:  1986        PMID: 3753770      PMCID: PMC339353          DOI: 10.1093/nar/14.1.31

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  12 in total

1.  Pattern recognition in genetic sequences.

Authors:  P H Sellers
Journal:  Proc Natl Acad Sci U S A       Date:  1979-07       Impact factor: 11.205

2.  Matching sequences under deletion-insertion constraints.

Authors:  D Sankoff
Journal:  Proc Natl Acad Sci U S A       Date:  1972-01       Impact factor: 11.205

3.  Estimation of secondary structure in ribonucleic acids.

Authors:  I Tinoco; O C Uhlenbeck; M D Levine
Journal:  Nature       Date:  1971-04-09       Impact factor: 49.962

4.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

5.  Fast algorithm for predicting the secondary structure of single-stranded RNA.

Authors:  R Nussinov; A B Jacobson
Journal:  Proc Natl Acad Sci U S A       Date:  1980-11       Impact factor: 11.205

6.  Efficient algorithms for folding and comparing nucleic acid sequences.

Authors:  J P Dumas; J Ninio
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

7.  Fast optimal alignment.

Authors:  J W Fickett
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

8.  Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors:  J V Maizel; R P Lenk
Journal:  Proc Natl Acad Sci U S A       Date:  1981-12       Impact factor: 11.205

9.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

Authors:  W B Goad; M I Kanehisa
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

10.  Rapid similarity searches of nucleic acid and protein data banks.

Authors:  W J Wilbur; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1983-02       Impact factor: 11.205

View more
  5 in total

1.  ASEtrap: a biological method for speeding up the exploration of spliceomes.

Authors:  Gilbert Thill; Vanina Castelli; Sophie Pallud; Marcel Salanoubat; Patrick Wincker; Pierre de la Grange; Didier Auboeuf; Vincent Schächter; Jean Weissenbach
Journal:  Genome Res       Date:  2006-05-08       Impact factor: 9.043

2.  GOSSIP: a method for fast and accurate global alignment of protein structures.

Authors:  I Kifer; R Nussinov; H J Wolfson
Journal:  Bioinformatics       Date:  2011-02-03       Impact factor: 6.937

3.  Formal language theory and DNA: an analysis of the generative capacity of specific recombinant behaviors.

Authors:  T Head
Journal:  Bull Math Biol       Date:  1987       Impact factor: 1.758

4.  Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.

Authors:  Martin Šošic; Mile Šikic
Journal:  Bioinformatics       Date:  2017-05-01       Impact factor: 6.937

5.  RNA structure prediction using positive and negative evolutionary information.

Authors:  Elena Rivas
Journal:  PLoS Comput Biol       Date:  2020-10-30       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.