Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 An efficient string matching algorithm with k differences for nucleotide and amino acid sequences.

Literature DB >> 3753770

An efficient string matching algorithm with k differences for nucleotide and amino acid sequences.

Abstract

There are a few algorithms designed to solve the problem of the optimal alignment of one sequence, the pattern, of length m, with another, longer sequence the text, of length n. These algorithms allow mismatches, deletions and insertions. Algorithms to date run in O(mn) time. Let us define an integer, k, which is the maximal number of differences allowed. We present a simple algorithm showing that sequences can be optimally aligned in O(k2n) time. For long sequences the gain factor over the currently used algorithms is very large.

Mesh：

Year: 1986 PMID： 3753770 PMCID： PMC339353 DOI： 10.1093/nar/14.1.31

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

12 in total

An efficient string matching algorithm with k differences for nucleotide and amino acid sequences.

1. Pattern recognition in genetic sequences.

2. Matching sequences under deletion-insertion constraints.

3. Estimation of secondary structure in ribonucleic acids.

4. A general method applicable to the search for similarities in the amino acid sequence of two proteins.

5. Fast algorithm for predicting the secondary structure of single-stranded RNA.

6. Efficient algorithms for folding and comparing nucleic acid sequences.

7. Fast optimal alignment.

8. Enhanced graphic matrix analysis of nucleic acid and protein sequences.

9. Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

10. Rapid similarity searches of nucleic acid and protein data banks.

1. ASEtrap: a biological method for speeding up the exploration of spliceomes.

2. GOSSIP: a method for fast and accurate global alignment of protein structures.

3. Formal language theory and DNA: an analysis of the generative capacity of specific recombinant behaviors.

4. Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.

5. RNA structure prediction using positive and negative evolutionary information.