Literature DB >> 3753792

Improving the efficiency of dot-matrix similarity searches through use of an oligomer table.

B Fristensky.   

Abstract

Dot-matrix sequence similarity searches can be greatly speeded up through use of a table listing all locations of short oligomers in one of the sequences to find potential similarities with a second sequence. The algorithm described finds similarities between two sequences of lengths M and N, comparing L residues at a time, with an efficiency of L X M X N/(SK) where S is the alphabet size, and k is the length of the oligomer. For nucleic acids, in which S = 4, use of a tetranucleotide table results in an efficiency of L X M X N/256. The simplicity of the approach allows for a straightforward calculation of the level of similarities expected to be found for given search parameters. Furthermore, the storage required is minimal, allowing for even large sequences to be compared on small microcomputers. Theoretical considerations regarding the use of this search are discussed.

Entities:  

Mesh:

Year:  1986        PMID: 3753792      PMCID: PMC339447          DOI: 10.1093/nar/14.1.597

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  21 in total

1.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

2.  Portable microcomputer software for nucleotide sequence analysis.

Authors:  B Fristensky; J Lis; R Wu
Journal:  Nucleic Acids Res       Date:  1982-10-25       Impact factor: 16.971

3.  Base sequence studies of 300 nucleotide renatured repeated human DNA clones.

Authors:  P L Deininger; D J Jolly; C M Rubin; T Friedmann; C W Schmid
Journal:  J Mol Biol       Date:  1981-09-05       Impact factor: 5.469

4.  The nucleotide sequence and protein-coding capability of the transposable element IS5.

Authors:  J A Engler; M P van Bree
Journal:  Gene       Date:  1981-08       Impact factor: 3.688

5.  A high speed, high capacity homology matrix: zooming through SV40 and polyoma.

Authors:  J Pustell; F C Kafatos
Journal:  Nucleic Acids Res       Date:  1982-08-11       Impact factor: 16.971

6.  Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors:  J V Maizel; R P Lenk
Journal:  Proc Natl Acad Sci U S A       Date:  1981-12       Impact factor: 11.205

7.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

Authors:  W B Goad; M I Kanehisa
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

8.  The nucleotide sequence of the ubiquitous repetitive DNA sequence B1 complementary to the most abundant class of mouse fold-back RNA.

Authors:  A S Krayev; D A Kramerov; K G Skryabin; A P Ryskov; A A Bayev; G P Georgiev
Journal:  Nucleic Acids Res       Date:  1980-03-25       Impact factor: 16.971

9.  Recognition of protein coding regions in DNA sequences.

Authors:  J W Fickett
Journal:  Nucleic Acids Res       Date:  1982-09-11       Impact factor: 16.971

10.  The nucleotide sequence of IS5 from Escherichia coli.

Authors:  B Schoner; M Kahn
Journal:  Gene       Date:  1981-08       Impact factor: 3.688

View more
  5 in total

1.  cDNA sequences for pea disease resistance response genes.

Authors:  B Fristensky; D Horovitz; L A Hadwiger
Journal:  Plant Mol Biol       Date:  1988-09       Impact factor: 4.076

2.  A cereal haemoglobin gene is expressed in seed and root tissues under anaerobic conditions.

Authors:  E R Taylor; X Z Nie; A W MacGregor; R D Hill
Journal:  Plant Mol Biol       Date:  1994-03       Impact factor: 4.076

3.  A fast word search algorithm for the representation of sequence similarity in genomic DNA.

Authors:  C Lefèvre; J E Ikeda
Journal:  Nucleic Acids Res       Date:  1994-02-11       Impact factor: 16.971

4.  Dual bidirectional promoters at the mouse dhfr locus: cloning and characterization of two mRNA classes of the divergently transcribed Rep-1 gene.

Authors:  J P Linton; J Y Yen; E Selby; Z Chen; J M Chinsky; K Liu; R E Kellems; G F Crouse
Journal:  Mol Cell Biol       Date:  1989-07       Impact factor: 4.272

5.  Analysis of the integrant in MyK-103 transgenic mice in which males fail to transmit the integrant.

Authors:  T M Wilkie; R D Palmiter
Journal:  Mol Cell Biol       Date:  1987-05       Impact factor: 4.272

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.