Literature DB >> 8127677

A fast word search algorithm for the representation of sequence similarity in genomic DNA.

C Lefèvre1, J E Ikeda.   

Abstract

Representation of sequence similarity by dot matrix plots is a method widely used for comparing biological sequences. The user is presented with an overall view of similarity between two sequences. Computation of this plot has been reconsidered here. An improvement is proposed through the preprocessing of the data into an automation recognizing the word structure of a sequence. The main advantage of this approach is to systematically eliminate the repetitions during word comparison. Simple heuristics are also considered to greatly speed up pattern matching. As a result, large sequences are handled very efficiently. This is illustrated by a comparison of large genomic DNA. The algorithm has been implemented in an interactive application on a microcomputer.

Mesh:

Substances:

Year:  1994        PMID: 8127677      PMCID: PMC523596          DOI: 10.1093/nar/22.3.404

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  18 in total

1.  Locating gaps in amino acid sequences to optimize the homology between two proteins.

Authors:  W M Fitch
Journal:  Biochem Genet       Date:  1969-04       Impact factor: 1.890

2.  The diagram, a method for comparing sequences. Its use with amino acid and nucleotide sequences.

Authors:  A J Gibbs; G A McIntyre
Journal:  Eur J Biochem       Date:  1970-09

3.  Locating well-conserved regions within a pairwise alignment.

Authors:  K M Chao; R C Hardison; W Miller
Journal:  Comput Appl Biosci       Date:  1993-08

4.  Matrix program to analyze primary structure homology.

Authors:  J Novotny
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

5.  Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors:  J V Maizel; R P Lenk
Journal:  Proc Natl Acad Sci U S A       Date:  1981-12       Impact factor: 11.205

6.  Two-dimensional graphic analysis of DNA sequence homologies.

Authors:  R Harr; P Hagblom; P Gustafsson
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

7.  An interactive graphics program for comparing and aligning nucleic acid and amino acid sequences.

Authors:  R Staden
Journal:  Nucleic Acids Res       Date:  1982-05-11       Impact factor: 16.971

8.  Rapid similarity searches of nucleic acid and protein data banks.

Authors:  W J Wilbur; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1983-02       Impact factor: 11.205

9.  Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 .

Authors:  A D McLachlan
Journal:  J Mol Biol       Date:  1971-10-28       Impact factor: 5.469

10.  Three cDNA clones encoding mouse transplantation antigens: homology to immunoglobulin genes.

Authors:  M Steinmetz; J G Frelinger; D Fisher; T Hunkapiller; D Pereira; S M Weissman; H Uehara; S Nathenson; L Hood
Journal:  Cell       Date:  1981-04       Impact factor: 41.582

View more
  2 in total

1.  Fast analysis of genomic homologies: primate immunodeficiency virus.

Authors:  M L Moncany; P R Courtois
Journal:  J Mol Evol       Date:  1996-08       Impact factor: 2.395

2.  Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators.

Authors:  Selvaraj Muthukrishnan; Munish Puri; Christophe Lefevre
Journal:  BMC Res Notes       Date:  2014-01-27
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.