Literature DB >> 6572363

Rapid similarity searches of nucleic acid and protein data banks.

W J Wilbur, D J Lipman.   

Abstract

With the development of large data banks of protein and nucleic acid sequences, the need for efficient methods of searching such banks for sequences similar to a given sequence has become evident. We present an algorithm for the global comparison of sequences based on matching k-tuples of sequence elements for a fixed k. The method results in substantial reduction in the time required to search a data bank when compared with prior techniques of similarity analysis, with minimal loss in sensitivity. The algorithm has also been adapted, in a separate implementation, to produce rigorous sequence alignments. Currently, using the DEC KL-10 system, we can compare all sequences in the entire Protein Data Bank of the National Biomedical Research Foundation with a 350-residue query sequence in less than 3 min and carry out a similar analysis with a 500-base query sequence against all eukaryotic sequences in the Los Alamos Nucleic Acid Data Base in less than 2 min.

Mesh:

Substances:

Year:  1983        PMID: 6572363      PMCID: PMC393452          DOI: 10.1073/pnas.80.3.726

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  11 in total

1.  Pattern recognition in genetic sequences.

Authors:  P H Sellers
Journal:  Proc Natl Acad Sci U S A       Date:  1979-07       Impact factor: 11.205

2.  Computer analysis of nucleic acid regulatory sequences.

Authors:  L J Korn; C L Queen; M N Wegman
Journal:  Proc Natl Acad Sci U S A       Date:  1977-10       Impact factor: 11.205

3.  Matching sequences under deletion-insertion constraints.

Authors:  D Sankoff
Journal:  Proc Natl Acad Sci U S A       Date:  1972-01       Impact factor: 11.205

4.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

5.  Efficient algorithms for folding and comparing nucleic acid sequences.

Authors:  J P Dumas; J Ninio
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

6.  Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors:  J V Maizel; R P Lenk
Journal:  Proc Natl Acad Sci U S A       Date:  1981-12       Impact factor: 11.205

7.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

Authors:  W B Goad; M I Kanehisa
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

8.  Comparative biosequence metrics.

Authors:  T F Smith; M S Waterman; W M Fitch
Journal:  J Mol Evol       Date:  1981       Impact factor: 2.395

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  An improved method of testing for evolutionary homology.

Authors:  W M Fitch
Journal:  J Mol Biol       Date:  1966-03       Impact factor: 5.469

View more
  563 in total

1.  Nonlinear methods in the analysis of protein sequences: a case study in rubredoxins.

Authors:  A Giuliani; R Benigni; P Sirabella; J P Zbilut; A Colosimo
Journal:  Biophys J       Date:  2000-01       Impact factor: 4.033

2.  Prediction of protein functional domains from sequences using artificial neural networks.

Authors:  J Murvai; K Vlahovicek; C Szepesvári; S Pongor
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

3.  The VANA glycopeptide resistance protein is related to D-alanyl-D-alanine ligase cell wall biosynthesis enzymes.

Authors:  S Dutka-Malen; C Molinas; M Arthur; P Courvalin
Journal:  Mol Gen Genet       Date:  1990-12

4.  Multiple sequence alignment with the Clustal series of programs.

Authors:  Ramu Chenna; Hideaki Sugawara; Tadashi Koike; Rodrigo Lopez; Toby J Gibson; Desmond G Higgins; Julie D Thompson
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Identification of sequence types among the M-nontypeable group A streptococci.

Authors:  W A Relf; D R Martin; K S Sriprakash
Journal:  J Clin Microbiol       Date:  1992-12       Impact factor: 5.948

6.  Molecular characterization and determination of the coding capacity of the genome of equine herpesvirus type 2 between the genome coordinates 0.235 and 0.258 (the EcoRI DNA fragment N; 4.2 kbp).

Authors:  H J Rode; J J Bugert; M Handermann; P Schnitzler; R Kehm; W Janssen; H Delius; G Darai
Journal:  Virus Genes       Date:  1994-09       Impact factor: 2.332

7.  Characterization of a Cold-Regulated Wheat Gene Related to Arabidopsis cor47.

Authors:  W Guo; R W Ward; M F Thomashow
Journal:  Plant Physiol       Date:  1992-10       Impact factor: 8.340

8.  Illegitimate recombination generates a class switch from C mu to C delta in an IgD-secreting plasmacytoma.

Authors:  A C Gilliam; A Shen; J E Richards; F R Blattner; J F Mushinski; P W Tucker
Journal:  Proc Natl Acad Sci U S A       Date:  1984-07       Impact factor: 11.205

9.  Structure of the chromosome VII centromere region in Neurospora crassa: degenerate transposons and simple repeats.

Authors:  E B Cambareri; R Aisner; J Carbon
Journal:  Mol Cell Biol       Date:  1998-09       Impact factor: 4.272

10.  Sequence and structure of the Drosophila melanogaster ovarian tumor gene and generation of an antibody specific for the ovarian tumor protein.

Authors:  W R Steinhauer; R C Walsh; L J Kalfayan
Journal:  Mol Cell Biol       Date:  1989-12       Impact factor: 4.272

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.