Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Rapid similarity searches of nucleic acid and protein data banks.

Literature DB >> 6572363

Rapid similarity searches of nucleic acid and protein data banks.

Abstract

With the development of large data banks of protein and nucleic acid sequences, the need for efficient methods of searching such banks for sequences similar to a given sequence has become evident. We present an algorithm for the global comparison of sequences based on matching k-tuples of sequence elements for a fixed k. The method results in substantial reduction in the time required to search a data bank when compared with prior techniques of similarity analysis, with minimal loss in sensitivity. The algorithm has also been adapted, in a separate implementation, to produce rigorous sequence alignments. Currently, using the DEC KL-10 system, we can compare all sequences in the entire Protein Data Bank of the National Biomedical Research Foundation with a 350-residue query sequence in less than 3 min and carry out a similar analysis with a 500-base query sequence against all eukaryotic sequences in the Los Alamos Nucleic Acid Data Base in less than 2 min.

Mesh：

Substances：
Nucleic Acids
Proteins

Year: 1983 PMID： 6572363 PMCID： PMC393452 DOI： 10.1073/pnas.80.3.726

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

11 in total

1. Pattern recognition in genetic sequences.

Authors: P H Sellers
Journal: Proc Natl Acad Sci U S A Date: 1979-07 Impact factor: 11.205

2. Computer analysis of nucleic acid regulatory sequences.

Authors: L J Korn; C L Queen; M N Wegman
Journal: Proc Natl Acad Sci U S A Date: 1977-10 Impact factor: 11.205

3. Matching sequences under deletion-insertion constraints.

Authors: D Sankoff
Journal: Proc Natl Acad Sci U S A Date: 1972-01 Impact factor: 11.205

4. A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors: S B Needleman; C D Wunsch
Journal: J Mol Biol Date: 1970-03 Impact factor: 5.469

5. Efficient algorithms for folding and comparing nucleic acid sequences.

Authors: J P Dumas; J Ninio
Journal: Nucleic Acids Res Date: 1982-01-11 Impact factor: 16.971

6. Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors: J V Maizel; R P Lenk
Journal: Proc Natl Acad Sci U S A Date: 1981-12 Impact factor: 11.205

7. Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

Authors: W B Goad; M I Kanehisa
Journal: Nucleic Acids Res Date: 1982-01-11 Impact factor: 16.971

8. Comparative biosequence metrics.

Authors: T F Smith; M S Waterman; W M Fitch
Journal: J Mol Evol Date: 1981 Impact factor: 2.395

9. Identification of common molecular subsequences.

Authors: T F Smith; M S Waterman
Journal: J Mol Biol Date: 1981-03-25 Impact factor: 5.469

10. An improved method of testing for evolutionary homology.

Authors: W M Fitch
Journal: J Mol Biol Date: 1966-03 Impact factor: 5.469

563 in total

1. Nonlinear methods in the analysis of protein sequences: a case study in rubredoxins.

Authors: A Giuliani; R Benigni; P Sirabella; J P Zbilut; A Colosimo
Journal: Biophys J Date: 2000-01 Impact factor: 4.033

2. Prediction of protein functional domains from sequences using artificial neural networks.

Authors: J Murvai; K Vlahovicek; C Szepesvári; S Pongor
Journal: Genome Res Date: 2001-08 Impact factor: 9.043

3. The VANA glycopeptide resistance protein is related to D-alanyl-D-alanine ligase cell wall biosynthesis enzymes.

Authors: S Dutka-Malen; C Molinas; M Arthur; P Courvalin
Journal: Mol Gen Genet Date: 1990-12

4. Multiple sequence alignment with the Clustal series of programs.

Authors: Ramu Chenna; Hideaki Sugawara; Tadashi Koike; Rodrigo Lopez; Toby J Gibson; Desmond G Higgins; Julie D Thompson
Journal: Nucleic Acids Res Date: 2003-07-01 Impact factor: 16.971

5. Identification of sequence types among the M-nontypeable group A streptococci.

Authors: W A Relf; D R Martin; K S Sriprakash
Journal: J Clin Microbiol Date: 1992-12 Impact factor: 5.948

6. Molecular characterization and determination of the coding capacity of the genome of equine herpesvirus type 2 between the genome coordinates 0.235 and 0.258 (the EcoRI DNA fragment N; 4.2 kbp).

Authors: H J Rode; J J Bugert; M Handermann; P Schnitzler; R Kehm; W Janssen; H Delius; G Darai
Journal: Virus Genes Date: 1994-09 Impact factor: 2.332

10. Sequence and structure of the Drosophila melanogaster ovarian tumor gene and generation of an antibody specific for the ovarian tumor protein.

Authors: W R Steinhauer; R C Walsh; L J Kalfayan
Journal: Mol Cell Biol Date: 1989-12 Impact factor: 4.272