Literature DB >> 2207748

Improved sensitivity of biological sequence database searches.

D L Brutlag1, J P Dautricourt, S Maulik, J Relph.   

Abstract

We have increased the sensitivity of DNA and protein sequence database searches by allowing similar but non-identical amino acids or nucleotides to match. In addition, one can match k-tuples or words instead of matching individual residues in order to speed the search. A matching matrix species which k-tuples match each other. The matching matrix can be calculated from a similarity matrix of amino acids and a threshold of similarity required for matching. This permits amino acid similarity matrices or replacement matrices (PAM matrices) to be used in the first step of a sequence comparison rather than in a secondary scoring phase. The concept of matching non-identical k-tuples also increases the power of DNA database searches. For example, a matrix that specifies that any 3-tuple in a DNA sequence can match any other 3-tuple encoding the same amino acid permits a DNA database search using a DNA query sequence for regions that would encode a similar amino acid sequence.

Entities:  

Mesh:

Year:  1990        PMID: 2207748     DOI: 10.1093/bioinformatics/6.3.237

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  48 in total

1.  Which came first, MHC class I or class II?

Authors:  M F Flajnik; C Canel; J Kramer; M Kasahara
Journal:  Immunogenetics       Date:  1991       Impact factor: 2.846

2.  Predicted sequence and structure of a vegetative lectin in Pisum sativum.

Authors:  J H Pak; T Hendrickson; M S Dobres
Journal:  Plant Mol Biol       Date:  1992-03       Impact factor: 4.076

3.  Cloning, expression and localization of an RNA helicase gene from a human lymphoid cell line with chromosomal breakpoint 11q23.3.

Authors:  D Lu; J J Yunis
Journal:  Nucleic Acids Res       Date:  1992-04-25       Impact factor: 16.971

4.  Molecular characterization of two novel crystal protein genes from Bacillus thuringiensis subsp. thompsoni.

Authors:  K L Brown; H R Whiteley
Journal:  J Bacteriol       Date:  1992-01       Impact factor: 3.490

5.  Molecular characterization of flgM, a gene encoding a negative regulator of flagellin synthesis in Salmonella typhimurium.

Authors:  K L Gillen; K T Hughes
Journal:  J Bacteriol       Date:  1991-10       Impact factor: 3.490

6.  Patterns in protein primary sequences: classification, display and analysis.

Authors:  P N Saurugger; B A Metfessel
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1991

7.  Identification and characterization of T-cell antigen receptor-related genes in phylogenetically diverse vertebrate species.

Authors:  J P Rast; R N Haire; R T Litman; S Pross; G W Litman
Journal:  Immunogenetics       Date:  1995       Impact factor: 2.846

8.  Unification of the ferritin family of proteins.

Authors:  M J Grossman; S M Hinton; V Minak-Bernero; C Slaughter; E I Stiefel
Journal:  Proc Natl Acad Sci U S A       Date:  1992-03-15       Impact factor: 11.205

9.  T helper cell recognition of muscle acetylcholine receptor in myasthenia gravis. Epitopes on the gamma and delta subunits.

Authors:  A A Manfredi; M P Protti; M W Dalton; J F Howard; B M Conti-Tronconi
Journal:  J Clin Invest       Date:  1993-08       Impact factor: 14.808

10.  The Drosophila Stubble-stubbloid gene encodes an apparent transmembrane serine protease required for epithelial morphogenesis.

Authors:  L F Appel; M Prout; R Abu-Shumays; A Hammonds; J C Garbe; D Fristrom; J Fristrom
Journal:  Proc Natl Acad Sci U S A       Date:  1993-06-01       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.