Literature DB >> 1754394

Automated assembly of protein blocks for database searching.

S Henikoff1, J G Henikoff.   

Abstract

A system is described for finding and assembling the most highly conserved regions of related proteins for database searching. First, an automated version of Smith's algorithm for finding motifs is used for sensitive detection of multiple local alignments. Next, the local alignments are converted to blocks and the best set of non-overlapping blocks is determined. When the automated system was applied successively to all 437 groups of related proteins in the PROSITE catalog, 1764 blocks resulted; these could be used for very sensitive searches of sequence databases. Each block was calibrated by searching the SWISS-PROT database to obtain a measure of the chance distribution of matches, and the calibrated blocks were concatenated into a database that could itself be searched. Examples are provided in which distant relationships are detected either using a set of blocks to search a sequence database or using sequences to search the database of blocks. The practical use of the blocks database is demonstrated by detecting previously unknown relationships between oxidoreductases and by evaluating a proposed relationship between HIV Vif protein and thiol proteases.

Mesh:

Substances:

Year:  1991        PMID: 1754394      PMCID: PMC329220          DOI: 10.1093/nar/19.23.6565

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  25 in total

1.  PROSITE: a dictionary of sites and patterns in proteins.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1991-04-25       Impact factor: 16.971

2.  The SWISS-PROT protein sequence data bank.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1991-04-25       Impact factor: 16.971

3.  PROMOT: a FORTRAN program to scan protein sequences against a library of known motifs.

Authors:  M J Sternberg
Journal:  Comput Appl Biosci       Date:  1991-04

4.  A workbench for multiple alignment construction and analysis.

Authors:  G D Schuler; S F Altschul; D J Lipman
Journal:  Proteins       Date:  1991

5.  Multiple sequence alignment of protein families showing low sequence homology: a methodological approach using database pattern-matching discriminators for G-protein-linked receptors.

Authors:  T K Attwood; E E Eliopoulos; J B Findlay
Journal:  Gene       Date:  1991-02-15       Impact factor: 3.688

6.  Phosphotransferase sequence homology.

Authors:  S Brenner
Journal:  Nature       Date:  1987 Sep 3-9       Impact factor: 49.962

7.  Finding protein similarities with nucleotide sequence databases.

Authors:  S Henikoff; J C Wallace; J P Brown
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

8.  A fast and sensitive multiple sequence alignment algorithm.

Authors:  M Vingron; P Argos
Journal:  Comput Appl Biosci       Date:  1989-04

9.  Detecting homology of distantly related proteins with consensus sequences.

Authors:  L Patthy
Journal:  J Mol Biol       Date:  1987-12-20       Impact factor: 5.469

10.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

View more
  105 in total

1.  Increased coverage of protein families with the blocks database servers.

Authors:  J G Henikoff; E A Greene; S Pietrokovski; S Henikoff
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Recognition of nascent RNA by the human La antigen: conserved and divergent features of structure and function.

Authors:  R J Maraia; R V Intine
Journal:  Mol Cell Biol       Date:  2001-01       Impact factor: 4.272

Review 3.  Microbial relatives of the seed storage proteins of higher plants: conservation of structure and diversification of function during evolution of the cupin superfamily.

Authors:  J M Dunwell; S Khuri; P J Gane
Journal:  Microbiol Mol Biol Rev       Date:  2000-03       Impact factor: 11.056

4.  Evolutionary relationship between K(+) channels and symporters.

Authors:  S R Durell; Y Hao; T Nakamura; E P Bakker; H R Guy
Journal:  Biophys J       Date:  1999-08       Impact factor: 4.033

Review 5.  La protein and its associated small nuclear and nucleolar precursor RNAs.

Authors:  Richard J Maraia; Robert V Intine
Journal:  Gene Expr       Date:  2002

6.  Predicting deleterious amino acid substitutions.

Authors:  P C Ng; S Henikoff
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

7.  GenDB--an open source genome annotation system for prokaryote genomes.

Authors:  Folker Meyer; Alexander Goesmann; Alice C McHardy; Daniela Bartels; Thomas Bekel; Jörn Clausen; Jörn Kalinowski; Burkhard Linke; Oliver Rupp; Robert Giegerich; Alfred Pühler
Journal:  Nucleic Acids Res       Date:  2003-04-15       Impact factor: 16.971

8.  PANTHER: a library of protein families and subfamilies indexed by function.

Authors:  Paul D Thomas; Michael J Campbell; Anish Kejariwal; Huaiyu Mi; Brian Karlak; Robin Daverman; Karen Diemer; Anushya Muruganujan; Apurva Narechania
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

9.  A putative homolog of U2AF65 in S. cerevisiae.

Authors:  E Birney; S Kumar; A R Krainer
Journal:  Nucleic Acids Res       Date:  1992-09-11       Impact factor: 16.971

10.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.