Literature DB >> 3474607

Profile analysis: detection of distantly related proteins.

M Gribskov, A D McLachlan, D Eisenberg.   

Abstract

Profile analysis is a method for detecting distantly related proteins by sequence comparison. The basis for comparison is not only the customary Dayhoff mutational-distance matrix but also the results of structural studies and information implicit in the alignments of the sequences of families of similar proteins. This information is expressed in a position-specific scoring table (profile), which is created from a group of sequences previously aligned by structural or sequence similarity. The similarity of any other sequence (target) to the group of aligned sequences (probe) can be tested by comparing the target to the profile using dynamic programming algorithms. The profile method differs in two major respects from methods of sequence comparison in common use: (i) Any number of known sequences can be used to construct the profile, allowing more information to be used in the testing of the target than is possible with pairwise alignment methods. (ii) The profile includes the penalties for insertion or deletion at each position, which allow one to include the probe secondary structure in the testing scheme. Tests with globin and immunoglobulin sequences show that profile analysis can distinguish all members of these families from all other sequences in a database containing 3800 protein sequences.

Mesh:

Substances:

Year:  1987        PMID: 3474607      PMCID: PMC305087          DOI: 10.1073/pnas.84.13.4355

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  23 in total

1.  Sigma factors from E. coli, B. subtilis, phage SP01, and phage T4 are homologous proteins.

Authors:  M Gribskov; R R Burgess
Journal:  Nucleic Acids Res       Date:  1986-08-26       Impact factor: 16.971

Review 2.  Empirical predictions of protein conformation.

Authors:  P Y Chou; G D Fasman
Journal:  Annu Rev Biochem       Date:  1978       Impact factor: 23.643

3.  Analysis of gene duplication repeats in the myosin rod.

Authors:  A D McLachlan
Journal:  J Mol Biol       Date:  1983-09-05       Impact factor: 5.469

4.  Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structure.

Authors:  R M Sweet; D Eisenberg
Journal:  J Mol Biol       Date:  1983-12-25       Impact factor: 5.469

5.  Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors:  J V Maizel; R P Lenk
Journal:  Proc Natl Acad Sci U S A       Date:  1981-12       Impact factor: 11.205

6.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins.

Authors:  A M Lesk; C Chothia
Journal:  J Mol Biol       Date:  1980-01-25       Impact factor: 5.469

7.  Similar amino acid sequences: chance or common ancestry?

Authors:  R F Doolittle
Journal:  Science       Date:  1981-10-09       Impact factor: 47.728

8.  Rapid similarity searches of nucleic acid and protein data banks.

Authors:  W J Wilbur; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1983-02       Impact factor: 11.205

9.  An improved method of testing for evolutionary homology.

Authors:  W M Fitch
Journal:  J Mol Biol       Date:  1966-03       Impact factor: 5.469

10.  Three-dimensional structure, specificity and catalytic mechanism of renin.

Authors:  T Blundell; B L Sibanda; L Pearl
Journal:  Nature       Date:  1983 Jul 21-27       Impact factor: 49.962

View more
  307 in total

1.  The MetaFam Server: a comprehensive protein family resource.

Authors:  K A Silverstein; E Shoop; J E Johnson; A Kilian; J L Freeman; T M Kunau; I A Awad; M Mayer; E F Retzel
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

Authors:  A Bahr; J D Thompson; J C Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  Identification of related proteins with weak sequence identity using secondary structure information.

Authors:  C Geourjon; C Combet; C Blanchet; G Deléage
Journal:  Protein Sci       Date:  2001-04       Impact factor: 6.725

4.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

5.  Prediction of amino acid sequence from structure.

Authors:  K Raha; A M Wollacott; M J Italia; J R Desjarlais
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

6.  Motif-based fold assignment.

Authors:  L Salwinski; D Eisenberg
Journal:  Protein Sci       Date:  2001-12       Impact factor: 6.725

7.  A comparison of position-specific score matrices based on sequence and structure alignments.

Authors:  Anna R Panchenko; Stephen H Bryant
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

8.  The PROSITE database, its status in 2002.

Authors:  Laurent Falquet; Marco Pagni; Philipp Bucher; Nicolas Hulo; Christian J A Sigrist; Kay Hofmann; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

9.  CDD: a database of conserved domain alignments with links to domain three-dimensional structure.

Authors:  Aron Marchler-Bauer; Anna R Panchenko; Benjamin A Shoemaker; Paul A Thiessen; Lewis Y Geer; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

10.  Improved detection of homologous membrane proteins by inclusion of information from topology predictions.

Authors:  Maria Hedman; Hans Deloof; Gunnar Von Heijne; Arne Elofsson
Journal:  Protein Sci       Date:  2002-03       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.