Literature DB >> 7511453

Improving the sensitivity of the sequence profile method.

R Lüthy1, I Xenarios, P Bucher.   

Abstract

The sequence profile method (Gribskov M, McLachlan AD, Eisenberg D, 1987, Proc Natl Acad Sci USA 84:4355-4358) is a powerful tool to detect distant relationships between amino acid sequences. A profile is a table of position-specific scores and gap penalties, providing a generalized description of a protein motif, which can be used for sequence alignments and database searches instead of an individual sequence. A sequence profile is derived from a multiple sequence alignment. We have found 2 ways to improve the sensitivity of sequence profiles: (1) Sequence weights: Usage of individual weights for each sequence avoids bias toward closely related sequences. These weights are automatically assigned based on the distance of the sequences using a published procedure (Sibbald PR, Argos P, 1990, J Mol Biol 216:813-818). (2) Amino acid substitution table: In addition to the alignment, the construction of a profile also needs an amino acid substitution table. We have found that in some cases a new table, the BLOSUM45 table (Henikoff S, Henikoff JG, 1992, Proc Natl Acad Sci USA 89:10915-10919), is more sensitive than the original Dayhoff table or the modified Dayhoff table used in the current implementation. Profiles derived by the improved method are more sensitive and selective in a number of cases where previous methods have failed to completely separate true members from false positives.

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 7511453      PMCID: PMC2142471          DOI: 10.1002/pro.5560030118

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  10 in total

Review 1.  SH2 and SH3 domains: from structure to function.

Authors:  T Pawson; G D Gish
Journal:  Cell       Date:  1992-10-30       Impact factor: 41.582

2.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

3.  Phylogeny of the alpha-crystallin-related heat-shock proteins.

Authors:  N Plesofsky-Vig; J Vig; R Brambl
Journal:  J Mol Evol       Date:  1992-12       Impact factor: 2.395

4.  The SWISS-PROT protein sequence data bank.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

5.  PROSITE: a dictionary of sites and patterns in proteins.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

6.  Weighting aligned protein or nucleic acid sequences to correct for unequal representation.

Authors:  P R Sibbald; P Argos
Journal:  J Mol Biol       Date:  1990-12-20       Impact factor: 5.469

7.  Recognition of functional regions in primary structures using a set of property patterns.

Authors:  P Bork
Journal:  FEBS Lett       Date:  1989-10-23       Impact factor: 4.124

8.  Weights for data related by a tree.

Authors:  S F Altschul; R J Carroll; D J Lipman
Journal:  J Mol Biol       Date:  1989-06-20       Impact factor: 5.469

9.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

10.  A comprehensive set of sequence analysis programs for the VAX.

Authors:  J Devereux; P Haeberli; O Smithies
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

  10 in total
  20 in total

1.  NikR is a ribbon-helix-helix DNA-binding protein.

Authors:  P T Chivers; R T Sauer
Journal:  Protein Sci       Date:  1999-11       Impact factor: 6.725

2.  The PROSITE database, its status in 2002.

Authors:  Laurent Falquet; Marco Pagni; Philipp Bucher; Nicolas Hulo; Christian J A Sigrist; Kay Hofmann; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

Review 3.  Statistical and Bayesian approaches to RNA secondary structure prediction.

Authors:  Ye Ding
Journal:  RNA       Date:  2006-03       Impact factor: 4.942

4.  Phylogenetic profiles reveal evolutionary relationships within the "twilight zone" of sequence similarity.

Authors:  Gue Su Chang; Yoojin Hong; Kyung Dae Ko; Gaurav Bhardwaj; Edward C Holmes; Randen L Patterson; Damian B van Rossum
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-02       Impact factor: 11.205

5.  Cloning and characterization of the mammalian brain-specific, Mg2+-dependent neutral sphingomyelinase.

Authors:  K Hofmann; S Tomiuk; G Wolff; W Stoffel
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-23       Impact factor: 11.205

6.  Graph pyramids for protein function prediction.

Authors:  Tushar Sandhan; Youngjun Yoo; Jin Choi; Sun Kim
Journal:  BMC Med Genomics       Date:  2015-05-29       Impact factor: 3.063

7.  Self-organizing hierarchic networks for pattern recognition in protein sequence.

Authors:  J Hanke; G Beckmann; P Bork; J G Reich
Journal:  Protein Sci       Date:  1996-01       Impact factor: 6.725

8.  Computational characterisation of potential RNA-binding sites in arenavirus nucleocapsid proteins.

Authors:  G Parisi; J Echave; D Ghiringhelli; V Romanowski
Journal:  Virus Genes       Date:  1996       Impact factor: 2.332

9.  The PROSITE database, its status in 1997.

Authors:  A Bairoch; P Bucher; K Hofmann
Journal:  Nucleic Acids Res       Date:  1997-01-01       Impact factor: 16.971

10.  Embedding strategies for effective use of information from multiple sequence alignments.

Authors:  S Henikoff; J G Henikoff
Journal:  Protein Sci       Date:  1997-03       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.