Literature DB >> 10716175

Comparison of sequence profiles. Strategies for structural predictions using sequence information.

L Rychlewski1, L Jaroszewski, W Li, A Godzik.   

Abstract

Distant homologies between proteins are often discovered only after three-dimensional structures of both proteins are solved. The sequence divergence for such proteins can be so large that simple comparison of their sequences fails to identify any similarity. New generation of sensitive alignment tools use averaged sequences of entire homologous families (profiles) to detect such homologies. Several algorithms, including the newest generation of BLAST algorithms and BASIC, an algorithm used in our group to assign fold predictions for proteins from several genomes, are compared to each other on the large set of structurally similar proteins with little sequence similarity. Proteins in the benchmark are classified according to the level of their similarity, which allows us to demonstrate that most of the improvement of the new algorithms is achieved for proteins with strong functional similarities, with almost no progress in recognizing distant fold similarities. It is also shown that details of profile calculation strongly influence its sensitivity in recognizing distant homologies. The most important choice is how to include information from diverging members of the family, avoiding generating false predictions, while accounting for entire sequence divergence within a family. PSI-BLAST takes a conservative approach, deriving a profile from core members of the family, providing a solid improvement without almost any false predictions. BASIC strives for better sensitivity by increasing the weight of divergent family members and paying the price in lower reliability. A new FFAS algorithm introduced here uses a new procedure for profile generation that takes into account all the relations within the family and matches BASIC sensitivity with PSI-BLAST like reliability.

Mesh:

Substances:

Year:  2000        PMID: 10716175      PMCID: PMC2144550          DOI: 10.1110/ps.9.2.232

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  25 in total

1.  Structure classification-based assessment of CASP3 predictions for the fold recognition targets.

Authors:  A G Murzin
Journal:  Proteins       Date:  1999

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  A hidden Markov model that finds genes in E. coli DNA.

Authors:  A Krogh; I S Mian; D Haussler
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

4.  Applying motif and profile searches.

Authors:  P Bork; T J Gibson
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

5.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

6.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

7.  The PROSITE database, its status in 1995.

Authors:  A Bairoch; P Bucher; K Hofmann
Journal:  Nucleic Acids Res       Date:  1996-01-01       Impact factor: 16.971

8.  The HSSP data base of protein structure-sequence alignments.

Authors:  C Sander; R Schneider
Journal:  Nucleic Acids Res       Date:  1993-07-01       Impact factor: 16.971

9.  Position-based sequence weights.

Authors:  S Henikoff; J G Henikoff
Journal:  J Mol Biol       Date:  1994-11-04       Impact factor: 5.469

10.  Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks.

Authors:  R L Tatusov; S F Altschul; E V Koonin
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-06       Impact factor: 11.205

View more
  195 in total

1.  Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments.

Authors:  I Friedberg; T Kaplan; H Margalit
Journal:  Protein Sci       Date:  2000-11       Impact factor: 6.725

2.  LiveBench-1: continuous benchmarking of protein structure prediction servers.

Authors:  J M Bujnicki; A Elofsson; D Fischer; L Rychlewski
Journal:  Protein Sci       Date:  2001-02       Impact factor: 6.725

3.  Improved recognition of native-like protein structures using a family of designed sequences.

Authors:  Patrice Koehl; Michael Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-08       Impact factor: 11.205

4.  Protein topology and stability define the space of allowed sequences.

Authors:  Patrice Koehl; Michael Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-22       Impact factor: 11.205

5.  A comparison of position-specific score matrices based on sequence and structure alignments.

Authors:  Anna R Panchenko; Stephen H Bryant
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

Review 6.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

7.  Solution structure and stability of the anti-sigma factor AsiA: implications for novel functions.

Authors:  Jeffrey L Urbauer; Mario F Simeonov; Ramona J Bieber Urbauer; Karen Adelman; Joshua M Gilmore; Edward N Brody
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-05       Impact factor: 11.205

8.  Pcons: a neural-network-based consensus predictor that improves fold recognition.

Authors:  J Lundström; L Rychlewski; J Bujnicki; A Elofsson
Journal:  Protein Sci       Date:  2001-11       Impact factor: 6.725

9.  Structural similarity to link sequence space: new potential superfamilies and implications for structural genomics.

Authors:  Patrick Aloy; Baldomero Oliva; Enrique Querol; Francesc X Aviles; Robert B Russell
Journal:  Protein Sci       Date:  2002-05       Impact factor: 6.725

10.  Improving the quality of twilight-zone alignments.

Authors:  L Jaroszewski; L Rychlewski; A Godzik
Journal:  Protein Sci       Date:  2000-08       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.