Literature DB >> 9521122

Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins.

M Gerstein1, M Levitt.   

Abstract

We apply a simple method for aligning protein sequences on the basis of a 3D structure, on a large scale, to the proteins in the scop classification of fold families. This allows us to assess, understand, and improve our automatic method against an objective, manually derived standard, a type of comprehensive evaluation that has not yet been possible for other structural alignment algorithms. Our basic approach directly matches the backbones of two structures, using repeated cycles of dynamic programming and least-squares fitting to determine an alignment minimizing coordinate difference. Because of simplicity, our method can be readily modified to take into account additional features of protein structure such as the orientation of side chains or the location-dependent cost of opening a gap. Our basic method, augmented by such modifications, can find reasonable alignments for all but 1.5% of the known structural similarities in scop, i.e., all but 32 of the 2,107 superfamily pairs. We discuss the specific protein structural features that make these 32 pairs so difficult to align and show how our procedure effectively partitions the relationships in scop into different categories, depending on what aspects of protein structure are involved (e.g., depending on whether or not consideration of side-chain orientation is necessary for proper alignment). We also show how our pairwise alignment procedure can be extended to generate a multiple alignment for a group of related structures. We have compared these alignments in detail with corresponding manual ones culled from the literature. We find good agreement (to within 95% for the core regions), and detailed comparison highlights how particular protein structural features (such as certain strands) are problematical to align, giving somewhat ambiguous results. With these improvements and systematic tests, our procedure should be useful for the development of scop and the future classification of protein folds.

Mesh:

Substances:

Year:  1998        PMID: 9521122      PMCID: PMC2143933          DOI: 10.1002/pro.5560070226

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  53 in total

1.  A variable gap penalty function and feature weights for protein 3-D structure comparisons.

Authors:  Z Y Zhu; A Sali; T L Blundell
Journal:  Protein Eng       Date:  1992-01

2.  The structural alignment between two proteins: is there a unique answer?

Authors:  A Godzik
Journal:  Protein Sci       Date:  1996-07       Impact factor: 6.725

3.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

4.  An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited.

Authors:  G Vogt; T Etzold; P Argos
Journal:  J Mol Biol       Date:  1995-06-16       Impact factor: 5.469

5.  Domain closure in adenylate kinase. Joints on either side of two helices close like neighboring fingers.

Authors:  M Gerstein; G Schulz; C Chothia
Journal:  J Mol Biol       Date:  1993-01-20       Impact factor: 5.469

6.  Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm.

Authors:  H M Grindley; P J Artymiuk; D W Rice; P Willett
Journal:  J Mol Biol       Date:  1993-02-05       Impact factor: 5.469

7.  The FSSP database of structurally aligned protein fold families.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

8.  Volume changes in protein evolution.

Authors:  M Gerstein; E L Sonnhammer; C Chothia
Journal:  J Mol Biol       Date:  1994-03-04       Impact factor: 5.469

9.  Evolution of proteins formed by beta-sheets. II. The core of the immunoglobulin domains.

Authors:  A M Lesk; C Chothia
Journal:  J Mol Biol       Date:  1982-09-15       Impact factor: 5.469

10.  Derivation of rules for comparative protein modeling from a database of protein structure alignments.

Authors:  A Sali; J P Overington
Journal:  Protein Sci       Date:  1994-09       Impact factor: 6.725

View more
  52 in total

1.  Detection of protein fold similarity based on correlation of amino acid properties.

Authors:  I V Grigoriev; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

2.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

4.  The morph server: a standardized system for analyzing and visualizing macromolecular motions in a database framework.

Authors:  W G Krebs; M Gerstein
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

5.  Whole-genome trees based on the occurrence of folds and orthologs: implications for comparing genomes on different levels.

Authors:  J Lin; M Gerstein
Journal:  Genome Res       Date:  2000-06       Impact factor: 9.043

6.  Sequence variations within protein families are linearly related to structural variations.

Authors:  Patrice Koehl; Michael Levitt
Journal:  J Mol Biol       Date:  2002-10-25       Impact factor: 5.469

7.  Detection of homologous proteins by an intermediate sequence search.

Authors:  Bino John; Andrej Sali
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

8.  Matching protein structures with fuzzy alignments.

Authors:  Richard Blankenbecler; Mattias Ohlsson; Carsten Peterson; Markus Ringner
Journal:  Proc Natl Acad Sci U S A       Date:  2003-10-02       Impact factor: 11.205

9.  Automatic classification of protein structure by using Gauss integrals.

Authors:  Peter Rogen; Boris Fain
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-27       Impact factor: 11.205

10.  A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary.

Authors:  Ryan Day; David A C Beck; Roger S Armen; Valerie Daggett
Journal:  Protein Sci       Date:  2003-10       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.