Literature DB >> 8355272

Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.

C Ouzounis1, C Sander, M Scharf, R Schneider.   

Abstract

The problem of protein structure prediction is formulated here as that of evaluating how well an amino acid sequence fits a hypothetical structure. The simplest and most complicated approaches, secondary structure prediction and all-atom free energy calculations, can be viewed as sequence-structure fitness problems. Here, an approach of intermediate complexity is described, which involves; (1) description of a protein structure in terms of contact interface vectors, with both intra-protein and protein-solvent contacts counted, (2) derivation of sequence preferences for 2 up to 29 contact interface types, (3) generation of numerous hypothetical model structures by placing the input sequence into a large set of known three-dimensional structures in all possible alignments, (4) evaluation of these models by summing the sequence preferences over all structural positions and (5) choice of predicted three-dimensional structure as that with the best sequence-structure fitness. Evolutionary information is incorporated by using position-dependent core weights derived from multiple sequence alignments. A number of tests of the method are performed: (1) evaluation of cyclic shifts of a sequence in its native structure; (2) alignment of a sequence in its native structure, allowing gaps; (3) alignment search with a sequence or sequence fragment in a database of structures; and (4) alignment search with a structure in a database of sequences. The main results are: (1) a native sequence can very well find its native structure among a large number of alternatives, in correct alignment; (2) substructures, such as (beta alpha)n units, can be detected in spite of very low sequence similarity; (3) remote homologous can be detected, with some dependence on the set of parameters used; (4) contact interface parameters are clearly superior to classical secondary structure parameters; (5) a simple interface description in terms of just two states, protein-protein and protein-water contacts, performs surprisingly well; (6) the use of core weights considerably improves accuracy in detection of remote homologues; (7) based on a sequence database search with a myoglobin contact profile, the C-terminal domain of a viral origin of replication binding protein is predicted to have an all-helical fold. The sequence-structure fitness concept is sufficiently general to accommodate a large variety of protein structure prediction methods, including new models of intermediate complexity currently being developed.

Entities:  

Mesh:

Substances:

Year:  1993        PMID: 8355272     DOI: 10.1006/jmbi.1993.1433

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  16 in total

1.  Factors limiting the performance of prediction-based fold recognition methods.

Authors:  X de la Cruz; J M Thornton
Journal:  Protein Sci       Date:  1999-04       Impact factor: 6.725

2.  Statistical potentials for fold assessment.

Authors:  Francisco Melo; Roberto Sánchez; Andrej Sali
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

3.  Feasibility in the inverse protein folding protocol.

Authors:  M Ota; K Nishikawa
Journal:  Protein Sci       Date:  1999-05       Impact factor: 6.725

4.  The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

Authors:  Parag Mallick; Robert Weiss; David Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-02       Impact factor: 11.205

5.  Design of an optimal Chebyshev-expanded discrimination function for globular proteins.

Authors:  Boris Fain; Yu Xia; Michael Levitt
Journal:  Protein Sci       Date:  2002-08       Impact factor: 6.725

6.  Statistical potential for assessment and prediction of protein structures.

Authors:  Min-Yi Shen; Andrej Sali
Journal:  Protein Sci       Date:  2006-11       Impact factor: 6.725

7.  Analysis of the peroxiredoxin family: using active-site structure and sequence information for global classification and residue analysis.

Authors:  Kimberly J Nelson; Stacy T Knutson; Laura Soito; Chananat Klomsiri; Leslie B Poole; Jacquelyn S Fetrow
Journal:  Proteins       Date:  2010-12-22

8.  Protein fold recognition using sequence-derived predictions.

Authors:  D Fischer; D Eisenberg
Journal:  Protein Sci       Date:  1996-05       Impact factor: 6.725

9.  Fold prediction by a hierarchy of sequence, threading, and modeling methods.

Authors:  L Jaroszewski; L Rychlewski; B Zhang; A Godzik
Journal:  Protein Sci       Date:  1998-06       Impact factor: 6.725

10.  Protein structural similarities predicted by a sequence-structure compatibility method.

Authors:  Y Matsuo; K Nishikawa
Journal:  Protein Sci       Date:  1994-11       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.