Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.

Literature DB >> 8355272

Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.

C Ouzounis¹, C Sander, M Scharf, R Schneider.

Abstract

The problem of protein structure prediction is formulated here as that of evaluating how well an amino acid sequence fits a hypothetical structure. The simplest and most complicated approaches, secondary structure prediction and all-atom free energy calculations, can be viewed as sequence-structure fitness problems. Here, an approach of intermediate complexity is described, which involves; (1) description of a protein structure in terms of contact interface vectors, with both intra-protein and protein-solvent contacts counted, (2) derivation of sequence preferences for 2 up to 29 contact interface types, (3) generation of numerous hypothetical model structures by placing the input sequence into a large set of known three-dimensional structures in all possible alignments, (4) evaluation of these models by summing the sequence preferences over all structural positions and (5) choice of predicted three-dimensional structure as that with the best sequence-structure fitness. Evolutionary information is incorporated by using position-dependent core weights derived from multiple sequence alignments. A number of tests of the method are performed: (1) evaluation of cyclic shifts of a sequence in its native structure; (2) alignment of a sequence in its native structure, allowing gaps; (3) alignment search with a sequence or sequence fragment in a database of structures; and (4) alignment search with a structure in a database of sequences. The main results are: (1) a native sequence can very well find its native structure among a large number of alternatives, in correct alignment; (2) substructures, such as (beta alpha)n units, can be detected in spite of very low sequence similarity; (3) remote homologous can be detected, with some dependence on the set of parameters used; (4) contact interface parameters are clearly superior to classical secondary structure parameters; (5) a simple interface description in terms of just two states, protein-protein and protein-water contacts, performs surprisingly well; (6) the use of core weights considerably improves accuracy in detection of remote homologues; (7) based on a sequence database search with a myoglobin contact profile, the C-terminal domain of a viral origin of replication binding protein is predicted to have an all-helical fold. The sequence-structure fitness concept is sufficiently general to accommodate a large variety of protein structure prediction methods, including new models of intermediate complexity currently being developed.

Entities: Disease

Mesh：

Substances：
Proteins

Year: 1993 PMID： 8355272 DOI： 10.1006/jmbi.1993.1433

Source DB: PubMed Journal: J Mol Biol ISSN： 0022-2836 Impact factor: 5.469

Keyword Cloud
Cited

16 in total

1. Factors limiting the performance of prediction-based fold recognition methods.

Authors: X de la Cruz; J M Thornton
Journal: Protein Sci Date: 1999-04 Impact factor: 6.725

2. Statistical potentials for fold assessment.

Authors: Francisco Melo; Roberto Sánchez; Andrej Sali
Journal: Protein Sci Date: 2002-02 Impact factor: 6.725

3. Feasibility in the inverse protein folding protocol.

Authors: M Ota; K Nishikawa
Journal: Protein Sci Date: 1999-05 Impact factor: 6.725

4. The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

Authors: Parag Mallick; Robert Weiss; David Eisenberg
Journal: Proc Natl Acad Sci U S A Date: 2002-12-02 Impact factor: 11.205

5. Design of an optimal Chebyshev-expanded discrimination function for globular proteins.

Authors: Boris Fain; Yu Xia; Michael Levitt
Journal: Protein Sci Date: 2002-08 Impact factor: 6.725

6. Statistical potential for assessment and prediction of protein structures.

Authors: Min-Yi Shen; Andrej Sali
Journal: Protein Sci Date: 2006-11 Impact factor: 6.725

7. Analysis of the peroxiredoxin family: using active-site structure and sequence information for global classification and residue analysis.

Authors: Kimberly J Nelson; Stacy T Knutson; Laura Soito; Chananat Klomsiri; Leslie B Poole; Jacquelyn S Fetrow
Journal: Proteins Date: 2010-12-22