| Literature DB >> 24688737 |
Antonin Guilloux1, Bernard Caudron2, Jean-Luc Jestin3.
Abstract
There is a need for rules allowing three-dimensional structure information to be derived from protein sequences. In this work, consideration of an elementary protein folding step allows protein sub-sequences which optimize folding to be derived for any given protein sequence. Classical mechanics applied to this system and the energy conservation law during the elementary folding step yields an equation whose solutions are taken over the field of rational numbers. This formalism is applied to beta-sheets containing two edge strands and at least two central strands. The number of protein sub-sequences optimized for folding per amino acid in beta-strands is shown in particular to predict edge strands from protein sequences. Topological information on beta-strands and loops connecting them is derived for protein sequences with a prediction accuracy of 75%. The statistical significance of the finding is given. Applications in protein structure prediction are envisioned such as for the quality assessment of protein structure models.Entities:
Keywords: Protein structure; beta-strands; central strand; loop; structure prediction; topology; turn
Year: 2013 PMID: 24688737 PMCID: PMC3962219 DOI: 10.5936/csbj.201305001
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Figure 3Set of sequences with optimal folding properties (SOF) highlighted in red for human transthyretin whose structure was described (PDB reference 1eta) [48]. The amino acid numbers are drawn on the horizontal axis. Each red segment corresponds to a peptide sequence with optimal folding properties (SOF).
Figure 1Elementary step for the folding of a small group of mass m onto the folding entity to yield a larger folding entity modelled by a sphere of mass M. Symmetry is gained during this elementary folding step.
Figure 2Representation of predicted edge strands in the structure of human transthyretin (PDB reference 1eta) [48]. Lines represent virtual bonds between the alpha carbons of adjacent amino acids in the protein. Two superimposed beta-sheets (blue and green) consisting of four beta-strands each contain two edge strands (dark blue and dark green) and predicted according to the rule.
Edge strands and central strands of the human transthyretin structure. Extreme values of the number of SOF (Vm) highlighted in bold predict the edge strands noted in bold. The position of the strand in the sheet is central (C) or on the edge (E). Amino acids in the single-letter code are numbered according to the structure (PDB reference 1eta) [48].
| Beta-strand | Position | Vm |
|---|---|---|
| P11-D18 | C2 | 1.9 |
| G53-H56 |
|
|
| R103-S112 | C1 | 1.1 |
| S115-T123 | E1 | 1 |
| V28-A36 | C1 | 3 |
| A45-T49 |
|
|
| E66-D74 | C2 | 2.8 |
| H90-A97 | E2 | 2.3 |
Figure 4Bidimensional representations of polypeptide main chains containing 2D knots within loops between two beta-strands (A and B) or between a beta-strand and a helix (C and D). A beta-sheet's axis is represented by a small circle (B and D).