| Literature DB >> 31842350 |
Piotr Fabian1, Katarzyna Stapor1, Irena Roterman2.
Abstract
The model, describing a method of determining the structure of an early intermediate in the process of protein folding to analyze nonredundant PDB protein bases, allows determining the relationship between the sequence of tetrapeptides and their structural forms expressed by structural codes. The contingency table expressing such a relationship can be used to predict the structure of polypeptides by proposing a structural form with a precision limited to the structural code. However, by analyzing structural forms in native forms of proteins based on the fuzzy oil drop model, one can also determine the status of polypeptide chain fragments with respect to the assumptions of this model. Whether the probability distributions for both compliant and noncompliant forms were similar or whether the tetrapeptide sequences showed some differences at a level of a set of structural codes was investigated. The analysis presented here indicated that some sequences in both forms revealed differences in probability distributions expressed as a negative statistically significant correlation coefficient. This meant that the identified sections (tetrapeptides) took different forms against the fuzzy oil drop model. It may suggest that the information of the final status with respect to hydrophobic core formation is already carried by the structure of the early-stage intermediate.Entities:
Keywords: early stage; hydrophobicity; protein folding; structural codes
Year: 2019 PMID: 31842350 PMCID: PMC6995543 DOI: 10.3390/biom9120866
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Determination of early intermediate structural codes: (A) low-energy part of a Ramachandran plot; (B) relationship between the radius of a curvature (In(R)) and the angle of an aperture between adjacent peptide bond planes (dihedral angle) (V angle). Structures fulfilling the condition in the form of an idealized relationship (according to the regression function) located on the Ramachandran map form an elliptical path; (C) locations of structures meeting the conditions resulting from the approximation function—relaxed structural forms; (D) determination of the elliptical path by approximation; (E) relation of a secondary structure to the elliptic curve.
Figure 2The probability distribution of the Phie and Psie angles. The blue line represents the histogram of the number of occurrences of individual t angles for intervals of one degree. The vertical red lines denote successive local maxima, of which structural codes are given below (red letters). The angle was calculated clockwise along the ellipse, starting from the point in the lower right quadrant of the Ramachandran map.
Boundaries of intervals corresponding to structural codes (local maxima) expressed by the angle values in the parametric equation of the ellipse.
| Code | From | To |
|---|---|---|
| A | 0 | 50 |
| B | 51 | 85 |
| C | 86 | 110 |
| D | 111 | 150 |
| E | 151 | 193 |
| F | 194 | 225 |
| G | 226 | 359 |
A set of tetrapeptide sequences showing the opposite distribution of structural codes for comparing the status compliant or noncompliant with the status expected by the fuzzy oil drop model.
| A | C | D | E | F | G | H | I | K | L | N | P | Q |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AENN | CVAS | DADE | EFYT | FGAD | GADE | HAEN | IAPV | KDFT | LATP | NADS | PAVG | QIST |
Figure 3Radically different levels of self-hydrophobicity of amino acids in tetrapeptides (sequences given in the legend).
Figure 4Parabolic change in the self-hydrophobicity of tetrapeptide amino acids. Sequences are given in the legend.