| Literature DB >> 16351757 |
Maria Vittoria Cubellis1, Fabien Cailliez, Simon C Lovell.
Abstract
BACKGROUND: Secondary structure is used in hierarchical classification of protein structures, identification of protein features, such as helix caps and loops, for fold recognition, and as a precursor to ab initio structure prediction. There are several methods available for assigning secondary structure if the three-dimensional structure of the protein is known. Unfortunately they differ in their definitions, particularly in the exact positions of the termini. Additionally, most existing methods rely on hydrogen bonding, which means that important secondary structural classes, such as isolated beta-strands and poly-proline helices cannot be identified as they do not have characteristic hydrogen-bonding patterns. For this reason we have developed a more accurate method for assigning secondary structure based on main chain geometry, which also allows a more comprehensive assignment of secondary structure.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16351757 PMCID: PMC1866377 DOI: 10.1186/1471-2105-6-S4-S8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Position of helix N-caps. The letter under the position number corresponds to the assignment of the residue relatively to the concerned helix. Thus the position 1 corresponds to the first helical residue and the N-cap residue should correspond to the position 0. Absolute counts and percentages are given.
| Position of the N-cap | -1 C | 0 C | 1 H | 2 H | 3 H |
| Segno | 8 (0.8%) | 1049 (99.2%) | 0 (0%) | 0 (0%) | 0 (0%) |
| Stride | 29 (2,9%) | 1016 (97.1%) | 0 (0%) | 0 (0%) | 0 (0%) |
| DSSP | 32 (3.1%) | 1013 (96.9%) | 0 (0%) | 1 (0.1%) | 1 (0.1%) |
C-caps of alpha helices: position of the first residue with φ > 0. The letter under the position number corresponds to the assignment of the residue relatively to the helix concerned. Thus the position -1 is the last helical residue and should correspond to the C-cap, while the position 0 must correspond to the first residue with a positive φ.
| Position of the φ > 0 residue | -3 H | -2 H | -1 H | 0 C | 1 C | 2 C | 3 C |
| Segno | 0 (0%) | 0 (0%) | 0 (0%) | 2458 (81%) | 277 (9,1%) | 182 (6%) | 119 (3,9%) |
| Stride | 3 (0,1%) | 0 (0%) | 1 (0%) | 2335 (84,8%) | 183 (6,6%) | 124 (4,5%) | 107 (3,9%) |
| DSSP | 13 (0.4%) | 14 (0.5%) | 30 (1%) | 1911 (63,9%) | 709 (23.7%) | 193 (6,5%) | 119 (4%) |
Figure 1Curvature in secondary structures for (a) α-helices and (b) β-strands for the three programs' definitions. The x-axis represents curvature in degrees in a given bin, with the y-axis representing the percentage of residues in that bin. In (a) the bins >170° are omitted to show the distribution in the more distorted region more clearly.
Figure 2Examples of secondary structural elements showing extreme curvature, which are most likely to be due to mis-assignment of the secondary structure (a) 1qb7 (b) 1myr (c) and (d) 2baa. (c) and (d) show different views of the same region of the protein. In all cases all residues shown are assigned to be in continuous secondary structures by DSSP and STRIDE. SEGNO assignments are shown in black. Hydrogen bonds are shown in red. Figures are shown as stereo pairs.
Figure 3Secondary-structure guided sequence alignments for SEGNO and DSSP over a range of sequence similarities. Bars indicate proportion of cases where SEGNO gives a more accurate alignment than DSSP (dark bars) or where DSSP gives a more accurate alignment than SEGNO (light bars). Total number of observations in each bin are: 15–25%, 122; 25–35%, 135; 35–45%, 28.