| Literature DB >> 22745680 |
Agnel Praveen Joseph1, Hélène Valadié, Narayanaswamy Srinivasan, Alexandre G de Brevern.
Abstract
The constant increase in the number of solved protein structures is of great help in understanding the basic principles behind protein folding and evolution. 3-D structural knowledge is valuable in designing and developing methods for comparison, modelling and prediction of protein structures. These approaches for structure analysis can be directly implicated in studying protein function and for drug design. The backbone of a protein structure favours certain local conformations which include α-helices, β-strands and turns. Libraries of limited number of local conformations (Structural Alphabets) were developed in the past to obtain a useful categorization of backbone conformation. Protein Block (PB) is one such Structural Alphabet that gave a reasonable structure approximation of 0.42 Å. In this study, we use PB description of local structures to analyse conformations that are preferred sites for structural variations and insertions, among group of related folds. This knowledge can be utilized in improving tools for structure comparison that work by analysing local structure similarities. Conformational differences between homologous proteins are known to occur often in the regions comprising turns and loops. Interestingly, these differences are found to have specific preferences depending upon the structural classes of proteins. Such class-specific preferences are mainly seen in the all-β class with changes involving short helical conformations and hairpin turns. A test carried out on a benchmark dataset also indicates that the use of knowledge on the class specific variations can improve the performance of a PB based structure comparison approach. The preference for the indel sites also seem to be confined to a few backbone conformations involving β-turns and helix C-caps. These are mainly associated with short loops joining the regular secondary structures that mediate a reversal in the chain direction. Rare β-turns of type I' and II' are also identified as preferred sites for insertions.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22745680 PMCID: PMC3382195 DOI: 10.1371/journal.pone.0038805
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1PBs series of φ,ψ backbone dihedral angles.
For each PB the series of 8 dihedral angles (ψi−2, φi−1,ψi−1, φi,ψi, φi+1,ψi+1, φi+2), numbered from 1 to 8, are plotted. i indicates the position of an amino acid in the protein.
Figure 2Association examples of PBs with secondary structural elements.
Protein fragments (A-C) were chosen to highlight some frequently occurring PB transitions. These fragments are shown in a cartoon view distinguishing different secondary structure elements as assigned by PyMol [114]. The PB series corresponding to the local conformation of the fragment are labelled.
Association of PB with secondary structures.
| H | G | E | BTI | BTII | BTIV | BTVIII | BTI’ | BTII’ | C | GTINV | AG | AC | |
|
| 25.4 | 14.4 | 17.0 | 2.2 | 1.8 | 29.5 | 1.5 | 4.0 | |||||
|
| 18.1 | 13.2 | 14.6 | 8.7 | 1.2 | 35.8 | 2.3 | 2.0 | |||||
|
| 0.7 | 58.3 | 6.1 | 6.2 | 1.9 | 21.2 | 2.2 | ||||||
|
| 80.4 | 0.8 | 14.4 | 1.2 | |||||||||
|
| 62.5 | 12.5 | 11.3 | 10.3 | |||||||||
|
| 38.0 | 11.6 | 10.3 | 3.6 | 31.2 | 2.3 | |||||||
|
| 6.2 | 12.8 | 13.8 | 17.1 | 10.1 | 16.9 | 3.6 | 16.4 | 1.7 | ||||
|
| 1.6 | 27.2 | 24.4 | 31.7 | 2.1 | 9.8 | |||||||
|
| 7.2 | 35.1 | 38.6 | 15.0 | |||||||||
|
| 8.6 | 2.9 | 10.0 | 3.2 | 3.8 | 22.9 | 9.1 | 32.5 | 1.7 | ||||
|
| 37.1 | 11.1 | 23.5 | 2.3 | 18.0 | 5.5 | |||||||
|
| 49.1 | 13.0 | 13.5 | 2.3 | 14.0 | 1.9 | 4.3 | ||||||
|
| 90.4 | 2.6 | 2.5 | 1.7 | 2.3 | ||||||||
|
| 66.3 | 6.4 | 6.7 | 10.3 | 7.1 | ||||||||
|
| 20.6 | 5.0 | 15.5 | 5.2 | 20.0 | 1.5 | 29.4 | ||||||
|
| 8.3 | 10.8 | 1.4 | 16.7 | 3.1 | 14.7 | 0.8 | 0.9 | 38.5 | 1.2 |
The percentage of different secondary structures (assigned by PROMOTIF) found associated with each PB is given. Only the secondary structures with percentage occurrence greater than 0.5% are given. The PBs are listed in the beginning of each row and the secondary structure type is given as header for each column. Abbreviation of PROMOTIF assignments: BTX – β-turns, X is the type of β-turn, AG – Antiparallel strands, G1 type β-bulge, where the first residue is in the left handed helical conformation (usually Glycine), AC – Antiparallel strands, Classic type beta bulge, one extra residue forms the bulge, GTINV – Inverse γ-turns (φ = −79.0±40,ψ = 69.0±40).
Figure 3PB substitutions.
(A) The variation in substitution score in the PB substitution matrix is highlighted using a colour-code, as shown. (B) The series of dihedral angles (ψi−2, φi−1,ψi−1, φi,ψi, φi+1,ψi+1, φi+2), associated with the PB substitutions (p, g) and (p,i) and (C) (b,i). These represent some of the preferred local conformational changes (D) Hierarchical clustering of PBs based on the similarity of dihedral angles, measured in terms of angular rmsd. The PBs frequently associated with helices are in red, those found often with beta strands are in blue and the rest are in green (E) Clustering of PBs based on the substitution pattern associated with each PB (see ).
Figure 4Clustering PBs based on substitution patterns.
(A) Distribution of accessible and buried PBs classified based on different accessibility cut-offs of 7%,15% and 25%. Ratio of frequency of exposed PBs to that of buried, plotted for each of the 16 PBs (B) Hierarchical clustering of PBs classified as exposed (B) and buried (C) at an accessibility cut-off of 15%. The clustering is based on the correlation of substitution scores.
Figure 5PB relationship in each SCOP class derived based on the substitution pattern.
(A–D) Hierarchical clustering of PBs based on substitution patterns specific for each SCOP class. The clusters correspond to relationships observed in all-α (A), all-β (B), α/β (C) and α+β (D) classes.
Figure 6Comparison of class-specific PB substitution scores with the global distribution (global substitution matrix).
The differences in the PB substitution scores specific for the all-α (A), all-β (B), α/β (C) and α+β (D) classes, with respect to the global matrix, are plotted. The correlation coefficients obtained by performing row-wise comparisons (class-specific PB substitution patterns vs Global) are also indicated adjacent to the difference matrices.
Figure 7PB substitutions highly preferred in certain SCOP classes.
The cases where the class-specific substitution scores associated with each PB (each row in the substitution matrix) has a correlation less than 0.95 when compared to the global matrix, were looked into. The absolute differences (class specific vs Global) of substitution patterns (respective rows) were plotted as a boxplot, to identify outliers. Substitution scores lying outside a 1.5 inter-quartile range (IQR), were considered as outliers or significantly different from the global substitutions. For the all-α class, (A) the plots are generated for PBs a and g. (B) highlights examples of backbone conformations corresponding to substitutions detected as outliers. Similarly, boxplots were generated for the all-β class (C) and the examples of significantly different substitutions are shown (D).
Preferred indel sites in different SCOP classes.
| SCOPClass | InsertLength | Insert sitePBs(i,i+1) | PB series | Promotifassignment | φi,ψi; φi+1,ψi+1 |
|
| 1 | MN | mmMNop (97) | Helix C-cap | −65.54, −38.88; −66.34, −29.51 |
| 2 | CF | mpCFkl (79) | Coil | −106.09, 133.56; −96.68, 140.72 | |
| CC | mpCCdf (98) | Coil | −106.09, 133.56; −106.09, 133.56 | ||
| 4 | MB | moMBdc (27) | BTVIII | −65.54, −38.88; −92.21, −18.06 | |
| 5+ | PA | noPAfk (78) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | |
|
| 1 | BD | dfBDeh (21) | BTIV | −92.21, −18.06; −114.79, 140.11 |
| PA | koPAcd (52) | BTI, HP3:5, A G | 59.85, 21.51; −99.80, 131.88 | ||
| KO | dfKOpa (98) | BTI, HP3:5, A G | −59.35, −29.23; −87.27, 5.13 | ||
| 2 | JA | ehJAcc (97) | BTII’, HP2:2 | 82.88, 150.05; −99.80, 131.88 | |
| JB | ehJBcc (98) | BTII’ | 82.88, 150.05; −92.21, −18.06 | ||
| 3 | HI | eeHIaf (45) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 | |
| KO | dfKOpa (93) | BTI, HP3:5, A G | −59.35, −29.23; −87.27, 5.13 | ||
| 4 | HI | eeHIaf (66) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 | |
| 5+ | HI | eeHIaf (57) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 | |
| JA | ehJAcf (59) | BTIV, GTCLA, A C, HP2:2I/2:4 | 82.88, 150.05; −99.80, 131.88 | ||
| KB | dfKBcc (93) | BTI | −59.35, −29.23; −92.21, −18.06 | ||
|
| 1 | PA | noPAcd (47) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 |
| NO | mmNOpa (89) | Helix C-cap | −66.34, −29.51; −87.27, 5.13 | ||
| AC | opACdd (89) | Coil | −99.80, 131.88; −106.09, 133.56 | ||
| 2 | PA | noPAcd (55) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | |
| MB | mmMBcc (81) | BT1 | −65.54, −38.88; −92.21, −18.06 | ||
| 3 | PA | noPAcd (64) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | |
| 4 | HI | eeHIac (66) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 | |
| PA | noPAcd (31) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | ||
| 5+ | HI | eeHIac (57) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 | |
| PA | noPAcd (34) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | ||
|
| 1 | OP | mnOPad (30) | Helix C-cap | −87.27, 5.13; 59.85, 21.51 |
| NO | mmNOpa (86) | Helix C-cap | −66.34, −29.51; −87.27, 5.13 | ||
| 2 | PA | noPAcd (65) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | |
| 3 | PA | noPAfk (82) | Helix C-cap | 59.85, 21.51; −99.80, 131.88 | |
| 4 | KB | dfKBcc (62) | BT1 | −59.35, −29.23; −92.21, −18.06 | |
| HI | eeHIac (67) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 | ||
| 5+ | HI | eeHIac (52) | BTI’,HP2:2 | −67.91, 121.55; 77.85, 10.42 |
The PB bounds (di-PBs) that act as sites for insertions/deletions of different lengths are listed. To obtain a better picture of the local fold, the two PBs that are seen on both sides of the indel site were also analysed. The most frequent series are listed and their occurrence frequencies are given in parentheses. PROMOTIF [42] was used for assignment of the local fold corresponding to these frequent PB series. Those regions assigned as coils and are usually found as capping motifs, are labelled as ‘caps’. The following are the local fold definitions implied by the PROMOTIF assignment abbreviations: (see also Table 1). HPX:Y – β-hairpins, X and Y indicate the number of residues in loop, based on two different rules [42], GTCLA – Classic γ-turns (φ = 75.0±40,ψ = −64.0±40).
Figure 8Preferred local structure for indel events.
The di-PBs that bind the site of insertions are shown in the context of secondary structure definition. Parts of four domain structures (A–D) are used to highlight the indel sites.
Figure 9Percentage gain in alignments with better rmsd.
Alignment obtained by using class specific PB substitution matrices were compared with that of the global matrix. The percentage of alignments in the dataset with better rmsd is plotted. The performance of each class specific SM in each class is highlighted using different colours.