| Literature DB >> 24565061 |
Bahar Akbal-Delibas, Filip Jagodzinski, Nurit Haspel.
Abstract
BACKGROUND: Certain amino acids in proteins play a critical role in determining their structural stability and function. Examples include flexible regions such as hinges which allow domain motion, and highly conserved residues on functional interfaces which allow interactions with other proteins. Detecting these regions can aid in the analysis and simulation of protein rigidity and conformational changes, and helps characterizing protein binding and docking. We present an analysis of critical residues in proteins using a combination of two complementary techniques. One method performs in-silico mutations and analyzes the protein's rigidity to infer the role of a point substitution to Glycine or Alanine. The other method uses evolutionary conservation to find functional interfaces in proteins.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24565061 PMCID: PMC3952096 DOI: 10.1186/1472-6807-13-S1-S6
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Figure 1The cartoon rendering of the crystal structure of staphylococcal nuclease (PDB ID 1stn), refined at 1.7Å resolution, is shown in (a). KINARI-Web was used to calculate the protein's rigidity properties, visualized in (b); color clusters represent atoms that are rigidly connected (only clusters made up of more than 15 atoms are shown). KINARI uses chemical bonds and stabilizing interactions to identify bodies, which are sets of atoms rigidly attached to each other. The three atoms identified in the purple region in (c) form a rigid body because the covalent bonds and distances imposed by angle constraints (shown as dotted lines) remove all degrees of freedom among the three atoms. The rigid units are used to construct a mechanical model of the molecule, in which two rigid bodies that have a rotatable bond in common are represented as hinges, as shown in yellow in (d). In (d), rigid body 1 corresponds to atoms N1, C1, and C in (c), and rigid body 2 corresponds to atoms C1, C, and O; the two bodies share the covalent bond between C1and C. The pebble game paradigm associates rigid bodies of a mechanical model to a node in a graph, a hinge in the mechanical model as 5 bars between two nodes, and bars, which represent constraints such as hydrogen bonds and hydrophobic interactions in the mechanical model, to single bars among nodes in the graph. The pebble game algorithm is used to analyze the graph, the results of which are used to infer rigid clusters of atoms in the biomolecule. A complete explanation of how modeling is performed by KINARI is described in [16].
Critical residue analysis by both methods, for 42 proteins, with mutations to glycine
| PDB ID | No. residues | % Critical by Conservation | % Critical by Rigidity | % Critical by both methods | % Non-critical by both methods | % Total match | % Critical Only by Conservation | % Critical Only by Rigidity |
|---|---|---|---|---|---|---|---|---|
| 64 | 50.0 | 25.0 | 15.6 | 40.6 | 56.3 | 34.4 | 9.4 | |
| 80 | 52.5 | 16.3 | 13.8 | 42.5 | 56.3 | 38.8 | 2.5 | |
| 63 | 46.0 | 6.3 | 3.2 | 50.8 | 54.0 | 42.9 | 3.2 | |
| 82 | 41.5 | 19.5 | 3.7 | 42.7 | 46.3 | 37.8 | 15.9 | |
| 74 | 52.7 | 8.1 | 5.4 | 44.6 | 50.0 | 47.3 | 2.7 | |
| 62 | 41.9 | 29.0 | 11.3 | 40.3 | 51.6 | 30.6 | 17.7 | |
| 45 | 88.9 | 8.9 | 8.9 | 11.1 | 20.0 | 80.0 | 0.0 | |
| 91 | 64.8 | 18.7 | 16.5 | 33.0 | 49.5 | 48.4 | 2.2 | |
| 90 | 47.8 | 7.8 | 4.4 | 33.3 | 37.8 | 43.3 | 3.3 | |
| 63 | 66.7 | 30.2 | 17.5 | 20.6 | 38.1 | 49.2 | 12.7 | |
| 71 | 56.3 | 15.5 | 9.9 | 38.0 | 47.9 | 46.5 | 5.6 | |
| 80 | 56.3 | 10.0 | 6.3 | 40.0 | 46.3 | 50.0 | 3.8 | |
| 61 | 55.7 | 21.3 | 16.4 | 39.3 | 55.7 | 39.3 | 4.9 | |
| 95 | 53.7 | 5.3 | 2.1 | 43.2 | 45.3 | 51.6 | 3.2 | |
| 97 | 55.7 | 35.1 | 24.7 | 34.0 | 58.8 | 30.9 | 10.3 | |
| 96 | 52.1 | 16.7 | 10.4 | 41.7 | 52.1 | 41.7 | 6.3 | |
| 93 | 48.4 | 4.3 | 1.1 | 48.4 | 49.5 | 47.3 | 3.2 | |
| 64 | 53.1 | 20.3 | 14.1 | 40.6 | 54.7 | 39.1 | 6.3 | |
| 77 | 51.9 | 13.0 | 7.8 | 42.9 | 50.6 | 44.2 | 5.2 | |
| 36 | 63.9 | 22.2 | 19.4 | 33.3 | 52.8 | 44.4 | 2.8 | |
| 46 | 78.3 | 6.5 | 6.5 | 21.7 | 28.3 | 71.7 | 0.0 | |
| 56 | 50.0 | 12.5 | 10.7 | 42.9 | 53.6 | 39.3 | 1.8 | |
| 96 | 55.2 | 13.5 | 9.4 | 40.6 | 50.0 | 45.8 | 4.2 | |
| 58 | 56.9 | 17.2 | 8.6 | 34.5 | 43.1 | 48.3 | 8.6 | |
| 81 | 56.8 | 19.8 | 12.3 | 35.8 | 48.1 | 44.4 | 7.4 | |
| 43 | 53.5 | 27.9 | 16.3 | 34.9 | 51.2 | 37.2 | 11.6 | |
| 64 | 54.7 | 4.7 | 1.6 | 42.2 | 43.8 | 53.1 | 3.1 | |
| 56 | 41.1 | 23.2 | 12.5 | 48.2 | 60.7 | 28.6 | 10.7 | |
| 58 | 50.0 | 29.3 | 22.4 | 43.1 | 65.5 | 27.6 | 6.9 | |
| 60 | 53.3 | 25.0 | 13.3 | 35.0 | 48.3 | 40.0 | 11.7 | |
| 92 | 59.8 | 4.3 | 2.2 | 38.0 | 40.2 | 57.6 | 2.2 | |
| 67 | 56.7 | 6.0 | 6.0 | 43.3 | 49.3 | 50.7 | 0.0 | |
| 66 | 50.0 | 27.3 | 19.7 | 42.4 | 62.1 | 30.3 | 7.6 | |
| 81 | 53.1 | 29.6 | 18.5 | 35.8 | 54.3 | 34.6 | 11.1 | |
| 72 | 44.4 | 20.8 | 9.7 | 43.1 | 52.8 | 34.7 | 11.1 | |
| 94 | 54.3 | 20.2 | 16.0 | 41.5 | 57.4 | 38.3 | 4.3 | |
| 77 | 51.9 | 11.7 | 10.4 | 46.8 | 57.1 | 41.6 | 1.3 | |
| 99 | 44.4 | 12.1 | 5.1 | 48.5 | 53.5 | 39.4 | 7.1 | |
| 78 | 62.8 | 20.5 | 16.7 | 33.3 | 50.0 | 46.2 | 3.8 | |
| 86 | 31.4 | 10.5 | 4.7 | 60.5 | 65.1 | 26.7 | 5.8 | |
| 87 | 47.1 | 16.1 | 5.7 | 41.4 | 47.1 | 41.4 | 10.3 | |
| 99 | 49.5 | 12.1 | 7.1 | 45.5 | 52.5 | 42.4 | 5.1 |
Figure 2The cartoon rendering of Crambin (PDB ID 1crn) is colored in gray. Known critical residues based on experimental data (a) and critical residues detected by conservation analysis (b) are depicted as spheres. Different colors represent different residues.
Figure 3Comparisons of Rigidity Analysis and Conservation Analysis against experimental data for Lysozyme. The protein's 164 residues (divided into 4 subplots for convenience) are indicated on the x-axis. The upper line (labeled 1) designates a residue as critical, and the lower line (labeled 0) designates a residue as non-critical. A red circle is drawn on the upper line to indicate that the residue is experimentally known to be critical, or on the lower line to indicate that it is experimentally known to be not critical. Blue diamonds and ×s indicate whether conservation analysis or rigidity analysis, respectively, identified that residue as critical or non-critical. Residues that have a red circle, blue diamond, and × on the same line are locations for which the conservation analysis and rigidity analysis methods match the experimental data.
Residues that are correctly identified as critical only by the rigidity analysis (top 5) or conservation analysis (bottom 14).
| PDB ID | ΔΔ | Change to LRB upon | ||
|---|---|---|---|---|
| D95 | - | -3.1 | 5 | |
| T9 | very | -2.6 | 7 | |
| S68 | - | -2.4 | 12 | |
| V16 | very | -1.18 | 9 | |
| V8 | very | -1.0 | 6 | |
| L37 | very | -3.9 | 0 | |
| T62 | - | -3.4 | 0 | |
| A276 | slight | -1.5 | 0 | |
| A52 | slight | -2.7 | 0 | |
| A84 | slight | -1.25 | 0 | |
| V45 | very | -1.9 | 0 | |
| L36 | very | -5.4 | 0 | |
| V54 | very | -4.87 | 0 | |
| P93 | - | -2.6 | 0 | |
| P103 | - | -0.1 | 0 | |
| P114 | - | -3.6 | 0 | |
| P71 | - | -1.6 | 0 | |
| P97 | - | -1.2 | 0 | |
| V13 | very | -9.3 | 0 |
The ΔΔG column is the experimental data for physical point mutations to glycine at the specified wild-type residue. LRB=Largest Rigid Body. WT=Wild Type.
Rigidity analysis and conservation score analysis for proteins with residue mutations to glycine.
| PDB ID | ΔΔ | Critical by Conservation Score Analysis | Detected critical by Conservation Score or Rigidity Analysis | No. of Binding Partners | |||
|---|---|---|---|---|---|---|---|
| N43G | 0.0 | -5.7 | 1.39 | Yes | True Positive | 0 | |
| Y35G | 14.74 | -5.0 | 0.0 | Yes | True Positive | 2 | |
| V2G | 191.52 | -2.3 | 0.0 | No | False Negative | 0 | |
| V74G | 156.35 | -0.22 | 0.0 | No | False Negative | 1 | |
| V110G | 181.77 | 0.48 | 1.93 | No | False positive | 0 | |
| P71G | 72.63 | -1.6 | 0.29 | Yes | True positive | 1 | |
| P103G | 146.25 | -0.1 | 0.37 | Yes | True positive | 0 | |
| K95G | 142.44 | 1.7 | 0.0 | No | True Negative | 0 |
LRB=Largest Rigid Body. WT = wild type
Rigidity analysis and conservation score analysis for protein 1bpi with residue mutations to alanine.
| PDB ID | Mutation (WT, residue number, mutant) | WT Residue SASA (Å2) | ΔΔG | % Decrease of LRB upon | Critical by Conservation Score Analysis | Detected critical by Conservation Score or Rigidity Analysis | No. of Binding Partners |
|---|---|---|---|---|---|---|---|
| K46A | 177.11 | 0.1 | 0 | No | False Negative | 2 | |
| R53A | 174.71 | -0.1 | 0 | Yes | True Positive | 2 | |
| T54A | 68.66 | -0.1 | 1.3944223108 | No | True Positive | 2 | |
| T32A | 114.38 | -0.1 | 0 | No | False Negative | 2 | |
| E49A | 116.65 | -0.2 | 0 | No | False Negative | 1 | |
| G56A | 20.42 | -0.2 | 0 | No | False Negative | 2 | |
| G57A | 39.32 | -0.2 | 0 | No | False Negative | 0 | |
| R17A | 211.65 | -0.3 | 0 | No | False Negative | 5 | |
| K15A | 196.87 | -0.4 | 0 | No | False Negative | 5 | |
| K41A | 105.59 | -0.4 | 0 | Yes | True Positive | 2 | |
| D50A | 51.92 | -0.4 | 0 | No | False Negative | 1 | |
| R42A | 167.75 | -0.5 | 3.5856573705 | No | True Positive | 2 | |
| Q31A | 79.04 | -1.0 | 1.9920318725 | No | True Positive | 1 | |
| G28A | 41.29 | -1.0 | 0 | No | False Negative | 1 | |
| Y35A | 14.74 | -1.1 | 0 | Yes | False Negative | 2 | |
| P13A | 70.66 | -1.2 | 0 | Yes | True Positive | 4 | |
| Y10A | 73.8 | -1.2 | 0 | No | False Negative | 1 | |
| V34A | 117.65 | -1.2 | 0 | No | False Negative | 3 | |
| I18A | 98.24 | -1.5 | 0 | No | False Negative | 4 | |
| S47A | 35.24 | -1.6 | 0.796812749 | Yes | True Positive | 1 | |
| M52A | 122.96 | -1.7 | 0 | No | False Negative | 2 | |
| G12A | 16.54 | -1.8 | 0 | No | False Negative | 4 | |
| R20A | 36.99 | -1.8 | 12.9482071713 | Yes | True Positive | 2 | |
| F22A | 21.02 | -2.0 | 2.5896414343 | Yes | True Positive | 0 | |
| G36A | 0.25 | -2.1 | 0 | Yes | True Positive | 4 | |
| I19A | 158 | -2.1 | 0 | No | False Negative | 3 | |
| N24A | 35.71 | -2.2 | 2.7888446215 | Yes | True Positive | 0 | |
| G37A | 36.14 | -2.3 | 0 | Yes | True Positive | 4 | |
| N44A | 19.98 | -3.3 | 3.5856573705 | Yes | True Positive | 2 |
LRB=Largest Rigid Body. WT=wildtype. The table rows are ordered by ΔΔG; the mutations that are least destabilizing are at the top of the table, while the mutations that are most destabilizing are towards the bottom of the table.
Rigidity analysis and conservation score analysis for protein 2abd and 1csp, with residue mutations to alanine.
| PDB ID | Mutation (WT, residue number, mutant) | WT Residue SASA (Å2) | ΔΔ | Critical by Conservation Score Analysis | Detected critical by Conservation Score or Rigidity Analysis | |
|---|---|---|---|---|---|---|
| E67A | 99.97 | -0.36 | 0 | No | False Negative | |
| F17A | 57.18 | -0.81 | 4.33 | No | True Positive | |
| K54A | 49.09 | -0.86 | 1.98 | Yes | True Positive | |
| F27A | 70.65 | -0.89 | 0 | Yes | True Positive | |
| F15A | 50.5 | -0.96 | 2.96 | No | True Positive | |
| K32A | 63.06 | -1.02 | 1.18 | Yes | True Positive | |
| L25A | 15.81 | -1.02 | 3.76 | Yes | True Positive | |
| P44A | 49.62 | -1.04 | 3.06 | Yes | True Positive | |
| P19A | 5.59 | -1.07 | 0 | Yes | True Positive | |
| T35A | 51.06 | -1.09 | 0.69 | Yes | True Positive | |
| V77A | 8.94 | -1.14 | 0.59 | Yes | True Positive | |
| V12A | 8.78 | -1.69 | 2.37 | Yes | True Positive | |
| Y28A | 50.63 | -2.47 | 1.28 | Yes | True Positive | |
| L15A | 0.0 | -3.1 | 1.18 | Yes | True Positive | |
| Q33A | 1.59 | -3.66 | 0.99 | Yes | True Positive | |
| L80A | 3.15 | -3.7 | 3.26 | Yes | True Positive | |
| Y73A | 4.5 | -4.83 | 1.28 | Yes | True Positive |
LRB=Largest Rigid Body. WT=Wild Type. The table rows are ordered by ΔΔG; the mutations that are least destabilizing are at the top of the table, while the mutations that are most destabilizing are towards the bottom of the table.