| Literature DB >> 27077847 |
Marharyta Petukh1, Luogeng Dai2,3, Emil Alexov4.
Abstract
Predicting the effect of amino acid substitutions on protein-protein affinity (typically evaluated via the change of protein binding free energy) is important for both understanding the disease-causing mechanism of missense mutations and guiding protein engineering. In addition, researchers are also interested in understanding which energy components are mostly affected by the mutation and how the mutation affects the overall structure of the corresponding protein. Here we report a webserver, the Single Amino Acid Mutation based change in Binding free Energy (SAAMBE) webserver, which addresses the demand for tools for predicting the change of protein binding free energy. SAAMBE is an easy to use webserver, which only requires that a coordinate file be inputted and the user is provided with various, but easy to navigate, options. The user specifies the mutation position, wild type residue and type of mutation to be made. The server predicts the binding free energy change, the changes of the corresponding energy components and provides the energy minimized 3D structure of the wild type and mutant proteins for download. The SAAMBE protocol performance was tested by benchmarking the predictions against over 1300 experimentally determined changes of binding free energy and a Pearson correlation coefficient of 0.62 was obtained. How the predictions can be used for discriminating disease-causing from harmless mutations is discussed. The webserver can be accessed via http://compbio.clemson.edu/saambe_webserver/.Entities:
Keywords: MM/PBSA method; binding free energy; energy calculation; missense mutations
Mesh:
Substances:
Year: 2016 PMID: 27077847 PMCID: PMC4849003 DOI: 10.3390/ijms17040547
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Pearson correlation coefficient (R), the y-intercept and the slope of linear fit between experimental and predicted binding free energy changes. The number of cases is provided in parentheses. The results are shown for different types of mutations and summarized at the bottom for all mutations. The results of 5-fold cross validation test are shown as well.
| Type of Mutaton and Mutation Site Location | Pearson Correlation Coefficient ( | Slope | |
|---|---|---|---|
| Large-to-Small (173) | 0.489 | 0.328 | 0.692 |
| ALA-scanning (577) | 0.488 | 0.268 | 0.695 |
| COR, SUP (807) | 0.461 | 0.351 | 0.813 |
| RIM, SUR, INT (518) | 0.478 | −0.024 | 1.023 |
| ALL (1326) | 0.624 (0.716 ±2SD, 0.603 CV) | 1 | −2.31 × 10−5 |
±2SD: within two standard deviations; CV: 5-fold cross validation test. ALA: Alanine residue; COR: core; SUP: support; RIM: rim; SUR: surface; and INT: interior type of mutation site.
Figure 1The computational time (mean value) of the Single Amino Acid Mutation based change in Binding free Energy (SAAMBE) algorithm as a function of the number of residues in the protein complex. Both axes are in log scale. WT: wild type complexes.
Figure 2Benchmarking of statistical parameters as a function of various definitions (see Materials and Method section for details).
List of mutations and their positions, the effect provided by ClinVar, calculated ΔΔΔG and the location of mutation site within protein interface (for interfacial regions definition, see Methods). WT: wild type; MT: mutant type complexes.
| WT Residue | Position | MT Residue | Effect | ΔΔΔG | Location |
|---|---|---|---|---|---|
| Q | 36 | L | Disease | 0.00 | SUP |
| Y | 38 | H | Disease | −0.42 | INT |
| K | 41 | E | Disease | 1.04 | SUR |
| K | 41 | I | Disease | 0.29 | SUR |
| D | 46 | G | Disease | −1.25 | SUR |
| S | 52 | N | Disease | 0.00 | RIM-COR |
| R | 55 | K | Disease | 1.29 | COR |
| C | 63 | W | Disease | 0.07 | INT-SUP |
| K | 64 | I | Disease | 4.50 | COR |
| I | 70 | V | Unclassified | 0.42 | INT |
| K | 84 | E | Polymorphism | 0.81 | SUR |
| P | 136 | L | Disease | −0.06 | INT |
| V | 137 | I | Disease | 0.79 | SUP |
| H | 138 | R | Disease | −1.32 | COR |
List of mutations and their positions, the effect provided by ClinVar, calculated ΔΔΔG. ΔΔΔG(A) and ΔΔΔG(B) indicate calculations made with mutation introduced in chain A and B, respectively. ΔΔΔG(A + B) is the sum of both predictions.
| WT Residue | Position | MT Residue | Effect | ΔΔΔG(A) | ΔΔΔG(B) | ΔΔΔG(A + B) |
|---|---|---|---|---|---|---|
| I | 74 | T | Disease | 0.50 | 0.29 | 0.79 |
| R | 134 | S | Polymorphism | 0.32 | 0.64 | 0.96 |
| C | 135 | R | Disease | 0.48 | 0.01 | 0.48 |
| W | 148 | R | Unclassified | 0.13 | 0.74 | 0.88 |
| A | 150 | P | Disease | 0.52 | 0.74 | 1.27 |
| A | 175 | D | Disease | 0.56 | 0.52 | 1.08 |
| C | 178 | R | Disease | 0.46 | 0.00 | 0.46 |
| P | 185 | R | Disease | 0.03 | −0.01 | 0.03 |
| E | 207 | Q | Polymorphism | 0.35 | 0.15 | 0.49 |
| V | 222 | F | Disease | 0.32 | 0.61 | 0.93 |
| L | 229 | P | Disease | 0.86 | 0.85 | 1.72 |
| L | 257 | P | Disease | 0.80 | 0.69 | 1.49 |
| I | 268 | N | Polymorphism | 0.42 | 0.54 | 0.96 |
| L | 284 | P | Disease | 0.85 | 0.85 | 1.69 |
| R | 304 | Q | Disease | 0.39 | 0.22 | 0.61 |
| R | 304 | W | Disease | 0.27 | 0.24 | 0.52 |
| N | 335 | K | Disease | -0.12 | 0.00 | −0.12 |
| A | 338 | V | Disease | 0.20 | 0.07 | 0.27 |
Parameters of the residues location types in protein-protein complex.
| Location | Interface | Solvent Exposure | rSASAm | rSASAc | ΔrSASA |
|---|---|---|---|---|---|
| COR | Yes | No | >25% | <25% | >0 |
| SUP | Yes | No | <25% | <25% | >0 |
| RIM | Yes | Yes | any | >25% | >0 |
| INT | No | No | any | <25% | =0 |
| SUR | No | Yes | any | >25% | =0 |
Conditions used for calculating the number of tp, tn, fp, and fn cases for five scenarios. Here, A corresponds to the value of calculated change of the binding free energy (ΔΔΔGcalc), while B for experimentally determined change of the binding free energy (ΔΔΔGexp).
| ROC Parameters | Scenario 1 | Scenario 2 | Scenario 3 | Scenario 4 | Scenario 5 |
|---|---|---|---|---|---|
| tp | A ≥ x, B ≥ x, sign(A) = sign(B) | A ≥ y, B ≥ y, sign(A) = sign(B) | A ≥ x, B ≥ y, sign(A) = sign(B) | A ≥ x, B ≥ y, sign(A) = sign(B) | A ≥ y, B ≥ x, sign(A) = sign(B) |
| tn | A < x, B < x | A < x, B < x | A < y, B < x | A < z, B < y | A < y, B < z |
| fp | A ≥ x, B < x | A ≥ y, B < x | A ≥ y, B < x | A ≥ x, B < y | A ≥ y, B < z |
| fn | A < x, B ≥ x | A < x, B ≥ y | A < x, B ≥ y | A < z, B ≥ y | A < y, B ≥ x |
| example | 0.5 | 0.5–1 | 0.5/1.5 | 0.5/1/1.5 (ef) | 0.5/1/1.5 (cf) |
| parameter values | x = 0.5 | x = 0.5, y = 1 | x = 0.5, y = 1 | x = 0.5, y = 1, z = 1.5 | x = 0.5, y = 1, z = 1.5 |