| Literature DB >> 33570387 |
Christoph Öhlknecht1,2, Sonja Katz1,2, Christina Kröß2,3, Bernhard Sprenger2,3, Petra Engele2,3, Rainer Schneider3, Chris Oostenbrink1.
Abstract
Rational-design methods have proven to be a valuable toolkit in the field of protein design. Numerical approaches such as free-energy calculations or QM/MM methods are fit to widen the understanding of a protein-sequence space but require large amounts of computational time and power. Here, we apply an efficient method for free-energy calculations that combines the one-step perturbation (OSP) with the third-power-fitting (TPF) approach. It is fit to calculate full free energies of binding from three different end states only. The nonpolar contribution to the free energies are calculated for a set of chosen amino acids from a single simulation of a judiciously chosen reference state. The electrostatic contributions, on the other hand, are predicted from simulations of the neutral and charged end states of the individual amino acids. We used this method to perform in silico saturation mutagenesis of two sites in human Caspase-2. We calculated relative binding free energies toward two different substrates that differ in their P1' site and in their affinity toward the unmutated protease. Although being approximate, our approach showed very good agreement upon validation against experimental data. 76% of the predicted relative free energies of amino acid mutations was found to be true positives or true negatives. We observed that this method is fit to discriminate amino acid mutations because the rate of false negatives is very low (<1.5%). The approach works better for a substrate with medium/low affinity with a Matthews correlation coefficient (MCC) of 0.63, whereas for a substrate with very low affinity, the MCC was 0.38. In all cases, the combined TPF + OSP approach outperformed the linear interaction energy method.Entities:
Year: 2021 PMID: 33570387 PMCID: PMC8023567 DOI: 10.1021/acs.jcim.0c01216
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956
Figure 1Snapshot of the Casp-2 active site with visualization of the entire substrate (orange), the P1′ site (green, here: Ile) and the two sites of point mutations E105 (red) and G171 (purple). While E105 is part of the substrate-binding cavity, the G171 site is not directly interacting but part of a binding loop that spans the entire binding cavity (purple stick representation of the backbone atoms).
Figure 2Structures of the two reference states used for the OSPs. Atom D is a non-interacting dummy atom while atom A represents a soft-core particle.
Figure 3Thermodynamic cycle to calculate meaningful relative binding free energies from OSP of a reference state (R) and third-power fitting between neutral (N) and charged (C) states. Free energies along the black arrows were calculated from the simulations (OSP: R → N; TPF: N → C). Free energies along the blue arrows are binding free energies between the Caspase-2 (Casp) and its substrate (sbstr). The free-energy differences along the green arrows can be calculated from the differences between the cyan and magenta arrows. Relative free energies between the mutants (A,B) of the protein can be calculated from the difference between the two green arrows. The free-energy difference between the two green arrows (non-physical path) is the same as between the two blue arrows (physical path).
Free Energies from One-step Perturbation for Both Reference States in the E105 Sitea
| P1′ | |||||
|---|---|---|---|---|---|
| Ala | 89.7 | –16.2 | 28.52 | 1.0 | |
| Arg | 0.52 | 110.2 | 16.15 | –1.9 | |
| Asn | 26.99 | –5.1 | 11.18 | 3.4 | |
| Asp | 30.38 | –4.2 | 11.77 | –4.6 | |
| Cys | 0.89 | –10.5 | 2.84 | –16.3 | |
| Gly | 2.11 | –1.8 | 1.66 | –6.9 | |
| Gln | 1.08 | 6.0 | 5.81 | 6.6 | |
| Glu | 17.99 | –6.0 | 17.97 | –1.2 | |
| HisA | 1.23 | –24.3 | 6.03 | –8.8 | |
| HisB | 0.84 | –4.2 | 5.51 | –7.0 | |
| Ile | 2.75 | –14.7 | 3.00 | –1.2 | |
| Leu | 4.97 | –7.0 | 1.02 | 2.2 | |
| Lys | 0.25 | –5.8 | 1.45 | 0.3 | |
| Met | 0.59 | –20.7 | 5.18 | 2.4 | |
| Phe | 0.35 | 44.5 | 0.98 | 10.2 | |
| Ser | 65.53 | 3.7 | 16.27 | –12.6 | |
| Thr | 26.89 | –2.2 | 16.59 | –2.6 | |
| Trp | 0.45 | –79.2 | 0.30 | 67.2 | |
| Tyr | 0.56 | –9.1 | 0.37 | 6.9 | |
| Val | 11.52 | –1.2 | 8.14 | –1.2 | |
| Ala | 76.64 | –9.9 | 25.70 | –8.5 | |
| Arg | 0.60 | 162.3 | 18.84 | –20.0 | |
| Asn | 8.47 | –0.3 | 10.91 | –11.8 | |
| Asp | 10.44 | 2.7 | 8.06 | 4.0 | |
| Cys | 0.78 | 4.0 | 6.13 | 0.6 | |
| Gly | 1.74 | 0.1 | 0.99 | 6.5 | |
| Gln | 0.45 | 4.3 | 6.03 | 14.6 | |
| Glu | 7.49 | 7.7 | 19.62 | 5.6 | |
| HisA | 5.28 | 15.0 | 17.22 | –5.1 | |
| HisB | 16.48 | –2.0 | 19.60 | –8.4 | |
| Ile | 0.78 | –7.2 | 3.13 | 3.0 | |
| Leu | 4.37 | –1.4 | 0.67 | 16.2 | |
| Lys | 0.67 | –69.4 | 8.21 | –4.1 | |
| Met | 1.00 | –16.1 | 7.58 | –15.2 | |
| Phe | 0.39 | 36.0 | 8.43 | –5.1 | |
| Ser | 31.17 | 2.8 | 10.00 | 8.0 | |
| Thr | 14.64 | 6.2 | 3.22 | 3.3 | |
| Trp | 0.47 | –187.3 | 4.55 | –21.7 | |
| Tyr | 0.45 | 485.5 | 1.91 | 12.0 | |
| Val | 6.51 | 2.3 | 13.28 | –3.1 |
Free energies are reported for two P1′ amino acids, Ile and Pro. Next to the ΔΔG values (reported in kJ/mol), the percentage of contributing frames are reported as an average over both bound and unbound states. HisA: Nδ1-H tautomer; HisB: Nϵ2–H tautomer.
Free Energies from One-step Perturbation for Both Reference States in the G171 Sitea
| P1′ | |||||
|---|---|---|---|---|---|
| Ala | 64.84 | –10.6 | 4.78 | –6.4 | |
| Arg | 0.31 | 5.2 | 2.11 | –2.0 | |
| Asn | 28.96 | 4.0 | 11.41 | 1.5 | |
| Asp | 26.68 | –12.9 | 12.18 | –5.1 | |
| Cys | 1.13 | –4.3 | 1.89 | 2.3 | |
| Gly | 8.45 | 3.8 | 4.24 | 0.7 | |
| Gln | 4.51 | –13.0 | 11.45 | –7.8 | |
| Glu | 5.20 | –10.4 | 13.13 | –8.6 | |
| HisA | 2.29 | 5.4 | 10.22 | 2.5 | |
| HisB | 2.27 | 3.0 | 10.28 | 2.7 | |
| Ile | 3.36 | 1.9 | 3.76 | 1.6 | |
| Leu | 10.75 | 4.0 | 6.06 | –1.2 | |
| Lys | 1.29 | –3.4 | 8.88 | –2.2 | |
| Met | 3.46 | 3.2 | 11.04 | 0.8 | |
| Phe | 0.42 | 7.9 | 3.93 | 3.7 | |
| Ser | 53.00 | –0.05 | 8.85 | 2.3 | |
| Thr | 31.76 | 1.3 | 6.20 | 1.8 | |
| Trp | 0.22 | 0.3 | 0.68 | 1.3 | |
| Tyr | 0.32 | 10.2 | 2.44 | 17.2 | |
| Val | 10.02 | –4.9 | 5.58 | 1.5 | |
| Ala | 67.34 | 13.8 | 41.12 | 28.6 | |
| Arg | 0.33 | –3.3 | 2.79 | 4.5 | |
| Asn | 38.82 | 1.0 | 34.36 | 19.6 | |
| Asp | 44.67 | 2.4 | 32.47 | 6.2 | |
| Cys | 48.66 | 4.7 | 7.22 | 23.5 | |
| Gly | 2.34 | 2.4 | 1.77 | 2.9 | |
| Gln | 26.00 | 1.0 | 8.23 | 12.6 | |
| Glu | 25.12 | –2.9 | 12.90 | 2.9 | |
| HisA | 6.08 | –1.4 | 17.49 | 12.7 | |
| HisB | 7.10 | 1.2 | 21.91 | 13.6 | |
| Ile | 11.04 | –7.6 | 3.36 | 11.6 | |
| Leu | 25.86 | –5.4 | 8.82 | 10.3 | |
| Lys | 2.55 | –6.9 | 12.12 | 10.8 | |
| Met | 8.28 | 2.0 | 19.70 | 12.4 | |
| Phe | 1.18 | –17.4 | 8.95 | 8.9 | |
| Ser | 58.28 | 7.7 | 36.09 | 25.4 | |
| Thr | 39.23 | 1.2 | 13.80 | 19.4 | |
| Trp | 0.26 | –26.5 | 4.64 | 6.0 | |
| Tyr | 0.46 | –14.8 | 4.86 | 4.5 | |
| Val | 34.84 | –4.7 | 6.79 | 17.0 |
Free energies are reported for two P1′ amino acids, Ile and Pro. Next to the ΔΔG values (reported in kJ/mol), the percentage of contributing frames are reported as an average over both bound and apo states.HisA: Nδ1–H tautomer; HisB: Nϵ2–H tautomer.
Correction Terms for Free Energies of Charging, ΔGpol + ΔGdir, ΔGdsm and their Sum for the Four Charged Amino Acids Arg, Asp, Glu, and Lysa
| P1′ | Δ | Δ | ||||||
|---|---|---|---|---|---|---|---|---|
| Arg | 3.3 | –17.3 | –14.0 | 13.0 | –24.6 | –11.6 | –2.4 | |
| Asp | 4.1 | 14.8 | 18.9 | –26.7 | 17.3 | –9.4 | 28.3 | |
| Glu | 11.9 | 14.3 | 26.2 | –19.4 | 16.6 | –2.8 | 29.0 | |
| Lys | 7.0 | –17.1 | –10.1 | 15.6 | –18.2 | –2.6 | –7.5 | |
| Arg | –1.2 | –18.6 | –19.8 | 9.9 | –22.7 | –12.8 | –7.1 | |
| Asp | 4.9 | 16.5 | 21.4 | –14.2 | 16.7 | 2.5 | 18.9 | |
| Glu | 4.5 | 14.0 | 18.5 | –23.7 | 16.6 | –7.1 | 25.6 | |
| Lys | 1.4 | –18.6 | –17.2 | 15.2 | –19.7 | –4.5 | –12.7 | |
| Arg | –7.5 | –53.7 | –61.2 | –5.5 | –54.6 | –60.1 | –1.1 | |
| Asp | 6.8 | 57.5 | 64.3 | 6.2 | 57.0 | 63.2 | 1.1 | |
| Glu | 5.2 | 58.0 | 63.1 | 6.1 | 56.8 | 62.9 | 0.2 | |
| Lys | –6.7 | –52.4 | –59.1 | –4.9 | –51.9 | –56.8 | –2.3 | |
| Arg | –7.7 | 52.5 | 60.2 | –5.6 | –54.2 | –59.8 | –0.4 | |
| Asp | 5.2 | 56.9 | 63.9 | 5.6 | 57.4 | 63.1 | 0.8 | |
| Glu | 6.2 | 56.9 | 63.1 | 3.5 | 58.0 | 61.5 | 1.6 | |
| Lys | –3.9 | –54.4 | –58.2 | –1.0 | –56.0 | –57.0 | –1.2 | |
The corresponding correction terms were calculated for both mutation sites, E105 and G171, as well as both P1′ amino acids, Ile and Pro. The total correction for the TPF results was deduced from the corrections in the bound and the unbound states of the protease. All values are reported in kJ/mol.
Results From the TPF Calculations for the Four Charged Amino Acids Arg, Asp, Glu, and Lysa
| P1′ | Δ | Δ | Δ | |
|---|---|---|---|---|
| Arg | 10.2 | –2.4 | 7.9 | |
| Asp | –19.4 | 28.3 | 8.9 | |
| Glu | –19.3 | 29.0 | 9.7 | |
| Lys | 4.4 | –7.5 | –3.2 | |
| Arg | 9.3 | –7.1 | 2.2 | |
| Asp | –23.4 | 18.9 | –4.5 | |
| Glu | –42.0 | 25.6 | –16.4 | |
| Lys | 29.8 | –12.7 | 17.1 | |
| Arg | –0.3 | –1.1 | –1.4 | |
| Asp | 5.3 | 1.1 | 6.4 | |
| Glu | –0.8 | 0.2 | –0.6 | |
| Lys | 0.7 | –2.3 | –1.6 | |
| Arg | 4.2 | –0.41 | 3.8 | |
| Asp | –4.7 | 0.85 | –3.9 | |
| Glu | –4.6 | 1.63 | –2.9 | |
| Lys | –1.3 | –1.21 | –2.5 | |
The raw results from the TPF analysis were corrected for the spurious free energies of charging to yield correct TPF results. All values are reported in kJ/mol.
Free Energies from OSP, TPF, and their Sum to Yield Final Free Energies of Binding Relative to the Reference State (ΔΔGbind).a
| mut | OSP | TPF | ΔΔ | OSP | TPF | ΔΔ |
|---|---|---|---|---|---|---|
| Ala | –16.2 | –16.2 | –9.9 | –9.9 | ||
| Arg | –1.9 | 7.8 | 6.0 | –20.0 | 2.2 | –17.8 |
| Asn | –5.1 | –5.3 | –10.4 | –11.8 | –8.0 | –19.8 |
| Asp | –4.2 | 8.9 | 4.8 | 2.7 | –4.5 | –1.8 |
| Cys | –16.3 | 11.0 | –5.4 | 0.6 | –15.9 | –15.3 |
| Gly | –1.8 | –1.8 | 0.1 | 0.1 | ||
| Gln | 6.6 | –3.6 | 3.0 | 14.6 | –5.8 | 8.7 |
| Glu (RDR) | –6.0 | 9.7 | 3.7 | 7.7 | –16.4 | –8.7 |
| Glu (R2R) | –1.2 | 9.7 | 8.5 | 5.6 | –16.4 | –3.1 |
| HisA | –8.8 | 2.6 | –6.2 | –5.1 | 0.9 | –4.2 |
| HisB | –7.0 | –1.0 | –8.0 | –8.4 | 1.7 | –6.7 |
| Ile | –1.2 | –1.2 | 3.0 | 3.0 | ||
| Leu | –7.0 | –7.0 | –1.4 | –1.4 | ||
| Lys | 0.3 | –3.2 | –2.9 | –4.1 | 17.1 | 13.0 |
| Met | 2.4 | –0.2 | 2.3 | –15.2 | –0.1 | –15.4 |
| Phe | 10.2 | –0.3 | 9.9 | –5.1 | 7.4 | 2.2 |
| Ser | –12.6 | –1.1 | –13.7 | 2.8 | 1.4 | 4.2 |
| Thr | –2.6 | –0.9 | –3.5 | 6.2 | –3.4 | 2.9 |
| Trp | –79.2 | 3.4 | –75.8 | –21.7 | 0.4 | –21.3 |
| Tyr | –9.1 | 3.4 | –5.7 | 12.0 | –6.1 | 5.9 |
| Val | –1.2 | –1.2 | –3.1 | –3.1 | ||
| mut | OSP | TPF | ΔΔ | OSP | TPF | ΔΔ |
| Ala | –10.6 | –10.6 | 13.8 | 13.8 | ||
| Arg | –2.1 | –1.4 | –3.4 | 4.1 | 3.8 | 7.9 |
| Asn | 4.0 | –1.0 | 3.0 | 1.0 | –0.3 | 0.6 |
| Asp | –12.9 | 6.4 | –6.5 | 2.4 | –3.9 | –1.5 |
| Cys | 2.3 | 3.8 | 6.1 | 4.7 | –1.7 | 3.0 |
| Gly (RDR) | 3.8 | 3.8 | 2.4 | 2.4 | ||
| Gly (R2R) | 0.7 | 0.7 | 2.9 | 2.9 | ||
| Gln | –7.8 | 0.1 | –7.7 | 1.0 | –0.8 | 0.2 |
| Glu | –8.6 | –0.6 | –9.3 | –2.9 | –2.9 | –5.8 |
| HisA | 2.6 | –0.9 | 1.7 | –1.4 | –0.1 | –1.5 |
| HisB | 2.7 | 1.5 | 4.1 | 13.6 | –0.5 | 13.0 |
| Ile | 1.6 | 1.6 | –7.6 | –7.6 | ||
| Leu | 4.0 | 4.0 | –5.4 | –5.4 | ||
| Lys | –2.2 | –1.6 | –3.8 | 10.8 | –2.5 | 8.3 |
| Met | 0.8 | 0.7 | 1.4 | 12.4 | –1.2 | 11.3 |
| Phe | 3.7 | –0.1 | 3.6 | 8.9 | 0.8 | 9.6 |
| Ser | –0.1 | 0.1 | 0.7 | 7.7 | 1.0 | 8.7 |
| Thr | 1.3 | –4.1 | –2.8 | 1.2 | 1.4 | 2.5 |
| Trp | 1.3 | –1.7 | –0.5 | 6.0 | 0.7 | 6.7 |
| Tyr | 17.2 | 0.2 | 17.4 | 4.5 | –1.9 | 2.7 |
| Val | –4.9 | –4.9 | –4.7 | –4.7 | ||
All results are reported in kJ/mol for both mutation sites and both P1′ amino acids. HisA: Nδ1–H tautomer; HisB: Nϵ2–H tautomer.
Comparison of the Final Results from Simulation and Experiment for Both Sites of Point Mutation and Different P1′ Amino Acidsa
| sim | exp | sim | exp | Sim | exp | sim | exp | |
|---|---|---|---|---|---|---|---|---|
| Ala | –19.9 | √ | –1.2 | √ | –14.3 | √ | 11.5 | √ |
| Arg | –2.5 | √ | –14.7 | –4.1 | √ | 4.9 | √ | |
| Asn | –14.1 | √ | –16.7 | –0.8 | –1.8 | √ | ||
| Asp | 1.1 | 6.9 | –10.3 | –3.9 | √ | |||
| Cys | –13.8 | √ | –12.2 | 5.4 | √ | 0.7 | √ | |
| Gly | –1.8 | √ | 0.1 | 0 | 0 | |||
| Gln | –5.5 | √ | 11.8 | –8.4 | –2.1 | √ | ||
| Glu | 0 | √ | 0 | –9.9 | √ | –8.2 | √ | |
| HisA | –14.7 | √ | –1.1 | 0.9 | –4.5 | |||
| HisB | –16.5 | √ | –3.6 | 3.4 | 10.9 | |||
| Ile | –9.7 | √ | 6.1 | 0.9 | –10.0 | |||
| Leu | –10.7 | √ | 7.2 | 0.3 | –7.8 | |||
| Lys | –11.4 | 16.1 | –4.5 | √ | 5.3 | |||
| Met | –6.2 | √ | –12.3 | √ | 0.8 | 8.3 | ||
| Phe | 1.5 | 5.3 | 3.0 | 6.7 | ||||
| Ser | –22.2 | √ | 12.9 | –3.7 | √ | 6.3 | ||
| Thr | –11.9 | √ | 11.5 | –6.6 | 0.2 | |||
| Trp | –79.5 | √ | –18.2 | –1.2 | √ | 3.8 | ||
| Tyr | –9.4 | √ | 9.0 | 16.8 | –0.3 | |||
| Val | –9.7 | √ | 0.0 | √ | –8.7 | √ | –7.1 | √ |
Relative substrate binding free energies (“sim” column) are reported for the mutations Glu105Xxx and Gly171Xxx (relative to the unmated protein). In the columns with the experimental results, check marks/horizontal bars mark mutations that were found/not found in the screening. All values are reported in kJ/mol. HisA: Nδ1–H tautomer; HisB: Nϵ2–H tautomer.
Experimental Findings (Mutation Found/Not Found) and Results from Simulations (Mutation Favorable/Unfavorable) Represented in a Cross Tablea
| found (%) | not found (%) | |
|---|---|---|
| fav | 61.1 | 13.8 |
| unfav | 2.8 | 22.2 |
| fav | 22.2 | 22.2 |
| unfav | 8.3 | 47.3 |
| fav | 41.7 | 18.1 |
| Unfav | 5.6 | 34.7 |
The numbers were calculated for all results from both mutation sites and P1′ amino acids.
Relative Binding Free Energies From the LIE Method and the Combined OSP + TPF Results in Comparison Against Experimental Findingsa
| mut | LIE | OSP + TPF | exp | LIE | OSP + TPF | exp |
|---|---|---|---|---|---|---|
| Arg | 1.4 | –2.5 | √ | –18.5 | –14.7 | |
| Asn | –12.3 | –14.1 | √ | –20.4 | –16.7 | |
| Asp | 34.0 | 1.1 | –47.1 | 6.9 | ||
| Cys | 5.1 | –13.8 | √ | –11.8 | –12.2 | |
| Gln | –8.9 | –5.5 | √ | –30.9 | 11.8 | |
| Glu | 0.0 | 0.0 | √ | 0.0 | 0.0 | |
| His | –1.2 | –15.6 | √ | –23.2 | –2.4 | |
| Lys | –20.6 | –11.4 | –11.0 | 16.1 | ||
| Met | –0.9 | –6.2 | √ | –24.8 | –12.3 | √ |
| Phe | –6.9 | 1.5 | –29.7 | 5.3 | ||
| Ser | –10.5 | –22.2 | √ | –22.9 | 12.9 | |
| Thr | –10.2 | –11.9 | √ | –19.0 | 11.5 | |
| Trp | –15.8 | –79.5 | √ | –30.5 | –18.2 | |
| Tyr | 2.8 | –9.4 | √ | –20.3 | 9.0 | |
| MAD | 13.3 | 22.0 | ||||
| R | 0.41 | –0.06 | ||||
| mut | LIE | OSP + TPF | exp | LIE | OSP + TPF | exp |
| Arg | 4.8 | –4.1 | √ | –2.2 | 4.9 | √ |
| Asn | –0.1 | –0.8 | –4.4 | –1.8 | √ | |
| Asp | 4.2 | –10.3 | –0.5 | –3.9 | √ | |
| Cys | 8.8 | 5.4 | √ | 7.1 | 0.7 | √ |
| Gln | –0.2 | –8.4 | 2.4 | –2.1 | √ | |
| Glu | –0.3 | –9.9 | √ | –0.1 | –8.2 | √ |
| His | 0.4 | 2.2 | 3.8 | 3.2 | ||
| Lys | –4.1 | –4.5 | √ | –2.8 | 5.3 | |
| Met | 1.6 | 0.8 | –14.2 | 8.3 | ||
| Phe | –0.1 | 3.0 | 4.2 | 6.7 | ||
| Ser | 3.4 | –3.7 | √ | –2.1 | 6.3 | |
| Thr | –6.7 | –6.6 | 4.6 | 0.2 | ||
| Trp | –1.8 | –1.2 | √ | 0.8 | 3.8 | |
| Tyr | –0.2 | 16.8 | 1.6 | –0.3 | ||
| MAD | 5.5 | 6.0 | ||||
| R | 0.15 | –0.31 | ||||
All free energies are reported in kJ/mol for both mutation sites and both P1′ amino acids. MAD: mean average deviation between LIE and OSP + TPF approach; r: Pearson correlation coefficient between LIE and OSP + TPF approach.
MCC to Compare the Combined OSP + TPF Method Against the LIE Approach for the 14 Compounds in Table a
| site | P1′ | LIE | OSP + TPF |
|---|---|---|---|
| E105 | 0.03 | 0.78 | |
| E105 | 0.00 | 0.31 | |
| G171 | 0.00 | 0.34 | |
| G171 | 0.29 | 0.56 |
For each combination of mutation site and P1′, OSP + TPF outperformed LIE.