| Literature DB >> 27173910 |
Daniel F A R Dourado1, Samuel Coulbourn Flores1.
Abstract
It is possible to accurately and economically predict change in protein-protein interaction energy upon mutation (ΔΔG), when a high-resolution structure of the complex is available. This is of growing usefulness for design of high-affinity or otherwise modified binding proteins for therapeutic, diagnostic, industrial, and basic science applications. Recently the field has begun to pursue ΔΔG prediction for homology modeled complexes, but so far this has worked mostly for cases of high sequence identity. If the interacting proteins have been crystallized in free (uncomplexed) form, in a majority of cases it is possible to find a structurally similar complex which can be used as the basis for template-based modeling. We describe how to use MMB to create such models, and then use them to predict ΔΔG, using a dataset consisting of free target structures, co-crystallized template complexes with sequence identify with respect to the targets as low as 44%, and experimental ΔΔG measurements. We obtain similar results by fitting to a low-resolution Cryo-EM density map. Results suggest that other structural constraints may lead to a similar outcome, making the method even more broadly applicable.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27173910 PMCID: PMC4865953 DOI: 10.1038/srep25406
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Validation dataset.
| IgG1-K D44.1 FAB (1MLB) | Hen egg-white lysozyme (1DPX) | 1MLC | IgG1-K D44.1 FAB/Hen egg-white lysozyme | 1MLC:A,B/1MLB:A=100% 1MLC:E/1DPX:A=100% | global = 0.98 chain A = 0.96 chain B = 1.10 chain E = 0.75 | 16(26) |
| IgG1-K D1.3 FV (1VFA) | Hen egg-white lysozyme (1DPX) | 1VFB | IGG1-KAPPA D1.3 FV/Hen egg-white lysozyme | 1VFB:A/1VFA:A=100% 1VFB:C/1DPX:A=100% | global = 0.80 chain A = 0.33 chain B = 0.68 chain C = 1.19 | 42(56) |
| HyHEL-63 FAB (1DQM) | Hen egg-white lysozyme (1DPX) | 1DQJ | HyHEL-63 FAB/Hen egg-white lysozyme | 1DQJ:A/1DQM:A=100% 1DQJ:C/1DPX:A=100% | global = 0.80 chain A = 0.69 chain B = 0.91 chain C = 0.78 | 34(47) |
| TGF-β Type II Receptor (1M9Z) | TGF-β 3 (1TGJ) | 1KTZ | TGF-β Type II Receptor/TGF-β 3 | 1KTZ:A/1TGJ:A=100% 1KTZ:A/1M9Z:A=98.2% | global = 22.84 global (without 40–80) = 0.88 chain A = 21.79 chain A(without 40–80) = 1.21 chain B = 0.63 | 27(27) |
| β-lactamase inhibitor protein-I (3GMU) | TEM-1 β-lactamase (1ZG4) | 1JTG | β-lactamase inhibitor protein-I/TEM-1 β-lactamase | 1JTG:A/:3GMU:A=100% 1JTG:B/1ZG4:A=98.2% | global = 3.42 chain A = 1.94 chain B = 0.64 | 143(307) |
| IgG1 (4DZ8) | Fragment B of protein A (2JWD) | 1FC2 | IgG1/Fragment B of protein A | 1FC2:C/2JWD:A=93.1% 1FC2:D/4DZ8:A=96.4% | global = 1.67 chain C = 0.27 chain D = 1.35 | 9(9) |
| Iso-1-cytochrome C (1NMI) | Cytochrome C peroxidase (3R99) | 2PCC | Iso-1-Cytochrome C/Cytochrome C peroxidase | 2PCC:A/3R99:A=99.3% 2PCC:B/1NMI:A=99.1% | global = 1.21 chain A = 0.40 chain B = 2.18 | 12(18) |
| IgG1 (3DNK) | FcγR II (3RY4) | 3RY6 | IgG1/FcγR II | 3RY6:C/3RY4:A=97.1% 3RY6:A/3DNK:A=98.8% | global = 2.72 chain A = 1.90 chain B = 3.62 chain C = 2.22 | 65(138) |
| IgG1 (3DNK) | FcγR III (1E4J) | 1E4K | IgG1/FcγR III | 1E4K:C/1E4J:A =100% 1E4K:A/3DNK:A=97.2% | global = 1.99 chain A = 1.59 chain B = 2.55 chain C = 1.59 | 95(155) |
| IgG1 (3DNK) | FcγR N (4N0F) | 4N0U | IgG1/FcγR N | 4N0U:A/4N0F:A=100% 4N0U:A/3DNK:A=97.1% | global = 0.71 chain A = 0.31 chain E = 1.01 | 53(53) |
| IgG1 (1E4K) | FcγR I (3RJD) | 1E4K(IgG1/FcγR III) | IgG1/FcγR I | 1E4K:C/3RJD:A=45.2% | global = 2.76 chain A = 2.80 chain B = 2.34 chain C = 2.80 | 66(146) |
| IgG1 (3DNK) | FcγR II (3RY4) | 1E4K(IgG1/FcγR III) | Ig1/FcγR II | 1E4K:C/3RY4:A=44.0% 1E4K:A/3DNK:A=97.2% | global = 3.61 chain A = 3.10 chain B = 4.23 chain C = 3.36 | 65(138) |
| IgG1 (3DNK) | FcγR III (1E4J) | 3RY6(IgG1/FcγR II) | IgG1/FcγR III | 3RY6:C/1E4J:A=50.9% 3RY6:A/3DNK:A=98.8% | global = 4.86 chain A = 4.79 chain B = 4.61 chain C = 4.88 | 95(155) |
| IgG1 (3DNK) | FcγR I (3RJD) | 1E4K(IgG1/FcγR III) | IgG1/FcγR I | 1E4K:C/3RJD:A=45.2% 1E4K:A/3DNK:A=97.2% | global = 3.68 chain A = 3.03 chain B = 2.52 chain C = 4.80 | 62(128) |
| IgG1 (3DNK) | FcγR I (3RJD) | Density map from 1E4K | IgG1/FcγR I | 1E4K:C/3RJD:A=45.2% 1E4K:A/3DNK:A=97.2% | global = 3.37 chain A = 2.71 chain B = 2.56 chain C = 4.30 | 62(128) |
| TOTAL | 846(1531) |
The dataset is divided in two groups. The first is composed of double-free template models, based on self-templates. The second group (bottom of table, separated by a blank row) includes : 1) a single-free template model of IgG1/Fcγ R I, based on IgG1/Fcγ R III crystal, where the structure of IgG1 from the crystallographic complex is kept rather than being replaced; 2) double-free template models of IgG1/Fcγ R I, IgG1/Fcγ R II and IgG1/Fcγ R III, which were modeled from based on homologous templates; 3) A model of IgG1/FcγR I built by fitting to a low-resolution density map synthesized from an IgG1/FcγR III crystallographic complex21 Sequence identity and backbone RMSD of targets vs. templates are shown.
Comparison between Foldx-only and ZEMu performance.
| Dataset | Number of mutants | Models | |||
|---|---|---|---|---|---|
| All mutants | 846 | 1.83 | 0.12 | 1.54 | 0.34 |
| Multiple mutants | 584 | 1.79 | 0.15 | 1.54 | 0.37 |
| Single mutants | 262 | 1.89 | 0.06 | 1.54 | 0.24 |
Figure 1ZEMu versus experimental ΔΔG over the full dataset (846 mutants).
Comparing the performance of ZEMu on modeled vs. crystallographic complexes.
| Number of mutants | Double-free Models | Co-crystals | ||||
|---|---|---|---|---|---|---|
| FoldX-only | ZEMu | ZEMu | ||||
| RMSE (kcal/mol) | Correlation | RMSE (kcal/mol) | Correlation | RMSE (kcal/mol) | Correlation | |
| 558 | 2.00 | 0.17 | 1.76 | 0.38 | 1.58 | 0.61 |
Note that ZEMu performance decreased only moderately (1.76 vs. 1.58 RMSE) for modeled vs. crystallographic complexes.
Figure 2Results of flexibilizing a spatial neighbourhood of the mutation sites for a dataset composed of 687 mutants (not includes the template models of FcγRIII/IgG1 and FcγRII/IgG1 based on a homologous co-crystal complex).
Ordinary ZEMu has only five flexible residues about each mutation site flexibilized. Here we also flexibilize all residues within a radius (0–14 Å) of the mutation sites. Radius of 0 Å corresponds to ordinary ZEMu.
Comparison between Foldx-only and ZEMu performance for FcγRI/IgG1 single-free and double-free template-based and fitted models.
| MODEL | Number of mutants | FoldX-only | ZEMu | ||
|---|---|---|---|---|---|
| RMSE (kcal/mol) | Correlation | RMSE (kcal/mol) | Correlation | ||
| Single-free template-based model | 66 | 2.33 | 0.12 | 0.92 | 0.42 |
| Double-free template-based model | 62 | 0.64 | 0.41 | 0.57 | 0.49 |
| Fitted model | 62 | 0.49 | 0.47 | 0.47 | 0.50 |
Figure 3Creating complexes with MMB template modeling and ICFF.
As an example, we create an FcγRI-IgG1 model based on the experimentally observed FcγRIII-IgG1 complex. (A) A double-free template model is created as follows. We rigidify all chains. The two chains comprising the IgG1 Fc are constrained to each other, for the free (from PDB ID: 3DNK) structure. For the template (1E4K), all chains are constrained to ground. Springs connect the binding domain (D2) of the free FcγRI (3RJD) to the D2 domain of the template FcγRIII (1E4K). The springs connect residues which correspond based on a gapped sequence alignment. Similarly, springs connect the two CH2 domains of the free to the CH2 domains of the template IgG1. Once the free are superimposed on the template proteins, both of the template proteins are deleted. The complex is then declashed (see text), leaving the modeled complex ready for ΔΔG evaluation. The models in our main dataset were prepared in this way, except for the two complexes highlighted in Table 1. (B) A single-free template model is prepared as above, except that we have only one target (FcγRI) and the IgG1 is retained from the template rather than being deleted. (C) A fitted model is created using ICFF as described in21. We require an electron density map, which may be at low resolution (here we created a simulated map at 10 Å resolution from 1E4K using MDFF). We rigidly and approximately fit both protein structures into the available density map. Then, we allow flexibility in hinge residues (to enable domain motions) as well as certain residues in the protein-protein interface of interest (which would otherwise clash). We activate an MD force field in the spatial neighborhood of the flexible residues. We then turn on forces which push atoms along the density gradient, until the flexible fitting has converged21. Note significant domain motions occurred during the fitting process. Fitting is on the basis of all domains which are present in the low-resolution density map (i.e. D3 is not fitted).