| Literature DB >> 34257885 |
Himanshu Goel1, Anthony Hazel1, Vincent D Ustach1, Sunhwan Jo2, Wenbo Yu1, Alexander D MacKerell1,2.
Abstract
Predicting relative protein-ligand binding affinities is a central pillar of lead optimization efforts in structure-based drug design. The site identification by ligand competitive saturation (SILCS) methodology is based on functional group affinity patterns in the form of free energy maps that may be used to compute protein-ligand binding poses and affinities. Presented are results obtained from the SILCS methodology for a set of eight target proteins as reported originally in Wang et al. (J. Am. Chem. Soc., 2015, 137, 2695-2703) using free energy perturbation (FEP) methods in conjunction with enhanced sampling and cycle closure corrections. These eight targets have been subsequently studied by many other authors to compare the efficacy of their method while comparing with the outcomes of Wang et al. In this work, we present results for a total of 407 ligands on the eight targets and include specific analysis on the subset of 199 ligands considered previously. Using the SILCS methodology we can achieve an average accuracy of up to 77% and 74% when considering the eight targets with their 199 and 407 ligands, respectively, for rank-ordering ligand affinities as calculated by the percent correct metric. This accuracy increases to 82% and 80%, respectively, when the SILCS atomic free energy contributions are optimized using a Bayesian Markov-chain Monte Carlo approach. We also report other metrics including Pearson's correlation coefficient, Pearlman's predictive index, mean unsigned error, and root mean square error for both sets of ligands. The results obtained for the 199 ligands are compared with the outcomes of Wang et al. and other published works. Overall, the SILCS methodology yields similar or better-quality predictions without a priori need for known ligand orientations in terms of the different metrics when compared to current FEP approaches with significant computational savings while additionally offering quantitative estimates of individual atomic contributions to binding free energies. These results further validate the SILCS methodology as an accurate, computationally efficient tool to support lead optimization and drug discovery. This journal is © The Royal Society of Chemistry.Entities:
Year: 2021 PMID: 34257885 PMCID: PMC8246086 DOI: 10.1039/d1sc01781k
Source DB: PubMed Journal: Chem Sci ISSN: 2041-6520 Impact factor: 9.825
Fig. 1Computed versus experimental ΔG obtained from SILCS and FEP+/OPLS 2.1 with 199 ligands. “SILCS-Best PC” corresponds to results from Table 2, and “SILCS ML-Optimized” corresponds to the results from Table 3.
Average metrics for all eight proteins with 199 ligands with different ACS and ligand placement radii. MUE/RMSE values are in units of kcal mol−1. The rLP signifies the ligand placement radiia
| Protocol ( | ACS | MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|---|
| Ex ( | GAS21 | 0.911 | 1.157 | 0.227 | 0.242 | 0.583 |
| Ex ( | GAX21 | 0.943 | 1.193 | 0.266 | 0.260 | 0.587 |
| Ex ( | SS21 | 0.892 | 1.143 | 0.362 | 0.332 | 0.631 |
| Ex ( | SX21 | 0.951 | 1.175 | 0.456 | 0.439 | 0.649 |
| Ex ( | GAS21 | 0.797 | 0.994 | 0.421 | 0.431 | 0.654 |
| Ex ( | GAX21 | 0.824 | 1.034 | 0.504 | 0.517 | 0.684 |
| Ex ( | SS21 | 0.826 | 1.020 | 0.463 | 0.457 | 0.666 |
| Ex ( | SX21 | 0.841 | 1.032 | 0.588 | 0.578 | 0.708 |
| Ex ( | GAS21 | 0.801 | 1.011 | 0.455 | 0.459 | 0.660 |
| Ex ( | GAX21 | 0.828 | 1.037 | 0.533 | 0.530 | 0.683 |
| Ex ( | SS21 | 0.817 | 1.035 | 0.473 | 0.488 | 0.677 |
| Ex ( | SX21 | 0.800 | 1.014 | 0.614 | 0.640 | 0.719 |
| Ex ( | GAS21 | 0.809 | 1.007 | 0.484 | 0.481 | 0.658 |
| Ex ( | GAX21 | 0.803 | 1.019 | 0.552 | 0.563 | 0.698 |
| Ex ( | SS21 | 0.806 | 1.001 | 0.509 | 0.508 | 0.675 |
| Ex ( | SX21 | 0.887 | 1.088 | 0.532 | 0.510 | 0.672 |
Ex: exhaustive, R: Pearson's correlation coefficient, PI: predictive index, PC: percent correct.
Top percent correct scoring protocol (SILCS-Best PC) for the individual protein targets obtained from different ACS and ligand placement radii with 199 ligands. MUE/RMSE and rLP values are in units of kcal mol−1 and Å, respectively
| System | ACS |
| MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|---|---|
| P38 | SX21 | 1 | 1.368 | 1.675 | 0.631 | 0.653 | 0.742 |
| BACE | SS21 | 5 | 0.501 | 0.648 | 0.541 | 0.549 | 0.681 |
| MCL1 | GAX21 | 5 | 0.788 | 1.015 | 0.566 | 0.561 | 0.704 |
| TYK2 | SX21 | 5 | 0.874 | 1.050 | 0.593 | 0.721 | 0.775 |
| JNK1 | SS21 | 2 | 0.726 | 0.892 | 0.645 | 0.720 | 0.769 |
| Thrombin | GAX21 | 2 | 0.290 | 0.363 | 0.918 | 0.931 | 0.873 |
| CDK2 | SX21 | 1 | 0.670 | 0.874 | 0.668 | 0.690 | 0.767 |
| PTP1B | GAS21 | 5 | 0.642 | 0.910 | 0.781 | 0.863 | 0.828 |
|
|
|
|
|
|
|
Top percent correct scoring protocol (except for CDK2; see text) for the individual protein targets obtained with the PC-based ML optimization procedure from different ACS with radii 5 Å for 199 ligands. MUE/RMSE values are in units of kcal mol−1. The optimized FragMap weights can be found in ESI file si_ML-Opt_weights.xlsx
| System | ACS | MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|---|
| P38 | SX21*-5 Å | 0.921 | 1.103 | 0.744 | 0.755 | 0.781 |
| BACE | SS21*-5 Å | 0.536 | 0.753 | 0.563 | 0.673 | 0.733 |
| MCL1 | SS21*-5 Å | 0.776 | 0.944 | 0.782 | 0.790 | 0.792 |
| TYK2 | GAX21*-5 Å | 0.807 | 0.928 | 0.684 | 0.743 | 0.742 |
| JNK1 | GAS21*-5 Å | 0.676 | 0.821 | 0.706 | 0.730 | 0.812 |
| Thrombin | SX21*-5 Å | 0.509 | 0.555 | 0.915 | 0.963 | 0.945 |
| CDK2 | GAX21*-5 Å | 0.540 | 0.630 | 0.864 | 0.873 | 0.825 |
| PTP1B | SX21*-5 Å | 0.622 | 0.724 | 0.865 | 0.944 | 0.895 |
|
|
|
|
|
|
|
Comparison of the average metrics of MUE, RMSE, R, PI, and PC with 199 ligands for SILCS with respect to Wang et al. (FEP+/OPLS2.1),[55] Song et al. (AMBER/ff14SB+GAFF1.8),[56] Gapsys et al. (pmx GAFF/CGenFF/Consensus),[88] and Kuhn et al. (Flare FEP/ff14SB+GAFF2.1)[89]
| Method | MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|
| SILCS-Default SX21-5 Å | 0.800 | 1.014 | 0.614 | 0.640 | 0.719 |
| SILCS-Best PC | 0.732 | 0.928 | 0.668 | 0.711 | 0.767 |
| SILCS ML-Optimized | 0.673 | 0.807 | 0.765 | 0.809 | 0.816 |
| FEP+/OPLS 2.1 | 0.728 | 0.881 | 0.742 | 0.751 | 0.781 |
| AMBER/ff14SB+GAFF1.8 | 0.933 | 1.166 | 0.559 | 0.598 | 0.711 |
| Flare FEP/ff14SB+GAFF2.1 | 0.791 | 1.003 | 0.677 | 0.689 | 0.749 |
| pmx GAFF2.1 | 0.721 | 0.891 | 0.674 | 0.654 | 0.727 |
| pmx CGenFF | 0.835 | 1.088 | 0.562 | 0.591 | 0.716 |
| pmx Consensus | 0.740 | 0.900 | 0.637 | 0.659 | 0.738 |
Fig. 2Percentage of ligands with ΔG unsigned error values (in kcal mol−1) within the specified ranges for SILCS and other published data with 199 ligands.
Average metrics for all eight proteins with 407 ligands with different ACS and ligand placement radii. MUE/RMSE values are in units of kcal mol−1. The rLP signifies the ligand placement radii. The Ex (rLP = 5 Å) for SX21 is the SILCS-Default for 407 ligandsa
| Protocol ( | ACS | MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|---|
| Ex ( | GAS21 | 1.116 | 1.395 | 0.327 | 0.329 | 0.614 |
| Ex ( | GAX21 | 1.179 | 1.460 | 0.389 | 0.372 | 0.626 |
| Ex ( | SS21 | 1.180 | 1.458 | 0.419 | 0.409 | 0.640 |
| Ex ( | SX21 | 1.218 | 1.494 | 0.477 | 0.478 | 0.662 |
| Ex ( | GAS21 | 0.956 | 1.187 | 0.542 | 0.530 | 0.683 |
| Ex ( | GAX21 | 1.039 | 1.296 | 0.556 | 0.549 | 0.690 |
| Ex ( | SS21 | 1.080 | 1.333 | 0.501 | 0.502 | 0.679 |
| Ex ( | SX21 | 1.106 | 1.357 | 0.586 | 0.586 | 0.704 |
| Ex ( | GAS21 | 0.926 | 1.144 | 0.571 | 0.570 | 0.695 |
| Ex ( | GAX21 | 0.979 | 1.209 | 0.593 | 0.588 | 0.702 |
| Ex ( | SS21 | 0.978 | 1.220 | 0.545 | 0.527 | 0.686 |
| Ex ( | SX21 | 1.008 | 1.251 | 0.599 | 0.612 | 0.710 |
| Ex ( | GAS21 | 0.920 | 1.145 | 0.568 | 0.535 | 0.681 |
| Ex ( | GAX21 | 0.963 | 1.204 | 0.597 | 0.588 | 0.703 |
| Ex ( | SS21 | 0.960 | 1.200 | 0.565 | 0.552 | 0.691 |
| Ex ( | SX21 | 1.039 | 1.283 | 0.588 | 0.594 | 0.706 |
Ex: exhaustive, R: Pearson's correlation coefficient, PI: predictive index, PC: percent correct.
Top percent correct scoring protocol for the individual protein targets obtained from different ACS and radii for 407 ligands. MUE/RMSE and radii values are in units of kcal mol−1 and Å, respectively
| System | ACS |
| MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|---|---|
| P38 | SX21 | 1 | 1.368 | 1.665 | 0.621 | 0.644 | 0.733 |
| BACE | SS21 | 5 | 0.568 | 0.735 | 0.562 | 0.588 | 0.704 |
| MCL1 | GAS21 | 2 | 1.135 | 1.356 | 0.822 | 0.824 | 0.809 |
| TYK2 | GAS21 | 10 | 1.029 | 1.290 | 0.628 | 0.628 | 0.706 |
| JNK1 | GAX21 | 5 | 0.973 | 1.240 | 0.631 | 0.652 | 0.731 |
| Thrombin | GAX21 | 10 | 0.895 | 1.242 | 0.605 | 0.597 | 0.718 |
| CDK2 | SS21 | 1 | 0.800 | 1.009 | 0.638 | 0.664 | 0.727 |
| PTP1B | GAS21 | 5 | 0.936 | 1.139 | 0.790 | 0.816 | 0.804 |
|
|
|
|
|
|
|
Top percent correct scoring protocol for the individual protein targets obtained with the PC-based ML optimization procedure from different ACS for 407 ligands. MUE/RMSE values are in units of kcal mol−1. The optimized FragMap weights can be found in ESI file si_ML-Opt_weights.xlsx
| System | ACS | MUE | RMSE |
| PI | PC |
|---|---|---|---|---|---|---|
| P38 | SX21*-5 Å | 1.089 | 1.340 | 0.661 | 0.687 | 0.758 |
| BACE | SS21*-5 Å | 0.559 | 0.815 | 0.658 | 0.761 | 0.773 |
| MCL1 | SX21*-5 Å | 0.913 | 1.126 | 0.886 | 0.892 | 0.859 |
| TYK2 | SX21*-5 Å | 0.886 | 1.071 | 0.762 | 0.807 | 0.777 |
| JNK1 | GAX21*-5 Å | 1.076 | 1.408 | 0.731 | 0.741 | 0.782 |
| Thrombin | GAX21*-5 Å | 0.923 | 1.133 | 0.816 | 0.854 | 0.838 |
| CDK2 | GAX21*-5 Å | 0.605 | 0.759 | 0.791 | 0.814 | 0.810 |
| PTP1B | GAS21*-5 Å | 0.903 | 1.067 | 0.815 | 0.831 | 0.810 |
|
|
|
|
|
|
|
Fig. 3Computed versus experimental ΔG obtained from SILCS procedure for 407 ligands of eight protein targets. “SILCS-Best PC” corresponds to the results from Table 6, and “SILCS ML-Optimized” corresponds to the results from Table 7.
Fig. 4Examples of the atomic GFE contributions to the relative binding affinity between selected ligands of the (A) JNK1 (6f, 6g), (B) CDK2 (26, 29) and (C) BACE (13j, 4o) protein targets. Each panel presents minimum LGFE conformations of the ligand along with the FragMaps and corresponding 2D chemical structure with the individual atomic GFE scores as well as summed GFE scores for selected moieties. These GFE values are extracted from the SILCS-Best PC models. The FragMaps colors are (green) GENN or APOLAR, (red) GENA, (blue) GEND, (cyan) MAMN, and (orange) ACEO. All FragMap isocontour surfaces are displayed at a cutoff of −1.2 kcal mol−1. The cyan, blue, red, yellow, orange, pink, and white atom colors represent carbon, nitrogen, oxygen, sulfur, chlorine, fluorine, and hydrogen atoms, respectively.