| Literature DB >> 32252223 |
Shuheng Huang1,2, Linxin Chen1,2, Hu Mei1,2, Duo Zhang2, Tingting Shi2, Zuyin Kuang2, Yu Heng2, Lei Xu2, Xianchao Pan2,3.
Abstract
Accumulated evidence suggests that binding kinetic properties-especially dissociation rate constant or drug-target residence time-are crucial factors affecting drug potency. However, quantitative prediction of kinetic properties has always been a challenging task in drug discovery. In this study, the VolSurf method was successfully applied to quantitatively predict the koff values of the small ligands of heat shock protein 90α (HSP90α), adenosine receptor (AR) and p38 mitogen-activated protein kinase (p38 MAPK). The results showed that few VolSurf descriptors can efficiently capture the key ligand surface properties related to dissociation rate; the resulting models demonstrated to be extremely simple, robust and predictive in comparison with available prediction methods. Therefore, it can be concluded that the VolSurf-based prediction method can be widely applied in the ligand-receptor binding kinetics and de novo drug design researches.Entities:
Keywords: Partial Least Squares; VolSurf; dissociation rate constant; prediction
Mesh:
Substances:
Year: 2020 PMID: 32252223 PMCID: PMC7177943 DOI: 10.3390/ijms21072456
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The partial least squares (PLS) modeling results of the dissociation rate constants of the Hsp90 inhibitors.
| Model | No. of Variables | Variables Involved in Sequence | No. of PCs |
|
| RMSE | MAPE |
| RMSEP | Equation |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | V-OH2 | 1 | 0.635 | 0.626 | 0.435 | 0.167 | 0.716 | 0.409 | Y = 1.0727X |
| 2 | 2 | D8-DRY | 2 | 0.726 | 0.688 | 0.377 | 0.145 | 0.718 | 0.404 | Y = 1.0685X |
| 3 | 3 | W3-N3+ | 2 | 0.763 | 0.717 | 0.350 | 0.138 | 0.710 | 0.412 | Y = 1.073X |
| 4 | 4 | Emin1-OH2 | 2 | 0.771 | 0.709 | 0.345 | 0.128 | 0.744 | 0.393 | Y = 1.0753X |
| 5 | 5 | D4-DRY | 2 | 0.768 | 0.707 | 0.346 | 0.125 | 0.758 | 0.387 | Y = 1.0774X |
| 6 | 6 | A | 2 | 0.788 | 0.695 | 0.331 | 0.161 | 0.766 | 0.399 | Y = 1.0941X |
| 7 | 7 | IW8-OH2 | 2 | 0.815 | 0.726 | 0.310 | 0.186 | 0.780 | 0.393 | Y = 1.0968X |
| 8 | 8 | W4-N:= | 2 | 0.818 | 0.730 | 0.307 | 0.344 | 0.778 | 0.407 | Y = 1.1073X |
| 9 | 9 | D13-DRY | 2 | 0.819 | 0.658 | 0.306 | 0.143 | 0.730 | 0.437 | Y = 1.1101X |
Optimal PLS model with two descriptors; V-OH2: molecular volume given as the water solvent excluded volume (Å3); D8-DRY: hydrophobic regions generated by the hydrophobic probe at energy level of −1.6 kcal/mol; W3−N3+: hydrophilic regions generated by the sp3 NH3 probe at energy level of −1.0 kcal/mol; Emin1-OH2: local interaction energy minima between the H2O probe and the target molecule; D4-DRY: hydrophobic regions generated by the hydrophobic probe at energy level of −0.8 kcal/mol; A: Amphiphilic moment, defined as a vector pointing from the center of the hydrophobic domain to the center of the hydrophilic domain; IW8-OH2: integy moments generated by the water probe at energy level of −6.0 kcal/mol, represent the unbalance between the center of mass of a molecule and the position of the hydrophilic regions around it; W4-N:=: hydrophilic regions generated by the sp2 N probe at energy level of −2.0 kcal/mol; D13-DRY: hydrophobic local interaction energy minima distances generated by the hydrophobic probe; 5-fold cross validation; RMSE: Root- mean-square error of prediction for training samples; MAPE: Mean absolute percentage error for training samples; RMSEP: RMSE for validation samples.
Figure 1PLS modeling results of the dissociation rate constants of 52 Hsp90 inhibitors. (a) scatter plot of experimental vs. predicted −log(k) of 35 training samples; (b) scatter plot of experimental vs. predicted −log(k) of 17 validation samples; (c) first two principle component scores. The color legend represents the range of −log(k) values; (d) loading plot of the first two principal components.
Figure 2VolSurf properties of representative samples with different molecular skeletons. (a) 1b and 1i; (b) 5× and 5h. The hydrophobic regions at −1.6 kcal/mol energy level; red vectors represent the integy moments joining the center of mass of the molecule to the barycenter of the hydrophobic regions.
Figure 3Results of PLS model validation. (a) R2 and R distributions of 1000-times repeated PLS modeling; (b) 500-times Y random permutation test; (c) scatter plot of experimental vs. predicted −log(k) of 49 independent test samples.
Performance comparison among VolSurf, τ-random acceleration molecular dynamics (τRAMD) and COMBINE models.
| Model | Need to Consider Receptors? | No. of Variables | Total Sample Size | Training/Validation/Test Samples |
|
| MAPE |
|
|
|---|---|---|---|---|---|---|---|---|---|
| VolSurf | No | 2 | 101 | 35/17/49 | 0.73 | 0.69 | 0.15 | 0.72 | 0.56 |
| τRAMD [ | Yes | NA | 70 | 59/0/0 | 0.66 | NA | NA | NA | NA |
| τRAMD [ | Yes | NA | 94 | 80/0/0 | 0.75 | NA | 0.39 | NA | NA |
| COMBINE [ | Yes | 42 | 70 | 53/13/0 | 0.80 | 0.69 | 0.37 | 0.86 | NA |
R2: Coefficient of determination; Q2: the cross-validated R2; R: R2 for validation samples; R: R2 for external test samples; 5-fold cross-validation; leave-one-out cross-validation; 11 samples removed as outliers; 14 samples removed as outliers; 4 samples removed as outliers.
PLS modeling results of the dissociation rate constants of 39 A1AR agonists/antagonists.
| Model | No. of Variables | Variables Involved in Sequence | No. of PCs |
|
| RMSE | MAPE |
| RMSEP | Equation |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | POL | 1 | 0.638 | 0.606 | 0.299 | 0.201 | 0.584 | 0.341 | Y = 0.9691X |
| 2 | 2 | W5-N3+ | 1 | 0.688 | 0.631 | 0.278 | 0.201 | 0.627 | 0.284 | Y = 0.9771X |
| 3 | 3 | W2-O | 2 | 0.644 | 0.549 | 0.297 | 0.197 | 0.581 | 0.290 | Y = 0.9873X |
| 4 | 4 | Emin2-DRY | 2 | 0.678 | 0.599 | 0.282 | 0.193 | 0.489 | 0.311 | Y = 0.9814X |
| 5 | 5 | D13-OH2 | 2 | 0.717 | 0.623 | 0.264 | 0.185 | 0.561 | 0.277 | Y = 0.9912X |
| 6 | 6 | BV21-DRY | 2 | 0.757 | 0.610 | 0.245 | 0.165 | 0.579 | 0.279 | Y = 0.9934X |
| 7 | 7 | ID1-DRY | 2 | 0.762 | 0.600 | 0.242 | 0.150 | 0.647 | 0.257 | Y = 1.0237X |
Optimal PLS model with 2 descriptors; POL: average molecular polarizability; W5-N3+: hydrophilic regions generated by the sp3 NH3 probe at energy level of −3.0 kcal/mol (More details please refer to Table S1); leave-one-out cross-validation.
Figure 4Results of PLS model validations. (a) frequency distribution of R2 and R in 1000-times repeated PLS modeling; (b) 500-times Y random permutation test.
Figure 5Optimal PLS model of the dissociation rate constants of 39 A1AR agonists/antagonists. (a) scatter plot of experimental vs. predicted −log(k) of the 26 training samples; (b) scatter plot of experimental vs. predicted −log(k) of the 13 validation samples; (c) first principle component scores of the 26 training samples; (d) weights of independent variables in the first principle component.
PLS modeling results of the dissociation rate constants of p38 mitogen-activated protein kinase (p38 MAPK) inhibitors.
| Model | No. of Variables | Variables Involved in Sequence | No. of PCs |
|
| RMSE | MAPE |
| RMSEP | Equation |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | V-OH2 | 1 | 0.709 | 0.696 | 0.696 | 0.192 | 0.685 | 0.671 | Y = 0.9862X |
| 2 | 2 | BV21-OH2 | 1 | 0.821 | 0.818 | 0.546 | 0.145 | 0.821 | 0.527 | Y = 0.9574X |
| 3 | 3 | IW3-OH2 | 1 | 0.881 | 0.882 | 0.445 | 0.122 | 0.713 | 0.683 | Y = 0.9344X |
| 4 | 4 | Emin1-OH2 | 1 | 0.877 | 0.856 | 0.453 | 0.159 | 0.604 | 0.765 | Y = 0.9579X |
| 5 | 5 | W8 | 1 | 0.821 | 0.808 | 0.546 | 0.160 | 0.628 | 0.727 | Y = 0.993X |
| 6 | 6 | D7-DRY | 1 | 0.868 | 0.827 | 0.469 | 0.130 | 0.567 | 0.800 | Y = 0.9561X |
| 7 | 7 | D6-DRY | 1 | 0.853 | 0.760 | 0.465 | 0.135 | 0.481 | 0.905 | Y = 0.9224X |
| 8 | 8 | W8-O | 1 | 0.846 | 0.751 | 0.506 | 0.138 | 0.484 | 0.895 | Y = 0.9282X |
| 9 | 9 | IW7-OH2 | 1 | 0.859 | 0.756 | 0.485 | 0.125 | 0.490 | 0.905 | Y = 0.9167X |
Optimal PLS model with two descriptors; V-OH2: molecular volume given as the water solvent excluded volume (Å3); BV21-OH2: the best hydrophilic volumes generated by the water probe at energy levels of −1.0 and −3.0 kcal/mol. (More details please refer to Table S1); 3-fold cross validation.
Performance comparison among VolSurf, position-restrained and biased MD models.
| Model | Need to Consider Receptors? | No. of Variables | Total Sample Size | Training/Validation Samples |
|
|
|
|---|---|---|---|---|---|---|---|
| VolSurf | No | 2 | 28 | 18/10 | 0.82 | 0.82 | 0.82 |
| Position-restrained MD [ | Yes | 3 | 20 | 14/6 | 0.72 | 0.66 | 0.56 |
| Biased MD [ | Yes | NA | 8 | 8/0 | 0.64 | NA | NA |
3-fold cross-validation; leave-one-out cross-validation.
Figure 6Validations of the optimal PLS model: (a) frequency distribution of R2 and R in 1000-times repeated PLS modeling; (b) 500-times Y random permutation test.
Figure 7Optimal PLS model of the dissociation rate constants of 28 p38 MAPK inhibitors. (a) scatter plot of experimental vs. predicted −log(k) of the 18 training samples; (b) scatter plot of experimental vs. predicted −log(k) of the 10 validation samples; (c) first principle component scores of the 18 training samples. (d) weights of the two variables in the first principle component.
Figure 8The procedure of VolSurf description: (1) dividing the molecular space into 3D lattice points; (2) calculating the interaction energies of molecules by specific probes in each lattice point; (3) quantitating the interaction volume and surface information. MIF: molecular interaction fields.