| Literature DB >> 33946965 |
Suman Kumar Mandal1, Parthapratim Munshi1.
Abstract
Optimization of lead structures is crucial for drug discovery. However, the accuracy of such a prediction using the traditional molecular docking approach remains a major concern. Our study demonstrates that the employment of quantum crystallographic approach-counterpoise corrected kernel energy method (KEM-CP) can improve the accuracy by and large. We select human aldose reductase at 0.66 Å, cyclin dependent kinase 2 at 2.0 Å and estrogen receptor β at 2.7 Å resolutions with active site environment ranging from highly hydrophilic to moderate to highly hydrophobic and several of their known ligands. Overall, the use of KEM-CP alongside the GoldScore resulted superior prediction than the GoldScore alone. Unlike GoldScore, the KEM-CP approach is neither environment-specific nor structural resolution dependent, which highlights its versatility. Further, the ranking of the ligands based on the KEM-CP results correlated well with that of the experimental IC50 values. This computationally inexpensive yet simple approach is expected to ease the process of virtual screening of potent ligands, and it would advance the drug discovery research.Entities:
Keywords: kernel energy method; lead structure; molecular docking; protein-ligand interaction; quantum crystallography; scoring function
Mesh:
Substances:
Year: 2021 PMID: 33946965 PMCID: PMC8124947 DOI: 10.3390/molecules26092605
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Trend of publications from docking studies in the last two decades obtained from PubMed (https://www.ncbi.nlm.nih.gov/pmc/, accessed on 25 January 2021).
Details of the target protein structures.
| Protein Complexes | hAR-IDD594 | CDK2-NU6102 | ERβ-4NA |
|---|---|---|---|
| PDB ID | 1US0 | 1H1S | 1YY4 |
| Resolution | 0.66 Å | 2.0 Å | 2.7 Å |
| Hydrophobic | 3.0 | 1.4 | 4.4 |
| Hydrophilic | 0.7 | 1.0 | 0.3 |
| Balance ** | 4.2 | 1.4 | 13.3 |
* Radius of 5.0 Å was used for binding site. ** Balance >6.0 indicates high hydrophobicity and/or likely lipophilicity.
List of ligands of hAR with their average IEs and GoldScore fitness scores for the two types of poses. The most negative IEs and the best fitness scores from GoldScore are highlighted using (values in bold). The reported experimental IC50 [7] values are also listed.
| Ligand # | Experimental IC50 (nM) | Pose Type 1 | Pose Type 2 | ||||
|---|---|---|---|---|---|---|---|
| Avg. IE (kCal·mol−1) | GoldScore | Avg. IE (kCal·mol−1) | GoldScore | ||||
| Fitness Score | Rank | Fitness Score | Rank | ||||
| 10 | 176 |
|
| 1 | 2.35 | 58.51 | 19 |
| 16 | 44 |
|
| 1 | −18.68 | 72.97 | 4 |
| 19 | 30 |
|
| 1 | 18.24 | 76.64 | 2 |
| 24 | 7 |
| 77.90 | 2 | −33.42 |
| 1 |
| 25 | 6 |
| 73.76 | 2 | −19.97 |
| 1 |
Comparison of RMSDs of both types of poses with respect to the crystal geometry of IDD594.
| Ligand # | Pose Type 1 | Pose Type 2 | ||
|---|---|---|---|---|
| Avg. IE (kCal·mol−1) | RMSD Crystal Geometry (Å) | Avg. IE (kCal·mol−1) | RMSD Crystal Geometry (Å) | |
| Predicted pose superimposed on crystal geometry (grey) |
|
| ||
| 10 | −107.67 | 0.223 | 2.35 | 2.042 |
| 16 | −89.93 | 0.263 | −18.68 | 2.563 |
| 19 | −121.11 | 0.137 | 18.24 | 2.128 |
| 24 | −98.39 | 0.943 | −33.42 | 2.342 |
| 25 | −100.45 | 1.180 | −19.97 | 2.603 |
Figure 2Distribution of RMSD (Å) vs. fitting score from GoldScore and vs. KEM-CP based average IE (Table 2 and Table 3) of hAR ligands.
List of ligands of CDK2 with their IC50 values [48] and average IEs and docking score details for the three types of poses. The most negative IEs and the best fitness scores are highlighted (values in bold).
| Ligand # | Experimental IC50 (nM) | Pose Type 1 | Pose Type 2 | Pose Type 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Avg. IE (kCal·mol−1) | GoldScore | Avg. IE (kCal·mol−1) | GoldScore | Avg. IE (kCal·mol−1) | GoldScore | |||||
| Fitness Score | Rank | Fitness Score | Rank | Fitness Score | Rank | |||||
| 3 | 5.4 ± 1.0 |
|
| 1 | −422.08 | 63.07 | 2 | −299.76 | 60.32 | 3 |
| 25 | 69 ± 1 |
|
| 1 | −258.26 | 59.74 | 2 | −219.28 | 55.47 | 3 |
| 28 * | 7.0 ± 0.1 |
|
| 1 | - | - | - | −334.16 | 56.93 | 10 |
| 29 | 56 ± 20 |
|
| 1 | −376.89 | 59.73 | 4 | −376.57 | 55.92 | 11 |
| 30 | 63 ± 7 | −398.53 |
| 1 |
| 66.16 | 3 | −362.61 | 54.99 | 12 |
| 33 | 210 ± 40 |
|
| 1 | −414.07 | 63.00 | 4 | −317.21 | 56.66 | 14 |
| 34 | 64 ± 33 |
|
| 1 | −334.12 | 59.39 | 2 | Failed | 55.37 | 6 |
* None of the poses belonged to Type 2.
Comparison of RMSDs of the three types of predicted poses with respect to the crystal geometry of NU6102.
| Ligand # | Pose Type 1 | Pose Type 2 | Pose Type 3 | |||
|---|---|---|---|---|---|---|
| Avg. IE (kCal·mol−1) | RMSD with Crystal | Avg. IE (kCal·mol−1) | RMSD with Crystal | Avg. IE (kCal·mol−1) | RMSD with Crystal | |
| Predicted pose superimposed on crystal geometry (grey) |
|
|
| |||
| 3 | −434.26 | 0.832 | −422.08 | 4.072 | −299.76 | 3.212 |
| 25 | −304.97 | 0.944 | −258.26 | 4.379 | −219.28 | 2.947 |
| 28 | −433.67 | 0.844 | - | - | −334.16 | 3.221 |
| 29 | −434.11 | 0.798 | −376.89 | 3.797 | −376.57 | 2.752 |
| 30 | −398.53 | 0.811 | −498.35 | 3.454 | −362.61 | 2.950 |
| 33 | −432.88 | 0.567 | −414.07 | 3.680 | −317.21 | 2.908 |
| 34 | −418.00 | 0.975 | −334.12 | 3.330 | Failed | 2.875 |
Figure 3Distribution of RMSD (Å) vs. fitting score from GoldScore and vs. KEM-CP based average IE of CDK2 ligands as listed in Table 4 and Table 5.
Figure 4Plot showing IC50 ranking distribution with the KEM-CP IE and GoldScore fitness score rankings for the best predicted poses as listed in Table S15, Supplementary Materials.
List of ligands of ERβ with their IC50 values [49] and average IEs and docking score details for the four types of poses. The most negative IEs and the best fitness scores are highlighted (values in bold).
| Ligand # | Experimental IC50 (nM) | Pose Type 1 | Pose Type 2 | Pose Type 3 | Pose Type 4 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Avg. IE (kCal·mol−1) | GoldScore | Avg. IE (kCal·mol−1) | GoldScore | Avg. IE (kCal·mol−1) | GoldScore | Avg. IE (kCal·mol−1) | GoldScore | ||||||
| Fitness Score | Rank | Fitness Score | Rank | Fitness Score | Rank | Fitness Score | Rank | ||||||
| 15 | 2.52 ± 1.3 | −40.33 |
| 1 | −30.44 | 52.58 | 3 |
| 55.11 | 2 | −25.86 | 51.85 | 4 |
| 25 | 2.8 ± 0.1 |
| 50.76 | 3 | −54.59 | 49.80 | 6 | −57.96 | 50.73 | 4 | −48.19 |
| 1 |
| 27 | 2.3 ± 0.1 | −49.90 | 49.68 | 7 |
| 50.17 | 3 | −43.79 | 50.02 | 5 | −50.99 |
| 1 |
| 29 | 1.4 ± 0.6 |
| 52.28 | 3 | −37.37 | 51.88 | 5 | −42.65 | 51.99 | 4 | −50.74 |
| 1 |
| 40 | 1.6 ± 0.7 |
|
| 1 | −52.03 | 49.54 | 7 | −49.17 | 52.11 | 3 | −41.80 | 52.68 | 2 |
| 44 | 2.3 ± 1.7 | −53.64 |
| 1 | −42.58 | 49.53 | 5 |
| 54.53 | 3 | −24.37 | 52.34 | 4 |
| 57 | 0.5 ± 0.5 |
|
| 1 | −36.44 | 47.02 | 9 | −51.64 | 52.72 | 3 | −21.68 | 49.41 | 5 |
| 62 | 2.1 ± 0.9 |
|
| 1 | −26.68 | 53.51 | 5 | −40.83 | 55.12 | 3 | −38.80 | 49.52 | 11 |
| 68 | 1.2 ± 0.7 |
|
| 1 | −2.31 | 47.19 | 9 | −39.92 | 55.18 | 3 | −19.58 | 45.38 | 11 |
| 70 | 1.1 ± 1.6 | −47.88 | 53.93 | 2 | −38.95 | 51.50 | 7 |
|
| 1 | −28.36 | 50.32 | 10 |
Comparison of RMSDs of the four types of predicted poses with respect to the crystal geometry of 4NA.
| Ligand # | Pose Type 1 | Pose Type 2 | Pose Type 3 | Pose Type 4 | ||||
|---|---|---|---|---|---|---|---|---|
| Avg. IE (kCal·mol−1) | RMSD (Å) | Avg. IE (kCal·mol−1) | RMSD (Å) | Avg. IE (kCal·mol−1) | RMSD (Å) | Avg. IE (kCal·mol−1) | RMSD (Å) | |
| Predicted pose superimposed on crystal geometry (grey) |
|
|
|
| ||||
| 15 | −40.33 | 0.399 | −30.44 | 3.526 | −41.90 | 1.219 | −25.86 | 3.185 |
| 25 | −64.69 | 0.161 | −54.59 | 3.430 | −57.96 | 1.100 | −48.19 | 3.155 |
| 27 | −49.90 | 0.276 | −56.42 | 3.570 | −43.79 | 1.027 | −50.99 | 3.189 |
| 29 | −58.45 | 0.149 | −37.37 | 3.390 | −42.65 | 0.978 | −50.74 | 3.190 |
| 40 | −52.07 | 0.371 | −52.03 | 3.452 | −49.17 | 1.044 | −41.80 | 3.193 |
| 44 | −53.64 | 0.329 | −42.58 | 3.515 | −54.89 | 1.122 | −24.37 | 3.221 |
| 57 | −69.15 | 0.239 | −36.44 | 3.499 | −51.64 | 1.117 | −21.68 | 3.227 |
| 62 | −56.79 | 0.147 | −26.68 | 3.420 | −40.83 | 1.040 | −38.80 | 3.380 |
| 68 | −47.56 | 0.099 | −2.31 | 3.393 | −39.92 | 0.992 | −19.58 | 3.255 |
| 70 | −47.88 | 0.153 | −38.95 | 3.147 | −63.55 | 1.100 | −28.36 | 3.434 |
Figure 5Distribution of RMSD (Å) vs. fitting score from GoldScore and vs. KEM-CP based average IE of ERβ ligands as listed in Table 6 and Table 7.
Figure 6Plot showing IC50 ranking distribution with the KEM-CP IE and GoldScore Fitness score rankings for the best predicted poses (data from Table S28).
Summary of lead structure prediction by GoldScore and KEM-CP.
| Protein | Active Site Environment | No. of Ligands | No. of Ligands Predicted Correctly By | % of Ligands Predicted Correctly By | ||
|---|---|---|---|---|---|---|
| GoldScore | KEM-CP | GoldScore | KEM-CP | |||
|
| Moderately hydrophobic | 5 | 3 | 5 | 60 | 100 |
|
| Highly hydrophilic | 7 | 7 | 6 | 100 | 86 |
|
| Highly hydrophobic | 10 | 7 | 9 | 70 | 90 |