| Literature DB >> 36188488 |
Trung Hai Nguyen1,2, Nguyen Minh Tam1,2, Mai Van Tuan3, Peng Zhan4, Van V Vu5, Duong Tuan Quang6, Son Tung Ngo1,2.
Abstract
Inhibiting the biological activity of SARS-CoV-2 Mpro can prevent viral replication. In this context, a hybrid approach using knowledge- and physics-based methods was proposed to characterize potential inhibitors for SARS-CoV-2 Mpro. Initially, supervised machine learning (ML) models were trained to predict a ligand-binding affinity of ca. 2 million compounds with the correlation on a test set of R = 0.748 ± 0.044 . Atomistic simulations were then used to refine the outcome of the ML model. Using LIE/FEP calculations, nine compounds from the top 100 ML inhibitors were suggested to bind well to the protease with the domination of van der Waals interactions. Furthermore, the binding affinity of these compounds is also higher than that of nirmatrelvir, which was recently approved by the US FDA to treat COVID-19. In addition, the ligands altered the catalytic triad Cys145 - His41 - Asp187, possibly disturbing the biological activity of SARS-CoV-2.Entities:
Keywords: Docking, Simulation; FEP; FEP, Free Energy Perturbation; LIE; LIE, Linear Interaction Energy; ML, Machine Learning; Machine learning; Mpro, SARS-CoV-2 Mpro; SARS-CoV-2 Mpro; SL, Supervised Learning; Supervised learning
Year: 2022 PMID: 36188488 PMCID: PMC9511900 DOI: 10.1016/j.chemphys.2022.111709
Source DB: PubMed Journal: Chem Phys ISSN: 0301-0104 Impact factor: 2.552
Figure 1Computational strategy. (A) Computational approach utilized to search promising inhibitors for SARS-CoV-2 Mpro by using hybrid approach involving supervised machine learning and atomistic simulations. (B) Ligand-binding pose was preliminarily predicted via AutoDock Vina. (C) Configuration of catalytic triad Cys145 - His41 - Asp187. (D) + (E) Initial conformations of SARS-CoV-2 Mpro + inhibitor and individual inhibitors in solution.
Performance metrics of regression models in predicting binding free energy of 120 tested ligands to SARS-CoV-2 Mpro. Numbers in parentheses are error bars estimated by bootstrapping.
| Linear Regression | 1.299 ± 0.104 | 0.631 ± 0.070 | 0.708 ± 0.053 |
| Random Forest | 1.157 ± 0.093 | 0.737 ± 0.046 | 0.753 ± 0.045 |
| XGBoost | 1.125 ± 0.095 | 0.748 ± 0.044 | 0.765 ± 0.048 |
| GraphConv | 1.161 ± 0.088 | 0.735 ± 0.050 | 0.749 ± 0.043 |
Figure 2Predicted binding free energy versus experiment for 120 test compounds. Prediction was made using XGBoost model.
Calculated binding free energy of top-lead compounds to SARS-CoV-2 Mpro via different approaches.
| N0 | Compound name | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | CHEMBL3815050 | -10.02 | -15.7 | 7.16 | -36.37 | -16.71 ± 0.36 | -2.06 | -10.06 | -12.12 ± 1.12 | |
| 2 | CHEMBL4300604 | -9.98 | -14.7 | 10.66 | -33.23 | -15.97 ± 0.49 | -9.48 | -9.49 | -18.97 ± 3.59 | |
| 3 | CHEMBL3945443 | -9.98 | -15.1 | 9.58 | -30.44 | -15.12 ± 0.03 | -16.92 | -14.73 | -31.65 ± 0.92 | |
| 4 | CHEMBL3678802 | -9.97 | -17.1 | 5.23 | -32.15 | -15.40 ± 1.39 | -9.47 | -8.72 | -18.19 ± 0.69 | |
| 5 | CHEMBL4170638 | -9.91 | -14.0 | 3.62 | -31.78 | -15.21 ± 0.36 | -3.68 | -11.19 | -14.87 ± 0.33 | |
| 6 | CHEMBL4111845 | -9.79 | -15.3 | 3.79 | -35.84 | -16.39 ± 0.58 | -12.95 | -8.46 | -21.41 ± 0.69 | |
| 7 | CHEMBL580289 | -9.75 | -15.0 | 7.88 | -32.08 | -15.51 ± 0.78 | -17.79 | -9.45 | -27.24 ± 2.87 | |
| 8 | CHEMBL3640406 | -9.74 | -15.8 | 15.84 | -29.33 | -15.10 ± 0.14 | 9.97 | -14.96 | -4.99 ± 1.26 | |
| 9 | CHEMBL538763 | -9.74 | -15.1 | 11.20 | -30.55 | -15.23 ± 0.78 | 6.02 | -12.13 | -6.11 ± 3.77 | |
| 10 | CHEMBL176909 | -9.73 | -15.3 | 14.99 | -35.52 | -16.84 ± 0.36 | -6.99 | -12.47 | -19.45 ± 1.07 | |
| 11 | CHEMBL4095929 | -9.72 | -17.2 | 18.81 | -32.15 | -16.06 ± 0.21 | 9.13 | -14.07 | -4.95 ± 0.42 | |
| 12 | CHEMBL3640394 | -9.69 | -14.4 | 15.34 | -29.54 | -15.14 ± 0.30 | 7.79 | -14.77 | -6.98 ± 0.57 | |
| 13 | CHEMBL3657195 | -9.68 | -15.0 | 7.87 | -30.99 | -15.19 ± 0.26 | 5.07 | -8.63 | -3.56 ± 1.93 | |
| 14 | CHEMBL416434 | -9.68 | -15.9 | 11.72 | -32.35 | -15.77 ± 1.97 | -7.06 | -11.11 | -18.17 ± 2.52 | |
| 15 | CHEMBL1471687 | -9.67 | -13.8 | -8.97 | -38.03 | -16.39 ± 0.45 | -7.19 | -13.76 | -20.95 ± 0.08 | |
| 16 | CHEMBL285908 | -9.67 | -11.3 | 11.07 | -33.31 | -16.01 ± 0.99 | 1.60 | -10.40 | -8.80 ± 2.00 | |
| 17 | CHEMBL3673817 | -9.67 | -16.3 | 2.23 | -32.67 | -15.40 ± 0.27 | 3.57 | -12.21 | -8.64 ± 1.74 | |
| 18 | CHEMBL4101092 | -9.66 | -17.1 | 14.50 | -32.10 | -15.83 ± 0.7 | -1.73 | -16.77 | -18.50 ± 1.08 | |
| 19 | CHEMBL20260 | -9.64 | -15.8 | -3.65 | -34.18 | -15.55 ± 1.34 | 1.54 | -12.55 | -11.01 ± 2.88 | |
| 20 | Nirmatrelvir | -9.57 | -13.8 | 2.21 | -29.80 | -14.57 ± 0.06 | -4.56 | -9.78 | -14.35 ± 0.04 | -10.46 |
value obtained based on IC50 value, in which term was approximately equal to inhibition constant and contribution of covalent binding energy is assumed to be small [88]. Unit is kcal mol-1.
Figure 3Probability of NBC and HB contacts between SARS-CoV-2 Mpro individual residues and top-lead compounds. Green rectangles denote residues that formed more than 6% HB and 46% NBC to ligands.
Figure 4Collective-variable FEL of SARS-CoV-2 Mpro in present and absent ligands. Distances and (, which are associated with catalytic triad Cys145 - His41 - Asp187, were utilized as reaction coordinates.
Figure 5Interaction diagram between SARS-CoV-2 Mpro + CHEMBL3945443. Complexed structure was obtained by calculating clustering of all equilibrium snapshots of complex with cutoff of 0.2 nm.