| Literature DB >> 34909066 |
R D Jawarkar1, Ravindrakumar L Bakal1, Magdi E A Zaki2, Sami Al-Hussain2, Arabinda Ghosh3, Ajaykumar Gandhi4, Nobendu Mukerjee5, Abdul Samad6, Vijay H Masand7, Israa Lewaa8.
Abstract
Congruous coronavirus drug targets and analogous lead molecules must be identified as quickly as possible to produce antiviral therapeutics against human coronavirus (HCoV SARS 3CLpro) infections. In the present communication, we bear recognized a HIT candidate for HCoV SARS 3CLpro inhibition. Four Parametric GA-MLR primarily based QSAR model (R2:0.84, R2adj:0.82, Q2loo: 0.78) was once promoted using a dataset over 37 structurally diverse molecules along QSAR based virtual screening (QSAR-VS), molecular docking (MD) then molecular dynamic simulation (MDS) analysis and MMGBSA calculations. The QSAR-based virtual screening was utilized to find novel lead molecules from an in-house database of 100 molecules. The QSAR-vS successfully offered a hit molecule with an improved PEC50 value from 5.88 to 6.08. The benzene ring, phenyl ring, amide oxygen and nitrogen, and other important pharmacophoric sites are revealed via MD and MDS studies. Ile164, Pro188, Leu190, Thr25, His41, Asn46, Thr47, Ser49, Asn189, Gln191, Thr47, and Asn141 are among the key amino acid residues in the S1 and S2 pocket. A stable complex of a lead molecule with the HCoV SARS 3CLpro was discovered using MDS. MM-GBSA calculations resulted from MD simulation results well supported with the binding energies calculated from the docking results. The results of this study can be exploited to develop a novel antiviral target, such as an HCoV SARS 3CLpro Inhibitor.Entities:
Keywords: 3CLpro, 3C like Protease; FDA, Food and Drug Administration; GA-MLR; GA-MLR, Genetic Algorithm Multilinear Regression; HCoV SARS 3CLpro; HCoV-HKU1, Human coronavirus HKU1; HCoV-NL63, Human coronavirus NL63; HCoVs, human coronaviruses; Lead; MD, Molecular Docking; MDS, molecular dynamic simulation; MERS, Middle East Respiratory Syndrome; MMGBSA calculations; MMGBSA, Molecular Mechanics Generalized Born and Surface Area; Molecular docking and MD simulation; OECD, Organization for Economic Corporation and Development; QSAR based virtual screening; QSAR, Quantitative Structure Activity Relationship; RNA, Ribo-nucleic acid; SARS, severe acute respiratory sign; VS, Virtual Screening
Year: 2021 PMID: 34909066 PMCID: PMC8524701 DOI: 10.1016/j.arabjc.2021.103499
Source DB: PubMed Journal: Arab J Chem ISSN: 1878-5352 Impact factor: 6.212
showing Experimental end point (experimental pEC50 value in nm), Predicted fitting (Predicted pEC50 value) and Predicted fit residual value (residual).
| sn | Status | Exp. endpoint | Pred. fitting | Pred.Fit.Res. | Pred. LOO | Pred. LOO Res. |
|---|---|---|---|---|---|---|
| 1 | Training | 6.699 | 6.4234 | −0.2756 | 6.3412 | −0.3578 |
| 2 | Training | 6.301 | 6.4887 | 0.1877 | 6.5641 | 0.2631 |
| 3 | Training | 6.222 | 5.8991 | −0.3229 | 5.8629 | −0.3591 |
| 4 | Training | 5.886 | 5.6832 | −0.2028 | 5.6284 | −0.2576 |
| 5 | Training | 5.824 | 5.5829 | −0.2411 | 5.5531 | −0.2709 |
| 6 | Prediction | 5.745 | 5.475 | −0.27 | PRED | −0.27 |
| 7 | Training | 5.745 | 5.8673 | 0.1223 | 5.9219 | 0.1769 |
| 8 | Prediction | 5.745 | 5.8991 | 0.1541 | PRED | 0.1541 |
| 9 | Training | 5.602 | 5.8991 | 0.2971 | 5.9324 | 0.3304 |
| 10 | Training | 5.284 | 5.0159 | −0.2681 | 4.9946 | −0.2894 |
| 11 | Training | 5.268 | 5.1013 | −0.1667 | 5.0908 | −0.1772 |
| 12 | Training | 5.268 | 5.1013 | −0.1667 | 5.0908 | −0.1772 |
| 13 | Training | 5.268 | 5.2093 | −0.0587 | 5.2047 | −0.0633 |
| 14 | Training | 5.268 | 5.1013 | −0.1667 | 5.0908 | −0.1772 |
| 15 | Training | 5.268 | 5.6909 | 0.4229 | 5.746 | 0.478 |
| 16 | Training | 5.26 | 5.1239 | −0.1361 | 5.1067 | −0.1533 |
| 17 | Training | 5.102 | 5.1293 | 0.0273 | 5.1346 | 0.0326 |
| 18 | Training | 5.102 | 5.1293 | 0.0273 | 5.1346 | 0.0326 |
| 19 | Training | 5.102 | 5.0213 | −0.0807 | 5.0119 | −0.0901 |
| 20 | Prediction | 5.102 | 5.0866 | −0.0154 | PRED | −0.0154 |
| 21 | Training | 5.102 | 5.3523 | 0.2503 | 5.3728 | 0.2708 |
| 22 | Training | 5.073 | 4.9786 | −0.0944 | 4.9689 | −0.1041 |
| 23 | Training | 5.051 | 4.9934 | −0.0576 | 4.985 | −0.066 |
| 24 | Training | 5.051 | 4.765 | −0.286 | 4.7031 | −0.3479 |
| 25 | Prediction | 5.051 | 4.9079 | −0.1431 | PRED | −0.1431 |
| 26 | Training | 5.051 | 5.2093 | 0.1583 | 5.2217 | 0.1707 |
| 27 | Training | 5.051 | 4.9786 | −0.0724 | 4.9712 | −0.0798 |
| 28 | Prediction | 5.048 | 4.7402 | −0.3078 | PRED | −0.3078 |
| 29 | Training | 5.025 | 4.7627 | −0.2623 | 4.6986 | −0.3264 |
| 30 | Prediction | 5.025 | 4.8481 | −0.1769 | PRED | −0.1769 |
| 31 | Training | 4.928 | 5.3321 | 0.4041 | 5.3673 | 0.4393 |
| 32 | Training | 4.923 | 5.0415 | 0.1185 | 5.0826 | 0.1596 |
| 33 | Training | 4.609 | 4.5289 | −0.0801 | 4.5015 | −0.1075 |
| 34 | Training | 4.403 | 4.4822 | 0.0792 | 4.5994 | 0.1964 |
| 35 | Training | 4.357 | 4.7999 | 0.4429 | 4.9401 | 0.5831 |
| 36 | Training | 4.347 | 4.5491 | 0.2021 | 4.5866 | 0.2396 |
| 37 | Training | 4.222 | 4.4209 | 0.1989 | 4.4861 | 0.2641 |
Fig. 1Depiction of 37 dataset molecules used in QSAR study.
Fig. 2Plot of number of descriptors against Coefficient of Determination R2 and Leave-One out Coefficient of Determination Q2 to identify the optimum number of descriptors.
Fig. 7Display of Superimposed structures of Molecule 4 (Green colored) with Molecule TG-0204988 (Cyan Colored) within the binding pocket of SARS-CoV 229e 3CLpro (pdb id-2zu2).
Fig. 3Display of Descriptor fnotringNsp3C3B exclusively for the molecule 1 and 37.
Fig. 4Presentation of the descriptor faccH4B for the molecules 1 and 37 only.
Fig. 5Different graphs associated with the developed Quantitative Structure − Activity Relationship (QSAR) model: (a) experimental vs predicted pEC50 and (b) Williams plot to assess applicability domain of model, and (c) Insubria Plot.
Fig. 6Depiction of Molecule 4 orientation within the binding pocket of SARS-CoV 229e 3CLpro (pdb id-2zu2).
Portrayal of Structures, Docking Score (kcal/mol) and RMSD values for the five most active and five least active dataset molecules.
| Molecule | Structures | Docking Score | RMSD |
|---|---|---|---|
| 1 | −7.1447477 | 2.7707791 | |
| 2 | −7.8803358 | 1.7312964 | |
| 3 | −7.3945093 | 2.4114711 | |
| 4 | −8.4731464 | 1.6090333 | |
| 5 | −9.605979 | 2.440057 | |
| 33 | −7.4059458 | 1.5316099 | |
| 34 | −6.7014847 | 2.1337159 | |
| 35 | −10.147323 | 2.534488 | |
| 36 | −6.8096747 | 2.0087693 | |
| 37 | −7.909008 | 1.5497004 |
Fig. 8Presentation of 2D interaction of molecule 4 with SARS-CoV 3CLpro (pdb id-2zu2).
Presentation of Structures, Docking Score (kcal/mol), RMSD and PEC50 values for the five most active and five least active Hits obtained in QSAR Modeling Based Virtual Screening.
| sn | Molecule | Structure | Docking score | RMSD | PEC50 | status |
|---|---|---|---|---|---|---|
| 1 | 19 | −6.960 | 1.368 | 6.872 | Most active | |
| 2 | 6 | −7.126 | 2.978 | 6.743 | Most active | |
| 3 | 39 | −7.126 | 1.598 | 6.678 | Most active | |
| 4 | 91 | −6.728 | 2.3425 | 6.175 | Most active | |
| 5 | 97 | −8.043 | 1.53257 | 6.089 | Most active | |
| 6 | 38 | −7.335 | 1.666 | 6.025 | Most active | |
| 7 | 4 | −6.627 | 2.1068397 | 3.937 | Least active | |
| 8 | 9 | −7.485 | 1.298 | 3.921 | Least active | |
| 9 | 59 | −6.756 | 1.814 | 3.743 | Least active | |
| 10 | 98 | −6.948 | 0.9944 | 3.657 | Least active | |
| 11 | 94 | −7.533 | 1.638 | 3.635 | Least active | |
| 12 | 70 | −7.042 | 2.106 | 3.592 | Least active |
Fig. 9Presentation of 2D interaction of Hit molecule 97 with SARS-CoV 3CLpro (pdb id-2zu2).
Fig. 10Depiction of Hit Molecule 97 orientation within the binding pocket of SARS-CoV 229e 3CLpro (pdb id-2zu2).
Fig. 11Display of Superimposed structures of Hit Molecule 97 (Green colored) with Molecule TG-0204988 (cyan Colored) within the binding pocket of SARS-CoV 229e 3CLpro (pdb id-2zu2).
Fig. 12Root mean square deviation (RMSD) of C-α backbone of 229e (red) with Hit Molecule 97and Hcov_229e(green) with ligand compound 4 for 100 ns simulation exhibiting a stable configuration of 229e-hit6 & Hcov_229e-compound4.
Fig. 13Root mean square fluctuation of C-α backbone of 229e (red) & Hcov_229e (green) at its respective amino acid residues for 100 ns simulation exhibiting a stable configuration.
Fig. 14The types of bonds and the amino acid residues that participated during 100 ns of simulation; (A)229e- Hit Molecule 97, (B)Hcov_229e-complex molecule 4.
Fig. 152D interaction plots showing ligand interactions of 229e with (A) Hit Molecule 97 & (B) complex molecule 4 with the binding cavity residues of SARS-CoV 3CLpro.
Fig. 16Presentation of 3D and 2D interaction of Hit Molecule 97 in complex with SARS-CoV 3CLpro.
Fig. 17Presentation of 3D and 2D interaction of Molecule 4 in complex with SARS-CoV 3CLpro.
MMGBSA binding energy contribution by non bonded interactions by 229e-complex4, 229e-hit6 and 70-inactive molecules with the target protein.
| Energies (kcal/mol) | 229e-complex4 | 229e-hit97 | 70-inactive |
|---|---|---|---|
| ΔGbind | –32.2 ± 7.6 | −53.81 ± 6.7 | −7.2 ± 3.4 |
| ΔGbindLipo | −13.8 ± 2.9 | −19.5 ± 2.4 | −5.6 ± 1.1 |
| ΔGbindvdW | −38.1 ± 7.7 | −52.2 ± 7.2 | −4.8 ± 6.0 |
| ΔGbindCoulomb | −8.1 ± 7.9 | −14.0 ± 9.1 | −2.8 ± 0.9 |
| ΔGbindHbond | −0.14 ± 0.2 | −0.95 ± 0.1 | −0.49 ± 0.3 |
| ΔGbindSolvGB | 23.6 ± 9.1 | 30.6 ± 5.4 | 2.2 ± 0.7 |
| ΔGbindCovalent | 4.9 ± 2.3 | 2.8 ± 1.9 | 3.1 ± 3.5 |
Fig. 18MMGBSA trajectory (0 ns, before simulation and 100 ns, after simulation) exhibited conformational changes of dataset compound 4(a), most active hit molecule 97(b) and least active hit molecule 70 upon binding with the protein SARS CoV-229E 3CLpro. The arrows indicating the overall positional variation (movement and pose) of dataset compound 4, most active hit molecule 97 and least active hit molecule 70 at the binding site cavity.