Literature DB >> 32960061

Theoretical Insights into the Anti-SARS-CoV-2 Activity of Chloroquine and Its Analogs and In Silico Screening of Main Protease Inhibitors.

A S Achutha1, V L Pushpa1, Surendran Suchitra1.   

Abstract

Corona virus disease (COVID-19) is a dangerous disease rapidly spreading all over the world today. Currently there are no treatment options for it. Drug repurposing studies explored the potency of antimalarial drugs, chloroquine and hydroxychloroquine, against SARS-CoV-2 virus. These drugs can inhibit the viral protease, called chymotrypsin-like cysteine protease, also known as Main protease (3CLpro); hence, we studied the binding efficiencies of 4-aminoquinoline and 8-aminoquinoline analogs of chloroquine. Six compounds furnished better binding energies than chloroquine and hydroxychloroquine. The interactions with the active site residues especially with Cys145 and His41, which are involved in catalytic diad for proteolysis, make these compounds potent main protease inhibitors. A regression model correlating binding energy and the molecular descriptors for chloroquine analogs was generated with R2 = 0.9039 and Q2 = 0.8848. This model was used to screen new analogs of primaquine and molecules from the Asinex compound library. The docking and regression analysis showed these analogs to be more potent inhibitors of 3CLpro than hydroxychloroquine and primaquine. The molecular dynamic simulations of the hits were carried out to determine the binding stabilities. Finally, we propose four compounds that show drug likeness toward SARS-CoV-2 that can be further validated through in vitro and in vivo studies.

Entities:  

Keywords:  3CLpro; COVID-19; SARS-CoV-2; antiviral screening; chloroquine; hydroxychloroquine; molecular docking; molecular dynamics; regression

Mesh:

Substances:

Year:  2020        PMID: 32960061      PMCID: PMC7640984          DOI: 10.1021/acs.jproteome.0c00683

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


Introduction

In December 2019, patients were admitted in hospitals in Wuhan, China due to pneumonia with fever and other symptoms.[1,2] In January 2020, it was discovered that a novel coronavirus, which is similar to SARS-CoV and MERS-CoV genomes, is responsible for this disease.[2] This virus is named as Severe Acute Respiratory Syndrome Corona Virus-2 (SARS-CoV-2) and the disease is called Corona Virus Disease-2019 (COVID-19).[3,4] It was declared as pandemic on March 11 by the World Health Organization (WHO).[5] As of September 1, 2020, there are 25,327,098 confirmed cases with 848,255 deaths worldwide according to WHO reports.[6] Coronavirus is a single-stranded positive sense RNA with a membrane envelope composed of four types of viruses: α, β, γ, and δ coronaviruses. SARS-CoV-2 belongs to β-coronavirus family.[5,7] Its genome encodes for structural proteins mainly spike glycoprotein (S), envelope glycoprotein (E), membrane glycoprotein (M), nucleocapsid protein (N), and nonstructural proteins consisting of chymotrypsin-like cysteine protease also known as Main protease (3CLpro) and papain-like protease (PLpro).[1,8,9] The viral RNA is released to the host cell, which is then translated to polyproteins inside the cell. These viral polyproteins are cleaved to functional proteins by proteolysis, which is catalyzed by 3CLpro and PLpro 1,8,9. Since the main protease or 3CLpro cleaves at 11 sites of the polyprotein to produce smaller proteins for viral replication and because of its nonsimilarity with human proteins, 3CLpro is a potential target in anti-COVID-19 drug design.[8,10] The main protease of SARS-CoV-2 has a Cys-His catalytic diad involving Cys145 and His41 and has four conserved binding subsites S1, S2, S4, and S1′. Main chains of Cys145, Phe140, and Leu141 and side chains of Asn142, Glu166, His163, and His172 residues are present in the S1 subsite. The side chains of His41, Met49, Tyr54, and Met165, along with the alkyl side chain of Asp187 forms the S2 subsite. The S4 subsite consists of Met165, Leu167, Phe185, and Gln192 side chain residues and the main chain of Gln189. S1′ includes Thr24 and Thr25. Therefore, a molecule to show 3CLpro inhibitory property should interact with one or more residues among these amino acids. Also, the inhibitory action can be assumed to enhance if they interact with Cys145 or His41, which is involved in the catalytic diad of Main protease[10] (Figure ). Cysteine acts as a nucleophile and Histidine as an acid/base catalyst in this catalytic reaction.[9,11]
Figure 1

Mechanism of proteolysis by Cys-His diad by hydrolysis of amide substrate. (1) Deprotonation from cysteine by histidine. (2) Nucleophilic attack on the histidine. (3) Release of an amine resulting in the formation of a thioester deprotonation of histidine. (4) Addition of water. (5) Thioester is hydrolyzed and cysteine–substrate bond is broken. (6) Regeneration of enzyme with the elimination of a carboxylic acid molecule.[9,11]

Mechanism of proteolysis by Cys-His diad by hydrolysis of amide substrate. (1) Deprotonation from cysteine by histidine. (2) Nucleophilic attack on the histidine. (3) Release of an amine resulting in the formation of a thioester deprotonation of histidine. (4) Addition of water. (5) Thioester is hydrolyzed and cysteine–substrate bond is broken. (6) Regeneration of enzyme with the elimination of a carboxylic acid molecule.[9,11] Currently there are no specific approved treatment options for COVID-19. Researches are in search of vaccine development, druggable small molecule, monoclonal antibodies and cell-based therapies.[12] Several drug repurposing studies have been conducted so far, and many of them gave positive results. The 4-aminoquinolines, chloroquine (CQ), and its hydroxyl analog hydroxychloroquine (HCQ) have been found to disrupt the viral replication and infectivity.[13−15] CQ [4-N-(7-chloroquinolin-4-yl)-1-N,1-N-diethylpentane-1,4-diamine] and HCQ (2-[4-[(7-chloroquinolin-4-yl)amino]pentyl-ethylamino]ethanol) are used particularly as antimalarial drugs. Their broad spectrum of activity as antibacterial, antiviral, and antifungal infections has also been reported.[15,16] HCQ has been found out to be less toxic than CQ, and it is currently being used to treat COVID-19 patients.[17] The present study deals with theoretical perspectives on the anti-SARS-CoV-2 activity of CQ and HCQ analogs by observing their main protease inhibitory property. Molecular docking studies with the 3CLpro protein were performed to analyze the drug likeness as well as to correlate the binding energy of the docked complex with various physicochemical properties of the active molecules, which will aid in the design of new anti-COVID-19 medicatives.

Methodology

Computational Details

Analogs of CQ and HCQ as well as primaquine derivatives were obtained through literature search, from the database of small molecules Pubchem as well as created manually.[18−22] All the molecules were drawn in Marvin Sketch and minimized their energy using Avogadro software.[23,24] The structure of 3CLpro protein having PDB ID 6LU7 was retrieved from RCSB Protein Data Bank.[10,25] Characterization studies of the protein were done by using Expasy-protparam, an online server, which provides all the details regarding the protein.

Molecular Docking

The energy minimization of the retrieved protein was done by employing Swiss PBD Viewer.[26,27] Molecular docking was done using Autodock 4.2.6.[28] Both protein and ligands were prepared in pdbqt format. Polar hydrogens and Gasteiger charges were added to the receptor. Grid was prepared with 60 × 60 × 60 Å3 with spacing 0.375 Å and centered at −18.617, 13.208, and 60.061 Å along x, y, and z axes, respectively. Grids were prepared for every atom present in the ligand data set. The genetic algorithm was employed as search parameter with 50 runs, 300 and 27,000 population size and number of generations, respectively.

Regression Analysis

A multiple linear regression (MLR) model was generated to find the correlation with binding energy obtained from docking and the physicochemical properties using QSARINS.[29,30] The molecular descriptors for the ligands were calculated utilizing PaDEL-Descriptor.[31] To build the model, 80% of the data set were randomly divided as training set and test set in 80:20 ratio. Thirty-five compounds (28 training and 7 test) including CQ and HCQ were employed for modeling purpose (Supplementary Tables S2 and S3). The models were generated using training set with MLR analysis of 1–5 variables. The generated models were examined by internal validation and external validation. Internal validation was done by cross-validated leave-one-out (LOO) method, which involves iteratively leaving one compound from the training set and generating regression model with the remaining molecules and predicting the value of response for the excluded one. For a good model, the regression coefficient (R2) as well as the cross validated R2, that is, Q2 has to be close and >0.6.[29,32] The model was validated using test set, which was not included in model generation.[29,33] The best model was used to predict the binding energy of novel set of compounds.

Molecular Dynamic Simulations

All the MD simulations were done using the NAMD package (Version 2.14) developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana–Champaign[34] with the CHARMM 36 force field.[35,36] The docked structures were used as the initial coordinates for MD simulations. TIP3P water box was used to solvate the complex with 5 Å padding. The solvated system was minimized for 100 ps with a time step of 2 fs. The simulations were done with Particle mesh Ewald electrostatics and periodic boundary conditions. Atom-based cutoff of 12 Å was applied for nonbonding interactions. The temperature and pressure were kept constant at 310 K and 1 atm, respectively, using Langevin dynamics. The system was equilibrated for 100 ps followed by a production simulation for 20 ns using the NPT ensemble. All the preparation and analysis steps were done using the VMD package 1.9.3 (available at http://www.ks.uiuc.edu/Research/vmd/) (see Figure ).[37]
Figure 2

Flowchart for the methodology used in prediction of main protease inhibitor.[38]

Flowchart for the methodology used in prediction of main protease inhibitor.[38]

Results and Discussion

Protein Characterization

Being an inevitable enzyme for replication and transcription of the virus, the 3-chymotrypsin-like-cysteine protease (3CLpro) having PDB ID 6LU7 was retrieved from RCSB PDB and characterized by primary and secondary structure analysis as shown in Table .
Table 1

Values of Features for Protein Characterization

propertiesvalues
molecular weight333,797.64 kDa
energy–16,473.465 kJ/mol
resolution2.16 Å
theoretical pI5.95
aliphatic index82.12
GRAVY–0.019
instability index27.65
The pI value (isoelectric point) of 5.95 shows the slightly acidic nature of the protein. The aliphatic index (AI) is a measure of the relative volume occupied by the side chains Ala, Leu, Ile, and Val, which are the aliphatic residues in a protein. High AI ranging from 66.5 to 84.33 implies high thermal stability and hydrophobicity, which help it for membrane penetration in biological system.[39,40] The value for grand average of hydropathy (GRAVY) indicates the hydrophilicity of the protein.[40] The GRAVY value of −0.019, which is close to zero, shows that the protein is hydrophobic. Instability index <40 shows that in vivo half-life is greater than 60 h, which indicates the stability of the protein.[41,39] All the parameters show that the 6LU7 is a stable, hydrophobic protein. The 6LU7 protein consists of three domains in which the catalytically active site is present between Domain I and II. Figure S1 (Supporting Information) shows the domains and active site of 6LU7 protein bound with its cocrystallized inhibitor N3.[10,42]

Comparative Docking of CQ and HCQ with 3CLpro and PfDHFR-TS Targets

The antimalarial drugs chloroquine (CQ) and its hydroxy analog hydroxychloroquine (HCQ) are currently used as drugs for the treatment of COVID-19.[13,17] To compare its efficiency as an anti-COVID-19 drug to that of an antimalarial drug, docking studies were carried out for both targets, 3CLpro and PfDHFR-TS. Plasmodium falciparum dihydrofolate reductase thymidylate synthase (PfDHFR-TS) is one of the important targets for antimalarial drugs[43,44] (Supplementary Table S1). It was identified that for both the targets HCQ has lower binding energy owing to its higher binding affinity to the protein. Both the drugs are more active on PfDHFR-TS than 3CLpro. The 2D interaction diagrams and docked poses of CQ and HCQ with the target 3CLpro are given in Figures S2 and S3, respectively. HCQ fits more perfectly to the active site of the target than CQ, and hence, HCQ has high negative binding energy, which imparts more stability to the docked complex (Supplementary Figure S3). CQ forms three hydrogen bonds with Gly143, Cys145, and His164; van der Waals interaction with Asn142 and Gln189; π-alkyl interaction with His163; and two π-donor hydrogen bond interactions with the −SH group of Cys145 at distances 3.78 Å, 5.16 Å, 5.31 Å, 4.30 Å, 5.10 Å, 6.63 Å, 4.93 Å, and 4.95 Å, respectively (Supplementary Figure S2a, Table ). The Cys145-His41 diad is blocked by these interactions. The interaction of alkyl group of the CQ with the π-cloud of imidazole ring of histidine may disturb the delocalization of π electrons. The basicity of the imidazole ring is due to the resonance stabilization of the positive imidazolium ion (Figure ).[45]
Table 2

Binding Energy of Chloroquine Analogs and Their Interactions

  interaction with amino acid residues
lig namebinding energy (kcal/mol)hydrogen bondhydrophobic and other interactions
CQ–6.13Gly143, Cys145, His164His41, Asn142, His163
HCQ–6.58Phe140, Asn142, Ser144, Glu166Phe140, Asn142, Cys145, Glu166, His172
1–6.76Phe140, Glu166Cys145, Ser144
2–5.67Asn142Leu141, Cys145
3–5.9 Val104, Ile106, Phe112, Ile136, Ser158, Cys160, Tyr182,
4–6.55His41, Cys145, Glu166Leu27, Phe140, His163, Met165, His172
5–5.31Ser144, Glu166Cys145, His163
6–5.91Asn142Cys145
7–5.32Thr111Val104, Ile106, Asn151, Thr292, Asp295
8–7.19Cys145, Glu166Met49, Cys145
9–5.29Asn142Phe140, Cys145
10–6.62Glu166His41, Met165, Glu166
11–5.67 Met49, Phe140, Leu141, Met165, Glu166
12–5.23Asn142Asn142, Cys145, Glu166, Leu167, Pro168
13–6Asn142, Ser144, Glu166Asn142, Cys145
14–5.89Asn142, Ser144Asn142, Cys145
15–6Asn142, Ser144Cys145, Asn142
16–5.98Phe140, Asn142, Glu166His41, Ser139, Leu141, Cys145, His172
17–5.9Glu166Phe140, Cys145, Glu166
18–5.98Asn142, Glu166Asn142, Cys145, Met165, Glu166, Pro168,
19–5.81Ser139, Asn142, Ser144Phe140, Leu141 Asn142, Cys145, His172
20–5.81Leu141, Glu166His41, Leu141, Cys145, Met165
21–5.65Asn142, Glu166Asn142, Cys145, Glu166
22–7.21Ser139, Ser144, Asn142Phe140, Leu141, Asn142, Cys145, His172
23–5.8Phe140, Asn142, Glu166His41, Phe140, Leu141, Asn142, Gly143, Cys145, His163, Glu166
24–7.05Gly143, His164Phe140, Cys145, His163, Met165, Glu166
25–7.28Asn142, Ser144, Glu166Phe140, Leu141, Cys145
26–5.25Asn142, Glu166His41, Phe140, Leu141, Cys145, Glu166, Pro168
27–6.15Gly143, His164Cys145, Met165, Glu166, Leu167, Pro168
28–6.47Glu166His41, Phe140, Leu141, Asn142, Cys145, His163, Met165, Glu166
29–7.33Phe140, Gly143, Ser144, Cys145, Glu166Cys145
30–5.49Asn142, Ser144, Glu166Phe140, leu141, Cys145
31–6.03Leu141, Glu166Leu27, His41, Phe140, Gly143, Cys145, Glu166
32–5.97Gly138, Phe140, Asn142Phe140, Asn142, Gly143, Ser144, Cys145, Glu166
33–6.05Phe140, Asn142, Glu166Ser139, Cys145, His163, Met165, Glu166, His172
Figure 3

Resonance stabilization of imidazolium ion.

Resonance stabilization of imidazolium ion. The alkyl−π interaction of the alkyl donor and aromatic acceptor increases the stability of the docked complex as well as reduces the ability of the imidazole ring to accept H+ from the Cys residue, which is the initial step in the catalytic cycle in proteolysis, thereby reducing the activity.[46,11] HCQ forms two hydrogen bonds with Phe140, and one each with Asn142, Ser144, and Glu166 at distances 5.15 Å, 5.0 Å, 3.94 Å, 4.31 Å, and 5.04 Å, respectively. It forms two van der Waals interactions with Phe140, and one each with Glu166 and His172 at distances 6.59 Å, 7.12 Å, 5.30 Å, and 5.63 Å, respectively. It also forms a π–σ interaction with Asn142 and two π–alkyl interactions with Cys145 (Supplementary Figure S2b, Table ). The π–alkyl interaction with Cys145 may disturb the conformation required for the interaction with polypeptide in the catalytic cycle. There have been reports about the entry of SARS coronavirus as acidic pH dependent in which the activation occurs by the fusion of the membranes of virus and cellular endosomes and viral genome enters the cytoplasm.[47,48,15] The presence of amino side chain in CQ and HCQ makes them basic and increases the pH of cell organelles like endosomes and lysosomes, which interferes with the replication and inhibits the viral activation.[17,15]

Molecular Docking Analysis of CQ and HCQ Analogs

To find more potent inhibitors of 3CLpro, analogs of chloroquine were subjected to molecular docking, which were obtained from literature, Pubchem database, by replacement with isosteres and functional group variations[18−20] of chloroquine (Supplementary Tables S2 and S3). The binding energy thus obtained from docking with 6LU7 protein is given in Table . All the chloroquine analogs show some of the key interactions with the active site residues. The negative value shows a release of energy while forming a protein–ligand docked complex, which imparts stability. The more negative the binding energy, the higher will be the stability and binding affinity. The compounds 29, 25, 22, 8, 24, and 10 have more negative binding energy than CQ and HCQ. Their 2D ligand interaction diagrams are shown in Supplementary Figure S4. From the structures of these six compounds, we can infer that if the side chain has an extra −NH group, it can introduce additional hydrogen bonds, which results in a decrease in binding energy. In compound 24, the tail of side chain has become a part of a ring, which can interact with Met165 residue via π–alkyl interaction. Because of the presence of an additional −NH group, the basicity of these compounds is higher than CQ and HCQ, and thus, they can act as better inhibitors against SARS-CoV-2. Compounds 29, 25 22, 8, 24, and 1 form π-donor hydrogen bonds with Cys145, which will make the −SH hydrogen less available for catalytic cycle. Compounds 29, 25, 22, 24, and 1 have a π–alkyl interaction with Cys145. Overall, these interactions may affect the catalysis of proteolysis reaction and hence block viral replication. Compound 29 is primaquine (an 8-aminoquinoline), which is an antimalarial drug.[49,50] The effect of the side chain in the chloroquinoline scaffold was studied by changing the chain length and methyl substitution of the side chain. Compounds 12 and 23 are obtained by the removal of the branching methyl group from CQ and HCQ structures, which results in an increase in the binding energy to −5.23 and −5.80 kcal/mol, respectively. Therefore, it is understood that the branching next to theNH– group in 4-aminopentyl side chain is important. Isosteric substitution of −CH3 group in compounds 21, 22, and 26 with −OH, −NH2, and −CF3 results in binding energies −5.75, −7.21, and −5.25 kcal/mol, respectively. Substitution with NH2 resulted in a decrease in binding energy compared to HCQ by a factor of −0.63 kcal/mol, which means that an extra 0.63 kcal/mol of energy is released while forming a docked complex, which is more stable than that of HCQ. An increase in binding energy due to the variation of side chain length in CQ and HCQ structures of compounds 6, 9, 11, 14, 15, 16, 17, 18, and 19 makes them less stable complexes with the protein than with CQ and HCQ. Therefore, the chain length of 4-aminopentyl group is optimal for showing biological activity. When an additional −OH group is added to the ethyl chain of HCQ resulting in 2-[(4-aminopentyl)(2-hydroxyethyl)amino]ethan-1-ol side chain as in compound 13, as a result, the binding energy is decreased to −6.0 kcal/mol. Therefore, the −OH group at one terminal ethyl chain only is required for their inhibitory activity. Rearrangement of the amino side chain from the fourth position to fifth position of quinoline ring in HCQ (molecule 33) increased the binding energy by 0.53 kcal/mol. Replacement of quinoline scaffold by isoquinoline scaffold also increased the binding energy by an amount of 0.61 kcal/mol.

Regression Analysis

The CQ, HCQ, and its 33 analogs taken for this study were subjected to MLR analysis. Regression models were generated with binding energy as the response and molecular descriptors as the variables. Many models were generated by varying 1–5 variables at a time from a set of 1602 descriptors. The fitness criteria of a model include R2 ≥ 0.6, R2adj ≥ 0.6, Q2loo ≥ 0.6, high value for Fisher ratio (F), low regression standard deviation (s), lower correlation between descriptors (K), positive value for δk, which is the difference in correlation between descriptors and the response (y) value and correlation among descriptors, and smaller root–mean–squared error for training calculation (RMSEtr).[51,52] The parameters of the best five models calculated are given in Table .
Table 3

Parameters of Top Five Regression Models Fitting the Criteria

model no.descriptors usedR2RadjQ2looFsKxxδkRMSEtr
1SHBint4, minsNH2, n6Ring, GraphFP567, KRFP5670.91010.88970.876544.5000.21440.26420.05910.1900
1SCH-7, nHsNH2, minHBint6, FP402, KRFPC4760.90390.88300.848843.28190.23310.23780.10580.2076
2SCH-7, SHBint4, minsNH2, KRFP4340.87920.85990.838445.50690.23920.24150.07930.2184
3SHBint4, nTRing, ExtFP698, GraphFP8930.86120.84220.816245.07980.21400.16720.06950.2129
4Vp-5, maxHBint3, GraphFP409, KRFP607, KRFPC3970.85960.82520.808126.50030.24020.36410.06690.2129
5AlogP, VP-5, minHBint3, KRFP4930.78190.74390.718220.61060.29080.30850.04060.2635
Considering all the models generated, model 1 has the highest fitness score. However, upon inspecting its external validation parameters RMSEext = 0.9003 and R2ext = 0.3245, the values did not meet the cutoff and hence show poor predictability. For a good prediction, RMSEext should not vary much from RMSEtr (difference should be less) and R2ext should be greater than 0.6. Model 2 was selected for further studies since it was the best fit model with next highest R2 and Q2loo, which represents the fitness, robustness of the model, and the proximity of actual and predicted binding energy values. The model was built by incorporating the compounds 2, 5, 11, 21, 26, 28, and 30 in the test set and all others in the training set. Thus, the binding energy, the response value (y), can be expressed in terms of the descriptors considered for the generation of model 2 in the form of a linear relationship y = Σmixi + c where mi is the coefficient obtained for each descriptor in MLR model and xi is the value for the selected descriptor. The internal and external validation parameters determine the feasibility and predictivity of the model[29,32] (Table S4). The linear equation correlating binding energy and descriptors generated by model 2 can be written as shown in eq :The root–mean–square error for cross-validation RMSEcv = 0.2296 is greater than RMSEtr = 0.2008, making the model stable and predictive. The criteria for Y scrambling R2 scr < 0.2, Q2 scr < 0.2, and R2 scr > Q2 scr implies that there is no correlation between descriptors, and hence, the model is not simply correlated by chance. Model 2 was assessed with external validation parameters R2ext > 0.6 and small difference between RMSEext and RMSEtr and was hence acceptable due to its high predictivity. The predicted values of binding energy for both training and test set of molecules are given in Table S5. The plot of observed versus predicted binding energy of training and test set molecules is given in Figure . Substitution of −Cl with −NH2 in quinoline scaffold in compound 30 led to inaccurate prediction and hence was detected as an outlier.
Figure 4

Observed versus predicted binding energies calculated by Model 2 for training and test set.

Observed versus predicted binding energies calculated by Model 2 for training and test set. Model 2 shows that the binding energy is a linear combination of the descriptors SCH-7, nHsNH2, minHBint6, FP402, and KRFP402. SCH-7 is simple seventh order chi chain that is a topological descriptor based on interatomic distances calculated by the bonds between them representing molecular connectivity as a chemical graph like a bond-line formula of chemical structures. It considers the specificity of the structures at a fragment level rather than the whole molecule.[53] The order 7 represents the number of edges in the graph, which indicates the branching.[54,55] The negative value of its coefficient indicates its negative impact on biological activity. Thus, a low value for SCH-7 is required for a compound to show low binding energy, which makes a restriction on branching. nHsNH2 represents the number of NH2 groups and amine hydrogens.[56] The positive coefficient of 1.0801 indicates that as the number of NH2 groups increases, the binding energy decreases, which is in perfect agreement with the docking results calculated. minHBint6 is the minimum E-state descriptor of strength for potential hydrogen bonds of path length 6.[57] It is an atom-type electro-topological state descriptor in which both electronic and topological functions are considered.[53] There is a positive dependence of minHBint6 with the binding energy because as the chances of H-bond increases, the binding energy will be more negative. FP402 and KRFPC476 are fingerprint descriptors, which account for structure, specific properties, connectivity, or pharmacophores.[58] KRFPC476 accounts for the count of substructures also.[56] Both of these fingerprints are inversely related to binding energy. Designing molecules in such a way that having high values for nHsNH2 and minHBint6 and low values for SCH-7, FP402, and KRFPC476 will help to reduce the binding energy considerably.

Binding Energy Prediction and Molecular Docking Analysis of Primaquine Analogs

Compound 29, primaquine, shows the least binding energy among the CQ analogs. It is the only antimalarial drug that can interfere with a majority of the life cycle stages of the malarial protozoan species.[59,60] The combined usage of chloroquine and primaquine may mask chloroquine resistance.[61] However, its toxic effect is mainly due to the hemolytic lesions caused by methemoglobin production.[59,60] It is reported that introduction of a tert-butyl group at C2 position (R1 in scaffold given in Table ) will increase the efficiency and is completely drained of from methemoglobin production as in compound pq1.[59,60]
Table 4

Actual and Predicted Binding Energies of Primaquine Analogs and Their Interactions with Active Site of 6LU7 Protein

      interactions
compoundR1R2R3predicted binding energyactual binding energyhydrogen bondhydrophobic and other interactions
pq1–C(CH3)3HH–7.2615–7.39Phe140, Glu166His41, Met165, Glu166, Leu167
pq2–CH(CH3)2HH–7.3131–7.70Phe140, Gly143, Glu166Cys145, Met165
pq3–OCH3HH–7.8998–7.56Phe140, Gly143, Ser144, Cys145, Glu166His163
pq4–C(CH3)3–CH(CH3)2H–7.1488–7.74Phe140, Glu166His41, Met165, Glu166
pq5–CH(CH3)2–CH(CH3)2H–7.422–7.04Glu166, Leu167His41, Leu141, Met165, Glu166, Leu167
pq6HHCH3–7.2005–7.35Asn142, Gly143, Glu166Cys145
pq7HH–CH(CH3)2–6.0402–7.33Phe140, Gly143, His163, Glu166Cys145, Met165
pq8HH–CH2CH2NH2–7.1204–8.58Phe140, Glu166, His172Cys145
pq9–CH(CH3)2H–CH2CH2NH2–8.2004–8.48Phe140, Glu166, His172His41, Met49, Cys145, Met165
pq10HH–CH2NH2–8.3932–8.32Phe140, Gly143, His163, Glu166Cys145
pq11–CH(CH3)2H–CH2NH2–7.3889–7.98Glu166, Leu167Cys145, His163, His164, Met165, His164

In Vitro Screening of Antiviral Compounds

By using the formulated regression Model 2, we predicted the binding energy of some primaquine analogs obtained from the literature and PubChem database and then carried out their molecular docking studies on 3CLpro target to check the inhibitory potency of the ligands, given in Table . Also screened, 10,584 Asinex compounds having antiviral properties using the regression model 2. It was observed that the predicted binding energy of some compounds obtained from the Asinex database differed largely from its actual value, which was obtained from docking analysis. This may be due to their difference in scaffold from that of the compounds used to build the regression model. Even though the predicted binding energy was less accurate, they have furnished better binding energies on docking with the target protein implying the reliability of the model. Thirty molecules that showed lower binding energies were subjected to molecular docking analysis to identify the potent 3CLpro inhibitors (Supplementary Figure S5). Binding energies obtained through both the methods and their interactions with the receptor protein are given in Table .
Table 5

Binding Energies of Molecules Available in Asinex Compound Library Obtained through Docking Analysis and Predicted by the Model and Their Interactions with Active Site of 6LU7 Protein

   interactions
compoundpredicted binding energyactual binding energyhydrogen bondhydrophobic and other interactions
A1–23.9789–7.14 His41, Cys145, Met165, Glu166
A2–18.4293–5.91Glu166His41, His163, Glu166
A3–14.1327–7.15 Leu27, Cys145, Met165
A4–13.5978–7.48Glu166Leu27, Gly143, Cys145, Glu166
A5–12.7448–7.81Ser144, Cys145, Glu166His41, Ser144, His164, Cys145, Met165, Glu166, Pro168
A6–12.1664–7.60Leu141, Cys145Thr26, Gly143, Cys145, Met165, Glu166
A7–11.5557–7.72Gly143, Ser144, Cys145Phe140, Asn142, Cys145, His163, Met165, Glu166, His172
A8–11.5058–6.33Gly143Leu27, Lys137, Leu141, Asn142, Cys145, Glu166, His172
A9–11.4991–7.17 His41, Phe140, gly143, Cys145,
A10–11.4704–7.05Phe140, Gly143, Glu166His41, Cys145, Lys137, Glu166
A11–11.2018–7.66Cys145, Glu166His41, Leu141, Asn142, Cys145, Met165, Glu166, Pro168
A12–11.1328–8.18Gly143, Cys145, Glu166Leu27, His41, Cys145, Met165, Glu166, Pro168
A13–10.9315–7.72Ser144, Cys145Leu27, Cys145, Glu166, Pro168
A14–10.8543–7.22Cys145, Glu166Leu27, Cys145, His163
A15–10.7756–6.98Cys145Leu27, His41, Asn142, Gly143, Cys145, Met165, Gln189
A16–10.602–6.51Phe140, Gly143, Glu166His41, Lys137, Asn142, Gly143, Cys145, Glu166
A17–10.3794–5.97 Thr26, Leu27, His41, His163, Cys145
A18–10.3451–7.51Thr26Thr25, Leu27, Cys145, Met165, Glu166, Pro168
A19–10.1495–7.57Cys145, Glu166Thr26, Cys145, Met165, Leu167, Pro168
A20–10.1483–7.65Gly143, Glu166Leu141, Asn142, Ser144 Cys145, Glu166, Leu167, Pro168
A21–10.1232–6.89Ser144, His163, Glu166Cys145
A22–10.0867–6.77Phe140Leu27, His41, Cys145, Met165, Glu166
A23–9.34084–6.93Cys145Leu27, His41, Gly143, Cys145, Glu166
A24–9.21909–7.12Glu166His41, Leu141, His163, Met165, Glu166, His172
A25–9.09923–7.85Ser144, Cys145Asn142, Pro168
A26–8.94774–7.17Thr26, Ser144, Cys145Cys145, Glu166
A27–8.88306–7.08Gly143, Glu166Leu27, His41, Asn142, Gly143, Cys145, His163, Glu166
A28–8.51347–6.23Glu166Cys145, His163, Met165, Glu166, Leu167, Pro168
A29–8.06925–7.22 Val104, Ile106, Thr111, Phe112, Met130, Ile136, Cys160, Tyr182
A30–8.05939–7.45Ser144, Cys145, Glu166Cys145, His163, Met165
All the primaquine analogs showed lower binding energies than CQ and HCQ. All the compounds except pq5 furnished binding energies lower than that of primaquine. The binding energies calculated using the regression Model 2 are in good agreement with the value obtained from docking analysis. All the compounds exhibited interactions with some of the active site residues including the catalytic diad residues either Cys145 or His41 or both, which make them potent 3CLpro inhibitors. The compounds pq8, pq9, and pq10 exhibited lower binding energies of −8.58, −8.48, and −8.32 kcal/mol. The 2D interaction diagrams of these ligands with the protein are given in Figure . In accordance with the findings based on docking studies, the regression analysis of CQ analogs reveals the presence of one extra −NH2 bond in their structure.
Figure 5

2D interaction diagrams of pq8, pq9, pq10, and A12 with target 6LU7.

2D interaction diagrams of pq8, pq9, pq10, and A12 with target 6LU7. Inspecting the 2D interaction diagrams of compounds A6, A9, A10, A15, A16, A23, A29, A30 it can be seen that there are unfavorable donor–donor or acceptor-acceptor interactions present between the compounds and target protein which may negatively affect the stability of the docked complexes (Supplementary Figure S6). Hence these can be considered as poor inhibitors of 3CLpro protein even though they have binding energies less than HCQ. From these results it is clear that binding energy as well as interactions play decisive roles in determining the druggability of compounds. Asinex database compounds A12, A5, A13, A11, A20, A19, A18, A4, A14, and A3 obtained lower binding energies among all (Figure and Figure S7). These compounds form interactions with Cys145, His 41, or both along with other active site residues, which are responsible for their inhibitory property. The compounds A12, A5, A13, A11, A19, and A14 form hydrogen bonding interactions with Cys145, which will obstruct the initial deprotonation step in the catalytic cycle of 3CLpro as shown in Figure . The π–π interaction of His 41 with π systems of the compounds A12 and A5 will affect the proton accepting ability of His41. The π–sulfur interactions are present in aromatic rings with −SH group of the compounds A13, A20, A18, A4, A14, and A3 with Cys145 residue. It involves the interaction with the aromatic π orbitals and σ* acceptor orbital of SH bond of Cys145. S–H/π interactions are the important contributors to binding energy.[62] The thiol group of Cys145 is engaged in S–H/π with the aromatic rings of the inhibitors and is barely available for catalyzing the proteolysis. The combined effect of all the interactions with the target protein makes these molecules potent inhibitors of the main protease enzyme of SARS-CoV-2 virus. Comparing the structures of the hit compounds pq8, pq9, pq10, and A12 with the rest, it was clear that the high flexibility of the structures owing to more single bonds and less compact rings makes them able to undergo conformational changes at the active site of the target leading to more interactions with Cys145 and His41 along with other active site residues. Hence, these four compounds were selected for further analysis.

Molecular Dynamics Simulation

Even though docking studies give the best binding poses, they do not account for the conformational changes taking place upon ligand binding as the docking analysis preferably treats protein as rigid. The conformational changes affecting the binding affinity could be analyzed using molecular dynamic simulations.[63] Four top ranked compounds via docking (pq8, pq9, pq10, and A12) were selected for 20 ns molecular dynamics simulation studies to understand the conformational changes in the protein on binding of the ligand. RMSD, RMSF, and Radius of gyration plots were used to evaluate the stability of the complexes. The binding conformations of the compounds at the last frame of simulation are given in Figure .
Figure 6

Final poses (20 ns) of ligand protein-complexes in the MD simulations.

Final poses (20 ns) of ligand protein-complexes in the MD simulations. The root–mean–square deviation (RMSD) plot of apoprotein 3CLpro, pq8, pq9, pq10, and A12 with average RMSDs 1.75, 1.65, 1.93, 2.02, and 1.86 Å, respectively, is given in Figure a. It can be inferred that for the apo structure as well as the docked complexes except pq10, RMSD values rose to 2.54 Å and then formed a plateau after 12 ns of simulation, which indicates that the system has reached stable equilibrium state and has been confined in the active site. The RMSD of pq10 requires an extension of stabilization time to reach equilibrium. The higher RMSD fluctuations for the holo structures compared to the apo structure indicate larger conformational changes followed by repositioning of the ligand in the active site.
Figure 7

(a) RMSD. (b) RMSF. (c) Radius of gyration plots for pq8, pq9, pq10, and A12.

(a) RMSD. (b) RMSF. (c) Radius of gyration plots for pq8, pq9, pq10, and A12. The root–mean–square fluctuation (RMSF) plot gives the fluctuations of individual residues in the protein backbone.[64] Higher RMSF denotes higher flexibility and vice versa.[65] The RMSF plot (Figure b) showed greater residue flexibility for pq9 and pq10 compared to the unbound form. pq8 and A12 have comparable RMSF values with that of the apo form. The Cys145 and His41 residues involved in the catalytic cycle have fluctuations with greater than 0.7 Å in apoprotein. In pq8 and pq9, it fluctuates more than 3 Å, while in pq10 and A12, there is fluctuation less than 0.6 Å. In all the systems, the active site residues from 140 to 190 have the highest fluctuations. The structural flexibility or compactness of the protein molecule is analyzed by the radius of gyration.[66] The protein is supposed to maintain a relatively steady value for Rg. From inspection of Figure c, it can be seen that Rg has almost no significant variance with the average Rg values of 22 Å. The ligand bound form has lower Rg values compared to apoprotein, which indicates that the bound structures are more compact.

Conclusion

Molecular docking analysis of chloroquine and hydroxychloroquine on main protease (3CLpro) protein has shown that HCQ is more efficient than CQ. More potent inhibitors of 3CLpro have been identified through docking studies. The interactions with the active site residues, especially with Cys145 and His41, which are the key residues involved in catalytic diad for proteolysis in the enzyme, are responsible for the activity of these compounds. A regression model with R2 = 0.9039 and Q2 = 0.8848 was developed correlating binding energy with the molecular descriptors. This model was used to predict the binding energies of novel molecules. The primaquine derivatives and antiviral compounds obtained from Asinex compound library were screened using the regression model, and the hit compounds thus obtained were further docked with 3CLpro protein. They showed less binding energies and proper interactions with the target, main protease (3CLpro). Molecular dynamics studies on these top four compounds pq8, pq9, pq10, and A12 revealed their binding stability and conformational changes associated with the protein–ligand complexes. Hence, these four compounds are proposed for validation and further studies as more efficient medication for COVID-19.
  52 in total

1.  CHARMM-GUI: a web-based graphical user interface for CHARMM.

Authors:  Sunhwan Jo; Taehoon Kim; Vidyashankara G Iyer; Wonpil Im
Journal:  J Comput Chem       Date:  2008-08       Impact factor: 3.376

2.  SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling.

Authors:  N Guex; M C Peitsch
Journal:  Electrophoresis       Date:  1997-12       Impact factor: 3.535

3.  VMD: visual molecular dynamics.

Authors:  W Humphrey; A Dalke; K Schulten
Journal:  J Mol Graph       Date:  1996-02

4.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility.

Authors:  Garrett M Morris; Ruth Huey; William Lindstrom; Michel F Sanner; Richard K Belew; David S Goodsell; Arthur J Olson
Journal:  J Comput Chem       Date:  2009-12       Impact factor: 3.376

5.  Distinct Roles of Catalytic Cysteine and Histidine in the Protease and Ligase Mechanisms of Human Legumain As Revealed by DFT-Based QM/MM Simulations.

Authors:  Brigitta Elsässer; Florian B Zauner; Johann Messner; Wai Tuck Soh; Elfriede Dall; Hans Brandstetter
Journal:  ACS Catal       Date:  2017-07-14       Impact factor: 13.084

6.  Avogadro: an advanced semantic chemical editor, visualization, and analysis platform.

Authors:  Marcus D Hanwell; Donald E Curtis; David C Lonie; Tim Vandermeersch; Eva Zurek; Geoffrey R Hutchison
Journal:  J Cheminform       Date:  2012-08-13       Impact factor: 5.514

7.  New insights on the antiviral effects of chloroquine against coronavirus: what to expect for COVID-19?

Authors:  Christian A Devaux; Jean-Marc Rolain; Philippe Colson; Didier Raoult
Journal:  Int J Antimicrob Agents       Date:  2020-03-12       Impact factor: 5.283

8.  Review of the 2019 novel coronavirus (SARS-CoV-2) based on current evidence.

Authors:  Lisheng Wang; Yiru Wang; Dawei Ye; Qingquan Liu
Journal:  Int J Antimicrob Agents       Date:  2020-03-19       Impact factor: 5.283

9.  SARS coronavirus entry into host cells through a novel clathrin- and caveolae-independent endocytic pathway.

Authors:  Hongliang Wang; Peng Yang; Kangtai Liu; Feng Guo; Yanli Zhang; Gongyi Zhang; Chengyu Jiang
Journal:  Cell Res       Date:  2008-02       Impact factor: 25.617

10.  Chloroquine and hydroxychloroquine in the treatment of COVID-19 with or without diabetes: A systematic search and a narrative review with a special reference to India and other developing countries.

Authors:  Awadhesh Kumar Singh; Akriti Singh; Altamash Shaikh; Ritu Singh; Anoop Misra
Journal:  Diabetes Metab Syndr       Date:  2020-03-26
View more
  11 in total

Review 1.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

Review 2.  The SARS-CoV-2 main protease (Mpro): Structure, function, and emerging therapies for COVID-19.

Authors:  Qing Hu; Yuan Xiong; Guang-Hao Zhu; Ya-Ni Zhang; Yi-Wen Zhang; Ping Huang; Guang-Bo Ge
Journal:  MedComm (2020)       Date:  2022-07-14

3.  29th Annual GP2A Medicinal Chemistry Conference.

Authors:  Jean-Jacques Helesbeux; Laura Carro; Florence O McCarthy; Vânia M Moreira; Francesca Giuntini; Niamh O'Boyle; Susan E Matthews; Gülşah Bayraktar; Samuel Bertrand; Christophe Rochais; Pascal Marchand
Journal:  Pharmaceuticals (Basel)       Date:  2021-12-07

Review 4.  Advances in Pathogenesis, Progression, Potential Targets and Targeted Therapeutic Strategies in SARS-CoV-2-Induced COVID-19.

Authors:  Hong Zhou; Wei-Jian Ni; Wei Huang; Zhen Wang; Ming Cai; Yan-Cai Sun
Journal:  Front Immunol       Date:  2022-04-05       Impact factor: 8.786

5.  Crystal structure of SARS-CoV 3C-like protease with baicalein.

Authors:  Jingwen Feng; Dongyang Li; Jin Zhang; Xiushan Yin; Jian Li
Journal:  Biochem Biophys Res Commun       Date:  2022-04-22       Impact factor: 3.322

6.  In silico identification of SARS-CoV-2 spike (S) protein-ACE2 complex inhibitors from eight Tecoma species and cultivars analyzed by LC-MS.

Authors:  Seham S El Hawary; Amira R Khattab; Hanan S Marzouk; Amira S El Senousy; Mariam G A Alex; Omar M Aly; Mohamed Teleb; Usama Ramadan Abdelmohsen
Journal:  RSC Adv       Date:  2020-11-26       Impact factor: 4.036

Review 7.  A proposed insight into the anti-viral potential of metallic nanoparticles against novel coronavirus disease-19 (COVID-19).

Authors:  Ghadha Ibrahim Fouad
Journal:  Bull Natl Res Cent       Date:  2021-02-05

8.  Understanding the role of water on temperature-dependent structural modifications of SARS CoV-2 main protease binding sites.

Authors:  Pushyaraga P Venugopal; Omkar Singh; Debashree Chakraborty
Journal:  J Mol Liq       Date:  2022-07-20       Impact factor: 6.633

9.  Proteomics and Its Application in Pandemic Diseases.

Authors:  Suman S Thakur
Journal:  J Proteome Res       Date:  2020-11-06       Impact factor: 4.466

Review 10.  Molecular Basis of SARS-CoV-2 Infection and Rational Design of Potential Antiviral Agents: Modeling and Simulation Approaches.

Authors:  Antonio Francés-Monerris; Cécilia Hognon; Tom Miclot; Cristina García-Iriepa; Isabel Iriepa; Alessio Terenzi; Stéphanie Grandemange; Giampaolo Barone; Marco Marazzi; Antonio Monari
Journal:  J Proteome Res       Date:  2020-10-29       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.