Literature DB >> 35036042

Impact of Warhead Modulations on the Covalent Inhibition of SARS-CoV-2 M^pro Explored by QM/MM Simulations.

Sergio Martí¹, Kemel Arafet^1,2, Alessio Lodola², Adrian J Mulholland³, Katarzyna Świderek¹, Vicent Moliner¹.

Abstract

The COVID-19 pandemic, caused by the severe acute respiratory syndrome coronavirus-2, SARS-CoV-2, shows the need for effective antiviral treatments. Here, we present a simulation study of the inhibition of the SARS-CoV-2 main protease (Mpro), a cysteine hydrolase essential for the life cycle of the virus. The free energy landscape for the mechanism of the inhibition process is explored by QM/MM umbrella sampling and free energy perturbation simulations at the M06-2X/MM level of theory for two proposed peptidyl covalent inhibitors that share the same recognition motif but feature distinct cysteine-targeting warheads. Regardless of the intrinsic reactivity of the modeled inhibitors, namely a Michael acceptor and a hydroxymethyl ketone activated carbonyl, our results confirm that the inhibitory process takes place by means of a two-step mechanism, in which the formation of an ion pair C145/H41 dyad precedes the protein-inhibitor covalent bond formation. The nature of this second step is strongly dependent on the functional groups in the warhead: while the nucleophilic attack of the C145 sulfur atom on the Cα of the double bond of the Michael acceptor takes place concertedly with the proton transfer from H41 to Cβ, in the compound with an activated carbonyl, the sulfur attacks the carbonyl carbon concomitant with a proton transfer from H41 to the carbonyl oxygen via the hydroxyl group. An analysis of the free energy profiles, structures along the reaction path, and interactions between the inhibitors and the different pockets of the active site on the protein shows a measurable effect of the warhead on the kinetics and thermodynamics of the process. These results and QM/MM methods can be used as a guide to select warheads to design efficient irreversible and reversible inhibitors of SARS-CoV-2 Mpro.

Entities: Chemical

Year: 2021 PMID： 35036042 PMCID： PMC8751016 DOI： 10.1021/acscatal.1c04661

Source DB: PubMed Journal: ACS Catal Impact factor: 13.084

Introduction

Despite the development of efficient vaccines, the effect of the COVID-19 pandemic around the world, produced by the severe acute respiratory syndrome coronavirus-2—SARS-CoV-2—has emphasized the need for antiviral treatments. Moreover, considering the capabilities of the virus to mutate, like any other viruses that contain RNA genetic material such as this or the influenza viruses, the corresponding risk of a decrease in the effectiveness of the vaccines spurs the need for complementary strategies to fight against the pandemic. Many efforts have focused on understanding the life cycle of SARS-CoV-2, to provide information about possible ways of developing drugs.[1−3] Among the proteins involved in the replication of the virus, the main coronavirus protease (SARS-CoV-2 Mpro) is a most attractive target due to its intrinsic features, including its distinguishing ability to cleave proteins after the glutamine residue,[4] catalytic features which make Mpro unique with respect to human proteases. The most effective Mpro inhibitors identified so far, including the clinical candidates PF-00835231, incorporate a glutamine residue or a bioisostere at P1 position (see below) to give potency and selectivity and a peptidomimetic scaffold of moderate size endowed with branched hydrophobic substituents at both the P2 and P3 positions.[5−7] These compounds act by a covalent mechanism, and so a reactive “warhead” is required: i.e., an electrophilic group to form a covalent bond between the active site cysteine residue (C145), previously activated by a histidine residue (H41), and the inhibitor. Warheads that have been traditionally used in cysteine proteases range from classical Michael acceptors (MAs) to activated carbonyl derivatives, including α-ketoamides, aldehydes, and hydroxymethyl ketone (HMK).[4,6,8] Nevertheless, previous investigations on Mpro inhibition by those prototypical warheads did not assess their relative reactivity versus C145 or explore the contribution played by the recognition part to the overall inhibition process. Thus, there is an urgent need to understand the effects of warheads on the reactivity as well as on the chemical stability of the covalent adduct generated. This second aspect is fundamental in the context of drug development, as it affects the (ir)reversibility of inhibition. Irreversible inhibition through a covalent mechanism is generally the most effective strategy to obtain a sustained response in vivo, since release from inhibition requires the resynthesis of the engaged target. However, it can be challenging to identify covalent compounds that selectively react with the specific residue without causing irreversible labeling of other (host) targets that may lead to liver toxicity and/or immune responses. These concerns about off-target modifications are motivating the search for potent covalent and reversible agents. It is also important to be able to explore a range of different pharmacokinetic behaviors, and in particular, there is a need to predict the degree of reversibility of inhibition. It is therefore critical to develop computational protocols able to predict the kinetic behavior of a covalent inhibitor. A wide range of different computational methods has been used since the emergence of COVID-19 for the discovery of small-molecule therapeutics.[7,9] With regard to the inhibition of Mpro, modeling based on classical force fields can contribute to the discovery and optimization of noncovalent inhibitors,[10−12] while the use of methods based on quantum mechanics/molecular mechanics (QM/MM) potentials can assist in the design of covalent inhibitors. We recently studied the mechanism of the covalent inhibition of Mpro by the peptidyl-MA compound N3 designed by Jin and colleagues[6] and by two designed MA compounds (B1 and B2) by QM/MM molecular dynamics (MD) methods.[13] Our results indicated that both designed compounds may be promising candidates as drug leads against COVID-19. Interestingly, according to the computed thermodynamic properties of the full inhibition process, B1 could behave as an irreversible inhibitor while B2 may be a reversible inhibitor. As was previously proposed from structures from X-ray diffraction studies[3] and later supported by computational studies using several different approaches, the chemical reaction step of the Mpro inactivation involves the activation of the SH group of C145 by the imidazole group of H41. Then, the reactive nucleophilic thiolate formed (CysS–) would attack the inhibitor, creating an inhibitor–protein covalent bond.[13−18] According to the recent literature, the equilibrium between the neutral dyad and the CysS–/HisH+ ion pair appears to be tipped in favor of the neutral pair by ligand binding, but this may depend on the electrostatic properties of the ligand itself. Thus, while some simulation studies report a neutral dyad significantly more stable than the ion pair (by ca. 11 kcal mol–1) or identifying the ion pair as an unstable state,[18] others suggest that the ion pair is not so destabilized with respect to the initial neutral dyad[13,14] or is even slightly more stable than the neutral dyad (e.g., our previous study with B1).[13] Our previous study on the proteolysis reaction of SARS-CoV-2 Mpro, using a polypeptide with a fluorescent tag, concluded that this enzyme differs somewhat from other cysteine proteases.[15] Thus, for instance, the initial enzyme:substrate complex would not be the ion pair dyad C145–/H41+ (E:I) but rather the neutral C145/H41 dyad (E:I). This result is in agreement with studies carried out by us, and others, using different inhibitors and substrates (except for B1 mentioned above),[13,16,18] but in contrast with the protonation state of the catalytic dyad suggested from the ligand-free SARS-CoV-2 Mpro recently solved by neutron crystallography at pH 6.6.[19] Nevertheless, as has already been pointed out, questions remain about how pH or the presence of an inhibitor or substrate influence the protonation state of the dyad in SARS-CoV-2 Mpro.[16] Moreover, while the formation of the ion pair dyad C145–/H41+ (E:I) and the nucleophilic attack of sulfur atom of C145 to the carbonyl carbon atom of the peptide take place concertedly in the proteolysis reaction catalyzed by Mpro,[15] in the inhibition reaction by N3, our designed B1 and B2 MA compounds,[13] or the simulation with Mpro–substrate peptide models,[16] the formation of the ion pair dyad and the sulfur–carbon covalent bond formation appear to proceed in a stepwise manner. In any event, the rate-limiting step of the process, in all three studied inhibitors, was the enzyme–inhibitor covalent bond formation, with activation free energies ranging from 11.8 to 9.8 kcal mol–1.[13] An analysis of the QM/MM interaction energies between the substrate (the peptide in the proteolysis reaction or the inhibitor in the case of the inhibitors) and the different binding pockets of Mpro and the peptide (in the study of the proteolysis reaction) or the inhibitor (in the case of the inhibitors) confirms that they are dominated by those in the P1:::S1 site. Thus, our previous results indicate that a low-barrier C145 covalent modification depends on either the warhead or the recognition portion. The recognition portion dictates how the inhibitor is accommodated in the active site, which in turn affects the subsequent chemical reaction step. Consequently, the design of an efficient inhibitor must take into account the reactivity of the warhead and the favorable interactions between the recognition portion and the active site of the enzyme.[20] In all, the experience accumulated on the basis of the results derived from previous studies on this and other cysteine proteases can be used to guide the design of new compounds, and QM/MM simulations can be considered a useful tool to get a detailed description of the chemical steps of the inhibition of protein target covalent inhibitors. Moreover, the obtained activation free energies and reaction energies obtained with these high-level methods can confirm, or not, the viability of the proposed inhibitors. Here, we propose and investigate the inhibition of the SARS-CoV-2 Mpro by two potential covalent (peptidyl) inhibitors endowed with two chemically diverse warheads. Building upon the findings on information derived from our previous studies on the proteolysis of Mpro [15] and on the reaction of the inhibition with several peptidyl irreversible inhibitors,[13] we propose the two compounds B3 and B4 (Scheme ). A methyl oxo-enoate was used in B3, inspired by the dimethyl fumarate structure,[21,22] while a hydroxymethyl ketone (HMK) was used as a warhead in the B4 compound. This reactive group is also present in the structure of PF-00835231 Mpro inhibitor, now in a clinical trial.[5] The recognition part possessed by both B3 and B4 compounds was selected on the basis of QM/MM results obtained with the previously proposed inhibitors B1 and B2 and from an analysis of protein–substrate interactions from QM/MM simulations of the proteolysis reaction.[15] We keep the recognition part as a short peptide-like compound, combining the P1 moiety of B1 and the P2 and P3 moieties of B2. This is a robust and systematic way of deciphering the effect of specific changes in the intrinsic chemical reactivity of the proposed inhibitors on the mechanism (and energetics) of inhibition. As shown in our previous studies, stabilizing ligand:protein interactions were established when this relatively small recognition part was used.[13] Thus, S2 is a small hydrophobic pocket without strong hydrogen-bond interactions with P2. Therefore, an isobutyl group was kept at the P2 site as in B2. The S3 subsite of Mpro is completely exposed to the solvent, and then we keep the shorter P3 of B2. Also, previous studies[6] suggested that different kinds of substituents can be used in P3. It must be kept in mind that, despite the fact that the PF-00835231 Mpro inhibitor shows an interaction between the indole group (removed in our new compounds) and E166, our previous study revealed favorable interactions between P3 and M165 and Q189.[13] Finally, the glutamine present in P1 of B1 was employed in these new compounds due to the favorable interactions observed in our previous study of the proteolysis.[15]

Scheme 1

Chemical Structures of the Proposed (B3 and B4) Inhibitors of SARS-CoV-2 Mpro

Chemical Structures of the Proposed (B3 and B4) Inhibitors of SARS-CoV-2 Mpro

The warheads, P1′, are highlighted in red, while P1–P3 fragments are shown in blue, green, and black, respectively. The subpockets of the active site are labeled with S numbering complementary to fragments of the inhibitor. Asterisks indicate the main reactive centers of the inhibitors. From a mechanistic point of view, the two proposed compounds could potentially react in different manners in the active site of the enzyme, also because their key electrophilic centers not only possess a different chemical environment but also are not topologically equivalent. Thus, as shown in Scheme , after the formation of the ion pair E:I reactant complex with B3, the attack of the sulfur of C145 to the β-carbon of the substrate can take place, followed by a proton transfer from the protonated H41 to the α-carbon, leading to the stable covalent product E-I. Nevertheless, on consideration of the nature of the warhead in B3, the final proton transfer could also take place to the carbonyl oxygen atom (E-I). In the case of the inhibition with B4, this dual possibility of the final proton transfer does not appear after the acylation of the enzyme because the proton from H41 can only be transferred to the carbonyl oxygen atom of the inhibitor (Scheme ).

Scheme 2

Proposed Mechanism of SARS-CoV-2 Mpro Cysteine Protease Inhibition by B3

R1 and R2 are the different substituents, as shown in Scheme .

Scheme 3

Proposed Mechanism of SARS-CoV-2 Mpro Cysteine Protease Inhibition by B4

R1 and R2 are different substituents, as shown in Scheme .

Proposed Mechanism of SARS-CoV-2 Mpro Cysteine Protease Inhibition by B3

R1 and R2 are the different substituents, as shown in Scheme .

Proposed Mechanism of SARS-CoV-2 Mpro Cysteine Protease Inhibition by B4

R1 and R2 are different substituents, as shown in Scheme . The present study is a computational study of the mechanism of inhibition of Mpro by B3 and B4. The reaction mechanisms for each inhibitor were initially explored by nudged elastic band calculations of the minimum energy paths. Then, two free energy based methodologies, such as the umbrella sampling (US) and free energy perturbation (FEP) methods, both at the density functional theory level combined with classical force fields, were employed to explore the full inhibition process.

Computational Methods

The coordinates for the starting point were taken from the X-ray structure of the SARS-CoV-2 Mpro with the PF-00835231 inhibitor in its active site (PDB ID 6XHM).[5] The PF-00835231 inhibitor was then manually modified, leading to the two new enzyme–inhibitor models. The missing force field parameters for each model were generated using the Antechamber program,[23] available in the AmberTools package (see Tables S1 and S2). The protonation states of the titratable amino acids were determined using the empirical program PropKa ver. 3.0.3,[24] while the histidine residues were assigned by a detailed visual inspection (see a list of all the pKa values in Table S3). Each model was neutralized by adding eight sodium counterions that were placed in a box of 92.154 × 102.242 × 97.285 Å3 of TIP3P[25] water molecules. The next step for each model consisted of 105 steps of conjugate-gradient minimization, followed by a series of molecular dynamics (MD) simulations in the NVT ensemble with the AMBER ff03 force field,[26] as implemented in NAMD software.[27] Initially the temperature was raised from 0 to 310 K with a short MD, using a constant increment of 3.1 K/ps. Then 100 ps of MD was performed at 310 K, applying a constant scaling. Finally, an equilibration was carried out by means of 10 ns of NVT MD, using the Langevin thermostat.[28] All simulations made use of the PME algorithm for the electrostatic interactions with a force-switch scheme ranging from 14.5 to 16 Å and a time step of 1 fs. The time evolution of the root-mean-square deviations of the backbone atoms of the protein models, using the cpptraj facility,[29] confirmed that the two models became equilibrated (see Figures S1a and S2a). An additive hybrid QM/MM scheme was selected in the present study for constructing the total Hamiltonian. The QM subset of atoms includes the P1′, P1, and P2 positions of the inhibitor, together with the C145 and H41 residues of the protein. Four link atoms were inserted where the QM/MM boundary intersected covalent bonds in the positions indicated on Figures S1b and S2b. Thus, the QM part consisted of 89 atoms for the inhibitor B3 and 80 for B4. All the calculations were performed with the QMCube suite,[30] in which the combination of the NAMD and Gaussian09[31] programs was used to construct the potential energy function. The AMBER ff03[26] and the TIP3P[25] force fields were selected to describe the MM atoms, and the Minnesota functional M06-2X[32] with the split-valence 6-31+G(d,p) basis set was used to treat the QM subset of atoms. This functional has been tested and shown to be suitable for modeling this type of reactivity.[13,15,16,33,34] The position of any atom over 25 Å from the substrate was fixed to speed up the calculations. Reaction mechanisms for each inhibitor were initially explored using the nudged elastic band (NEB)[35] approach to set up plausible starting geometries for the transition structures. Then, they were localized and characterized by a micromacro[36,37] Hessian-based localization scheme, and minimum energy paths (MEP) were traced toward the corresponding minima. The information obtained inat this stage was used in the fine-tuning of the free energy methodologies, specifically the potential of mean force (PMF) and free energy perturbation (FEP) methods. We applied these two distinct approaches separately to calculate the free energy profiles for the reaction. It is important to point out that herein we directly compute the free energy landscape at a DFT/MM level higher than that in our previous QM/MM studies on the inhibition of Mpro. Thus, the present calculations apply a significantly higher level of theory in sampling, as detailed below. In the case of the FEP method, which has been successfully employed in our laboratory for reactivity studies in various biological systems,[38−41] the reaction path obtained for each of the mechanisms was analyzed to extract those consecutive geometries with an energy difference greater than or equal to 1.5 kJ mol–1 (called “windows”). Then, pure MM MD simulations were performed on each of these windows, keeping frozen the atoms of the QM part. Each MD run was performed in the NVT ensemble with a time step of 1 fs and a total time length of 20 ps. The partial charges of the QM atoms were recalculated every 200 steps using the CHELPG[42] methodology, because the MM region is changing during the sampling at every window and can polarize the QM wave function and consequently propagate to the MM engine (NAMD). The application of this protocol resulted in a total number of 100 structures per window (called “points”) with the same coordinates for the QM atoms, but with a different MM environment configuration. In the following step, the coordinates of the QM atoms for each point of the ith window were replaced by those of the consecutive window (i + 1), and both the perturbed QM and Lennard–Jones energy terms were evaluated. The free energy of the chemical step was then obtained from the cumulative sum of the free energy differences between successive windows, as shown in eq where both Ui and Ui+1 comprise the first three terms of eq (that is, EQM, EQM/MMelect and EQM/MMvdW) and the average was carried out on all points of the ith window. Finally, the free energy profile was obtained from the FEP method, including the corresponding harmonic QM zero-point energy (ZPE) for the stationary points. The potential of mean force for each chemical step was obtained using a combination of the umbrella sampling (US) approach[43] with the weighted histogram analysis method (WHAM).[44] A series of MD simulations were performed on adding a restraint along the collective reaction coordinate s,[45] with an umbrella force constant of 3000 kJ mol–1 Å–2. In every window, QM/MM MD-NVT simulations were performed with a total of 3.25 ps at 310 K with a time step of 0.5 fs (a total of 6500 steps). The definition of the s coordinate depended on the considered step, always reduced to a combination of distances. Thus, for the first step of the inhibitor B3, the following distances were included: d(Sγ,Cβ), d(Sγ,Hγ), and d(Hγ,Nε). In the second step, only the protonation of the double bond was studied because the protonation of the carbonyl group gave a high barrier in FEP calculations; therefore, the distances involved were d(Sγ,Cβ), d(Hγ,Cα), and d(Hγ,Nε). In the case of the B4 inhibitor, the distance d(H*,O) was also accounted for in the first step, the carbon of the carbonyl moiety being the one involved. The second step was followed with a combination of the distances: d(Sγ,C), d(Hγ,Nε), d(Hγ,O*), d(H*,O*), and d(H*,O). All of the information needed to define the equally distributed milestones from which the collective variable s was constructed was obtained from an analysis of the different MEPs previously traced. In addition, the error of the PMFs was evaluated as the standard deviation derived from a total of 1000 randomly bootstrapped PMFs.[46] Finally, the QM/MM interaction energy was computed as a sum of the contribution of each residue of the protein to the total interaction energy:According to eq , the protein–substrate interaction energy can be decomposed by the residue when the polarized wave function (Ψ) that describes the QM subset of atoms is used. Because of the large number of structures that must be evaluated to obtain a representative population, the semiempirical Hamiltonian AM1[47] was used to describe the QM region in these QM/MM MD calculations.

Results and Discussion

The first step in our study was to carry out a deep analysis of the interactions established between the two studied compounds and the active site of Mpro in the initial E:I state. Figure shows a schematic representation of hydrogen bond interactions, while Figure reports the average interaction energies (electrostatic plus Lennard–Jones) between Mpro residues and inhibitor fragments, thus including some interactions with residues that are not necessary at a close distance from the inhibitor. A list of relevant interatomic distances is deposited in Tables S4 and S5. An analysis of the results confirms the formation of a stable reactant Michaelis complex in both cases, with a similar pattern of interactions. Keeping in mind that the difference between B3 and B4 is restricted to the warhead, and in both cases the interactions with S1′ take place through mainly hydrogen bond interactions with the carbonyl oxygen next to P1 that is common in both inhibitors, the results appear as reasonable despite the very different functional groups in the P1′ position. Thus, this carbonyl group is interacting with the oxyanion hole located in S1′ formed by G143, S144, and C145. In addition, some indirect interactions that also stabilize the P1′ fragment, such as L27, N28, G146, and S147, were identified. The specific favorable interactions between the lactam ring on P1 and S1 are almost equivalent in both inhibitors, mainly through interactions with P140, N142, H163, and E166 (see Figure ). The backbone atoms of the residues of the P2 site are responsible for the interactions with Q189, H164, D187, and M165. Finally, the unfavorable interaction between R40 and the warhead of B3 and B4 is worth mentioning. Interestingly, R40 is ca. 8 Å from P1′ and the interaction, established basically between the carbonyl group of the peptide bond of R40 and P1′, corresponds to an electrostatic interaction. Thus, possible strategies to avoid this interaction in future inhibitor designs would require a redesign of the warheads. There are other positively charged side chains around the inhibitors at similar distances, such as K61 and R188, but their contribution is much less than that of R40, especially in the case of B4, due to a greater solvent exposure (see Figure and Figure S3). The conformation adopted by both compounds in the active site of Mpro can be compared with the X-ray crystal structures of related complexes. Thus, the cocrystal structure of the covalent adduct of PF-00835231 bound to SARS-CoV-2 Mpro (PDB code 6XHM)[5] that, as commented above is like B4, shows protein–ligand distances in S1, S2, and S3 equivalent to those shown in Figure b. These similarities are also observed when the distances between the key atoms involved in the inhibition reaction are compared: SγC145–CC=O, SγC145–NεH41, and NεH41–O*OH: 1.86, 3.71, and 3.80 Å, respectively, in the crystal structure and 2.90, 3.13, and 2.34 Å, respectively, in the B4 complex. With regard to B3, despite no X-ray structure being available for SARS-CoV-2 Mpro complexed with a structure comparable to that of B3, there is a cocrystal structure of the covalent adduct 2 of Hoffman and co-workers bound to SARS-CoV-1 Mpro (PDB code 6XHO).[5] An analysis of this structure provides similar conclusions regarding the interactions between the different subsites of the active site of the related proteins, CoV-1 and CoV-2 Mpro, and the corresponding compounds in the X-ray structure of SARS-CoV-1 Mpro and B3 in SARS-CoV-2 Mpro. Obviously, the absence of the carbonyl group at the α position of P1′ in 2 explains the lack of interactions with the oxyanion hole of the S1′ site. Nevertheless, a comparison of the interatomic distances that are related to the inhibition reaction, SγC145–Cβ, SγC145–NεH41, and NεH41–Cα, also shows similar values: 1.76, 3.96, and 3.23 Å, respectively, in the crystal structure and 3.29, 3.29, and 4.09 Å, respectively, in B3. Obviously, this comparison must be done with caution because the X-ray structures correspond to the protein–inhibitor covalent complex (E-I in our Schemes and 3) while the B3 and B4 structures analyzed at this point correspond to the initial reactant complex E:I. Thus, differences observed in the distances defining the attack of the sulfur atom of C145 to the corresponding carbon atom of the inhibitor (CC=O and Cβ for B3 and B4, respectively) are as expected. Anyway, the good overlap of the X-ray structures and the equilibrated E:I reactant complex support the quality of our initial state structures (see Figures S4 and S5).

Figure 1

Schematic representations of the H-bond interactions between the inhibitor and the SARS-CoV-2 Mpro derived from QM/MM MD simulations of B3 (a) and B4 (b) inhibitors in the E:I state.

Figure 2

Average interaction energies (electrostatic plus Lennard–Jones) between amino acids of Chain-A of SARS-CoV-2 Mpro and each fragment of the inhibitor B3 (a) and B4 (b) computed at the E:I state. The values were obtained as an average over 1000 structures from the AM1/MM MD simulations. The red bars correspond to the P1′:::S1′ interactions, the blue bars correspond to the P1:::S1 interactions, and the green bars correspond to the P2:::S2 interactions.

Schematic representations of the H-bond interactions between the inhibitor and the SARS-CoV-2 Mpro derived from QM/MM MD simulations of B3 (a) and B4 (b) inhibitors in the E:I state. Average interaction energies (electrostatic plus Lennard–Jones) between amino acids of Chain-A of SARS-CoV-2 Mpro and each fragment of the inhibitor B3 (a) and B4 (b) computed at the E:I state. The values were obtained as an average over 1000 structures from the AM1/MM MD simulations. The red bars correspond to the P1′:::S1′ interactions, the blue bars correspond to the P1:::S1 interactions, and the green bars correspond to the P2:::S2 interactions. Once it was confirmed that the E:I complex represents a stable reactant complex, in both cases, the inhibition reaction was studied according to the general mechanisms proposed in Schemes and 3 for the reactions with B3 and B4, respectively.

Inhibition of SARS-CoV-2 Mpro with B3

As shown in Scheme , after C145 is activated by a proton transfer to H41, thus forming the ion pair complex E:I, the covalent complex is formed by the nucleophilic attack of the sulfur atom of C145 to the Cβ atom of the B3 inhibitor. Then, the reaction is completed by the transfer of the proton from the protonated H41 to either the Cα atom, to render the final E-I covalent adduct, or to the carbonyl oxygen atom then ending in E-I′. Exploration of both mechanisms by M06-2X/6-31+G(d,p)/MM FEP calculations revealed that the formation of the former (i.e., the direct addition mechanism) is both thermodynamically and kinetically favored with respect to the formation of E-I′ (see Figure S6). Thus, while the reaction that renders the E-I product is strongly exergonic (−16.2 kcal mol–1), the energy of E-I′ product appears to be 15.2 kcal mol–1 higher than the initial reactant state, E:I. These differences in the reaction energies are also associated, as mentioned, with significant differences in activation energies: 13.7 and 21.0 kcal mol–1 to form E-I and E-I′, respectively. Consequently, the much more computationally demanding M06-2X/6-31+G(d,p)/MM US method was applied only to the exploration of the mechanism rendering the E-I final product. The resulting free energy profile for the covalent inhibition of SARS-CoV-2 Mpro with B3 is depicted in Figure , while the evolution of the selected bond distances along the PMF is shown in Figure . Details of the M06-2X/6-31+G(d,p)/MM FESs obtained by means of the US method are given in Figure S7.

Figure 3

Figure 4

Evolution of the selected bond distances along the PMF of the SARS-CoV-2 Mpro inhibition with B3: (a) formation of the ion pair E:I; (b) formation of the final E-I covalent complex. Vertical dashed lines represent the positions of the optimized TS structures.

M06-2X/6-31+G(d,p)/MM free energy profiles obtained with umbrella sampling MD for the inhibition mechanism of SARS-CoV-2 Mpro cysteine protease by B3 (red line) and B4 (blue line) inhibitors at 310 K. Evolution of the selected bond distances along the PMF of the SARS-CoV-2 Mpro inhibition with B3: (a) formation of the ion pair E:I; (b) formation of the final E-I covalent complex. Vertical dashed lines represent the positions of the optimized TS structures. According to our results, the full reaction mechanism of the inhibition of SARS-CoV-2 Mpro cysteine protease by B3 takes place in two steps. First, the proton from C145 is transferred to H41 with an activation energy barrier of 10.2 ± 0.3 kcal mol–1. The resulting ion pair complex, E:I, a zwitterionic species that according to previous studies is well described by the M06-2X functional here,[48] is clearly less stable than the initial complex, in which both residues of the C145/H41 dyad are in their neutral states (by ca. 8 kcal mol–1). This result agrees with our previous computational studies of the proteolysis reaction and the inhibition reaction with different inhibitors.[13,15] Thus, despite the quantitative energetic difference between the ion pair and the neutral form that appears to be dependent on the substrate, the neutral dyad must be considered as the starting state of the reaction catalyzed by Mpro. As shown in Figure a, the proton transfer from C145 to H41 is associated with a slight approach of the sulfur atom of the former to the nucleophilic atom of the substrate (from 3.5 to 2.8 Å). Then, the covalent bond formation between C145 and the Cβ atom of the substrate takes place concertedly with a proton transfer from the protonated H41 to the Cα atom of the substrate to reach the final E-I covalent complex. Interestingly, the barrier for the ion-pair formation is higher than the barrier of the covalent bond formation, if it is measured from the intermediate E:I. Nevertheless, because the first step is endergonic, we measured the activation free energy of the second step from the reactant E:I complex, and consequently, this is the rate-limiting step of the process with a free energy barrier of 13.5 ± 1.2 kcal mol–1, with breaking and forming bonds in a very asynchronous process (see Figure b). The transition state, TS2, defined as the maximum of the PMF but also confirmed by optimizing and characterizing a representative structure at the M06-2X/6-31+G(d,p)/MM level (see Figure and Table S6) as a saddle point of order 1, is characterized by Sγ–Cβ bond formation in a very advanced stage of the process (1.89 Å) but a proton transfer in an early stage of the reaction Hγ–Cα distance of ca. 1.70 Å. This concerted character was also confirmed by tracing the IRC down to the ion pair intermediate and the product from the optimized TS2, which in fact was used to generate the free energy profile with the FEP method described above. From a technical point of view, it is important to note that both methods, US and FEP, provide the same description of the process with only slight energetic differences, with or without addition of the entropic contribution of the QM region (see Figure vs Figure S8). Finally, an analysis of the interaction energies (computed as an average over the sum of electrostatic plus Lennard–Jones terms) between amino acids of Mpro and each fragment of the inhibitor B3 computed at TS2 shows that the pattern of interactions does not significantly change from that obtained in the E:I complex (see Figure a vs Figure S9).

Figure 5

Detail of M06-2X/6-31+G(d,p)/MM optimized structures of selected states located along the inhibition reaction of Mpro by B3. Carbon atoms of the inhibitor are shown in green, and those of the catalytic residues C145 and H41 are shown in cyan. Key distances are in Å.

Inhibition of SARS-CoV-2 Mpro with B4

As shown in Figure and schematically depicted in Scheme , the inhibition process with B4 is equivalent to that obtained with B3. Thus, the generation of the transient ion pair intermediate E:I by a proton transfer from C145 to H41 precedes the formation of the covalent complex between C145 and the carbonyl carbon atom of B4. The first step is virtually the same as in the case of the inhibition with B3, confirming that, once again, the neutral reactant complex is favored with respect to the ion pair dyad.[13−18] The evolution of the key distances with the progression of the reaction shows that the proton transfer from C145 to H41 is also associated with an approach of the former to the carbonyl carbon atom, from 3.0 to 2.2 Å, thus generating a more reactive conformation (Figure a). Then, the second step, which as in the case of B3 represents the rate-limiting step of the full inhibition process, involves the acylation of the protein together with the proton transfer from the protonated H41 to the carbonyl oxygen atom of the inhibitor with an energy barrier of 15.2 ± 1.1 kcal mol–1. This barrier is slightly higher than that obtained for B3, 13.5 kcal mol–1. This difference of ∼2 kcal mol–1 is also observed in the reaction energies, the reaction with B3 being slightly more exergonic (−12.5 ± 1.0 kcal mol–1) than the reaction with B4 (−10.5 ± 0.9 kcal mol–1). A recent computational study of the inhibition mechanism of the cysteine protease rhodesain by a dipeptidyl enoate in our laboratory showed that it can take place through cysteine attack on either the Cβ or the carbonyl carbon atom of the inhibitor, in an exergonic process with a low activation energy barrier.[34] In the current work, as revealed by the evolution of the interatomic distances monitored in Figure b, the proton transfer from the positively charged H41 to the carbonyl oxygen atom does not take place directly but occurs through the hydroxyl group of the substrate. Thus, the proton Hγ is transferred from Nε of H41 to the oxygen atom of the hydroxyl group, O*, simultaneous with the proton transfer, H*, from this hydroxyl oxygen atom to the carbonyl oxygen atom, O, of the substrate. A similar mechanism has been found for the inhibition reaction of Mpro with PF-00835231 using similar QM/MM methods but with the B3LYP functional.[18] Our activation free energy is 4.5 kcal mol–1 lower than the energy obtained in that study, which could be due to chemical and/or methodological differences. In this regard, limitations of B3LYP for describing thio-Michael additions have been previously noted.[48] It is also worth mentioning that our ion pair intermediate is clearly a stable minimum in the free energy surface, while a metastable ion pair catalytic dyad is formed by the proton transfer from C145 to H41 in that work. The transition state of the second step, defined as the maximum of the PMF and also confirmed by optimizing a representative structure at the M06-2X/6-31+G(d,p)/MM level (see Figure and Table S7) and tracing the IRC down to the ion pair and the final product complex, supports the proposed mechanism. The role played by the terminal hydroxyl group of B4 in the proton transfer from H41 to the carbonyl oxygen atom of the inhibitor agrees with the results of Hoffman and co-workers, who found a drop of potency in different HMKs when the terminal hydroxyl group was substituted by other groups.[5]

Figure 6

Figure 7

Detail of M06-2X/6-31+G(d,p)/MM optimized structures of selected states located along the inhibition reaction of Mpro by B4. Carbon atoms of the inhibitor are shown in green, and those of the catalytic residues C145 and H41 are shown in cyan. Key distances are in Å.

Evolution of the selected bond distances along the PMF of the SARS-CoV-2 Mpro inhibition by B4: (a) formation of the ion pair E:I; (b) formation of E-I covalent complex. Vertical dashed lines represent the positions of the optimized TS structures. Detail of M06-2X/6-31+G(d,p)/MM optimized structures of selected states located along the inhibition reaction of Mpro by B4. Carbon atoms of the inhibitor are shown in green, and those of the catalytic residues C145 and H41 are shown in cyan. Key distances are in Å. Finally, as in the case of B3, an analysis of the average interaction energies between residues of Mpro and each fragment of the inhibitor B4 computed at TS2 shows that the pattern of interactions does not significantly change from that obtained in the E:I complex (see Figure b vs Figure S10).

Conclusions

We have reported a detailed computational study of the inhibition of SARS-CoV-2 Mpro with two proposed covalent (peptidyl) inhibitors: B3 and B4. Both inhibitors share the same recognition part, which is equivalent to that proposed in our previous study,[13] but differ in the activated carbonyl warheads. The results provide information on warhead effects in SARS-CoV-2 Mpro covalent inhibitors. The full inhibition processes have been explored with two different DFT/MM methods: first, FEP methods starting from optimized TSs, and second, US relying on the nudged elastic band and the calculation of the minimum energy paths. There is good agreement between the results derived from these different methodologies. Our results show that the inhibition process with both compounds takes place by a two-step mechanism, in which the formation of a high-energy intermediate (the C145–/H41+ ion pair) precedes the protein–inhibitor covalent bond formation. An analysis of the free energy profiles, the geometries of the states appearing along the reaction path, and the interactions between the inhibitors and the different pockets of the active site, confirms a notable effect of the warhead on the kinetics and thermodynamics of the process of the second step. This second step (corresponding to enzyme–inhibitor covalent bond formation) appears to be the rate-limiting step of the process for both inhibitors, with activation free energies of 13.5 ± 1.2 and 15.2 ± 1.1 kcal mol–1 for B3 and B4, respectively. The lower activation free energy of B3, together with a slightly more stable final covalent product (by 2 kcal mol–1), suggests that future designs should be based on the modification over this kind of warhead introduced in B3. In addition, the highly disfavored reverse processes in both cases (26.0 and 25.7 kcal mol–1 for B3 and B4, respectively) suggest a clear irreversible character of both proposed new compounds, with potential corresponding advantages for medicinal applications. It is important to note that our previously proposed peptidyl nitroalkene inhibitor B2, which shares the same recognition moiety as B3 and B4, showed a slightly lower activation energy barrier (9.8 kcal mol–1) and a less exergonic inhibition process (−11.4 kcal mol–1). These differences may be partially due to the different computational strategies, but they could also be advantages to obtain more irreversible-character inhibitors. These results are in good agreement with available experimental data on peptidyl covalent Mpro inhibitors, but there are no biochemical data for a direct comparison with our proposed compounds. The computed QM/MM interaction energies between the inhibitor, decomposed by fragments, and the amino acids of the active site of Mpro confirm that these interactions are dominated in both cases, B3 and B4, by those in the P1:::S1 site, as in our previous studies.[13,15] Finally, the good overlap between the structures of either the reactant complex E:I or the final covalent product E-I and two cocrystal structures of the covalent adduct of similar compounds bound to SARS-CoV-2 Mpro suggests that B3 and B4 can bind well in the active site. The low barrier obtained for B4 suggests that the terminal hydroxyl group is an important structural element in its inhibitory activity. This would also mean that modulation of the pKa of this group could represent an effective strategy to improve the potency of this specific class of HMKs. In summary, our QM/MM study of the inhibition of Mpro by two covalent (peptidyl) inhibitors, B3 and B4, whose design was guided by medicinal chemistry experience and by results derived from our previous computational studies, indicates that both, but particularly B3, could be used as templates to redesign candidates as drug leads against COVID-19. From a drug discovery standpoint, the development of highly selective compounds can benefit from the Mpro specificity in cleaving proteins after the Gln residue, a characteristic not detected in human enzymes.

35 in total

1. Automatic atom type and bond type perception in molecular mechanical calculations.

Authors: Junmei Wang; Wei Wang; Peter A Kollman; David A Case
Journal: J Mol Graph Model Date: 2006-02-03 Impact factor: 2.518

2. Role of solvent on nonenzymatic peptide bond formation mechanisms and kinetic isotope effects.

Authors: Katarzyna Świderek; Iñaki Tuñón; Sergio Martí; Vicent Moliner; Juan Bertrán
Journal: J Am Chem Soc Date: 2013-05-29 Impact factor: 15.419

3. Computational Studies Suggest Promiscuous Candida antarctica Lipase B as an Environmentally Friendly Alternative for the Production of Epoxides.

Authors: Miquel A Galmés; Katarzyna Świderek; Vicent Moliner
Journal: J Chem Inf Model Date: 2021-07-12 Impact factor: 4.956

4. Structure of M^pro from SARS-CoV-2 and discovery of its inhibitors.

Authors: Zhenming Jin; Xiaoyu Du; Yechun Xu; Yongqiang Deng; Meiqin Liu; Yao Zhao; Bing Zhang; Xiaofeng Li; Leike Zhang; Chao Peng; Yinkai Duan; Jing Yu; Lin Wang; Kailin Yang; Fengjiang Liu; Rendi Jiang; Xinglou Yang; Tian You; Xiaoce Liu; Xiuna Yang; Fang Bai; Hong Liu; Xiang Liu; Luke W Guddat; Wenqing Xu; Gengfu Xiao; Chengfeng Qin; Zhengli Shi; Hualiang Jiang; Zihe Rao; Haitao Yang
Journal: Nature Date: 2020-04-09 Impact factor: 49.962

5. Structure-based design of antiviral drug candidates targeting the SARS-CoV-2 main protease.

Authors: Wenhao Dai; Bing Zhang; Xia-Ming Jiang; Haixia Su; Jian Li; Yao Zhao; Xiong Xie; Zhenming Jin; Jingjing Peng; Fengjiang Liu; Chunpu Li; You Li; Fang Bai; Haofeng Wang; Xi Cheng; Xiaobo Cen; Shulei Hu; Xiuna Yang; Jiang Wang; Xiang Liu; Gengfu Xiao; Hualiang Jiang; Zihe Rao; Lei-Ke Zhang; Yechun Xu; Haitao Yang; Hong Liu
Journal: Science Date: 2020-04-22 Impact factor: 47.728

6. Designing of improved drugs for COVID-19: Crystal structure of SARS-CoV-2 main protease M^pro.

Authors: Hylemariam Mihiretie Mengist; Xiaojiao Fan; Tengchuan Jin
Journal: Signal Transduct Target Ther Date: 2020-05-09

7. Unraveling the SARS-CoV-2 Main Protease Mechanism Using Multiscale Methods.

Authors: Carlos A Ramos-Guzmán; J Javier Ruiz-Pernía; Iñaki Tuñón
Journal: ACS Catal Date: 2020-09-28 Impact factor: 13.084

8. Revealing the molecular mechanisms of proteolysis of SARS-CoV-2 M^pro by QM/MM computational methods.

Authors: Katarzyna Świderek; Vicent Moliner
Journal: Chem Sci Date: 2020-06-25 Impact factor: 9.825

9. Inhibition Mechanism of SARS-CoV-2 Main Protease with Ketone-Based Inhibitors Unveiled by Multiscale Simulations: Insights for Improved Designs*.

Authors: Carlos A Ramos-Guzmán; J Javier Ruiz-Pernía; Iñaki Tuñón
Journal: Angew Chem Int Ed Engl Date: 2021-11-03 Impact factor: 15.336

2 in total

1. Testing Affordable Strategies for the Computational Study of Reactivity in Cysteine Proteases: The Case of SARS-CoV-2 3CL Protease Inhibition.

Authors: Carlos A Ramos-Guzmán; José Luis Velázquez-Libera; J Javier Ruiz-Pernía; Iñaki Tuñón
Journal: J Chem Theory Comput Date: 2022-05-13 Impact factor: 6.578

2. Computationally driven discovery of SARS-CoV-2 M^pro inhibitors: from design to experimental validation.

Authors: Léa El Khoury; Zhifeng Jing; Alberto Cuzzolin; Alessandro Deplano; Daniele Loco; Boris Sattarov; Florent Hédin; Sebastian Wendeborn; Chris Ho; Dina El Ahdab; Theo Jaffrelot Inizan; Mattia Sturlese; Alice Sosic; Martina Volpiana; Angela Lugato; Marco Barone; Barbara Gatto; Maria Ludovica Macchia; Massimo Bellanda; Roberto Battistutta; Cristiano Salata; Ivan Kondratov; Rustam Iminov; Andrii Khairulin; Yaroslav Mykhalonok; Anton Pochepko; Volodymyr Chashka-Ratushnyi; Iaroslava Kos; Stefano Moro; Matthieu Montes; Pengyu Ren; Jay W Ponder; Louis Lagardère; Jean-Philip Piquemal; Davide Sabbadin
Journal: Chem Sci Date: 2022-02-10 Impact factor: 9.825

2 in total