Sergio Martí1, Kemel Arafet1,2, Alessio Lodola2, Adrian J Mulholland3, Katarzyna Świderek1, Vicent Moliner1. 1. BioComp Group, Institute of Advanced Materials (INAM), Universitat Jaume I, 12071 Castelló, Spain. 2. Dipartimento di Scienze degli Alimenti e del Farmaco, Università degli Studi di Parma, I-43124 Parma, Italy. 3. Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol BS8 1TS, United Kingdom.
Abstract
The COVID-19 pandemic, caused by the severe acute respiratory syndrome coronavirus-2, SARS-CoV-2, shows the need for effective antiviral treatments. Here, we present a simulation study of the inhibition of the SARS-CoV-2 main protease (Mpro), a cysteine hydrolase essential for the life cycle of the virus. The free energy landscape for the mechanism of the inhibition process is explored by QM/MM umbrella sampling and free energy perturbation simulations at the M06-2X/MM level of theory for two proposed peptidyl covalent inhibitors that share the same recognition motif but feature distinct cysteine-targeting warheads. Regardless of the intrinsic reactivity of the modeled inhibitors, namely a Michael acceptor and a hydroxymethyl ketone activated carbonyl, our results confirm that the inhibitory process takes place by means of a two-step mechanism, in which the formation of an ion pair C145/H41 dyad precedes the protein-inhibitor covalent bond formation. The nature of this second step is strongly dependent on the functional groups in the warhead: while the nucleophilic attack of the C145 sulfur atom on the Cα of the double bond of the Michael acceptor takes place concertedly with the proton transfer from H41 to Cβ, in the compound with an activated carbonyl, the sulfur attacks the carbonyl carbon concomitant with a proton transfer from H41 to the carbonyl oxygen via the hydroxyl group. An analysis of the free energy profiles, structures along the reaction path, and interactions between the inhibitors and the different pockets of the active site on the protein shows a measurable effect of the warhead on the kinetics and thermodynamics of the process. These results and QM/MM methods can be used as a guide to select warheads to design efficient irreversible and reversible inhibitors of SARS-CoV-2 Mpro.
The COVID-19 pandemic, caused by the severe acute respiratory syndrome coronavirus-2, SARS-CoV-2, shows the need for effective antiviral treatments. Here, we present a simulation study of the inhibition of the SARS-CoV-2 main protease (Mpro), a cysteine hydrolase essential for the life cycle of the virus. The free energy landscape for the mechanism of the inhibition process is explored by QM/MM umbrella sampling and free energy perturbation simulations at the M06-2X/MM level of theory for two proposed peptidyl covalent inhibitors that share the same recognition motif but feature distinct cysteine-targeting warheads. Regardless of the intrinsic reactivity of the modeled inhibitors, namely a Michael acceptor and a hydroxymethyl ketone activated carbonyl, our results confirm that the inhibitory process takes place by means of a two-step mechanism, in which the formation of an ion pair C145/H41 dyad precedes the protein-inhibitor covalent bond formation. The nature of this second step is strongly dependent on the functional groups in the warhead: while the nucleophilic attack of the C145 sulfur atom on the Cα of the double bond of the Michael acceptor takes place concertedly with the proton transfer from H41 to Cβ, in the compound with an activated carbonyl, the sulfur attacks the carbonyl carbon concomitant with a proton transfer from H41 to the carbonyl oxygen via the hydroxyl group. An analysis of the free energy profiles, structures along the reaction path, and interactions between the inhibitors and the different pockets of the active site on the protein shows a measurable effect of the warhead on the kinetics and thermodynamics of the process. These results and QM/MM methods can be used as a guide to select warheads to design efficient irreversible and reversible inhibitors of SARS-CoV-2 Mpro.
Despite the development
of efficient vaccines, the effect of the
COVID-19 pandemic around the world, produced by the severe acute respiratory
syndrome coronavirus-2—SARS-CoV-2—has emphasized the
need for antiviral treatments. Moreover, considering the capabilities
of the virus to mutate, like any other viruses that contain RNA genetic
material such as this or the influenza viruses, the corresponding
risk of a decrease in the effectiveness of the vaccines spurs the
need for complementary strategies to fight against the pandemic. Many
efforts have focused on understanding the life cycle of SARS-CoV-2,
to provide information about possible ways of developing drugs.[1−3] Among the proteins involved in the replication of the virus, the
main coronavirus protease (SARS-CoV-2 Mpro) is a most attractive
target due to its intrinsic features, including its distinguishing
ability to cleave proteins after the glutamine residue,[4] catalytic features which make Mpro unique with respect to human proteases. The most effective Mpro inhibitors identified so far, including the clinical candidates
PF-00835231, incorporate a glutamine residue or a bioisostere at P1
position (see below) to give potency and selectivity and a peptidomimetic
scaffold of moderate size endowed with branched hydrophobic substituents
at both the P2 and P3 positions.[5−7] These compounds act by a covalent
mechanism, and so a reactive “warhead” is required:
i.e., an electrophilic group to form a covalent bond between the active
site cysteine residue (C145), previously activated by a histidine
residue (H41), and the inhibitor. Warheads that have been traditionally
used in cysteine proteases range from classical Michael acceptors
(MAs) to activated carbonyl derivatives, including α-ketoamides,
aldehydes, and hydroxymethyl ketone (HMK).[4,6,8] Nevertheless, previous investigations on
Mpro inhibition by those prototypical warheads did not
assess their relative reactivity versus C145 or explore the contribution
played by the recognition part to the overall inhibition process.Thus, there is an urgent need to understand the effects of warheads
on the reactivity as well as on the chemical stability of the covalent
adduct generated. This second aspect is fundamental in the context
of drug development, as it affects the (ir)reversibility of inhibition.
Irreversible inhibition through a covalent mechanism is generally
the most effective strategy to obtain a sustained response in vivo, since release from inhibition requires the resynthesis
of the engaged target. However, it can be challenging to identify
covalent compounds that selectively react with the specific residue
without causing irreversible labeling of other (host) targets that
may lead to liver toxicity and/or immune responses. These concerns
about off-target modifications are motivating the search for potent
covalent and reversible agents. It is also important to be able to
explore a range of different pharmacokinetic behaviors, and in particular,
there is a need to predict the degree of reversibility of inhibition.
It is therefore critical to develop computational protocols able to
predict the kinetic behavior of a covalent inhibitor.A wide
range of different computational methods has been used since
the emergence of COVID-19 for the discovery of small-molecule therapeutics.[7,9] With regard to the inhibition of Mpro, modeling based
on classical force fields can contribute to the discovery and optimization
of noncovalent inhibitors,[10−12] while the use of methods based
on quantum mechanics/molecular mechanics (QM/MM) potentials can assist
in the design of covalent inhibitors. We recently studied the mechanism
of the covalent inhibition of Mpro by the peptidyl-MA compound N3 designed by Jin and colleagues[6] and by two designed MA compounds (B1 and B2) by QM/MM molecular dynamics (MD) methods.[13] Our results indicated that both designed compounds may be promising
candidates as drug leads against COVID-19. Interestingly, according
to the computed thermodynamic properties of the full inhibition process, B1 could behave as an irreversible inhibitor while B2 may be a reversible inhibitor.As was previously proposed
from structures from X-ray diffraction
studies[3] and later supported by computational
studies using several different approaches, the chemical reaction
step of the Mpro inactivation involves the activation of
the SH group of C145 by the imidazole group of H41. Then, the reactive
nucleophilic thiolate formed (CysS–) would attack
the inhibitor, creating an inhibitor–protein covalent bond.[13−18] According to the recent literature, the equilibrium between the
neutral dyad and the CysS–/HisH+ ion
pair appears to be tipped in favor of the neutral pair by ligand binding,
but this may depend on the electrostatic properties of the ligand
itself. Thus, while some simulation studies report a neutral dyad
significantly more stable than the ion pair (by ca. 11 kcal mol–1) or identifying the ion pair as an unstable state,[18] others suggest that the ion pair is not so destabilized
with respect to the initial neutral dyad[13,14] or is even slightly more stable than the neutral dyad (e.g., our
previous study with B1).[13]Our previous study on the proteolysis reaction of SARS-CoV-2
Mpro, using a polypeptide with a fluorescent tag, concluded
that this enzyme differs somewhat from other cysteine proteases.[15] Thus, for instance, the initial enzyme:substrate
complex would not be the ion pair dyad C145–/H41+ (E:I) but rather the neutral C145/H41 dyad (E:I). This result
is in agreement with studies carried out by us, and others, using
different inhibitors and substrates (except for B1 mentioned
above),[13,16,18] but in contrast
with the protonation state of the catalytic dyad suggested from the
ligand-free SARS-CoV-2 Mpro recently solved by neutron
crystallography at pH 6.6.[19] Nevertheless,
as has already been pointed out, questions remain about how pH or
the presence of an inhibitor or substrate influence the protonation
state of the dyad in SARS-CoV-2 Mpro.[16] Moreover, while the formation of the ion pair dyad C145–/H41+ (E:I) and the nucleophilic attack of sulfur atom
of C145 to the carbonyl carbon atom of the peptide take place concertedly
in the proteolysis reaction catalyzed by Mpro,[15] in the inhibition reaction by N3, our designed B1 and B2 MA compounds,[13] or the simulation with Mpro–substrate
peptide models,[16] the formation of the
ion pair dyad and the sulfur–carbon covalent bond formation
appear to proceed in a stepwise manner. In any event, the rate-limiting
step of the process, in all three studied inhibitors, was the enzyme–inhibitor
covalent bond formation, with activation free energies ranging from
11.8 to 9.8 kcal mol–1.[13]An analysis of the QM/MM interaction energies between the
substrate
(the peptide in the proteolysis reaction or the inhibitor in the case
of the inhibitors) and the different binding pockets of Mpro and the peptide (in the study of the proteolysis reaction) or the
inhibitor (in the case of the inhibitors) confirms that they are dominated
by those in the P1:::S1 site. Thus, our previous results indicate
that a low-barrier C145 covalent modification depends on either the
warhead or the recognition portion. The recognition portion dictates
how the inhibitor is accommodated in the active site, which in turn
affects the subsequent chemical reaction step. Consequently, the design
of an efficient inhibitor must take into account the reactivity of
the warhead and the favorable interactions between the recognition
portion and the active site of the enzyme.[20] In all, the experience accumulated on the basis of the results derived
from previous studies on this and other cysteine proteases can be
used to guide the design of new compounds, and QM/MM simulations can
be considered a useful tool to get a detailed description of the chemical
steps of the inhibition of protein target covalent inhibitors. Moreover,
the obtained activation free energies and reaction energies obtained
with these high-level methods can confirm, or not, the viability of
the proposed inhibitors.Here, we propose and investigate the
inhibition of the SARS-CoV-2
Mpro by two potential covalent (peptidyl) inhibitors endowed
with two chemically diverse warheads. Building upon the findings on
information derived from our previous studies on the proteolysis of
Mpro [15] and on the reaction
of the inhibition with several peptidyl irreversible inhibitors,[13] we propose the two compounds B3 and B4 (Scheme ). A methyl oxo-enoate was used in B3, inspired
by the dimethyl fumarate structure,[21,22] while a hydroxymethyl
ketone (HMK) was used as a warhead in the B4 compound.
This reactive group is also present in the structure of PF-00835231
Mpro inhibitor, now in a clinical trial.[5] The recognition part possessed by both B3 and B4 compounds was selected on the basis of QM/MM results obtained
with the previously proposed inhibitors B1 and B2 and from an analysis of protein–substrate interactions
from QM/MM simulations of the proteolysis reaction.[15] We keep the recognition part as a short peptide-like compound,
combining the P1 moiety of B1 and the P2 and P3 moieties
of B2. This is a robust and systematic way of deciphering
the effect of specific changes in the intrinsic chemical reactivity
of the proposed inhibitors on the mechanism (and energetics) of inhibition.
As shown in our previous studies, stabilizing ligand:protein interactions
were established when this relatively small recognition part was used.[13] Thus, S2 is a small hydrophobic pocket without
strong hydrogen-bond interactions with P2. Therefore, an isobutyl
group was kept at the P2 site as in B2. The S3 subsite
of Mpro is completely exposed to the solvent, and then
we keep the shorter P3 of B2. Also, previous studies[6] suggested that different kinds of substituents
can be used in P3. It must be kept in mind that, despite the fact
that the PF-00835231 Mpro inhibitor shows an interaction
between the indole group (removed in our new compounds) and E166,
our previous study revealed favorable interactions between P3 and
M165 and Q189.[13] Finally, the glutamine
present in P1 of B1 was employed in these new compounds
due to the favorable interactions observed in our previous study of
the proteolysis.[15]
Scheme 1
Chemical Structures
of the Proposed (B3 and B4) Inhibitors of
SARS-CoV-2 Mpro
The warheads, P1′,
are highlighted in red, while P1–P3 fragments are shown in
blue, green, and black, respectively. The subpockets of the active
site are labeled with S numbering complementary to fragments of the
inhibitor. Asterisks indicate the main reactive centers of the inhibitors.
Chemical Structures
of the Proposed (B3 and B4) Inhibitors of
SARS-CoV-2 Mpro
The warheads, P1′,
are highlighted in red, while P1–P3 fragments are shown in
blue, green, and black, respectively. The subpockets of the active
site are labeled with S numbering complementary to fragments of the
inhibitor. Asterisks indicate the main reactive centers of the inhibitors.From a mechanistic point of view, the two proposed
compounds could
potentially react in different manners in the active site of the enzyme,
also because their key electrophilic centers not only possess a different
chemical environment but also are not topologically equivalent. Thus,
as shown in Scheme , after the formation of the ion pair E:I reactant complex with B3, the
attack of the sulfur of C145 to the β-carbon of the substrate
can take place, followed by a proton transfer from the protonated
H41 to the α-carbon, leading to the stable covalent product E-I. Nevertheless, on consideration of the nature of the warhead
in B3, the final proton transfer could also take place
to the carbonyl oxygen atom (E-I). In the case of the inhibition with B4, this
dual possibility of the final proton transfer does not appear after
the acylation of the enzyme because the proton from H41 can only be
transferred to the carbonyl oxygen atom of the inhibitor (Scheme ).
Scheme 2
Proposed Mechanism
of SARS-CoV-2 Mpro Cysteine Protease
Inhibition by B3
R1 and R2 are the
different
substituents, as shown in Scheme .
Scheme 3
Proposed Mechanism of SARS-CoV-2 Mpro Cysteine Protease
Inhibition by B4
R1 and R2 are different
substituents,
as shown in Scheme .
Proposed Mechanism
of SARS-CoV-2 Mpro Cysteine Protease
Inhibition by B3
R1 and R2 are the
different
substituents, as shown in Scheme .
Proposed Mechanism of SARS-CoV-2 Mpro Cysteine Protease
Inhibition by B4
R1 and R2 are different
substituents,
as shown in Scheme .The present study is a computational study
of the mechanism of
inhibition of Mpro by B3 and B4. The reaction mechanisms for each inhibitor were initially explored
by nudged elastic band calculations of the minimum energy paths. Then,
two free energy based methodologies, such as the umbrella sampling
(US) and free energy perturbation (FEP) methods, both at the density
functional theory level combined with classical force fields, were
employed to explore the full inhibition process.
Computational Methods
The coordinates for the starting point were taken from the X-ray
structure of the SARS-CoV-2 Mpro with the PF-00835231 inhibitor
in its active site (PDB ID 6XHM).[5] The PF-00835231 inhibitor
was then manually modified, leading to the two new enzyme–inhibitor
models. The missing force field parameters for each model were generated
using the Antechamber program,[23] available
in the AmberTools package (see Tables S1 and S2). The protonation states of the titratable amino acids were determined
using the empirical program PropKa ver. 3.0.3,[24] while the histidine residues were assigned by a detailed
visual inspection (see a list of all the pKa values in Table S3). Each model was neutralized
by adding eight sodium counterions that were placed in a box of 92.154
× 102.242 × 97.285 Å3 of TIP3P[25] water molecules.The next step for each
model consisted of 105 steps
of conjugate-gradient minimization, followed by a series of molecular
dynamics (MD) simulations in the NVT ensemble with the AMBER ff03
force field,[26] as implemented in NAMD software.[27] Initially the temperature was raised from 0
to 310 K with a short MD, using a constant increment of 3.1 K/ps.
Then 100 ps of MD was performed at 310 K, applying a constant scaling.
Finally, an equilibration was carried out by means of 10 ns of NVT
MD, using the Langevin thermostat.[28] All
simulations made use of the PME algorithm for the electrostatic interactions
with a force-switch scheme ranging from 14.5 to 16 Å and a time
step of 1 fs. The time evolution of the root-mean-square deviations
of the backbone atoms of the protein models, using the cpptraj facility,[29] confirmed that the two models became equilibrated
(see Figures S1a and S2a).An additive
hybrid QM/MM scheme was selected in the present study
for constructing the total Hamiltonian. The QM subset of atoms includes
the P1′, P1, and P2 positions of the inhibitor, together with
the C145 and H41 residues of the protein. Four link atoms were inserted
where the QM/MM boundary intersected covalent bonds in the positions
indicated on Figures S1b and S2b. Thus,
the QM part consisted of 89 atoms for the inhibitor B3 and 80 for B4.All the calculations were performed
with the QMCube suite,[30] in which the combination
of the NAMD and Gaussian09[31] programs was
used to construct the potential
energy function. The AMBER ff03[26] and the
TIP3P[25] force fields were selected to describe
the MM atoms, and the Minnesota functional M06-2X[32] with the split-valence 6-31+G(d,p) basis set was used to
treat the QM subset of atoms. This functional has been tested and
shown to be suitable for modeling this type of reactivity.[13,15,16,33,34] The position of any atom over 25 Å
from the substrate was fixed to speed up the calculations.Reaction
mechanisms for each inhibitor were initially explored
using the nudged elastic band (NEB)[35] approach
to set up plausible starting geometries for the transition structures.
Then, they were localized and characterized by a micromacro[36,37] Hessian-based localization scheme, and minimum energy paths (MEP)
were traced toward the corresponding minima. The information obtained
inat this stage was used in the fine-tuning of the free energy methodologies,
specifically the potential of mean force (PMF) and free energy perturbation
(FEP) methods. We applied these two distinct approaches separately
to calculate the free energy profiles for the reaction. It is important
to point out that herein we directly compute the free energy landscape
at a DFT/MM level higher than that in our previous QM/MM studies on
the inhibition of Mpro. Thus, the present calculations
apply a significantly higher level of theory in sampling, as detailed
below.In the case of the FEP method, which has been successfully
employed
in our laboratory for reactivity studies in various biological systems,[38−41] the reaction path obtained for each of the mechanisms was analyzed
to extract those consecutive geometries with an energy difference
greater than or equal to 1.5 kJ mol–1 (called “windows”).
Then, pure MM MD simulations were performed on each of these windows,
keeping frozen the atoms of the QM part. Each MD run was performed
in the NVT ensemble with a time step of 1 fs and a total time length
of 20 ps. The partial charges of the QM atoms were recalculated every
200 steps using the CHELPG[42] methodology,
because the MM region is changing during the sampling at every window
and can polarize the QM wave function and consequently propagate to
the MM engine (NAMD). The application of this protocol resulted in
a total number of 100 structures per window (called “points”)
with the same coordinates for the QM atoms, but with a different MM
environment configuration. In the following step, the coordinates
of the QM atoms for each point of the ith window
were replaced by those of the consecutive window (i + 1), and both the perturbed QM and Lennard–Jones energy
terms were evaluated. The free energy of the chemical step was then
obtained from the cumulative sum of the free energy differences between
successive windows, as shown in eq where both Ui and Ui+1 comprise the first
three terms of eq (that
is, EQM, EQM/MMelect and EQM/MMvdW) and the average was carried out on all points of the ith window. Finally, the free energy profile was obtained from the
FEP method, including the corresponding harmonic QM zero-point energy
(ZPE) for the stationary points.The potential of mean force
for each chemical step was obtained
using a combination of the umbrella sampling (US) approach[43] with the weighted histogram analysis method
(WHAM).[44] A series of MD simulations were
performed on adding a restraint along the collective reaction coordinate s,[45] with an umbrella force constant
of 3000 kJ mol–1 Å–2. In
every window, QM/MM MD-NVT simulations were performed with a total
of 3.25 ps at 310 K with a time step of 0.5 fs (a total of 6500 steps).
The definition of the s coordinate depended on the
considered step, always reduced to a combination of distances. Thus,
for the first step of the inhibitor B3, the following
distances were included: d(Sγ,Cβ), d(Sγ,Hγ), and d(Hγ,Nε). In the second step, only the protonation of the double bond was
studied because the protonation of the carbonyl group gave a high
barrier in FEP calculations; therefore, the distances involved were d(Sγ,Cβ), d(Hγ,Cα), and d(Hγ,Nε). In the case of the B4 inhibitor, the distance d(H*,O) was also
accounted for in the first step, the carbon of the carbonyl moiety
being the one involved. The second step was followed with a combination
of the distances: d(Sγ,C), d(Hγ,Nε), d(Hγ,O*), d(H*,O*), and d(H*,O). All of the information needed to define the equally
distributed milestones from which the collective variable s was constructed was obtained from an analysis of the different
MEPs previously traced. In addition, the error of the PMFs was evaluated
as the standard deviation derived from a total of 1000 randomly bootstrapped
PMFs.[46]Finally, the QM/MM interaction
energy was computed as a sum of
the contribution of each residue of the protein to the total interaction
energy:According to eq , the protein–substrate interaction
energy can be decomposed by the residue when the polarized wave function
(Ψ) that describes the QM subset of atoms is used. Because of
the large number of structures that must be evaluated to obtain a
representative population, the semiempirical Hamiltonian AM1[47] was used to describe the QM region in these
QM/MM MD calculations.
Results and Discussion
The first
step in our study was to carry out a deep analysis of
the interactions established between the two studied compounds and
the active site of Mpro in the initial E:I state. Figure shows
a schematic representation of hydrogen bond interactions, while Figure reports the average
interaction energies (electrostatic plus Lennard–Jones) between
Mpro residues and inhibitor fragments, thus including some
interactions with residues that are not necessary at a close distance
from the inhibitor. A list of relevant interatomic distances is deposited
in Tables S4 and S5. An analysis of the
results confirms the formation of a stable reactant Michaelis complex
in both cases, with a similar pattern of interactions. Keeping in
mind that the difference between B3 and B4 is restricted to the warhead, and in both cases the interactions
with S1′ take place through mainly hydrogen bond interactions
with the carbonyl oxygen next to P1 that is common in both inhibitors,
the results appear as reasonable despite the very different functional
groups in the P1′ position. Thus, this carbonyl group is interacting
with the oxyanion hole located in S1′ formed by G143, S144,
and C145. In addition, some indirect interactions that also stabilize
the P1′ fragment, such as L27, N28, G146, and S147, were identified.
The specific favorable interactions between the lactam ring on P1
and S1 are almost equivalent in both inhibitors, mainly through interactions
with P140, N142, H163, and E166 (see Figure ). The backbone atoms of the residues of
the P2 site are responsible for the interactions with Q189, H164,
D187, and M165. Finally, the unfavorable interaction between R40 and
the warhead of B3 and B4 is worth mentioning.
Interestingly, R40 is ca. 8 Å from P1′ and the interaction,
established basically between the carbonyl group of the peptide bond
of R40 and P1′, corresponds to an electrostatic interaction.
Thus, possible strategies to avoid this interaction in future inhibitor
designs would require a redesign of the warheads. There are other
positively charged side chains around the inhibitors at similar distances,
such as K61 and R188, but their contribution is much less than that
of R40, especially in the case of B4, due to a greater
solvent exposure (see Figure and Figure S3). The conformation
adopted by both compounds in the active site of Mpro can
be compared with the X-ray crystal structures of related complexes.
Thus, the cocrystal structure of the covalent adduct of PF-00835231
bound to SARS-CoV-2 Mpro (PDB code 6XHM)[5] that, as commented above is like B4, shows
protein–ligand distances in S1, S2, and S3 equivalent to those
shown in Figure b.
These similarities are also observed when the distances between the
key atoms involved in the inhibition reaction are compared: SγC145–CC=O, SγC145–NεH41, and NεH41–O*OH: 1.86,
3.71, and 3.80 Å, respectively, in the crystal structure and
2.90, 3.13, and 2.34 Å, respectively, in the B4 complex.
With regard to B3, despite no X-ray structure being available
for SARS-CoV-2 Mpro complexed with a structure comparable
to that of B3, there is a cocrystal structure of the
covalent adduct 2 of Hoffman and co-workers bound to SARS-CoV-1 Mpro (PDB code 6XHO).[5] An analysis of this structure provides
similar conclusions regarding the interactions between the different
subsites of the active site of the related proteins, CoV-1 and CoV-2
Mpro, and the corresponding compounds in the X-ray structure
of SARS-CoV-1 Mpro and B3 in SARS-CoV-2 Mpro. Obviously, the absence of the carbonyl group at the α
position of P1′ in 2 explains the lack of interactions
with the oxyanion hole of the S1′ site. Nevertheless, a comparison
of the interatomic distances that are related to the inhibition reaction,
SγC145–Cβ, SγC145–NεH41, and NεH41–Cα, also shows similar values: 1.76, 3.96, and 3.23 Å, respectively,
in the crystal structure and 3.29, 3.29, and 4.09 Å, respectively,
in B3. Obviously, this comparison must be done with caution
because the X-ray structures correspond to the protein–inhibitor
covalent complex (E-I in our Schemes and 3) while the B3 and B4 structures analyzed at this point correspond
to the initial reactant complex E:I. Thus, differences
observed in the distances defining the attack of the sulfur atom of
C145 to the corresponding carbon atom of the inhibitor (CC=O and Cβ for B3 and B4,
respectively) are as expected. Anyway, the good overlap of the X-ray
structures and the equilibrated E:I reactant complex
support the quality of our initial state structures (see Figures S4 and S5).
Figure 1
Schematic representations
of the H-bond interactions between the
inhibitor and the SARS-CoV-2 Mpro derived from QM/MM MD
simulations of B3 (a) and B4 (b) inhibitors
in the E:I state.
Figure 2
Average
interaction energies (electrostatic plus Lennard–Jones)
between amino acids of Chain-A of SARS-CoV-2 Mpro and each
fragment of the inhibitor B3 (a) and B4 (b)
computed at the E:I state. The values were obtained as
an average over 1000 structures from the AM1/MM MD simulations. The
red bars correspond to the P1′:::S1′ interactions, the
blue bars correspond to the P1:::S1 interactions, and the green bars
correspond to the P2:::S2 interactions.
Schematic representations
of the H-bond interactions between the
inhibitor and the SARS-CoV-2 Mpro derived from QM/MM MD
simulations of B3 (a) and B4 (b) inhibitors
in the E:I state.Average
interaction energies (electrostatic plus Lennard–Jones)
between amino acids of Chain-A of SARS-CoV-2 Mpro and each
fragment of the inhibitor B3 (a) and B4 (b)
computed at the E:I state. The values were obtained as
an average over 1000 structures from the AM1/MM MD simulations. The
red bars correspond to the P1′:::S1′ interactions, the
blue bars correspond to the P1:::S1 interactions, and the green bars
correspond to the P2:::S2 interactions.Once it was confirmed that the E:I complex represents
a stable reactant complex, in both cases, the inhibition reaction
was studied according to the general mechanisms proposed in Schemes and 3 for the reactions with B3 and B4, respectively.
Inhibition of SARS-CoV-2 Mpro with B3
As shown in Scheme , after C145 is activated by a proton transfer to H41,
thus
forming the ion pair complex E:I, the covalent complex is formed by the nucleophilic
attack of the sulfur atom of C145 to the Cβ atom
of the B3 inhibitor. Then, the reaction is completed
by the transfer of the proton from the protonated H41 to either the
Cα atom, to render the final E-I covalent
adduct, or to the carbonyl oxygen atom then ending in E-I′. Exploration of both mechanisms by M06-2X/6-31+G(d,p)/MM FEP calculations
revealed that the formation of the former (i.e., the direct
addition mechanism) is both thermodynamically and kinetically
favored with respect to the formation of E-I′ (see Figure S6). Thus, while the reaction that renders
the E-I product is strongly exergonic (−16.2 kcal
mol–1), the energy of E-I′ product
appears to be 15.2 kcal mol–1 higher than the initial
reactant state, E:I. These differences in the reaction
energies are also associated, as mentioned, with significant differences
in activation energies: 13.7 and 21.0 kcal mol–1 to form E-I and E-I′, respectively.
Consequently, the much more computationally demanding M06-2X/6-31+G(d,p)/MM
US method was applied only to the exploration of the mechanism rendering
the E-I final product. The resulting free energy profile
for the covalent inhibition of SARS-CoV-2 Mpro with B3 is depicted in Figure , while the evolution of the selected bond distances
along the PMF is shown in Figure . Details of the M06-2X/6-31+G(d,p)/MM FESs obtained
by means of the US method are given in Figure S7.
Figure 3
M06-2X/6-31+G(d,p)/MM free energy profiles obtained with umbrella
sampling MD for the inhibition mechanism of SARS-CoV-2 Mpro cysteine protease by B3 (red line) and B4 (blue line) inhibitors at 310 K.
Figure 4
Evolution
of the selected bond distances along the PMF of the SARS-CoV-2
Mpro inhibition with B3: (a) formation of
the ion pair E:I; (b) formation of the final E-I covalent complex. Vertical
dashed lines represent the positions of the optimized TS structures.
M06-2X/6-31+G(d,p)/MM free energy profiles obtained with umbrella
sampling MD for the inhibition mechanism of SARS-CoV-2 Mpro cysteine protease by B3 (red line) and B4 (blue line) inhibitors at 310 K.Evolution
of the selected bond distances along the PMF of the SARS-CoV-2
Mpro inhibition with B3: (a) formation of
the ion pair E:I; (b) formation of the final E-I covalent complex. Vertical
dashed lines represent the positions of the optimized TS structures.According to our results, the full reaction mechanism
of the inhibition
of SARS-CoV-2 Mpro cysteine protease by B3 takes place in two steps. First, the proton from C145 is transferred
to H41 with an activation energy barrier of 10.2 ± 0.3 kcal mol–1. The resulting ion pair complex, E:I, a zwitterionic species
that according to previous studies is well described by the M06-2X
functional here,[48] is clearly less stable
than the initial complex, in which both residues of the C145/H41 dyad
are in their neutral states (by ca. 8 kcal mol–1). This result agrees with our previous computational studies of
the proteolysis reaction and the inhibition reaction with different
inhibitors.[13,15] Thus, despite the quantitative
energetic difference between the ion pair and the neutral form that
appears to be dependent on the substrate, the neutral dyad must be
considered as the starting state of the reaction catalyzed by Mpro. As shown in Figure a, the proton transfer from C145 to H41 is associated with
a slight approach of the sulfur atom of the former to the nucleophilic
atom of the substrate (from 3.5 to 2.8 Å). Then, the covalent
bond formation between C145 and the Cβ atom of the
substrate takes place concertedly with a proton transfer from the
protonated H41 to the Cα atom of the substrate to
reach the final E-I covalent complex. Interestingly,
the barrier for the ion-pair formation is higher than the barrier
of the covalent bond formation, if it is measured from the intermediate E:I. Nevertheless,
because the first step is endergonic, we measured the activation free
energy of the second step from the reactant E:I complex,
and consequently, this is the rate-limiting step of the process with
a free energy barrier of 13.5 ± 1.2 kcal mol–1, with breaking and forming bonds in a very asynchronous process
(see Figure b). The
transition state, TS2, defined as the maximum of the
PMF but also confirmed by optimizing and characterizing a representative
structure at the M06-2X/6-31+G(d,p)/MM level (see Figure and Table S6) as a saddle point of order 1, is characterized by Sγ–Cβ bond formation in a very
advanced stage of the process (1.89 Å) but a proton transfer
in an early stage of the reaction Hγ–Cα distance of ca. 1.70 Å. This concerted character
was also confirmed by tracing the IRC down to the ion pair intermediate
and the product from the optimized TS2, which in fact
was used to generate the free energy profile with the FEP method described
above. From a technical point of view, it is important to note that
both methods, US and FEP, provide the same description of the process
with only slight energetic differences, with or without addition of
the entropic contribution of the QM region (see Figure vs Figure S8).
Finally, an analysis of the interaction energies (computed as an average
over the sum of electrostatic plus Lennard–Jones terms) between
amino acids of Mpro and each fragment of the inhibitor B3 computed at TS2 shows that the pattern of
interactions does not significantly change from that obtained in the E:I complex (see Figure a vs Figure S9).
Figure 5
Detail of M06-2X/6-31+G(d,p)/MM
optimized structures of selected
states located along the inhibition reaction of Mpro by B3. Carbon atoms of the inhibitor are shown in green, and
those of the catalytic residues C145 and H41 are shown in cyan. Key
distances are in Å.
Detail of M06-2X/6-31+G(d,p)/MM
optimized structures of selected
states located along the inhibition reaction of Mpro by B3. Carbon atoms of the inhibitor are shown in green, and
those of the catalytic residues C145 and H41 are shown in cyan. Key
distances are in Å.
Inhibition of SARS-CoV-2
Mpro with B4
As shown in Figure and schematically
depicted in Scheme , the inhibition process with B4 is equivalent to that
obtained with B3. Thus, the generation
of the transient ion pair intermediate E:I by a proton transfer from C145 to H41 precedes
the formation of the covalent complex between C145 and the carbonyl
carbon atom of B4. The first step is virtually the same
as in the case of the inhibition with B3, confirming
that, once again, the neutral reactant complex is favored with respect
to the ion pair dyad.[13−18] The evolution of the key distances with the progression of the reaction
shows that the proton transfer from C145 to H41 is also associated
with an approach of the former to the carbonyl carbon atom, from 3.0
to 2.2 Å, thus generating a more reactive conformation (Figure a). Then, the second
step, which as in the case of B3 represents the rate-limiting
step of the full inhibition process, involves the acylation of the
protein together with the proton transfer from the protonated H41
to the carbonyl oxygen atom of the inhibitor with an energy barrier
of 15.2 ± 1.1 kcal mol–1. This barrier is slightly
higher than that obtained for B3, 13.5 kcal mol–1. This difference of ∼2 kcal mol–1 is also
observed in the reaction energies, the reaction with B3 being slightly more exergonic (−12.5 ± 1.0 kcal mol–1) than the reaction with B4 (−10.5
± 0.9 kcal mol–1). A recent computational study
of the inhibition mechanism of the cysteine protease rhodesain by
a dipeptidyl enoate in our laboratory showed that it can take place
through cysteine attack on either the Cβ or the carbonyl
carbon atom of the inhibitor, in an exergonic process with a low activation
energy barrier.[34] In the current work,
as revealed by the evolution of the interatomic distances monitored
in Figure b, the proton
transfer from the positively charged H41 to the carbonyl oxygen atom
does not take place directly but occurs through the hydroxyl group
of the substrate. Thus, the proton Hγ is transferred
from Nε of H41 to the oxygen atom of the hydroxyl
group, O*, simultaneous with the proton transfer, H*, from this hydroxyl
oxygen atom to the carbonyl oxygen atom, O, of the substrate. A similar
mechanism has been found for the inhibition reaction of Mpro with PF-00835231 using similar QM/MM methods but with the B3LYP
functional.[18] Our activation free energy
is 4.5 kcal mol–1 lower than the energy obtained
in that study, which could be due to chemical and/or methodological
differences. In this regard, limitations of B3LYP for describing thio-Michael
additions have been previously noted.[48] It is also worth mentioning that our ion pair intermediate is clearly
a stable minimum in the free energy surface, while a metastable ion
pair catalytic dyad is formed by the proton transfer from C145 to
H41 in that work. The transition state of the second step, defined
as the maximum of the PMF and also confirmed by optimizing a representative
structure at the M06-2X/6-31+G(d,p)/MM level (see Figure and Table S7) and tracing the IRC down to the ion pair and the final
product complex, supports the proposed mechanism. The role played
by the terminal hydroxyl group of B4 in the proton transfer
from H41 to the carbonyl oxygen atom of the inhibitor agrees with
the results of Hoffman and co-workers, who found a drop of potency
in different HMKs when the terminal hydroxyl group was substituted
by other groups.[5]
Figure 6
Evolution of the selected
bond distances along the PMF of the SARS-CoV-2
Mpro inhibition by B4: (a) formation of the
ion pair E:I; (b) formation of E-I covalent complex. Vertical dashed
lines represent the positions of the optimized TS structures.
Figure 7
Detail of M06-2X/6-31+G(d,p)/MM optimized structures of
selected
states located along the inhibition reaction of Mpro by B4. Carbon atoms of the inhibitor are shown in green, and
those of the catalytic residues C145 and H41 are shown in cyan. Key
distances are in Å.
Evolution of the selected
bond distances along the PMF of the SARS-CoV-2
Mpro inhibition by B4: (a) formation of the
ion pair E:I; (b) formation of E-I covalent complex. Vertical dashed
lines represent the positions of the optimized TS structures.Detail of M06-2X/6-31+G(d,p)/MM optimized structures of
selected
states located along the inhibition reaction of Mpro by B4. Carbon atoms of the inhibitor are shown in green, and
those of the catalytic residues C145 and H41 are shown in cyan. Key
distances are in Å.Finally, as in the case
of B3, an analysis of the
average interaction energies between residues of Mpro and
each fragment of the inhibitor B4 computed at TS2 shows that the pattern of interactions does not significantly change
from that obtained in the E:I complex (see Figure b vs Figure S10).
Conclusions
We have reported a detailed
computational study of the inhibition
of SARS-CoV-2 Mpro with two proposed covalent (peptidyl)
inhibitors: B3 and B4. Both inhibitors share
the same recognition part, which is equivalent to that proposed in
our previous study,[13] but differ in the
activated carbonyl warheads. The results provide information on warhead
effects in SARS-CoV-2 Mpro covalent inhibitors. The full
inhibition processes have been explored with two different DFT/MM
methods: first, FEP methods starting from optimized TSs, and second,
US relying on the nudged elastic band and the calculation of the minimum
energy paths. There is good agreement between the results derived
from these different methodologies.Our results show that the
inhibition process with both compounds
takes place by a two-step mechanism, in which the formation of a high-energy
intermediate (the C145–/H41+ ion pair)
precedes the protein–inhibitor covalent bond formation. An
analysis of the free energy profiles, the geometries of the states
appearing along the reaction path, and the interactions between the
inhibitors and the different pockets of the active site, confirms
a notable effect of the warhead on the kinetics and thermodynamics
of the process of the second step. This second step (corresponding
to enzyme–inhibitor covalent bond formation) appears to be
the rate-limiting step of the process for both inhibitors, with activation
free energies of 13.5 ± 1.2 and 15.2 ± 1.1 kcal mol–1 for B3 and B4, respectively.
The lower activation free energy of B3, together with
a slightly more stable final covalent product (by 2 kcal mol–1), suggests that future designs should be based on the modification
over this kind of warhead introduced in B3. In addition,
the highly disfavored reverse processes in both cases (26.0 and 25.7
kcal mol–1 for B3 and B4, respectively) suggest a clear irreversible character of both proposed
new compounds, with potential corresponding advantages for medicinal
applications.It is important to note that our previously proposed
peptidyl nitroalkene
inhibitor B2, which shares the same recognition moiety
as B3 and B4, showed a slightly lower activation
energy barrier (9.8 kcal mol–1) and a less exergonic
inhibition process (−11.4 kcal mol–1). These
differences may be partially due to the different computational strategies,
but they could also be advantages to obtain more irreversible-character
inhibitors. These results are in good agreement with available experimental
data on peptidyl covalent Mpro inhibitors, but there are
no biochemical data for a direct comparison with our proposed compounds.The computed QM/MM interaction energies between the inhibitor,
decomposed by fragments, and the amino acids of the active site of
Mpro confirm that these interactions are dominated in both
cases, B3 and B4, by those in the P1:::S1
site, as in our previous studies.[13,15] Finally, the
good overlap between the structures of either the reactant complex E:I or the final covalent product E-I and two
cocrystal structures of the covalent adduct of similar compounds bound
to SARS-CoV-2 Mpro suggests that B3 and B4 can bind well in the active site. The low barrier obtained
for B4 suggests that the terminal hydroxyl group is an
important structural element in its inhibitory activity. This would
also mean that modulation of the pKa of
this group could represent an effective strategy to improve the potency
of this specific class of HMKs.In summary, our QM/MM study
of the inhibition of Mpro by two covalent (peptidyl) inhibitors, B3 and B4, whose design was guided by medicinal
chemistry experience
and by results derived from our previous computational studies, indicates
that both, but particularly B3, could be used as templates
to redesign candidates as drug leads against COVID-19. From a drug
discovery standpoint, the development of highly selective compounds
can benefit from the Mpro specificity in cleaving proteins
after the Gln residue, a characteristic not detected in human enzymes.
Authors: Carlos A Ramos-Guzmán; José Luis Velázquez-Libera; J Javier Ruiz-Pernía; Iñaki Tuñón Journal: J Chem Theory Comput Date: 2022-05-13 Impact factor: 6.578
Authors: Léa El Khoury; Zhifeng Jing; Alberto Cuzzolin; Alessandro Deplano; Daniele Loco; Boris Sattarov; Florent Hédin; Sebastian Wendeborn; Chris Ho; Dina El Ahdab; Theo Jaffrelot Inizan; Mattia Sturlese; Alice Sosic; Martina Volpiana; Angela Lugato; Marco Barone; Barbara Gatto; Maria Ludovica Macchia; Massimo Bellanda; Roberto Battistutta; Cristiano Salata; Ivan Kondratov; Rustam Iminov; Andrii Khairulin; Yaroslav Mykhalonok; Anton Pochepko; Volodymyr Chashka-Ratushnyi; Iaroslava Kos; Stefano Moro; Matthieu Montes; Pengyu Ren; Jay W Ponder; Louis Lagardère; Jean-Philip Piquemal; Davide Sabbadin Journal: Chem Sci Date: 2022-02-10 Impact factor: 9.825