Neha Verma1, Jack A Henderson1, Jana Shen1. 1. Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, Maryland 21201, United States.
Abstract
The SARS coronavirus 2 (SARS-CoV-2) main protease (Mpro) is an attractive broad-spectrum antiviral drug target. Despite the enormous progress in structure elucidation, the Mpro's structure-function relationship remains poorly understood. Recently, a peptidomimetic inhibitor has entered clinical trial; however, small-molecule orally available antiviral drugs have yet to be developed. Intrigued by a long-standing controversy regarding the existence of an inactive state, we explored the proton-coupled dynamics of the Mpros of SARS-CoV-2 and the closely related SARS-CoV using a newly developed continuous constant pH molecular dynamics (MD) method and microsecond fixed-charge all-atom MD simulations. Our data supports a general base mechanism for Mpro's proteolytic function. The simulations revealed that protonation of His172 alters a conserved interaction network that upholds the oxyanion loop, leading to a partial collapse of the conserved S1 pocket, consistent with the first and controversial crystal structure of SARS-CoV Mpro determined at pH 6. Interestingly, a natural flavonoid binds SARS-CoV-2 Mpro in the close proximity to a conserved cysteine (Cys44), which is hyper-reactive according to the CpHMD titration. This finding offers an exciting new opportunity for small-molecule targeted covalent inhibitor design. Our work represents a first step toward the mechanistic understanding of the proton-coupled structure-dynamics-function relationship of CoV Mpros; the proposed strategy of designing small-molecule covalent inhibitors may help accelerate the development of orally available broad-spectrum antiviral drugs to stop the current pandemic and prevent future outbreaks.
The SARS coronavirus 2 (SARS-CoV-2) main protease (Mpro) is an attractive broad-spectrum antiviral drug target. Despite the enormous progress in structure elucidation, the Mpro's structure-function relationship remains poorly understood. Recently, a peptidomimetic inhibitor has entered clinical trial; however, small-molecule orally available antiviral drugs have yet to be developed. Intrigued by a long-standing controversy regarding the existence of an inactive state, we explored the proton-coupled dynamics of the Mpros of SARS-CoV-2 and the closely related SARS-CoV using a newly developed continuous constant pH molecular dynamics (MD) method and microsecond fixed-charge all-atom MD simulations. Our data supports a general base mechanism for Mpro's proteolytic function. The simulations revealed that protonation of His172 alters a conserved interaction network that upholds the oxyanion loop, leading to a partial collapse of the conserved S1 pocket, consistent with the first and controversial crystal structure of SARS-CoVMpro determined at pH 6. Interestingly, a natural flavonoid binds SARS-CoV-2Mpro in the close proximity to a conserved cysteine (Cys44), which is hyper-reactive according to the CpHMD titration. This finding offers an exciting new opportunity for small-molecule targeted covalent inhibitor design. Our work represents a first step toward the mechanistic understanding of the proton-coupled structure-dynamics-function relationship of CoV Mpros; the proposed strategy of designing small-molecule covalent inhibitors may help accelerate the development of orally available broad-spectrum antiviral drugs to stop the current pandemic and prevent future outbreaks.
The ongoing COVID-19
pandemic has claimed more than 1.5 million
human lives worldwide, but an effective therapeutic intervention has
yet to be developed. COVID-19 is caused by a new coronavirus called
severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2, GenBank
accession code: MN908947.3[1]). Belonging
to the genus Betacoronavirus, SARS-CoV-2 is closely
related to several bat SARS-like coronaviruses, and SARS-CoV which
caused an outbreak in 2002–2003, as well as MERS-CoV (Middle
East Respiratory Syndrome coronavirus) which caused an outbreak in
2012.[1] The related genus Alphacoronavirus includes humancoronaviruses that cause the common cold.[2] The replicase gene of coronaviruses encodes two
polyproteins (pp1a and pp1ab), which are processed into functional
nonstructural proteins to form the viral replication/transcription
complex by two cysteine proteases:[3] the
main protease (Mpro, also known as 3CLpro due to the similarity to
picornavirus 3C protease) and the papain-like protease (PLpro).[3] The Mpro cleaves the larger polyprotein pp1ab
at 11 sites, with the sequence of (small)-X-(Leu/Phe/Met)-Gln↓(Gly/Ala/Ser), where X denotes any amino acid and ↓ indicates the cleavage site. Note, Gln in the P1
position is absolutely required by the Mpro, and the substrate specificity
pattern is not shared by any human protease.[3] The S1′, S1, and S2 substrate binding pockets are conserved
among the Mpros of SARS-, MERS-, and SARS-like bat-CoVs.[3] Thus, the Mpro is an attractive broad-spectrum
antiviral drug target for combating the COVID-19 pandemic and preventing
future outbreaks through the zoonotic transmission of SARS-like bat
coronaviruses to human.[3,4]SARS-CoV-2/CoV Mpros have
been well characterized by crystallography.
To date, Protein Data Bank (PDB) contains 54 and 188 entries of X-ray
crystal structures for SARS-CoV and SARS-CoV-2 Mpros, respectively,
most of which are inhibitor-bound complexes.[5−10] The rapid structure determination led to the discovery of the first
broad-spectrum lead inhibitor against SARS, MERS, and the related
coronavirus Mpros.[2,5] Most recently, Pfizer initiated
a clinical trial of PF-07304814, a prodrug which metabolizes into
a ketone-based peptidomimetic inhibitor PF-00835231 against the Mpros
of SARS-CoV-2/CoV and other coronaviruses.[11] Despite the progress, the structure–function relationship
and conformational plasticity of the Mpros remain poorly understood.
Such information is urgently needed to advance broad-spectrum antiviral
drug design.SARS-CoV-2/CoV Mpros function as a dimer.[2,3] With
96% overall sequence identity and 100% identity in the active site,
their structures are nearly superimposable.[5] The Mpro protomer is composed of three domains (Figure a). The chymotrypsin-like β-barrel
domains I (residues 10–99) and II (residues 100–182)
embrace the Cys-His catalytic dyad at the interface, while the α-helical
domain III (residues 198–303) is connected to domain II via
a long loop called L3 (residues 183–197). Dimerization of Mpro
is mainly supported by the domain III interactions with domain III
and the N-terminal loop (residues 1–10) of the opposite protomer.[12] In particular, residues 1–7 (called N-finger[13]) form hydrogen bonds and electrostatic interactions
with the S1 pocket (Figure c), and have been considered critical for both dimerization
and active site integrity of SARS-CoVMpro.[13] Deletion or mutation of Arg4[14] and Gly11[15] was shown to completely inactivate SARS-CoVMpro, while deletion of residues 1–3 maintained 76% enzymatic
activity.[14] Deletion of residues 1–5
in the related transmissible gastroenteritis virusMpro nearly abolished
the activity.[16]
Figure 1
Crystal structure and
pH-dependent activity for SARS-CoV-2/CoV
Mpros. (a) X-ray crystal structure of SARS-Cov-2 Mpro dimer (PDB: 6y2g(5)). Protomer A is colored gray, while domains I, II, and
III in protomer B are colored red, green, and cyan, respectively.
The oxyanion loop L1 (residues 138–145) is colored magenta
and the N-finger loop (residues 1–10) as well as G11 are colored
yellow. (b) pH-activity profile of SARS-CoV Mpro determined by the
HPLC-based peptide cleavage assay.[17] (c)
Zoomed-in view of the S1 specificity pocket in protomer A and the
N-finger loop in protomer B. Residues involved in the dimer interface
interactions are explicitly shown (those in protomer B denoted with
an asterisk). The conserved S1 pocket residues Phe140, His163, Glu166,
and His172, and the catalytic dyad Cys145 and His41 are explicitly
shown.
Crystal structure and
pH-dependent activity for SARS-CoV-2/CoV
Mpros. (a) X-ray crystal structure of SARS-Cov-2Mpro dimer (PDB: 6y2g(5)). Protomer A is colored gray, while domains I, II, and
III in protomer B are colored red, green, and cyan, respectively.
The oxyanion loop L1 (residues 138–145) is colored magenta
and the N-finger loop (residues 1–10) as well as G11 are colored
yellow. (b) pH-activity profile of SARS-CoVMpro determined by the
HPLC-based peptide cleavage assay.[17] (c)
Zoomed-in view of the S1 specificity pocket in protomer A and the
N-finger loop in protomer B. Residues involved in the dimer interface
interactions are explicitly shown (those in protomer B denoted with
an asterisk). The conserved S1 pocket residues Phe140, His163, Glu166,
and His172, and the catalytic dyad Cys145 and His41 are explicitly
shown.Mpro’s substrate binding
site is composed of the canonical
S1–S4 pockets, whereby the S1 pocket is formed by the side
chains of His163, Phe140 and the backbones of Met165, Glu166, and
His172, all of which are conserved (Figure c). In the majority of crystal structures
of SARS-CoV-2Mpro, Glu166 forms a hydrogen bond with His172, while
His163 forms an aromatic stacking with Phe140 (Figure c and Figure S1). Additionally, the side chain of Glu166 and backbone of Phe140
form hydrogen bonds with the amino group of Ser1 (amino terminus)
of the opposite protomer (Figure c and Figure S1). The oxyanion
hole of the cysteine protease is formed by the backbone amides of
Gly143, Ser144, and Cys145,[5,13] and the oxyanion loop
(residues 138–145, also called L1) forms a wall of the S1 pocket
(Figure c).The proteolytic activity of SARS-CoVMpro displays a bell-shaped
pH profile, with the peak at pH 7.0[17] (or
7.4[18]) and sharp decreases at lower or
higher pH (Figure b); however, the molecular mechanism of the pH dependence is not
understood. Interestingly, a pH 6 crystal structure of SARS-CoVMpro
(PDB: 1uj1(13)), which captured two independent protomers in
the asymmetric unit, revealed an inactive conformation for protomer
B, in which the oxyanion loop is moved toward the S1 pocket and the
hydrogen bond between Glu166 and His172 is broken. Hilgenfeld et al.
hypothesized that the pH-dependent activity change arises from the
protonation state switches of His163 and His172.[13,17] Based on the crystal structures,[13,17] His163 and
His172 were suggested as neutral and charged at physiological pH,
respectively, and consequently, the protonation of His163 was hypothesized
to induce the aforementioned conformational changes in the S1 pocket.[17] Shortly afterward, the authenticity of the inactive
conformation was questioned, by arguing that a fusion tag added to
the termini from the recombinant protein expression system may have
destabilized the Ser1 interactions with the S1 pocket of the opposite
protomer, causing it to collapse.[19]Recently, an X-ray structure of SARS-CoV-2Mpro dimer (PDB: 6y2g(5)) revived the debate regarding the existence of an inactive
Mpro protomer and its pH dependence. In protomer B of this crystal
structure, the Glu166–His172hydrogen bond is broken, similar
to the inactive protomer of the pH 6 structure of SARS-CoVMpro (PDB: 1UJ1(13)), although the large movement of the oxyanion loop is absent.[5] Testing the hypothesis of an inactive state by
crystallography proves challenging, as most crystal structures of
Mpros were determined in space group C2 which captures only one protomer
in the asymmetric unit which favors a more ordered structure.To shed light on the above controversy, we conducted a computational
study of SARS-CoV-2/CoV Mpros, employing the newly developed GPU-accelerated
implicit-solvent continuous constant pH MD (CpHMD)[20,21] to determine the protonation states and microsecond-long conventional
fixed-charge MD simulations to characterize the detailed conformational
changes. Our data confirms the general base mechanism for Mpro’s
proteolytic function and suggests that protonation of His172 (and
not His163) induces a partial collapse of the S1 pocket, consistent
with the aforementioned pH 6 crystal structure of SARS-CoVMpro. The
simulations also predict an alternative Cys residue for targeted covalent
inhibitor design. Taken together, our work represents a first step
toward a mechanistic understanding of the pH-dependent structure–dynamics–function
relationship of SARS-CoV-2/CoV Mpros and contributes urgently needed
knowledge for broad-spectrum antiviral drug design.
Results and Discussion
Protonation
States of SARS-CoV-2 Mpro
The Mpro protomer
has 12 Cys and 7 His residues, including the catalytic dyad Cys145
and His41 as well as the absolutely conserved His163, His164, and
His172 in the S1 pocket. Because the solution pKa’s of Cys and His (8.5 and 6.5, respectively) are only
1 pH unit away from the physiological pH, it is important to determine
their protonation states in the protein before conducting a fixed-charged
MD study. We performed the pH replica-exchange GBNeck2-CpHMD simulations,[20,21] which have been shown to give accurate pKa estimates, particularly for Cys[22] and
His.[21] The simulations were initiated from
the aforementioned crystal structure of SARS-CoV-2Mpro (PDB: 6y2g(5)), whereby all side chains of Asp/Glu/His/Cys/Lys were allowed
to titrate. Nine pH replicas were used over a pH range of 5–9,
with a total sampling time of 495 ns. The calculated pKa’s were well converged (Figure S2).CpHMD showed that the catalytic dyad residues Cys145
and His41 are both neutral at physiological pH, suggesting that the
cleavage reaction of the Mpro follows a general base and not an ion-pair
mechanism like SARS-CoV-2/CoV PLpros,[23] consistent with the pH-dependent activity data of SARS-CoVMpro.[18] The latter gave the pKa’s of 8.3 and 6.4 for Cys145 and His41, respectively.[18] The calculated pKa of His41 (6.6/6.7 for the two protomers, Table ) is in excellent agreement with experiment,
while the calculated pKa of Cys145 is
overestimated by about 2 units. Trajectory snapshots showed that the
deprotonated form of Cys145 requires stabilization by several hydrogen
bonds that are formed after extensive conformational sampling, consistent
with our previous findings regarding the deprotonation of cysteines.[22,24] We defer the detailed investigation of this topic to a future study.
Table 1
Calculated pKa’s
of Relevant Cys and His Residues in the Two Protomers
of SARS-CoV-2/CoV Mprosa
SARS-CoV-2
SARS-CoV
Residue
A
B
A
B
C22
7.5
6.8
6.8
7.0
C44
7.0
9.2
4.2
5.8
C145
neutral
∼9.4b
neutral
neutral
H41
6.6
6.7
6.2
6.5
H163
neutral
neutral
neutral
neutral
H164
neutral
neutral
neutral
neutral
H172
6.6
6.6
6.6
7.7c
Calculations based on the crystal
structures of SARS-CoV-2 (PDB: 6y2g(5) and SARS-CoV
(PDB: 1uk2(13)) Mpros. A complete list of pKa’s is given in SI Table . For residues that did not titrate in the
simulation pH range, their protonation states are given.
Estimated based on partial deprotonation
in the pH range 7.0–9.0.
See discussion in the main text.
Calculations based on the crystal
structures of SARS-CoV-2 (PDB: 6y2g(5) and SARS-CoV
(PDB: 1uk2(13)) Mpros. A complete list of pKa’s is given in SI Table . For residues that did not titrate in the
simulation pH range, their protonation states are given.Estimated based on partial deprotonation
in the pH range 7.0–9.0.See discussion in the main text.Importantly, CpHMD titration gave
the pKa’s of His163 and His164
below 5, whereas the pKa of His172 is
6.6/6.6 (Table ).
Thus, our data supports the hypothesis
that His163 is neutral, but it contradicts the hypothesis that His172
is charged at physiological pH.[5,13,17] Consequently, our data suggests that the protonation state switch
of His172 (and not the proposed His163[5,13,17]) is responsible for the conformational changes of
the S1 pocket at pH 6.
Conformational Changes in the S1 Pocket of
SARS-CoV-2 Mpro
To test the hypothesis that the protonation
of His172 leads to
a conformational deactivation of SARS-CoV-2Mpro, we carried out two
independent runs of 2 μs fixed-charged MD starting from the
crystal structure (PDB: 6y2g(5)). In run 1, His172 was
neutral in protomer A and charged in protomer B, while in run 2, His172
was charged in protomer A and neutral in protomer B. All other residues
were fixed in the standard protonation states, as determined by the
CpHMD titration for physiological pH (see Table S1). During the simulations, the dimer and each protomer remained
stable with a Cα root-mean-square deviation (RMSD) below 3 Å
(Figure S3 and Figure S4). However, the RMSD of the oxyanion loop in the monomer
with the charged His172 increased to about 2 Å within 500 ns,
whereas that in the monomer with the neutral His172 stayed below 1
Å throughout the 2 μs simulations (Figure S5).To understand the impact of the His172 protonation
state, we first examine its interaction with Glu166 and the latter
interaction with His163 by calculating the probability distributions
of the side chain minimum distances. In both simulation runs, the
charged His172 forms a salt bridge (weak hydrogen bond) with Glu166
(distance of ∼3.5 Å); however, the interaction between
the neutral His172 and Glu166 is flexible, as evident from the distance
distribution showing two peaks at 4.5/6 Å in the first run (Figure a) and 3.5/6 Å
in the second run (Figure S6). Interestingly,
in the presence of the charged His172, Glu166 also forms a weak hydrogen
bond with His163 (distance of ∼3.5 Å), whereas with the
neutral His172, the Glu166–His163 interaction is largely absent,
with the most probable distance of 7 Å in the first run (Figure b) and 6.2 Å
in the second run (Figure S6). Trajectory
snapshots and the χ1 angle distributions revealed
that the side chain of Glu166 adopts different rotamer states dependent
on the protonation state of His172. In the presence of the charged
His172, Glu166 prefers the g– state
(χ1 of 300°), whereas with the neutral His172,
it mainly samples the trans state (χ1 of 180°, similar to the crystal structure value) and occasionally
adopts the g+ state (χ1 of 90°, Figure d and Figure S6). These data suggest that
when His172 is charged, Glu166 is locked in the g– state, interacting with both His172 and His163,
whereas when His172 is neutral, Glu166 side chain is more flexible,
sampling both trans and g+ states and loosely interacting with His172 and His163.
Figure 2
Conformational
changes of the S1 pocket in SARS-CoV-2 Mpro is coupled
to the switch of His172 protonation state. (a,b) Probability distribution
of the minimum distance between the carboxylate oxygens of Glu166
and the imidazole nitrogens of His172 (a) and His163 (b). (d) Distribution
of the χ1 angle of Glu166. (e) Distribution of the
distance between the center of mass of Glu166 (carboxylate oxygens)
and the oxyanion loop (Cα atoms of residues 138–145).
(g) Distribution of the distance between the center of mass of the
aromatic rings of His163 and Phe140. (h) Distribution of the χ1 angle of Phe140. Data for the protomer with neutral and charged
His172 are colored green and orange, respectively. All plots were
based on simulation run 1 (run 2 data given in Figure S6). The black and red dashed lines indicate the corresponding
values from the protomer A of the SARS-CoV-2 Mpro structure (PDB: 6y2g(5)), and the inactive protomer B of the SARS-CoV Mpro structure
(PDB: 1uj1(13)). (c,f,i) Snapshots showing the conformational
differences between the protomers with neutral (green) and charged
(orange) His172. The oxyanion loop is colored magenta.
Conformational
changes of the S1 pocket in SARS-CoV-2Mpro is coupled
to the switch of His172 protonation state. (a,b) Probability distribution
of the minimum distance between the carboxylate oxygens of Glu166
and the imidazolenitrogens of His172 (a) and His163 (b). (d) Distribution
of the χ1 angle of Glu166. (e) Distribution of the
distance between the center of mass of Glu166 (carboxylate oxygens)
and the oxyanion loop (Cα atoms of residues 138–145).
(g) Distribution of the distance between the center of mass of the
aromatic rings of His163 and Phe140. (h) Distribution of the χ1 angle of Phe140. Data for the protomer with neutral and charged
His172 are colored green and orange, respectively. All plots were
based on simulation run 1 (run 2 data given in Figure S6). The black and red dashed lines indicate the corresponding
values from the protomer A of the SARS-CoV-2Mpro structure (PDB: 6y2g(5)), and the inactive protomer B of the SARS-CoVMpro structure
(PDB: 1uj1(13)). (c,f,i) Snapshots showing the conformational
differences between the protomers with neutral (green) and charged
(orange) His172. The oxyanion loop is colored magenta.
Movement of the Oxyanion Loop in SARS-CoV-2 Mpro
The
protonation state of His172 is also coupled to the conformation of
the oxyanion loop L1. The Cα RMSD of L1 increased to 1.5–2
Å in the protomer with charged His172, but it remained below
1 Å in the protomer with neutral His172 (Figure S5). Trajectory snapshots revealed that L1 is moved
closer to the S1 pocket when His172 is charged. In the presence of
the charged His172, the distance between the center of mass of Glu166
and that of L1 is moved to ∼4 Å, whereas with neutral
His172, the distance is ∼6 Å (Figure e and Figure S6). The L1 movement is likely due to the change in the interaction
between Phe140 on L1 and His163. Crystal structures of SARS-Cov-2/CoV
Mpros show an aromatic ring stacking interaction between them, which
is thought to stabilize L1 in the active form.[5,13] Simulations
showed that the stacking interaction is maintained when His172 is
neutral (distance of ∼4 Å) and disrupted when His172 is
charged (distance of ∼8 Å, Figure g, Figure S6).
Interestingly, the loss in the aromatic stacking is correlated with
a change in the side chain conformation of Phe140, which adopts a trans rotamer state (χ1 angle ∼180°)
in the protomers with charged His172 and a g– state (χ1 angle ∼300°)
in the protomers with neutral His172 (Figure h,i and Figure S6).
Comparison to Crystal Structures of Active and Inactive Mpro
Protomers
The conformational changes of the S1 pocket induced
by the protonation of His172 are consistent with the differences between
the crystal structures of the active protomer of SARS-CoV-2Mpro (PDB:6y2g[5]) and the inactive protomer of SARS-CoVMpro (PDB:1uj1,[13] see Figure dashed lines). Two seeming discrepancies are worth noting. While
the Glu166–His172 interaction in the presence of neutral His172
is flexible in our simulations, the aforementioned crystal structure
shows a hydrogen bond. This discrepancy may be due to the crystal
vs solution condition. In fact, about 10 crystal structures of SARS-CoV-2Mpro display a corresponding distance of ∼5 Å (Figure S1), in line with the simulation data
(Figure a and Figure S6). Another seeming discrepancy is in
the rotamer state of Phe140. In the simulations, Phe140 switches to
a trans rotamer (χ1 of 180°)
with the charged His172, whereas in the crystal structure of the inactive
protomer B of SARS-CoVMpro (PDB: 1uj1(13)), the χ1 angle remains similar to that in the active protomer. This
may be explained by the extremely low electron density of the Phe140
side chain atoms in the X-ray structure (B-factor nearly 100).
N-Finger
Interactions and Dimer Stability in SARS-CoV-2 Mpro
Having
tested our hypothesis that protonation of His172 leads to
a partial collapse of the S1 pocket, we turn to its interactions with
the N-finger, which were thought to be disrupted at low pH.[5,13] Note, the positions of the first two N-terminal residues were missing
in the pH 6 crystal structure of SARS-CoVMpro (PDB: 1uj1(13)). In most of the crystal structures of SARS-CoV-2Mpro,
the interactions between Phe140/His172A/B and Ser1B/A are stable, while the Glu166A/B–Ser1B/A distance varies between 2.2 and 5.3 Å (Figure S1).In the simulation run 1, the
interactions between Glu166A/His172A (neutral
His172A) and Ser1B are largely maintained, and
the interaction between Phe140A and Ser1B is
flexible (Figure ).
In contrast, the interactions between Phe140B/Glu166B/His172B (charged His172B) and Ser1A are completely abolished (Figure ). In the simulation run 2, the differences
are more pronounced; all three interactions sample the crystal structure
values when His172 is neutral but become disrupted when His172 is
charged (Figure S7).
Figure 3
Interactions between
the S1 pocket residues and Ser1 of the opposite
protomer in SARS-CoV-2 Mpro are disrupted by the protonation of His172.
(a,b,c) Probability distributions of the Glu166A/Phe140A/His172A–Ser1B distances are
colored green; distributions of the Glu166B/Phe140B/His172B–Ser1A distances are
colored orange. His172 is neutral in protomer A and charged in protomer
B. The Glu166/Phe140–Ser1 distance refers to that between the
N-terminal amino nitrogen and the nearest carboxyl oxygen of Glu166
or the carbonyl oxygen of Phe140. The His172–Ser1 distance
is refers that between the backbone carbonyl oxygen of Ser1 and the
nearest imidazole nitrogen of His172. (d) Zoomed-in view of the interactions
between the S1 pocket (protomer A) and Ser1 (protomer B) in the X-ray
structure of SARS-CoV-2 Mpro (PDB: 6y2g(5)). Hydrogen
bonds are shown in dashed lines.
Interactions between
the S1 pocket residues and Ser1 of the opposite
protomer in SARS-CoV-2Mpro are disrupted by the protonation of His172.
(a,b,c) Probability distributions of the Glu166A/Phe140A/His172A–Ser1B distances are
colored green; distributions of the Glu166B/Phe140B/His172B–Ser1A distances are
colored orange. His172 is neutral in protomer A and charged in protomer
B. The Glu166/Phe140–Ser1 distance refers to that between the
N-terminal amino nitrogen and the nearest carboxyl oxygen of Glu166
or the carbonyl oxygen of Phe140. The His172–Ser1 distance
is refers that between the backbone carbonyl oxygen of Ser1 and the
nearest imidazolenitrogen of His172. (d) Zoomed-in view of the interactions
between the S1 pocket (protomer A) and Ser1 (protomer B) in the X-ray
structure of SARS-CoV-2Mpro (PDB: 6y2g(5)). Hydrogen
bonds are shown in dashed lines.To further investigate the effect of His172 protonation on the
dimerization stability, we examined several hydrogen bond contacts
formed between the two protomers in the crystal structure (PDB: 6y2g(5)). Distance distributions show that all interactions, Gln127A/B/ Glu290A/B–Arg4B/A, Ser10A/B–Ser10B/A, Glu14A/B–Gly11B/A, and Val125A/B–Ala7B/A, remain
stable regardless of the His172 protonation state (Figure S8), which is consistent with the dimer stability in
both simulation runs (Figure S3).
Protonation
States and Proton-Coupled Dynamics of SARS-CoV Mpro
To further
corroborate our findings, we conducted CpHMD titration
and fixed-charged MD simulations of SARS-CoVMpro starting from the
crystal structure (PDB: 1uk2(13), convergence plots are
given in Figure S9). The CpHMD titration
results of SARS-CoVMpro are consistent with those from SARS-CoV-2Mpro (Table and Table S1). The pKa’s of the catalytic His41 and Cys145 remain neutral at physiological
pH (Table ), supporting
a general base mechanism for Mpro’s proteolytic function.[18] Consistently, His163 and His164 remain neutral
above pH 5.0. However, while the pKa of
His172A is identical to that in SARS-CoV-2Mpro, the pKa of His172B is one unit higher.
The difference can be attributed to the loss of the interaction with
Ser1A in the crystal structure (PDB: 1uk2(13)). Due to the charged amino group, the His172B–Ser1A interaction would destabilize the charged
form of His172B and thereby lowering its pKa.To test the hypothesis that protonation of His172
leads to the conformational deactivation of the S1 pocket in SARS-CoVMpro, we performed a 2 μs fixed-charged MD simulation starting
from the aforementioned crystal structure (PDB: 1uk2(13)), in which His172 was fixed in neutral and charged states
in protomers A and B, respectively. The differences in the S1 pocket
interactions between protomer A and B are in complete agreement with
those of SARS-CoV-2Mpro (Figure S10).
We did not test the S1 pocket interactions with Ser1, as the hydrogen
bonds between the S1 pocket residues of protomer B and Ser1 of protomer
A in the X-ray structure of SARS-CoVMpro are absent (PDB: 1uk2(13)).
New Opportunity for Small-Molecule Broad-Spectrum
Covalent Inhibitor
Design
Currently, the majority of potent broad-spectrum antiviral
inhibitors are large substrate mimetics carrying an electrophilic
warhead that covalently binds with the catalytic Cys145 in SARS-CoV-2/CoV
Mpros, e.g., the α-ketoamide based inhibitors from the Hilgenfeld
lab[5] and Pfizer’s ketone-based inhibitor
PF-00835231 that has entered clinical trial.[11] The latter is promising, as it also showed additive/synergistic
effect in combination with remdesivir which targets the RNA polymerase.
However, delivery of large peptidomimetic compounds such as Pfizer’s
inhibitor[11] often requires intravenous
infusion, which necessitates a hospital visit and lowers patient compliance.
Thus, orally available small-molecule antiviral drugs are more desirable.Interestingly, the CpHMD titration revealed that Cys22 and Cys44,
which are conserved among the Mpros of SARS-CoV-2/CoV, MERS-CoV, and
SARS-like bat-CoV, are significantly more nucleophilic than Cys145,
as they are readily deprotonated at physiological pH (Table ). The hyper-reactivity of Cys44
and its promixity to the substrate binding pocket make it a particularly
attractive site of covalent linkage. A recent crystal structure showed
that galangin (3,5,7-trihydroxyflavone), a naturally occurring flavonoid
with antitumor activities,[25,26] is bound to SARS-CoV-2Mpro in a position with a minimum heavy-atom distance of 3.9 Å
to either Cys44 or Cys145 (Figure ). Thus, we envision the design of targeted covalent
inhibitors of Mpros by installing a weakly electrophilic warhead on
ganlangin to enable the covalent bond formation with the Cys44 thiolate.
It is also conceivable that two warheads can be installed on both
ends of ganlangin to bond with both Cys44 and Cys145, allowing superior
potency. These strategies may lead to novel small-molecule broad-spectrum
covalent inhibitors that have not been attempted so far.
Figure 4
Natural flavonoid
compound offers a starting point for designing
small-molecule broad-spectrum targeted covalent inhibitors. X-ray
structure of SARS-CoV-2 Mpro in complex with galangin (PDB: 6m2n) shows that it is
in the proximity of both Cys44 and Cys145.
Natural flavonoid
compound offers a starting point for designing
small-molecule broad-spectrum targeted covalent inhibitors. X-ray
structure of SARS-CoV-2Mpro in complex with galangin (PDB: 6m2n) shows that it is
in the proximity of both Cys44 and Cys145.
Conclusion
Our simulations showed that the protonation state
of His172 profoundly
impacts the conformation of the S1 pocket through an interaction network
involving His172, Glu166, His163, and Phe140. When His172 is neutral,
Glu166 is flexible and does not form a hydrogen bond with His163,
which allows His163 to maintain the aromatic stacking with Phe140
and thereby keeping the oxyanion loop in the active open conformation.
However, when His172 is charged, it forms a salt bridge with Glu166,
which locks Glu166 in the g– rotameric
state, stabilizing the hydrogen bond interaction with His163. The
latter results in a disruption of the stacking interaction between
His163 and Phe140 and consequently a collapse (closing) of the oxyanion
loop toward the S1 pocket. Consistent with the pH 6 crystal structure
of SARS-CoVMpro, our data provides a detailed molecular mechanism
of the conformational deactivation and loss of catalytic activity
of SARS-CoV-2/CoV Mpros with decreased pH. Our future work will address
the mechanism of the Mpro’s activity decrease at high pH. We
speculate that deprotonation of the amino terminus may play a significant
role, as the solution pKa of the α-amino
group is 8.0[27] and in the proximity of
the negatively charged Glu166, the pKa may shift somewhat higher.The present study has several caveats
worth pointing out. In the
CpHMD simulations, the N-terminus was acetylated which may have reduced
the stability of the dimer interface. In the fixed-charge simulations,
only the protonation state of His172 was varied, as it is one of the
two residues suggested by experiment to possibly make an impact on
the S1 pocket; however, His41 may also become protonated at similar
pH (e.g., pH 6), and this effect was neglected in the present study.
With regard to the dimerization stability, the fixed-charge simulations
of SARS-CoV-2Mpro demonstrated that His172 protonation disrupted
the S1 pocket interactions with Ser1 but not other dimer interface
contacts. The latter could be a result of the limited sampling time.While a complete elucidation of the pH-dependent conformational
mechanisms awaits future investigation, the present work offers a
first step in understanding the proton-coupled structure–dynamics–function
relationships of coronavirus Mpros. Our finding of the hyper-reactive
cysteine and the proposed strategies of designing small molecule targeted
covalent inhibitors may help accelerate the development of orally
available broad-spectrum antiviral drugs to stop the current pandemic
and prevent future outbreaks.
Methods and Protocols
System
Preparation for CpHMD Simulations
The coordinates
of the X-ray crystal structures of SARS-CoV-2 (PDB: 6y2g,[5] determined at pH 8.5) and SARS-CoV (PDB 1uk2,[13] determined at pH 8) Mpros were retrieved from the PDB.
Both structures contain two independent protomers in the asymmetric
unit. The N- and C-termini of each protomer were acetylated and amidated,
respectively (free in the fixed-charge simulations). Missing hydrogens
were added using the HBUILD facility in CHARMM (C38b2);[28] dummy hydrogens for titration of Asp/Glu were
added using a custom CHARMM script.[29] A
short energy minimization was performed for the hydrogen positions
using the steepest decent and Newton–Raphson methods for 20
steps with the heavy atoms constrained. The force field parameters
and coordinate files were constructed using the LEAP utility in AMBER.[30] The protein was represented by the ff14sb force
field[31] and solvent was represented by
the GBNeck2 (igb = 8) implicit-solvent model.[32] The default mbondi3 intrinsic Born radii, except for His and Cys,
for which the following radii optimized for CpHMD were used: 1.17
Å for imidazolenitrogen atoms of His[20] and 2.0 Å for sulfur atom of Cys.[33] Following energy minimization using the steepest decent and conjugate-gradient
algorithms for 5000 and 1000 steps, respectively, the system underwent
restrained equilibration at pH 7 in four stages (2000 MD steps each).
The force constant in the heavy-atom restraint was 5, 2, 1, and 0
kcal/mol/Å2 in the four stages. The final structure
was used for the CpHMD titration simulations.
pH Replica-Exchange CpHMD
Simulations
The CpHMD titration
simulations were performed with the GPU-accelerated GBNeck2-CpHMD
method[21] in Amber18.[30] The pH replica-exchange protocol[34] was used to accelerate convergence. To allow for the use of one
or two GPU cards, we applied the newly implemented asynchronous replica-exchange
scheme.[23] For SARS-CoV-2Mpro, 9 replicas
over the pH range 5–9 with an interval of 0.5 pH unit were
used, with an aggregate sampling time of 495 ns. For SARS-CoVMpro,
9 replicas over the pH range 4.5–8.5 were used, with an aggregate
sampling time of 360 ns. All Asp, Glu, His, Cys, and Lys side chains
were allowed to titrate, with the CpHMD default model pKa’s of 3.8, 4.2, 6.5, 8.5, and 10.4, respectively.[21] The simulations were run at 300 K with an ionic
strength of 0.15 M and an effectively infinite cutoff (999 Å)
for nonbonded interactions. The SHAKE algorithm was used to constrain
the bonds involving the hydrogens to allow for a 2 fs time step. For
pKa calculations and analysis, data from
the first 20 ns per replica were discarded.
Conventional Fixed-Charge
All-Atom MD Simulations
The
initial structures for SARS-CoV-2/CoV Mpros (PDB: 6y2g(5) and 1uk2(13)) prepared using the LEAP utility in
Amber.[30] The termini of each protomer were
left in the free form. Hydrogens were added according to the appropriate
protonation states (see below). Sodium and chloride ions were added
to represent a neutral system with a 150 mM physiological salt concentration.
Two 2-μs all-atom fixed-charged MD simulations were performed
for SARS-CoV-2/CoV Mpros (PDB: 6y2g(5)), whereby
the protonation states (except for His172) were fixed to those as
determined by CpHMD for physiological pH, i.e., Asp(−)/Glu(−)/His(0)/Cys(0).
In simulation run 1, His172 was neutral in protomer A and charged
in protomer B, whereas in simulation run 2, His172 was charged in
monomer A and neutral in monomer B. As a comparison, one 2-μs
simulation for SARS-CoVMpro was performed using the same protocol
as run 1 of SARS-CoV-2Mpro.All simulations were performed
with Amber18.[30] The Mpro dimer was placed
in a truncated octahedral water box, with a minimum of 11 Å between
the protein heavy atoms and the edges of the water box. The protein
was represented by the ff14SB force field[31] and water represented by the TIP3P[35] model.
Long-range electrostatic interactions were treated with the particle-mesh
Ewald method.[36] A nonbonded cutoff of 8
Å was used. SHAKE was turned on to allow a 2 fs time step. Initially,
the system was energy minimized by applying 5000 steps of steepest
descent followed by 5000 steps of conjugate gradient minimization
with a force constant of 25 kcal/mol/Å2 to the solute
heavy atoms. The force constant was then reduced to 5 kcal/mol/Å2, and the system was heated from 100 to 300 K in the canonical
ensemble for 50 ps followed by 250 ps restrained and 100 ps unrestrained
equilibration in the isothermal–isobaric ensemble with the
isotropic Berendsen barostat.[37] Finally,
the production runs of 2 μs each were performed, starting from
a different random initial velocity seed for each system. All analysis
was performed with the Amber module CPPTRAJ[38] using the last 1 μs of each trajectory.
Authors: Mohammad Khedri; Reza Maleki; Mohammad Dahri; Mohammad Moein Sadeghi; Sima Rezvantalab; Hélder A Santos; Mohammad-Ali Shahbazi Journal: Drug Deliv Transl Res Date: 2021-09-03 Impact factor: 5.671
Authors: Chunlong Ma; Zilei Xia; Michael Dominic Sacco; Yanmei Hu; Julia Alma Townsend; Xiangzhi Meng; Juliana Choza; Haozhou Tan; Janice Jang; Maura V Gongora; Xiujun Zhang; Fushun Zhang; Yan Xiang; Michael Thomas Marty; Yu Chen; Jun Wang Journal: J Am Chem Soc Date: 2021-12-03 Impact factor: 15.419
Authors: Guillem Macip; Pol Garcia-Segura; Júlia Mestres-Truyol; Bryan Saldivar-Espinoza; María José Ojeda-Montes; Aleix Gimeno; Adrià Cereto-Massagué; Santiago Garcia-Vallvé; Gerard Pujadas Journal: Med Res Rev Date: 2021-10-26 Impact factor: 12.388