Literature DB >> 34705466

Structural, Electronic, and Electrostatic Determinants for Inhibitor Binding to Subsites S1 and S2 in SARS-CoV-2 Main Protease.

Daniel W Kneller1,2, Hui Li2,3, Stephanie Galanie2,4, Gwyndalyn Phillips1,2, Audrey Labbé2,4, Kevin L Weiss1,2, Qiu Zhang1,2, Mark A Arnould2,3, Austin Clyde2,5,6, Heng Ma2,5, Arvind Ramanathan2,5,7, Colleen B Jonsson8, Martha S Head2,9, Leighton Coates2,10, John M Louis11, Peter V Bonnesen2,3, Andrey Kovalevsky1,2.   

Abstract

Creating small-molecule antivirals specific for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) proteins is crucial to battle coronavirus disease 2019 (COVID-19). SARS-CoV-2 main protease (Mpro) is an established drug target for the design of protease inhibitors. We performed a structure-activity relationship (SAR) study of noncovalent compounds that bind in the enzyme's substrate-binding subsites S1 and S2, revealing structural, electronic, and electrostatic determinants of these sites. The study was guided by the X-ray/neutron structure of Mpro complexed with Mcule-5948770040 (compound 1), in which protonation states were directly visualized. Virtual reality-assisted structure analysis and small-molecule building were employed to generate analogues of 1. In vitro enzyme inhibition assays and room-temperature X-ray structures demonstrated the effect of chemical modifications on Mpro inhibition, showing that (1) maintaining correct geometry of an inhibitor's P1 group is essential to preserve the hydrogen bond with the protonated His163; (2) a positively charged linker is preferred; and (3) subsite S2 prefers nonbulky modestly electronegative groups.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34705466      PMCID: PMC8565456          DOI: 10.1021/acs.jmedchem.1c01475

Source DB:  PubMed          Journal:  J Med Chem        ISSN: 0022-2623            Impact factor:   7.446


Introduction

Since the start of the coronavirus disease 2019 (COVID-19) pandemic in early 2020, several preventative and treatment options have been developed, including several vaccines and antiviral therapies.[1−4] The COVID-19 vaccines developed in record time are now potentially saving millions of lives. However, due to vaccine hesitancy, pre-existing health conditions, and vaccine escape variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2),[5] a significant proportion of the population will remain at risk, creating an urgent priority to advance existing therapeutics. COVID-19 is caused by a novel coronavirus, SARS-CoV-2, believed to be of zoonotic origin,[6,7] raising concerns that other easily transmissible respiratory viruses will emerge to cause future pandemics. The arsenal of therapeutic intervention options will undoubtedly be expanded by designing multiple small-molecule drugs that inhibit various viral targets disrupting essential steps in the SARS-CoV-2 replication cycle.[8] This bolstered preparedness strategy has the potential to yield broad-spectrum antivirals providing a means of tackling future pathogenic coronaviruses. 3-Chymotrypsin-like protease (3CLpro), also known as the main protease (Mpro), from SARS-CoV-2 is a viral cysteine protease enzyme[9] and an important drug target that has attracted considerable attention from structural and computational biologists and drug designers.[10−15] SARS-CoV-2 is a single-stranded, positive-sense RNA virus with a genome of ∼30k nucleotides resembling mRNA.[16] Host cell ribosomes partially translate the genomic mRNA to generate two polyproteins, pp1a and pp1ab, encoded by the viral replicase gene during the initial steps of the virus replication cycle.[16,17] The proteolytic cleavage of pp1a and pp1ab is vital for SARS-CoV-2 proliferation and liberates 16 individual viral protein components necessary for the viral genome transcription activity. This essential step in the SARS-CoV-2 replication cycle is accomplished by Mpro, and another cysteine protease, the papain-like protease (PLpro), through hydrolyzing peptide bonds within the two polyproteins at specific locations. Small-molecule inhibitors of the viral protease enzymatic activity have strong clinical precedence for blocking virus replication, and hence, the fervent interest of the scientific community to develop coronavirus-specific protease inhibitors. The active site of Mpro is distinct from the known human proteases; thus, off-target binding of specific Mpro inhibitors can be minimized.[10,11] Conversely, conservation of the Mpro active site across various coronaviruses creates an opportunity to design pan-coronavirus antivirals.[18] Significant effort in the design of Mpro inhibitors against SARS-CoV-2[8,19−21] has focused on the reversible and irreversible (suicidal) covalent inhibitors, including compounds initially designed for the inhibition of SARS-CoV Mpro.[22−25] Such inhibitors contain chemical groups, or warheads, that are reactive toward the catalytic cysteine of Mpro, Cys145. The hepatitis C virus clinical protease inhibitor boceprevir and the feline peritonitis virus protease inhibitor GC-376 were initially considered for drug repurposing, leading to the rational design of hybrid inhibitors.[26−28] In addition, noncovalent competitive and allosteric inhibitors have captured interest due to the availability of high-throughput virtual and experimental screening of large compound libraries that leverage new advances in supercomputing and fast X-ray crystallographic screening.[15,29−32] Compounds can be fed into structure-based drug design pipelines and chemically modified to improve their potency to inhibit SARS-CoV-2 Mpro.[33−36] Moreover, noncovalent inhibitors may have a higher selectivity for Mpro compared to covalent compounds that can also target host proteases and can possess elevated cytotoxicity due to binding to other human proteins.[33,36] The active site of Mpro consists of subsites S5–S1′, which can accommodate substrate and inhibitor groups at positions P5–P1′. Subsites S1 and S2 are selective for Gln and a medium-sized hydrophobic residue like Leu or Phe, respectively.[11,37−39] A recent study analyzed the effect of chemical modifications in a noncovalent inhibitor ML188 on its binding to Mpro.[36] Here, we report a structure–activity relationship (SAR) study performed on a competitive noncovalent inhibitor Mcule-5948770040 (compound 1) of a novel scaffold that we recently discovered through a large-scale virtual screening and validated using in vitro enzyme inhibition assays and X-ray crystallography.[40] The aim of our SAR study was to chemically modify compound 1 that binds across the Mpro catalytic site in the substrate-binding subsites S1 and S2 to reveal structural, electronic, and electrostatic determinants of these ligand-binding sites. Compound 1 has a general architecture of P1–linker–P2 (Scheme ). We initiated the study by obtaining a joint X-ray/neutron (XN) structure of the Mpro-1 complex at near-physiological temperature and neutral pH (Figure a). The XN structure permitted us to fully map the hydrogen positions (observed as deuterium atoms) in the Mpro active site and compound 1, accurately determining protonation states of the enzyme amino acid residues and the inhibitor. With this information in hand, we systematically derivatized P1, P2, and linker groups producing a series of compounds; named as the HL-3 series (Scheme ). A virtual reality-assisted structure analysis and small-molecule building were employed to generate derivatives of 1, considering the geometric constraints of the Mpro subsites S1 and S2 and the feasibility of the syntheses. In vitro enzyme inhibition assays demonstrated the effect of chemical modifications on the ability of the modified compounds to inhibit Mpro. In contrast, subsequent X-ray crystallographic analysis at room temperature identified the structural determinants for P1, P2, and linker binding. Moreover, we designed an improved inhibitor, compound HL-3-68, that showed several-fold better inhibition of Mproin vitro.
Scheme 1

Chemical Diagrams of Mcule-5948770040 (Compound 1) and Its Derivatives

For the generalized structure of the HL-3 series not all possible combinations of X, Y, and Z substituents were examined. The full list of HL-3 series of compounds, including their chemical structures, is given in Table S1.

Figure 1

Compound 1 binds to the S1 and S2 subsites of Mpro. (a) Dimer of Mpro-1 is represented with one protomer as cartoon (green) and the other as surface (white). The locations of subsites S1 (pink) and S2 (purple) are highlighted. The S2 helix and S5 loop are labeled in orange. The Cys145 and His41 catalytic dyad are shown as sticks. PDB ID 7N8C. (b) Chemical structure of compound 1 as observed in Mpro-1. (c) Compound 1 from Mpro-1 shown in a ball-and-stick representation (teal carbons). H/D-exchanged D atoms are colored in orange. The electron density 2Fo – Fc map (blue mesh) and nuclear density 2Fo – Fc map (orange mesh) contoured to 1σ.

Compound 1 binds to the S1 and S2 subsites of Mpro. (a) Dimer of Mpro-1 is represented with one protomer as cartoon (green) and the other as surface (white). The locations of subsites S1 (pink) and S2 (purple) are highlighted. The S2 helix and S5 loop are labeled in orange. The Cys145 and His41 catalytic dyad are shown as sticks. PDB ID 7N8C. (b) Chemical structure of compound 1 as observed in Mpro-1. (c) Compound 1 from Mpro-1 shown in a ball-and-stick representation (teal carbons). H/D-exchanged D atoms are colored in orange. The electron density 2Fo – Fc map (blue mesh) and nuclear density 2Fo – Fc map (orange mesh) contoured to 1σ.

Chemical Diagrams of Mcule-5948770040 (Compound 1) and Its Derivatives

For the generalized structure of the HL-3 series not all possible combinations of X, Y, and Z substituents were examined. The full list of HL-3 series of compounds, including their chemical structures, is given in Table S1.

Results

Protonation States in the Mpro-1 Complex

Neutron crystallography of the Mpro-1 complex was employed to accurately resolve the positions of hydrogen atoms in the active site of Mpro and compound 1. Neutron and X-ray diffraction datasets to 2.5 and 2.2 Å resolutions, respectively, were collected at room temperature and neutral pH from a large deuterated protein crystal and then jointly refined to produce accurate positions of both deuterium and heavy atoms (Table S1). Henceforth, the analysis includes comparisons to previously determined neutron structures of ligand-free Mpro[41] and Mpro bound to the covalent α-ketoamide inhibitor telaprevir.[42] Compound 1 (Figure b) was modeled into the electron and nuclear density maps with high confidence (Figure c). For hydrogen bonds, distances between a deuterium (D) atom and the heavy atom are reported henceforth. Direct interactions between Mpro and the P1 and P2 groups of 1 are shown in Figure a with 2Fo – Fc and D-omit Fo – Fc nuclear density maps. The uracil-like P1 group of 1 contains a carbonyl that forms a short 1.7 Å hydrogen bond with a doubly protonated His163. This carbonyl’s second lone pair of electrons makes a weaker unconventional C–H···O hydrogen bond with Cδ2 of His172. The other carbonyl of the P1 group forms a D2O-mediated interaction with Ser1′ of the second Mpro protomer. Another D2O-mediated interaction to Asn142 arises from the amide ND group at the P1 group’s 2 position, whereas the amide ND at the 4-position forms a 2.0 Å hydrogen bond with the carboxylate side chain of Glu166. To facilitate this interaction, the Glu166 carboxylate rotates from its position observed in the ligand-free Mpro structure toward 1, and His163 gains a D atom on Nε2 to become positively charged in Mpro-1 (Figure S1a). Interestingly, the same conformational change of the Glu166 side chain was observed in the telaprevir-bound neutron structure, where His163 was also found in the doubly protonated state, relative to the ligand-free Mpro (Figure S1b). However, telaprevir possesses a short hydrophobic norvaline P1 group that prevents a direct polar interaction with the enzyme; instead, a water molecule is recruited to this position to hydrogen bond with His163. The aromatic dichlorobenzene P2 group orchestrates itself into the hydrophobic S2 subsite by displacing Met49 and S2 helix (residues 46–52) and rearranging His41 and Gln189 to create π–π stacking interactions.[40] P1 and P2 groups of 1 are connected by a saturated heterocyclic piperazine–amide linker that includes a carbonyl aimed toward the oxyanion hole and a potentially ionizable tertiary amine preceding P2. Analysis of the nuclear density demonstrated that the latter amine nitrogen is protonated with the D atom directed away from His41 and into the bulk solvent. As a result, compound 1 is a cation with a +1 positive charge.
Figure 2

Atomic details of the SARS-CoV-2 Mpro complex with compound 1 determined by X-ray/neutron crystallography. (a) Mpro protonation states and molecular interactions with 1. Protein structures are presented as cartoon with side chains and the ligand in ball-and-stick representation. H-bonds are represented as dashes, while π-interactions are represented as blue dash-dots. Distances are in angstrom. The 2Fo – Fc nuclear density map as an orange mesh is contoured at 1σ. Omit maps for D atoms are shown as a purple mesh and contour levels are adjusted for clarity as follows: linker amine D is 2.0σ, P1 amine Ds is 3.0σ, His41 Nε2 D is 2.5σ, His163 Nδ1 D is 4.0σ, and Nε2 D is 3.0σ. (b) H-bond network of the catalytic D2O and His41 side chain. The 2Fo – Fc nuclear density map as an orange mesh is contoured at 1.5σ. The omit map for His164 Nδ1 D is contoured at 3.0σ. (c) Superposition of Mpro-1 (blue carbons) and Mpro ligand-free (light orange carbons, PDB code 7JUN) showing His41 flip and Gln189 shift in the complex. Red arrows indicate conformational shifts from ligand-free to compound 1 complex. Ligand-free is labeled in blue where different. Superposition calculated by least-squares fitting on Cα atoms.

Atomic details of the SARS-CoV-2 Mpro complex with compound 1 determined by X-ray/neutron crystallography. (a) Mpro protonation states and molecular interactions with 1. Protein structures are presented as cartoon with side chains and the ligand in ball-and-stick representation. H-bonds are represented as dashes, while π-interactions are represented as blue dash-dots. Distances are in angstrom. The 2Fo – Fc nuclear density map as an orange mesh is contoured at 1σ. Omit maps for D atoms are shown as a purple mesh and contour levels are adjusted for clarity as follows: linker amine D is 2.0σ, P1 amine Ds is 3.0σ, His41 Nε2 D is 2.5σ, His163 Nδ1 D is 4.0σ, and Nε2 D is 3.0σ. (b) H-bond network of the catalytic D2O and His41 side chain. The 2Fo – Fc nuclear density map as an orange mesh is contoured at 1.5σ. The omit map for His164 Nδ1 D is contoured at 3.0σ. (c) Superposition of Mpro-1 (blue carbons) and Mpro ligand-free (light orange carbons, PDB code 7JUN) showing His41 flip and Gln189 shift in the complex. Red arrows indicate conformational shifts from ligand-free to compound 1 complex. Ligand-free is labeled in blue where different. Superposition calculated by least-squares fitting on Cα atoms. The catalytic site of Mpro is composed of a noncanonical catalytic dyad, Cys145 and His41, thought to be assisted by a highly coordinated conserved water molecule (D2Ocat). The catalytic dyad exists in the zwitterionic state in ligand-free Mpro as discrete anionic thiolate and cationic imidazolium side chains.[41] The hydrogen-bonding landscapes of His41 and D2Ocat in Mpro-1 are presented in Figure b. In this complex, the catalytic dyad is neutralized with Cys145 observed as a thiol (-SD), and His41 singly protonated on Nε2 but not on Nδ1. The Cys145 thiol deuterium was refined to ∼85% D occupancy. Relative to the ligand-free Mpro and Mpro-telaprevir neutron structures, the His41 side chain is flipped about 180°, its position in Mpro-1 being stabilized by a new hydrogen bond formed between Nε2-D and the His164 main chain carbonyl (Figures c and S1b). Consequently, a conserved hydrogen bond with D2Ocat observed in the other two neutron structures is eliminated to create a new hydrogen bond made by the Nδ1 with a D2O molecule recruited from the bulk solvent. D2Ocat in Mpro-1 is oriented by donating its D atoms in hydrogen bonds with the His164 and Asp187 side chains in a rotation pose not seen in either the ligand-free or telaprevir-bound neutron structures (Figure S1a,b). In contrast to ligand-free Mpro in which His164 is doubly protonated, His164 is neutrally charged in Mpro-1, possessing strong nuclear density for Nδ1-D that is hydrogen-bonded to Thr175 (Figure b). Moreover, this neutral protonation state configuration of His164 is different from that observed in the Mpro-telaprevir neutron structure where Nε2 is protonated, and the hydrogen bond to Thr175 is absent. Thus, in Mpro-1, D2Ocat rotates to donate its D atoms in hydrogen bonds with His164 Nε2 and Asp187 carboxylate.

Design of Compound 1 Derivatives: HL-3 Series

The architecture of compound 1 can be divided into three parts: P1 and P2 groups and the linker (Scheme and Figure b). P1 is a uracil-like, 6-pyrimidine-2,4(1H,3H)-dione, substituent capable of direct hydrogen bonding with Mpro residues His163 and Glu166 in the S1 subsite. The linker is a piperazine-1-carbonyl moiety whose carbonyl group is anchored by the oxyanion hole of the catalytic site. P2, a substituted benzene substituent, inserts into the mostly hydrophobic S2 subsite bordered by Met49 from the S2 helix, main chain atoms of Arg188 and Gln189, and side chains of His41, Cys44, Tyr54, and Gln189. The Tyr54 phenolic hydroxyl is the only group that potentially can play a role of a hydrogen-bonding partner with a P2 group of an inhibitor. To improve inhibitor affinity and probe the chemical determinants for ligand binding to S1 and S2, we designed an array of compound 1 derivatives, which we call HL-3 series, to methodically assess how the P1, linker, and P2 groups contribute to Mpro inhibition. Our design strategy of compound 1 derivatives, i.e., HL-3 compounds (Scheme and Table ), included chemical modifications to each of its three parts, taking into consideration specific geometric and hydrogen-bonding constraints of S1 and S2 subsites and the feasibility of syntheses for designed compounds. We employed virtual reality software to immerse into the Mpro structure, to modify the scaffold of compound 1, and to perform structural analysis of the modeled complexes. First, we examined the effect of saturating the olefinic portion of the uracil-like (pyrimidine-2,4(1H,3H)-dione) P1 group that creates a nonplanar sp3-hybridized endocyclic carbon. Both enantiomers were made to mimic the γ-lactam ring commonly used as inhibitor’s P1 (HL-3-51, HL-3-53). Such modification should not eliminate the P1 group’s ability to hydrogen bond with His163, which is a prerequisite for Mpro inhibitors. Next, we examined the effect of removing a positive charge from the linker, where the aniline nitrogen is observed in the protonated quaternary ammonium state in our neutron structure. To achieve this, the aniline nitrogen was replaced with a saturated carbon by substitution of a piperidine moiety for the piperazine in the linker (HL-3-69). The most extensive modifications were made to the P2 group by varying substituents at positions 3, 4, and 5 of the phenyl ring. We examined the effect of removing one Cl from either position 3 or 4 and synthesized singly substituted derivatives at position 4 containing groups such as F, I, CN, CF3, CHO (aldehyde), and CH2OH. In addition, we retained Cl in position 3 and varied substituents in position 4 to include Br, CF3, CHO, and CH2OH. In another compound, Cl in position 4 remained, but position 3 contained a CF3 group. Finally, we investigated derivatives with three substituents in positions 3, 4, and 5. In this series, the 3,5-meta positions had Cl groups, whereas para position 4 consisted of Cl, CF3, or CH3.
Table 1

50% Inhibitory Concentration (IC50) Values for the Inhibition of SARS-CoV-2 Mpro by a Series of HL-3 Compoundsac

X-ray crystallographic statistics for the obtained structures is given in Table S2.

95% confidence interval (CI).

N/D—not determined.

X-ray crystallographic statistics for the obtained structures is given in Table S2. 95% confidence interval (CI). N/D—not determined. Various halides (F, Cl, I), nitrile, and trifluoromethyl were examined at the para position to modulate the electronic properties of the P2 group. In contrast, aldehyde and hydroxymethyl substituents were investigated to determine whether the Tyr54 side chain can act as a hydrogen-bond partner, donor or acceptor. Sterically bulkier substituents were not considered for synthesis due to geometric constraints of the S2 subsite.

Inhibition of Mpro by HL-3 Compounds

To determine the effect of the chemical modifications within the HL-3 series of compounds on their ability to inhibit Mpro compared to compound 1, an initial Mpro activity inhibition screen was performed at a 20 μM inhibitor concentration. Eight compounds showed at least 50% Mpro activity inhibition at 20 μM (Table ) and were further characterized by assaying inhibition across a range of concentrations to determine 50% inhibitory concentration (IC50) values (eq ). The rest of the compounds were excluded from further measurements because their IC50 values were well above 20 μM. Compound 1 produced an IC50 of 0.68 μM in the current experiments (Table ). Replacing P1 uracil-like group with either of dihydropyrimidine-2,4(1H,3H)-dione enantiomers (HL-3-51 and HL-3-53) had a considerable effect, with the IC50 values being above 20 μM for the two derivatives. Therefore, the structurally conserved S1 subsite cannot accommodate the bent structures of these P1 groups. Interestingly, replacing the protonated positively charged aniline nitrogen with carbon to give HL-3-69 also negatively affected the compound’s potency (IC50 = 8.8 μM), even though this nitrogen does not make close contacts with Mpro residues. Hence, it appears that more than an order of magnitude increase in IC50 for HL-3-69 compared to compound 1 can be attributed to the altered electrostatics of the inhibitor. The removal of the Cl substituent at the 3-position of the P2 group to generate the monosubstituted derivative HL-3-45, and then substituting Cl at the 4-position with various groups, such as F, I, CHO, CH2OH, CN, and CF3, leads to dramatic losses of these derivatives’ potencies. A similar result was obtained for compounds HL-3-46 and HL-3-71, in which the chlorine at the 4-position is replaced, respectively, with −H or −CH2OH. These observations clearly indicated that −Cl at the 3-position of the P2 group is essential for a compound to maintain its potency. Indeed, adding Cl back to the 3-position for monosubstituted compounds with CHO (HL-3-44) or CF3 (HL-3-50) at the 4-position significantly restored potency, with HL-3-70 and HL-3-63 possessing IC50 values of of 6.2 and 6.4 μM, respectively. However, the potencies of HL-3-70 and HL-3-63 are an order of magnitude worse than that of compound 1, indicating that the presence of sterically larger and more electronegative substituents than Cl at the 4-position reduces an inhibitor’s potency. Replacing Cl at the 4-position with −Br, whose van der Waals radius of 1.97 Å is slightly larger than that of Cl (1.90 Å) and electronegativity lower than that of Cl, while maintaining Cl at the 3-position, resulted in compound HL-3-78, which demonstrated potency as good as compound 1. Unexpectedly, we determined that when Cl and CF3 in HL-3-63 swap positions to give HL-3-52, the IC50 improves several folds to 1.4 μM, which is only about twice as high as that for compound 1. Adding an extra Cl substituent to HL-3-63 at the 5-position (HL-3-87) caused a similar reduction in IC50. The latter two observations demonstrated that the 3-position of the P2 group could accommodate a bulkier and more electronegative substituent, and addition of Cl to the 5-position on the benzene ring is beneficial for an inhibitor’s potency. With this in mind, we analyzed the potencies of two molecules, HL-3-68 and Mcule-CSR-494190-S1, with −Cl in positions 3 and 5 and either Cl or CH3 at position 4, respectively (Table ). Both showed improved inhibition potency relative to compound 1, indicating the preference of the S2 subsite for less bulky substituents with moderate-to-low electronegativity. We thus determined Ki values for HL-3-68 and Mcule-CSR-494190-S1 from the initial rates determined across a range of inhibitor and substrate concentrations (eq . Ki measurements confirmed that the inhibitor HL-3-68 had the highest affinity, with a sub-μM Ki of 0.89 μM, followed by Mcule-CSR-494190-S1 with a Ki of 1.4 μM. The significant accuracy of Ki measurements indicates that the potencies of HL-3-68 and Mcule-CSR-494190-S1 are 3-fold and 2-fold better than the Ki of 2.9 μM of the previously reported compound 1.[40] Of note, none of the compounds demonstrated antiviral activities against SARS-CoV-2 in cell-based assays (Figure S2).
Table 2

Binding Affinities of Compound 1 and Selected Inhibitors Determined by In Vitro Assays (Ki) and Isothermal Titration Calorimetry (ITC) (Kd and Thermodynamic Parameters ΔH, ΔS, and ΔG of Binding)

Isothermal Titration Calorimetry (ITC)

To directly assess the thermodynamic binding properties of compound 1 and the two most potent inhibitors HL-3-68 and Mcule-CSR-494190-S1, we performed isothermal titration calorimetry (ITC, Table and Figure S3). The Kd values measured by ITC are in good agreement with the Ki values obtained by enzyme kinetics. HL-3-68 demonstrates sub-μM affinity to Mpro, binding ∼2-fold better to the enzyme than the other two compounds. The binding of all three compounds to Mpro is driven primarily by enthalpy. Compound 1 binds to the enzyme with essentially no change in entropy, whereas ΔS of binding for Mcule-CSR-494190-S1, although small (−3.16 cal/(mol·K)), is negative and therefore contributes unfavorably to the binding with −TΔS of +0.95 kcal/mol. Conversely, substitution of the methyl at the 4-position of the P2 group with Cl reverts ΔS to a small positive value of 2.4 cal/(mol·K). Interestingly, ΔH of binding is the most favorable for Mcule-CSR-494190-S1, although its ΔS is the most unfavorable. The interplay of the enthalpy and entropy components results in HL-3-68 possessing the highest affinity for Mpro (Kd = 0.69 μM).

Room-Temperature X-ray Structures of Mpro in Complex with HL-3 Compounds

To shed light on the SAR between Mpro and HL-3 compounds, we obtained nine room-temperature X-ray structures of Mpro co-crystallized with selected HL-3 compounds at resolutions in the range of 1.85–2.10 Å (Table S2 and Figure S4). We did not obtain the crystal structures of Mpro complexes with other compounds because they were either significantly less potent than compound 1 or their complexes did not crystallize. Comparisons of these structures with Mpro-1 provide insights into how substitutions at positions 3, 4, and 5 of the aromatic P2 group alter the binding of HL-3 compounds and correlate with their inhibition potencies. Inhibitors of all determined structures bind in identical fashion (Figure a) anchored to Mpro by invariant hydrogen bonds (2.6 ± 0.1 Å between the heavy atoms) of the uracil P1 group with His163 and the linker’s carbonyl with the main chain amide nitrogen of Gly143. Some shifts in the positions of the aromatic P2 group up to 0.5 Å are observed due to the various substituents in positions 3, 4, and 5.
Figure 3

Mpro inhibition by HL-3 compounds. (a) Binding modes of nine HL-3 compounds co-crystallized with Mpro. (b) HL-3-68 (cyan) bound in the active site of Mpro (pink). Electron density omit Fo – Fc map (gray mesh) is contoured at 3σ. H-bonds are shown as black dashes. Distances are in angstrom. PDB ID 7RLS. (c) Superposition of Mpro-1 X-ray/neutron structure (PDB ID 7N8C) with HL-3-68 (PDB ID 7RLS) and Mcule-CSR-494190-S1 (PDB ID 7RM2) complex X-ray structures showing rotation of Met49 to accommodate the Cl at 5-position of the aromatic P2 group. Van der Waals contacts for improved inhibitors represented as dotted lines. Superposition was calculated by least-squares fitting on Cα atoms.

Mpro inhibition by HL-3 compounds. (a) Binding modes of nine HL-3 compounds co-crystallized with Mpro. (b) HL-3-68 (cyan) bound in the active site of Mpro (pink). Electron density omit Fo – Fc map (gray mesh) is contoured at 3σ. H-bonds are shown as black dashes. Distances are in angstrom. PDB ID 7RLS. (c) Superposition of Mpro-1 X-ray/neutron structure (PDB ID 7N8C) with HL-3-68 (PDB ID 7RLS) and Mcule-CSR-494190-S1 (PDB ID 7RM2) complex X-ray structures showing rotation of Met49 to accommodate the Cl at 5-position of the aromatic P2 group. Van der Waals contacts for improved inhibitors represented as dotted lines. Superposition was calculated by least-squares fitting on Cα atoms. Mpro-HL-3-68 (Figure b) and Mpro-Mcule-CSR-494190-S1 superimpose with the Mpro-1 joint X-ray/neutron structure with Cα RMSD values in the range of 0.2–0.23 Å. Substituting Cl or CH3 at the 4-position of the P2 probes the effect of the chemical group’s hydrophobicity at the S2 site with a small difference in van der Waals radii. Increased hydrophobicity of the CH3 group in Mpro-Mcule-CSR-494190-S1 compared to Cl in Mpro-HL-3-68 translates structurally only into the nearby Pro52 ring that adopts the exo-conformation instead of the endo geometry, respectively. Pro52 terminates the short S2 α-helix spanning residues 46–52, where the Met49 side chain acts to gate the S2 subsite. The effect on the S2 subsite arising from −Cl at the P2 5-position can be observed here through superpositions of the Mpro-1 neutron structure with the Mpro-HL-3-68 and Mpro-Mcule-CSR-494190-S1 complexes (Figure c). In both structures, the Met49 side chain is sterically rotated away from its position in Mpro-1 to accommodate the additional Cl atom, which enables van der Waals contacts with Cys44 but does not lead to an additional shift in the position of the S2 helix. Crystal structures of complexes exhibiting similar or worse potency compared to compound 1 were also analyzed to assess the structural determinants of noncovalent ligand binding to Mpro. HL-3-78 substitutes Br at the P2 4-position probing the effect of a slightly larger van der Waals radius at this position compared to compound 1. The binding of HL-3-78 resulted in no significant changes in the protein structure but comparable inhibition properties. However, bulkier and highly electronegative CF3 groups at the 3- or 4-position (HL-3-52/63) bring about unfavorable distal shifts in the flexible S5 loop (>1.5 Å for Ala191 Cα, Figure S5a). Tyr54’s phenol oxygen faces the S2 subsite and presents an attractive target for direct H-bonding, as tested by substituting CH2OH (HL-3-71) and CHO (HL-3-70) at the 4-position of the P2. Unfortunately, the Mpro-HL-3-70 and Mpro-HL-3-71 crystal structures show the primary alcohol and the aldehyde oxygens are rotated away from the Tyr54 phenol oxygen, which keeps its conserved hydrogen bond with the main chain carbonyl of Asp187 (Figure S5b). Eliminating the cationic potential of the ligand by changing the linker from piperazine to piperidine (HL-3-69) produced no significant structural changes, except for a 0.4 Å shift in the position of the linker and P2 groups away from the S2 helix reducing van der Waals contacts.

Molecular Dynamics (MD) Simulations

MD simulations of Mpro (ligand-free), the Mpro-HL-3-68, and Mpro-1 complexes reveal a consistent picture of how the HL-3-68 ligand is more stable within the primary binding site of Mpro compared to compound 1 reported in our previous study.[40] We quantified the conformational changes using the root-mean-square deviation (RMSD) analysis across each trajectory (Figures and S6). The distribution of the RMSDs determined from at least three replicates of the simulations (shown in Figure A as a histogram of all conformers from MD trajectories) further reveals that HL-3-68 stabilizes the binding pocket of Mpro more than compound 1. For each system, we did observe slightly different fluctuations in chain A and chain B, which agrees with the previous simulation results.[13,40] Across the three simulation systems, the Mpro-HL-3-68 complex had the lowest average RMSD from both chains when compared with Mpro and the Mpro-1 system (Figure S6). Per-residue fluctuations were characterized by calculating the root-mean-square fluctuations (RMSF) of the Cα-atoms using the average conformation of each trajectory as the reference structure. Despite the fluctuation at the C-termini, the fluctuation patterns are largely in agreement, except that the Mpro-HL-3-68 system depicts suppressed fluctuations across the entire protein (Figure ). Lower RMSFs were observed for the primary ligand-binding site of the Mpro-HL-3-68 system, whereas other regions remained largely unaffected by the binding of the ligand(s). In our previous simulations[40] and as demonstrated here, compound 1 can potentially move away from the primary binding site to occupy various novel sites on the surface of Mpro; however, in crystallographic studies, compound 1 has not been observed to bind to other sites. Given that both crystallographic studies and biochemical assays indicate that the HL-3-68 is more stabilizing, our simulations also confirm that over the course of the time-scales of our simulations, it appears that the HL-3-68 stabilizes the primary interactions in S1 and S2 subsites by “locking” in the site, thus considerably reducing the flexibility of the loops surrounding the primary binding site. Thus, our analyses support the observation that HL-3-68 ligand binding stabilizes the Mpro structure, forming stronger interactions than compound 1.
Figure 4

Root-mean-square analysis of MD trajectories. (A) Distribution of the root-mean-square deviations (RMSD) captured as a histogram over the course of MD simulations from the three replicas showing that chain A exhibits higher fluctuations than chain B in the ligand-free state (top panel). In the ligand-bound states, the asymmetric nature of fluctuations is still present; however, Mpro-HL-3-68 (middle panel) exhibits a considerably lower degree of fluctuations compared to Mpro compound 1 (bottom panel) as a consequence of the increased stability of the interactions within the primary binding site. The starting model was PDB ID 7JUN for apo-Mpro simulation, 7RLS for the HL-3-68 complex, and 7N8C for the complex with compound 1. (B) Root-mean-square fluctuations (RMSFs) of the individual chains captured across the three sets of replicas further reinforce the observation that both ligands stabilize the Mpro-binding site. The shaded regions highlight the variation across three independent replicas of the simulations (for each system). (C) Putty-like representation of the RMSF of the ligand-free Mpro dimer simulations depicts large-scale fluctuations along the primary binding site (red tubes). (D, F, H) Depiction highlighting RMSF in one monomer of the protein, ligand-free protein in (D), Mpro-HL-3-68 complex in (F), and Mpro-1 complex in (H). (E) Putty representation of the RMSF of the Mpro dimer complexed with HL-3-68. (G) Putty representations of the RMSF of the Mpro dimer complexed with compound 1. The ligands in each case are highlighted using a ball-and-stick representation.

Root-mean-square analysis of MD trajectories. (A) Distribution of the root-mean-square deviations (RMSD) captured as a histogram over the course of MD simulations from the three replicas showing that chain A exhibits higher fluctuations than chain B in the ligand-free state (top panel). In the ligand-bound states, the asymmetric nature of fluctuations is still present; however, Mpro-HL-3-68 (middle panel) exhibits a considerably lower degree of fluctuations compared to Mpro compound 1 (bottom panel) as a consequence of the increased stability of the interactions within the primary binding site. The starting model was PDB ID 7JUN for apo-Mpro simulation, 7RLS for the HL-3-68 complex, and 7N8C for the complex with compound 1. (B) Root-mean-square fluctuations (RMSFs) of the individual chains captured across the three sets of replicas further reinforce the observation that both ligands stabilize the Mpro-binding site. The shaded regions highlight the variation across three independent replicas of the simulations (for each system). (C) Putty-like representation of the RMSF of the ligand-free Mpro dimer simulations depicts large-scale fluctuations along the primary binding site (red tubes). (D, F, H) Depiction highlighting RMSF in one monomer of the protein, ligand-free protein in (D), Mpro-HL-3-68 complex in (F), and Mpro-1 complex in (H). (E) Putty representation of the RMSF of the Mpro dimer complexed with HL-3-68. (G) Putty representations of the RMSF of the Mpro dimer complexed with compound 1. The ligands in each case are highlighted using a ball-and-stick representation.

Discussion

The design and development of small-molecule therapeutics are crucial components of the ongoing efforts to battle COVID-19 and to prepare for future pandemics.[43] SARS-CoV-2 Mpro is an attractive target for specific protease inhibitors that can be further developed into clinical drugs. Studying the structure, function, and inhibition of the enzyme in detail is important for accelerating this process. Similarly, understanding the SAR profile of the designed compounds is crucial to determine how structural, electronic, and electrostatic properties of certain chemical groups affect inhibitor binding to the Mpro active site. Therefore, our SAR study was guided by the XN structure of the Mpro-1 complex, where hydrogen atom positions, protonation states, and electric charges of Mpro residues and compound 1 were directly determined, providing the most detailed information to date for an Mpro in complex with a noncovalent inhibitor. We observed in the XN structure of Mpro-1 that Mpro adapts protonation states of the active site residues to maintain a net +1 charge within the binding site found in ligand-free[41] and telaprevir-bound[42] Mpro (Figure and Table ). Protonation state modulations occur through His163 in the S1 subsite, Cys145, and the His41–D2Ocat–His164 network. As predicted computationally,[44] His163’s Nδ1 becomes protonated upon ligand binding. The active site electrostatics can thus be tuned to allow the Cys145 side chain to exist as a thiol in Mpro-1 or as a thiolate primed for catalysis as observed in the ligand-free form. The imidazole ring of His41 is not only flipped 180° relative to its conformation in the ligand-free and telaprevir complexes but also neutral. Hence, while the catalytic dyad is zwitterionic in the ligand-free structure, it is neutral in Mpro-1. Whether the protonation states of the Cys–His dyad are interdependent is currently an open question. Each protonation state combination of His41 and His164 has now been captured individually in the three neutron structures, suggesting that D2Ocat-mediated H-bonding between these two side chains is not required for inhibition or a stable active site. Taken together, the protonation states determined from these three neutron structures suggest a mechanism where charges are shuffled between His163, the catalytic dyad, and His164 maintaining an overall +1 charge by active site residues when binding to inhibitors.
Figure 5

Comparison of protonation states of ionizable residues in the active site of SARS-CoV-2 Mpro determined from neutron crystallographic structures. (a) Mpro bound to compound 1. (b) Ligand-free Mpro (PDB code 7JUN). (c) Mpro complexed with covalent α-ketoamide HCV protease inhibitor telaprevir (PDB code 7LB7). Charges are represented with transparent surface colored gray for neutral, red for positively charged, and blue for negatively charged. Nonpolar deuterium atoms are shown in orange. Mpro-1 determined at pD = 7.4 and 7JUN and 7LB7 are at pH = 7.0.

Table 3

Summary of Protonation States and Corresponding Electric Charges of the Ionizable Residues in the SARS-CoV-2 Mpro Active Site Observed in the Neutron Structures of the Ligand-Free Enzyme and in Complex with Compound 1 and Telaprevir

 Mproligand-free (PDB ID 7JUN)
Mpro-Telaprevir (PDB ID 7LB7)
Mpro-1 (PDB ID 7N8C)
residuechargespecieschargespecieschargespecies
Cys145cat–1thiolate (−S)0S-C-OD (hemithioketal)0thiol (−SD)
His41cat+1Nδ1-D, Nε2-D0Nδ1-D0Nε2-D
His1630Nδ1-D+1Nδ1-D, Nε2-D+1Nδ1-D, Nε2-D
His164+1Nδ1-D, Nε2-D0Nδ1-D0Nε2-D
His1720Nε2-D0Nε2-D0Nε2-D
net charge+1 +1 +1 
Comparison of protonation states of ionizable residues in the active site of SARS-CoV-2 Mpro determined from neutron crystallographic structures. (a) Mpro bound to compound 1. (b) Ligand-free Mpro (PDB code 7JUN). (c) Mpro complexed with covalent α-ketoamide HCV protease inhibitor telaprevir (PDB code 7LB7). Charges are represented with transparent surface colored gray for neutral, red for positively charged, and blue for negatively charged. Nonpolar deuterium atoms are shown in orange. Mpro-1 determined at pD = 7.4 and 7JUN and 7LB7 are at pH = 7.0. The hydrogen bond between the P1 group of the HL-3 compounds and the protonated positively charged His163 is essential for binding. Distorting the P1 group planarity by introducing a partial saturation as in compounds HL-3-51/53 appears to disrupt the hydrogen bond leading to a dramatic loss of affinity. The cationic nature of the linker positioned above the neutral catalytic dyad is beneficial for the compound’s potency. However, the antiviral activity of some cationic drugs may be attributed to induced phospholipidosis rather than their specific function;[45] thus, novel Mpro inhibitors should be designed with this knowledge in mind. We determined that the substituents on the aromatic P2 group should have both moderate steric size and electronegativity as the binding is sensitive to small changes in atomic properties. In addition, compounds with only one substituent on the P2 group are poor inhibitors. Highly electronegative substituents such as F or CF3 are disadvantageous, as are less electronegative but sterically larger CHO and CH2OH, which push against the S5 loop and Tyr54, respectively. Adding a third Cl to position 5 of the P2 group in compound 1 to give HL-3-68 improved inhibition by 2–3-fold based on Ki and Kd values, indicating that its proximity to Cys44 and 3.5–4.3 Å contacts is favorable. It is interesting that our ITC measurements of compound 1, Mcule-CSR-494190-S1, and HL-3-68 binding to Mpro demonstrated that these noncovalent inhibitors bind with a limited hydrophobic effect, i.e., the entropy (ΔS) of binding is small. A combination of several opposing factors may result in the measured values of ΔS of binding. First, the ligand-free Mpro has a few water molecules in the active site,[41] whereas several waters are recruited from the bulk solvent when the inhibitors bind. Second, the P2 groups access the S2 subsite by carving out the pocket blocked by Met49 and the S2 helix, limiting the conformational space for favorable binding. Third, a compound would lose some conformational freedom once bound to Mpro. These three factors would contribute unfavorably to the ΔS of binding, while the loss of the compound’s hydration shell when it binds to the enzyme would increase the entropy, contributing favorably to the ΔS of binding. In this way, substituting Cl with CH3 at the 4-position of the P2 produces enough difference in conformational entropy and hydration entropy to elicit significant differences in the ΔS of binding. Changes in the protein dynamics upon inhibitor binding, and specifically, in the vibrational dynamics,[46−48] would also contribute to the ΔS (and ΔH) of binding, although the effect of these changes is not known.

Conclusions

In summary, the current SAR study of Mpro combines neutron and X-ray crystallography, chemical synthesis, in vitro measurements, and molecular dynamics simulations to profile the binding of a noncovalent ligand discovered through a new high-throughput screening approach.[40] Protonation states of critical side chains in the Mpro active site are intrinsically variable, thus hard to predict, a feature that presents challenges for in silico modeling and inhibitor design. The active site and especially the hydrophobic S2 pocket are sensitive to small changes in ligand properties. We show that one atom differences in the studied noncovalent ligand’s P2 group were enough to significantly alter the binding entropy, potency, and complex dynamics. Taken together, these characterization techniques elucidate new details of Mpro as a drug target.

Experimental Section

General Information

Virtual reality-assisted analysis and model building were used to design and visualize derivatives of compound 1 (Mcule-5948770040). Modifications to the ligand scaffold starting from PDB code 7LTJ(40) were modeled and assessed using the MedChem tool in Nanome.[49,50] Nickel-nitrilotriacetic acid (Ni-NTA) columns were purchased from Cytiva (Piscataway, New Jersey). His-tagged human rhinovirus (HRV) 3C protease was purchased from Sigma (MilliporeSigma, St. Louis, MO). Crystallization reagents and supplies were purchased from Hampton Research (Aliso Viejo, California). Crystallographic supplies for crystal mounting and X-ray and neutron diffraction data collection at room temperature were purchased from MiTeGen (Ithaca, New York) and Vitrocom (Mountain Lakes, New Jersey). The FRET substrate DABCYL-KTSAVLQSGFRKM-E(EDANS) trifluoroacetate salt was purchased from Bachem (PN 4045664). NMR spectra were obtained on a Bruker Avance NEO NMR console coupled to an 11.74 T actively shielded magnet (Magnex Scientific/Varian) operating at 499.717 MHz for proton at the Center for Nanophase Materials Sciences. Mcule-5948770040 and 3,5-dichloro,4-methylphenyl derivative (Mcule-CSR-494190-S1) were purchased from Mcule, Inc (Palo Alto, California). All HL-3 compounds were synthesized at the Center for Nanophase Materials Sciences (Oak Ridge National Laboratory). Full details of the syntheses, NMR, and laser desorption ionization-time of flight (LDI-TOF) structural data are provided in the Supporting Information section.

General Synthesis Procedure of HL-3 Compounds

The HL-3 compounds were synthesized using techniques described previously.[51] In a typical procedure, a 20 mL vial was charged with a stir bar, the appropriate aryl piperazine derivative (1 equiv), orotic acid (1 equiv), HOBt·H2O (0.07 equiv), and anhydrous dimethylformamide (DMF) (5 mL) at ambient temperature. The solution was cooled to 0 °C, and N-(3-dimethylaminopropyl)-N′-ethylcarbodiimide hydrochloride (EDC·HCl) (1 equiv) was added. The reaction mixture was stirred at room temperature overnight, and DMF was removed under reduced pressure. The residue was purified by silica gel chromatography (DCM to DCM/CH3OH = 10:1). The removal of the solvents yielded the products as solids in yields >90%. Purity is >95% for all compounds as measured by NMR and mass spectrometry analyses (see the Supporting Information).

Gene Construction, Expression, and Purification of Hydrogenated and Partially Deuterated SARS-CoV-2 Mpro

A codon-optimized gene sequence of Mpro (NSP5) from SARS-CoV-2 was cloned into a plasmid harboring the kanamycin resistance cassette (pD451-SR, Atum, Newark, CA). The Mpro construct is flanked upstream by a gene for maltose-binding protein (MBP) and downstream by a His6 tag.[52] The native N-terminus is achieved during expression through an Mpro autoprocessing site SAVLQ↓SGFRK, where ↓ denotes the cleavage site, corresponding to the cleavage between NSP4 and NSP5 in the viral polyprotein. The native C-terminus is produced through an HRV-3C protease cleavage site (SGVTFQ↓GP). This strategy enables a two-step Ni-affinity chromatography purification. Hydrogenated Mpro was expressed in Escherichia coli and purified according to the established procedures.[52] Partially deuterated Mpro was expressed using a bioreactor and purified, as described recently.[42] Final protein yields for hydrogenated Mpro preparations averaged ∼4 mg per 1 g cells (∼17 mg/L of cell culture), whereas partially deuterated preparations yielded ∼0.8 mg per 1 g of cell paste (∼40 mg/L of cell culture). Similar in strategy to the above, a second construct was also used to express and purify the wild-type Mpro. This construct differs from the first construct by having an additional 36 residue spacer sequence corresponding to the immunoglobulin-binding domain B1 of GB1 inserted between MBP and a 6-amino-acid flanking nsp4 sequence TSAVLQ. Expression and purification were carried out, as described above and in ref (39). Peak fractions were concentrated and stored in aliquots at −20 °C. This second source of Mpro was used to determine the binding constants by ITC independent of the measurement carried out through enzyme kinetic measurements using the first enzyme source.

Crystallization of the Mpro–Inhibitor Complexes

Detailed instructions for crystallizing high-quality Mpro crystals starting from hydrogenated and partially deuterated enzymes are accessible.[42,53] Crystallization conditions for flower-shaped crystal aggregates of Mpro were initially discovered by automated high-throughput screening at the Hauptman-Woodward Medical Research Institute (HWI).[54] Crystal aggregates of apo-Mpro were reproduced locally and converted into microseeds for seeding subsequent crystallization experiments. Protein for co-crystallization was concentrated to 5 mg/mL in 20 mM Tris, 150 mM NaCl, and 1 mM tris(2-carboxyethyl)phosphine (TCEP), pH 8.0, and used fresh or stored at −30 °C for no longer than 2 weeks. Stock derivatives (50 mM) of compound 1 were prepared in 100% dimethyl sulfoxide (DMSO) for crystallization purposes and stored at −30 °C. Mpro was mixed with the ligands at a 1:5 molar ratio and allowed to incubate at room temperature for a minimum of 30 min prior to setting up crystal trays. All crystals grown for room-temperature X-ray diffraction used sitting drop vapor diffusion methodology with 18–21% poly(ethylene glycol) 3350 (PEG3350), 0.1 M Bis–Tris, pH 6.5 or 7.0, as the precipitant solution. Crystallization drop volumes of 20–30 μL at a 1:1 ratio were seeded with 0.2 μL of apo-Mpro microseeds (1:200 dilution). Crystals appeared after 3 days of incubation at 14 °C and continued to grow in volume for additional 7 days. Typical Mpro–ligand complex crystals grew to ∼0.1 mm3 before mounting in MiTeGen (Ithaca, NY) room-temperature capillary setups (Figure S7). The crystal used for joint XN crystallography started from partially deuterated Mpro at 10 mg/mL mixed with compound 1 at a 1:5 molar ratio, incubated at room temperature for 30 min, and filtered through the 0.2 μm centrifugal filter. A Hampton nine-well sandwich box was set up with 220 μL of drops at a 1:1 ratio of protein to 18% PEG3350, 0.1 M Bis–Tris, pH 7.0, reservoir solution and 0.2 μL of microseeds at 1:200 dilution. After 11 days of incubation at 14 °C, the temperature was reduced to 12 °C, and crystals were allowed to grow for 30 more days. This process afforded three protein crystals of >2 mm3 in volume, with the final crystal used for neutron data collection measuring ∼2 × 1.5 × 0.7 mm3 (2.1 mm3) (Figure S8). The crystal was mounted in a fused quartz capillary accompanied by 19% PEG3350 prepared with 100% D2O to allow labile hydrogens to exchange at 18 °C for 2 weeks before starting neutron data collection. The final pH of the crystallization drop at the time of crystal mounting was 7.0 as measured by a microelectrode, corresponding to a final pD of 7.4 (pD = pH + 0.4).

Room-Temperature X-ray Diffraction Data Collection and Structure Refinement

All room-temperature X-ray crystallographic data were collected with a Rigaku HighFlux HomeLab instrument equipped with a MicroMax-007 HF X-ray generator, Osmic VariMax optics, and a DECTRIS Eiger R 4M hybrid photon counting detector. Diffraction data were integrated using the CrysAlis Pro software suite (Rigaku Inc., The Woodlands, TX) and then reduced and scaled using Aimless[55] from the CCP4 suite.[56] Structures were solved by molecular replacement using PDB code 7LTJ(40) with Phaser[57] from CCP4. Each model was iteratively refined using Phenix.refine from PHENIX suite[58] and COOT[59,60] graphics program aided by Molprobity[61] for geometry validation. All ligand restraints were generated with eLBOW[62] using geometry optimized by quantum mechanical calculations in Gaussian 16 at the B3LYP/6-31g(d,p) level of theory.[63] Final data collection and refinement statistics are organized in Table S2.

Neutron Diffraction Data Collection

Room-temperature neutron diffraction data were collected using the Macromolecular Neutron Diffractometer Instrument (MaNDi) at the Spallation Neutron Source of the Oak Ridge National Laboratory.[64−66] The crystal was held stationary at room temperature, while diffraction data were collected for 20 h using all neutrons between 2 and 4.16 Å. Following this, the crystal was rotated by Δϕ = 10°, and a subsequent data frame was collected again for 20 h. A total of 21 data frames were collected in the final neutron data set. Diffraction data were reduced using the Mantid package, with integration carried out using three-dimensional TOF profile fitting.[67] Wavelength normalization of the Laue data was performed using the Lauenorm program from the Lauegen suite.[68] Neutron data collection statistics are shown in Table S1.

Joint X-ray/Neutron (XN) Refinement

Joint XN refinement of the deuterated Mpro-1 complex was performed using nCNS,[69,70] and the structure was manipulated in COOT.[59,60] After initial rigid-body refinement, several cycles of positional, atomic displacement parameter, and occupancy refinement were run. Correctness of side-chain conformations, hydrogen bonding, and orientations of D2O water molecules in the structure was based on the mFo – DFc difference neutron scattering length density maps. The mFo – DFc and mFo – DFc neutron scattering length density maps were then examined to determine the correct orientations of hydroxyl (Ser, Thr, Tyr), thiol (Cys), and ammonium (Lys) groups as well as protonation states of the enzyme residues and compound 1. The protonation states of some disordered side chains on the protein surface could not be obtained directly and remained ambiguous. Water molecules were refined as D2O where water oxygen atoms were centered on their electron density peaks and each molecule was rotated in accordance with the neutron scattering length density maps. Hydrogen positions in the protein were modeled as deuterium atoms because the protein was partially deuterated. Compound 1 is ionizable at the piperazine amine and was modeled as the protonated species with a D atom. Occupancies of D atoms were refined individually within the range of −0.56 (pure H) to 1.00 (pure D) because the neutron scattering length of H is −0.56 times that of D. Before depositing the neutron structure to the PDB, coordinates of a D atom were split into two records corresponding to an H and a D partially occupying the same site, both with positive partial occupancies that add up to unity. The percent D at a specific site is calculated according to the following formula: % D = {occupancy(D) + 0.56}/1.56.

Enzyme Inhibition Assay

Compounds were dissolved in 10 mM DMSO and stored at −20 °C. The initial rates of Mpro were measured, and the data were analyzed using a previously established fluorescence resonance energy transfer (FRET) peptide substrate assay method.[71,72] For the initial inhibition screen, performed in duplicate, final assay concentrations were 250 nM enzyme, 20 μM inhibitor, and 40 μM peptide substrate. Inhibitors with 50% residual activity or less in the initial screen were further characterized across seven inhibitor concentrations in the range of 0.03–100 μM in at least duplicate, and the resulting initial rates were normalized with 0 as 0% residual activity and the average of positive control rates without the inhibitor as 100% residual activity. The [inhibitor] vs normalized response–variable slope equation in GraphPad Prism 9 was fit to the normalized data to determine IC50 valueswhere A is the residual activity, IC50 is the inhibitor concentration at which 50% inhibition is observed, [I] is the inhibitor concentration, and b is the Hill slope. The two inhibitors with the lowest IC50 values were further characterized to determine their Ki values, as previously described for compound 1.[40] Initial rates were measured in triplicate without the inhibitor and with the inhibitor at final concentrations of 2.5, 7.5, and 25 μM, with 150 nM enzyme, and final concentrations of the substrate in the range 20–500 μM. The competitive inhibition equation in GraphPad Prism 9 was fit to the resulting initial rates, v0, to determine the Michaelis–Menten enzyme parameters Vmax and KM and the Ki affinities of the inhibitors.

Isothermal Titration Calorimetry

Purified wild-type Mpro was diluted from a stock solution to 60 μM and dialyzed overnight at 4 °C against 25 mM Tris–HCl, pH 7.2, 20 mM NaCl, and 1 mM TCEP (ITC buffer). The concentration of the enzyme was estimated based on its 280 nm absorbance. Stock solutions of inhibitors were diluted in ITC buffer to 0.3 mM and contained a final concentration of 0.5% DMSO. The protein solution was also adjusted to contain the same concentration of DMSO. Titrations were performed at 28 °C on an iTC200 microcalorimeter (Malvern Instruments Inc., Westborough, Massachusetts). A control titration of buffer with the inhibitor showed a negligible response. Data were processed and plots were generated using the Origin software provided with the instrument. For competitive inhibitors that bind at only one site, the dissociation constant (Kd = 1/Ka) is equivalent to the inhibition constant measured by enzyme kinetics (Ki).

MD Simulations and Analysis

MD simulations were performed for three different systems (ligand-free, or apo-, Mpro; Mpro in complex with compound 1; and Mpro with HL-3-68) to study the protein stability upon binding with different ligands. The simulation runs were carried out with the OpenMM package on Nvidia V100 GPUs. The protein atomic interactions were described with the Amberff14SB force field and tip3p water model. The ligands, compound 1, and HL-3-68 were modeled using the antechamber package with the GAFF force field. Each system was neutralized with counterions. The nonbonded interactions were cut off at 10 Å and long-range interactions were calculated with the particle mesh Ewald method. The simulations were run at 310 K and 2 fs time steps with the Langevin integrator. The chemical bonds with hydrogen atoms were fixed, and the system pressure was fixed at 1 bar with a Monte Carlo barostat. Each of the three systems was equilibrated using a procedure described in the previous work[40] and three replicas (each with a 250 ns production run) were generated; snapshots from the simulation were saved every 50 ps. For all of the trajectories, we calculated the root-mean-square deviations (RMSDs) for the overall protein structure and root-mean-square fluctuation (RMSF) to quantify per-residue fluctuations.

Antiviral Assays

Evaluation of the antiviral activity of compound 1 (Mcule-59487700), HL-3-68, and Mcule-CSR-494190-S1 was carried out in Vero E6 TMPRSSS cells, as described in Bocci et al.[73] using the USA-WA1/2020 (deposited by the Centers for Disease Control and Prevention and obtained through BEI Resources, NIAID, NIH, NR-52281). Compounds were evaluated in a dose response format starting at 33 μM and 6 additional twofold dilutions in duplicate. These compounds demonstrated no antiviral activity.
  6 in total

1.  Discovery of 2-(furan-2-ylmethylene)hydrazine-1-carbothioamide derivatives as novel inhibitors of SARS-CoV-2 main protease.

Authors:  Xiaodong Dou; Qi Sun; Guofeng Xu; Yameng Liu; Caifang Zhang; Bingding Wang; Yangbin Lu; Zheng Guo; Lingyu Su; Tongyu Huo; Xinyi Zhao; Chen Wang; Zhongtian Yu; Song Song; Liangren Zhang; Zhenming Liu; Luhua Lai; Ning Jiao
Journal:  Eur J Med Chem       Date:  2022-06-03       Impact factor: 7.088

2.  Covalent narlaprevir- and boceprevir-derived hybrid inhibitors of SARS-CoV-2 main protease.

Authors:  Daniel W Kneller; Hui Li; Gwyndalyn Phillips; Kevin L Weiss; Qiu Zhang; Mark A Arnould; Colleen B Jonsson; Surekha Surendranathan; Jyothi Parvathareddy; Matthew P Blakeley; Leighton Coates; John M Louis; Peter V Bonnesen; Andrey Kovalevsky
Journal:  Nat Commun       Date:  2022-04-27       Impact factor: 17.694

3.  Covalent narlaprevir- and boceprevir-derived hybrid inhibitors of SARS-CoV-2 main protease: room-temperature X-ray and neutron crystallography, binding thermodynamics, and antiviral activity.

Authors:  Daniel Kneller; Hui Li; Gwyndalyn Phillips; Kevin Weiss; Qiu Zhang; Mark Arnould; Colleen Jonsson; Surekha Surendranathan; Jyothi Parvathareddy; Matthew Blakeley; Leighton Coates; John Louis; Peter Bonnesen; Andrey Kovalevsky
Journal:  Res Sq       Date:  2022-02-11

4.  Hit Expansion of a Noncovalent SARS-CoV-2 Main Protease Inhibitor.

Authors:  Jens Glaser; Ada Sedova; Stephanie Galanie; Daniel W Kneller; Russell B Davidson; Elvis Maradzike; Sara Del Galdo; Audrey Labbé; Darren J Hsu; Rupesh Agarwal; Dmytro Bykov; Arnold Tharrington; Jerry M Parks; Dayle M A Smith; Isabella Daidone; Leighton Coates; Andrey Kovalevsky; Jeremy C Smith
Journal:  ACS Pharmacol Transl Sci       Date:  2022-04-04

5.  X-ray crystallographic characterization of the SARS-CoV-2 main protease polyprotein cleavage sites essential for viral processing and maturation.

Authors:  Jaeyong Lee; Calem Kenward; Liam J Worrall; Marija Vuckovic; Francesco Gentile; Anh-Tien Ton; Myles Ng; Artem Cherkasov; Natalie C J Strynadka; Mark Paetzel
Journal:  Nat Commun       Date:  2022-09-03       Impact factor: 17.694

6.  Autoprocessing and oxyanion loop reorganization upon GC373 and nirmatrelvir binding of monomeric SARS-CoV-2 main protease catalytic domain.

Authors:  Nashaat T Nashed; Daniel W Kneller; Leighton Coates; Rodolfo Ghirlando; Annie Aniana; Andrey Kovalevsky; John M Louis
Journal:  Commun Biol       Date:  2022-09-16
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.