Literature DB >> 21854807

Peptide aldehyde inhibitors challenge the substrate specificity of the SARS-coronavirus main protease.

Lili Zhu1, Shyla George, Marco F Schmidt, Samer I Al-Gharabli, Jörg Rademann, Rolf Hilgenfeld.   

Abstract

pan class="Species">SARS coronavirus main protease (n>n class="Disease">SARS-CoV M(pro)) is essential for the replication of the virus and regarded as a major antiviral drug target. The enzyme is a cysteine protease, with a catalytic dyad (Cys-145/His-41) in the active site. Aldehyde inhibitors can bind reversibly to the active-site sulfhydryl of SARS-CoV M(pro). Previous studies using peptidic substrates and inhibitors showed that the substrate specificity of SARS-CoV M(pro) requires glutamine in the P1 position and a large hydrophobic residue in the P2 position. We determined four crystal structures of SARS-CoV M(pro) in complex with pentapeptide aldehydes (Ac-ESTLQ-H, Ac-NSFSQ-H, Ac-DSFDQ-H, and Ac-NSTSQ-H). Kinetic data showed that all of these aldehydes exhibit inhibitory activity towards SARS-CoV M(pro), with K(i) values in the μM range. Surprisingly, the X-ray structures revealed that the hydrophobic S2 pocket of the enzyme can accommodate serine and even aspartic-acid side-chains in the P2 positions of the inhibitors. Consequently, we reassessed the substrate specificity of the enzyme by testing the cleavage of 20 different tetradecapeptide substrates with varying amino-acid residues in the P2 position. The cleavage efficiency for the substrate with serine in the P2 position was 160-times lower than that for the original substrate (P2=Leu); furthermore, the substrate with aspartic acid in the P2 position was not cleaved at all. We also determined a crystal structure of SARS-CoV M(pro) in complex with aldehyde Cm-FF-H, which has its P1-phenylalanine residue bound to the relatively hydrophilic S1 pocket of the enzyme and yet exhibits a high inhibitory activity against SARS-CoV M(pro), with K(i)=2.24±0.58 μM. These results show that the stringent substrate specificity of the SARS-CoV M(pro) with respect to the P1 and P2 positions can be overruled by the highly electrophilic character of the aldehyde warhead, thereby constituting a deviation from the dogma that peptidic inhibitors need to correspond to the observed cleavage specificity of the target protease.
Copyright © 2011 Elsevier B.V. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21854807      PMCID: PMC7114241          DOI: 10.1016/j.antiviral.2011.08.001

Source DB:  PubMed          Journal:  Antiviral Res        ISSN: 0166-3542            Impact factor:   5.970


Introduction

pan class="Species">SARS coronavirus main protease (n>n class="Disease">SARS-CoV Mpro) is essential for the replication of the virus and regarded as a major antiviral drug target (Anand et al., 2003, Yang et al., 2003, Tan et al., 2005, Steuber and Hilgenfeld, 2010). Peptide aldehyde inhibitors are widely used as research tools to characterize the substrate specificity of serine and cysteine proteases. In complex with the crystalline target enzyme, they provide a wealth of information on the specificity-defining subsites of the substrate-binding site. For this purpose, we have previously described peptide aldehydes as inhibitors of the SARS-coronavirus main protease (SARS-CoV Mpro) (Al-Gharabli et al., 2006). A major advantage of aldehydes over other peptide electrophiles is their reversible binding to the active-site sulfhydryl of cysteine proteases. We have previously used this property to screen a small library of non-peptide nucleophiles (mostly amines) for molecules that would efficiently enhance the inhibition of the aldehyde by formation of a stronger binding aldehyde-nucleophile ligation product. In a second step of this process that we named “Dynamic Ligation Screening” (Schmidt et al., 2008), the most efficient compound in this process had its amino group replaced by an aldehyde moiety and was reacted with the target enzyme, and the resulting covalent adduct was subsequently used to screen the same library of nucleophiles for those that would react with the aldehyde. This way, starting from a substrate-like peptide aldehyde, we obtained a low-μM non-peptidic, reversible inhibitor from two fragments which by themselves had no or only very low inhibitory activity (Schmidt et al., 2008). Interestingly, it turned out that some pan class="Chemical">peptide n>n class="Chemical">aldehydes with an amino-acid sequence deviating from the consensus sequence embracing the cleavage sites of polyprotein substrates were surprisingly efficient as inhibitors. In particular, peptide aldehydes with P2 = Asp or Ser inhibited the main protease with a relatively low IC50 (Al-Gharabli et al., 2006), although it is generally agreed that the S2 specificity subsite of the enzyme has a strong preference for large hydrophobic side-chains such as Leu, Phe, or Met (Fan et al., 2004, Fan et al., 2005, Lai et al., 2006). We therefore assumed that the hydrophilic P2 side-chain of these inhibitors would be oriented towards the solvent, rather than occupy the S2 pocket, and that the hydrophobic P3 residue would be binding to that pocket (Al-Gharabli et al., 2006, Schmidt et al., 2008). This model was based on a crystal structure of a peptidyl chloromethyl ketone bound to the SARS-CoV main protease, in which this arrangement had been observed (Yang et al., 2003). Here we show by X-ray crystallography of a number of peptide aldehyde complexes with the protein that this is not the case. Instead, the hydrophilic Ser or Asp residues in the P2 position bind to the hydrophobic S2 subsite. In order to understand these binding modes, the atomic interactions were analyzed in detail. Also, we report the inhibition kinetics of these compounds and re-evaluate the cleavage specificity of the enzyme as far as the P2 position is concerned. In addition, we determined a crystal structure of SARS-CoV Mpro in complex with the aldehyde Cm-FF-H (cinnamoyl-Phe-Phe-H), which has a phenylalanine residue in the P1 position and exhibits high inhibitory activity against SARS-CoV Mpro, with K  = 2.24 ± 0.58 μM. These results show that the stringent substrate specificity of the SARS-CoV Mpro with respect to the P1 and P2 positions can be overruled by the highly electrophilic character of the aldehyde warhead.

Materials and methods

Synthesis of aldehydes with various P2 residues

Chemical synthesis of pan class="Chemical">peptide n>n class="Chemical">aldehydes Ac-ESTLQ-H, Ac-NSFSQ-H, Ac-DSFDQ-H, and Ac-NSTSQ-H was performed employing a solid-state method (Al-Gharabli et al., 2006). Briefly, protected glutamine aldehyde obtained by racemization-free oxidation of the corresponding amino alcohol with Dess–Martin periodinane was immobilized on a threonyl resin as oxazolidine. Following N-tert-butyl oxycarbonyl (Boc)-protection of the ring nitrogen to yield the N-protected oxazolidine linker, peptide synthesis was performed on the resin.

Synthesis of Cm-FF-H (Scheme 1)

Synthesis of Cm-FF-H was carried out by amidation of pan class="Chemical">Boc-l-phenylalanine with n>n class="Chemical">l-phenylalanine methyl ester followed by deprotection of the Boc group and acylation of the corresponding product with cinnamoyl chloride to provide N-cinnamoyl-l-phenylalanyl-l-phenylalanine methyl ester. Cm-FF-H was then obtained by reduction of the ester with NaBH4/CaCl2 and alcohol oxidation with IBX. Synthesis of Cm-FF-H. Reagents and conditions: (a) EDCI, HOBt, pan class="Chemical">Boc-n>n class="Chemical">Phe-OH, DMF, 0–25 °C, 18 h, 78%; (b) (i) TFA, CH2Cl2, 0–25 °C, 2 h; (ii) cinnamoyl chloride, N-methylmorpholine, THF, DMF, 0–25 °C, 12 h, 50%; (c) NaBH4, CaCl2, EtOH, THF (1:1), 0–25 °C, 14 h, 35%; (d) IBX, DMSO, 0–25 °C, 5 h, 47%.

Enzyme kinetics

The recombinant production and purification of pan class="Disease">SARS-CoV Mpro with authentic N and C termini were performed as described previously (Xue et al., 2007, Verschueren et al., 2008). The substrate Dabcyl-KTSAVLQ↓SGFRKME-(Edans)-n>n class="Chemical">amide (95% purity; Biosyntan GmbH, Berlin, Germany), which contains a main-protease cleavage site (indicated by the arrow), was used as the substrate in the fluorescence resonance energy transfer (FRET)-based cleavage assay. The substrate stock was prepared by dissolving 1 mM of the peptide in DMSO. The dequenching of the Dabcyl fluorescence due to the cleavage of the substrate as catalyzed by the SARS-CoV Mpro was monitored at 490 nm with excitation at 340 nm, using a Cary Eclipse fluorescence spectrophotometer. The experiments were performed in the buffer consisting of 20 mM Tris-HCl (pH 7.3), 100 mM NaCl, and 1 mM EDTA. The reaction was initiated by adding different final concentrations of the FRET peptide (10–50 μM) to a solution containing SARS-CoV Mpro (final concentration 0.5 μM). Kinetic constants (V max and K m) were derived by fitting the data to the Michaelis–Menten equation, V  =  V max  × [S]/(K m  + [S]). Then k cat was calculated according to the equation, k cat  =  V max/[E]. In order to estimate the population of catalytically active Mpro dimers, the respective monomer and dimer concentrations were calculated according to Graziano et al. (2006). The dimerization of the SARS-CoV Mpro follows the scheme The equilibrium dissociation constant, K, is defined bywhere [M] and [D] are the molar concentrations of the monomer and dimer, respectively. The total protein concentration [M] expressed in terms of molar monomer equipan class="Chemical">valents is Sincesubstituting the above equation into the expression for the K yields Solving the above quadratic equation for [M] gives It then follows that

Aldehyde inhibition assay

pan class="Chemical">Aldehyde stock solutions were prepan>red by dissolving the compounds in n>n class="Chemical">DMSO at 1 mM. All aldehydes form covalent bonds between the aldehyde (–CHO) group of the inhibitor and the sulfhydryl (–SH) group of Cys145 of SARS-CoV Mpro. The binding of these inhibitors is reversible (Schmidt et al., 2008), although it is strong. Therefore, they were treated as reversible tight-binding inhibitors. For the measurement of the inhibition constant K, SARS-CoV Mpro was incubated with the aldehyde inhibitor in reaction buffer at room temperature for 30 min. Then the FRET peptide substrate, Dabcyl-KTSAVLQ↓SGFRKME-Edans, was added and the reaction velocity was calculated according to substrate cleavage. Values of the intrinsic () and apparent (, ) catalytic parameters for SARS-CoV Mpro catalyzing the hydrolysis of peptide substrate were determined in the absence and presence of aldehyde, respectively. The apparent inhibition constants () for aldehyde binding to SARS-CoV Mpro were obtained from the dependence of on the inhibitor concentration ([I]) at fixed substrate concentration ([S]), according to the equation (Ascenzi et al., 1987, Copeland, 2000). Values of the intrinsic inhibition constant (K) for aldehyde binding to SARS-CoV Mpro were calculated according to the equation  =  K  × (1 + [S]/K m) (Ascenzi et al., 1987, Copeland, 2000).

Peptide cleavage assay

Twenty pan class="Chemical">peptide substrates harboring alternative amino-pan class="Chemical">acid residues in P2 (=X; SWTSAVXQ↓SGFRKWA) were purchased from GL Biochemistry Ltd. (Shanghai, China). These peptides correspond to the N-terminal autocleavage site of the SARS-CoV Mpro, with the exception of the P7 Ile which had been replaced by Trp, and the P6′ Met which had also been replaced by Trp. All peptide substrate stocks were dissolved in DMSO at 10 mM. To determine the k cat/K m for the substrate, 0.2 mM substrate peptide was incubated with 10 μM SARS-CoV Mpro in 40 mM Tris-HCl buffer, pH 7.3. Aliquots of reactions were removed at different times, stopped by the addition of 1% trichloroacetic acid. Separation of products and substrate was carried out using a reverse-phase (RP) HPLC and a linear gradient (1–90%) of acetonitrile in 0.1% trifluoroacetic acid. The absorbance was determined at 280 nm, and peak areas were calculated by integration. The (k cat/K m)app ratio was determined by plotting the substrate peak area using the equation lnPA =  C  − (k cat/K m)app  ×  C E  ×  t, where PA is the peak area of the substrate peptide, C E is the total concentration of SARS-CoV Mpro, and C is an experimental constant (Fan et al., 2004, Fan et al., 2005).

Crystallization of the complexes

pan class="Disease">SARS-CoV Mpro with authentic chain termini was concentrated to 10 mg/ml and crystallized by vapor diffusion using sitting drops (Xue et al., 2007). The crystals grew overnight at 20 °C by equilibration against a ren>n class="Chemical">servoir containing 6–8% polyethylene glycol (PEG) 6000, 0.1 M MES (pH 6.0), 3% 2-methyl-2,4-pentanediol (MPD), and 3% DMSO. All aldehydes were dissolved in 8% PEG 6000, 0.1 M MES (pH 6.0), 3% MPD, and 10% DMSO to a concentration of 10 mM. Crystals of the aldehyde complexes of SARS-CoV Mpro were obtained either by adding a 4-μl aliquot of aldehyde solution to the drop and soaking of the crystals for 12 h, or by incubating the enzyme for 2 h at 20 °C with a 7-fold excess of the aldehyde solution and subsequent cocrystallizing at 20 °C against a reservoir containing 8% PEG 6000, 0.1 M MES (pH 6.0), 3% MPD, and 3% DMSO. In the latter case, nucleation was initiated by microseeding using crushed monoclinic (space group C2) crystals of the SARS-CoV Mpro. Prior to diffraction data collection, the aldehyde complex crystals obtained from soaking and cocrystallization were transferred for a few seconds to a cryoprotectant solution containing the crystallization ingredients and 20% MPD.

Crystallographic data collection and processing, and structure elucidation and refinement

All diffrpan class="Chemical">action data were collected at 100 K at the Joint EMBL/University of Hamburg/University of Lübeck synchrotron beamline X13 at DESY (Hamburg, Germany), using a 165-mm MAR CCD detector (Mar Research, Hamburg, Germany), or at synchrotron beamline BL14.1 at BESSY (Berlin, Germany), using an MX225 CCD detector (Rayonics, Evanston, IL). Statistics of data collection, processing, and refinement are summarized in Table 1 . Data were processed with MOSFLM (Leslie, 1992), and scaled using the SCALA program from the n>n class="Gene">CCP4 suite (Collaborative Computational Project, 1994, Evans, 2006).
Table 1

Data collection and refinement statistics.

InhibitorAc-ESTLQ-HAc-DSFDQ-HAc-NSTSQ-HAc-NSFSQ-HAc-ESTLQ-HCm-FF-H
Data collection statistics
Wavelenth (Å)0.81230.81230.81230.81230.81230.9184
MethodSoakingSoakingSoakingSoakingCocrystallizationSoaking
Space groupC2C2C2C2P21C2
Unit cell dimensions (Å, °)108.91, 81.36, 53.40β = 104.35108.74, 81.86, 53.21β = 104.26107.83,82.46, 53.32β = 104.65107.87, 82.09, 53.08β = 104.3052.37, 96.79, 68.07β = 102.49107.75, 82.88, 53.50β = 104.63
NPa111121
Resolution range (Å)32.21–2.6032.32–2.4026.58–2.5864.56–3.0527.29–1.8932.82–1.99
Number of reflections13.95716.00913.938864350.90131.388
Redundancyb3.8(3.8)3.4(2.8)3.0(2.7)3.4(3.4)2.8(2.6)3.6(3.4)
Completeness (%)b99.9(100.0)91.3(85.7)96.7(82.5)99.9(100.0)94.5(78.3)99.7(98.1)
Rmerge (%)b10.2(45.6)10.4(25.7)6.0(32.0)8.3(47.2)6.7(33.6)17.0(36.6)
I/σ(I)b8.7(2.4)5.8(2.3)10.1(2.8)10.2(2.3)10.3(2.7)4.6(2.2)



Refinement statistics
R/Rfree (%)18.5/22.521.2/23.919.8/26.019.0/25.518.7/23.821.7/26.9



r.m.s deviation from idea geometry
Bonds (Å)0.0100.0090.0170.0150.0090.018
Angles (°)1.5351.1621.8201.6611.1721.689
Ramachandran plot
Most favored (%)88.090.383.980.690.689.8
Allowed (%)10.98.614.218.38.39.1
Generously allowed (%)0.40.71.10.40.40.8
Disallowed (%)0.70.40.70.80.80.4
PDB ID3SNE3SNB3SNC3SNA3SND3SN8

Number of protein molecules per asymmetric unit.

Numbers in parentheses are for the outermost resolution shell.

Data collection and refinement statistics. Number of protein molecules per asympan class="Chemical">metric unit. Numbers in parentheses are for the outermost resolution shell. Structure elucidation and refinement were carried out using the pan class="Gene">CCP4 suite of programs (Collaborative Computational Project, 1994, Potterton et al., 2003). Crystal structures were determined by molecular repln>n class="Chemical">acement, using the original X-ray structure of the SARS-CoV Mpro with authentic chain termini (PDB ID: 2H2Z) (Xue et al., 2007) as the initial model. REFMAC (Murshudov et al., 1997) was employed for structure refinement. The computer graphics program Coot (Emsley and Cowtan, 2004) was used for interpretation of electron density maps and model building. Gap volumes between inhibitor and protein in the subsites of the enzyme were calculated using UCSF-Chimera (Pettersen et al., 2004). The molecular graphics package PyMOL (DeLano, 2002) was used to generate the figures.

Results and discussion

Inhibition of SARS-CoV Mpro by peptide aldehydes

The kinetic parapan class="Chemical">meters of n>n class="Disease">SARS-CoV Mpro with authentic chain termini were determined using the FRET substrate Dabcyl-KTSAVLQ↓SGFRKME-Edans amide. The K m value for this substrate was determined as 24.5 μM and the apparent k cat/K m value was 2359 M−1·19 s−1. However, the dissociation of the catalytically active SARS-CoV Mpro dimer into inactive monomers (Fan et al., 2004) has to be taken into account. A wide range of K values has been reported for Mpro dimer dissociation (see Grum-Tokars et al. (2008) for an overview). For the enzyme with authentic chain termini, the latter authors reported a K of 0.25 to 1.0 μM. As the Mpro dimer tends to be stabilized by the presence of substrate (Cheng et al., 2010), we used the lower limit of this range for estimation of the necessary corrections and obtained a k cat/K m value of 7863 M−1  s−1. The K m value determined in our study is of the same magnitude as other values derived from data not corrected for dissociation, which are typically in the range from 10 to 50 μM (Grum-Tokars et al., 2008). This suggests that much higher values reported previously (e.g., Verschueren et al., 2008) should probably be reconsidered. Initially, four pentapan class="Chemical">peptide substrate-analogous n>n class="Chemical">aldehydes containing the canonical P1 residue, glutamine, namely Ac-ESTLQ-H, Ac-NSTSQ-H, Ac-DSFDQ-H, and Ac-NSFSQ-H, were tested for inhibition of SARS-CoV Mpro. Overall, the four analogues were found to be reversible tight-binding inhibitors. Harboring the canonical P2 Leu, aldehyde Ac-ESTLQ-H exhibited inhibition with a relatively low K of 8.27 ± 1.52 μM. Aldehydes Ac-NSTSQ-H, Ac-DSFDQ-H, and Ac-NSFSQ-H, all with a non-canonical P2 residue, exhibited moderate inhibition with K values of 40.98 ± 2.63, 41.24 ± 2.25, and 72.73 ± 3.60 μM. Surprisingly, aldehyde CmFF-H, carrying a cinnamoyl group in the P3 and a Phe residue in the P1 position, had an even higher inhibitory activity against SARS-CoV Mpro than the four pentapeptide aldehydes, with a K of 2.24 ± 0.58 μM.

Overall structures of the aldehyde complexes

The pan class="Chemical">aldehydes n>n class="Chemical">Ac-ESTLQ-H, Ac-NSTSQ-H, Ac-DSFDQ-H, Ac-NSFSQ-H, and Cm-FF-H were separately soaked into crystals of SARS-CoV Mpro. The crystals were all of space group C2, which is often observed for SARS-CoV Mpro (Lee et al., 2005, Xue et al., 2007, Verschueren et al., 2008). These crystals contain one SARS-CoV Mpro monomer per asymmetric unit and the dimer (which is the enzymatically active species) is formed through the symmetry of the crystal. The four pentapeptide aldehydes Ac-ESTLQ-H, Ac-NSTSQ-H, Ac-DSFDQ-H, and Ac-NSFSQ-H are bound in extended conformations in the S6–S1 specificity subsites of SARS-CoV Mpro. Cm-FF-H occupies sites S3–S1. Remarkably, the P1 phenylalanine side chain of this inhibitor is bound deeply in the S1 pocket, which is generally considered to be specific for glutamine. 2F o  −  F c electron density maps of these aldehyde inhibitors are shown in Fig. 1 . In all complexes, continuous electron density between the aldehyde carbonyl C-atom of the inhibitor and Cys145-Sγ of SARS-CoV Mpro (Fig. 1) indicates the formation of a thiohemiacetal, as a result of the nucleophilic attack of the catalytic cysteine onto the C-terminal aldehyde of the inhibitor. The main-chain conformations of SARS-CoV Mpro in the five complexes are basically identical, with overall root mean-square deviations (RMSD) of 0.16 − 0.36 Å for Cα atoms. In addition to the crystal soaking experiments, we also tried to cocrystallize SARS-CoV Mpro with the aldehydes. The crystals obtained were predominantly of space group P21, with the exception of the complex with Ac-NSFSQ-H, which still displayed space group C2. However, in most of the P21 crystals, no electron density for the aldehyde could be observed; thus, while the presence of the inhibitors induced a change of space group in most cases, the compounds themselves were not detected in the crystals. An exception was Ac-ESTLQ-H. The crystals of its complex with the Mpro diffracted to 1.89 Å, but in both enzyme monomers in the asymmetric unit, electron density could only be seen for residues P1 (Gln) and P2 (Leu) of the inhibitor (Fig. 1E and F). Crystallographic data and refinement statistics for all six crystal structures are summarized in Table 1.
Fig. 1

Inhibitor binding at the active site of SARS-CoV Mpro. 2Fo − Fc electron-density maps are shown for inhibitors (A) Ac-ESTLQ-H, (B) Ac-NSTSQ-H, (C) Ac-DSFDQ-H, (D) Ac-NSFSQ-H, (E) the visible portion of Ac-ESTLQ-H cocrystallized with the Mpro, molecule A, and (F) molecule B, (G) Cm-FF-H. The maps are contoured at a level of 1σ. The catalytic Cys145 of SARS-CoV Mpro forms a thiohemiacetal with the aldehyde group of the inhibitors.

Inhibitor binding at the pan class="Chemical">active site of n>n class="Disease">SARS-CoV Mpro. 2Fo − Fc electron-density maps are shown for inhibitors (A) Ac-ESTLQ-H, (B) Ac-NSTSQ-H, (C) Ac-DSFDQ-H, (D) Ac-NSFSQ-H, (E) the visible portion of Ac-ESTLQ-H cocrystallized with the Mpro, molecule A, and (F) molecule B, (G) Cm-FF-H. The maps are contoured at a level of 1σ. The catalytic Cys145 of SARS-CoV Mpro forms a thiohemiacetal with the aldehyde group of the inhibitors.

Dual configurations of the thiohemiacetal in the complex with Ac-NSFSQ-H

In the pan class="Disease">SARS-CoV Mpro complex with n>n class="Chemical">aldehyde inhibitor Ac-NSFSQ-H, the thiohemiacetal adopts alternative configurations (Fig. 2 A and B). This is likely the result of nucleophilic attack by Cys145 onto the planar carbonyl from either side. In the (R)-isomer, the thiohemiacetal oxygen forms a hydrogen bond with the imidazole of His41 (3.30 Å) of the catalytic dyad (Fig. 2C), whereas in the (S)-isomer, the oxygen atom points away from His41 and into the oxyanion hole, forming H-bonds with the main-chain amides of Gly143 (2.71 Å) and Cys 145 (3.15 Å) (Fig. 2D). Previous X-ray and NMR studies of aldehyde binding to various cysteine and serine proteases demonstrated that one or both configurations were present in the structures of the complexes (Delbaere and Brayer, 1985, Webber et al., 1998, Robin et al., 2009).
Fig. 2

Dual configurations of the thiohemiacetal in the complex with Ac-NSFSQ-H. (A) and (B): Ac-NSFSQ-H binding at the active site in two alternative configurations. Hydrogen bonding interactions are represented by broken lines. Distances between atoms are shown in Å. (C) and (D): Schematic diagram of the aldehyde inhibitor Ac-NSFSQ-H attacked by the catalytic cysteine 145, leading to the formation of two possible diastereomeric products.

Dual configurations of the pan class="Chemical">thiohemiacetal in the complex with n>n class="Chemical">Ac-NSFSQ-H. (A) and (B): Ac-NSFSQ-H binding at the active site in two alternative configurations. Hydrogen bonding interactions are represented by broken lines. Distances between atoms are shown in Å. (C) and (D): Schematic diagram of the aldehyde inhibitor Ac-NSFSQ-H attacked by the catalytic cysteine 145, leading to the formation of two possible diastereomeric products.

The S2 subsite

The S2 subsite is a large pocket lined by residues pan class="Chemical">His41, n>n class="Chemical">Met49, Cys145, His164, Met165, Asp187, Arg188mc (mc: contribution from main-chain atoms only), and Gln 189 (Fig. 1). At the cleavage sites of the SARS-CoV Mpro in the viral polyproteins, the P2 position is mostly found to be occupied by Leu or Phe. Accordingly, almost all inhibitors designed for the enzyme carry a large hydrophobic group in the P2 position. We were therefore very surprised to find the Ser or Asp side chains in the P2 position of the aldehydes bound to this hydrophobic site (Fig. 1). In the SARS-CoV Mpro complexes with Ac-NSTSQ-H or Ac-NSFSQ-H, there is no hydrogen-bonding interaction between the side chain of P2-Ser and residues of the S2 subsite. As the Ser side chain is small, there is no steric barrier for Ser binding in the large S2 pocket. The serine does not quite reach the “bottom” of the S2 pocket, but the distance between its Oγ atom and the carbonyl oxygen of residue Gln192 is 4.0 Å, leaving no space for a water molecule to locate in between. Accordingly, we did not find any electron density suggesting the presence of a well-ordered water in the S2 pocket, although disordered water is probably present in the pocket on both sides of the serine side-chain, where free spaces with a total volume of >70 Å3 are present between the substrate and the protein. It should be noted that the side-chain of Gln189, which contributes to one wall of the S2 pocket, is quite flexible and adopts different conformations in the various inhibitor complexes, allowing or preventing the access of water to the pocket. In case of the pan class="Disease">SARS-CoV Mpro complex with n>n class="Chemical">Ac-DSFDQ-H, the P2 Asp side-chain is situated between the side chains of Met49 and Met165, which form opposite walls of the S2 pocket. Most interestingly, its carboxylate oxygens undergo close interactions with the sulfur atoms of the two methionines. The distances (r) between the Sδ atoms of Met49 and Met165 on the one hand and the Oδ2 and Oδ1 atoms, respectively, of P2-Asp on the other are 3.36 and 3.45 Å (Fig. 3 ). Then the relative distances (d) can be calculated according to d   =   r  −  vdw(S) −  vdw(O), where values of 1.80 and 1.52 Å are used as van der Waals radii (vdw) for S and O atoms, respectively. With d  = 0.04 and 0.13 Å, the distances between the P2-Asp and the two S2-Met side-chains fulfill the condition of d  ⩽ 0.2 Å (Iwaoka et al., 2002) for a non-bonded S…O contact. These interactions are not in the plane of the methionine sulfides; the θ values (θ is the polar angle between the normal to the sulfide plane and the S…O vector (Pal and Chakrabarti, 2001)) are 30.6° and 52.8°. Similar nonbonded interactions between the methionine sulfur atoms and main-chain carbonyl oxygens or carboxylate side-chains have been detected previously in the hydrophobic cores of proteins and were proposed to stabilize the protein fold (Pal and Chakrabarti, 2001). It has also been suggested that S…O interactions should be taken into account in protein engineering studies (Iwaoka et al., 2002, Pal and Chakrabarti, 2001), but to the best of our knowledge, we provide here the first description of a methionine-carboxylate interaction in a protein-ligand complex. The unexpected finding of Ser and Asp binding in the S2 subsite constitutes a deviation from the dogma that peptide inhibitors of proteases should contain amino-acid residues corresponding to the sequence specificity of the target enzyme.
Fig. 3

Interactions of P2-Asp in the S2 subsite in the complex structure SARS-CoV Mpro: Ac-DSFDQ-H. P2-Asp carboxylate oxygens interact with the sulfur atoms of Met49 and Met165. These non-bonded interactions are represented by dashed lines. Distances between atoms are shown in Å.

Interpan class="Chemical">actions of n>n class="Chemical">P2-Asp in the S2 subsite in the complex structure SARS-CoV Mpro: Ac-DSFDQ-H. P2-Asp carboxylate oxygens interact with the sulfur atoms of Met49 and Met165. These non-bonded interactions are represented by dashed lines. Distances between atoms are shown in Å.

Analysis of peptide substrates harboring different amino-acid residues in P2

The unexpected obpan class="Chemical">servation of the n>n class="Chemical">P2-Asp residue of aldehyde Ac-DSFDQ-H binding to the hydrophobic S2 pocket prompted us to re-determine the cleavage specificity of the SARS-CoV Mpro with respect to the P2 position. Within the context of the peptide substrate SWTSAVQ↓SGFRKWA, all 20 proteinogenic amino acids were tested in position P2 (=X). In the HPLC-based assay, the protease concentration was kept constant at 0.5 μM. The canonical substrate with Leu in the P2 position was completely cleaved within two minutes and was therefore used as the standard to compare the proteolytic activities with other substrates. The results are listed in Fig. 4 . Our study confirmed that the most favored residues in the P2 position of the substrate (after Leu) are the hydrophobic amino acids Phe, Met, Val, and Ile. Substrates with polar amino acids in P2 are poorly cleaved. The cleavage efficiency for the substrate with P2 = Ser was 160-times lower than for the best substrate (P2 = Leu); furthermore, the peptide with P2 = Asp was not cleaved at all after incubation with SARS-CoV Mpro for 16 h.
Fig. 4

Relative cleavage efficiencies of 20 peptide substrates harboring different amino-acid residues in P2.

Relative cleavage efficiencies of 20 pan class="Chemical">peptide substrates harboring different amino-pan class="Chemical">acid residues in P2.

S1 subsite

The S1 subsite is lined by residues pan class="Chemical">Phe140, n>n class="Chemical">Asn142, Gly143, Ser144, Cys145, His163, Glu166, and His172. From the substrate specificity results (Fan et al., 2004, Fan et al., 2005, Lai et al., 2006), P1 has to be Gln. Moreover, in all SARS-CoV Mpro complex structures described so far (Lee et al., 2005, Xue et al., 2007, Yang et al., 2005, Yang et al., 2003, Yin et al., 2007), the S1 subsite is occupied by a polar group, i.e. the side chain of Gln or a five-membered lactam (used as a glutamine surrogate). Shie et al. synthesized a series of α,β-unsaturated esters containing both P1 and P2 phenylalanine residues which showed modest inhibitory activity against the SARS-CoV Mpro (IC50  = 11–39 μM) (Shie et al., 2005). The α,β-unsaturated ethyl ester of 4-(dimethylamino)cinnamoyl-Phe-Phe was reported to be a potent inhibitor, with an inhibition constant of 0.52 μM (Shie et al., 2005). The computer model for the complex of SARS-CoV Mpro with the compound proposed that the P1 and P2 phenyl groups occupy the S2 and S3 pockets, respectively (Shie et al., 2005). However, in the crystal structure of the complex with the aldehyde Cm-FF-H that we describe in this communication, the side chain of P1-Phe is inserted into the S1 subsite (Fig. 1G). In order to understand this unexpected deviation from the commonly observed specificity, we compared the interaction with the S1 subsite observed in our structures of the complexes with Cm-FF-H and Ac-ESTLQ-H, the latter of which corresponds exactly to the described cleavage specificity of the Mpro. The predominant S1 specificity of the enzyme for Gln is determined primarily by His163. In the complex formed with Ac-ESTLQ-H, Nε2 of His163 donates a 2.83-Å hydrogen bond to the side-chain carbonyl oxygen (Oε1) of P1-Gln (Fig. 1). In the complex with Cm-FF-H, the side chain of Asn142 undergoes an 83.8° rotation about its χ1 torsion angle compared to the conformation in the complex with Ac-ESTLQ-H (Fig. 5 ), leading to an opening of the S1 subsite towards the solvent. A similar movement of Asn142 has been observed in the crystal structure of SARS-CoV Mpro in complex with an inhibitor called “N1” (PDB ID: 1WOF), where both the regular conformation of this residue and the one observed in our structure were found (Yang et al., 2005). The P1-benzyl group of Cm-FF-H binds to the S1 subsite by making hydrophobic interactions with Phe140, Leu141, Asn142, and the P3 cinnamoyl group of Cm-FF-H. As the S1 subsite specificity does not seem to be as stringent as previously thought. It may well be possible that other chemical groups can be accommodated here, thereby expanding the opportunities for designing and synthesizing efficient inhibitors of the SARS-CoV Mpro.
Fig. 5

Superimposition of the structures of SARS-CoV Mpro complexes with Cm-FF-H (red) and Ac-ESTLQ-H (green) showing the conformational changes in the S1 subsite. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Superimposition of the structures of pan class="Disease">SARS-CoV Mpro complexes with Cm-FF-H (red) and n>n class="Chemical">Ac-ESTLQ-H (green) showing the conformational changes in the S1 subsite. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Conclusions

In tpan class="Chemical">his study, we report six crystal structures of SARS-Cov Mpro in complex with peptide aldehydes and the inhibition kinetics of these compounds. The crystal structures reveal the mechanism of aldehyde binding to the active site of SARS-Cov Mpro and provide detailed information on the atomic interactions. Since Asp or Ser were found to bind to the hydrophobic S2 subsite, and Phe was located in the hydrophilic S1 subsite of SARS-Cov Mpro, we conclude that the stringent substrate specificity of the SARS-CoV Mpro with respect to the P1 and P2 positions can be overruled by the highly electrophilic character of the aldehyde warhead. This constitutes a deviation from the dogma that peptidic protease inhibitors should comprise an amino-acid sequence corresponding to the cleavage specificity of the target enzyme. The observed non-bonded interaction of the carboxylate oxygen atoms of an aspartate residue of an inhibitor with the thioether moieties of two methionines forming part of an hydrophobic pocket of the target protein is remarkable and suggests that such S…O interactions should be added to the repertoire of computer-aided drug design.

Accession numbers

Atomic coordinates and structure fpan class="Chemical">actors have been deposited in the RCSB Protein Data Bank under ID codes 3SN8, 3SNA, 3SNB, 3SNC, 3SND, and 3SNE (see Table 1).
  29 in total

1.  Refinement of macromolecular structures by the maximum-likelihood method.

Authors:  G N Murshudov; A A Vagin; E J Dodson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  1997-05-01

2.  Coot: model-building tools for molecular graphics.

Authors:  Paul Emsley; Kevin Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2004-11-26

3.  SARS CoV main proteinase: The monomer-dimer equilibrium dissociation constant.

Authors:  Vito Graziano; William J McGrath; Lin Yang; Walter F Mangel
Journal:  Biochemistry       Date:  2006-12-12       Impact factor: 3.162

4.  The crystal structures of severe acute respiratory syndrome virus main protease and its complex with an inhibitor.

Authors:  Haitao Yang; Maojun Yang; Yi Ding; Yiwei Liu; Zhiyong Lou; Zhe Zhou; Lei Sun; Lijuan Mo; Sheng Ye; Hai Pang; George F Gao; Kanchan Anand; Mark Bartlam; Rolf Hilgenfeld; Zihe Rao
Journal:  Proc Natl Acad Sci U S A       Date:  2003-10-29       Impact factor: 11.205

5.  Tripeptide aldehyde inhibitors of human rhinovirus 3C protease: design, synthesis, biological evaluation, and cocrystal structure solution of P1 glutamine isosteric replacements.

Authors:  S E Webber; K Okano; T L Little; S H Reich; Y Xin; S A Fuhrman; D A Matthews; R A Love; T F Hendrickson; A K Patick; J W Meador; R A Ferre; E L Brown; C E Ford; S L Binford; S T Worland
Journal:  J Med Chem       Date:  1998-07-16       Impact factor: 7.446

6.  Statistical and theoretical investigations on the directionality of nonbonded S...O interactions. Implications for molecular design and protein engineering.

Authors:  Michio Iwaoka; Shinya Takemoto; Shuji Tomoda
Journal:  J Am Chem Soc       Date:  2002-09-04       Impact factor: 15.419

Review 7.  Scaling and assessment of data quality.

Authors:  Philip Evans
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2005-12-14

8.  Inhibition of the severe acute respiratory syndrome 3CL protease by peptidomimetic alpha,beta-unsaturated esters.

Authors:  Jiun-Jie Shie; Jim-Min Fang; Tun-Hsun Kuo; Chih-Jung Kuo; Po-Huang Liang; Hung-Jyun Huang; Yin-Ta Wu; Jia-Tsrong Jan; Yih-Shyun E Cheng; Chi-Huey Wong
Journal:  Bioorg Med Chem       Date:  2005-09-01       Impact factor: 3.641

9.  pH-dependent conformational flexibility of the SARS-CoV main proteinase (M(pro)) dimer: molecular dynamics simulations and multiple X-ray structure analyses.

Authors:  Jinzhi Tan; Koen H G Verschueren; Kanchan Anand; Jianhua Shen; Maojun Yang; Yechun Xu; Zihe Rao; Janna Bigalke; Burkhard Heisen; Jeroen R Mesters; Kaixian Chen; Xu Shen; Hualiang Jiang; Rolf Hilgenfeld
Journal:  J Mol Biol       Date:  2005-09-23       Impact factor: 5.469

10.  The substrate specificity of SARS coronavirus 3C-like proteinase.

Authors:  Keqiang Fan; Liang Ma; Xiaofeng Han; Huanhuan Liang; Ping Wei; Ying Liu; Luhua Lai
Journal:  Biochem Biophys Res Commun       Date:  2005-04-15       Impact factor: 3.575

View more
  58 in total

1.  Intrinsic evolutionary constraints on protease structure, enzyme acylation, and the identity of the catalytic triad.

Authors:  Andrew R Buller; Craig A Townsend
Journal:  Proc Natl Acad Sci U S A       Date:  2013-02-04       Impact factor: 11.205

2.  Covalent Antiviral Agents.

Authors:  Sako Mirzaie; Fatemeh Abdi; Amin GhavamiNejad; Brian Lu; Xiao Yu Wu
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

Review 3.  Review on development of potential inhibitors of SARS-CoV-2 main protease (MPro).

Authors:  Soumya Gulab Katre; Alpana Jagdish Asnani; Kumar Pratyush; Nilima Gangadhar Sakharkar; Ashwini Gajanan Bhope; Kanchan Tekram Sawarkar; Vaibhav Santosh Nimbekar
Journal:  Futur J Pharm Sci       Date:  2022-06-21

4.  From Repurposing to Redesign: Optimization of Boceprevir to Highly Potent Inhibitors of the SARS-CoV-2 Main Protease.

Authors:  Matthias Göhl; Linlin Zhang; Haifa El Kilani; Xinyuanyuan Sun; Kaixuan Zhang; Mark Brönstrup; Rolf Hilgenfeld
Journal:  Molecules       Date:  2022-07-04       Impact factor: 4.927

5.  Profiling of substrate specificities of 3C-like proteases from group 1, 2a, 2b, and 3 coronaviruses.

Authors:  Chi-Pang Chuck; Hak-Fun Chow; David Chi-Cheong Wan; Kam-Bo Wong
Journal:  PLoS One       Date:  2011-11-02       Impact factor: 3.240

Review 6.  A review of the latest research on Mpro targeting SARS-COV inhibitors.

Authors:  Huihui Yang; Jinfei Yang
Journal:  RSC Med Chem       Date:  2021-04-14

7.  Evaluation of potency of the selected bioactive molecules from Indian medicinal plants with MPro of SARS-CoV-2 through in silico analysis.

Authors:  Pinku Halder; Upamanyu Pal; Pranab Paladhi; Saurav Dutta; Pallab Paul; Samudra Pal; Debasmita Das; Agnish Ganguly; Ishita Dutta; Sayarneel Mandal; Anirban Ray; Sujay Ghosh
Journal:  J Ayurveda Integr Med       Date:  2021-05-21

8.  Mechanism of inhibition of SARS-CoV-2 Mpro by N3 peptidyl Michael acceptor explained by QM/MM simulations and design of new derivatives with tunable chemical reactivity.

Authors:  Kemel Arafet; Natalia Serrano-Aparicio; Alessio Lodola; Adrian J Mulholland; Florenci V González; Katarzyna Świderek; Vicent Moliner
Journal:  Chem Sci       Date:  2020-11-27       Impact factor: 9.825

9.  Two Inhibitors Against the 3C-Like Proteases of Swine Coronavirus and Feline Coronavirus.

Authors:  Mengxin Zhou; Yutong Han; Mengxia Li; Gang Ye; Guiqing Peng
Journal:  Virol Sin       Date:  2021-07-06       Impact factor: 4.327

10.  Phycobilins as Potent Food Bioactive Broad-Spectrum Inhibitors Against Proteases of SARS-CoV-2 and Other Coronaviruses: A Preliminary Study.

Authors:  Brahmaiah Pendyala; Ankit Patras; Chandravanu Dash
Journal:  Front Microbiol       Date:  2021-06-10       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.