The main proteinase (M(pro)) of the severe acute respiratory syndrome (SARS) coronavirus is a principal target for the design of anticoronaviral compounds. Benzotriazole esters have been reported as potent nonpeptidic inhibitors of the enzyme, but their exact mechanism of action remains unclear. Here we present crystal structures of SARS-CoV M(pro), the active-site cysteine of which has been acylated by benzotriazole esters that act as suicide inhibitors. In one of the structures, the thioester product has been hydrolyzed and benzoic acid is observed to bind to the hydrophobic S2 pocket. This structure also features the enzyme with a shortened N-terminal segment ("amputated N finger"). The results further the understanding of the important role of the N finger for catalysis as well as the design of benzotriazole inhibitors with improved specificity.
The main proteinase (M(pro)) of the severe acute respiratory syndrome (SARS) coronavirus is a principal target for the design of anticoronaviral compounds. Benzotriazole esters have been reported as potent nonpeptidic inhibitors of the enzyme, but their exact mechanism of action remains unclear. Here we present crystal structures of SARS-CoV M(pro), the active-site cysteine of which has been acylated by benzotriazole esters that act as suicide inhibitors. In one of the structures, the thioester product has been hydrolyzed and benzoic acid is observed to bind to the hydrophobic S2 pocket. This structure also features the enzyme with a shortened N-terminal segment ("amputated N finger"). The results further the understanding of the important role of the N finger for catalysis as well as the design of benzotriazole inhibitors with improved specificity.
The global 2003 outbreak of the highly infectious severe acute respiratory syndrome (SARS) was caused by a new human coronavirus, SARScoronavirus (SARS-CoV) (Ksiazek et al., 2003, Drosten et al., 2003, Peiris et al., 2003). Spreading from southern China, the disease was responsible for the death of about 10% of those infected (Stockman et al., 2006). Since 2003 there have been two isolated mini-outbreaks in China, but they have been contained in their early stages. These cases remind us of the lingering danger to human health represented by SARS-CoV as well as other coronaviruses, and the need for effective anticoronaviral therapeutics. Since the global SARS outbreak, two new coronaviruses infecting humans have been discovered (van der Hoek et al., 2004, Fouchier et al., 2004, Woo et al., 2005). HCoV NL63, a coronavirus associated with croup (Chiu et al., 2005), appears to be the most widespread member of the entire family. It is the causative agent of up to 10% of all respiratory diseases (Pyrc et al., 2006 and references therein). Detected in 13 patients in Hong Kong, HCoV HKU1 is a coronavirus associated with relatively mild, community-acquired pneumonia characterized by fever, running nose, and cough (Lau et al., 2006), although a subtype of HKU1 causing a severe obstructive pneumonia was recently identified (Kupfer et al., 2007). Two other human coronaviruses, HCoV 229E and OC43, have been known for decades, and are believed to cause mild upper respiratory tract illnesses, including the common cold (McIntosh et al., 1967). Furthermore, HCoV OC43 has also been reported to have a high prevalence in the brains of multiple sclerosispatients (Arbour et al., 2000).Coronaviruses are enveloped, positive-sense RNA viruses. Their large single-stranded RNA genome ranges from 27 to 31 kb (29.7 kb for SARS-CoV) and encodes several structural and auxiliary proteins as well as two large overlapping polyproteins, pp1a (replicase 1a, ∼486 kDa for SARS-CoV) and pp1ab (replicase 1ab, ∼790 kDa) (Thiel et al., 2003). These polyproteins are subsequently cleaved to give rise to the individual nonstructural proteins essential for building the viral replicase complex. In most coronaviruses, this maturation process is performed by extensive proteolytic processing by three cysteine peptidases. There are two papain-like proteinases (PLpro) that are responsible for acting on three cleavage sites near the N terminus of the polyproteins (Ziebuhr et al., 1995), and the main proteinase or Mpro (also called “3C-like protease” or 3CLpro), which is necessary for cleaving at no less than 11 conserved sites containing a large hydrophobic residue (preferably Leu) in the P2 position, a Gln in the P1 position, and a small aliphatic amino acid residue (Ser, Gly, Ala) in the P1′ position (Ziebuhr et al., 2000, Hegyi and Ziebuhr, 2002, Anand et al., 2003). In contrast to other coronaviruses, the SARS-CoV genome encodes only one PLpro in addition to the Mpro. Besides being involved in the processing of the polyproteins, the PLpro has a second function as a deubiquitinating enzyme (Barretto et al., 2005, Lindner et al., 2005). Because of its pivotal role in viral replication, the 33.8 kDa main proteinase has become the prime target in the development of inhibitors directed at SARS-CoV and other coronaviruses (Anand et al., 2003, Anand et al., 2005).So far, the crystal structures of the main proteinases of four coronaviruses have been determined: transmissible gastroenteritis virus (TGEV) (Anand et al., 2002), HCoV 229E (Anand et al., 2003), SARS-CoV (Yang et al., 2003, Tan et al., 2005, Xu et al., 2005, Xue et al., 2007), and infectious bronchitis virus (IBV) (Xue et al., 2008). Except for IBV, the enzyme is a homodimer, and this is the quaternary structure observed in solution as well (Anand et al., 2002, Fan et al., 2004, Hsu et al., 2005a). In fact, it has been proposed that dimerization is essential for catalytic activity because the very amino terminus (“N finger”) of one subunit is involved in organizing the substrate-binding pocket of the other (Anand et al., 2002, Yang et al., 2003, Tan et al., 2005).Each monomer of the coronaviral Mpro consists of three domains. Domains I (residues 1–101) and II (residues 102–184) each consist of an antiparallel β barrel; taken together, domains I and II resemble the structure of chymotrypsin-like serine proteinases (Anand et al., 2002, Anand et al., 2003, Yang et al., 2003, Tsukada and Blow, 1985). The active site of the Mpro is located in the interface between the two domains and consists of a Cys-His catalytic dyad, in which the cysteine acts as the nucleophile in the proteolytic cleavage reaction (Anand et al., 2002, Anand et al., 2003, Yang et al., 2003). Connected by a long loop (residues 185–200) to domain II, domain III is all-helical and not only involved, to some extent, in dimer formation but also in placing the N finger next to the substrate-binding site of the other monomer. In the active conformation of the protein, the individual binding pockets for the respective amino acid residues of the substrate are accessible and the oxyanion loop (residues 138–145) has the correct shape to donate two hydrogen bonds from main-chain amides to stabilize the tetrahedral transition state of the proteolysis reaction. The two monomers in the dimer are arranged perpendicular to each other (
Figure 1; Yang et al., 2003).
Figure 1
Dimer of the SARS CoV Mpro
Ribbon diagram and surface representation of the dimeric SARS-CoV Mpro (protomer A in blue, protomer B in light orange). The catalytic dyad residues (Cys145 and His41) in both monomers colored by atom (yellow, carbon; red, oxygen; blue, nitrogen; green, sulfur). The N finger (residues 1–7) is colored magenta for monomer A and green for monomer B. The very N terminus is marked by a sphere. It interacts with residue Glu166 of the opposing monomer. The oxyanion loop is colored red for monomer A and dark brown for monomer B.
Dimer of the SARS CoV MproRibbon diagram and surface representation of the dimeric SARS-CoV Mpro (protomer A in blue, protomer B in light orange). The catalytic dyad residues (Cys145 and His41) in both monomers colored by atom (yellow, carbon; red, oxygen; blue, nitrogen; green, sulfur). The N finger (residues 1–7) is colored magenta for monomer A and green for monomer B. The very N terminus is marked by a sphere. It interacts with residue Glu166 of the opposing monomer. The oxyanion loop is colored red for monomer A and dark brown for monomer B.A number of crystal structures with small-molecule inhibitors bound to the coronavirus Mpro have been reported. Most of these compounds are peptide analogs derived from the substrate cleavage consensus sequence, such as peptidyl chloromethyl ketones (Anand et al., 2003, Yang et al., 2003), vinylogous alkyl ester derivatives serving as Michael acceptors (Yang et al., 2005), aza-peptide epoxides (Lee et al., 2005, Lee et al., 2007), or α,β-epoxiketones (Goetz et al., 2007). Peptidyl aldehydes have been described as reversible inhibitors (Al-Gharabli et al., 2006, Schmidt et al., 2008). Also, structure-based virtual screening approaches have led to the discovery of a number of nonpeptidic inhibitors (Wu et al., 2004, Bacha et al., 2004, Chen et al., 2005b, Kaeppler et al., 2005, Tsai et al., 2006), but until now, only a few crystal structures of such complexes with the protease have been reported (e.g., Lu et al., 2006).In the early stages of the 2003 SARS outbreak, one of the first drugs tried for the treatment of patients was ribavirin, an antiviral drug commonly used against a number of DNA and RNA viruses (Sidwell et al., 1972), but this proved to be unsuccessful (Stockman et al., 2006, Knowles et al., 2003). Left with few other options, physicians also tested HIV proteinase inhibitors (Stockman et al., 2006), and Wu et al. (2006) found that one of them, lopinavir, targeted SARS-CoV Mpro with an IC50 of ≈50 μM. In their search for derivatives of lopinavir with an improved inhibitory action toward Mpro, Wu et al. (2006) noticed by serendipity that the intermediate benzotriazole esters occurring during the synthesis of these derivatives were better inhibitors of the enzyme than the final products (see also Hilgenfeld and Pumpor, 2006). These benzotriazole esters are the result of the activation of carboxylic acids by 2-(1H-benzotriazole-1-yl)-1,1,3,3-tetramethyluronium hexafluorophosphate (HBTU), which was used as a coupling reagent in the synthesis of the lopinavir-like molecules. Wu et al. (2006) reported Ki = 7.5 nM and kinact (=k3) = 0.0011 s−1 for the best of these inhibitors and suggested, on the basis of mass spectrometric analyses, that the benzotriazole esters irreversibly acylate the active-site Cys145. However, even though these Ki values are among the lowest for any inhibitor of the SARS-CoV Mpro described so far, Wu et al. (2006) also reported that full inhibition of the enzyme could not be achieved. Also, upon replacement of the esteroxygen by a methylene group, the inhibitors bound noncovalently to the enzyme, with Ki values in the micromolar range (Wu et al., 2006). Such a mechanism might also apply, to some extent, to the benzotriazole esters themselves, perhaps explaining the incomplete inhibition reported by Wu et al. (2006). Because of these uncertainties, we wanted to visualize the binding mode of benzotriazole esters to the SARS-CoV main proteinase, and have determined the crystal structures of the enzyme after incubation with two such compounds, 1-(benzoyloxy)-benzotriazole (compound XP-27) and 1-(4-dimethylaminobenzoyloxy)-benzotriazole (compound XP-59) (see
Figure 2).
Figure 2
Chemical Formulas and Reaction Mechanism
Formulas for compounds XP-27 (X = H) and XP-59 (X = [CH3]2N) and proposed reaction mechanism (Wu et al., 2006).
Chemical Formulas and Reaction MechanismFormulas for compounds XP-27 (X = H) and XP-59 (X = [CH3]2N) and proposed reaction mechanism (Wu et al., 2006).
Results and Discussion
Chain Termini of the SARS-CoV Mpro Influence Catalytic Activity
Two different gene constructs coding for the SARS-CoV main proteinase were used in this study. One of them ([ΔSer1]/C-His6) has the last two residues at the C terminus of the protein (Phe305 and Gln306) replaced by a hexahistidine tag, while the codon for the N-terminal serine has been replaced by the start codon, AUG. As we can demonstrate by Edman degradation (not shown), the corresponding formylmethionine residue was cleaved off during synthesis in Escherichia coli, most probably by methionyl-aminopeptidase (Hirel et al., 1989), creating Δ(Ser1)/C-His6 Mpro. The other construct (“authentic Mpro”) carries an N-terminal GST-AVLQ tag that is autocleaved by the protease itself, and a C-terminal GP-H6 tag which can be removed by PreScission protease, so that the final proteinase product possesses the authentic N and C termini (Xue et al., 2007). The difference in enzymatic activity and dimer stability between the two constructs illustrates the importance of an intact N terminus (N finger). The kcat/KM of the (ΔSer1)/C-His6 protein, determined by using an HPLC-based assay with the pentadecapeptide SWTSAVLQ↓SGFRKWA as a substrate, was 61.9 (±1.5) mM−1min−1 (Tan et al., 2005). Using a FRET-based assay with a fluorogenic peptide substrate (Dabcyl-KTSAVLQ↓SGFRKME-Edans), this value was 167.9 mM−1min−1. The KD value for the monomer-dimer equilibrium of this enzyme species was estimated at ≈250 nM by using the method of Kuo et al. (2004), much lower than the values derived for a similar construct by Graziano et al. (2006) from small-angle scattering and chemical crosslinking. Accordingly, the fraction of catalytically active (ΔSer1)/C-His6 dimers at the enzyme concentration used in the assays (1.0 μM with respect to monomers) is about 0.8, and the kcat/KM of this enzyme species has to be corrected to ≈209 mM−1min−1. The Mpro with the authentic chain termini, however, is significantly more active under the same circumstances and has a kcat/KM value of 502.3 mM−1min−1. For the latter enzyme, the KD value for dimer dissociation was estimated to be significantly lower than 100 nM, in agreement with most reports in the literature (Kuo et al., 2004, Hsu et al., 2005b). This would indicate that > > 90% of the authentic Mpro molecules exist as dimers at the concentration of 1.0 μM (with respect to monomers) used in the assays. Occasionally, much higher KD values have been reported for SARS-CoV Mpro dimer dissociation in the literature (e.g., Fan et al., 2004, Chen et al., 2005a), but in most of these cases, the enzyme did not have authentic chain termini.For the authentic enzyme, we determined KM = 2.2 mM and kcat = 1105 min−1, whereas for (ΔSer1)/C-His6, the respective values were 0.24 mM and 40.3 min−1 (prior to correction for dimer dissociation; see above). Thus, the authentic enzyme, although displaying a 10-fold larger KM value, exhibited a 27-fold higher activity than (ΔSer1)/C-His6. The crystal structures presented below provide an explanation for this observation.
Kinetic Data for the Benzotriazole Inhibitors
We found the benzotriazole derivatives to be potent inhibitors of SARS-CoV Mpro, capable of inhibiting half of the authentic enzyme activity at low micromolar concentrations (<5 μM and 0.1 μM for XP-27 and XP-59, respectively), as determined using the fluorogenic peptide cleavage assay. Under the assumption that the inhibitors act as suicide substrates by covalently binding to the proteinase, a description of the compounds requires determination of both the equilibrium binding constant Ki and the inactivation rate constant (for covalent bond formation) k3 (=kinact), according to the following scheme (Scheme I, Equation VIII.127 of Dixon and Webb, 1979):In competition with the fluorescent peptide substrate (S), the inhibitor initially forms a Michaelis complex with the proteinase, which subsequently undergoes a nucleophilic attack by Cys145 onto the ester bond, leading to the formation of a covalent enzyme-inhibitor product (see Figure 2 for chemical formulas and reaction mechanism). The leaving group of the reaction is 1-hydroxybenzotriazole. For the inhibitor XP-59, the Ki and k3 values were determined as 1.38 μM and 0.013 s−1, respectively. At an inhibitor concentration of 1.3 μM, approximately 97% inhibition of the enzyme was achieved. For the inhibitor XP-27, the exact determination of the inhibition constant proved difficult because of the inherent instability of this compound. UV absorption of XP-27 decreased in anhydrous dimethyl sulfoxide (DMSO) over time, even in the absence of enzyme, and this process was significantly enhanced by addition of a few microliters of water. Addition of XP-27 to the enzyme led to rapid inactivation, but after about 3–4 min, the enzyme activity started to recover (
Figure 3A). A possible reason for this behavior is that the rather instable thioester product may have been partially hydrolyzed. As we shall see below, our crystal structure of the enzyme acylated by XP-27 provides an explanation for this observation. In contrast to XP-27 and its thioester product with the enzyme, XP-59 and its covalent Mpro adduct were more stable, because the electron-donating nature of the p-dimethylamino group decreased the electrophilicity of the carbonyl C atom. The inhibition curve (fluorescence versus time) of XP-59 displayed the expected hyperbolic shape (Figure 3B).
Figure 3
Inhibition Kinetics of Benzotriazole Esters
Inhibition kinetics of benzotriazole esters XP-27 (A) and XP-59 (B). The graphs show the fluorescence (in arbitrary units) created by substrate cleavage as a function of time. For inhibitor XP-27, the curve initially reaches a plateau, but after about 3–4 min, enzymatic activity seems to be restored (A). For XP-59, the expected hyperbolic curve is observed (B).
Inhibition Kinetics of Benzotriazole EstersInhibition kinetics of benzotriazole estersXP-27 (A) and XP-59 (B). The graphs show the fluorescence (in arbitrary units) created by substrate cleavage as a function of time. For inhibitor XP-27, the curve initially reaches a plateau, but after about 3–4 min, enzymatic activity seems to be restored (A). For XP-59, the expected hyperbolic curve is observed (B).Wu et al. (2006) reported a Ki value of 17.4 nM and a k3 of 0.0013 s−1 for XP-59 (their compound 4). We have no immediate explanation for the discrepancy by a factor of 80 between the Ki value reported by these authors and our findings, nor for the factor of 10 between their and our k3 values. We also do not know whether the kcat/KM value of their enzyme preparation is comparable to ours.
Binding of 1-(Benzoyloxy)-Benzotriazole (Compound XP-27) to SARS-CoV Mpro (Complex 1)
Complex 1 was obtained by soaking crystals of (ΔSer1/C-His6)-SARS-CoV Mpro at pH 6.5 with 5 mM 1-(benzoyloxy)-benzotriazole (XP-27). The plate-like crystals displayed space group P21 (
Table 1) and contained one Mpro dimer in the asymmetric unit. One monomer of the dimer (protomer A) was found to be in the active conformation, whereas the other exhibits an inactive conformation in which the loop forming the oxyanion hole (residues 138–145) has a catalytically incompetent shape. This has previously been observed in other SARS-CoV Mpro structures when the crystals had been grown at pH 6.0–6.5, and we have postulated that the inactive form in the dimer arises when a histidine residue (His163) at the bottom of the S1 specificity pocket is protonated at low pH, leading to a reorientation of Glu166 and to the collapse of both the S1 pocket and the oxyanion hole (Yang et al., 2003, Tan et al., 2005, Anand et al., 2005; see, for instance, Protein Data Bank [PDB] ID code: 1UJ1).
Table 1
Statistics of Data Collection and Processing, and Crystallographic Refinement
XP-27
XP-59
Data Collection
Space group
P21
C2
Unit cell dimensions
a (Å)
52.23
108.27
b (Å)
97.76
81.97
c (Å)
67.71
53.42
β (°)
103.01
104.3
Wavelength (Å)
0.8157
0.8080
Resolution range (Å)
40–2.25
65–1.95
Outer shell (Å)
2.34–2.25
2.02–1.95
Unique reflections
31,491
30,270
Mean redundancya
3.2 (3.2)
3.4 (3.4)
Completeness (%)a
99.7 (100.0)
97.4 (96.8)
Rmerge (%)a, b
7.3 (46.6)
12.3 (44.5)
I/σ(I)
12.9 (1.91)
12.8 (2.49)
Refinement
R factor (%)c
18.30
16.48
Free R factor (%)d
25.59
21.01
Root-mean-square deviations from ideality
Bonds (Å)
0.016
0.016
Angles (°)
1.57
1.64
Values in parentheses are for the highest resolution shell.
Rmerge = ∑∑|I − < I >|/∑ ∑ × 100%, where I and < I > are the ith observed intensity and average intensity of the reflection hkl, respectively.
R factor = ∑ − |F/∑ × 100%.
Free R factor = ∑/∑ × 100%, where hkl < T represents the test data set of 5% of the diffraction data.
Statistics of Data Collection and Processing, and Crystallographic RefinementValues in parentheses are for the highest resolution shell.Rmerge = ∑∑|I − < I >|/∑ ∑ × 100%, where I and < I > are the ith observed intensity and average intensity of the reflection hkl, respectively.R factor = ∑ − |F/∑ × 100%.Free R factor = ∑/∑ × 100%, where hkl < T represents the test data set of 5% of the diffraction data.When examining the electron density for monomer B, extra density was seen connected to the active-site Cys145 into which we could model a covalently bound benzoyl ester, with an occupancy of 70% (
Figure 4A). The benzene ring of the inhibitor lies like a lid on top of the entrance to the S1 pocket and pushes away the Glu166 side chain, which is originally (30% occupancy remaining for this conformation) blocking the pocket. The Oɛ2 atom of the reoriented Glu166 now makes a hydrogen bond to the Nɛ2 atom of His172 (2.54 Å), just like in the active conformation (Tan et al., 2005). Presumably, the highly reactive and relatively small inhibitor induces this conformation upon binding, but its steric demands on the oxyanion loop are too limited to force this segment into the active conformation. The benzene ring of the covalently bound inhibitor makes van der Waals contacts with the rim of the collapsed oxyanion loop (Cα atoms of Asn142 and Gly143).
Figure 4
Active-Site Environment of the SARS-CoV Mpro Reacted with 1-(Benzoyloxy)-Benzotriazole
Active-site environment of the SARS-CoV Mpro reacted with 1-(benzoyloxy)-benzotriazole (XP-27), with corresponding 2Fo − Fc electron density map (contoured at 1σ above the mean).
(A) Monomer B with Cys145 acylated by the 1-(benzoyloxy) moiety (70% occupancy; atom colors), which covers the S1 specificity pocket. An Fo − Fc omit map (green), contoured at 2.75σ above the mean, is shown for the inhibitor moiety. Glu166 (red) has a double conformation, one of which exists in the 30% of the molecules that do not have the active-site cysteine acylated. Met49 and Met165 (double conformation) (orange) line the S2 specificity pocket. His163 and His172 are colored in magenta. The catalytic dyad residues (Cys145 and His41) are colored by atom (yellow, carbon; red, oxygen; blue, nitrogen; green, sulfur). Loop 138–145 is in an inactive conformation (Phe140 turned away from His163) and colored gray.
(B) Monomer A represents the structure after hydrolysis of the thioester. The resulting benzoic acid molecule (atom colors; Fo − Fc omit map, contoured at 2.75σ, shown in green) has entered the S2 pocket and is sandwiched between Met49 and Met165 (orange). The latter has two conformations, one of which exists only in the 50% of the molecules that do not have the benzoic acid bound. The oxyanion loop (gray) is in an active conformation, with Phe140 stacking against His163 (magenta). The 2Fo − Fc electron density maps (blue) are contoured at 1σ above the mean.
Active-Site Environment of the SARS-CoV Mpro Reacted with 1-(Benzoyloxy)-BenzotriazoleActive-site environment of the SARS-CoV Mpro reacted with 1-(benzoyloxy)-benzotriazole (XP-27), with corresponding 2Fo − Fc electron density map (contoured at 1σ above the mean).(A) Monomer B with Cys145 acylated by the 1-(benzoyloxy) moiety (70% occupancy; atom colors), which covers the S1 specificity pocket. An Fo − Fc omit map (green), contoured at 2.75σ above the mean, is shown for the inhibitor moiety. Glu166 (red) has a double conformation, one of which exists in the 30% of the molecules that do not have the active-site cysteine acylated. Met49 and Met165 (double conformation) (orange) line the S2 specificity pocket. His163 and His172 are colored in magenta. The catalytic dyad residues (Cys145 and His41) are colored by atom (yellow, carbon; red, oxygen; blue, nitrogen; green, sulfur). Loop 138–145 is in an inactive conformation (Phe140 turned away from His163) and colored gray.(B) Monomer A represents the structure after hydrolysis of the thioester. The resulting benzoic acid molecule (atom colors; Fo − Fc omit map, contoured at 2.75σ, shown in green) has entered the S2 pocket and is sandwiched between Met49 and Met165 (orange). The latter has two conformations, one of which exists only in the 50% of the molecules that do not have the benzoic acid bound. The oxyanion loop (gray) is in an active conformation, with Phe140 stacking against His163 (magenta). The 2Fo − Fc electron density maps (blue) are contoured at 1σ above the mean.Surprisingly, the immediate active site of monomer A is empty and contains no electron density for a covalently bound product. However, in the hydrophobic S2 specificity pocket, we found clear difference density (>4σ) for a benzoic acid molecule. The molecule is sandwiched between the side chains of Met49 and Met165. The latter adopts two conformations, one of which (occupancy 50%) is not compatible with the presence of benzoic acid at this site. The occupancy of the benzoic acid molecule was fixed at 30%. The observation of this molecule in the S2 site immediately raises the question as to which mechanism was at work here. We assume that the thioester formed between the benzoyl group and Cys145 (with 1-hydroxybenzotriazole being the leaving group) is attacked by one of the numerous water molecules in the substrate-binding site. This results in the production of benzoic acid and restoration of the free active-site cysteine. Because of its overall hydrophobicity, the benzoic acid then binds to the nearby S2 pocket (Figure 4B). This interpretation is in full agreement with the observed biphasic kinetics for XP-27 (Figure 3A). However, even though benzoic acid itself is not an inhibitor of the enzyme up to μM concentrations (data not shown), we cannot exclude that the compound bound to the S2 pocket might originate from degradation of free XP-27 in solution over the time of the crystal-soaking experiment. In any case, our findings help explain the observation of Wu et al. (2006) that their benzotriazole inhibitors, although displaying nanomolar Ki values, did not lead to complete inhibition of the enzyme.If hydrolysis of the enzyme-bound thioester occurs with XP-27, why then only in the A monomer? Only in this molecule is the oxyanion loop in the correct conformation to stabilize the tetrahedral intermediate of the hydrolysis reaction. In the B monomer, this loop is in the catalytically incompetent conformation and, therefore, the thioester adduct remains relatively stable (even though the occupancy is only 70%).
Amputation of the N Finger in (ΔSer1)/C-His6 Mpro Leads to Local Structural Changes
Apart from some small differences, mostly in flexible loops at the periphery of the protein and at the C terminus, the dimeric structures seen in the enzyme modified by XP-27 and the original 1UJ1 dimer (Yang et al., 2003) overlay quite well. The overall root-mean-square deviation for 577 out of 600 Cα atoms is 0.36 Å. Interestingly, we found electron density for three DMSO molecules, which must originate from the DMSO used to dissolve the inhibitor in the crystal-soaking experiment. One of them fills a hole at the site where the N fingers of the two polypeptide chains approach each other, near the carbonyl groups of the Lys5 residues of both the A and the B monomer, that is, more or less on the noncrystallographic two-fold axis of symmetry (near the position where the magenta and green polypeptide segments meet in Figure 1). This site could be of interest for designing dimerization inhibitors. The two remaining DMSO molecules fill the space between the Mpro dimers adjacent to the crystal contacts, next to the AspB244…His#A134 (DMSO2) and the GlnA256…Trp#A31/Ala#A70 (DMSO3) contacts (# marks a residue in a symmetry-related dimer).Because of the cloning procedure used in the present study, the N-terminal serine has been replaced by a formylmethionine which, favored by the penultimate residue being a glycine, was cleaved off during synthesis in E. coli (Hirel et al., 1989). This causes the “amputated N finger” (residues 2–7, ΔSer1) to become more flexible. As a result, residues Gly2 and Phe3 in both monomers A and B have flipped their orientation such that the N terminus of Gly2 is now, at least in protomer B, hydrogen bonded to the Sγ atom of Cys300 (domain III) of the parent monomer (at 2.39 Å in protomer B, 4.50 Å in protomer A) and the N finger can no longer stabilize the loop (residues 138–145) forming the oxyanion hole of the other monomer. This is observed in monomer B, where the oxyanion loop is in a collapsed conformation as discussed before. Yet, in monomer A, this loop is in the catalytically competent form although the main-chain conformation of Phe140 is no longer stabilized by the N finger of monomer B. But the phenyl ring of this residue is still embedded in a hydrophobic pocket consisting of Val114, Ala116, Tyr126, Tyr161, His163, and His172. The lack of an intact N finger, however, does make the loop 138–145 more flexible. This is reflected in the B factors of the main-chain atoms of the loop in monomer A (44.4–74.0 Å2, with an average of 64.3 Å2), which are higher than the average B factor of the entire main chain (40.4 Å2). This situation is comparable to the B factors observed in the loop 138–145 of the B monomer (47.1–77.8 Å2, with an average of 67.9 Å2; the average B factor for the entire main chain is 41.7 Å2). The high flexibility of the oxyanion-binding loop explains the fact that kcat for the enzyme with the amputated N finger is significantly lower (40.3 min−1) than for the authentic Mpro (1105 min−1). On the other hand, deletion of the N-terminal serine may create more space in the substrate-binding site, hence the lower KM for the (ΔSer1)/C-His6 enzyme.We have shown in this work that deletion of Ser1, Phe305, and Gln306, plus the C-terminal attachment of a hexahistidine tag (not seen in the electron density maps), reduces enzymatic activity (kcat/KM) by ≈59%. In another study, Hsu et al. (2005a) have shown that a deletion of the three N-terminal amino acid residues reduces the enzyme's activity by a mere 24%, whereas the fourth residue, arginine, is absolutely essential for catalytic activity and important for the formation of the dimer. In agreement with this, we previously reported almost total inactivation of the homologous main proteinase from TGEV upon removal of five residues from the N terminus (Anand et al., 2002). In another report, removal of the seven N-terminal residues abolishes the enzymatic activity of the main proteinase almost completely, but does not seem to have much influence on dimer formation (Chen et al., 2005a) (however, the enzyme used in this study had both N- and C-terminal extensions). It has been suggested by molecular dynamics simulations that a different dimer could form under these conditions, which involves much more elaborate contact between domains III of the monomers (Chen et al., 2005a). This would be in agreement with the observed dimerization of isolated domain III of SARS-CoV Mpro (Shi et al., 2004). Although the N-terminal residues obviously have an effect on the stability of an intact substrate-binding site, the catalytic activity of the Mpro is probably mainly controlled by the protonation states of the two histidine residues, His163 and His172, in the substrate-binding site. Molecular dynamics calculations performed on the Mpro dimer have emphasized the role played by these two histidines in maintaining the correct conformation of the substrate-binding site (Tan et al., 2005).
Binding of 1-(4-Dimethylaminobenzoyloxy)-Benzotriazole (XP-59) to SARS-CoV Mpro (Complex 2)
Complex 2 was obtained by cocrystallization. SARS-CoV Mpro (with authentic chain termini) was incubated for 2 hr at 25°C with a 7-fold excess of 1-(4-dimethylaminobenzoyloxy)-benzotriazole (compound XP-59), and crystal growth was initiated by microseeding using crushed crystals of the free enzyme. Small block-sized (0.2 mm × 0.1 mm × 0.1 mm) crystals appeared overnight and displayed monoclinic space group C2 (Table 1), isomorphous to the crystals of the free enzyme (Xue et al., 2007). These crystals, grown at pH 6.0, have one monomer in the asymmetric unit, which has the substrate-binding site in the active conformation. Formed by the crystallographic two-fold axis from the monomer, the dimer is very similar to the one in the P21 crystal structures, except for the oxyanion loops of both monomers being in the active conformation. The largest differences occur between the C termini of the monomers and between the oxyanion loops of monomer B, which is in the collapsed conformation in the P21 crystal structure. The C terminus in the C2 crystal structure is completely visible and extends in a direction different from the apparently flexible C terminus in the P21 crystal structure. The root-mean-square deviation for 584 out of 603 Cα atoms with the dimer observed in space group P21 (Yang et al., 2003; PDB ID code: 1UJ1) is 0.83 Å, whereas this value is only 0.23 Å (297 out of 306 Cα pairs) for the comparison with the enzyme with authentic chain termini that was crystallized in space group C2 (Xue et al., 2007).Upon inspection of the active site of the Mpro-inhibitor complex, clear electron density was observed for the 1-(4-dimethylamino)-benzoyl moiety covalently bound to Cys145 (
Figure 5A). Surprisingly, the side chain of the other catalytic dyad residue His41 has undergone a rotation away from Cys145 (Figure 5B). It is now stacked against the phenyl ring of the inhibitor, making good π-π interactions (distance between the planes is 3.4 Å). The S1-binding pocket is not occupied by the inhibitor. Instead, a chain of three water molecules with well-defined density connects the His163 Nɛ2 atom at the bottom of the S1 specificity site with the side chains of Glu166 (Oɛ1) and Asn142 (Oδ1). The dimethylamino moiety of the inhibitor is situated between the side chains of Met49 and Met165, where it partly occupies the S2 specificity pocket. It is also involved in van der Waals interactions with the main-chain atoms of residues Asp187, Arg188, Gln189, and Pro39. A similar binding mode of the inhibitor to the main proteinase is observed when C2 crystals of the free enzyme are soaked at pH 7.0 with a 20-fold excess of compound XP-59 (data not shown). Binding of a substituted phenyl group to the S2 specificity subsite has also been observed in other studies (e.g., Goetz et al., 2007).
Figure 5
Active-Site Environment of the SARS-CoV Mpro Reacted with 1-(4-Dimethylaminobenzoyloxy)-Benzotriazole
Active-site environment of the SARS-CoV Mpro reacted with 1-(4-dimethylaminobenzoyloxy)-benzotriazole (XP-59), with corresponding 2Fo − Fc electron density map (contoured at 1σ above the mean).
(A) Cys145 is acylated by the 1-(4-dimethylaminobenzoyloxy) moiety (atom colors; color code as in Figure 4), which extends toward the S2 specificity pocket. An Fo − Fc omit map (green), contoured at 2.75σ above the mean, is overlaid onto the inhibitor moiety. Met49 and Met165 (orange) line the S2 specificity pocket. His163 and His172 are colored in magenta. The catalytic dyad residues (Cys145 and His41) are colored by atom. Loop 138–145 is in an active conformation and colored gray. The 2Fo − Fc electron density map is contoured at 1σ above the mean.
(B) Comparison of the active site of the free enzyme (light orange) with the 1-(4-dimethylamino)-benzoyl thioester (gray). 2Fo − Fc electron density (contoured at 1σ above the mean) is shown for the thioester and His41. The latter has rotated in comparison to the free enzyme and is now stacking against the aromatic moiety of the inhibitor.
Active-Site Environment of the SARS-CoV Mpro Reacted with 1-(4-Dimethylaminobenzoyloxy)-BenzotriazoleActive-site environment of the SARS-CoV Mpro reacted with 1-(4-dimethylaminobenzoyloxy)-benzotriazole (XP-59), with corresponding 2Fo − Fc electron density map (contoured at 1σ above the mean).(A) Cys145 is acylated by the 1-(4-dimethylaminobenzoyloxy) moiety (atom colors; color code as in Figure 4), which extends toward the S2 specificity pocket. An Fo − Fc omit map (green), contoured at 2.75σ above the mean, is overlaid onto the inhibitor moiety. Met49 and Met165 (orange) line the S2 specificity pocket. His163 and His172 are colored in magenta. The catalytic dyad residues (Cys145 and His41) are colored by atom. Loop 138–145 is in an active conformation and colored gray. The 2Fo − Fc electron density map is contoured at 1σ above the mean.(B) Comparison of the active site of the free enzyme (light orange) with the 1-(4-dimethylamino)-benzoylthioester (gray). 2Fo − Fc electron density (contoured at 1σ above the mean) is shown for the thioester and His41. The latter has rotated in comparison to the free enzyme and is now stacking against the aromatic moiety of the inhibitor.In the crystal structure of complex 2, the loop 138–145 is being held in the active conformation by the N finger of the symmetry-related molecule (distance SerB1 N–PheA140 O: 2.67 Å) in order to create the perfect environment for the oxyanion transition state. The conformation of this loop is quite stable, as can be deduced from the B factors of the main-chain atoms of the loop compared to the overall B factor of the entire main chain (ranging from 24 to 34 Å2 with an average of 28.6 Å2, compared to 31.5 Å2). Equally, the amino terminus of the N finger is also hydrogen bonded to Glu166 Oɛ1 (at 2.75 Å), preventing closure of the S1 specificity pocket by the Glu166 side chain. This catalytically competent conformation explains the high kcat (1105 min−1) observed for the authentic enzyme, when compared to the situation in the enzyme featuring the amputated N finger ([ΔSer1]/C-His6; kcat = 40.3 min−1).Comparison of the crystal structures of the two Mpro-inhibitor complexes shows that the covalently bound suicide-inhibitor products are oriented in different directions (cf. Figures 4A and 5A). The benzoyloxy moiety in complex 1 (molecule B) is covering the S1 specificity pocket (Figure 4A), whereas the dimethylamino group in complex 2 is entering the S2-binding site (Figure 5A). In an attempt to explain how the inhibition can proceed, Wu et al. (2006) have modeled a number of their benzotriazole inhibitors, among them XP-59, into the active site of the SARS-CoV Mpro. In their model of the Michaelis complex, the benzotriazole is positioned in the oxyanion hole environment and the remainder of the inhibitor is located in the vicinity of amino acid residues Thr25, Thr26, His41, Thr45, Ala46, and Met49, leaving the carbonyl group of the benzotriazoleester close enough to the Sγ of the active-site Cys145 for a nucleophilic attack to occur. This orientation of the inhibitor is possible, but then the resulting covalently bound products have to swing at least 90° around the covalent bond to occupy an energetically probably more favorable niche on top of the S1-binding site or in the S2 pocket in the two crystal structures, respectively. Because of the rather small size of the products of suicide inhibition, this should be possible without serious steric clashes. That structural rearrangements must have happened in the enzyme after the reaction with the inhibitor is clear in the crystal structure of Mpro complexed with the dimethylaminobenzoyloxy moiety (from compound XP-59) where, first, the oxygen atom proposed to be stabilized in the oxyanion hole (formed by the main-chain amides of Gly143 and Cys145) during the transition state of the reaction is flipped away and makes no specific contacts anymore and, second, the side chain of the active-site residue His41 has moved away from its original position to make the stacking interactions with the inhibitor product as discussed previously (Figure 5B).
Significance
Caused by a newly identified coronavirus, SARS-CoV, severe acute respiratory syndrome (SARS) is an emerging disease that has the capacity to become a global threat to world health because of its rapid transmission. The animal reservoir for the virus (most probably bats) is still existent and a renewed zoonotic transition to man cannot be excluded. Therefore, efficient drugs are needed in order to be prepared for a possible future outbreak. The SARS-CoV main proteinase (MThe structures of complexes of the M
reported that these compounds are not cell toxic, at least not up to a concentration of 100 μM. Starting from these lead structures, the inhibitors can now be modified by substituents occupying the S1 and S2 specificity pockets in order to improve their potency and specificity. The structures presented here also demonstrate that electron-donating substituents in the benzoyl moiety are useful to stabilize the thioester bonds with the active-site cysteine against hydrolysis. Finally, the structure of the M
Experimental Procedures
Expression and Purification
The C-terminally His6-tagged SARS-CoV Mpro was cloned and overexpressed in E. coliBL21 (DE3) cells. Plasmid pET-SCoV-Mpro-Δ305-306(6× His) was kindly provided by Dr. J. Ziebuhr. In this plasmid, the nucleotides coding for the C-terminal residues Phe305 and Gln306 have been replaced by a hexahistidine tail, whereas the codon for the N-terminal Ser1 was replaced by a start codon. Most likely, the resulting N-formylmethionine has posttranslationally been cleaved off by the methionine-aminopeptidase during expression in E. coli (Hirel et al., 1989). Edman degradation showed the polypeptide chain to begin with residue Gly2. The overexpressed protein was purified by nickel-NTA affinity chromatography followed by a gel-filtration step. Only the middle fractions of the resulting elution peak were pooled and used for crystallization experiments.The plasmid coding for SARS-CoV Mpro with authentic N and C termini was a gift from Prof. Z. Rao (Xue et al., 2007). The plasmid was transformed in E. coliBL21 (DE3) cells as an N-terminal GST fusion protein with a C-terminal tag of eight extra residues containing a glycine, a proline, and six histidines (GPH6). Because of the construction of the plasmid, the GST tag was cleaved off by autocatalytic processing of the active Mpro. The C-terminally GPH6-tagged product was subjected to nickel-NTA affinity chromatography and concentrated in PreScission cleavage buffer (Walker et al., 1994). The PreScission protease cleaved off the C-terminal GPH6 tag, producing an authentic SARS-CoV Mpro. The wild-type protein was further purified using anion-exchange chromatography (Xue et al., 2007).
Preparation of Benzotriazole Esters
To a stirred solution of 2.5 mmol N-hydroxybenzotriazole and 2.75 mmol triethylamine in 4 ml dichloromethane (DCM) at room temperature, 2.5 mmol benzoyl chloride was slowly added. The reaction mixture was stirred for 2 hr, diluted with DCM (20 ml), washed with saturated NaHCO3 solution (2 × 4 ml) and brine (1 × 5 ml), dried over MgSO4, and filtered. The solvent was removed under reduced pressure and the residue was purified to provide the desired benzotriazole.
1-(Benzoyloxy)-Benzotriazole (XP-27)
After the reaction mixture had been worked up, the crude product was purified by column chromatography using diethyl ether:cyclohexane (2:3). 1-(benzoyloxy)-benzotriazole was obtained in 91% yield as a white solid. Melting point 74°C; ms (ESI) m/z = 239.0 [M]+, 105.0 [C6H5CO]+.
After the reaction mixture had been worked up, the crude product was recrystallized from hexane:ethyl acetate. 1-(4-dimethylaminobenzoyloxy)-benzotriazole was obtained in 80% yield as a slightly yellow solid. Melting point 153°C; ms (ESI) m/z = 869.3 [3M + Na]+, 587.2 [2M + Na]+, 305.1 [M + Na]+.
Enzyme Kinetics
Both a fluorescence-based and an HPLC assay were used to assess the activity of the SARS-CoV Mpro. The former made use of the fluorogenic peptide substrate Dabcyl-KTSAVLQ↓SGFRKME-Edans (95% purity; Biosyntan GmbH, Berlin, Germany), which contains the main proteinase cleavage site (indicated by the arrow). The enhanced fluorescence due to the cleavage of this substrate as catalyzed by the enzyme was monitored at 490 nm with excitation at 340 nm, using a Cary Eclipse fluorescence spectrophotometer. The experiments were performed in a buffer consisting of 20 mM Tris-HCl (pH 7.3), 100 mM NaCl, 1 mM EDTA. Kinetic parameters KM and kcat were determined by initial-rate measurements at 25°C. The reaction was initiated by adding proteinase (final concentration 0.5 μM) to a solution containing different final concentrations of the fluorogenic peptide (10–40 μM). The initial rates were converted to enzyme activities (μmol substrate cleaved per s). Kinetic constants were derived by fitting the data to the Michaelis-Menten equation with the nonlinear regression analysis program SigmaPlot (SigmaPlot 2000, version 6.00; SPSS, Chicago, IL, USA).The HPLC-based approach has been described elsewhere (Tan et al., 2005). Briefly, the substrate used here was the pentadecapeptide SWTSAVLQ↓SGFRKWA, which also resembles the cleavage site at the N terminus of the SARS-CoV Mpro, except for the two tryptophans at either side of the cleavage site replacing the P7 Ile and the P6′ Met residues. The 20 μl reaction mixture contained the inhibitor at concentrations varying from 5 to 500 μM, 1.5 μM Mpro dimer (diluted in 100 mM morpholinoethanesulfonic acid monohydride [MES] [pH 7.0]), and 250 μM substrate. Inhibitor and Mpro were incubated for 5 min at room temperature before adding the substrate. The reaction was run for 5 min at 25°C, then stopped by adding 5 μl trifluoroacetic acid (0.1%) and put on ice or stored at −20°C. The samples were centrifuged and 80 μl MES buffer (100 mM) was added. Products and substrate were separated on a reverse-phase HPLC column (Jupiter 4μ Proteo 90A, Phenomenex, Torrance, CA, USA), using a gradient of acetonitrile in 0.1% trifluoroacetic acid. The product and substrate peaks were detected at 280 nm, and IC50 values were determined.
Inhibition Assay
The fluorescence-based assay was also used for the determination of the inhibitor constants of the two benzotriazole compounds. These tests were performed with the untagged, authentic SARS-CoV Mpro. Values of Ki and k3 (see scheme above) were calculated from plots of 1/kobs versus 1/[I] according to Equation 1 (Equation VIII.129 of Dixon and Webb, 1979):To obtain the observed first-order inhibition rate constant kobs, time-dependent progress curves were fitted to a first-order exponential (Equation 2; equivalent to Equation A10 of Tian and Tsou, 1982, with the addition of D; see below):F is the product fluorescence (measured in arbitrary units), v is the initial velocity, t is time, and D is a displacement term to account for the nonzero emission at the start of data collection.In the experiment, the Ki and k3 values for the irreversible inhibitors were obtained from reactions initiated by addition of the enzyme (final concentration 0.5 μM) to 20 mM Tris-HCl buffer (pH 7.3) containing 100 mM NaCl, 1 mM EDTA, 20 μM fluorogenic substrate, and inhibitor. Four different inhibitor concentrations were tested, in at least 10-fold molar excess over the enzyme in most cases. Data from the continuous assays were analyzed with the nonlinear regression analysis program SigmaPlot.
Crystallization of the Complexes
Before crystallization, the C-terminally His6-tagged SARS-CoV Mpro ([ΔSer1]/C-His6) was dialyzed against 50 mM Tris-CH3COOH (pH 7.5), 1 mM EDTA, 5 mM DTT and concentrated to 10 mg/ml. The protein was crystallized at 18°C by vapor diffusion using hanging drops or sitting drops. The crystallization conditions were 5% PEG 10,000, 0.1 M MES (pH 6.5), 3% ethylene glycol, 50 mM ammonium acetate (Lee et al., 2005). Plate-shaped crystals of monoclinic form appeared after 3 days to a week, sometimes accompanied by needles and irregularly shaped crystals, which turned out to belong to the tetragonal space group (see Tan et al., 2005). One of the monoclinic crystals was soaked for 4 days at 18°C in the crystallization buffer solution containing 5 mM 1-(benzoyloxy)-benzotriazole (XP-27) and 10% DMSO to increase the solubility of the inhibitor.The untagged SARS-CoV Mpro with authentic chain termini, concentrated to 10 mg/ml in 50 mM Tris-HCl (pH 7.5), 40 mM NaCl, 1 mM EDTA, 5 mM DTT, was incubated for 2 hr at 25°C with a 7-fold excess of the inhibitor 1-(4-dimethylaminobenzoyloxy)-benzotriazole (XP-59) and solubilized in 8% PEG 6000, 0.1 M MES (pH 6.0), 3% 2-methyl-2,4-pentanediol (MPD). Block-shaped crystals grew overnight at 20°C in 6%–8% PEG 6000, 0.1 M MES (pH 6.0), 3% MPD, after initiation of nucleation by microseeding using crushed C2 crystals of the free enzyme.Prior to data collection, crystals obtained from both procedures were transferred for a couple of seconds to a cryoprotectant solution containing the crystallization conditions and 30% PEG 400.
Crystallographic Data Collection and Processing, and Structure Elucidation and Refinement
Statistics of data collection, processing, and refinement are summarized in Table 1. Diffraction data were collected at 100K using monochromatic synchrotron radiation provided by beamlines X11 and X13 (EMBL and Universities of Hamburg and Lübeck, DESY, Hamburg, Germany; wavelength 0.8157 and 0.8080 Å, respectively). Intensities were measured using a MAR CCD detector. Indexing, scaling, and merging of data sets were performed using DENZO and SCALEPACK (Otwinowski and Minor, 1997). Molecular replacement and refinement were carried out using MOLREP (Vagin and Teplyakov, 1997) and REFMAC (Murshudov et al., 1997), respectively, as implemented in the CCP4 suite (CCP4, 1994). Simple bulk scaling in REFMAC was preferred over Babinet scaling because this procedure better reflected the overall Wilson B factor. The search model used for molecular replacement was either the dimer or monomer A of the crystal structure of PDB ID code 1UJ1 for the data sets of SARS-CoV Mpro complexed with XP-27 or XP-59, respectively. The computer graphics program Coot, implemented in the CCP4 suite, was used for interpretation of the electron density maps and model building (Emsley and Cowtan, 2004). The molecular graphics package PyMOL was used to generate the figures (DeLano, 2002).
Authors: Susan S Chiu; Kwok Hung Chan; Ka Wing Chu; See Wai Kwan; Yi Guan; Leo Lit Man Poon; J S M Peiris Journal: Clin Infect Dis Date: 2005-05-10 Impact factor: 9.079
Authors: Xufang Deng; Sarah E StJohn; Heather L Osswald; Amornrat O'Brien; Bridget S Banach; Katrina Sleeman; Arun K Ghosh; Andrew D Mesecar; Susan C Baker Journal: J Virol Date: 2014-08-06 Impact factor: 5.103
Authors: Marios Tomazou; Marilena M Bourdakou; George Minadakis; Margarita Zachariou; Anastasis Oulas; Evangelos Karatzas; Eleni M Loizidou; Andrea C Kakouri; Christiana C Christodoulou; Kyriaki Savva; Maria Zanti; Anna Onisiforou; Sotiroula Afxenti; Jan Richter; Christina G Christodoulou; Theodoros Kyprianou; George Kolios; Nikolas Dietis; George M Spyrou Journal: Brief Bioinform Date: 2021-11-05 Impact factor: 11.622