The SARS coronavirus main peptidase (SARS-CoV M(pro)) plays an essential role in the life-cycle of the virus and is a primary target for the development of anti-SARS agents. Here, we report the crystal structure of M(pro) at a resolution of 1.82 Angstroms, in space group P2(1) at pH 6.0. In contrast to the previously reported structure of M(pro) in the same space group at the same pH, the active sites and the S1 specificity pockets of both protomers in the structure of M(pro) reported here are in the catalytically competent conformation, suggesting their conformational flexibility. We report two crystal structures of M(pro) having an additional Ala at the N terminus of each protomer (M(+A(-1))(pro)), both at a resolution of 2.00 Angstroms, in space group P4(3)2(1)2: one unbound and one bound by a substrate-like aza-peptide epoxide (APE). In the unbound form, the active sites and the S1 specificity pockets of both protomers of M(+A(-1))(pro) are observed in a collapsed (catalytically incompetent) conformation; whereas they are in an open (catalytically competent) conformation in the APE-bound form. The observed conformational flexibility of the active sites and the S1 specificity pockets suggests that these parts of M(pro) exist in dynamic equilibrium. The structural data further suggest that the binding of APE to M(pro) follows an induced-fit model. The substrate likely also binds in an induced-fit manner in a process that may help drive the catalytic cycle.
The SARS coronavirus main peptidase (SARS-CoV M(pro)) plays an essential role in the life-cycle of the virus and is a primary target for the development of anti-SARS agents. Here, we report the crystal structure of M(pro) at a resolution of 1.82 Angstroms, in space group P2(1) at pH 6.0. In contrast to the previously reported structure of M(pro) in the same space group at the same pH, the active sites and the S1 specificity pockets of both protomers in the structure of M(pro) reported here are in the catalytically competent conformation, suggesting their conformational flexibility. We report two crystal structures of M(pro) having an additional Ala at the N terminus of each protomer (M(+A(-1))(pro)), both at a resolution of 2.00 Angstroms, in space group P4(3)2(1)2: one unbound and one bound by a substrate-like aza-peptide epoxide (APE). In the unbound form, the active sites and the S1 specificity pockets of both protomers of M(+A(-1))(pro) are observed in a collapsed (catalytically incompetent) conformation; whereas they are in an open (catalytically competent) conformation in the APE-bound form. The observed conformational flexibility of the active sites and the S1 specificity pockets suggests that these parts of M(pro) exist in dynamic equilibrium. The structural data further suggest that the binding of APE to M(pro) follows an induced-fit model. The substrate likely also binds in an induced-fit manner in a process that may help drive the catalyticcycle.
Severe acute respiratory syndrome (SARS) is a highly transmissible, infectious and often fatal disease (World Health Organization–Severe acute respiratory syndrome†
). Since its outbreak in 2002 and rapid spread throughout early 2003, efforts in the development of anti-SARS vaccines and drugs have taken on paramount importance.SARS is caused by a coronavirus (CoV);[1], 2., 3. it is an enveloped, positive-sense single-stranded RNA virus. Anti-SARS therapeutics could target any one of several major steps in the viral life-cycle, such as virus–cell interactions, virus entry, intracellular viral replication, virus assembly and exit. Extensive studies have been carried out on the proteins involved in these steps. The intracellular replication of CoV is mediated by a replicase complex derived from two virally coded polyprotein precursors, pp1a (486 kDa) and pp1ab (790 kDa).6., 7. The formation of this replicase complex requires the extensive processing of the two polyproteins by two cysteine peptidases encoded within them; namely, the main peptidase (Mpro), also known as the 3C-like peptidase (3CLpro) because of its similarity to the 3C peptidases of Picornaviridae, and the accessory papain-like peptidase 2 (PL2pro), which cleaves at three sites in the N-proximal regions of the two polyproteins. By contrast, Mprocleaves at 11 sites in the central and C-proximal regions of the two polyproteins, releasing key viral replication proteins, such as an RNA polymerase and a helicase. As an essential protein, Mpro is an attractive molecular target for the development of anti-SARS drugs.SARS-CoV Mpro is catalytically active only in the homodimeric form; each protomer has a molecular mass of 33.8 kDa.[9], 10., 11., 12., 13., [14] Considerable efforts have been directed to the X-ray structural studies of Mpro, resulting in the availability of many crystal structures of Mpro and its variants over a pH range of 5.9–9.0 in a variety of space groups. All of these structures show that the two protomers of Mpro are oriented almost perpendicular to each other. The N-terminal residues 1–7 of each protomer constitute the N-finger, of which Arg4 was shown to be mandatory for the dimerization and the exhibition of the catalytic activity of Mpro. Beyond the N-finger, each protomer consists of three domains. Domain I (residues 8–101) and domain II (residues 102–184) comprise a two-β-barrel fold similar to that of the chymotrypsin-type serine peptidases. Domain III (residues 201–300) has five α-helices and is connected to domain II by a long loop (residues 185–200). Each protomer has its own substrate-binding region situated in the cleft between domains I and II. Recent mutagenesis studies have confirmed that, similar to the main peptidases from human coronavirus strain 229E,15., 16. and porcine transmissible gastroenteritis coronavirus, SARS-CoV Mpro is a cysteine peptidase with a Cys-Hiscatalytic dyad at the active site.16., 18. As suggested by the structure-based sequence alignment of the main peptidases (including their flanking residues in the polyproteins) from SARS-CoV and other coronaviruses, and confirmed by in vitro studies,7., [9] Mprocleaves preferentially at a consensus sequence for the P4 to P1′ residues of substrates (the nomenclature is based on that used by Schechter & Berger with the arrow indicating the cleavage site): (amino acid with a small side-chain)-(any amino acid)-Leu-Gln↓(Ala, Ser, Gly).A number of Mpro inhibitors have been proposed using structure-based discovery20., 21. and experimental screening.22., [23], [24] The crystal structures of Mpro inhibited by some peptidomimeticcompounds have been determined.25., 26., [27] Recently, virtual screening followed by experimental evaluations have identified the old drug cinanserin as a strong inhibitor of the replication of SARS-CoV; cinanserin likely targets Mpro.We have reported on the kinetic and structural characterization of the inhibition of SARS-CoV Mpro by an aza-peptide epoxide (APE, Figure 1
). APEs were synthesized as a new class of inhibitors apparently specific for clan CD cysteine peptidases,30., 31. including the legumains, and the caspases. Each APE has an aza-peptide component, with an epoxide moiety attached to the carbonyl group of the P1 residue. The side-chain of the P1 residue predominantly determines the target-peptidase specificity of an APE. The substituent on the epoxide C2 atom also allows some tuning of both the inhibitory activity and specificity of APE towards a particular target peptidase. In the APE, the Cα atom of the P1 residue is replaced by a nitrogen atom, yielding an aza-amino acid residue. This introduces trigonal planar geometry to the α-atom of the P1 residue and reduces the electrophilicity of the carbonyl C atom of the P1 residue, thereby making the carbonyl group of the P1 residue resistant to nucleophilic attack. It has been proposed that APEs inhibit their target peptidases irreversibly by a mechanism in which the catalyticCys Sγ atom nucleophilically attacks one of the two epoxidecarbon atoms (C2 or C3) of APE.
,
32., 33. This results in the opening of the conformationally strained epoxide ring, with the formation of a covalent bond between the Cys Sγ atom and the attacked APE atom. An APE containing Gln at P1 inhibits Mpro with a k
inact/K
i
= 1900(±400) M−1 s−1. In this reaction, the catalyticCys145 Sγ atom attacks the epoxideC3 atom of the APE.
Figure 1
Inhibition of SARS-CoV Mpro by the aza-peptide epoxides (APEs) synthesized for our study, Cbz-Leu-Phe-AGln-EP-CooEt. The epoxide carbon atoms are numbered and their stereochemistry is omitted for simplicity. The proposed mechanism for the irreversible inhibition of clan CD cysteine peptidases by APEs is indicated by arrows. In the inhibition of Mpro, route I was adopted. Cbz, the benzyloxycarbonyl group; AGln, aza-glutamine; EP, epoxide; COOEt, ethyl ester.
Inhibition of SARS-CoV Mpro by the aza-peptide epoxides (APEs) synthesized for our study, Cbz-Leu-Phe-AGln-EP-CooEt. The epoxidecarbon atoms are numbered and their stereochemistry is omitted for simplicity. The proposed mechanism for the irreversible inhibition of clan CD cysteine peptidases by APEs is indicated by arrows. In the inhibition of Mpro, route I was adopted. Cbz, the benzyloxycarbonyl group; AGln, aza-glutamine; EP, epoxide; COOEt, ethyl ester.
Results and Discussion
Structure determination
The parameters and statistics derived from data processing and structure refinement are summarized in Table 1
. SARS-CoV Mprocrystallizes in space group P21 in conditions slightly different from those reported previously. Each asymmetric unit contains both protomers (A and B) of the physiological dimer. In the electron density maps of the unbound Mpro, residues 1A–44A, 50A–305A and 1B–302B could be identified. In the Ramachandran plot of this structure, Asp33A, Asn84A and Asn84B are in the generously allowed regions, and Asp33B, Glu47B, Tyr154A and Tyr154B are in the disallowed regions. In both independent protomers, the Asp33 N atom forms a hydrogen bond with the Thr98 Oγ1 atom, and the Asp33carbonyl O atom forms hydrogen bonds with the Trp31 Nε1 and the Asn95 N atoms. Also, the Asn84 Nδ2 atom forms a hydrogen bond with the Glu178carbonyl O atom. The poorly defined electron densities of the side-chains of Glu47B, Tyr154A and Tyr154B indicate dynamic disorder.
Table 1
Parameters and statistics derived from X-ray diffraction data processing and structure refinement
Unbound Mpro
Unbound M+A(-1)pro
M+A(-1)pro:APE
A. Data processing
Wavelength (Å)
1.116
1.116
1.116
Resolution limita (Å)
41.25–1.82 (1.89–1.82)
24.85–2.00 (2.07–2.00)
24.78–2.00 (2.07–2.00)
Space group
P21
P43212
P43212
Unit-cell parameters
a (Å)
52.39
70.29
70.09
b (Å)
96.19
70.29
70.09
c (Å)
67.91
102.87
103.86
α (deg.)
90.00
90.00
90.00
β (deg.)
102.91
90.00
90.00
γ (deg.)
90.00
90.00
90.00
Mosaicity (deg.)
0.63
0.65
0.58
No. unique reflections
56,644 (5313)
17,964 (1739)
18,098 (1760)
Redundancy
2.0 (1.8)
6.8 (6.2)
7.1 (6.5)
Completeness (%)
96.4 (90.6)
99.3 (97.8)
99.8 (99.4)
Rsymb (%)
3.6 (20.1)
8.8 (48.8)
7.4 (44.4)
<I/σ(I)>
20.0 (3.4)
19.6 (4.2)
23.7 (4.9)
B. Structure refinement
Resolution range (Å)
41.25–1.82
24.85–2.00
24.78–2.00
Rworkc (%)
17.0
17.5
19.6
Rfreec (%)
21.7
24.1
27.0
Number of non-hydrogen atoms per asymmetric unit (average B-factor, Å2)
Protein
4753 (30.12)
2332 (23.00)
2332 (23.93)
APE
Not applicable
Not applicable
46 (22.69)
Solvent
591 (37.81)
162 (34.38)
127 (32.49)
rms deviation from ideal geometry
Bond lengths (Å)
0.019
0.019
0.022
Bond angles (deg.)
1.778
1.765
2.064
Ramachandran plot
Favored (%)
91.3
90.0
90.0
Allowed (%)
7.3
8.5
8.1
Generously allowed (%)
0.6
0.8
0.8
Disallowed (%)
0.8
0.8
1.2
Numbers in parentheses refer to the highest resolution bins.
Rsym = ∑∑|I − |/ ∑∑I, where I and are the ith observed intensity and average intensity of the reflection hkl, respectively.
Rwork = ∑||Fo| − |Fc||/∑|Fo|, where |Fo| and |Fc| are the observed and calculated structure factor amplitudes of a particular reflection, respectively. The summation is over 95% of the reflections in the specified resolution range. The remaining 5% of the reflections were selected randomly before the structure refinement and are not included in the structure refinement. Rfree was calculated over these reflections with the equation used for Rwork.
Parameters and statistics derived from X-ray diffraction data processing and structure refinementNumbers in parentheses refer to the highest resolution bins.Rsym = ∑∑|I − |/ ∑∑I, where I and are the ith observed intensity and average intensity of the reflection hkl, respectively.Rwork = ∑||Fo| − |Fc||/∑|Fo|, where |Fo| and |Fc| are the observed and calculated structure factor amplitudes of a particular reflection, respectively. The summation is over 95% of the reflections in the specified resolution range. The remaining 5% of the reflections were selected randomly before the structure refinement and are not included in the structure refinement. Rfree was calculated over these reflections with the equation used for Rwork.SARS-CoV M+A(-1)
procrystallized in space group P212121
as well as P43212. Each asymmetric unit of the latter contains only one protomer of the dimer; the two protomers of each dimer are related by the crystallographic 2-fold symmetry axis parallel with the C-face diagonal of the unit cell. In the electron density maps of the unbound M+A(-1)
pro, residues 3–300 of the protomer were clearly identified. In the Ramachandran plot of this structure, Asp33 and Ser139 are in the generously allowed regions, and Asn84 and Tyr154 are in the disallowed regions. The side-chain of Tyr154 makes contact with those of Ile78 and probably Arg76 from a neighboring asymmetric unit. The electron density of Ser139 is not well defined. In the electron density maps of the M+A(-1)
pro:APE complex, residues 2–300 of the protomer were identified. In the Ramachandran plot of this structure, Asp33 and Asn277 are in the generously allowed regions, and Asn84, Tyr154 and Ile286 are in the disallowed regions. The side-chains of Thr285 and Ile286 from opposite protomers of the dimer contact each other. The electron density of Asn277 is not well defined.Superpositions of the crystal structures reported here with all of those previously reported show no significant difference in the protomer orientation and the overall fold.
Active sites and substrate-binding regions of the unbound SARS-CoV Mpro
The previously reported crystal structure of SARS-CoV Mpro in space group P21 at pH 6.0 showed the collapse of the active site and S1 specificity pocket of one of the protomers, whereas the structures in the same space group at pH 7.6 and at pH 8.0 showed all the active sites to be in the catalytically competent conformation (Table 2
). On the basis of this trend, a pH-triggered switch for the catalytic activity of Mpro was proposed. We have now grown crystals of Mpro in the same space group (P21 with the same unit-cell constants) at pH 6.0 under slightly different conditions; the active sites and the S1 specificity pockets of both protomers are in the catalytically competent conformation (Figure 2, Figure 3, Figure 4
). More specifically, superposition of protomers A and B of the resulting Mpro structure (rmsd 0.50 Å for 1070 out of 1172 main-chain atoms) shows good agreement in most atomic positions. In both independent protomers, the catalytic dyad has a distance of 3.6 Å between the His41 Nε2 atom and the Cys145 Sγ atom, and the Cys145 Sγ atom is coplanar with the atoms of the His41 imidazole ring. Residues Gly143 to Cys145 are in the proper conformation to form the oxyanion hole that accommodates the carbonyl O atom of the scissile peptide bond of the substrate; the position to be occupied by the carbonyl O atom is occupied by a water molecule. The O atom of the water molecule forms hydrogen bonds at distances of 3.0 Å with the Gly143 N atom and with the Cys145 N atom (Figure 3(a) and (b)).
Table 2
Crystal structures of SARS-CoV Mpro (wild-type and variants) determined in different conditions
Ref.
Inhibitor
Space group
pH
In the catalytically competent conformation?
Protomer Aa
Protomer Ba
Wild-type SARS-CoV Mpro
b
–
P21
6.0
Yes
Yes
36
–
P43212
5.9
No
No
25
–
P21
6.0
Yes
No
25
CMK
P21
6.0
Yes
No
29
–
C2
6.5
Yes
Yes
29
APE
C2
6.5
Yes
Yes
37
–
P21212
6.5
Yes
Yes
36
–
P21212
6.6
No
No
42
–
P21212
7.0
Yes
Yes
25
–
P21
7.6
Yes
Yes
25
–
P21
8.0
Yes
Yes
SARS-CoV M+A(-1)pro
b
–
P43212
6.5
No
No
b
APE
P43212
6.5
Yes
Yes
29
APE
P212121
6.5
Yes
Yes
SARS-CoV Mprowith additional Ser-Leu at the N termini of both protomers
c
–
P21
6.5
Yes
Yes
SARS-CoV MproC145A variant
42
product
C2
9.0
Yes
Yes
SARS-CoV Mprowith additional Gly-Pro-Leu-Gly-Ser at the N termini of both protomers
27
I2
P21
6.0
Yes
Yes
27
N1
P21
6.0
Yes
Yes
27
N3
P21
6.0
Yes
Yes
27
N9
P21
6.0
Yes
Yes
For any structure whose asymmetric unit contains only one protomer, that protomer represents both protomer A and protomer B.
These are the structures reported here.
PDB accession code 1Q2W; J. B. Bonanno et al., unpublished results.
Figure 2
Electron densities in the Fo–Fc omit maps for residues Lys137 to Ser144 of SARS-CoV Mpro and M+A(−1)pro. (a) Protomer A of the unbound Mpro. (b) Protomer B of the unbound Mpro. (c) Unbound M+A(−1)pro. (d) APE-bound M+A(−1)pro. The residues in (c) are in the catalytically incompetent conformation.
Figure 3
Active sites and S1 specificity pockets of SARS-CoV Mpro and M+A(−1)pro, viewed from the S2 specificity pockets. (a) Protomer A of the unbound Mpro. (b) Protomer B of the unbound Mpro. (c) Unbound M+A(−1)pro. (d) APE-bound M+A(−1)pro (For clarity, the APE is not shown.). Hydrogen bonds are indicated by broken lines. Water molecules are labeled w.
Figure 4
Main-chain conformational angles of residues Thr135 to Gly146 in the crystal structures of SARS-CoV Mpro and M+A(−1)pro reported here. The ϕ and the ψ angles are represented by the yellow and the blue bars, respectively. (a) Protomer A of the unbound Mpro (overall positional uncertainty estimated based on maximum likelihood, σ: 0.15 Å). (b) Protomer B of the unbound Mpro (σ: 0.15 Å). (c) Unbound M+A(−1)pro (σ: 0.23 Å). (d) APE-bound M+A(−1)pro (σ: 0.28 Å).
Crystal structures of SARS-CoV Mpro (wild-type and variants) determined in different conditionsFor any structure whose asymmetric unit contains only one protomer, that protomer represents both protomer A and protomer B.These are the structures reported here.PDB accession code 1Q2W; J. B. Bonanno et al., unpublished results.Electron densities in the Fo–Fc omit maps for residues Lys137 to Ser144 of SARS-CoV Mpro and M+A(−1)pro. (a) Protomer A of the unbound Mpro. (b) Protomer B of the unbound Mpro. (c) Unbound M+A(−1)pro. (d) APE-bound M+A(−1)pro. The residues in (c) are in the catalytically incompetent conformation.Main-chain conformational angles of residues Thr135 to Gly146 in the crystal structures of SARS-CoV Mpro and M+A(−1)pro reported here. The ϕ and the ψ angles are represented by the yellow and the blue bars, respectively. (a) Protomer A of the unbound Mpro (overall positional uncertainty estimated based on maximum likelihood, σ: 0.15 Å). (b) Protomer B of the unbound Mpro (σ: 0.15 Å). (c) Unbound M+A(−1)pro (σ: 0.23 Å). (d) APE-bound M+A(−1)pro (σ: 0.28 Å).Active sites and S1 specificity pockets of SARS-CoV Mpro and M+A(−1)pro, viewed from the S2 specificity pockets. (a) Protomer A of the unbound Mpro. (b) Protomer B of the unbound Mpro. (c) Unbound M+A(−1)pro. (d) APE-bound M+A(−1)pro (For clarity, the APE is not shown.). Hydrogen bonds are indicated by broken lines. Water molecules are labeled w.Previous studies suggest strongly that the predominant S1 specificity of SARS-CoV Mpro for Gln is determined primarily by the conserved residue His163.15., 16., [35] In both independent protomers, the orientation of the imidazole ring of His163 is determined by the hydrogen bonding between its Nδ1 atom and the OH group of Tyr161 (3.0 Å) and by its π-stacking with the phenyl ring of Phe140 (the distance between the geometriccenters of the aromatic rings is 3.9 Å). The position to be occupied by the side-chain carbonyl O atom of P1-Gln of the substrate is occupied by a water molecule, whose O atom forms a hydrogen bond with the His163 Nε2 atom (2.9 Å); thiswater molecule is coplanar with the atoms of the imidazole ring of His163. In both protomers, Phe140 and Glu166 interact with Ser1 of the opposite protomer to form the “floor” of the S1 specificity pocket (Figure 3(a) and (b)). In protomer A, the Glu166A Oε1 atom forms a hydrogen bond (2.8 Å) with the His172A Nε2 atom, thereby constituting a “side-wall” of the S1 specificity pocket similar to that in protomer A of previously reported structures and the molecular-dynamic simulation model of Mpro at pH 6.0 (Figure 3(a)).25., 36. However, in protomer B of the structure of Mpro reported here, the side-chain of Glu166B rotates away and the His172B Nε2 atom forms a hydrogen bond (3.0 Å) with the Ser1A Oγ atom of protomer A instead. The Ser1A Oγ atom also forms a hydrogen bond (2.8 Å) with the Gly170Bcarbonyl O atom (Figure 3(b)). The difference in the side-chain conformation of Glu166 in protomers A and B of the structure of Mpro reported here indicates the weakness of the interaction between the side-chains of Glu166 and His172, even though this interaction may acquire some ioniccharacter by the protonation of the His172 Nδ1 atom. Contrary to those in protomer B of both the previously reported structure and the molecular-dynamic simulation model of Mpro at pH 6.0,25., 36. the side-chains of His163B and Glu166B do not interact in protomer B of the structure of Mpro reported here, probably because His163B is protonated only at its Nε2 atom and carries no charge.As in the previously reported structure of Mpro at pH 6.0, protomers A and B in the current structure show good agreement in the rest of the substrate-binding region, including the S2, S4 and S1′ specificity pockets. Superpositions with the crystal structures of Mpro in the other conditions show that the rest of the substrate-binding regions in the structure of Mpro reported here are in the catalytically competent conformation.
Active sites and substrate-binding regions of the unbound SARS-CoV M+A(-1)pro
We have reported the crystal structure of the SARS-CoV M+A(-1)
pro:APE complex in space group P212121, whose asymmetric unit contains both protomers of the M+A(-1)
pro dimer. In that structure, the active sites and the substrate-binding regions of both protomers are in the catalytically competent conformation (Table 2). In both independent protomers, the additional Ala at the N terminus blocks Ser1 and disrupts its interactions with Phe140 and Glu166 of the opposite protomer; however, the floor of the S1 specificity pocket is only partly disrupted. More importantly, the presence of a ten-residue affinity tag at the N terminus of both independent protomers reduces the specific activity of Mpro by less than an order of magnitude. Similar observations are given by the crystal structure of Mpro with additional residues Ser-Leu at the N terminus of both independent protomers (PDB accession code 1Q2W; J. B. Bonanno et al., unpublished results).Attempts to crystallize M+A(-1)
pro in space group P21 have not been successful. Interestingly, M+A(-1)
procrystallizes in space group P43212 as well as in space group P212121 under the same conditions. In space group P43212, each asymmetric unit has one protomer of M+A(-1)
pro. Superpositions of the resulting structure of the unbound M+A(-1)
pro with the crystal structures of Mpro in the other conditions show no difference in the overall fold or the protomer orientation. Crystal contacts of the unbound M+A(-1)
pro in space group P43212 do not involve any residues forming the active sites and the S1 specificity pockets of the peptidase. The structure of the unbound M+A(-1)
pro in space group P43212 shows good agreement in most atomic positions with the structure of the APE-bound M+A(-1)
pro in space group P212121. In the structure of the unbound M+A(-1)
pro reported here, the catalytic dyad has a distance of 3.9 Å between the His41 Nε2 atom and the Cys145 Sγ atom, and the Cys145 Sγ atom is coplanar with the atoms of the His41 imidazole ring. However, the oxyanion hole and the S1 specificity pocket are distorted (Figure 2, Figure 3). The ϕ and ψ angles of residues Lys137 to Ser144 show dramatic differences compared with those of the previously reported structure of the APE-bound M+A(-1)
pro
(Figure 4(c)). The above-average B-factors of these residues in the unbound M+A(-1)
pro indicate their high mobility relative to the rest of the peptidase (Figure 5(c)). The oxyanion hole is not distorted as much; the N atoms of Gly143 and Cys145 are still oriented to donate hydrogen bonds that would stabilize the negatively charged carbonyl O atom of the scissile peptide bond of the substrate, although no water molecule is found at the position to be occupied by the carbonyl O atom (Figure 3(c)).
Figure 5
The B-factors of residues Thr135–Gly146 in the crystal structures of SARS-CoV Mpro and M+A(−1)pro reported here. The main chain and the side-chain B-factors are represented by the red and the green bars, respectively. The average values of the main chain and the side-chain B-factors are indicated by the red and the green lines, respectively. There is no green bar for Gly138, Gly143 or Gly146, because these residues do not have side-chains. (a) Protomer A of the unbound Mpro. (b) Protomer B of the unbound Mpro. (c) Unbound M+A(−1)pro. (d) APE-bound M+A(−1)pro.
The B-factors of residues Thr135–Gly146 in the crystal structures of SARS-CoV Mpro and M+A(−1)pro reported here. The main chain and the side-chain B-factors are represented by the red and the green bars, respectively. The average values of the main chain and the side-chain B-factors are indicated by the red and the green lines, respectively. There is no green bar for Gly138, Gly143 or Gly146, because these residues do not have side-chains. (a) Protomer A of the unbound Mpro. (b) Protomer B of the unbound Mpro. (c) Unbound M+A(−1)pro. (d) APE-bound M+A(−1)pro.The hydrogen bond between the Nδ1 atom of His163 and the OH group of Tyr161 is preserved (3.1 Å); however, the imidazole ring of His163 is no longer π-stacked with the phenyl ring of Phe140; it makes contacts with the side-chain of Leu141 instead (Figure 3(c)). Here the phenyl ring of Phe140 makes contacts with the side-chains of Val114, Tyr126, Ile136 and His172, and the main-chain atoms of Lys137 and Gly138 from the parent protomer, and with the side-chain of Arg4 from the opposite protomer. The Phe140carbonyl O atom also forms a long hydrogen bond with the OH group of Tyr118 (3.4 Å). The electron density maps show two possible conformers of the side-chain of Glu166; the occupancies of both conformers were fixed at 0.5 without refinement. In conformer 1, the Glu166 Oε1 atom forms a hydrogen bond with the His172 Nδ1 atom (2.3 Å), and the Glu166 Oε2 atom forms a hydrogen bond with the His163 Nε2 atom (2.8 Å); whereas in conformer 2, the side-chain of Glu166protrudes into the solvent (Figure 3(c)). Probably because of their high mobility, the additional Ala at the N terminus, Ser1 and Gly2 of the protomer could not be identified in the electron density maps of the unbound M+A(-1)
pro. Interactions similar to those forming the floors of the S1 specificity pockets of Mpro are not observed in the unbound M+A(-1)
pro.The rest of the substrate-binding region in the structure of the unbound M+A(-1)
pro reported here is in the catalytically competent conformation, in good agreement with those in the previously reported structures of the APE-bound M+A(-1)
pro and of Mpro, except for some side-chains whose conformational rearrangements are necessary in order to accommodate APE or the other inhibitors.
Binding of APE to SARS-CoV M+A(-1)pro
Both the P21crystals of SARS-CoV Mpro and the P43212 crystals of SARS-CoV M+A(-1)
pro were soaked in solutions of the APE synthesized for this study in three stereochemical versions: the two diastereomers Cbz-Leu-Phe-AGln-(2S,3S)EP-COOEt and Cbz-Leu-Phe-AGln-(2R,3R)EP-COOEt, and the racemic mixture of these two diastereomers. Outstanding electron density for APE was observed only in the electron density maps of M+A(-1)
pro; it could be fit only by the 2S,3S diastereomer (Figure 6(a)). The latter is consistent with the results of the studies on the inhibition of Mpro by the APE in these versions. These results can be explained with the models of all four of the possible diastereomers of the APE binding to Mpro. Superpositions of the structures of the unbound and the APE-bound M+A(-1)
pro reported here show that the binding of the APE does not grossly affect the overall fold or the protomer orientation of the M+A(-1)
pro dimer. The same observations are found in the superpositions of the previously reported crystal structures of the unbound and the APE-bound Mpro in space group C2. The structure of the APE-bound M+A(-1)
pro in space group P43212 also agrees well in most atomic positions with that of the APE-bound M+A(-1)
pro in space group P212121.
Figure 6
Interactions of the APE (orange) with SARS-CoV M+A(−1)pro (green). (a) Outstanding density in the Fo – Fc map of the M+A(−1)pro:APE complex. (b) View from the ‘floors’ of the S1 specificity pockets of M+A(−1)pro. (c) A diagram of the interactions. Hydrogen bonds are shown as broken lines, with their distances (in Å) given alongside. The residues of M+A(−1)pro in contact with the APE are shown as arcs.
Interactions of the APE (orange) with SARS-CoV M+A(−1)pro (green). (a) Outstanding density in the Fo – Fc map of the M+A(−1)pro:APE complex. (b) View from the ‘floors’ of the S1 specificity pockets of M+A(−1)pro. (c) A diagram of the interactions. Hydrogen bonds are shown as broken lines, with their distances (in Å) given alongside. The residues of M+A(−1)pro in contact with the APE are shown as arcs.The structural consequences of the binding of the APE to M+A(-1)
pro are very similar to those observed in the crystal structures of the unbound and the APE-bound Mpro in space group C2, and the APE-bound M+A(-1)
pro in space group P212121. A covalent bond forms between the Cys145 Sγ atom of M+A(-1)
pro and the epoxideC3 atom of the APE (2.15 Å; Figure 6(b) and (c)). In the refinement of all of the structures of the APE-bound Mpro and M+A(-1)
pro, a restraint was applied to thisC-S bond on the basis that the length of a C-S single bond is normally about 1.8 Å. Interestingly, the opened epoxide moiety of the APE always tended to be tilted away from Cys145 of Mpro or M+A(-1)
pro, thereby lengthening thisC-S bond by 0.2–0.3 Å, even though the opened epoxide moiety of the APE was manually moved back towards Cys145 of Mpro or M+A(-1)
pro regularly during the refinement process. Although such lengthening is not considered significant as the overall positional uncertainties (based on maximum likelihood) of all these structures are in the range of 0.2–0.3 Å, it does suggest the possibility of thisC-S bond being under strain and vulnerable to rupturecaused by a second nucleophilic attack at the epoxideC3 atom of the APE (say, by an activated water molecule), leading to the speculation that the APE could act as a reversible inhibitor of Mpro.The conformation of the opened epoxide moiety and the main-chain conformation of P1-AGln of the APE in the structure of the M+A(-1)
pro:APE complex reported here are essentially the same as those in the previously reported structures of the Mpro:APE and the M+A(-1)
pro:APE complexes (Figure 7(a)–(d)). Unlike the P1 Cα atom of other inhibitors, the P1-AGln Nα atom of the APE is sp2-hybridized and has a trigonal planar geometry; in order to be accommodated by the S1 specificity pocket of the peptidases, the side-chain of P1-AGln has to adopt a different conformation, in particular the equivalent to χ (N-Nα-Cβ-Cγ). This angle is −123.6° in the structure of the M+A(-1)
pro:APE complex reported here; by contrast, in the crystal structures of the complexes of an Mpro variant with a series of peptidomimetic inhibitors, this angle is in the range of −65° to −80°.
Figure 7
Newman projections of the APE. (a) The equivalent to ϕ of P1-AGln, C(P2-Phe)-N(P1-AGln)-Nα(P1-AGln)-C(P1-AGln); (b) the equivalent to ψ of P1-AGln, N(P1-AGln)-Nα(P1-AGln)-C(P1-AGln)-C3(epoxide); (c) the torsion angle O=C(P1-AGln)-C3(epoxide)-Sγ(Cys145); and (d) the torsion angle C(P1-AGln)-C3(epoxide)-C2(epoxide)-C(ethyl ester carbonyl).
Newman projections of the APE. (a) The equivalent to ϕ of P1-AGln, C(P2-Phe)-N(P1-AGln)-Nα(P1-AGln)-C(P1-AGln); (b) the equivalent to ψ of P1-AGln, N(P1-AGln)-Nα(P1-AGln)-C(P1-AGln)-C3(epoxide); (c) the torsion angle O=C(P1-AGln)-C3(epoxide)-Sγ(Cys145); and (d) the torsion angle C(P1-AGln)-C3(epoxide)-C2(epoxide)-C(ethyl estercarbonyl).Interestingly, the binding of the APE induces the recovery of the catalytically competent conformation of the oxyanion holes and the S1 specificity pockets of M+A(-1)
pro (Figure 2, Figure 3). The P1-AGln main-chain carbonyl O atom of the APE is accommodated in the oxyanion hole of M+A(-1)
pro, forming hydrogen bonds with the N atoms of Gly143 (2.8 Å) and Cys145 (3.2 Å) (Figure 6(b) and (c)). The ϕ and ψ angles of residues Lys137 to Ser144 in the structure of the APE-bound M+A(-1)
pro reported here are essentially the same as those in the previously reported structure of the APE-bound M+A(-1)
pro (Figure 4(d)). In contrast to those in the structure of the unbound M+A(-1)
pro, the B-factors of these residues in the structures of the APE-bound M+A(-1)
pro are close to the averages, indicating that the mobility of these residues is reduced upon the binding of the APE (Figure 5(d)).The hydrogen bond between the Nδ1 atom of His163 and the OH group of Tyr161 is preserved (3.3 Å), and the π-stacking of the imidazole ring of His163 with the phenyl ring of Phe140 is recovered (distances between the geometriccenters of the aromatic rings: 3.7 Å) (Figure 3(d)). The His163 Nε2 atom no longer interacts with the Glu166 Oε2 atom, but forms a hydrogen bond (2.7 Å) with the side-chain carbonyl O atom of P1-AGln of the APE instead (Figure 6(b) and (c)). The side-chains of Glu166 and His172 interact with each other (3.0 Å; Figure 3(d)). The additional Ala at the N terminus and Ser1 of the protomer could not be identified in the electron density maps of the M+A(-1)
pro:APE complex. Interestingly, the side-chain amide N atom of P1-AGln of the APE forms hydrogen bonds, though not in ideal geometry, with the Phe140carbonyl O atom (3.4 Å) and the Glu166 Oε1 and Oε2 atom (3.5 Å and 2.9 Å, respectively) (Figure 6(b)). Therefore, without the participation of Ser1 of the opposite protomer, Phe140 and Glu166 are tied together to form parts of the floors of the S1 specificity pockets of M+A(-1)
pro.Similar to the APE in protomer A of the previously reported structure of the APE-bound M+A(-1)
pro, the benzyloxycarbonyl (Cbz) group of the APE in the structure of the APE-bound M+A(-1)
pro reported here squeezes into and slightly widens the S4 specificity pockets of M+A(-1)
pro and, as a result, making contacts with Leu167, Pro168, Gln192 and Ala193 of M+A(-1)
pro. Otherwise, the interactions of the APE with the rest of the substrate-binding region observed in the structure of the APE-bound M+A(-1)
pro reported here are essentially the same as those observed in the previously reported structures of the APE-bound M+A(-1)
pro and Mpro (Figure 6(b) and (c)).
Dynamic equilibrium for the conformation of the active sites and the S1 specificity pockets of SARS-CoV Mpro
In contrast to the previously reported crystal structure of SARS-CoV Mpro in space group P21 at pH 6.0, the crystal structure of Mpro reported here (in the same space group and at the same pH) shows that the active sites and the S1 specificity pockets of both protomers are in the catalytically competent conformation (Table 2). In the determinations of both structures, the complete wild-type sequence (residues 1–306) of Mpro was over-expressed, purified and crystallized. Although different strategies might have been used in the X-ray diffraction data collection and processing, and in the structure solution and refinement, thiscould not result in the dramatic structural differences observed. The structural differences probably arise from the differences in the conditions of the preparation and the crystallization of Mpro. Similarly, the crystal structure of Mpro in space group P21212 at pH 6.5 shows that both protomers are in the catalytically competent conformation, whereas that in the same space group P21212 and at a slightly higher pH (6.6) shows that both protomers of the latter are in the catalytically incompetent conformation. The conformations of the collapsed active sites and S1 specificity pockets observed in the catalytically incompetent protomers show some variability. This variability indicates that the active site and the S1 specificity pocket of each protomer of Mpro do not adopt a single conformation in solution at pH 5.9–6.6, but instead there is an ensemble of conformations. A particular conformation might be favored by a particular set of crystallization conditions.However, the various conformations of the collapsed active sites and S1 specificity pockets share a common feature: the imidazole ring of His163 is not π-stacked with the phenyl ring of Phe140. π-stacking is an example of aromatic interactions. In the structures of the unbound Mpro, an aromatic interaction is observed between these two rings in an offset-stacked (i.e. π-stacking) or an edge-to-face fashion wherever the protomer is in the catalytically competent conformation (Figure 8
). In both orientations, one or two hydrogen atoms (with partial positive charge) on the phenyl ring of Phe140 are positioned near the central region (with partial negative charge) of the imidazole ring of His163. According to the results of previous studies,39., 40. a single aromatic interaction as such is weak (the interaction energy may be in the range of 1–2 kcal/mol only), in contrast to the clusters of aromatic interactions commonly involved in the stabilization of protein structures. Therefore, this interaction is susceptible to disruptions that could be caused by changes in a number of factors; the formation of this interaction can be viewed as a reversible process in dynamic equilibrium:The position of the equilibrium could be determined, in part, by pH. At low pH (near or below the pK
a1 of His163; Figure 9
), the His163 Nδ1 atom in a significant number of the Mproprotomers is protonated, thereby introducing a positive charge on the imidazole ring of His163. This would disfavor its aromatic interaction with the phenyl ring of Phe140, and the position of the equilibrium would shift to the right. This is consistent with the trend exhibited by the structures of the unbound Mpro at various pH values (Table 2). In the pH range of 7.0–9.0, most of the Mproprotomers have the aromatic interaction and are in the catalytically competent conformation (the left-hand side of the equilibrium predominates); whereas in the pH range of 5.9–6.6, some of the Mproprotomers lose the aromatic interaction and are in the catalytically incompetent conformation (the position of the equilibrium shifts to the right).
Figure 8
Aromatic interactions observed between Phe140 and His163 in the crystal structures of SARS-CoV Mpro (wild type and variants). The broken lines indicate the alignment of the partial positive charges (δ+) of the hydrogen atoms on the phenyl ring of Phe140 with the partial negative charges (δ−) in the central part of the imidazole ring of His163.
Figure 9
Protonation/deprotonation states of His163 hydrogen bonded to Tyr161 in SARS-CoV Mpro (wild-type and variants). Ka1 is the dissociation constant for the protonation of the His163 Nδ1 atom; Ka2 is the dissociation constant for the deprotonation of the His163 Nε2 atom.
Aromatic interactions observed between Phe140 and His163 in the crystal structures of SARS-CoV Mpro (wild type and variants). The broken lines indicate the alignment of the partial positive charges (δ+) of the hydrogen atoms on the phenyl ring of Phe140 with the partial negative charges (δ−) in the central part of the imidazole ring of His163.Protonation/deprotonation states of His163hydrogen bonded to Tyr161 in SARS-CoV Mpro (wild-type and variants). Ka1 is the dissociation constant for the protonation of the His163 Nδ1 atom; Ka2 is the dissociation constant for the deprotonation of the His163 Nε2 atom.A second factor in determining the position of the equilibrium could be the integrity of the interactions among Phe140 and Glu166 of the parent protomer, and Ser1 of the opposite protomer. In the structure of M+A(-1)
pro reported here, the additional Ala at the N terminus of the opposite protomer blocks Ser1 and disrupts its normal interactions with Phe140 and Glu166 of the parent protomer. Thisprobably weakens the conformational anchor of Phe140, as indicated by its above-average B-factors (Figure 5(c)), making the aromatic interaction of its phenyl ring with the imidazole ring of His163 vulnerable to disruption. In all of the crystal structures of the unbound Mpro (both the wild-type and the variants) determined so far, wherever Phe140 and Glu166 of the parent protomer, and Ser1 of the opposite protomer do not interact normally, the B-factors of residues Lys137 to Ser144 of the parent protomer are above the average, even if the parent protomer is in the catalytically competent conformation (e.g. the Mpro structure of Mpro at pH 6.5 in the space group P21212); in contrast, wherever the three residues interact normally, the B-factors of residues Lys137 to Ser144 of the parent protomer are close to the average (e.g. protomer A of the Mpro structure reported here; Figure 5(a)). These observations suggest that the interactions among the three residues can immobilize residues 137 to 144. Apparently, both the ionic interaction between the N terminus of Ser1 and the side-chain of Glu166 of opposite protomers, and the amidehydrogen-carbonyl oxygenhydrogen bonds between Ser1 and Phe140 of opposite protomers contribute to the immobilization of residues Lys137 to Ser144. The Mpro structure at pH 7.0 in space group P21212 and protomer B of the Mpro structure reported here (Figure 5(b)) suggest that the absence of either interaction slightly compromises the immobilizing effect.The increased vulnerability of the aromatic interaction to disruption would shift the position of the equilibrium to the right. Thisprobably explains why the active sites and the S1 specificity pockets of both protomers are collapsed in the crystal structure of M+A(-1)
pro reported here. Mpro with its three N-terminal residues truncated, Mpro
Δ(1-3), exists mainly as a dimer in solution but its activity is 24% lower than that of Mpro. Without the interactions among Phe140 and Glu166 of the parent protomer, and Ser1 of the opposite protomer, the position of the equilibrium would shift to the right, resulting in the decreased availability of the active sites and the S1 specificity pockets in the catalytically competent conformation. The binding of a substrate to Mpro
Δ(1-3) may induce the recovery of the catalytically competent conformation of the active sites and the S1 specificity pockets (to be discussed below), at an energeticcost accounted for by the reduced activity of Mpro
Δ(1-3).
Induced-fit binding of the APE to SARS-CoV Mpro
The crystal structures of the unbound and the APE-bound SARS-CoV M+A(-1)
pro reported here show that the binding of the APE to M+A(-1)
pro follows an induced-fit model, not the lock-and-key model. More particularly, the binding of the APE induces the recovery of the catalytically competent conformation of the oxyanion holes and the S1 specificity pockets of both protomers of M+A(-1)
pro. This induced fit is likely relevant to the binding of APE to Mpro, because of the conformational flexibility of the active sites and the S1 specificity pockets of Mpro discussed above. In all of the crystal structures of the inhibitor-bound Mpro (both the wild-type and the variants) determined so far,25., 26., [27], 29., 42. except for protomer B in the complex of Mpro with a chloromethyl ketone (CMK), the active sites and the S1 specificity pockets are in the catalytically competent conformation (Table 2). The induced-fit binding of the APE to M+A(-1)
proprobably is driven by the formation of the covalent bond between the Cys145 Sγ atom of M+A(-1)
pro and the epoxideC3 atom of the APE. It would be energetically costly to break thisC-S bond in order to repel the APE and preserve the conformational flexibility of the active sites and the S1 specificity pockets of M+A(-1)
pro. Therefore, the active sites and the S1 specificity pockets of M+A(-1)
pro have to adopt the catalytically competent conformation in order to accommodate a substrate mimic like APE. This explanation probably applies to the actual catalyticcycle of Mpro as well: the acylation step (i.e. formation of the covalent acyl-enzyme intermediate) may drive the induced-fit binding of a substrate to Mpro; after the deacylation step (i.e. breakdown of the covalent acyl-enzyme intermediate), the removal of the product may be favored by the recovery of the conformational flexibility of the active sites and the S1 specificity pockets of Mpro.As the side-chain of P1-AGln of the APE occupies the S1 specificity pockets of M+A(-1)
pro, the side-chain amide N atom of P1-AGln of the APE donates hydrogen bonds to the Phe140carbonyl O atom and to the Glu166 Oε1 and Oε2 atoms of M+A(-1)
pro, in addition to the hydrogen bond between the side-chain carbonyl O atom of P1-AGln of the APE and the His163 Nε2 atom of M+A(-1)
pro. Similarly, additional hydrogen bonds are observed in the crystal structures of the M+A(-1)
pro:APE complex in space group P212121, the Mpro:APE complex, protomer A of the Mpro:CMK complex, and the complex of Mpro with a peptidomimetic inhibitor. In the crystal structures of the complexes of an Mpro variant with a series of peptidomimetic inhibitors, these hydrogen bonds seem to be water-mediated. Consequently, even without the participation of Ser1 of the opposite protomer of M+A(-1)
pro, Phe140 and Glu166 are tied together by these hydrogen bonds to form parts of the floors of the S1 specificity pockets. These hydrogen bonds also conformationally anchor Phe140, as indicated by its close-to-average B-factors (Figure 5(d)), and make the π-stacking of its phenyl ring with the imidazole ring of His163 resistant to disruption. As a result, the catalytically competent conformation of the active sites and the S1 specificity pockets of M+A(-1)
pro is rigidified; the position of the equilibrium described by equation (1) would shift to the left. Thisprobably serves as an additional explanation for the predominant S1 specificity of Mpro or M+A(-1)
pro for Gln, but not for Glu. Comparison of the structure of the unbound Mpro with that of the APE-bound Mpro, both in space group C2 at pH 6.5, does not show an obvious reduction in the B-factors of residues Lys137 to Ser144 upon the binding of the APE to Mpro, probably because Phe140 and Glu166 of the parent protomer, and Ser1 of the opposite protomer in the unbound Mpro interact normally and immobilize these residues.In protomer B of the Mpro:CMK complex, the binding of the CMK does not induce the recovery of the catalytically competent conformation of the active site and the S1 specificity pocket of Mpro (Table 2). The side-chain of P1-Gln of the CMK cannot be accommodated by the collapsed S1 specificity pocket, so it protrudes into the solvent. Thiscould be attributed to the covalent bond between the Cys145 Sγ atom of Mpro and the methylene C atom of the CMK, whose rotation can orient the side-chain of P1-Gln of the CMK towards the solvent, thereby avoiding the steric hindrance due to the collapsed S1 specificity pocket. In protomer A of the Mpro:CMK complex, although the active site and the S1 specificity pocket of Mpro are in the catalytically competent conformation (Table 2), allowing the side-chain of P1-Gln of the CMK to occupy the S1 specificity pocket of Mpro, the unoccupied space in the rest of the substrate-binding region of Mpro is large enough for the conformational rearrangements in the rest of the CMK, resulting in an unexpected binding mode of the CMK to protomer A of Mpro as well. These unexpected binding modes are not observed in the crystal structures of the complexes of Mpro (both the wild-type and the variants) with the APE, nor with the peptidomimetic inhibitors,26., [27] probably because their P' ester groups sterically hinder the rotation of the C-Scovalent bond formed between the peptidases and the inhibitors. In the case of the APE, the π-conjugation of the Nα atom and the carbonyl group of P1-AGln restricts the orientation of the side-chain of P1-AGln to point towards the S1 specificity pockets of Mpro or M+A(-1)
pro. In the case of the peptidomimetic inhibitors, the C atom attacked by the Cys145 Sγ atom corresponds to the carbonyl C atom of the scissile peptide bond of a substrate. Therefore, the resulting C-S bond pulls the inhibitor towards the back wall of the substrate-binding region. This reinforces the steric hindrance due to the P' ester group of the inhibitor against the rotation of the C-S bond.Thus, a refined explanation could be provided for the pH-dependence of the activity of Mpro
12., 18., 25., 36.: At low pH, a significant number of the Mproprotomers are in the catalytically incompetent conformation. However, these protomers are conformationally flexible; the recovery of their catalytically competent conformation can be induced by the binding of a substrate. The activity of Mpro is still compromised, because of the energeticcost associated with this induced-fit binding. It would be interesting to determine this energeticcost quantitatively, and compare it with the experimentally determined decrease in the activity of Mpro. Before carrying out such studies, the possibility of a second mechanism, such as the change in the protonation/deprotonation state of the catalytic dyad with pH, being involved in the pH-dependence of the activity of Mpro should not be ruled out.
Domain III of SARS-CoV Mpro
The compositions of the dimer interfaces observed in the structures reported here are essentially identical with those observed in all of the previously reported structures. Extensive studies have been carried out on the role of domain III in the dimerization of Mpro. With domain III truncated, the protomer of Mpro does not dimerize over a wide range of concentrations,10., [14] whereas domain III alone forms a stable dimer even at very low concentrations. Truncation of the final C-terminal helix (residues 293–306) strongly disfavors the dimerization of Mpro;[14], 43. analysis of the dimer interfaces observed in the structures of Mpro found that most of the residues of domain III involved in dimer interactions are parts of the final C-terminal helix. Superpositions of the previously reported structures of Mpro in space group P21 at pH 6.0, 7.6 and 8.0, show that domain III of both independent protomers rotates in a rigid-body manner as the pH increases from 6.0 to 8.0, whereas the remainder of the Mpro dimer remains essentially in the same atomic positions (Figure 10
). Domain III of protomer B rotates through a slightly greater angle than does domain III of protomer A. This rotation was regarded as a consequence of the recovery of the catalytically competent conformation of the active sites and the S1 specificity pockets of Mpro as the pH increased from 6.0 to 8.0. However, the structure of Mpro in space group P21 at pH 6.0 reported here, having the active sites and the S1 specificity pockets of both protomers in the catalytically competent conformation, shows good agreement in the atomic positions of domain III of both independent protomers with the previously reported structure of Mpro in space group P21 at pH 6.0, but not with those at pH 7.6 or pH 8.0. The interactions between the domain-III residues of opposite protomers, and the interactions of the residues of domain III with those of domains I and II of the same protomer have been analyzed, but the mechanics of this rotation is still unclear. The structures of Mpro in the other space groups have not been studied over such a wide range of pH. It would be interesting to explore whether this rotation is an artifact associated with crystal packing in space group P21, or whether similar rotations occur in the structures of Mpro in the other space groups as well, indicating some biochemical significance of this rotation. For example, the rotation may represent a mechanism underlying the pH-dependence of the dimerization behaviors of Mpro.
Figure 10
Cα traces showing the orientations of domain III of the previously reported structures of SARS-CoV Mpro in space group P21 at pH 6.0 (red), 7.6 (yellow) and 8.0 (blue), and the structure of Mpro in space group P21 at pH 6.0 reported here (green). Domain III of both independent protomers rotates as pH increases from 6.0 to 8.0.
Cα traces showing the orientations of domain III of the previously reported structures of SARS-CoV Mpro in space group P21 at pH 6.0 (red), 7.6 (yellow) and 8.0 (blue), and the structure of Mpro in space group P21 at pH 6.0 reported here (green). Domain III of both independent protomers rotates as pH increases from 6.0 to 8.0.
Materials and Methods
Preparation of SARS-CoV Mpro, M+A(-1)pro and APE
SARS-CoV M+A(-1)
pro was cloned, over-expressed and purified as described. A clone expressing Mpro was generated using oligonucleotide-directed evolution to delete the codon corresponding to the N-terminal Ala of M+A(-1)
pro. Using thisclone, Mpro was over-expressed and purified essentially as described for M+A(-1)
pro. Cbz-Leu-Phe-AGln-(S,S)EP-COOEt was synthesized using the methods established to synthesize other APEs30., 32., 33. with minor modifications.
Crystallization, crystal soaking and cryo-protection
Before crystallization, both SARS-CoV Mpro and M+A(-1)
pro were dialyzed against 20 mM Tris–HCl (pH 7.5), 20 mM NaCl and concentrated to 10 mg/ml. All crystals were grown at ambient temperature by the hanging-drop, vapor-diffusion method. For the P21crystals, the reservoir solution contained 0.1 M Mes (pH 6.0), 2% (w/v) polyethylene glycol 20,000, 3% (v/v) dimethyl sulfoxide, and 1 mM dithiothreitol. The final crystallization drop contained equal amounts of the Mpro solution and the reservoir solution. Crystals of a thick-plate habit grew in three to five days to a size of about 0.3 mm × 0.3 mm × 0.1 mm. For the P43212 crystals, the growth conditions were the same as those for the P212121 crystals; the reservoir solution contained 0.1 M Mes (pH 6.5), 50 mM ammonium acetate, 6% (w/v) polyethylene glycol 8000, 3% (v/v) ethylene glycol, and 1 mM dithiothreitol. The drop contained equal amounts of the M+A(-1)
pro solution and the reservoir solution. Bipyrimidal crystals grew in three to five days to a size of about 0.1 mm × 0.1 mm × 0.1 mm. Macroseeding was carried out to improve the quality of the crystals. Crystals of good quality were selected and soaked overnight in drops having the same compositions as their reservoir solutions plus the APE chosen for this study present at 3 mM. Cryo-protectants had essentially the same compositions as reservoir solutions, except for the inclusion of 25% (v/v) ethylene glycol and the exclusion of dimethyl sulfoxide and dithiothreitol. Crystals were soaked for about 10 s and then immediately shock-cooled in liquid nitrogen for storage and shipment to the synchrotron beamline.
Data collection and processing, structure solution, refinement and analysis
The X-ray diffraction data from all crystals were collected at the synchrotron Beamline 8.3.1 (equipped with an ADSC-Q210 CCD detector) at the Advanced Light Source in the Lawrence Berkeley National Laboratory. All data sets were indexed, scaled and merged using DENZO and SCALEPACK. Structure solution and refinement were carried out in CCP4.45., 46. All structures were solved by the molecular replacement method using MOLREP, with the structure of unbound SARS-CoV Mpro in space group C2 (PDB accession code 2A5A) being used as the search model. In the structure of the M+A(-1)
pro:APE complex, APE was located as outstanding electron densities in the substrate-binding region of M+A(-1)
pro in both the |F
o|–|F
c|,αc (contoured at 3σ and 4σ) and the 2|F
o|–|F
c|,αc (contoured at 1σ) maps. A restraint was applied to the covalent bond between the Cys145 Sγ atom of M+A(-1)
pro and the epoxideC3 atom of the APE, on the basis that the length of a C-S single bond is normally about 1.8 Å. All structures were then iteratively refined using REFMAC, and manually adjusted when needed using XtalView/Xfit. The stereochemical qualities of the final structures were assessed using PROCHECK. Graphical representations of the structures were prepared using PyMOL‡. Superimpositions of the various structures were carried out using ALIGN,51., 52. based on the main-chain atoms (amide N, Cα, and carbonyl C and O). The surface areas of the structures were calculated using NACCESS§
Dimer interactions and M+A(-1)
pro:APE interactions were analyzed using DIMPLOT' and LIGPLOT, respectively.
Protein Data Bank accession codes
The atomiccoordinates and the structure factors of all the structures have been deposited in the RCSB Protein Data Bank. The accession code is 2GT7 for the structure of the unbound SARS-CoV Mpro, 2GT8 for the structure of the unbound M+A(-1)
pro, and 2GTB for the structure of the M+A(-1)
pro:APE complex.
Authors: Volker Thiel; Konstantin A Ivanov; Ákos Putics; Tobias Hertzig; Barbara Schelle; Sonja Bayer; Benedikt Weißbrich; Eric J Snijder; Holger Rabenau; Hans Wilhelm Doerr; Alexander E Gorbalenya; John Ziebuhr Journal: J Gen Virol Date: 2003-09 Impact factor: 3.891
Authors: Kanchan Anand; Gottfried J Palm; Jeroen R Mesters; Stuart G Siddell; John Ziebuhr; Rolf Hilgenfeld Journal: EMBO J Date: 2002-07-01 Impact factor: 11.598
Authors: Mark Turlington; Aspen Chun; Sakshi Tomar; Aimee Eggler; Valerie Grum-Tokars; Jon Jacobs; J Scott Daniels; Eric Dawson; Adrian Saldanha; Peter Chase; Yahira M Baez-Santos; Craig W Lindsley; Peter Hodder; Andrew D Mesecar; Shaun R Stauffer Journal: Bioorg Med Chem Lett Date: 2013-09-07 Impact factor: 2.823