Yafang Chen1, Sergey N Savinov2, Anna M Mielech3, Thu Cao1, Susan C Baker3, Andrew D Mesecar4. 1. From the Department of Biological Sciences. 2. the Center for Cancer Research, and. 3. the Department of Microbiology and Immunology, Stritch School of Medicine, Loyola University of Chicago, Maywood, Illinois 60153. 4. From the Department of Biological Sciences, the Center for Cancer Research, and the Department of Biochemistry, Purdue University, West Lafayette, Indiana 47907 and amesecar@purdue.edu.
Abstract
Murine hepatitis virus (MHV) has long served as a model system for the study of coronaviruses. Non-structural protein 3 (nsp3) is the largest nsp in the coronavirus genome, and it contains multiple functional domains that are required for coronavirus replication. Despite the numerous functional studies on MHV and its nsp3 domain, the structure of only one domain in nsp3, the small ubiquitin-like domain 1 (Ubl1), has been determined. We report here the x-ray structure of three tandemly linked domains of MHV nsp3, including the papain-like protease 2 (PLP2) catalytic domain, the ubiquitin-like domain 2 (Ubl2), and a third domain that we call the DPUP (domain preceding Ubl2 and PLP2) domain. DPUP has close structural similarity to the severe acute respiratory syndrome coronavirus unique domain C (SUD-C), suggesting that this domain may not be unique to the severe acute respiratory syndrome coronavirus. The PLP2 catalytic domain was found to have both deubiquitinating and deISGylating isopeptidase activities in addition to proteolytic activity. A computationally derived model of MHV PLP2 bound to ubiquitin was generated, and the potential interactions between ubiquitin and PLP2 were probed by site-directed mutagenesis. These studies extend substantially our structural knowledge of MHV nsp3, providing a platform for further investigation of the role of nsp3 domains in MHV viral replication.
Murine hepatitis virus (MHV) has long served as a model system for the study of coronaviruses. Non-structural protein 3 (nsp3) is the largest nsp in the coronavirus genome, and it contains multiple functional domains that are required for coronavirus replication. Despite the numerous functional studies on MHV and its nsp3 domain, the structure of only one domain in nsp3, the small ubiquitin-like domain 1 (Ubl1), has been determined. We report here the x-ray structure of three tandemly linked domains of MHV nsp3, including the papain-like protease 2 (PLP2) catalytic domain, the ubiquitin-like domain 2 (Ubl2), and a third domain that we call the DPUP (domain preceding Ubl2 and PLP2) domain. DPUP has close structural similarity to the severe acute respiratory syndrome coronavirus unique domain C (SUD-C), suggesting that this domain may not be unique to the severe acute respiratory syndrome coronavirus. The PLP2 catalytic domain was found to have both deubiquitinating and deISGylating isopeptidase activities in addition to proteolytic activity. A computationally derived model of MHV PLP2 bound to ubiquitin was generated, and the potential interactions between ubiquitin and PLP2 were probed by site-directed mutagenesis. These studies extend substantially our structural knowledge of MHV nsp3, providing a platform for further investigation of the role of nsp3 domains in MHV viral replication.
Coronaviruses (CoVs) are enveloped positive-sense RNA viruses, with the largest genomes (25.5–32 kb) among known RNA viruses. CoVs can infect a variety of species, including animals and humans, causing respiratory and gastrointestinal illnesses (1). Physiological consequences of humaninfection by CoVs are usually mild, except in immunocompromised individuals, where the outcome can be much more severe and even lead to death (1). However, emerging CoVs are capable of being extremely pathogenic. The most serious epidemic caused by an emerging human coronaviruses (HCoVs) occurred in 2002–2003, when the global outbreak of severe acute respiratory syndrome (SARS), caused by SARS-CoV, resulted in a case fatality rate of ∼10% (2). Although a decade has passed since the outbreak of SARS, no drugs or vaccines are approved for the treatment of coronaviral infections. The emergence of a novel coronavirus in 2012, the Middle East respiratory syndrome coronavirus (MERS-CoV), which has a case fatality rate of ∼38% (3), highlights the need for the discovery of antiviral drugs and/or vaccines against infections caused by CoVs.Studies of MERS-CoV suggest that, similar to SARS-CoV, the new coronaviral species originates from an animal reservoir (4–6). The zoonotic nature of HCoVs makes future reemergence of coronaviral global epidemics, such as SARS and MERS, a potential threat to human health. In addition, porcine epidemic diarrhea virus is a coronavirus that has devastated the porcine industry worldwide. The outbreak of porcine epidemic diarrhea virus in the United States in 2013/2014 killed over 7 million pigs and resulted in significant economic loss (7). Therefore, a better understanding of the replication and pathogenesis of CoVs is urgently needed to develop effective treatments against coronaviral infections.Murine hepatitis virus (MHV) is a member in the β-coronavirus genogroup 2a, which has long served as a model system for the study of CoVs. The first two-thirds of the giant coronavirus genome contains two overlapping open reading frames, ORF1a and ORF1b. They encode two viral replicase polyproteins, pp1a from ORF1a and pp1ab that results from a −1 frameshift between ORF1a and ORF1b (Fig. 1A). The polyproteins are then processed by viral papain-like proteases (PLPs) and a 3C-like protease (3CLpro or nsp5) into 16 non-structural proteins (nsps) that are involved in the assembly of double-membrane vehicles, essential for viral RNA replication (8–11).
FIGURE 1.
Domain organization of the MHV and SARS-CoV coronavirus polyprotein and nsp3.
A, domain organization of the MHV polyprotein. The locations of the different nsps resulting from the processing of polyproteins are numbered 1–16. The two papain-like protease domains of MHV within nsp3, PLP1 and PLP2, are represented in orange and red, respectively, whereas the 3C-like protease (3CLpro) within nsp5 is represented in blue. The corresponding cleavage sites for each protease (PLP1, PLP2, and 3CLpro) are indicated by arrows and are colored according to the protease that performs the cleavage. B, domain architecture of SARS-CoV nsp3 in comparison with MHV nsp3. Domains with known structures are shown as ribbons, whereas domains with unknown structures are represented as boxes. For SARS-CoV, only one PLpro is encoded in nsp3, and it catalyzes cleavage of the polyprotein at all three cleavage sites. The structures of the three MHV nsp3 domains determined in this study are underscored with dashed lines. PDB codes are as follows: SARS-CoV nsp3, 2IDY, 2ACF, 2W2G, 2KQW, 2FE8, and 2K87; MHV nsp3, 2M0A and 4YPT (from this study). C, a summary of the currently known structures of various nsp3 domains within CoVs studied to date (13–18, 22, 30, 53, 54, 63, 64). The currently known nsp3 domains are indicated at the top and are shaded in different shades of red. The PDB codes for the structures of individual or single domains are listed under the domain acronym. The PDB codes for the structures of domains that have been determined from a single polypeptide chain are connected with dashed lines and are shaded according to the color code of the corresponding domains.
Domain organization of the MHV and SARS-CoVcoronavirus polyprotein and nsp3.
A, domain organization of the MHV polyprotein. The locations of the different nsps resulting from the processing of polyproteins are numbered 1–16. The two papain-like protease domains of MHV within nsp3, PLP1 and PLP2, are represented in orange and red, respectively, whereas the 3C-like protease (3CLpro) within nsp5 is represented in blue. The corresponding cleavage sites for each protease (PLP1, PLP2, and 3CLpro) are indicated by arrows and are colored according to the protease that performs the cleavage. B, domain architecture of SARS-CoV nsp3 in comparison with MHV nsp3. Domains with known structures are shown as ribbons, whereas domains with unknown structures are represented as boxes. For SARS-CoV, only one PLpro is encoded in nsp3, and it catalyzes cleavage of the polyprotein at all three cleavage sites. The structures of the three MHV nsp3 domains determined in this study are underscored with dashed lines. PDB codes are as follows: SARS-CoV nsp3, 2IDY, 2ACF, 2W2G, 2KQW, 2FE8, and 2K87; MHV nsp3, 2M0A and 4YPT (from this study). C, a summary of the currently known structures of various nsp3 domains within CoVs studied to date (13–18, 22, 30, 53, 54, 63, 64). The currently known nsp3 domains are indicated at the top and are shaded in different shades of red. The PDB codes for the structures of individual or single domains are listed under the domain acronym. The PDB codes for the structures of domains that have been determined from a single polypeptide chain are connected with dashed lines and are shaded according to the color code of the corresponding domains.Among the 16 nsps, nsp3 is the largest replicase subunit, and it contains several domains of functional and structural importance for virus replication (12). Nsp3 of SARS-CoV has been well studied, with two-thirds of nsp3 having been characterized structurally by either x-ray crystallography or NMR (Fig. 1, B and C) (13–18). In contrast, there is a paucity of structural information of MHV nsp3. Extensive functional studies of domains within MHV nsp3 have been conducted, but little structural information on these domains is currently available (Fig. 1, B and C) (19–23). In fact, with the exception of SARS-CoV, the structural information of most CoVs′ nsp3 is still missing (Fig. 1C). Interestingly, the SARS-unique domain (SUD), which consists of three domains, the N-terminal domain (SUD-N), the middle domain (SUD-M), and the C-terminal domain (SUD-C), has been identified within SARS-CoV nsp3 (16). It was originally proposed that the SUD domain is unique to only SARS-CoV (24), and subsequently, it was predicted to exist only in SARS and bat CoVs within the β-coronavirus genogroups 2b and 2c (12). However, in this study, we present structural evidence that this is not the case because the SUD-C appears to also exist in MHV.The PLP domains within nsp3 are responsible for proteolytic processing of the N-terminal polyprotein (Fig. 1A). In addition to the proteolytic activity, several coronaviral PLPs have been shown to possess deubiquitinating (DUB) activity (25–31, 65). Moreover, SARS-CoVpapain-like protease (PLpro), MERS-CoV PLpro, and HCoV-NL63 PLP2 have also been shown to have deISGylating activity (25, 29, 32, 33), which removes ISG15 (interferon-stimulated gene 15, a ubiquitin-like molecule) from proteins, possibly counteracting the antiviral state triggered by ISG15 conjugation (34). Because ubiquitination and ISGylation are known to play important roles in the regulation of numerous cellular pathways, including innate immune responses, it has been proposed that the CoVs may employ the DUB and deISGylating activities of PLPs to antagonize host innate immune responses to infection. Indeed, both SARS-CoV PLpro and HCoV-NL63 PLP2 have been shown to interfere with the IRF3 and NF-κB signaling pathways for interferon (IFN) production (33, 35). We also recently demonstrated that infection of α/β interferon receptor knock-out (IFNAR(−/−)) mice with a chimeric Sindbis virus harboring SARS-CoV PLpro and ISG15 expression genes suppresses ISG15-mediated protection during viral infection (36). Most recently, it was discovered that MERS-CoV PLpro displays IFN antagonism activity (25, 29, 37). MHV PLP2 has also been shown to inhibit cellular type I interferon production by deubiquitinating TBK1 (TANK-binding kinase-1) (38, 39). However, the ability of purified MHV PLP2 enzyme to remove ubiquitin and ISG15 from substrates has not yet been demonstrated (see below).Here we disclose the first x-ray crystal structure of the MHV PLP2 catalytic domain and two of its adjacent domains within nsp3. For clarity, this construct of PLP2 will be referred to as PLP2+ throughout this work. Kinetic characterization of recombinant MHVPLP2+ reveals that PLP2+ possesses both deubiquitinating and deISGylating activities. We present a computationally derived model of PLP2 in complex with ubiquitin aldehyde (Ubal), based on our recently reported structure of SARS-CoV PLpro in complex with Ubal (40) to gain insights into the molecular interactions between PLP2 and Ub, and these interactions are probed via mutagenesis and cell culture studies.
Experimental Procedures
Constructs of PLP2+ for Expression
The construct (pEV-L8-PLP2+, polyprotein aa 1525–1911 with the accession number P0C6V0 cloned into pEV-L8 vector) used for the expression of PLP2+ in Escherichia coli cells was the same as described previously (23). Constructs for PLP2+ mutants (D1772R, D1807R, Y1824F, Y1824A, and F1812A) were generated by site-directed mutagenesis following the QuikChange protocol (Stratagene) with minor temperature cycling modifications to maximize the yield of the PCR products. The introduced mutations were then verified by DNA sequencing of the entire insert.
Expression and Purification of PLP2+
The expression and purification of PLP2+ followed the procedure described previously (23). Purified PLP2+ was concentrated to 12 mg/ml for protein crystallization or flash-frozen in the same buffer with the addition of glycerol to a final concentration of 2% using liquid nitrogen. The final frozen enzyme was stored at −80 °C.To express selenomethionine (SeMet)-substituted PLP2+, pEV-L8-PLP2+ was transformed into E. coli strain B834 cells. Two liters of cells were grown in M9 medium (supplemented with thiamine, trace metal mix, l-SeMet, and all amino acids except unlabeled methionine) at 37 °C until the A600 reached 0.6. At this point, protein expression was induced by the addition of isopropyl 1-thio-β-d-galactopyranoside to a final concentration of 1 mm, and the cells were grown overnight at 18 °C with constant shaking. The procedure for purification of SeMetPLP2+ was the same as described previously for wild-type, unlabeled PLP2+ (23).Expression of PLP2+ mutants followed the procedure for WT PLP2+, except the induction with 0.1 mm isopropyl 1-thio-β-d-galactopyranoside was performed at 18 °C for overnight instead of at 25 °C for 6 h. The PLP2+ mutants along with WT PLP2+ as a control were purified through one-step immobilized metal affinity chromatography using separate 20-ml gravity columns (Bio-Rad) packed with 2 ml of nickel-nitrilotriacetic acid resin (Thermo Scientific) for each protein. Briefly, the cell pellet from 500 ml of culture (the weight of cell pellet varies between mutants, ranging from 1.2 to 2 g) was resuspended in 40 ml of Buffer A (25 mm Tris, pH 7.0, 500 mm NaCl, 20 mm imidazole, 5 mm β-mercaptoethanol) supplemented with lysozyme and DNase I. Cells were then lysed by sonication and clarified with centrifugation following the procedures described previously (23). Clarified lysate was next loaded onto the column pre-equilibrated with Buffer A. Unbound proteins were washed with 20 column volumes of Buffer A, and bound proteins were then eluted with 7 ml of 100% Buffer B (25 mm Tris, pH 7.0, 500 mm NaCl, 500 mm imidazole, 5 mm β-mercaptoethanol). Eluted samples were then dialyzed in Buffer C (50 mm HEPES, pH 7.0, 100 mm NaCl, 5 mm β-mercaptoethanol) at 4 °C overnight. The resulting proteins were then used for further activity characterization.Throughout the protein purification, the specific activity of PLP2+ was determined using a 50 μm concentration of the Z-RLRGG-AMC substrate following the procedure described below. The concentration of PLP2+ during the purification was measured through the cuvette-based Bio-Rad Bradford protein assay.
Steady-state Kinetic Studies of PLP2+
The substrate Z-RLRGG-AMC, where Z is the carboxybenzyl protecting group and RLRGG is the C-terminal peptide of Ub and ISG15, was purchased from BaChem, whereas Ub-AMC and ISG15-AMC were purchased from Boston Biochem. In each of these substrates, AMC represents 7-amino-4-methylcoumarin. Ub-AMC and ISG15-AMC assays were performed at 25 °C in triplicate in a final volume of 50 μl, whereas the Z-RLRGG-AMC assay was performed at 25 °C in triplicate in a final volume of 100 μl. Assay buffer contained 50 mm HEPES, pH 7.0, 0.1 mg/ml bovine serum albumin (BSA), and 2 mm DTT. Assays were initiated by the addition of enzymes. The final enzyme concentrations were as follows for each assay: 3 μm for the Z-RLRGG-AMC assay, 5 nm for the Ub-AMC assay, and 10 nm for the ISG15-AMC assay. Rates of hydrolysis of the substrates by PLP2+ were monitored by recording the increase in fluorescence intensity of the released AMC group (excitation λ, 340 ± 8 nm; emission λ, 460 ± 8 nm) as a function of time using an EnVision multimode plate reader from PerkinElmer Life Sciences. The initial slope of the reaction in units of fluorescence per unit time (arbitrary fluorescence units/min) was converted to the amount of product produced per unit time (μm/min) using the extinction coefficient of the product (arbitrary fluorescence units/μm) determined following the procedure as described in (25). For the Ub-AMC substrate, the initial velocities were plotted against Ub-AMC concentrations, and the data were fit to the Michaelis-Menten equation, v/[E] = kcat[S]/(K + [S]), where v is the initial velocity (μm/min), [E] is the enzyme concentration (μm), kcat is the turnover number in reciprocal minutes (min−1) and represents the maximum number of substrate molecules converted to products/enzyme molecule/min, [S] is the substrate concentration, and K is the substrate concentration (μm) at which the reaction rate is half-maximum. For the substrates ISG15-AMC and Z-RLRGG-AMC, where no saturation of the enzyme was observed, plots of initial velocities versus substrate concentrations were fit to the equation, v/[E] = kapp[S], where kapp represents the apparent rate constant (kcat/K) in units of μm−1 min−1. When [S] ≪ K, kapp approximates kcat/K.
Activity Characterization of PLP2+ Mutants
The kinetic activity of each PLP2+ mutant was measured in the presence of 50 μm Z-RLRGG-AMC or 500 nm Ub-AMC following the procedures described above. The initial rates of each reaction were then determined and compared directly to the value for WT PLP2+ at the same substrate concentration.
Ubiquitin Chain Cleavage Assay
A diubiquitin (di-Ub) panel including all eight ubiquitin linkage types (Lys6, Lys11, Lys27, Lys29, Lys33, Lys48, Lys63, and linear), Lys48-linked tetraubiquitin, and Lys63-linked hexaubiquitin were purchased from Boston Biochem. Ubiquitin chain-cleaving reactions were performed by incubating a 2 μm concentration of each ubiquitin chain listed above with a 20 nm concentration of recombinant PLP2+ in buffer (50 mm HEPES, pH 7.0, 100 mm NaCl, 0.1 mg/ml BSA, and 2 mm DTT) at 25 °C for various times ranging from 5 min to 2 h. At the end of each time period, except for the panel of di-Ub substrates, where only one time point was taken at 2 h, the reactions were quenched by the addition of NuPAGE LDS sample buffer (Invitrogen). As a control, the same amount of ubiquitin chains was incubated at 25 °C without any enzyme. Samples were then analyzed by SDS-PAGE using NuPAGE Novex 4–12% BisTris mini gels (Invitrogen).
Protein Crystallization
Initial crystallization conditions for MHVPLP2+ were first identified after performing a high-throughput crystallization screen using a series of commercial crystallization screens from Qiagen. Briefly, 1 μl of purified MHVPLP2+ at three concentrations (6, 12, and 24 mg/ml) was pipetted into the three subwells of a 96-3 well sitting drop plate (Greiner CrystalQuick crystallization plate), and 1 μl of reservoir solution was then added to the drop. The plates were then incubated at 20 °C. An initial crystallization hit was identified after 2 days of growth in a solution containing 100 mm HEPES, pH 7.5, 2% PEG 400, and 2 m (NH4)2SO4. Initial crystals were small, with the longest dimension only 0.05 mm. Further optimization of the crystallization conditions was then carried out manually using 24-well sitting drop plates at 20 °C using a fine grid of 100 mm HEPES between pH 6.8 and pH 8.3 and PLP2+ microseeds of serial dilutions between 104 and 1010. Microseeds of PLP2+ were prepared by adding a crystallization drop (from a plate set up previously without the addition of microseeds) containing an abundance of small PLP2+ crystals to 20 μl of the corresponding reservoir solution followed by serial dilutions in the same reservoir solution. To set up the crystallization plate, 2 μl of purified PLP2+ at 6 mg/ml was pipetted into the subwell of the plate. To the subwell, 0.5 μl of reservoir solution and 0.5 μl of fresh prepared PLP2+ microseeds were then added. Diffraction quality PLP2+ crystals grew within 2 days with dimensions of ∼0.09 by 0.12 mm. The optimal crystallization condition consisted of 100 mm HEPES, pH 8.0, 4% PEG 400, 2 m (NH4)2SO4, and 5 mm DTT. SeMetPLP2+ was crystallized in the same way as unlabeled PLP2+.
Data Collection and Structure Determination
Crystals of MHVPLP2+ were harvested using nylon loops and then placed into reservoir solution supplemented with 20% glycerol for seconds before they were mounted to nylon loops by flash-cooling in liquid nitrogen. Crystals were then stored in a Dewar under liquid nitrogen until data collection. For data collection, cryo-cooled crystals were mounted under a stream of dry N2 at 100 K. One native data set and one SeMet peak data set were collected at the National Institutes of General Medical Sciences and Cancer 23-ID-B and Life Science-Collaborative Access Team 21-ID-D beamlines, respectively, at the Advanced Photon Source Synchrotron, Argonne National Laboratory. Both data sets were processed and scaled using HKL2000 (41). The native data set and the SeMet peak data set were processed and scaled to 2.60 and 2.80 Å, respectively. Initial phases were determined through the single-wavelength anomalous dispersion method using the SeMet peak data set. An initial model of PLP2+ was generated using the AutoSol module in Phenix (42). Phases were then further extended using the 2.60 Å native data set. However, the initial model resulting from AutoSol was incomplete because the first 90 residues at the N terminus were not built by the program. Through the iteration of manual model building in Coot (43), structure refinement with Phenix Refine (44), and automatic model building in ARP/wARP (45), the majority of the missing residues were built. The improved model was then further refined using Phenix Refine. Data collection and refinement statistics are shown in Table 2. Figures were generated using PyMOL (66).
TABLE 2
Data collection and refinement statistics
Values in parentheses are for the last (highest resolution) shell.
SeMet
Native
Data collection
Beamline
21ID-D
23ID-B
Wavelength (Å)
0.98
1.03
Space group
I213
I213
Unit cell dimensions
a, b, c (Å)
a = b = c = 154.9
a = b = c = 155.6
α, β, γ (degrees)
α = β = γ = 90
α = β = γ = 90
Resolution (Å)
50.00–2.80 (2.85–2.80)
50.00–2.60 (2.64–2.60)
No. of reflections observed
472,931
537,164
No. of unique reflections
15,419
19,350
Rmerge (%)a
7.9 (77.8)
5.7 (74.9)
Rpim (%)b
4.6 (27.4)
1.9 (24.6)
I/σI
21.9 (2.4)
38.2 (4.5)
Completeness (%)
98.8 (100.0)
99.7 (100.0)
Redundancy
5.8 (5.8)
10.2 (10.2)
Phasing
Resolution (Å)
49.00–3.10 (3.21–3.10)
Figure of merit
0.43
Sites
5 Se
Refinement
Resolution range (Å)
49–2.60 (2.74–2.60)
No. of reflections in working set
18,388 (2612)
No. of reflections in test set
998 (136)
Rwork (%)c
17.1 (21.0)
Rfree (%)d
22.2 (28.5)
Wilson B factor (Å2)
30.9
Average B factor (Å2)
54.5
Root mean square deviation from ideal geometry
Bond length (Å)
0.008
Bond angle (degrees)
1.138
Ramachandran plot
Most favored (%)
95.0
Allowed (%)
4.2
Disallowed (%)
0.8
merge = ΣΣ|I(hkl) − 〈I(hkl)〉|/ΣΣ(hkl), where I(hkl) is the intensity of a given reflection, and 〈I(hkl)〉 is the mean intensity of symmetry-related reflections.
pim = Σ√(1/n − 1) Σ|I(hkl) − 〈I(hkl)〉|/ΣΣ(hkl), where n is the multiplicity. Rpim is multiplicity-weighted Rmerge.
work = Σ‖Fobs| − Fcalc‖/Σ|Fobs|, where Fobs and Fcalc are the observed and calculated structure factors, respectively.
free was calculated using 5% of the data set chosen at random that was excluded from the refinement.
Generation of a Structure Model for MHV PLP2 Bound with Ubiquitin-Aldehyde
The x-ray crystal structure of SARS-CoV PLpro in complex with ubiquitin-aldehyde (Ubal) (Protein Data Bank (PDB) code 4MM3) was used as a template to generate a model of MHV PLP2-Ub complex (40). Because the Ub structure could not be embedded directly into the ubiquitin binding site of the unliganded structure of MHV PLP2 without severe clashes with the fingertip region (the region in the fingers domain where the zinc atom coordinates to the four cysteine residues), we subjected the crystal structure determined in this study to a 1.2-ns isothermal (300 K) molecular dynamics simulation with explicit water molecules and NaCl ions (150 mm) (Desmond Molecular Dynamics System, Schrödinger, LLC). Remarkably, the trajectory of the simulation indicated an immediate opening of the “collapsed” fingertip region of MHV PLP2 beyond the positions observed in various liganded and unliganded SARS-CoV PLpro structures, as judged by the distance between the zinc atom and Cα carbon of the active site catalytic cysteine. Subsequently, the zinc ion and the associated fingertip oscillated between open and collapsed states with decreasing amplitude. Next, we identified a conformational state that is the most similar to the one found in the SARS-CoV PLpro-Ubal complex and used it to generate a structural model of a MHV PLP2-Ubal complex. The preliminary model was refined further by a secondary molecular dynamics simulation of a non-covalent complex to allow for extra flexibility at the C terminus of Ub, which produced a number of structurally similar conformational states. A state with optimal interaction energetics, characterized by a comparatively high number of hydrogen-bonding, hydrophobic, and favorable electrostatic contacts was used as the complex model thereafter. The persistence of the interactions between PLP2 and Ubal during the simulation was calculated. For comparison, the structure of the SARS-CoV PLpro-Ubal complex was also subject to a 1.2-ns molecular dynamics simulation.
Luciferase Reporter Assay
The luciferase assay followed the procedure described previously (25). HEK293T cells were transfected using 50 ng of Renilla luciferase, 100 ng of IFN-β-luc, and the indicated PLP expression plasmids described previously (23). 50 ng of N-RIG-I/well was transfected for simulation.
Results
Substrate Specificity of MHV PLP2+
The MHVPLP2+ catalytic core (pp1ab residues 1668–1911) with a flanking N-terminal domain (pp1a residues 1525–1607) and the Ubl2 domains (pp1ab residues 1608–1667) was expressed and purified from E. coli. The final purified protein is a 389-amino acid protein (43.5 kDa), with two additional residues at the N terminus resulting from removal of the octahistidine tag after tobacco etch virus protease cleavage. Herein, we refer to this three-domain protein construct from nsp3 as PLP2+ for simplicity.Kinetic characterization of PLP2+ was performed with three different fluorogenic substrates, Ub-AMC, ISG15-AMC, and Z-RLRGG-AMC. These three substrates were used to measure the deubiquitinating, deISGylating, and proteolytic activities of PLP2+. Plots of initial rates versus increasing concentrations of either Z-RLRGG-AMC or ISG15-AMC up to the highest substrate concentrations tested show no signs of substrate saturation (data not shown). Therefore, data were fit to a line to derive the apparent rate constant, kcat/K, from the slopes (Table 1). On the other hand, PLP2+ is readily saturated by the substrate Ub-AMC, so data were fit to the Michaelis-Menten equation, resulting in a K value of 1.3 ± 0.2 μm and a kcat value of 49.8 ± 2.9 min−1. The resulting kinetic parameters for the PLP2+-catalyzed hydrolysis of all three substrates are summarized in Table 1, where they are compared with the kinetic parameters of SARS-CoV PLpro and MERS-CoV PLpro, which were determined previously (25).
TABLE 1
Kinetic parameters of MHV PLP2+ with substrates RLRGG-AMC, Ub-AMC, and ISG15-AMC in comparison with SARS-CoV PLpro and MERS-CoV PLpro
Substrate
RLRGG-AMC
Ub-AMC
ISG15-AMC
MHV PLP2+
kcat/Km (μm−1 min−1)
0.0016a
38.3 ± 6.3
2.3 ± 0.1a
kcat (min−1)
49.8 ± 2.9
Km (μm)
1.3 ± 0.2
SARS-CoV PLprob
kcat/Km (μm−1 min−1)
0.3 ± 0.1a
1.5 ± 0.3
28.9 ± 5.3
kcat (min−1)
75.9 ± 8.1
436 ± 40
Km (μm)
50.6 ± 7.4
15.1 ± 2.4
MERS-CoV PLprob
kcat/Km (μm−1 min−1)
0.003a
1.3 ± 0.2
9.9 ± 1.6
kcat (min−1)
18.8 ± 1.2
32.6 ± 1.8
Km (μm)
14.3 ± 2.0
3.3 ± 0.5
For non-saturating substrates, kapp is calculated to approximate kcat/K.
The kinetic parameters of SARS-CoV PLpro and MERS-CoV PLpro are from Baez-Santos et al. (25).
Kinetic parameters of MHVPLP2+ with substrates RLRGG-AMC, Ub-AMC, and ISG15-AMC in comparison with SARS-CoV PLpro and MERS-CoV PLproFor non-saturating substrates, kapp is calculated to approximate kcat/K.The kinetic parameters of SARS-CoV PLpro and MERS-CoV PLpro are from Baez-Santos et al. (25).Based on the apparent kcat/K values, MHVPLP2+ catalyzes the hydrolysis of Ub-AMC ∼17-fold more efficiently than ISG15-AMC and 24,000-fold more efficiently than Z-RLRGG-AMC. This order of substrate preference for MHVPLP2+ (Ub-AMC > ISG15-AMC ⋙ Z-RLRGG-AMC) stands in contrast to the SARS-CoV and MERS-CoV PLpro enzymes, which prefer ISG15-AMC over Ub-AMC by about 19- and 8-fold, respectively (Table 1). Both SARS-CoV and MERS-CoV PLpro also highly prefer ISG15-AMC by 96- and 3,300-fold over the Z-RLRGG-AMC substrate. The significant difference between the PLP2+- and PLpro-mediated hydrolysis of Ub-AMC or ISG15-AMC and Z-RLRGG-AMC suggests that there are additional and important interactions between PLP2 and Ub outside of the enzyme's catalytic center (i.e. beyond the reactive cysteine and S5–S1 substrate recognition subsites).The ability of PLP2+ to act as deubiquitinating enzyme and catalyze the hydrolysis of isopeptide bonds was explored next by testing its specificity toward ubiquitin chains. Di-Ubs with different isopeptide linkages (Lys6, Lys11, Lys27, Lys29, Lys33, Lys48, or Lys63) or a peptide linkage (linear Ub) were incubated with PLP2+, and the reaction products were analyzed by SDS-PAGE (Fig. 2A). The results show that MHVPLP2+ is capable of cleaving all diubiquitins with isopeptide bonds into monoubiquitins but not linear-Ub, which contains a peptide bond (Fig. 2A). In addition, the efficiency of diubiquitin cleavage by PLP2+ appears to be linkage-dependent because hydrolyses of Lys11-, Lys48- and Lys63-linked di-Ubs are the most efficient. After 2 h, these di-Ubs have been completely processed to mono-Ub, whereas unreacted substrate still remains for Lys6-, Lys27-, Lys29-, and Lys33-linked di-Ubs.
FIGURE 2.
PLP2+-mediated processing of ubiquitin chains.
A, survey of the hydrolysis of di-Ub by PLP2+. 0.5 μg of di-Ub with different linkages (Lys6, Lys11, Lys27, Lys29, Lys33, Lys48, Lys63, and linear) was incubated at 25 °C with 20 nm PLP2+ for 2 h. Di-Ub incubated without PLP2+ serves as the negative control. B and C, time-dependent hydrolysis of Lys48-Ub4 (B) and Lys63-Ub6 (C) by PLP2+. The reactions were incubated at 25 °C, and aliquots were removed at five different time points, quenched with sample buffer, and then analyzed by SDS-PAGE. Polyubiquitin chains incubated without PLP2+ serve as negative control (ctrl). Markers were (from top to bottom) 50, 37, 25, 20, 15, and 10 kDa.
PLP2+-mediated processing of ubiquitin chains.
A, survey of the hydrolysis of di-Ub by PLP2+. 0.5 μg of di-Ub with different linkages (Lys6, Lys11, Lys27, Lys29, Lys33, Lys48, Lys63, and linear) was incubated at 25 °C with 20 nm PLP2+ for 2 h. Di-Ub incubated without PLP2+ serves as the negative control. B and C, time-dependent hydrolysis of Lys48-Ub4 (B) and Lys63-Ub6 (C) by PLP2+. The reactions were incubated at 25 °C, and aliquots were removed at five different time points, quenched with sample buffer, and then analyzed by SDS-PAGE. Polyubiquitin chains incubated without PLP2+ serve as negative control (ctrl). Markers were (from top to bottom) 50, 37, 25, 20, 15, and 10 kDa.Because Lys48- and Lys63-linked polyubiquitin chains are the most characterized chain types and the most ubiquitous in cell signaling pathways, we determined the time course for hydrolysis of Lys48-linked tetraubiquitin (Lys48-Ub4) and Lys63-linked hexaubiquitin (Lys63-Ub6) by MHVPLP2+ (Fig. 2, B and C). The data show that PLP2+ processes Lys48-Ub4 and Lys63-Ub6 with similar catalytic efficiency.
Crystallization and X-ray Structure Determination of MHV PLP2+
Purified MHVPLP2+ was readily crystallized by the microseeding approach from a solution containing ∼6 mg/ml enzyme in crystallization solution (100 mm HEPES, pH 8.0, 4% PEG 400, 2 m (NH4)2SO4, and 5 mm DTT). A complete x-ray data set on a native crystal was collected to 2.60 Å resolution (Table 2). Multiple attempts to determine the structure of MHVPLP2+ through numerous molecular replacement approaches using the available SARS-CoV PLpro structures (PDB codes 2FE8, 3E9S, and 3MJ5) or the TGEVPLP1 structure (3MP2) proved unsuccessful. Therefore, experimental phasing using heavy atom derivatives and SeMet-substituted MHVPLP2+ was pursued. Ultimately, experimental phases were determined via the single-wavelength anomalous dispersion approach from a 2.80 Å data set collected from a SeMet-substituted MHVPLP2+ crystal. Both native and SeMetPLP2+ proteins crystallized in the I213 space group with one PLP2+ monomer in the asymmetric unit. Because PLP2+ was determined to be a monomer in solution based on size exclusion chromatography/multiangle light scattering analysis (data not shown), the asymmetric unit contains the biologically relevant monomer. The final structure was determined and refined against the native data set to a resolution of 2.60 Å with final R-values of Rwork = 17.1% and Rfree = 22.1%. The final x-ray data collection and refinement statistics are summarized in Table 2. The final MHVPLP2+ structural model had observable electron density for residues 1533–1910. The first 10 N-terminal and last C-terminal residues had weak or unobservable electron density.Data collection and refinement statisticsValues in parentheses are for the last (highest resolution) shell.merge = ΣΣ|I(hkl) − 〈I(hkl)〉|/ΣΣ(hkl), where I(hkl) is the intensity of a given reflection, and 〈I(hkl)〉 is the mean intensity of symmetry-related reflections.pim = Σ√(1/n − 1) Σ|I(hkl) − 〈I(hkl)〉|/ΣΣ(hkl), where n is the multiplicity. Rpim is multiplicity-weighted Rmerge.work = Σ‖Fobs| − Fcalc‖/Σ|Fobs|, where Fobs and Fcalc are the observed and calculated structure factors, respectively.free was calculated using 5% of the data set chosen at random that was excluded from the refinement.
The X-ray Structure of MHV PLP2+ Reveals a DPUP Domain in MHV nsp3
The crystal structure of the MHV PLP2 catalytic core and flanking N-terminal and Ubl2 domains consists of five domains: fingers, palm, thumb, Ubl2, and DPUP (Fig. 3A). The thumb, palm, and fingers domains form the catalytic core of PLP2, which has an architecture resembling a right hand. The thumb domain (aa 1668–1786 in the expression construct) is primarily made up of six helices (α4–α9), with two short anti-parallel strands (β11–β12) between helices α5 and α6. The palm domain (aa 1842–1909) is formed by six anti-parallel strands (β18–β23). Finally, the fingers domain contains two short helices, one short strand, and one β-sheet consisting of four anti-parallel twisted strands. At the tip of the fingers domain, four cysteines (Cys1794, Cys1796, Cys1828, and Cys1830) coordinate a metal ion in a tetrahedral geometry. The identity of the metal was determined to be zinc via a fluorescence energy scan at the zinc x-ray absorption edge (data not shown).
FIGURE 3.
X-ray crystal structure of PLP2+.
A, ribbon representation of the overall structure of PLP2+. The structure contains five domains: fingers (green), palm (light blue), thumb (orange), Ubl2 (magenta), and DPUP (wheat). The zinc atom in the fingertip of the fingers domain is shown as a gray sphere. The catalytic triad residues and the four zinc-coordinating cysteines from the fingers domain are represented as sticks. B, structural superposition of the DPUP domain (wheat) with SARS SUD-C domain (cyan; PDB code 2KQW). The root mean square deviation between the two structures is 2.1 Å over 63 aligned Cα atoms at 13% sequence identity. C, electron density maps covering the catalytic triad before (left) and after (right) refinement of the sulfonic group at the catalytic Cys. In the right panel, the occupancies of the oxygen atoms in the sulfonic acid group were set to 0.5. In each panel, the residual electron density in F − F maps are colored green and are contoured at 3σ, whereas the 2F − F maps are colored blue and are contoured at 1σ.
X-ray crystal structure of PLP2+.
A, ribbon representation of the overall structure of PLP2+. The structure contains five domains: fingers (green), palm (light blue), thumb (orange), Ubl2 (magenta), and DPUP (wheat). The zinc atom in the fingertip of the fingers domain is shown as a gray sphere. The catalytic triad residues and the four zinc-coordinating cysteines from the fingers domain are represented as sticks. B, structural superposition of the DPUP domain (wheat) with SARS SUD-C domain (cyan; PDB code 2KQW). The root mean square deviation between the two structures is 2.1 Å over 63 aligned Cα atoms at 13% sequence identity. C, electron density maps covering the catalytic triad before (left) and after (right) refinement of the sulfonic group at the catalytic Cys. In the right panel, the occupancies of the oxygen atoms in the sulfonic acid group were set to 0.5. In each panel, the residual electron density in F − F maps are colored green and are contoured at 3σ, whereas the 2F − F maps are colored blue and are contoured at 1σ.The MHV ubiquitin-like (Ubl2) domain (aa 1608–1667) is composed of a β-grasp fold (46). The Ubl domain was first identified in nsp3 after the determination of the x-ray structure of SARS-CoV PLpro (17). Subsequently, a second Ubl domain was identified in nsp3 after determination of the NMR structure of the first 183 residues of the SARS-CoV nsp3 N terminus (13). The Ubl domain associated with PLpros and PLP2 is now designated as Ubl2, and the N-terminal Ubl domain is now designated as Ubl1 because it comes first in the primary sequence.The x-ray structure of PLP2+ also reveals an unpredicted domain at the N terminus of MHV PLP2 that is formed by 85 residues. The domain consists of two helices (α1 and α2) that are connected by five anti-parallel β-strands (β1–β5). A DALI search (47) of this unpredicted domain against the PDB identified the C domain of the SARS unique domain (SUD-C) as the top match, with a Z-score of 7.3. Structural superposition of this domain with the SUD-C domain, using the DaliLite server (48), shows that the root mean square deviation is 2.1 Å over 63 aligned backbone Cα atoms, whereas the sequence identity is only 13% (Fig. 3B). Although the SUD has been proposed to be unique to SARS-CoV, the presence of this domain in MHV nsp3 calls into question its exclusivity (see “Discussion”). We designate this new domain in nsp3 as the DPUP (domain preceding Ubl2 and PLP2) domain because it does not engender exclusivity, and it allows for DPUP to be identified in other coronavirus nsp3s.The active site of PLP2, which consists of a Cys1716-His1873-Asp1887 catalytic triad, sits between the thumb and palm domains. Cys1716 and Asp1887 are both within hydrogen-bonding distance of His1873 (3.1 and 3.0 Å, respectively; Fig. 3C), indicating that the active site is arranged in a catalytically competent manner, even in the absence of a substrate. Unlike SARS-CoV PLpro, where a tryptophan side chain forms part of the oxyanion hole (17), Gln1711 occupies the same position in PLP2 and appears to be involved in the oxyanion hole stabilization (Fig. 3C). Asn1714, which is conserved among coronaviral PLPs, may also participate in stabilizing the oxyanion hole. Interestingly, after structural refinement of PLP2+, residual electron density surrounding the reactive cysteine (Cys1716) was observed, suggesting that it is partially oxidized to cysteic acid (Fig. 3C, left). X-ray refinement with a partially occupied cysteic acid was able to account for the residual electron density (Fig. 3C, right). Partial or complete oxidation of the catalytic cysteine is commonly observed in crystal structures of cysteine proteases due to both the highly reactive nature of the side chain thiol group and its ability to coordinate thiophilic transition metals capable of oxidation (49). We have observed complete, partial, and no oxidation of the reactive cysteine in SARS-CoV PLpro x-ray structures (17, 50–52).
Structural Comparison of MHV PLP2 to Known Structures of Coronaviral PLPs
Prior to the structure of MHV PLP2, structures of other CoV PLPs, including SARS-CoV PLpro, MERS-CoV PLpro, TGEVPLP1, and IBV PLpro, have been determined (17, 30, 53, 54, 65) (Fig. 1). To help elucidate the function of coronaviral PLPs, the catalytic core of PLP2 was superimposed with the above known CoVPLP structures. The overall architectures of these proteins are similar, with root mean square deviation ranges from 1.7 Å (SARS-CoV PLpro, 218 of 242 aligned residues) to 2.8 Å (TGEVPLP1, 189 of 242 aligned residues). For the sake of clarity, only the structural superpositions of MHV PLP2 on SARS-CoV PLpro and MERS-CoV PLpro are shown (Fig. 4A). The Cys-His-Asp catalytic triads from these PLPs are well aligned (Fig. 4B). However, several differences in the structures are observed from the overlay. First, whereas the local geometry of the zinc-binding site is maintained in all these structures, the fingertip regions have different conformations and differing extents of opening/closure with respect to the palm domains. In contrast with SARS-CoV PLpro and MERS-CoV PLpro, the fingertip in PLP2 collapses toward the palm, rendering the canonical surface for ubiquitin binding too compact to accommodate the typical ubiquitin structure without some opening via conformational changes. Extensive flexibility within the fingers domain has been observed in the x-ray structure of the free enzyme form (i.e. with no substrates or ligands bound) of SARS-CoV PLpro, where an overlay of the three protomers within one asymmetric unit shows significant variations in the positions of the atoms in the fingers domain (up to 4.6 Å for zinc atoms) (17). The conformational flexibility of the fingers domain may regulate substrate binding and, hence, the activity of PLPs. However, given the tight packing of the crystal around the fingers domain (data not shown), there is the possibility that crystal packing may influence the conformation of the collapsed fingers domain in MHV PLP2.
FIGURE 4.
Structural comparison among coronaviral PLPs.
A, structural superposition of unbound PLP2 (light blue), unbound SARS-CoV PLpro (light pink; PDB code 2FE8), and unbound MERS-CoV PLpro (light green; PDB code 4P16). Areas with major differences are indicated with black arrows. B, close-up view of the active site alignments. The Cys-His-Asp catalytic triad is conserved among the three proteins, whereas the oxyanion hole residues are different. The color code is the same as in A. The catalytic cysteine in MERS-CoV PLpro was covalently modified by β-mercaptoethanol. C, conformations of the BL2 loops in different CoV PLPs. The color code is the same as in A with the additional structure of inhibitor-bound SARS-CoV PLpro shown in cyan (PDB code 3E9S). The loop from MERS-CoV PLpro is omitted because it is not visible in the structure. D, sequences of the BL2 loops in different CoV PLPs.
Structural comparison among coronaviral PLPs.
A, structural superposition of unbound PLP2 (light blue), unbound SARS-CoV PLpro (light pink; PDB code 2FE8), and unbound MERS-CoV PLpro (light green; PDB code 4P16). Areas with major differences are indicated with black arrows. B, close-up view of the active site alignments. The Cys-His-Asp catalytic triad is conserved among the three proteins, whereas the oxyanion hole residues are different. The color code is the same as in A. The catalytic cysteine in MERS-CoV PLpro was covalently modified by β-mercaptoethanol. C, conformations of the BL2 loops in different CoV PLPs. The color code is the same as in A with the additional structure of inhibitor-bound SARS-CoV PLpro shown in cyan (PDB code 3E9S). The loop from MERS-CoV PLpro is omitted because it is not visible in the structure. D, sequences of the BL2 loops in different CoV PLPs.Another region with significant differences among the PLPs is the Lβ19-β20 loop in PLP2 (sequence: 1868GGSVG1872) (Fig. 4, A and C), referred to as the BL2 loop in ubiquitin-specific proteases. The length of the substrate loop varies in different CoVs (Figs. 4D and 6). Flanked with glycine residues, this loop is expected to be highly flexible. Indeed, this loop has been observed in different conformations in various structures of coronaviral PLPs (Fig. 4C). In the case of SARS-CoV PLpro, the loop (267GNYQCG272) is found in a closed arrangement when there is a bound inhibitor, whereas an open state is observed when the enzyme is crystallized as the free enzyme form (Fig. 4C). However, the loop in PLP2 is one residue shorter, and it is observed in an extended open conformation. Interestingly, the BL2 loop of PLP2 does not contain any phenylalanine or tyrosine residue. It has been shown for SARS-CoV PLpro and HCoV-NL63 PLP2 that having a phenylalanine or tyrosine on the BL2 loop is important for the binding of a series of potent PLP inhibitors (50, 52). On the other hand, MERS-CoV PLpro, which does not have any phenylalanine or tyrosine residue on the BL2 loop, has no inhibition by these inhibitors (25). And our results have shown that MHV PLP2, which lacks a phenylalanine or tyrosine residue on the BL2 loop, is not affected by this series of inhibitors (data not shown).
FIGURE 6.
Sequence alignment of various coronaviral PLP2s. Group 1, HCoV-229E and HCoV-NL63; group 2, MHV, SARS-CoV, MERS-CoV, HCoV-OC43, and HCoV-HKU1; group 3, IBV. The BL2 loop residues are boxed in green. Purple stars, catalytic triad residues; orange ovals, residues of MHV PLP2 that may be involved in important interactions with Ub. Accession numbers are as follows: MHV PLP2, P0C6V0; SARS-CoV PLpro, AEA10816.1; MERS-CoV PLpro, YP_007188578.1; HCoV-OC43, NP_937947.21; HCoV-HKU1, YP_460024.1; HCoV-229E, AGT21365.1; HCoV-NL63, AFD98832.1; IBV, P0C6V5.1. The alignment was performed using Clustal Omega. The figure was generated using ESPript (67). Same residues are shaded in red, and conserved residues are shown in red.
Potential Interactions between MHV PLP2 and Ubiquitin
To gain insights into potential interactions between PLP2 and ubiquitin, we utilized the structure of the SARS-CoV PLpro-ubiquitin aldehyde (Ubal) complex (PDB code 4MM3) (40) as a template to generate a model of PLP2 in complex with Ubal via molecular dynamics (see “Experimental Procedures” for details). The resulting computational model of the MHV PLP2-Ubal complex reveals that the Ubal molecule is stabilized by numerous interactions at the canonical surface formed by the thumb-palm-fingers domains of PLP2, whereas the C-terminal residues of Ubal extend into the narrow opening of the active site, as expected. Henceforth, to differentiate the enzyme residues from Ub residues, the latter will be shown in italic type, whereas the former will be represented in normal type.Extensive hydrogen-bonding interactions between the C-terminal residues of Ubal and the active site of PLP2 are observed in the model (Fig. 5A). This observation is consistent with the fact that PLP2 can hydrolyze the peptide substrate Z-RLRGG-AMC, albeit more than 20,000 times less efficiently than Ub-AMC. A similar network of hydrogen-bonding interactions has also been discovered in the structure of SARS-CoV PLpro-Ubal complex (Fig. 5B) (40). In SARS-CoV PLpro, the side chain of Tyr265 interacts with the backbone of Arg (Fig. 5B). A similar interaction is absent in PLP2 due to the substitution of Tyr265 with Phe1866. However, the loss of this hydrogen-bonding interaction may not have a significant effect on the enzymatic activity because the Y265F mutant of SARS-CoV PLpro possesses activity comparable with that of WT (55). On the other hand, a potentially stable salt bridge interaction between the side chain of Arg and Glu1766 is observed in the PLP2-Ubal model (Fig. 5A). In the SARS-CoV PLpro-Ubal complex, however, a similar interaction is absent, as suggested by both the crystal structure (40, 55) and the 1.2-ns molecular dynamics analysis of the complex (data not shown). Overall, the differences in the interaction profiles between the C-terminal residues of Ubal and the active sites of MHV PLP2 and SARS-CoV PLpro do not adequately justify why the activity of PLP2 with Z-RLRGG-AMC is significantly lower than that of SARS-CoV PLpro. Perhaps the additional salt bridge between Glu1766 and Arg in the PLP2-Ubal model is detrimental to the activity of PLP2+ with Z-RLRGG-AMC because it could lead to substantial product inhibition. Alternatively, additional interactions that were not predicted by the molecular dynamics run may play an important role in the reduction of MHVPLP2+ activity against Z-RLRGG-AMC when compared with that of SARS-CoV PLpro.
FIGURE 5.
Model of PLP2 in complex with Ubal.
A and B, potential interactions between PLP2 and the C-terminal residues of Ubal (A) in comparison with the identified interactions between SARS-CoV PLpro and the C-terminal residues of Ubal (B) as observed in the structure of SARS-CoV PLpro-Ubal complex (PDB code 4MM3). PLP2 is colored as in Fig. 3: thumb (orange), palm (light blue), and fingers (green). Ubal is shown in yellow, and SARS-CoV PLpro is shown in cyan. Hydrogen bonding and salt bridge interactions are represented as dashed lines. The red dashed line in A indicates that a similar interaction is not observed in B, and vice versa. C, potential interactions between PLP2 and Ubal (yellow). Only the interactions that persist for more than 50% of the simulation time are shown. The magenta dashed line indicates that the interaction persists for more than 50% of the simulation time but is not present in the current analyzed frame.
Model of PLP2 in complex with Ubal.
A and B, potential interactions between PLP2 and the C-terminal residues of Ubal (A) in comparison with the identified interactions between SARS-CoV PLpro and the C-terminal residues of Ubal (B) as observed in the structure of SARS-CoV PLpro-Ubal complex (PDB code 4MM3). PLP2 is colored as in Fig. 3: thumb (orange), palm (light blue), and fingers (green). Ubal is shown in yellow, and SARS-CoV PLpro is shown in cyan. Hydrogen bonding and salt bridge interactions are represented as dashed lines. The red dashed line in A indicates that a similar interaction is not observed in B, and vice versa. C, potential interactions between PLP2 and Ubal (yellow). Only the interactions that persist for more than 50% of the simulation time are shown. The magenta dashed line indicates that the interaction persists for more than 50% of the simulation time but is not present in the current analyzed frame.In addition to the interactions near the active site, extensive contacts are predicted between MHV PLP2 and the Ubal core (residues 1–71) (Fig. 5C). Specifically, side chains of Tyr1824 and Arg1803 interact with the backbone carbonyl of Ala through hydrogen bonding, whereas there is a backbone-backbone interaction between the amide of Phe1812 and the carbonyl of Gly. In addition, the side chain of Phe1812 interacts with the Ile hydrophobic patch of Ub. Moreover, residues Asp1772, Arg1775, and Asp1807 each engage in salt bridge interactions with Arg, Glu, and Lys, respectively. The more complementary charge contacts that exist in the MHV PLP2-Ubal complex but not the SARS-CoV PLpro-Ubal complex may account for the much lower K of PLP2+ with Ub-AMC than that of SARS-CoV PLpro.
Mutagenesis Studies of MHV PLP2+ Reveal Important Residues Involved in Ub Binding and Interferon Antagonism
To probe the importance of identified residues in Ub binding, we performed mutagenesis studies and characterized the activities of the resulting mutants with substrates Z-RLRGG-AMC and Ub-AMC. As summarized in Table 3, the D1772R mutant has only 1% of WT activity with Ub-AMC, whereas 50% of WT activity with Z-RLRGG-AMC is maintained. The same trend is observed with the D1807R mutant. These observations suggest that both Asp1772 and Asp1807 of MHV PLP2 play important roles in binding Ub via regions outside of the Ub C terminus. Asp1807 is poorly conserved among CoVs (Fig. 6), implying that the interaction between Asp1807 and Ub may be unique to MHV PLP2. On the other hand, residues corresponding to Asp1772 in MHV PLP2 are either aspartic or glutamic acids in other CoVs (Fig. 6), except MERS-CoV PLpro (arginine) and IBV PLP2 (tryptophan). In fact, Glu168, which corresponds to Asp1772 in SARS-CoV PLpro has also been shown to be important for interacting with Ub (55). However, the observation that MERS-CoV PLpro also has a comparable level of DUB activity (29) suggests that this enzyme may interact with Ub in a different manner.
TABLE 3
Activity characterization of PLP2+ mutants
Relative activity of WT
50 μm RLRGG-AMC
500 nm Ub-AMC
%
%
WT
100 ± 3
100 ± 2
D1772R
42.1 ± 2.4
1.1 ± 0.1
D1807R
26.2 ± 2.0
0.5 ± 0.1
Y1824F
46.2 ± 0.7
70.3 ± 4.4
Y1824A
0
3.0 ± 0.2
F1812A
41.1 ± 0.9
3.3 ± 0.9
Activity characterization of PLP2+ mutantsSequence alignment of various coronaviral PLP2s. Group 1, HCoV-229E and HCoV-NL63; group 2, MHV, SARS-CoV, MERS-CoV, HCoV-OC43, and HCoV-HKU1; group 3, IBV. The BL2 loop residues are boxed in green. Purple stars, catalytic triad residues; orange ovals, residues of MHV PLP2 that may be involved in important interactions with Ub. Accession numbers are as follows: MHV PLP2, P0C6V0; SARS-CoV PLpro, AEA10816.1; MERS-CoV PLpro, YP_007188578.1; HCoV-OC43, NP_937947.21; HCoV-HKU1, YP_460024.1; HCoV-229E, AGT21365.1; HCoV-NL63, AFD98832.1; IBV, P0C6V5.1. The alignment was performed using Clustal Omega. The figure was generated using ESPript (67). Same residues are shaded in red, and conserved residues are shown in red.Two mutants, Y1824F and Y1824A, were generated to investigate the role of the phenolic side chain in ubiquitin binding. Whereas the Y1824F mutant was found to possess activities comparable with those of WT, the Y1824A mutant is essentially inactive. This indicates that the predicted hydrogen-bonding interaction between Tyr1824 and Ub is not crucial for the processing of Ub. However, the presence of an aromatic ring at this position is important, given the packing around the Tyr1824 residue (Fig. 7A), and its removal may compromise the protein's stability and thus its enzymatic activity (Table 3). Indeed, temperature inactivation experiments show that Y1824A is much more unstable than Y1824F (Fig. 7C). Interestingly, the multiple sequence alignment of various PLPs reveals that residues at this position are conserved as mainly aromatic or in one case a methionine (Fig. 6).
FIGURE 7.
Packing of MHV PLP2 around residues Tyr
A, the packing of the PLP2 fingers domain around Tyr1824 shown in a space-filling view (green, carbon; red, oxygen; blue, nitrogen; gray, hydrogen). Ub is shown as a cartoon in yellow. B, the packing of Phe1812 from PLP2 against the Ile patch from Ub shown in a space-filling view. The color code is the same as in A, except carbon atoms of Ile44 and Val70 are shown in yellow. C, temperature inactivation of PLP2+ mutant Y1824A. The activity of WT PLP2+ and two PLP2+ mutants (Y1824F and Y1824A) were measured after incubation at 30 °C for different time periods and then normalized to the activity at 0 min. Error bars, S.D.
Packing of MHV PLP2 around residues Tyr
A, the packing of the PLP2 fingers domain around Tyr1824 shown in a space-filling view (green, carbon; red, oxygen; blue, nitrogen; gray, hydrogen). Ub is shown as a cartoon in yellow. B, the packing of Phe1812 from PLP2 against the Ile patch from Ub shown in a space-filling view. The color code is the same as in A, except carbon atoms of Ile44 and Val70 are shown in yellow. C, temperature inactivation of PLP2+ mutant Y1824A. The activity of WT PLP2+ and two PLP2+ mutants (Y1824F and Y1824A) were measured after incubation at 30 °C for different time periods and then normalized to the activity at 0 min. Error bars, S.D.Importantly, the F1812A mutant leads to a substantial decrease (3% remaining activity) in its DUB activity, whereas 50% of its activity with Z-RLRGG-AMC is maintained. This indicates that the hydrophobic interaction between Phe1812 and the Ile patch is important in the recognition of Ub. Given the packing of Phe1812 against Ile (Fig. 7B), it is not surprising that the replacement of Phe1812 with alanine has such dire consequences to its DUB activity. Notably, most CoVs have conserved hydrophobic residues at this position (Fig. 6).Finally, to investigate the correlation between the DUB activity of PLP2 and its ability to antagonize IFN production, we performed IFN antagonism assays in cell culture with several selected mutants. In particular, the Y1824F mutant, which displays WT level DUB activity, preserves the ability to antagonize IFN production, whereas the inactive mutant, Y1824A, loses IFN antagonism ability (Fig. 8). In contrast to Y1824F, the D1772R, D1807R, and F1812A mutants, which have moderate activities against Z-RLRGG-AMC but low DUB activities, are unable to block the activation of IFN (Fig. 8). Thus, the combined computational modeling and enzyme kinetic studies aid in the identification of regions and residues important for ubiquitin binding and IFN antagonism.
FIGURE 8.
Interferon antagonism activity of MHV PLP2+ wild type and mutants. HEK293T cells were transfected with plasmids expressing either the WT, the catalytic mutant (CA), or the indicated PLP2+ mutants and plasmids expressing IFNβ-luc, Renilla-luc, and N-RIG-I. At 16 h post-transfection, cells were lysed, and luciferase activity was measured. Experiments were performed in triplicate. Error bars, S.D.
Interferon antagonism activity of MHVPLP2+ wild type and mutants. HEK293T cells were transfected with plasmids expressing either the WT, the catalytic mutant (CA), or the indicated PLP2+ mutants and plasmids expressing IFNβ-luc, Renilla-luc, and N-RIG-I. At 16 h post-transfection, cells were lysed, and luciferase activity was measured. Experiments were performed in triplicate. Error bars, S.D.
Discussion
In this study, we demonstrate that purified MHVPLP2+ has both deubiquitinating and deISGylating activities, with the former activity being the most dominant. A survey of various ubiquitin linkages shows that PLP2+ processes the isopeptide bonds of di-Ubs of all types, with the exception of linear di-Ub, which contains a peptide bond. We also report the first crystal structure of the PLP2 catalytic core together with two of its N-terminal adjacent domains as part of nsp3. Besides the expected thumb-palm-finger architecture of the PLP2 catalytic core and the Ubl2 domain at its N terminus, the x-ray structure reveals a domain preceding the Ubl2 domain in MHV nsp3 that is so far only similar to the SARS-CoV SUD-C domain. We designate this domain as the DPUP (domain preceding Ubl2 and PLP2) domain. Finally, we generated a computational model of PLP2 in complex with Ubal, and supported by mutagenesis studies, we show that the computational model provides insights into the interactions between PLP2 and Ub.Kinetic characterization of MHVPLP2+ revealed significant distinctions in the deubiquitination and deISGylation activities of MHVPLP2+ and SARS-CoV PLpro. The results suggest that these two viral proteins may target different cellular substrates to facilitate viral evasion of the host innate immune responses. Furthermore, the fact that MHVPLP2+ can process di-Ubs of all linkage types, except the linear one, indicates that this enzyme displays promiscuous recognition surrounding the environment of the lysine residue of the proximal ubiquitin molecule in ubiquitin chains. What is not clear is whether or not there are specific substrate recognition mechanisms associated with MHVPLP2+ that are mediated through the interactions with specific ubiquitinated substrates instead of the attached ubiquitin chains. Alternatively, MHVPLP2+ may require external assistance, such as other domains in nsp3, to achieve substrate discrimination, or MHVPLP2+ may indiscriminately deubiquitinate multiple host cell proteins. Moreover, the observation that MHVPLP2+ can process both Lys48- and Lys63-linked polyubiquitin chains to monoubiquitin with nearly equal efficiency differs from that of SARS-CoV PLpro. In the case with SARS-CoV PLpro, Lys48-linked ubiquitin chains are cleaved much more rapidly than the Lys63-linked ones, and Lys48-di-Ub is processed much slower than the longer ubiquitin chains (26). This observed difference indicates that, unlike SARS-CoV PLpro, which has two ubiquitin binding sites (40), MHVPLP2+ most likely has only one ubiquitin binding site.Of significant importance is the discovery of the DPUP domain in MHV nsp3 as revealed by the x-ray structure of PLP2+. The SUD, including SUD-N, SUD-M, and SUD-C, was believed to be unique to SARS-CoV and, perhaps, responsible for its extreme pathogenicity (24). Subsequently, Neuman et al. (12) performed a bioinformatics analysis of nsp3s from multiple coronaviruses and proposed that the SUD-C exists only in the β-coronavirus genogroups 2b, 2c, and 2d and not in genogroup 2a, to which MHV belongs. In addition, the SUD-C is predicted to be absent in the α- and γ-coronaviruses. X-ray and NMR structural studies show that subdomains SUD-N and SUD-M adopt macrodomain folds, whereas SUD-C adopts a frataxin-like fold (15, 16). Interestingly, SUD-NM and its individual subdomains can bind oligonucleotides that form G-quadruplexes (15, 56). SUD-C has also been shown to have RNA-binding ability, and concerted actions by the subdomains of SUD have been proposed to ensure specific RNA binding (16). We have attempted to investigate the ability of the DPUP domain to bind oligonucleotides; however, the results were inconclusive (data not shown). Nonetheless, the presence of the DPUP domain in MHV nsp3 and the bioinformatics prediction suggest that SUD should no longer be considered “unique” to SARS-CoV. Therefore, it is intriguing to investigate the presence of similar domains in other CoVs and their roles in virus infection, replication, and pathogenesis.The computational model of PLP2 in complex with Ubal provides insights into the molecular interactions between the enzyme and its ubiquitin substrates and helps in providing a structural rationale for the results of the kinetic characterization. The extensive interactions predicted between PLP2 and the Ub beyond the -RLRGG C-terminal sequence may account for the significantly higher activity of PLP2+ with Ub-AMC than with the Z-RLRGG pentapeptide substrate. A higher extent of charge complementarity between PLP2 and Ub, compared with that displayed by SARS-CoV PLpro, may explain the much lower K of the former with Ub-AMC. Mutagenesis of PLP2+ supports the model and, at the same time, reveals the importance of residues Asp1772, Asp1807, Tyr1824, and Phe1812 in the recognition of Ub. To evaluate the role of PLP2 DUB activity in immune evasion in the context of virus replication through the approach of protein engineering, it is essential to maintain the proteolytic activity of the mutants. The residues listed above, which are located away from the active site, can serve as candidates for protein engineering of PLP2 in an effort to selectively disrupt its DUB activity and, therefore, its IFN antagonism ability.Coronaviral nsp3 is a large, membrane-associated, multidomain protein. SARS-CoV nsp3 has been shown to play a crucial role in inducing the formation of double-membrane vehicles (57). Furthermore, a proteomics analysis of SARS-CoV virions has described nsp3 as the “hub of connectivity” because it makes the most interactions with other viral proteins, implying a critical role of nsp3 in virion assembly and virus replication (12, 20). Although several domains of nsp3 have been characterized individually, including the ADRP domain (from SARS-CoV and HCoV-229E), the PLP domain (from SARS-CoV, HCoV-NL63, MERS-CoV, TGEV, and porcine epidemic diarrhea virus), and the SARS-CoV SUD domain (15–17, 26, 56, 58), it remains unclear how nsp3 coordinates the multiple domains with different functions to participate in viral RNA synthesis and whether there are any concerted actions among domains. For example, our recent study revealed the critical role of the Ubl2 domain adjacent to PLP2 in maintaining the stability and activity of the protease (23). We found that mutations in the Ubl2 domain resulted in a temperature-sensitive phenotype and that the virus was highly attenuated in animals. In addition, this Ubl2 mutant induced protective immunity and provided a new approach to vaccine development for coronaviruses. Because of the potential cross-talk between the adjacent domains, it would be important to characterize nsp3 kinetically and structurally in its entirety and to evaluate the function of each domain in the context of full nsp3. Toward this goal, we disclose herein the enzymatic and structural characterization of the largest portion of coronaviral nsp3 that has ever been studied in vitro, which consists of the DPUP domain, the Ubl domain, and the PLP2 domain.Although PLP2 has been shown to be capable of deubiquitinating TBK1 in cell-based assays (38), here the DUB activity of PLP2 is explicitly validated through in vitro characterization using purified recombinant proteins. Antagonizing the host innate immune responses through viral deubiquitinating/deISGylating enzymes is a common strategy employed by viruses. Besides coronaviruses, other species, such as adenovirus, nairovirus, arterivirus, and herpesvirus, have also been found to encode proteases that have deubiquitinating/deISGylating activities, and these proteases are proposed to assist in evading ubiquitin- or ISG15-dependent host innate immune responses (59–61). Extensive protein overexpression studies performed on cellular levels have to some extent supported this hypothesis (33, 35, 39). Moreover, studies with equine arteritis virus PLP2 have shown that cells infected with DUB activity-deficient equine arteritis virus display significantly enhanced innate immune responses compared with cells infected with WT virus (62). However, a detailed mechanism of how viral DUBs suppress host immune responses is still not fully clear.The continuous emergence of severe epidemics caused by new CoVs requires better understanding of the infection and pathogenesis mechanisms. MHV is the best-studied coronavirus model, and it has an undeniable advantage of ease in culturing in comparison with HCoVs. The discovery that PLP2 has deubiquitinating/deISGylating activities enables the use of MHV as a model to study the mechanism of coronavirus immune evasion through viral DUBs. More importantly, the structure of PLP2 and the model of PLP2-Ub complex provided by this study bring us closer to this goal.
Author Contributions
Y. C. and A. D. M. conceived and coordinated the study and wrote the paper. Y. C. purified, characterized, and crystallized PLP2+ and determined its x-ray structure. S. N. S. built the computation model of PLP2-Ub complex. Y. C. and S. N. S. designed the mutants. Y. C. and T. C. purified and characterized the mutants. A. M. M. designed, performed, and analyzed the experiments shown in Fig. 8. S. C. B. provided substantial contributions to analysis and interpretation of data. All authors reviewed the results and approved the final version of the manuscript.
Authors: Kumar Singh Saikatendu; Jeremiah S Joseph; Vanitha Subramanian; Tom Clayton; Mark Griffith; Kin Moy; Jeffrey Velasquez; Benjamin W Neuman; Michael J Buchmeier; Raymond C Stevens; Peter Kuhn Journal: Structure Date: 2005-11 Impact factor: 5.006
Authors: Antonina Andreeva; Dave Howorth; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin Journal: Nucleic Acids Res Date: 2004-01-01 Impact factor: 16.971
Authors: Eric J Snijder; Peter J Bredenbeek; Jessika C Dobbe; Volker Thiel; John Ziebuhr; Leo L M Poon; Yi Guan; Mikhail Rozanov; Willy J M Spaan; Alexander E Gorbalenya Journal: J Mol Biol Date: 2003-08-29 Impact factor: 5.469
Authors: Nerea Irigoyen; Andrew E Firth; Joshua D Jones; Betty Y-W Chung; Stuart G Siddell; Ian Brierley Journal: PLoS Pathog Date: 2016-02-26 Impact factor: 6.823
Authors: Jozlyn R Clasman; Yahira M Báez-Santos; Robert C Mettelman; Amornrat O'Brien; Susan C Baker; Andrew D Mesecar Journal: Sci Rep Date: 2017-01-12 Impact factor: 4.379
Authors: Steve P H Alexander; Jane F Armstrong; Anthony P Davenport; Jamie A Davies; Elena Faccenda; Simon D Harding; Francesca Levi-Schaffer; Janet J Maguire; Adam J Pawson; Christopher Southan; Michael Spedding Journal: Br J Pharmacol Date: 2020-07-19 Impact factor: 8.739
Authors: Brendan T Freitas; Ian A Durie; Jackelyn Murray; Jaron E Longo; Holden C Miller; David Crich; Robert Jeff Hogan; Ralph A Tripp; Scott D Pegan Journal: ACS Infect Dis Date: 2020-06-04 Impact factor: 5.084
Authors: Xufang Deng; Robert C Mettelman; Amornrat O'Brien; John A Thompson; Timothy E O'Brien; Susan C Baker Journal: J Virol Date: 2019-05-29 Impact factor: 5.103
Authors: Xufang Deng; Yafang Chen; Anna M Mielech; Matthew Hackbart; Kristina R Kesely; Robert C Mettelman; Amornrat O'Brien; Mackenzie E Chapman; Andrew D Mesecar; Susan C Baker Journal: J Virol Date: 2020-05-18 Impact factor: 5.103