Qian Chen1,2, Junwei Zhou1,2, Zhixiang Yang1,2, Jiahui Guo1,2, Zimin Liu1,2, Xinyi Sun1,2, Qingshi Jiang1,2, Liurong Fang1,2, Dang Wang1,2, Shaobo Xiao1,2. 1. State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China. 2. Key Laboratory of Preventive Veterinary Medicine in Hubei Province, The Cooperative Innovation Center for Sustainable Pig Production, Wuhan 430070, China.
Abstract
Equine arteritis virus (EAV) and porcine reproductive and respiratory syndrome virus (PRRSV) represent two members of the family Arteriviridae and pose a major threat to the equine- and swine-breeding industries throughout the world. Previously, we and others demonstrated that PRRSV 3C-like protease (3CLpro) had very high glutamic acid (Glu)-specificity at the P1 position (P1-Glu). Comparably, EAV 3CLpro exhibited recognition of both Glu and glutamine (Gln) at the P1 position. However, the underlying mechanisms of the P1 substrate specificity shift of arterivirus 3CLpro remain unclear. We systematically screened the specific amino acids in the S1 subsite of arterivirus 3CLpro using a cyclized luciferase-based biosensor and identified Gly116, His133 and Ser136 (using PRRSV 3CLpro numbering) are important for recognition of P1-Glu, whereas Ser136 is nonessential for recognition of P1-Gln. Molecular dynamics simulations and biochemical experiments highlighted that the PRRSV 3CLpro and EAV 3CLpro formed distinct S1 subsites for the P1 substrate specificity switch. Mechanistically, a specific intermolecular salt bridge between PRRSV 3CLpro and substrate P1-Glu (Lys138/P1-Glu) are invaluable for high Glu-specificity at the P1 position, and the exchange of K138T (salt bridge interruption, from PRRSV to EAV) shifted the specificity of PRRSV 3CLpro toward P1-Gln. In turn, the T139K exchange of EAV 3CLpro showed a noticeable shift in substrate specificity, such that substrates containing P1-Glu are likely to be recognized more efficiently. These findings identify an evolutionarily accessible mechanism for disrupting or reorganizing salt bridge with only a single mutation of arterivirus 3CLpro to trigger a substrate specificity switch.
Equine arteritis virus (EAV) and porcine reproductive and respiratory syndrome virus (PRRSV) represent two members of the family Arteriviridae and pose a major threat to the equine- and swine-breeding industries throughout the world. Previously, we and others demonstrated that PRRSV 3C-like protease (3CLpro) had very high glutamic acid (Glu)-specificity at the P1 position (P1-Glu). Comparably, EAV 3CLpro exhibited recognition of both Glu and glutamine (Gln) at the P1 position. However, the underlying mechanisms of the P1 substrate specificity shift of arterivirus 3CLpro remain unclear. We systematically screened the specific amino acids in the S1 subsite of arterivirus 3CLpro using a cyclized luciferase-based biosensor and identified Gly116, His133 and Ser136 (using PRRSV 3CLpro numbering) are important for recognition of P1-Glu, whereas Ser136 is nonessential for recognition of P1-Gln. Molecular dynamics simulations and biochemical experiments highlighted that the PRRSV 3CLpro and EAV 3CLpro formed distinct S1 subsites for the P1 substrate specificity switch. Mechanistically, a specific intermolecular salt bridge between PRRSV 3CLpro and substrate P1-Glu (Lys138/P1-Glu) are invaluable for high Glu-specificity at the P1 position, and the exchange of K138T (salt bridge interruption, from PRRSV to EAV) shifted the specificity of PRRSV 3CLpro toward P1-Gln. In turn, the T139K exchange of EAV 3CLpro showed a noticeable shift in substrate specificity, such that substrates containing P1-Glu are likely to be recognized more efficiently. These findings identify an evolutionarily accessible mechanism for disrupting or reorganizing salt bridge with only a single mutation of arterivirus 3CLpro to trigger a substrate specificity switch.
The family Arteriviridae, which is a member of the order Nidovirales, includes six subfamilies Crocarterivirinae, Equarterivirinae, Heroarterivirinae, Simarterivirinae, Variarterivirinae and Zealarterivirinae
[1]. Some arteriviruses are known as important veterinary pathogens while others infect certain species of wild rodents or African non-human primates and can only be characterized only by genome sequencing [2], [3]. Of these arteriviruses, porcine reproductive and respiratory syndrome virus (PRRSV, belongs to the subfamily Variarterivirinae) and equine arteritis virus (EAV, which belongs to the subfamily Equarterivirinae) are important etiological agents in the area of veterinary research because of the economic losses that they cause in the horse- and swine-breeding industries worldwide [4], [5], [6]. Arterivirus infections are both cytological and cause outbreaks of either acute or persistent infections in their hosts [7]. Although most EAV infections are subclinical, some virulent strains periodically cause significant diseases outbreaks and may be associated with abortions, neonatal mortality, and establishing persistent infection in stallions [8], [9]. Pigs of all ages in immunologically naïve herds are vulnerable to being infected with PRRSV [10]. Highly virulent PRRSV strains can cause severe clinical diseases in susceptible herds, which are economically important pathogens of pigs [11], [12], [13]. Currently, available drugs and vaccines do not provide adequate protection against these diseases [10], [12], [14], thus it is important to find an appropriate strategy to prevent arterivirus from proliferating in the host.Arteriviruses represent a family of positive-sense, single-stranded, enveloped RNA viruses whose viral RNA genomes range in length from about 12 to 16 kb [15], [16]. Arteriviruses represent a family of positive-sense, single-stranded, enveloped RNA viruses whose viral RNA genomes range in length from about 12 to 16 kb [7], [17]. Three-fourths of the 5′-proximal length of the genome is taken up by two large open reading frames (ORFs), ORF1a and ORF1b, that encode two long nonstructural replicase polyproteins, pp1a and pp1ab [18], [19]. Both polyproteins are autoproteolytically processed by various papain-like cysteine proteases and a 3C‐like serine protease (3CLpro) at conserved sites to generate 13–14 mature nonstructural proteins (nsps) for gene transcription and replication [20], [21], [22], [23], [24].Arterivirus 3CLpro plays an indispensable role in the life cycle of arterivirus that cleaves the polyproteins at nine conserved sites [20], [21], [25]. Considering its essential function in viral replication, arterivirus 3CLpro is a highly prospective target for anti-arterivirus drug design [26], [27], [28], [29]. In addition to viral polyprotein processing, we and others demonstrated that arterivirus 3CLpro, specifically EAV 3CLpro and PRRSV 3CLpro, cleave host NF-κB essential modulator (NEMO) at multiple sites, thereby impairing NF-κB activation and IFN-β production. Importantly, cleavage of NEMO by PRRSV 3CLpro or EAV 3CLpro led to suppression of the type I IFN signaling pathway [30], [31]. Therefore, targeting 3CLpro would hinder viral replication and could also boost host immune response [32].Many 3CLpro inhibitors for COVID-19 treatment have been proposed recently and some new drug candidates have been successful in preclinical studies [33]. Understanding the substrate recognition mechanism of 3CLpro is a key step in establishing substrate-specific inhibitors of 3CLpro. Meanwhile, substrate envelope-guided design has achieved better potency and resistance profile of 3CLpro inhibitors. The 3CLpro families of arteriviruses are highly variable among themselves and are reasonably conserved only in the active site region. Like most chymotrypsin-like serine proteases, PRRSV 3CLpro employs a canonical Ser-His-Asp triad responsible for recognizing cleavage sites with a glutamic acid (Glu, E) at the P1 position to target viral polyprotein [34], [35]. Nevertheless, EAV 3CLpro uses the same catalytic triad but recognizes both Glu and glutamine (Gln, Q) at the P1 position [20], [36]. Similar substrate preferences were obtained when arterivirus 3CLpro cleaved host protein, NEMO. Our previous study demonstrated that both EAV 3CLpro and PRRSV 3CLpro induced NEMO cleavage at the conserved residues E166, E171, and E349 at the P1 position, whereas only EAV 3CLpro could target NEMO at Q205 [30]. The substrate preferences of PRRSV 3CLpro and EAV 3CLpro cleaved host proteins might be consistent with the 3CLpro preferences for viral substrates. However, why the cleavage of NEMO at site Q205 is unique to EAV 3CLpro has not been investigated in previous studies. In this study, we characterized the key amino acids in the S1 subsite of arterivirus 3CLpro for P1 substrate specificity by molecular dynamics (MD) simulations and site-directed mutagenesis. The major differences between the S1 subsites manifest at PRRSV 3CLpro Lys138 residue, where EAV 3CLpro contains Thr139 residue. The K138T mutant of PRRSV 3CLpro converts the P1-Glu Substrate Specificity of this protease to that toward P1-Gln seen in the EAV 3CLpro. Moreover, Thr139 in EAV 3CLpro is responsible for the preference for P1-Gln, since the T139K mutant more efficiently recognizes substrates containing P1-Glu than the wild type (WT) EAV 3CLpro. Hence, we identified a potential intermolecular salt bridge linking substrate binding and P1 substrate specificity shift of arterivirus 3CLpro, which might be important when designing specific inhibitors of arterivirus 3CLpro.
Materials and methods
Sequence analysis of the PRRSV and EAV genomes
The complete PRRSV and EAV genomes were selected from the National Center for Biotechnology Information (NCBI, ) for substrate profiling of their 3CLpro. Briefly, we obtained 1,037 complete PRRSV genome and 96 complete EAV genome (obtained 1 November 2021). Nine putative cleavage sites of PRRSV 3CLpro and EAV 3CLpro were inferred from alignments of pp1ab polyprotein amongst PRRSV and EAV isolates. Details on the data set are summarized in Supporting information
Fig. S1.
Plasmids
The eukaryotic expression plasmids for PRRSV 3CLpro, EAV 3CLpro, and NEMO (E166A-E171A-E349A) were described previously [30]. PRRSV 3CLpro and EAV 3CLpro mutants were constructed by overlap extension PCR using specific mutagenic primers and cloned into the C-terminal hemagglutinin (HA) tag-encoding pCAGGS-HA-C plasmid. The luciferase-based biosensor plasmid 233D, which contained oligonucleotides corresponding to ENLYFQ↓YS (cleaved by TEV 3Cpro), were used as the biosensor control as described previously [37]. To monitor the activity of PRRSV 3CLpro and EAV 3CLpro, oligonucleotides corresponding to the amino acid sequences of substrate peptides were cloned into the 233D biosensor vector, including PRRSV-nsp3/4, EAV-nsp3/4, EAV-nsp10/11, PRRSV-nsp3/4-Q, and NEMO-Q205. All the constructs were to be validated by DNA sequencing.
Dual-luciferase assays
Human embryonic kidney (HEK293T) cells were obtained from the China Typical Culture Collection Center (Wuhan, China) and cultured at 37 °C, 5% CO2 in Dulbecco's modified Eagle medium (DMEM, Invitrogen, Madison, WI, USA), with 10% fetal bovine serum. HEK293T cells in 24-well plates were transfected with indicated luciferase biosensor plasmids and pRL-TK (Promega), along with PRRSV 3CLpro or EAV 3CLpro expression plasmids or the empty control plasmid. The pRL-TK plasmid was used as a control for transfection efficiency. Cells were lysed 28 h post-transfection and firefly luciferase and Renilla luciferase activities were determined using the dual-luciferase reporter assays (Promega, USA). The data represent relative firefly luciferase activity normalized to Renilla luciferase activity and are representative of three independently conducted experiments. All results were presented as means ± standard deviations. The p value of < 0.05 was regarded as significant. All results were analyzed using GraphPad Prism Version 9.3 ().
Western blot
HEK293T cells grown in 6-well plates were transfected with indicated plasmids. After 28 h transfection, cell lysates were prepared in RIPA buffer (Beyotime, China), quantified for protein concentrations, and subject to SDS-PAGE and immunoblotting as previously described [30]. The following antibodies were used to identify the corresponding proteins: anti-β-actin antibody (Antgene, Wuhan, China), anti-Luciferase antibody (Promega), anti-HA antibody (MBL, Nagoya, Japan), and anti-Flag antibody (Macgene, China). Expression of β-actin was detected with an anti-β-actin antibody to confirm loading of equal protein amounts. To confirm the expression levels of HA-tagged WT arterivirus 3CLpro and its mutants, an anti-HA antibody was used for immunoblotting. The expression of luciferase or Flag-tagged NEMO proteins was analyzed using anti-Luciferase antibody or an anti-Flag antibody, respectively. Following incubation with appropriate secondary antibodies, protein bands were imaged with a ChemiDoc Imaging System (Bio-Rad, USA).
Molecular dynamics simulations
Molecular dynamics simulations were performed as described previously [37]. Briefly, the PRRSV 3CLpro (PDB id: 5Y4L) and EAV 3CLpro (PDB id: 1MBM) complexed with different substrate peptides (nsp3/nsp4 auto-cleavage sequence of PRRSV 3CLpro LGSLLE↓GAFRTQ, nsp3/nsp4 auto-cleavage sequence of EAV 3CLpro GGMVFE↓GLFRSP, and nsp10/nsp11 cleavage sequence of EAV 3CLpro CGWEKQ↓SNKISC) were generated utilizing the structure of Glu-SGP in complex with a tetrapeptide (PDB id: 1HPG) as the reference template, because EAV 3CLpro and PRRSV 3CLpro exhibited a high similarity with Glu-SGP structure among all available crystal complex structure. For consistency, these crystal structures were also adopted as the template for the construction of the single mutant complexes using SYBYL-X v.2.0 (v.2.0; https://omictools.com/sybyl-x-tool). MD simulations were conducted using the Amber ff14SB force field (with the TIP3P water model) implemented in the GROMACS 2018 software package as formerly described [38], [39], [40], [41]. The distance between the edge of the water box and protein was at least 1.2 nm. Molecular systems were neutralized by the addition of a certain number of counterions sodium (Na+) or chloride (Cl-), and NaCl at 150 mM was used to mimic physiological conditions [42]. For energy minimization and relaxation of the systems, each system was energy minimized by using a 2-step, extensive energy minimization process based on the steepest descent method followed by the conjugate gradient algorithm. Following minimization, each system was gradually heated from 0 to 310 K in 500 ps, then equilibrated at that temperature for another 500 ps. The standard temperature is kept constant at 310 K, then equilibration was performed under constant pressure for 150 ns with no position restrictions on protein. During the MD simulation, the SHAKE algorithm was applied to constrain all bonds involving hydrogen atoms [43]. The temperature was controlled using the modified Berendsen thermostat (V-rescale algorithm) with a collision frequency of 0.2 ps [44]. The pressure was maintained at 1 bar using the Parrinello-Rahman barostat with a compressibility of 4.5 × 10−5 bar−1 and a coupling constant of 2.0 ps. The non-bonded cutoff distances and van der Waals (vdW) interaction were 15 Å, and the Particle Mesh Ewald (PME) method was utilized to treat long-range electrostatic interactions [45]. A 2 fs step was applied to calculate the motion equations using the Leap-Frog integrator for equilibration steps [43]. The coordinates of the atoms were taken at every 10 ps and used for the final analysis. The backbone root mean square deviations (RMSD) values, root mean square fluctuation (RMSF) values, vdW interactions and electrostatic interactions were calculated using the GROMACS package. In addition, the RMSD values suggest that the different conformations remain stable during 150 ns simulation times (Fig. S2). The 3D-structure of proteins and peptides was visualized using PyMOL software Version 2.5 (Schrödinger, LLC) [46].
Hydrogen bond calculations
Hydrogen bonds (H-Bonds) were identified based on donor–acceptor distance and donor–acceptor angle (X···Y distance < 3.5 Å and X-H···Y angle < 40°), whereas only the pairs of polar atoms belonging to different hydrophobic groups were involved. The H-Bonds Plugin from Visual Molecular Dynamics was used to perform hydrogen bond analyses for each complex. H-bond occupancy indicated the percentage of indicated H-bonds that are maintained throughout the MD simulation. The data represented all H-bonds with occupancy greater than 10% between P1 residue of the substrate peptides and S1 subsite of WT arterivirus 3CLpro.
Clustering of MD trajectories using principal component analysis
Principal component analysis (PCA) was used to determine the conformational changes in protein movement by calculation and diagonalization of the covariance matrix of the C-alpha (Cα) atoms. The calculated orthogonal vectors or eigenvectors with the highest eigenvalues are named principal components (PCs). PCA was performed using the Bio3d package (v2.4.1; ) in R environment (v4.0.4; https://mirror.fcaglp.unlp.edu.ar/CRAN) [47], [48]. Trajectories are obtained from dynamic simulation. The 100 ns MD trajectory consists of 10,000 frames after equilibration. One conformation out of ten is sampled. By sampling each conformation, a single long trajectory is obtained, and thus it represents 10% of the 10,000 frames MD trajectory. To determine the internal motions of a protein, alignment is necessary before building the C-matrix using Cartesian coordinates. Cα atoms were selected for analysis and in each frame were aligned by removing the overall translation and rotation, and then corresponding PCs were calculated under default parameters using the Bio3d package. The predominant movements during the simulation were plotted using these PCs.
Results
Different substrates preference at the P1 position between PRRSV 3CLpro and EAV 3CLpro
To clarify the substrate specificity of arterivirus 3CLpro, we obtained 1,364 complete genome sequences of 25 different species of arteriviruses from the National Center for Biotechnology Information (NCBI). PRRSV and EAV, as two well-characterized arteriviruses, contributed most of the complete genome sequences of arteriviruses (1037 and 96, respectively). Thus, we selected PRRSV and EAV as representatives to investigate the substrates preference of arterivirus 3CLpro. Based on previous studies [[34], [35]], nine putative cleavage sites of PRRSV 3CLpro and EAV 3CLpro were predicted based on alignments of pp1ab polyproteins amongst all PRRSV and EAV isolates (Fig. 1a-b). The sequence logos of PRRSV 3CLpro and EAV 3CLpro substrates show considerable similarities, but there is a subtle difference at the P1 position. Consistent with previous studies, PRRSV 3CLpro confers an extremely high specificity for the P1 residue of Glu in the substrates. Interestingly, while Glu is frequently found specific to the P1 position in EAV 3CLpro, Gln at P1 appears in one of 9 cleavage sites (nsp10/nsp11) [21]. This cleavage site (nsp10/nsp11) with the preference of Gln at the P1 position is well conserved amongst all EAV isolates (Fig. S1). In addition, the P1ʹ position, on the other hand, is mainly occupied by various small-sized residues such as Gly, Ser and Ala, among which Gly is the most frequent one [21], [22], [34]. The residue preference at the remaining Pn's residues of PRRSV 3CLpro and EAV 3CLpro is less demanding. These results revealed that the residue specificity of the P1 position varies in the substrates for PRRSV 3CLpro and EAV 3CLpro.
Fig. 1
Sequence logos of the polyprotein junctions cleaved by PRRSV 3CLpro and EAV 3CLpro from different isolates. (a) Schematic diagrams of the PRRSV and EAV polyproteins. The PRRSV 3CLpro and EAV 3CLpro encoded by nsp4 are marked by orange rectangular; all the 3CLpro-mediated cleavage sites in polyproteins are shown in the orange triangle. (b) Conservation of the polyprotein junctions cleaved by PRRSV 3CLpro from 1037 PRRSV strains and EAV 3CLpro from 96 EAV strains. Amino acid sequence logos of the substrates were created using PSSMSearch (http://slim.icr.ac.uk/pssmsearch/), and the height of the letters represented the relative frequency of the amino acid. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Sequence logos of the polyprotein junctions cleaved by PRRSV 3CLpro and EAV 3CLpro from different isolates. (a) Schematic diagrams of the PRRSV and EAV polyproteins. The PRRSV 3CLpro and EAV 3CLpro encoded by nsp4 are marked by orange rectangular; all the 3CLpro-mediated cleavage sites in polyproteins are shown in the orange triangle. (b) Conservation of the polyprotein junctions cleaved by PRRSV 3CLpro from 1037 PRRSV strains and EAV 3CLpro from 96 EAV strains. Amino acid sequence logos of the substrates were created using PSSMSearch (http://slim.icr.ac.uk/pssmsearch/), and the height of the letters represented the relative frequency of the amino acid. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Structure comparison of S1 subsites in arterivirus 3CLpro using molecular dynamics simulation
Viral 3CLpro is broadly believed to undertake conformational changes after substrate binding. These changes allow them to interact with substrate proteins and to initiate catalytic activity [45]. To further determine whether structural changes in arterivirus 3CLpro in response to substrate binding and hence potentially regulate arterivirus 3CLpro activity, we performed molecular dynamics analysis of apo and substrate-bound arterivirus 3CLpro. As shown in Fig. 2a, the 3D structure of PRRSV 3CLpro is remarkably like that of EAV 3CLpro, with the corresponding RMSD of alignment of 1.653 Å, even though PRRSV 3CLpro has low amino acid sequence homology with EAV 3CLpro (about 37%). Both PRRSV 3CLpro and EAV 3CLpro consists of a chymotrypsin-like domains fold (domains I and II) at the N-terminus, an extra C-terminal domain III, and a lengthy loop linking domains II and III [23], [34]. A shallow-surfaced gap between domains I and II, to which 6 and 7 antiparallel β sheets from two domains contribute, forms the native substrate-binding site including the catalytic Ser-His-Asp triad and the Gly-Ser oxyanion hole [23]. Surprisingly, despite their high structural similarity, the S1 subsites of PRRSV 3CLpro and EAV 3CLpro showed significantly different amino acid residue compositions. In detail, the S1 subsite in the PRRSV 3CLpro is mainly composed of Thr113/Cys115/Gly116/His133/Ser136/Lys138, while that in EAV 3CLpro is constituted by Thr115/Thr116/Ser117/Gly118/His134/Ser137 (Fig. 2b). These residues are highly conserved in PRRSV 3CLpro and EAV 3CLpro, respectively, suggesting these residues might involve the P1 substrate specificity of arterivirus 3CLpro (Fig. 2c). As expected, the P1 residue, Glu or Gln, engaged in a highly similar interaction network with the S1 sites of the 3CLpro active site in PRRSV and EAV, including a fully conserved network of H-bonds between the P1-Glu/Gln and Thr113, Gly116 and His133 of 3CLpro (all amino acid positions are described using PRRSV 3CLpro numbering). However, we observed several significant differences in PRRSV 3CLpro and EAV 3CLpro complexes that highlight active site plasticity. In 3CLpro/P1-Glu complexes, the Ser136 side-chain participates in H-bonds with the P1-Glu. In contrast, the Ser137 side-chain points away from the P1-Gln side-chain in EAV 3CLpro/P1-Gln structure. The second region of variability is at 138 positions. The substrate P1-Glu side chain was stabilized via a salt bridge interaction to the PRRSV 3CLpro Lys138 residue. However, in EAV 3CLpro structures, Thr139 is on the outside of the S1 subsite, regardless of the different P1-Glu and P1-Gln conformations (Fig. 2b).
Fig. 2
Structure comparison of the S1 subsites in PRRSV 3CLpro and EAV 3CLpro. (a) The overall structures of PRRSV 3CLpro (PDB id: 5Y4L) and EAV 3CLpro (PDB id: 1MBM) and details of their catalytic centers. The surface view of PRRSV 3CLpro and EAV 3CLpro comprised three domains (domain I marked in green, domain II marked in blue, and domain III marked in yellow). The canonical Ser-His-Asp triad responsible for recognizing cleavage sites was presented in the middle of the image, which is located in the cleft between domains I and II. (b) The S1 subsites of PRRSV 3CLpro (grey) and EAV 3CLpro (orange) in complex with its substrates are shown as surface and cartoon, respectively. The amino acid residues composited the S1 subsite and the catalytic triad Gly116-His39-Ser118 (PRRSV 3CLpro numbering is used) are displayed as sticks. Enlarged view of residues Ser136 and Lys138 in PRRSV 3CLpro and residues Ser137 and Thr139 in EAV 3CLpro. Based on the structure of arterivirus 3CLpro (PRRSV, PDB id: 5Y4L; EAV, PDB id: 1MBM), the molding structure of the 3CLpro-substrate complex was generated. (c) Sequence alignment for the S1 subsites of the PRRSV 3CLpro and EAV 3CLpro amongst all PRRSV and EAV isolates. Sequence logos were generated using PSSMSearch (). (d) RMSF analysis of loop S136-G140 in apo and substrate-bound arterivirus 3CLpro. Per residue RMSF of Cα atoms in loop S136-G140 of apo PRRSV 3CLpro (black), apo EAV 3CLpro (red), and substrate-bound PRRSV 3CLpro (green) from MD trajectories. The sequences were aligned, and the PRRSV 3CLpro residue numbering is used for the x-axis. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Structure comparison of the S1 subsites in PRRSV 3CLpro and EAV 3CLpro. (a) The overall structures of PRRSV 3CLpro (PDB id: 5Y4L) and EAV 3CLpro (PDB id: 1MBM) and details of their catalytic centers. The surface view of PRRSV 3CLpro and EAV 3CLpro comprised three domains (domain I marked in green, domain II marked in blue, and domain III marked in yellow). The canonical Ser-His-Asp triad responsible for recognizing cleavage sites was presented in the middle of the image, which is located in the cleft between domains I and II. (b) The S1 subsites of PRRSV 3CLpro (grey) and EAV 3CLpro (orange) in complex with its substrates are shown as surface and cartoon, respectively. The amino acid residues composited the S1 subsite and the catalytic triad Gly116-His39-Ser118 (PRRSV 3CLpro numbering is used) are displayed as sticks. Enlarged view of residues Ser136 and Lys138 in PRRSV 3CLpro and residues Ser137 and Thr139 in EAV 3CLpro. Based on the structure of arterivirus 3CLpro (PRRSV, PDB id: 5Y4L; EAV, PDB id: 1MBM), the molding structure of the 3CLpro-substrate complex was generated. (c) Sequence alignment for the S1 subsites of the PRRSV 3CLpro and EAV 3CLpro amongst all PRRSV and EAV isolates. Sequence logos were generated using PSSMSearch (). (d) RMSF analysis of loop S136-G140 in apo and substrate-bound arterivirus 3CLpro. Per residue RMSF of Cα atoms in loop S136-G140 of apo PRRSV 3CLpro (black), apo EAV 3CLpro (red), and substrate-bound PRRSV 3CLpro (green) from MD trajectories. The sequences were aligned, and the PRRSV 3CLpro residue numbering is used for the x-axis. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)To further examine the flexibility of the S1 subsites in 3CLpro, we concentrated on the movements of loops enveloping the S1 subsites by RMSF analysis. We found that one of the analyzed loops of apo PRRSV 3CLpro, loop S136-G140, lined a wall of the S1 subsite and was more flexible than the corresponding loops of apo EAV 3CLpro structure, while the neighbouring loops had little flexibility (Fig. 2d). This might be deduced indirectly from the lack of the S136-G140 loop in the PRRSV 3CLpro crystallographic structure in our and others' previous studies [49]. Surprisingly, such flexibility reduced substrate-bound PRRSV 3CLpro, suggesting that the presence of a substrate could stabilize the loops around the active site (Fig. 2d). B-factors analysis of all deposited arterivirus 3CLpro crystal structures completely verified these conclusions. It is worth adding that the residues on this loop differ greatly in 138 and 139 positions between PRRSV 3CLpro and EAV 3CLpro (Fig. S3), suggesting that these two amino acids in this loop are likely to be involved in the P1 substrate specificity switch of arterivirus 3CLpro.To understand the essential motion changes of apo and substrate-bound conformations of arterivirus 3CLpro, PCA was performed on MD trajectories. For the apo and substrate-bound forms of PRRSV 3CLpro, the first two eigenvectors (EVs) contributed 33.66% and 56.6% of the global motion, respectively (Fig. S4). These results showed that the eigenvalues of the apo conformation were lower than those of the substrate-bound form. Furthermore, the scatter plots generated by the apo and substrate-bound forms of PRRSV 3CLpro indicated a significant difference between the two systems, which also suggested that the apo form of PRRSV 3CLpro showed less dynamic tertiary structure conformation than the substrate-bound form. Similar motion changes were obtained from MD trajectories of the apo and substrate-bound forms of EAV 3CLpro, suggesting that the structural rearrangement by substrate binding might contribute to the increased essential motion in the substrate-bound form of arterivirus 3CLpro.
Establishment of a biosensor assay for evaluating the protease − substrate specificity of arterivirus 3CLproin vivo
The traditional approach to studying protease-substrate specificity is usually performed through in vitro proteolytic reactions with a purified protease and its predicted peptide or protein substrates in a fluid buffer. In vitro characterization is always labour- and time-demanding, and it might not accurately reflect the physiological characteristics of proteases. Here, we have established a fast, reliable, and efficient luciferase-based biosensor to quantify the substrate specificity of arterivirus 3CLpro
in vivo, which has been used in previous studies to identify cleavage sites recognized by viral protease [37], [50], [51], [52]. In the biosensor assay, the P6-P6′ positions of the candidate recognition regions of arterivirus 3CLpro were inserted into the cycling luciferase biosensor (Fig. 3a). The PRRSV 3CLpro N-terminal auto-cleavage sequence (named PRRSV-nsp3/4, with P1-Glu preference), EAV 3CLpro N-terminal auto-cleavage sequence (named EAV-nsp3/4, with P1-Glu preference) and EAV nsp10/nsp11 cleavage sequence (named EAV-nsp10/11, with P1-Gln preference) were tested, while the Tobacco Etch Virus (TEV) protease recognition sequence ENLYFQYS (named 233D) was used as a negative control. In principle, arterivirus 3CLpro effectively cleaves its favored substrates between residues P1 and P1′, leading to an interaction of the two firefly luciferase domains, resulting in an active form of the luciferase. Conversely, the unpreferred substrates will not be efficiently cleaved by arterivirus 3CLpro, resulting in a non-active form of the luciferase. As shown in Fig. 3b, the luciferase activity of PRRSV-nsp3/4 was markedly induced in cells cotransfected with PRRSV 3CLpro, while no activity was detected in negative controls (233D). Moreover, the luciferase activity of both EAV-nsp3/4 and EAV-nsp10/11 was also induced in EAV 3CLpro-cotransfected cells. Interestingly, no activity of EAV-nsp10/11 was detected in PRRSV 3CLpro-cotransfected cells, suggesting that the substrate with P1-Gln was a nonpreferred substrate for PRRSV 3CLpro, which is consistent with previous studies on the P1 substrate specificity of EAV 3CLpro and PRRSV 3CLpro
[21]. Western blotting also showed that PRRSV 3CLpro/EAV 3CLpro could split the recombinant firefly luciferase with the preferred cleavage sequence, resulting in a more rapidly migrating protein band (Fig. 3c). Consistency of cleavage and fold induction of the luciferase supports that there is a correspondence between the luciferase activity assay and cleavage of the biosensor construct by arterivirus 3CLpro, pointing to its possible use in evaluating the protease-substrate specificity of arterivirus 3CLpro
in vivo.
Fig. 3
Exploiting the biosensor assay to evaluate the protease-substrate specificity of arterivirus 3CLproin vivo. (a) Diagram of the generation of luciferase-based biosensors. The purple and the orange structures represent the recombined firefly luciferase. The green rectangle indicates the different recognition sequences that were used to evaluate the activity of PRRSV 3CLpro or EAV 3CLpro. (b and c) HEK293T cells in 24-well plates were transfected with the pRL-TK plasmid, the plasmid encoding PRRSV 3CLproor EAV 3CLpro, and corresponding luciferase-based biosensors plasmids, or their control 233D. After 28 h transfection, cell lysates were prepared and analyzed by dual-luciferase assays (b) and western blotting (c). ns, not significant. ***p < 0.001. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Exploiting the biosensor assay to evaluate the protease-substrate specificity of arterivirus 3CLproin vivo. (a) Diagram of the generation of luciferase-based biosensors. The purple and the orange structures represent the recombined firefly luciferase. The green rectangle indicates the different recognition sequences that were used to evaluate the activity of PRRSV 3CLpro or EAV 3CLpro. (b and c) HEK293T cells in 24-well plates were transfected with the pRL-TK plasmid, the plasmid encoding PRRSV 3CLproor EAV 3CLpro, and corresponding luciferase-based biosensors plasmids, or their control 233D. After 28 h transfection, cell lysates were prepared and analyzed by dual-luciferase assays (b) and western blotting (c). ns, not significant. ***p < 0.001. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Molecular determinants of arterivirus 3CLpro substrate specificity
Arterivirus 3CLpro employs a conserved Ser-His-Asp triad as the principal nucleophile and the general acid-base catalyst [23], [34], [53], which is important for substrate binding and hydrolysis. We considered the possibility that the additional amino acid residues of S1 subsites are also involved in arterivirus 3CLpro activity and P1 substrate preference. To this end, several amino acids in the S1 subsites of PRRSV 3CLpro and EAV 3CLpro were mutated to alanine and assessed the P1 substrate preference of all mutants using three cyclized luciferase-based biosensors (contained Glu or Gln at the P1 position to enable assessment of substrate preference at this position). As shown in Fig. 4a, the putative residues in S1 subsite of PRRSV 3CLpro, Thr113, Cys115, G116, His133, Ser136, and Lys138, were mutated to Ala and tested with a PRRSV-nsp3/4 biosensor (with P1-Glu). In contrast to WT PRRSV 3CLpro, the mutants (T113A, C115A, H133A, K138A) significantly less-efficiently cleaved substrate containing Glu at P1, but the mutants (G116A, S136A) almost completely abolished the ability of PRRSV 3CLpro to cleave substrates having P1-Glu. Using an EAV-nsp3/4 biosensor (with P1-Glu), we found that the mutants of EAV 3CLpro retained (T116A, S117A), significantly reduced (T115A, H134) or eliminated (G118A, S137A) the ability to cleave substrate containing Glu at P1 (Fig. 4b). These data suggest that both G116/H133/S136 of PRRSV 3CLpro and G118/H134/S137 of EAV 3CLpro are important for recognition of P1-Glu. Interestingly, the mutants G118A and H134A of EAV 3CLpro lost P1-Gln cleavage activity, but the mutant S137A retained this ability cleaving P1-Gln substrate as did WT EAV 3CLpro (Fig. 4c), suggesting that although Ser137 is important for its ability to cleave substrates having P1-Glu, this residue is nonessential for recognition of P1-Gln.
Fig. 4
Identifying the amino acid residues involved in the S1 subsites of arterivirus 3CLpro. (a-c) HEK293T cells were cotransfected with the indicated plasmids encoding WT or mutants of arterivirus 3CLpro, pRL-TK and corresponding biosensor plasmids (a. PRRSV-nsp3/4 biosensor (with P1-Glu); b. EAV-nsp3/4 biosensor (with P1-Glu); c. EAV-nsp10/11 biosensor (with P1-Gln)). Dual-luciferase assays and western blotting were performed at 28 h after the transfection. (d-f) Heatmaps of the variation in H-bond occupancy between P1 residue and S1 subsite of WT or mutants of arterivirus 3CLpro. All H-bonds between S1 subsite and P1 residue occurring with an occupancy rate of at least 10% during MD simulations were investigated. Heatmaps were plotted using GraphPad Prism Version 9.3 (). The first one or two characters of the atom name of H-bond comprise the chemical symbol for the atom type. All the atom names beginning with C are carbon atoms, N shows nitrogen and O shows oxygen. The next character is the remoteness indicator code, which follows the Greek alphabet, starting with the alpha carbon (A) and moving on to beta (B), gamma (G), delta (D), epsilon (E), zeta (Z), and eta (H). For example, P1E-OE1 represents the first epsilon oxygen of glutamic acid (E) at the P1 position. Schematic of PRRSV 3CLpro and EAV 3CLpro S1 subsites and their interactions with the preferred P1 residue. PRRSV 3CLpro has a conserved K138 residue at the base of S1 subsite and formed a salt bridge interaction between the substrate P1-E (red). The S136/S137 side-chain participates of PRRSV/EAV 3CLpro in H-bonds with the P1-Glu, whereas the S137 side-chain of EAV 3CLpro points away from the P1-Gln side-chain. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Identifying the amino acid residues involved in the S1 subsites of arterivirus 3CLpro. (a-c) HEK293T cells were cotransfected with the indicated plasmids encoding WT or mutants of arterivirus 3CLpro, pRL-TK and corresponding biosensor plasmids (a. PRRSV-nsp3/4 biosensor (with P1-Glu); b. EAV-nsp3/4 biosensor (with P1-Glu); c. EAV-nsp10/11 biosensor (with P1-Gln)). Dual-luciferase assays and western blotting were performed at 28 h after the transfection. (d-f) Heatmaps of the variation in H-bond occupancy between P1 residue and S1 subsite of WT or mutants of arterivirus 3CLpro. All H-bonds between S1 subsite and P1 residue occurring with an occupancy rate of at least 10% during MD simulations were investigated. Heatmaps were plotted using GraphPad Prism Version 9.3 (). The first one or two characters of the atom name of H-bond comprise the chemical symbol for the atom type. All the atom names beginning with C are carbon atoms, N shows nitrogen and O shows oxygen. The next character is the remoteness indicator code, which follows the Greek alphabet, starting with the alpha carbon (A) and moving on to beta (B), gamma (G), delta (D), epsilon (E), zeta (Z), and eta (H). For example, P1E-OE1 represents the first epsilon oxygen of glutamic acid (E) at the P1 position. Schematic of PRRSV 3CLpro and EAV 3CLpro S1 subsites and their interactions with the preferred P1 residue. PRRSV 3CLpro has a conserved K138 residue at the base of S1 subsite and formed a salt bridge interaction between the substrate P1-E (red). The S136/S137 side-chain participates of PRRSV/EAV 3CLpro in H-bonds with the P1-Glu, whereas the S137 side-chain of EAV 3CLpro points away from the P1-Gln side-chain. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)To further investigate the underlying molecular mechanism of the alanine scan results, we performed MD simulations used by different PRRSV 3CLpro and EAV 3CLpro mutants with substrates. The RMSD values of the Cα atoms for each complex converged and remained stable during 150 ns MD simulations. In the last 100 ns of simulations, we calculated the selected H-bond (occupancy greater than 10%) during MD simulations to better capture the electrostatic interactions between the substrate and the protease. Gly116/Gly118 in PRRSV 3CLpro/EAV 3CLpro mainchain amides constitute the oxyanion hole by providing one H-bond to the cleavable P1-Glu and P1-Gln carbonyl oxygen that stabilizes the resulting negative charge during covalent catalysis. As expected, both G116A mutant in PRRSV 3CLpro and G118A mutant in EAV 3CLpro disrupted the engagement of H-bond network around the active sites (Fig. 4d-f), which also eliminated the ability to cleave the substrate (Fig. 4a-c). Moreover, His133/His134 and Thr113/Thr115 in PRRSV 3CLpro/EAV 3CLpro make H-bonds with the invariant side chain of P1-Glu and P1-Gln. Indeed, we found that alanine substitution of each of these two residues significantly reduced the H-bond occupancy between arterivirus 3CLpro and substrates. These conserved H-bond interactions between arterivirus 3CLpro and substrates provide an explanation for the reduced catalytic efficiency of the mutants (H133A/H134A, T113A/T115A in PRRSV 3CLpro/EAV 3CLpro) for both P1-Glu and P1-Gln substrates. By contrast, alanine mutation of other residues that cannot form stable H-bonds with the P1 substrate, such as C115 in PRRVS 3CLpro and T116/S117 in EAV 3CLpro, had less or no significant effect on P1-Glu/P1-Gln cleavage activity. Importantly, Ser136/Ser137 in PRRSV 3CLpro /EAV 3CLpro contacts P1-Glu, but not P1-Gln, through two conserved H-bonds (S136/S137-OG(Side)…P1E-OE1(Side) and S136/S137-OG (Side)…P1E-OE2 (Side)). S136A mutant in PRRSV 3CLpro and S137A mutant in EAV 3CLpro caused a loss of H-bonds between side chains of S136/S137 and the P1-Glu (Fig. 4d column 1 vs 6, Fig. 4e column 1 vs 7), whereas S137A mutant in EAV 3CLpro had no significant effect on H-bond interactions toward P1-Gln (Fig. 4f column 1 vs 7). Further confirming this notion, the S137A mutant in EAV 3CLpro lost the ability in cleaving substrate containing P1-Glu but had a similar cleavage activity toward P1-Gln as did WT EAV 3CLpro (Fig. 4b-c). These results suggested that substrate interactions with the Ser136/Ser137 side chains in PRRSV 3CLpro /EAV 3CLpro distinguish P1-Glu and P1-Gln recognition.Interestingly, in the flexible loop S136-G140 of PRRSV 3CLpro, a salt bridge Lys138-P1-Glu interaction is observed in PRRSV 3CLpro-P1-Glu complex and contributes to binding P1-Glu of the substrate, which might explain why the K138A mutant lost the ability to cleave substrate with P1-Glu. Surprisingly, neither EAV 3CLpro-P1-Glu nor EAV 3CLpro-P1-Gln complexes show the salt bridge observed with Lys138-P1-Glu in the PRRSV 3CLpro substrate complexes. Considering that PRRSV 3CLpro exhibited P1 preference only toward Glu and EAV had both P1-Glu and P1-Gln preference, we speculate that the salt bridge might lead to a P1substrate specificity shift from P1-Glu/Gln to P1-Glu preference during the evolution of arterivirus 3CLpro.
Identification of an intermolecular salt bridge linking P1 substrate specificity shift of arterivirus 3CLpro
To further test whether the Lys138-P1-Glu salt bridge links the P1 substrate specificity shift of arterivirus 3CLpro, we attempted to convert P1 substrate specificity of PRRSV 3CLpro to that of EAV 3CLpro. The only amino acid distinction in the binding site region between PRRSV 3CLpro and EAV 3CLpro is in position 138 at the loop near the active site, where Lys is found in PRRSV 3CLpro and Thr in EAV 3CLpro. We generated the T139K mutation in EAV 3CLpro, which we predicted would form a salt bridge linking P1-Glu of the substrate. As shown in Fig. 5a, the T139K mutant of EAV 3CLpro revealed a significant shift in substrate specificity, recognizing substrates containing Glu at P1 more efficiently. Gln- containing substrates were still hydrolyzed, but with much less relative efficiency as compared with the WT EAV 3CLpro (Fig. 5b). Next, we tested the opposite mutations in PRRSV 3CLpro to determine whether mutant PRRSV 3CLpro would behave like EAV 3CLpro. The K138T mutation of PRRSV 3CLpro significantly reduced the P1 substrate specificity toward Glu (Fig. 5c). Simultaneously, the mutant gained the ability to cleave Gln-containing substrates efficiently, an activity that is almost absent in the wild type PRRSV 3CLpro. Interestingly, the K138T mutant exhibits higher expression levels compared to wild-type PRRSV 3CLpro (Fig. 5d). To exclude the possibility that the gained cleavage ability is due to higher expression of the K138T mutant, the experiments of dose–response analysis were performed. Dual-luciferase reporter assay and western blot assay demonstrated that the gained cleavage ability of K138T to cleave Gln-containing substrates is not due to high expression levels of the mutant (Fig. S5).
Fig. 5
Identifying an intermolecular salt bridge linking P1 substrate specificity shift of arterivirus 3CLpro. (a-b) HEK293T cells were transfected with plasmid encoding WT or the mutant T139K of EAV 3CLpro (salt bridge formation), pRL-TK and corresponding luciferase-based biosensors plasmids. The cells were harvested after 28 h transfection and analyzed by dual-luciferase assays and western blotting. (c-d) HEK293T cells were cotransfected with a plasmid encoding PRRSV 3CLpro WT or the mutant K138T (salt bridge disruption), pRL-TK and corresponding luciferase-based biosensors plasmids. The cells were harvested after 28 h transfection and analyzed by dual-luciferase assays and western blotting. *p < 0.05; ***p < 0.001
Identifying an intermolecular salt bridge linking P1 substrate specificity shift of arterivirus 3CLpro. (a-b) HEK293T cells were transfected with plasmid encoding WT or the mutant T139K of EAV 3CLpro (salt bridge formation), pRL-TK and corresponding luciferase-based biosensors plasmids. The cells were harvested after 28 h transfection and analyzed by dual-luciferase assays and western blotting. (c-d) HEK293T cells were cotransfected with a plasmid encoding PRRSV 3CLpro WT or the mutant K138T (salt bridge disruption), pRL-TK and corresponding luciferase-based biosensors plasmids. The cells were harvested after 28 h transfection and analyzed by dual-luciferase assays and western blotting. *p < 0.05; ***p < 0.001Considering that Lys is a large amino acid with a positively charged chain, while Thr is a small amino acid with a polar uncharged side chain, we speculate that electrostatic or vdW might play an important role in P1 substrate specificity shift. To this end, we calculated and compared the electrostatic or vdW energies among arterivirus 3CLpro structures with or without the potential intermolecular salt bridge. As shown in Fig. 6, the dominant binding attractions were determined by several residues in S1 subsites. In contrast with vdW, the electrostatic energy was more favorable for binding (Fig. 6a). As expected, the T139K mutant of EAV 3CLpro formed an intermolecular salt bridge with P1-Glu, producing a strong electrostatic interaction (about −82.83 kJ/mol), whereas K138T mutant of PRRSV 3CLpro significantly reduced the electrostatic force due to the loss of the salt bridge between K138 and substrate P1-Glu (Lys138/P1-Glu) (Fig. 6b). Together, molecular dynamics simulations further show that this salt bridge is particularly critical for P1 substrate specificity of arterivirus 3CLpro via electrostatic forces and that the acquisition of this salt bridge leads to a shift of arterivirus 3CLpro from P1-Gln to P1-Glu preference.
Fig. 6
Electrostatic and vdW interaction energies between P1 residue and S1 subsite of WT arterivirus 3CLpro or its mutants. (a) Heatmaps represent the different binding free energies between P1 residue and S1 subsite of WT arterivirus 3CLpro or its mutants, and the contributions from two individual energetic components: electrostatics (E) and van der Waals (E) energy. Cooler (blue) and warmer (red) colors indicate higher or lower energies contributions to the interaction between P1 residue and S1 subsite, respectively. Heatmaps were plotted using GraphPad Prism Version 9.3 (). (b) Comparison of energies surface diagrams of WT arterivirus 3CLpro and its mutants from MD simulations. The surface representation of the S1 subsite is colored according to the energy of the respective residue positions. Energetically favorable residues are shown as red, neutral as yellow, and unfavorable as green. The P6-P6′ positions of substrate are displayed as a white cartoon. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Electrostatic and vdW interaction energies between P1 residue and S1 subsite of WT arterivirus 3CLpro or its mutants. (a) Heatmaps represent the different binding free energies between P1 residue and S1 subsite of WT arterivirus 3CLpro or its mutants, and the contributions from two individual energetic components: electrostatics (E) and van der Waals (E) energy. Cooler (blue) and warmer (red) colors indicate higher or lower energies contributions to the interaction between P1 residue and S1 subsite, respectively. Heatmaps were plotted using GraphPad Prism Version 9.3 (). (b) Comparison of energies surface diagrams of WT arterivirus 3CLpro and its mutants from MD simulations. The surface representation of the S1 subsite is colored according to the energy of the respective residue positions. Energetically favorable residues are shown as red, neutral as yellow, and unfavorable as green. The P6-P6′ positions of substrate are displayed as a white cartoon. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Evolutionary mechanism of salt-bridge-triggered P1 substrate specificity switch of arterivirus 3CLpro
To exclude the effect of different substrates on the above findings, two different substrates with Gln at P1 position were selected and tested whether the salt bridge disruption of PRRSV 3CLpro leads to the P1 substrate specificity switch. First, we constructed a mutant of PRRSV-nsp3/4 biosensor in which the Glu at P1 position was replaced with Gln and examined their cleavage by PRRSV 3CLpro. As expected, the luciferase activity of P1-E → Q substitution of PRRSV-nsp3/4 biosensor was markedly induced in the K138T mutant of PRRSV 3CLpro-cotransfected cells, whereas no activity was detected in wild type of PRRSV 3CLpro-cotransfected cells (Fig. 7a).
Fig. 7
Evolutionary mechanism of salt-bridge-triggered P1 substrate specificity switch of PRRSV 3CLpro. (a) HEK293T cells were cotransfected with PRRSV-nsp3/4-Q biosensor plasmid, pRL-TK and a plasmid encoding PRRSV 3CLpro WT or the mutant K138T. After 28 h transfection, cell lysates were prepared and analyzed by dual-luciferase assays and western blotting. **p < 0.01. (b) HEK293T cells were cotransfected with NEMO-Q205 biosensor plasmid, pRL-TK and a plasmid encoding PRRSV 3CLpro WT or the mutant K138T. Cells lysates were processed as described above for panel A. ***p < 0.001. (c) HEK293T cells were cotransfected with PRRSV 3CLpro WT or the mutant K138T, along with Flag-tagged NEMO (E166A-E171A-E349A). Cell lysates were prepared 30 h post transfection and analyzed by western blotting. (d-e) HEK293T cells were cotransfected with plasmids encoding WT PRRSV 3CLpro or its mutants, pRL-TK and corresponding biosensor plasmid. After 28 h transfection, cell lysates were prepared and analyzed by dual-luciferase assays and western blotting.
Evolutionary mechanism of salt-bridge-triggered P1 substrate specificity switch of PRRSV 3CLpro. (a) HEK293T cells were cotransfected with PRRSV-nsp3/4-Q biosensor plasmid, pRL-TK and a plasmid encoding PRRSV 3CLpro WT or the mutant K138T. After 28 h transfection, cell lysates were prepared and analyzed by dual-luciferase assays and western blotting. **p < 0.01. (b) HEK293T cells were cotransfected with NEMO-Q205 biosensor plasmid, pRL-TK and a plasmid encoding PRRSV 3CLpro WT or the mutant K138T. Cells lysates were processed as described above for panel A. ***p < 0.001. (c) HEK293T cells were cotransfected with PRRSV 3CLpro WT or the mutant K138T, along with Flag-tagged NEMO (E166A-E171A-E349A). Cell lysates were prepared 30 h post transfection and analyzed by western blotting. (d-e) HEK293T cells were cotransfected with plasmids encoding WT PRRSV 3CLpro or its mutants, pRL-TK and corresponding biosensor plasmid. After 28 h transfection, cell lysates were prepared and analyzed by dual-luciferase assays and western blotting.Considering that arterivirus 3CLpro cleaves not only viral polyproteins but also host proteins to facilitate the various steps of the virus infection cycle [30]. Our previous study found that EAV 3CLpro, but not PRRSV 3CLpro, cleaved host protein NEMO at Q205 in the P1 position [30]. We then chose NEMO as a representative host protein to study the effect of salt bridge disruption of PRRSV 3CLpro on the cleavage of the host protein. In the NEMO-Q205 biosensor assay, the P6-P6′ positions of EAV 3CLpro recognition regions around NEMO-Q205 were introduced into the cyclized luciferase biosensor (Fig. 3a). Consistent with our previous findings, wild type of PRRSV 3CLpro did not induce the luciferase activity of NEMO-Q205 biosensor, whereas such activity was significantly induced in the K138T mutant of PRRSV 3CLpro cotransfected cells (Fig. 7b). As further evidence supporting this result, western blotting also revealed that the K138T mutant of PRRSV 3CLpro was able to cleave NEMO at Q205, producing an expected faster migrating protein band that was detected by an anti-Flag antibody and had molecular masses of approximately 22 kDa. Presumably, this band is NEMO cleavage product, NEMO-(1–205), mediated by the K138T mutant. As we predicted, no cleavage products were observed for WT PRRSV 3CLpro (Fig. 7c). These data suggested that the salt bridge disruption of PRRSV 3CLpro led to a substrate preference shift of arterivirus 3CLpro from P1-Glu to P1-Gln.To further investigate the importance of the residue K138 in the salt bridge, two additional mutants (K138L and K138M, mutants with hydrophobic side chain) of PRRSV 3CLpro were constructed. As shown in Fig. 7d-e, these two mutants (K138L and K138M) also gained the ability to cleave the substrate with P1-Gln, while significantly losing the ability to cleave the substrate with P1-Glu, as did the mutants K138A and K138T. Together, these findings identify an evolutionarily accessible mechanism for disrupting or reorganizing salt bridge with only a single mutation of arterivirus 3CLpro to trigger a substrate specificity shift.
Discussion
In most positive-sense, single-stranded RNA viruses, viral 3CLpro or 3C proteases (3Cpro) cleaved the viral polyprotein at multiple conserved sites and are engaged in translational processing of the viral non-structural proteins [54], [55], [56], [57], [58], [59]. The hydrolysis sites of 3Cpro/3CLpro are broadly resembling and usually include a Gln or Glu residue at the P1 position along with a small amino acid residue downstream [34], [60], [61]. For arterivirus 3CLpro, EAV 3CLpro cleave the bonds formed by both P1-Glu and P1-Gln like some of picornaviruses (e.g., Foot-and-mouth disease virus), while PRRSV 3CLpro and Glu-specific protease from S. griseus (Glu-SGP, the first Glu-specific protease whose spatial structure was determined) prefer P1-Glu [60], [62], [63], [64]. Here, we established a rapid, sensitive, and efficient luciferase-based biosensor to monitor the activity of arterivirus 3CLpro
in vivo and identified key amino acids for P1 substrate specificity in arterivirus 3CLpro by site-directed mutagenesis and MD simulations. Specifically, a potential intermolecular salt-bridge in arterivirus 3CLpro linked substrate binding and P1 substrate switch, this finding could augment the discovery of new specific inhibitors against arterivirus 3CLpro
[65], [66], [67].Previous studies have shown that the structures of the S1 subsites of Glu-SGP, picornavirus 3Cpro and arterivirus 3CLpro are very close (Fig. S6) [35], [63], [64], [68]. The S1 subsite comprises the three main structural elements: the structures of the S1 subsites have three main structural elements: a conserved histidine residue (His213 in Glu-SGP, His182 in FMDV 3Cpro, His133 in PRRSV 3CLpro and His134 in EAV 3CLpro) at the base of the S1 subsite, a conserved Ser/Thr residue lining one “wall” of the S1 subsite (Ser190 in Glu-SGP, Thr159 in FMDV 3Cpro, Thr113 in PRRSV 3CLpro, Thr115 in EAV 3CLpro), and a conserved Ser residue (Ser216 in Glu-SGP, Ser136 in PRRSV 3CLpro, Ser137 in EAV 3CLpro, corresponding to Gly185 in FMDV 3Cpro). Previously, the significance of residues His134 and Thr115 for EAV 3CLpro function was supported by site-directed mutagenesis experiments. The mutations His134 also terminated the processing; however, the mutations Thr115 only slightly decreased the processing efficiency. Moreover, our findings highlight the essential significance of His134 and the considerably smaller role of the residues Thr115 for the cleavage of substrates with both P1-Glu and P1-Gln by EAV 3CLpro. Meanwhile, our MD analysis also showed that these two residues are essential for the formation of an appropriate geometry of the S1 subsite for recognizing both P1-Glu and P1-Gln (e.g., EAV 3CLpro, FMDV 3Cpro). The third element of the S1 site in Glu-SGP is Ser216, which is highly conserved in PRRSV 3CLpro and EAV 3CLpro but replaced with Gly or Ala residues in FMDV 3Cpro. It is worth noting that the mutagenesis in the Glu-SGP model indicates that the Ser216Ala/Gly replacement decreased the efficiency in their cleavage activity over P1-Glu substrate but did not make the substrates with P1-Gln the preferred ones [63], [69]. Consistent with these results, the mutant S136A of PRRSV 3CLpro lost the ability to cleave substrate with P1-Glu (Fig. 4a) but did not lead to a P1 substrate specificity shift from P1-Glu to P1-Gln preference (data not shown). This suggests that the S136A substitution in PRRSV 3CLpro with that of picornavirus 3Cpro may not sufficiently induce the corresponding specificity toward P1-Gln. It should be noted that although the complexes of proteases and short peptide substrates have been widely used for MD simulations to study the interactions between 3CLpro and substrates [70], [71], [72], the residues within other region of the substrate have the potential to affect the substrate specificity of arterivirus 3CLpro. Thus, the determination of crystal structures of arterivirus 3CLpro and full-length substrate will provide more details on the molecular determinants of substrate specificities of arterivirus 3CLpro, which deserve further investigation.As demonstrated in substrate profiling for the Zika virus NS2B-NS3 protease, the preference for a lysine residue in the S2-binding subsite is the result of salt bridge formation between the β-carboxyl group of the Asp83 cofactor residue and the ε-amino group of the substrate Lys residue [73], [74]. A negatively charged S2 binding site accounts for the pronounced preference for lysine and arginine at the P2 position of the substrate, as shown in the P2 round of unfolding. As with the Zika virus NS2B-NS3 protease, the substrate charge compensator is anticipated to be the core structural determinant of the specificity of arterivirus 3CLpro recognizing charged P1-Glu residues [75]. Our data demonstrated that Lys at position 138 of PRRSV 3CLpro might be suggested as candidates for this. Since PRRSV 3CLpro plays an indispensable role in the virus life cycle by cleaving the polyprotein precursors, we attempted to investigate whether the intermolecular salt bridge in PRRSV 3CLpro is important for its replication by a PRRSV reverse genetics approach. However, the salt bridge-breaking mutant of PRRSV 3CLpro (PRRSV 3CLpro-K138T) could not be rescued, and one possible reason is that the salt bridge-breaking mutation almost completely lost the ability to cleave the substrate P1-E in the polyprotein of PRRSV (Fig. 5c). These data suggest that an evolutionary mechanism for the intermolecular salt bridge in PRRSV 3CLpro might be required for its replication.
Conclusion
Ensuring P1-Gln substrate specificity in PRRSV 3CLpro required the substitution of Lys138 in the S1 subsite and disruption of a salt bridge between protease and substrate. In turn, the T139K mutant of EAV 3CLpro, building a salt bridge, leads to a shift of EAV 3CLpro from P1-Gln to P1-Glu preference. In conclusion, our findings identified an evolutionarily accessible mechanism for disrupting or reorganizing salt bridge with only a single mutation of viral proteases to trigger a substrate specificity switch.
Author contributions
D.W., Q.C., L.F., S.X. and J.Z. conceptualization; Q.C., J.Z., Z.Y., J.G., Z. L., X. S., and Q. J. methodology; J.Z. and Q.C. data curation; J.Z. and Q.C. software; J.Z. and Q.C. formal analysis; D.W. supervision; Q.C. and D.W. writing–original draft; L.F. and S.X. writing—review & editing; D.W., L.F. and S.X. funding acquisition.
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Authors: James R Birtley; Stephen R Knox; Agnès M Jaulent; Peter Brick; Robin J Leatherbarrow; Stephen Curry Journal: J Biol Chem Date: 2005-01-14 Impact factor: 5.157
Authors: Eric J Neumann; James B Kliebenstein; Colin D Johnson; John W Mabry; Eric J Bush; Ann H Seitzinger; Alice L Green; Jeffrey J Zimmerman Journal: J Am Vet Med Assoc Date: 2005-08-01 Impact factor: 1.936
Authors: Yunjeong Kim; Scott Lovell; Kok-Chuan Tiew; Sivakoteswara Rao Mandadapu; Kevin R Alliston; Kevin P Battaile; William C Groutas; Kyeong-Ok Chang Journal: J Virol Date: 2012-08-22 Impact factor: 5.103