Literature DB >> 32391184

Comprehensive Insights into the Catalytic Mechanism of Middle East Respiratory Syndrome 3C-Like Protease and Severe Acute Respiratory Syndrome 3C-Like Protease.

Hao Wang1,2, Shuai He1,2, Weilong Deng1,2, Ying Zhang3, Guobang Li1,2, Jixue Sun1, Wei Zhao1,2, Yu Guo1,2, Zheng Yin1,2,4, Dongmei Li1, Luqing Shang1,2.   

Abstract

Coronavirus 3C-like protease (3CLPro) is a highly conserved cysteine protease employing a catalytic dyad for its functions. 3CLPro is essential to the viral life cycle and, therefore, is an attractive target for developing antiviral agents. However, the detailed catalytic mechanism of coronavirus 3CLPro remains largely unknown. We took an integrated approach of employing X-ray crystallography, mutational studies, enzyme kinetics study, and inhibitors to gain insights into the mechanism. Such experimental work is supplemented by computational studies, including the prereaction state analysis, the ab initio calculation of the critical catalytic step, and the molecular dynamic simulation of the wild-type and mutant enzymes. Taken together, such studies allowed us to identify a residue pair (Glu-His) and a conserved His as critical for binding; a conserved GSCGS motif as important for the start of catalysis, a partial negative charge cluster (PNCC) formed by Arg-Tyr-Asp as essential for catalysis, and a conserved water molecule mediating the remote interaction between PNCC and catalytic dyad. The data collected and our insights into the detailed mechanism have allowed us to achieve a good understanding of the difference in catalytic efficiency between 3CLPro from SARS and MERS, conduct mutational studies to improve the catalytic activity by 8-fold, optimize existing inhibitors to improve the potency by 4-fold, and identify a potential allosteric site for inhibitor design. All such results reinforce each other to support the overall catalytic mechanism proposed herein.
Copyright © 2020 American Chemical Society.

Entities:  

Year:  2020        PMID: 32391184      PMCID: PMC7202269          DOI: 10.1021/acscatal.0c00110

Source DB:  PubMed          Journal:  ACS Catal            Impact factor:   13.084


Introduction

The COVID-19 (Coronavirus Disease-2019) outbreak is caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) and has become a public health emergency worldwide.[1,2] Relevant to the COVID-19 pandemic, the Severe Acute Respiratory Syndrome (SARS) in 2003 and the Middle East Respiratory Syndrome (MERS) in 2015 also caused tremendous social panic due to their high fatality rates at 10% and 30%, respectively.[3−6] All three syndromes are caused by coronaviruses belonging to the same beta-coronavirus family. 3C-Like protease (3CLPro) is a cysteine protease critical to the life cycles of all three types of coronaviruses. Thus, it is an attractive target for drug design, especially for designing pan-coronavirus antivirals.[7−9] We are interested in examining the mechanistic details of 3CLPro with the hope of aiding in drug design efforts. Specifically, the MERS coronavirus (MERS-CoV) possesses a single-stranded positive-sense RNA genome with 2 open reading frames (ORFs) and encodes two polyprotein precursors,[10−13] which are cleaved by 3CLPro and a papain-like cysteine protease (PLPro) to generate 16 nonstructural proteins (NSP1–16).[14−17] Among them, 3CLPro, as the main protease, is critical for most of the cleavage events during polyprotein processing.[18,19] The catalytic mechanism of MERS-CoV 3CLPro is largely unknown. For SARS-CoV 3CLPro, a Gln in the substrate P1 position and a small amino acid residue, such as Gly, Ser, or Ala, in the P1′ position are proposed to be essential based on an analysis of the active sites sequence.[20−23] In addition, a Cys···His catalytic dyad is reported to complete the proteolytic task through a common nucleophilic-type reaction in SARS-CoV 3CLpro (Figure ).[24,25] Although the substrate specificity and nucleophilic-attack model of the SARS-CoV 3CLpro dyad have been illustrated, the detailed catalytic mechanism of coronavirus 3CLPro is still unclear. This inspires us to work on deciphering the comprehensive mechanism of coronavirus 3CLPro.
Figure 1

General overall scheme of the SARS-CoV 3CLPro catalytic mechanism.

General overall scheme of the SARS-CoV 3CLPro catalytic mechanism. Here, we report the comprehensive molecular catalytic mechanism (binding and catalysis) of MERS-CoV 3CLPro and SARS-CoV 3CLPro. For substrate binding, the protease adopted a residue pair (Glu-His) and a stable hydrogen bond formed by a conserved His and the substrate Gln to recognize the conserved Gln in the P1 position of the substrate. For catalysis, a conserved GSCGS motif was identified and shown to stabilize the active site of the substrate and maintain the mobility of the catalytic cysteine side chain. In addition, a partial negative charge cluster (PNCC) formed by Arg-Tyr-Asp was proven to be essential for catalysis, via a remote interaction mediated by a conserved water molecule. The distinctions between proteases from SARS and MERS were explored in terms of their catalytic efficiency. Furthermore, mutation studies were conducted to improve the enzymatic activity (8-fold), the inhibitor of MERS-CoV 3CLPro was optimized to improve the potency via introduction of a powerful hydrogen bond interaction between the inhibitor and the critical Q195 of MERS-CoV 3CLPro. A potential allosteric inhibitory site was identified. Hence, the results will help in the understanding of the enzymology, as well as de novo protein design and novel inhibitor development.

Materials and Methods

Enzyme Preparation

The wild type (WT) MERS-CoV 3CLPro and SARS-CoV 3CLPro genes were synthesized by Genewiz, Inc. Subsequently, MERS-CoV 3CLPro was cloned into the modified pET-28bs vector (Novagen), and the SARS-CoV 3CLPro was constructed in the pEGX-6P vector (Novagen). The detailed primer information was presented in Table S1 of the Supporting Information (SI). Then, the constructed plasmids were transformed into Escherichia coli BL21 (DE3) cells (TransGen Biotech, Beijing, China), and target protein was induced by 0.25 mM isopropyl β-d-1-thiogalactopyranoside (IPTG) at 16 °C for 18 h. The harvested cells were resuspended into lysis buffer containing 20 mM Tris-HCl (pH 8), 150 mM NaCl, 4 mM MgCl2, 5% glycerol, and homogenized with ultrasonic cell disintegration at low temperature. Following centrifugation at 12 000 rpm for 40 min at 4 °C to remove cell debris, the supernatant was loaded onto the Ni-nitrilotriacetic acid (Ni-NTA) column (GE Healthcare). After washing the resin with the washing buffer containing 20 mM imidazole (pH 8), SUMO protease was added to generate MERS-CoV 3CLPro or washed with buffer containing 200 mM imidazole (pH 8) to separate SARS-CoV 3CLPro. Crude protein was purified by Superdex 75 gel filtration chromatography (GE Healthcare) or Superdex 200 gel filtration chromatography (GE Healthcare) and verified by SDS-PAGE analysis (Figure S1). Finally, the target protein was concentrated into 30 mg/mL and stored at −80 °C.

Protein Mutation

The mutational protein was prepared by using the Fast Mutagenesis System Kit (Transgen Biotech Co. LTD) following the manufacturer’s instructions. The primers of mutates were presented in Table S2. Following mutagenesis, the mutational recombinant plasmid was verified via gene sequencing, and the mutational proteases were expressed in the enzyme preparation method.

Activity Measurements

The FRET-based peptide NMA-TSAVLQSGFRK(DNP)M was synthesized via a solid-phase method and used as a substrate, which turned fluorescent upon cleavage of the Gln-Ser bond by 3CLpro. In brief, 2.0 μM MERS-CoV 3CLPro was incubated with seven different concentrations of the inhibitor (2-fold dilution) including DMSO only as blank control in 50 μL assay buffer (pH = 8.0, 20 mM Tris-HCl, 150 mM NaCl) at 37 °C for 30 min. Subsequently, the reaction was initiated following the addition of 30 μM solution of the substrate (50 μL). The change of relative fluorescence units was obtained by a microplate reader (Thermo Varioskan Flash, U.S.A.) at λex of 340 nm and λem of 440 nm. As a consequence, the IC50 value of the inhibitor was calculated based on the inhibitory curve fitting by GraphPad Prism 7.0. To determine the kinetic parameters of the cleavage reaction, the prerequisite was to calculate the relationship between the relative fluorescence units and the substrate concentration via precalibrating the instrument with the free fluorescent moiety NMA standard. The kinetic parameters (Km and kcat) of enzyme and mutations were calculated via kinetics curve fitting by the GraphPad Prism 7.0.[26]

Chemistry

To study the identifying process of protease, several known inhibitors were synthesized as probe molecules.[27] The synthesis routes were presented in Schemes S1 and S2. In brief, intermediates 4a and 4b were obtained from l-glutamate acid following protection of the amino acid, SN2 substitution reaction, and reduction reaction. Following subsequent condensation reaction, reductive reaction, and oxidative reaction, aldehydes 8a and 8b were obtained.[27] Meanwhile, aldehydes 12a and 12b were synthesized via similar methods.

Crystallization, Data Collection, Structure Determination, and Refinement

Purified SARS-CoV 3CLPro was concentrated to 4 mg/mL in the buffer, which contains 20 mM Tris-HCl, 150 mM NaCl, and a pH of 8.0. The inhibitors were preincubated with SARS-CoV 3CLPro in 5:1 stoichiometric ratio at 4 °C overnight. After iterative rounds of optimization of the crystallization conditions, the crystals of SARS-CoV 3CLPro were suitable to grow in 0.1 M MES, pH 6.0, 2%–10% PEG 8000 buffer condition, and in a hanging-drop, vapor diffusion method at 16 °C. For collecting X-ray diffraction data, the crystals were flash-cooled in liquid nitrogen followed by dragging the crystals through the crystallization solution supplemented with 20% glycerol. The X-ray diffraction data were collected at the Rigaku RU200 X-ray generator at Tsinghua University. The data were processed using the HKL2000 package. The space group was identified as C2, and one molecule was detected per asymmetric unit. The crystal structure of SARS-CoV 3CLPro (PDB code: 1UJ1) was used as the initial searching model to determine the complex structure we obtained. Subsequently, the manual model was refined by performing COOT and PHENIX software through rigid-body refinement, energy minimization, and individual b-factor refinement. Finally, the quality of the final refined model was verified by the program PHENIX validation module and the statistics information were summarized in Table and 2.
Table 1

Data Collection and Refinement Statistics

PDB accession no.6LNQ
data collection statistics
X-ray SourceRigaku RU200 X-ray generator
wavelength (Å)1.5418
space groupC 121
unit cell parameters (Å; °)a = 109.068, b = 80.529, c = 53.354; α = 90.00, β = 104.36, γ = 90.0
resolution range* (Å)50.00–2.28 (2.28–2.24)*
unique reflections21337 (1033)
completeness (%)99.9 (97.8)
redundancy6.5 (5.4)
I/σ(I)16.88 (3.36)
Rmerge (%)12.2 (53.9)
refinement statistics
resolution range (Å)43.55–2.244 (2.325–2.244)
reflections used in refinement21323 (2010)
reflections used for R-free1094 (113)
 19.92 (30.51)
Rwork (%) *
Rfree (%) *23.90 (34.75)
number of non-hydrogen atoms2508
protein2371
ligands37
solvent100
average B-factors30.24
protein29.88
ligands47.04
solvent32.41
r.m.s. deviations 
bond lengths (Å)0.008
bond angles (deg)0.87
Ramachandran 
favored (%)97.04
allowed (%)2.96
outliers (%)0.00
Table 2

Data Collection and Refinement Statistics

PDB accession no.6LNY6LO0
data collection statistics
X-ray sourceRigaku RU200Rigaku RU200
wavelength (Å)1.5401.540
space groupC 121C 121
unit cell parameters (Å; °)a = 109.049, b = 80.67, c = 53.407; α = 90.00, β = 104.491, γ = 90.00a = 108.669, b = 81.544, c = 53.382; α = 90.00, β = 104.349, γ = 90.00
resolution range* (Å)50.00–2.2950.00–2.28
 (2.29–2.25)(2.28–2.24)
unique reflections21307 (1001)21495 (930)
completeness (%)99.5 (95.2)98.7 (87.3)
redundancy6.2 (4.7)6.5(5.5)
I/σ(I)19.21 (2.65)21.68 (6.27)
Rmerge (%) *10.6 (67.3)9 (31.8)
refinement statistics
resolution range (Å)43.71–1.9443.71–1.939
 (2.325–2.245)*(2.008–1.939)*
reflections used in refinement21274 (1995)27854 (867)
reflections used for R-free1997 (187)1412 (66)
Rwork (%) *20.41 (27.72)22.72 (40.90)
Rfree (%) *25.28 (34.41)26.83 (46.91)
number of non-hydrogen atoms25102589
Protein23712371
Ligands3029
solvent109189
average B-factors32.9829.46
protein32.7229.06
ligands40.0637.04
solvent36.7233.32
r.m.s. deviations  
bond lengths (Å)0.0090.007
bond angles (deg)0.970.92
Ramachandran  
favored (%)97.7097.70
allowed (%)2.302.30
outliers (%)0.000.00

MD Simulations

The Amber FF14SB force field was utilized and both of the systems were minimized for 10 000 steps, containing 5000 steps of steepest descent minimization and 5000 steps of conjugate gradient minimization.[28−30] Then the systems were heated from 0 to 300 K in 1 ns constant volume MD simulation. In the heating stage, a force constant of 20 kcal/mol was employed to constrain the complex and the Langevin thermostat was utilized for temperature control.[31] After that, a 100 ns MD simulation was performed for each system without any constraints. In the MD simulation, the cutoff value of the van der Waals interactions was set to 10 Å. The Particle Mesh Ewald (PME) method was applied to calculate the long-range electrostatic interactions, and the SHAKE algorithm was applied to restrain all of the bond lengths that involved hydrogen atoms. Snapshots of the system were saved every 10 ps.[32,33]

CPPTRAJ, Pocket Volume Measurer (POVME) and Molecular Mechanics (MM) Calculation Methods

The CPPTRAJ module was applied to calculate the Root Mean Square Deviations (RMSD), distance, dihedral, angle, and solvent accessible surface area (SASA) of each system in AmberTools15.[34] The redundant volume was calculated by the Pocket Volume Measurer (POVME) script.[35] For calculating the binding free energy between two significant residues, Molecular Mechanics (MM) calculation was introduced. To execute the MM calculation, in brief, a total of 500 snapshots were extracted from the last 5 ns trajectory of each system, and all parameters were used in default values in the calculation.[36−38]

Potential of Mean Force (PMF) Calculation

For generating an energy landscape to further explore the energetic change, the PMF was used and the energy landscape was obtained via the equation: In the equation, kB represents the Boltzmann constant, T is the simulation temperature, and the g(x, y) is the normalized probability distribution. The explicit relative energy bar is presented near the energy landscape.[39]

Prereaction State (PRS) Analysis

The initial protein structure used in the Prereaction State (PRS) analysis was constructed with the thiolate-imidazolium ion pair model. Basically, the first step of nucleophilic reaction was assumed to be critical in the cascaded mechanism.[40,41] Accordingly, the two complexations of peptide substrate and the protease were constructed for MERS-CoV Wild type (exp. Km:23.1 ± 2.1 μM, kcat:0.38 ± 0.02 min–1; kcat/Km: ∼16.4 mM–1 min–1) and mutant M168L/T174 V (exp. Km: 9.2 ± 1.1 μM, kcat: 1.27 ± 0.06 min–1; kcat/Km: ∼137.2 mM–1 min–1) with the mutate module in Discovery Studio software package, using the QM-calculated transition state information. Water molecules were assigned with the TIP3P model, and the ff14SB force field was applied for the classical molecular dynamics simulation. The complexes were placed in a truncated octahedral box of water molecules, extending 10.0 Å along each dimension. A certain number of counterions Na+ were added to neutralize the calculated system. The MD systems were first minimized by the steepest descent minimization of 1000 steps followed the conjugate gradient minimization of 9000 steps, heated up from 0 to 300 K at constant volume in 50 ps, and equilibrated for another 50 ps without any restraints. In the MD simulations, the Particle Mesh Ewald (PME) method was employed for long-range electrostatic interactions. Finally, multiple 10 ns of trajectories (100 000 frames) were collected for the further PRS analysis, similar to our previous studies.[42−44]

Ab Initio Calculations

For each reaction system, a snapshot that was close to the average simulated structure was extracted from the stable MD trajectory as the starting structure for the ab initio calculations. Specifically, we tracked the distance between the sulfur atom of C148 and the carbonyl carbon atom of substrate active site in the attacking state which conformed Burgi–Dunitz criteria of near-attach-conformation parameters (the S---C distance <3.5 Å and attacking angle alpha (100–110°) and SCOCa dihedral (85–95°), i.e. distance D1 in Scheme S3. Within stable MD trajectories, we obtained an average D1 value and chose the snapshot with the D1 value being close to the corresponding average D1 value. Each system was truncated as a model (Scheme S3) to mimic the reaction pathway in the protein environment. During the geometry optimization, the boundary atoms were fixed at the position they were in in the protein environment. This ensured that each reaction moiety stayed in the same orientation as that in the protein environment. Geometry optimization was conducted using the B3LYP functional and a uniform 6-31+G* basis set.[45−47] Frequency analysis was used to calculate the Gibbs free energies at 298.15 K and 1 atm. Single-point energy calculation was carried out on the optimized geometry with the MP2 method and the 6-31+G* basis set. The SMD solvation method with a dielectric constant of ε = 5.6968 (chlorobenzene) was used to model the weak polarization effect of the protein environment. The ab initio calculation was carried out using the Gaussian 09 program.[48]

Results

Overall Investigation of the Structures and the Enzymatic Activity of MERS-CoV 3CLPro and SARS-CoV 3CLPro, as well as the Catalytic Center in MERS-CoV 3CLPro

With the aim of understanding the molecular catalytic pattern of MERS-CoV 3CLPro and SARS-CoV 3CLPro, it is important to examine the experimentally determined structure of the protease with a bound ligand. For MERS-CoV 3CLPro, we started with a reported complex structure (PDB code: 4RSP) for further investigation. To check the catalytic mechanism and unravel the general coronavirus 3CLPro catalytic mechanism, we sought to determine the complex structure of SARS-CoV 3CLPro with a peptidomimetic aldehyde inhibitor (PDB code: 6LNQ; for detailed information, see Table ). Generally, MERS-CoV 3CLPro and SARS-CoV 3CLPro appear as a quasi-ellipsoid and possess three separate portions (domain I, domain II, and domain III), which are connected by flexible loops (Figure A). More narrowly, the structures of the two proteases chiefly comprise 10 helices, α1−α10, and 13 β-sheets, β1−β13 (Figure S2). Therein, domains I and II are reported to execute catalytic activity, while domain III is responsible for protease dimerization.[49] By superimposing the two structures, the root-mean-square deviation (RMSD) value indicated that the catalytic zone (domain I and domain II) presented higher similarity than the dimerization domain (domain III) (Figure A). In addition, similar results emerged among four other coronavirus 3CLPro proteins (human coronavirus HKU1 (HCoV-HKU1), bat coronavirus HKU4 (BCoV-HKU4), human coronavirus NL63 (HCoV-NL63), and human coronavirus 229E (HCoV-229E)), which hinted that various coronaviruses 3CLPro have a conserved proteolysis mechanism (Figure S3).
Figure 2

Structure of MERS-CoV 3CLPro and SARS-CoV 3CLPro, enzymatic assay development and verification of the MERS-CoV 3CLPro catalytic center. (A) The overall structure of MERS-CoV 3CLPro and SARS-CoV 3CLPro. Domain I is presented in orange, Domain II is presented in green, and Domain III is presented in cyan. The RMSD distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro is shown in the table below. The catalytic centers of the two proteases are shown in the center of the image. The 3CLPro catalytic dyad is displayed as sticks (MERS-CoV 3CLPro in cyan and SARS-CoV 3CLPro in green) and the protease is displayed as a cartoon (MERS-CoV 3CLPro in wheat color and SARS-CoV 3CLPro in white). (B) Bioinformatics analysis of the substrate active site sequence. (C) The structure of the designed substrate and the kinetic parameters of the substrate. (D) Catalytic center verification of MERS-CoV 3CLPro. The structure of MERS-CoV 3CLPro is shown as a b-factor to indicate that the two cysteines are stable in the structure. The two cysteines required to investigate are emphasized, and the frame represents the binding site of the substrate on the left side. Enzymatic assays of the wild type and mutant proteases are shown on the right side. The data presented are the mean values from experiments in triplicate and the error bars indicate the standard deviations.

Structure of MERS-CoV 3CLPro and SARS-CoV 3CLPro, enzymatic assay development and verification of the MERS-CoV 3CLPro catalytic center. (A) The overall structure of MERS-CoV 3CLPro and SARS-CoV 3CLPro. Domain I is presented in orange, Domain II is presented in green, and Domain III is presented in cyan. The RMSD distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro is shown in the table below. The catalytic centers of the two proteases are shown in the center of the image. The 3CLPro catalytic dyad is displayed as sticks (MERS-CoV 3CLPro in cyan and SARS-CoV 3CLPro in green) and the protease is displayed as a cartoon (MERS-CoV 3CLPro in wheat color and SARS-CoV 3CLPro in white). (B) Bioinformatics analysis of the substrate active site sequence. (C) The structure of the designed substrate and the kinetic parameters of the substrate. (D) Catalytic center verification of MERS-CoV 3CLPro. The structure of MERS-CoV 3CLPro is shown as a b-factor to indicate that the two cysteines are stable in the structure. The two cysteines required to investigate are emphasized, and the frame represents the binding site of the substrate on the left side. Enzymatic assays of the wild type and mutant proteases are shown on the right side. The data presented are the mean values from experiments in triplicate and the error bars indicate the standard deviations. For establishment of the enzymatic assay, the active sites of the polypeptide precursors were presented via bioinformatics analysis, and a FRET-based dodecapeptide as an experimental substrate was constructed following analysis of these active sites (Figures B and S4). For validating the availability of the substrate, MERS-CoV 3CLPro and SARS-CoV 3CLPro were tested, and the kinetic dynamic parameters of the proteases were determined (Figures C and S5). Identification of the catalytic center is essential to provide a foundation for decoding the catalytic mechanism. In MERS-CoV 3CLPro, two cysteines (C145 and C148) are located in the catalytic center (Figure D), whereas SARS-CoV 3CLPro exploits only C145 to execute catalysis. Nevertheless, C148 has been reported to be essential to the protease, although the significance of C145 in MERS-CoV 3CLPro is still unknown. To examine the influence of C145 in MERS-CoV 3CLPro, we mutated both Cys to Ser and Ala. Enzymatic assays using the purified variants C145S and C145A showed that these mutations manifested comparable efficiency to WT MERS-CoV 3CLPro, while the variants C148S and C148A almost eliminated the catalytic efficiency (Figure D). Meanwhile, the mutants H41A and H41L were also incapable of cutting the substrate, which suggested the catalytic dyad H41–C148 in MERS-CoV 3CLPro.

Conserved Residue Pair (E–H) Utilize Steric Effect for 3CLPro Recognition of Glutamine Substrates

For substrate binding, the highly conserved native substrate P1 site residue attracted our attention (Figure S4). We previously developed inhibitors of EV71 3C protease (EV71 3CPro) and determined the complex structure of EV71 3CPro with a peptidomimetic inhibitor (PDB code: 5BPE).[50−53] By comparing the S1 pocket structures of EV71 3CPro and coronavirus 3CLPro, we noticed an extra hydrogen bond interaction between the conserved protease glutamine (E169 of MERS-CoV 3CLPro and E166 of SARS-CoV 3CLPro) and the lactam ring of the inhibitor, which presumably led to the enhanced inhibitor affinity (Figure A). Therefore, to verify this assumption, mutagenesis of MERS-CoV 3CLPro E169 was performed.
Figure 3

Comprehensive investigation of the effect of the conserved Glu-His in MERS-CoV 3CLPro and SARS-CoV 3CLPro. (A) Structural analysis of the S1 pocket from three proteases interacting with the inhibitor P1 site. The results are shown as a two-dimensional structure using the Maestro software. Hydrogen bond interactions are presented as purple arrows (inhibitor P1 site acts as a hydrogen bond donor) and red arrows (inhibitor P1 site act as a hydrogen bond acceptor). (B) Biochemical assay of the wild type and E169 mutant proteases for proteolytic activity. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (C) Structural analysis of the surroundings of E169 in MERS-CoV 3CLPro and SARS-CoV 3CLPro. The proteases are shown as cartoons (MERS-CoV 3CLPro in wheat color and SARS-CoV 3CLPro in white), and the inhibitor is presented as yellow sticks. (D) Catalytic activity analysis of the wild type and H175 mutant proteases. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (E) POVME script calculation for the spare volume enveloping the substrate glutamine, 169 site residue, and 175 site residue. The average structure of the WT and mutant proteases following MD simulation are shown (the protease is presented as a cartoon, the key residue in the protease is shown as cyan sticks, and the substrate glutamine is shown as sticks) and the spare volume is displayed as a violet mesh. Meanwhile, the value of the redundant volume is shown at the bottom.

Comprehensive investigation of the effect of the conserved Glu-His in MERS-CoV 3CLPro and SARS-CoV 3CLPro. (A) Structural analysis of the S1 pocket from three proteases interacting with the inhibitor P1 site. The results are shown as a two-dimensional structure using the Maestro software. Hydrogen bond interactions are presented as purple arrows (inhibitor P1 site acts as a hydrogen bond donor) and red arrows (inhibitor P1 site act as a hydrogen bond acceptor). (B) Biochemical assay of the wild type and E169 mutant proteases for proteolytic activity. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (C) Structural analysis of the surroundings of E169 in MERS-CoV 3CLPro and SARS-CoV 3CLPro. The proteases are shown as cartoons (MERS-CoV 3CLPro in wheat color and SARS-CoV 3CLPro in white), and the inhibitor is presented as yellow sticks. (D) Catalytic activity analysis of the wild type and H175 mutant proteases. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (E) POVME script calculation for the spare volume enveloping the substrate glutamine, 169 site residue, and 175 site residue. The average structure of the WT and mutant proteases following MD simulation are shown (the protease is presented as a cartoon, the key residue in the protease is shown as cyan sticks, and the substrate glutamine is shown as sticks) and the spare volume is displayed as a violet mesh. Meanwhile, the value of the redundant volume is shown at the bottom. In principle, we began mutagenesis via replacement of E169 with Asp that also contains a carboxyl group with negative charge. Following evaluation, compared with the WT protease, the mutant E169D exhibited inferior catalytic efficiency, which suggested that the electrostatic force of E169 was too laborious to affect the substrate binding process (Figure B). Subsequently, enzymatic assays were conducted using purified variants with five neutral amino acid residues (Ala, Gln, Leu, Met, and Val) instead of Glu (Figure B). Comparable enzymatic activity was observed for the E169L and E169Q mutants, which possessed side chains of similar bulk to Glu, while the E169A, E169 V, and E169 M mutants, which had side chains with various volumes at the 169th site, showed decreased catalytic activity compared with WT protease, which indicated that steric effects at E169 were required to fit the substrate (Figure B). However, the flexible side-chain of E169 meant that this amino acid alone could not achieve this effect. When scanning the structure, conserved residue H175 was conventionally ignored owing to its weak interaction with the inhibitor. However, the inextricable interaction between H175 and E169 made it reasonable to speculate that H175 was indispensable for proteolytic bioactivity (Figure C). Subsequently, crippling the impregnable interactions by replacing H175 with Ala, Leu, Glu, and even Asn could remarkably reduce the enzymatic activity, though the mutation of H175 to Gln, which shared a similar hydrogen bond donor feature to His, could retain the protease bioactivity (Figure D). The results indicated that the hydrogen bond interaction between H175 and E169 was essential to the protease, which was in contrast to the previous E169L mutant result (Figure B). Then, a multipoint mutant (E169L/H175L) was created, and this mutant expressed inferior catalytic activity (Figure D). To explain these unexpected results, molecular dynamic (MD) simulations were applied to imitate the binding mode of the protease with the natural substrate, which was construed form the peptidomimetic aldehyde structure and optimized by MD simulation. As consequence, compared with WT, the distance between residues 169 and 175 was remarkably enlarged in the mutant H175L and mutant E169L/H175L, while the mutant E169L displayed the same distance as the WT protease (Figure S6), which hinted that redundant space may strikingly affect protease catalytic activity in the H175L and mutant E169L/H175L. Following POVME script calculation, the results demonstrated that the redundant space interfered with the fit between the protease and substrate, which indicated that E169 mainly interacted with the substrate through steric effects rather than electrostatic interactions (Figure E).

Stable Conserved Histidine Participates in the Recognition of Glutamine Substrates by 3CLPro

Due to the essential fitting model of E169-H175, it led us to question whether a residue with a suitable side chain volume might act similar role with Gln to fill the S1 pocket and start the cleavage event. However, coronavirus 3CLPro preferred Gln to His, which exhibits a similar side chain volume as Gln in the substrate P1 site, invalidating this assumption. To explore the underlying reason, the structures of the two proteases were scrutinized, and a conserved histidine (H166 in MERS-CoV 3CLPro and H164 in SARS-CoV 3CLPro) caught our attention, as it formed robust hydrogen bond interactions with the inhibitor (Figure A). In an effort to validate the effect of the robust hydrogen, mutagenesis study on H166 of MERS-CoV 3CLPro was performed. Mutants H166A and H166L could reduce the proteolytic activity, which clearly demonstrated the significance of the intense hydrogen bond (Figure B). As Gln and Asn have similar hydrogen bond donor and acceptor abilities as His, both of them were suitable replacements for histidine. But mutants H166N and H166Q had significantly decreased activity, implying that other characteristics might define the conserved histidine (Figure B). After examining the structures, due to the inappropriate dihedral (CD2-CG-CB-H) presented by H166, conserved H166 exhibited an eclipsed conformation, leading to the torsional tension and a rotating trend (Figure C). Thus, there were restrictive ingredients that limited the rotation of the H166 side chain and maintained H166 in its energized conformation, which made H166 interact with the substrate Gln in an appropriate angle and distance.
Figure 4

A stable histidine is responsible for the 3CLPro recognition stage for the substrate glutamine. (A) The structural analysis of the area surrounding the substrate glutamine and H166. (B) The catalytic activity evaluation of WT and H166 mutants. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (C) The sketch map of the dihedral (CD2-CG-CB-H) from the conserved histidine (left). The general symbol for atom in the histidine and the value of the dihedral (right). (D) The proteolytic activity analysis of the Y164 and F143 mutants in MERS-CoV 3CLPro. The data presented are the mean values from experiments in triplicate and the error bars indicate the standard deviations. (E) The average structure of the WT and mutant (Y164F, F143L, and F143A) protease following MD simulation. In the structures, the protease is presented as a wheat colored cartoon, and the significant residue is shown as sticks (the protease residue is in cyan, and the substrate glutamine is in yellow).

A stable histidine is responsible for the 3CLPro recognition stage for the substrate glutamine. (A) The structural analysis of the area surrounding the substrate glutamine and H166. (B) The catalytic activity evaluation of WT and H166 mutants. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (C) The sketch map of the dihedral (CD2-CG-CB-H) from the conserved histidine (left). The general symbol for atom in the histidine and the value of the dihedral (right). (D) The proteolytic activity analysis of the Y164 and F143 mutants in MERS-CoV 3CLPro. The data presented are the mean values from experiments in triplicate and the error bars indicate the standard deviations. (E) The average structure of the WT and mutant (Y164F, F143L, and F143A) protease following MD simulation. In the structures, the protease is presented as a wheat colored cartoon, and the significant residue is shown as sticks (the protease residue is in cyan, and the substrate glutamine is in yellow). By scrutinizing the two structures, conserved Y164 was detected as exploiting its phenolic hydroxyl group to interact with H166 (Figure A). In theory, the acidic phenolic hydroxyl group of Y164 can provide a proton or form a stronger hydrogen bond with H166. To explore the significance and acting mode of Y164, we chose mutants Y164A, Y164F, Y164R, Y164 K, Y164E, and Y164D, considering electrical and steric factors. Compared with WT, all of the variants showed depressing catalytic activity. This result suggested that the phenolic hydroxyl group of Y164 interacted with H166 via the hydrogen bond to restrain H166 in a suitable orientation to promote preferable substrate recognition (Figure D). To understand the role of Y164, MD simulations were performed, and the results showed that replacement of Y164 led to the free rotation of H166 and leaving of the substrate Gln from the S1 pocket (Figures E and S7). Meanwhile, F143 is located near H166 and appears to interact with H166 via π–π stacking interactions to hamper the rotation of H166 (Figure A). Nevertheless, two variants containing an aromatic ring, F143Y and F143W, exhibited significantly reduced activities, indicating that the π–π stacking may be not the major factor. In the case of variants F143L, F143M, F143 V, and F143A, F143L unexpectedly displayed excellent performance equal to WT (Figure D). Compared with that of WT and F143L mutant, MD simulation results of the mutant F143A revealed a deficiency in steric restriction, leading to an unstable connection between H166 and Y164 and the substrate glutamine (Figures E, S8, and S9). Therefore, F143 was proven to employ steric effects to restrain the rotation of H166. Similar results involving the S1 pocket were exhibited in SARS-CoV 3CLPro during the substrate binding stage (Figure S10).

Special GSCGS Motif Plays a Significant Role in the Start of Catalysis

With regard to the catalysis, it was inevitable to explore the sequence nearby the active Cys. On the basis of the alignment of the common coronavirus 3CLPro sequence, a conserved GSCGS sequence caught our attention (Figure A). As a familiar protein linker, the GS sequence was frequently applied for protein engineering. However, the GS sequence, especially a double GS sequence, infrequently emerges near the viral cysteine protease catalytic center. Therefore, to verify the significance of the motif, the residues in the GSCGS motif of MERS-CoV 3CLPro were sequentially replaced with Ala and all of the mutants represented reduced activities, which manifested that the GSCGS motif was essential for protease activity (Figures B and S11). Further exploring the structure of the protease, the GSCGS motif exhibited three consecutive turns via obvious hydrogen bonding (Figure A). As Pro is an appropriate residue to form a turn in the protein structure, the mutation of Gly to Pro was performed. The decreased bioactivity of variants G146P and G149P indicated the irreplaceability of the GSCGS motif and the significant role of Gly in the GSCGS motif (Figure B).
Figure 5

Comprehensive research on G146 and G149 in the conserved GSCGS motif. (A) Conservation analysis of six common coronavirus 3C-like proteases. (B) Catalytic activity assays of the WT and mutant proteases in MERS-CoV 3CLPro and SARS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (C) The explicit structure of the WT protease revealing that the protease GSCGS motif compactly interacts with the substrate active site after MD simulation. The protease is presented as a wheat colored cartoon and the significant residue is highlighted as sticks. (D) MM calculation of the significant interaction in WT and mutant proteases. (E) The time evolution of the substrate glutamine RMSD value in the WT and mutants. (F) The time evolution of the dihedral of C148 (SG-CB-CA-C) in WT protease (in red) and mutations (G149A in blue and G149P in violet). (G) The model to explain the underlying reason for the restraining effect of G149A toward the free rotation of the C148 thiol. The left part is the time evolution of the φ dihedral (148C-149N-149CA-149C) in the WT and G149A mutant proteases. Following mutagenesis of G149 to A149, the distinction in the φ dihedral between the WT and G149A mutant proteases led to more compact binding between C148-G149 and H166, which hampered the rotation of the C148 side chain in the G149A mutant (right).

Figure 6

Comprehensive research on special structure of the conserved GSCGS motif. (A) Typical structure of the WT MERS-CoV 3CLPro protease following MD simulation to reveal the consecutive three turns constituting the GSCGS motif. The protease is presented as a cartoon, and the significant residue is highlighted as sticks. (B) The time evolution of the distance (C148-substrate glutamine and G146-substrate glutamine) in the WT and mutants. (C) Typical structure of the WT MERS-CoV 3CLPro protease following MD simulation to reveal the formation of turn II in the GSCGS motif. (D) Catalytic activity assays of the WT and mutant proteases in MERS-CoV 3CLPro.The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (E) PMF calculation for the SG atom of C148-substrate glutamine carbonyl group distance vs the RMSD of total system in N28L MERS-CoV 3CLPro. (F) MM calculation of the interaction between G149 and H166 in WT and mutant N28L.

Comprehensive research on G146 and G149 in the conserved GSCGS motif. (A) Conservation analysis of six common coronavirus 3C-like proteases. (B) Catalytic activity assays of the WT and mutant proteases in MERS-CoV 3CLPro and SARS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (C) The explicit structure of the WT protease revealing that the protease GSCGS motif compactly interacts with the substrate active site after MD simulation. The protease is presented as a wheat colored cartoon and the significant residue is highlighted as sticks. (D) MM calculation of the significant interaction in WT and mutant proteases. (E) The time evolution of the substrate glutamine RMSD value in the WT and mutants. (F) The time evolution of the dihedral of C148 (SG-CB-CA-C) in WT protease (in red) and mutations (G149A in blue and G149P in violet). (G) The model to explain the underlying reason for the restraining effect of G149A toward the free rotation of the C148 thiol. The left part is the time evolution of the φ dihedral (148C-149N-149CA-149C) in the WT and G149A mutant proteases. Following mutagenesis of G149 to A149, the distinction in the φ dihedral between the WT and G149A mutant proteases led to more compact binding between C148-G149 and H166, which hampered the rotation of the C148 side chain in the G149A mutant (right). Comprehensive research on special structure of the conserved GSCGS motif. (A) Typical structure of the WT MERS-CoV 3CLPro protease following MD simulation to reveal the consecutive three turns constituting the GSCGS motif. The protease is presented as a cartoon, and the significant residue is highlighted as sticks. (B) The time evolution of the distance (C148-substrate glutamine and G146-substrate glutamine) in the WT and mutants. (C) Typical structure of the WT MERS-CoV 3CLPro protease following MD simulation to reveal the formation of turn II in the GSCGS motif. (D) Catalytic activity assays of the WT and mutant proteases in MERS-CoV 3CLPro.The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (E) PMF calculation for the SG atom of C148-substrate glutamine carbonyl group distance vs the RMSD of total system in N28L MERS-CoV 3CLPro. (F) MM calculation of the interaction between G149 and H166 in WT and mutant N28L. For investigating the effect of G146 in the motif, MD simulations were performed, and the results suggested that G146 and C148 could form firm hydrogen bonds with the substrate active site in the WT protease conformation (Figure C). However, the conformation of the G146A and G146P mutants following MD simulation demonstrated that the 146th residue failed to have an optimal distance or interactive angle, and even a deficiency in hydrogen bonding interaction, to bind the substrate active site (Figures D, S12, and S13). As a consequence, compared with WT protease, a modest swing of the substrate Gln residue emerged in the G146A and G146P mutants. This result indicated that G146 could fix the carbonyl group at the substrate active site, decrease the swing at the active site, and facilitate the assault of C148 (Figure E). With analysis on the MD simulation results of variants G149A and G149P, the dihedral constituted from SG-CB-CA-C of C148 presented extreme stability compared with the WT protease, which suggested that the rotation frequency of the C148 side chain had been remarkably decreased (Figure F). In parallel, the PMF calculation result for WT protease manifested two low energy conformations appearing in the trajectory of the protease. However, the results of variants G149A and G149P suggested that only one low energy conformation, where the thiol was far from the active site of the substrate, emerged in the motion of the protease (Figure S14) and manifested the rotation of C148 side chain relied on G149. To unveil the underlying reason, the rotation of the C148 thiol in the WT protease was further investigated and two typical states (resting state and attacking state) were noticed in the MD simulations (Figure S15). In the resting state, the C148 thiol preferred to deviate from the substrate and interact with H166. Conversely, the C148 thiol presented a suitable conformation, close to the substrate in the attacking state. By superimposing the attacking and resting state conformations, the C148-G149 main chain exhibited obvious deflection, while the dihedral φ and ψ angles of C148 were stable (Figures S16 and S17). This result indicated that the translation of the C148-G149 main chain other than rotation might contribute to the rotation of the C148 thiol. In the G149A mutant, the smaller A149 φ dihedral led to a compact conformation of A149 and enhanced the rigid hydrogen bond between A149 and H166, which indicated that the translation of the C148-A149 main chain would demand more energy to break the restriction of H166 to A149 in the G149A mutant (Figure D and 5G). Additionally, due to the deficiency of the hydrogen bond interaction in G149P, the C148 thiol formed a more robust interaction with H166 in compensation, interfering with the rotation of the C148 thiol (Figure S18).

Special Three Consecutive Turns in GSCGS Motif

To smoothly execute the task discussed above, the GSCGS motif must maintain three special and consecutive turns. Turns I and III are guaranteed via two interior hydrogen bonds of the motif mediated by S147 and S150 (Figure A). Mutagenesis of Ser to Ala would seriously enlarge the distance (G146 carbonyl at the active site and C148 carbonyl at the active site) and hamper the anchoring effect of G146 and C148 toward the active site in the MD simulation (Figure B). Interestingly, considering that turn III is dominated by a hydrogen bond between S150 and S147, the substation of S150 with Ala would interfere not only with turn III but also turn I. This result explained the more drastic effect of variant S150A than that of S147A on the distance. Alternatively, turn II is maintained by an exterior hydrogen bond formed by N28 (Figure C). Mutagenesis of N28 to Ala, Leu, Asp, and Gln, as well as His, resulted in the reduced activities of the protease (Figure D). To investigate the concrete effect of N28, MD simulations were performed on the mutant N28L. As a consequence, similar to that in the mutant G149A, the free rotation of the C148 thiol was restricted in mutant N28L, which further proved that the mutagenesis of N28 would deprive the pulling force of N28 to the carbonyl group of C148 and lead to more robust hydrogen bond interaction between the main chain of C148-G149 and H166. Hence, dominated by N28, the formation of turn II contributed to the rotation of the C148 thiol (Figure E and 6F).

Partial Negative Charge Cluster Constituted of Asp190, Arg40, and Tyr54 Exhibited Indispensable Functions during the Catalytic Process

With the development of the catalysis, bearing an analogous chymotrypsin-like structure of 3CLPro, the EV71 3C protease cuts the polyprotein precursors with its catalytic triad (Glu, His, and Cys) (Figure S19). Nevertheless, there is a catalytic dyad (His and Cys) in coronavirus 3CLPro due to the Glu deficiency. After aligning the six coronavirus 3CLPro sequences (Figure S20), the conserved acidic amino acid D190 caught our attention. Interestingly, D190 is located almost 6.7 Å away from H41, which seemed problematic for promoting firm interaction between D190 and the catalytic dyad due to the long distance (Figure A). To discover the significance of D190, D190 was mutated to Leu, Ala, His, and Asn. As a result, the D190A, D190L, D190H, and D190N mutants failed to effectively cleave the substrate, while the D190E mutant retained protease catalytic activity, which implied that the acidic residue was indispensable, despite the far distance from the catalytic dyad (Figure B).
Figure 7

Importance of the partial negative charge cluster (PNCC) constituted by Arg-Tyr-Asp in MERS-CoV 3CLPro and SARS-CoV 3CLPro. (A) The structural analysis of the area surrounding the catalytic center in MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right). (B) Catalytic activity assay of the D190 and R40 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (C) The structural analysis of the area surrounding the key Arg-Asp seeking an additional residue participating in the PNCC. (D) Catalytic activity analysis on the Y54 mutants, M85 mutants, and multipoint mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (E) Structural scrutiny to understand on the typical structural characteristics for maintaining the compact connection between conserved Arg and Asp in MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right). (F) Catalytic activity evaluation of the C38, P39, and Q195 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations.

Importance of the partial negative charge cluster (PNCC) constituted by Arg-Tyr-Asp in MERS-CoV 3CLPro and SARS-CoV 3CLPro. (A) The structural analysis of the area surrounding the catalytic center in MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right). (B) Catalytic activity assay of the D190 and R40 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (C) The structural analysis of the area surrounding the key Arg-Asp seeking an additional residue participating in the PNCC. (D) Catalytic activity analysis on the Y54 mutants, M85 mutants, and multipoint mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (E) Structural scrutiny to understand on the typical structural characteristics for maintaining the compact connection between conserved Arg and Asp in MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right). (F) Catalytic activity evaluation of the C38, P39, and Q195 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. In addition to D190, R40 formed a strong electric interaction with D190 in the coronavirus 3CLPro, which might neutralize the negative charge of D190 (Figure A). Following mutagenesis of R40, the catalytic activity of mutants exhibited a remarkable decline (Figure B). Interestingly, when R40 was mutated to an aliphatic residue (Leu or Ala), the density of the D190 negative charge would be enhanced and the proteolytic activity might in theory be improved. However, the mutants R40L and R40A manifested decreased proteolytic activity. Therefore, a partial negative charge cluster constituted from R40-D190 was supposed to play a significant role in the catalysis activity. Meanwhile, the value of the positive charge of the Arg side chain was usually superior to the value of the negative charge manifested by Asp side chain in physical condition. This implied the existence of an extra residue that could alleviate the positive charge of R40 for maintaining the partial negative charge of R40-D190. Investigation of the structures of the two proteases showed that Y54 and M85 were located near R40 and might interact with R40 via a π-cation interaction and an electron atmosphere–cation interaction, respectively (Figure C). Following mutagenesis of Y54 and M85, the results suggested that Y54, rather than M85, could interact with R40 via the π–cation interaction, which potentially supported that Y54 could interact with R40 via the π-cation interaction to alleviate the electrical interference of R40 to D190 and further caused D190 to present partial negative charge (Figure D). Additionally, multipoint mutations were introduced, and the loss of catalytic activities of the variants reiterated the importance of the partial negative charge cluster (PNCC) constituted by D190-R40-Y54 (Figure D). Since the D190-R40-Y54 synergistically took effect on the catalysis, the interior close connection of PNCC required a special structure for support. Investigation of the protease sequence near R40 identified a remarkably conserved sequence containing C38 and P39 (Figure S21). There was a typical β-turn formed by P39–V42 hydrogen bond interactions, which led to R40 protruding from the loop and potentially enhanced the interior interaction of PNCC (Figure E). However, the deficiency of apparent interaction involving C38 suggested that the C38 side chain volume might be responsible for shaping the remarkably conserved short loop. To confirm this hypothesis, we mutated C38 and P39, and the results verified the assumption (Figure F). From the perspective of D190, an atypical turn was formed, and conserved residue Q195 interacted with the main chain carbonyl group of M189 and K191, which caused a protrusion of D190 from the loop and promoted the R40-D190 interaction (Figures S21 and 7E). The decreased catalytic activities of the mutagenesis proved the significance of Q195 (Figure F).

Fixed by Gln, a Conserved Water Mediates the Remote Interaction between PNCC and the Dyad

Owing to the deficiency of intuitive binding, remote interactions are generally neglected. In coronavirus 3CLPro, the location of PNCC is approximately 6–7 Å from the catalytic dyad, which makes it difficult to assume powerful interactions between the dyad and PNCC. However, the investigation of the structures of the two proteases identified a conserved water, which might mediate the remote interaction between these two essential components (Figure A). To verify the necessity of the conserved water, MD simulations were employed to imitate the motion of the conserved water that was extracted from its original location in MERS-CoV 3CLPro. Following PMF calculation, the lower system energy demand compelled the water to penetrate into the protein and locate itself at a suitable site, which proved the essentiality of conserved water (Figure B).
Figure 8

Indispensable roles of the remote interaction mediated by conserved water in catalysis. (A) Structural scrutiny of the space between H41 and the PNCC in MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right). (B) PMF calculation for the conserved water–Q167 distance vs conserved water–D190 distance. (C) Catalytic activity assays of the Q167 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (D) POVME calculation of the space volume enveloping 41H, Q167, M168, and D190 in the average structure of Q167A (left), Q167 V (center), and Q167L (right) following MD simulation. The proteases are shown as wheat colored cartoons, the key residues are presented as sticks and the redundant volume is exhibited as a violet mesh. The calculated space volume value (refer to the symbol V) and the contribution to the polar surface area of residue (refer to the symbol C) are shown above. (E) The PRS analysis for the WT and M168VT174 V mutant of MERS-CoV 3CL proteases. (F) Optimized structures and free energy barriers for the first step of catalysis stage.

Indispensable roles of the remote interaction mediated by conserved water in catalysis. (A) Structural scrutiny of the space between H41 and the PNCC in MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right). (B) PMF calculation for the conserved water–Q167 distance vs conserved water–D190 distance. (C) Catalytic activity assays of the Q167 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (D) POVME calculation of the space volume enveloping 41H, Q167, M168, and D190 in the average structure of Q167A (left), Q167 V (center), and Q167L (right) following MD simulation. The proteases are shown as wheat colored cartoons, the key residues are presented as sticks and the redundant volume is exhibited as a violet mesh. The calculated space volume value (refer to the symbol V) and the contribution to the polar surface area of residue (refer to the symbol C) are shown above. (E) The PRS analysis for the WT and M168VT174 V mutant of MERS-CoV 3CL proteases. (F) Optimized structures and free energy barriers for the first step of catalysis stage. Meanwhile, the conserved water was noticed to interact with a key residue (Q167 in MERS-CoV 3CLPro and H164 in SARS-CoV 3CLPro), which was considered to hold the conserved water in an appropriate location (Figure A). Thus, the Q167 in MERS-CoV 3CLPro was mutated to aliphatic residues (Figure C). As a consequence, biochemical results and POVME calculation results suggested that the volume and hydrophilicity of the pocket created by H41, Q167, M168, and D190 were critical to the catalytic activity of protease (Figure D). With further investigation, the lower catalytic activity emerged in the variants (Q167S, Q167N, and Q167H) with hydrophilic side chains at 167th residue, which additionally proved that the long hydrophilic side chain of Q167 anchored the conserved water in the suitable location. These results verified the significance of the conserved water. Furthermore, the mutation of Q167 to Glu, with an additional negative center, would decrease the enzymatic activity and further verified the importance of the partial negative charge of PNCC (Figure C). A highly similar result was observed for SARS-CoV 3CLPro (Figure S22). As discussed above, to further explore the concrete effects of the remote interaction composed of PNCC and the conserved water, Prereaction State (PRS) analysis and quantitative calculations were performed. According to the previous QM/MM calculation,[39,40] the first step of MERS-CoV 3CLPro-catalyzed reaction was believed to be the nucleophilic attack of the cysteine residue, and the resulting thioester would be rapidly hydrolyzed by a water molecule under the general base catalysis of His41. Figure E showed the accessibility of the Gln-Ser peptide bond of the substrate to the Cys148 thiolate of the MERS-CoV 3CLPro. It is highly essential for the nucleophilic thiolate and the acceptor carbonyl group to align up in the sp2-sp3 conversion to maximize the orbital interaction between the peptide π* (LUMO) and sulfur lone pair (HOMO) in the nucleophilic addition.[54] Using the Burgi–Dunitz criteria of near-attach-conformation parameters (the S---C distance <3.5 Å and attacking angle alpha (100–110°) and SCOCa dihedral (85–95°)), the calculated PRS indicated a significant population belong to the resting state, where the S–C distance is as long as ∼4.8 Å. It is likely that the 3CLPro is yet poorly evolutionary for this substrate. Therefore, we chose an attacking state conformation on the basis of the criteria described at methods and performed ab initio calculation. Compared with the state excluding the remote interaction, the integral state presented a lower catalytic energy barrier, which suggested that the existence of the remote interaction could extremely accelerate the first step of the nucleophilic reaction (Figure E). Taken together with the partial negative charge of PNCC, it was considered to temporarily stabilize the protonated histidine with conserved water mediating rather than fastening the protonated H41, therefore interfering with the subsequent catalytic process.

Distinction of Catalytic Efficiency between MERS-CoV 3CLPro and SARS-CoV 3CLPro during Substrate Binding Stage

According to the investigations described above, the comprehensive catalytic mechanism of coronavirus 3CLPro was proposed. Insights into the catalytic mechanism would contribute to penetrate into the individuality of coronavirus 3CLPro and decipher the distinction of the catalytic efficiency between MERS-CoV 3CLPro and SARS-CoV 3CLPro. When designing inhibitors against 3CLPro, cyclization of the glutamine side-chain to a (S)-γ-lactam and (S)-δ-lactam was an efficient application of the configuration restriction strategy. Therefore, peptidomimetic aldehydes 8a and 8b were synthesized, and their inhibitory activities were evaluated (Figures A and 9B). Compound 8a showed more preferable inhibitory activities than 8b against the two proteases. We determined two cocrystal structures of each compound bound to SARS-CoV 3CLPro (PDB code: 6LNY, 6LO0; see the detailed information referred to in Table ). Analysis of the two structures detected a slight deflection of E166, which demonstrated the obvious extrusion force existed between the lactam ring and a rigid β-sheet segment (consisting of G170, V171, and H172) and resulted in the enhancement of the fitting interaction between the protease and 8a (Figure C). Interestingly, a similar section presented a turn dominated by the hydrogen bond (N172-T174) and a loose flexible loop in MERS-CoV 3CLPro (Figure D).
Figure 9

A significant distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro during recognition. (A) The structure of aldehydes 8a and 8b. (B) Inhibitory activity evaluation of the two aldehyde inhibitors against MERS-CoV 3CLPro and SARS-CoV 3CLPro. The IC50 values were calculated and are presented in the table at the bottom. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (C) The structural analysis from the binding model of inhibitor 8a (left) and 8b (right) in the cocrystal structure. The superimposed structure is applied at the center to indicate the major distinction of the two inhibitors’ binding modes with respect to E166. (D) Superimposition of the MERS-CoV 3CLPro (in white) and SARS-CoV 3CLPro (in wheat color) structures. The main structural distinction of two proteases in the S1 pocket is highlighted with a black frame. (E) The detailed distinction in the highlighted secondary structure between SARS-CoV 3CLPro (displayed in gray cartoon and green sticks) and MERS-CoV 3CLPro (displayed in wheat colored cartoon and cyan sticks). (F) Catalytic activity analysis and enzyme kinetic parameters of the T174 V mutant. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. The significant residue is shown as sticks (MERS-CoV 3CLPro residue in cyan, SARS-CoV 3CLPro residue in green, and the substrate glutamine in yellow).

A significant distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro during recognition. (A) The structure of aldehydes 8a and 8b. (B) Inhibitory activity evaluation of the two aldehyde inhibitors against MERS-CoV 3CLPro and SARS-CoV 3CLPro. The IC50 values were calculated and are presented in the table at the bottom. The data presented are the mean values from experiments in triplicate, and the error bars indicate the standard deviations. (C) The structural analysis from the binding model of inhibitor 8a (left) and 8b (right) in the cocrystal structure. The superimposed structure is applied at the center to indicate the major distinction of the two inhibitors’ binding modes with respect to E166. (D) Superimposition of the MERS-CoV 3CLPro (in white) and SARS-CoV 3CLPro (in wheat color) structures. The main structural distinction of two proteases in the S1 pocket is highlighted with a black frame. (E) The detailed distinction in the highlighted secondary structure between SARS-CoV 3CLPro (displayed in gray cartoon and green sticks) and MERS-CoV 3CLPro (displayed in wheat colored cartoon and cyan sticks). (F) Catalytic activity analysis and enzyme kinetic parameters of the T174 V mutant. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. The significant residue is shown as sticks (MERS-CoV 3CLPro residue in cyan, SARS-CoV 3CLPro residue in green, and the substrate glutamine in yellow). In addition, SARS-CoV 3CLPro displayed more efficient substrate affinity than MERS-CoV 3CLPro (lower Km value) (Figure C). Therefore, we hypothesized that the presence of the turn might transform the secondary structure of the segment and cause the catalytic distinction of the two proteases. The mutant T174 V mutation could increase catalytic activity, which suggested that T174 might dominate the secondary structure of the segment and lead to the significant catalytic distinction of MERS-CoV 3CLPro and SARS-CoV 3CLPro (Figures E and 9F).

Distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro during the Catalysis

In the catalysis, H164 of SARS-CoV 3CLPro was noticed to replace the corresponding Q167 in MERS-CoV 3CLPro. However, the Q167H variant in MERS-CoV 3CLPro exhibited inferior enzymatic activity (Figure A). It was first considered that the amino acid residues surrounding the Q167 for MERS-CoV 3CLPro and H164 for SARS-CoV 3CLPro might determine the selectivity and further interfere with protease activity. In MERS-CoV 3CLPro, the S178 side chain acted as a hydrogen bond acceptor to fix Q167, while T88 might be responsible for maintaining the suitable space volume among T88, Q167, and S178 (Figure B). With the change of Q167 in MERS-CoV 3CLPro to H164 in SARS-CoV 3CLPro, the T175 side chain acted as a hydrogen bond acceptor to fix H164. In addition, T175 and C85 were responsible to maintain the appropriate space volume among C85, H164, and T175 (Figure B).
Figure 10

Distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro during the catalysis stage. (A) The distinct structure to fix the conserved water between MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right), and the catalytic activity analysis of the Q167H mutant in MERS-CoV 3CLPro. (B) Structural scrutiny of the area surrounding Q167 in MERS-CoV 3CLPro (left) and H164 in SARS-CoV 3CLPro (right). (C) Catalytic activity assays of the T88 and S178 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (D). Catalytic activity assays of the T88C, Q167H, and S178T individual and multipoint mutants from MERS-CoV 3CLPro in a dose-dependent manner (upper figure) and the kinetic parameters for multipoint mutant catalysis (table below). The data present the average results from three independent experiments and the error bars indicate the standard deviations.

Distinction between MERS-CoV 3CLPro and SARS-CoV 3CLPro during the catalysis stage. (A) The distinct structure to fix the conserved water between MERS-CoV 3CLPro (left) and SARS-CoV 3CLPro (right), and the catalytic activity analysis of the Q167H mutant in MERS-CoV 3CLPro. (B) Structural scrutiny of the area surrounding Q167 in MERS-CoV 3CLPro (left) and H164 in SARS-CoV 3CLPro (right). (C) Catalytic activity assays of the T88 and S178 mutants in MERS-CoV 3CLPro. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (D). Catalytic activity assays of the T88C, Q167H, and S178T individual and multipoint mutants from MERS-CoV 3CLPro in a dose-dependent manner (upper figure) and the kinetic parameters for multipoint mutant catalysis (table below). The data present the average results from three independent experiments and the error bars indicate the standard deviations. To verify the analysis on the environment of the residues for anchoring the conserved water, mutations of T88 and S178 in MERS-CoV 3CLPro were performed. The enzymatic results showed that the single point mutation resulted in decreased protease activities, but the multipoint mutation S178T/T88C almost remained in the catalytic activity (Figure C). Meanwhile, the MD simulations results indicated the violent swing of Q167 emerged in mutants S178A and T88A, while stable Q167 appeared in mutant S178T/T88C (Figures S23 and S24). These results verified the similar environment of MERS-CoV 3CLPro Q167 and SARS-CoV 3CLPro H164, and suggested that the key residue to anchor the conserved water rather than the environment might be the main ingredient to cause the distinction of two proteases. Therefore, on the basis of the elimination of environmental impacts, a multipoint mutant (T88C/Q167H/S178T) of MERS-CoV 3CLPro was created. Compared with WT MERS-CoV 3CLPro, the decreased proteolytic ability and remarkably decreased kcat value of the multipoint mutation T88C/Q167H/S178T indicated that the residue, anchoring the conserved water, allowed for the distinction between the catalytic capabilities of the two proteases in the cleavage stage (Figure D).

Two Effective Strategies for Enzyme and Drug Design Based on the Catalytic Mechanism

The high specificity of 3CLpro prompts us to improve the catalytic efficiency for engineering enzyme like Tobacco Etch Virus protease (TEV-P) in intracellular regulation and Trypsin in protein sequence analysis.[55,56] In addition, the significance of 3CLpro prompts us to develop effective antiviral agents. Therefore, the profound investigation of the catalytic mechanism contributes to enzyme design for expanding enzyme applications and antiviral drug design. For the MERS-CoV 3CLPro, as a result of the conserved water being located at the zone enveloped by H41, Q167, M168, and D190 to mediate the remote interaction, restricting the motion of conserved water might be helpful to maintain the remote interaction between the dyad and PNCC. Moreover, in the MD simulations on the catalytic process (H41 formed protonated histidine-HIP and C148 became thiolate following an abstract proton processing of H41), a typical conformation of the conserved water approaching M168 was captured (Figure A). Therefore, mutagenesis of M168 to hydrophobic Leu was investigated to impede the conserved water from approaching L168 and maintain the tight interaction between PNCC and protonated histidine. The enhanced enzymatic activity proved the validity of the mutagenesis strategy, and the underlying mechanism was shown to restrict the motion of the conserved water in compact area via MD simulation, which further proved the temporary stabilizing effect of PNCC via the conserved water (Figures B, 11C, and S25). A highly similar mechanism was also proven for SARS-CoV 3CLPro (Figure S26).
Figure 11

Two effective applications based on the investigated mechanism. (A) Structural analysis of the area surrounding of the conserved water in the cocrystal structure of MERS-CoV 3CLPro (left) and the typical structure following MD simulation (right). (B) Catalytic activity assays of the M168L mutant in a dose-dependent manner (upper figure) and the kinetic parameters for the mutant during catalysis (table below). The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (C) PMF calculation for the conserved water—Q167 distance vs conserved water—D190 distance in WT (left) and M168L mutant (right) proteases. (D) The target to optimize the inhibitor. The importance and the stability of Q195 was reflected by the stable RMSD value, supporting that it become the target to optimize the inhibitor. Meanwhile, the typical structure of MERS-CoV 3CLPro binding with compound 12a following MD simulation showed that the para-position of the aldehyde 12a P4 site was a potential site to be optimized. (E) The strategy to optimize the aldehyde 12a. Introducing a hydrogen bond acceptor to generate aldehyde 12b might promote superior affinity toward MERS-CoV 3CLPro through MD simulation, and the average structure of MERS-CoV 3CLPro binding to 12b following MD simulation is exhibited. (F) The inhibitory activity investigation for 12a and 12b. The data present the average results from three independent experiments, and the error bars indicate the standard deviations.

Two effective applications based on the investigated mechanism. (A) Structural analysis of the area surrounding of the conserved water in the cocrystal structure of MERS-CoV 3CLPro (left) and the typical structure following MD simulation (right). (B) Catalytic activity assays of the M168L mutant in a dose-dependent manner (upper figure) and the kinetic parameters for the mutant during catalysis (table below). The data present the average results from three independent experiments, and the error bars indicate the standard deviations. (C) PMF calculation for the conserved water—Q167 distance vs conserved water—D190 distance in WT (left) and M168L mutant (right) proteases. (D) The target to optimize the inhibitor. The importance and the stability of Q195 was reflected by the stable RMSD value, supporting that it become the target to optimize the inhibitor. Meanwhile, the typical structure of MERS-CoV 3CLPro binding with compound 12a following MD simulation showed that the para-position of the aldehyde 12a P4 site was a potential site to be optimized. (E) The strategy to optimize the aldehyde 12a. Introducing a hydrogen bond acceptor to generate aldehyde 12b might promote superior affinity toward MERS-CoV 3CLPro through MD simulation, and the average structure of MERS-CoV 3CLPro binding to 12b following MD simulation is exhibited. (F) The inhibitory activity investigation for 12a and 12b. The data present the average results from three independent experiments, and the error bars indicate the standard deviations. In combination with the previous study on the distinction of two protease in recognition stage, multipoint mutagenesis was performed. The enzymatic results suggested the multipoint mutation M168L/T174 V of MERS-CoV 3CLPro exhibited an 8-fold increase in catalytic efficiency compared to WT protease. On the basis of the PRS analysis, it was computationally observed that the Cys148 SG atom in the highest-efficient mutant (M168L/T174 V) points to the carbonyl C atom statistically and dynamically better than those in the WT protease. On the basis of the Burgi–Dunitz criteria of near-attach-conformation parameters, the calculated PRS ratio was about 1:1.8 between the WT and mutant, which was conclusive that the distal mutations at the 168th and 174th sites can promote the reactive population of PRS via synergetic effect. Meanwhile, the PRS analysis on the multiple mutant (M168L/T174 V) reiterate the validity of molecular mechanism investigation (Figure E). Insights into the catalytic mechanism are meaningful to guide inhibitor design. According to the explicit investigation, essential Q195 was located at the surface of the substrate binding pocket and presented extreme stability in the MD simulations. This implied that Q195 could become a significant target to optimize inhibitors (Figure D). In a previous investigation, 12a was designed to inhibit MERS-CoV 3CLPro and expressed a specific inhibitory activity. In the study focused on the binding model of 12a, MD simulation results showed that the P4 site of 12a approached the Q195. Therefore, a pyridine group as a hydrogen acceptor was introduced into the inhibitor at P4 to generate 12b, which might smoothly interact with Q195. The results of enzymatic analysis and MD simulations revealed that 12b exhibited an almost 4-fold increase in inhibitory activity against MERS-CoV 3CLPro compared to 12a and formed a tight interaction with Q195 (Figures E, 11F, and S27).

Discussion and Summary

By exploring MERS-CoV 3CLPro and SARS-CoV 3CLPro, the experimental results clearly present a comprehensive molecular catalytic mechanism for both MERS-CoV 3CLPro and SARS-CoV 3CLPro. Taking MERS-CoV 3CLPro as an example, for substrate binding, E169-H175 identifies the substrate Gln via steric effect rather than through direct electrostatic interaction of E169. Meanwhile, the protease utilizes a hydrogen bond formed by Y164 and the steric effect of F143 to stabilize H166 in an eclipsed conformation which is advantageous for H166 to establish the powerful hydrogen bond interaction with the Gln of substrate and enhance the recognition of the substrate Gln by the protease (Figure A). Throughout the investigation, the conserved carbonyl group rather than the side chain N-terminal amide bond in the Gln is considered essential for recognition of 3CLPro. This implies that replacement of the Gln side chain N-terminal amide bond with a suitable volume group, which should be fit for the conserved residue pair (E169-H175). This strategy may extend the variety of the P1 fragment of 3CLPro inhibitor.
Figure 12

Summary of the catalytic mechanism in MERS-CoV 3CLPro and SARS-CoV 3CLPro, and the allosteric inhibitory site in coronavirus 3CLPro. MERS-CoV 3CLPro was applied as an example to illustrate the mechanism. (A) The detailed recognition mechanism with the S1 pocket. (B) The explicit mechanism during the start of catalysis stage. (C) The explicit mechanism during the first step of the catalysis stage. (D) The protein surface of MERS-CoV 3CLPro (upper) and SARS-CoV 3CLPro (bottom), and the allosteric inhibitory site in the two proteases. The zone enveloped in the green frame refers to the substrate binding sites, and the zone surrounded with the orange frame refers to the PNCC. Following structural scrutiny, the PNCC is located at the surface of the protein and expressed as a pocket shape, which infers that the partial negative charge cluster may become the target to inhibit the coronavirus 3CLPro.

Summary of the catalytic mechanism in MERS-CoV 3CLPro and SARS-CoV 3CLPro, and the allosteric inhibitory site in coronavirus 3CLPro. MERS-CoV 3CLPro was applied as an example to illustrate the mechanism. (A) The detailed recognition mechanism with the S1 pocket. (B) The explicit mechanism during the start of catalysis stage. (C) The explicit mechanism during the first step of the catalysis stage. (D) The protein surface of MERS-CoV 3CLPro (upper) and SARS-CoV 3CLPro (bottom), and the allosteric inhibitory site in the two proteases. The zone enveloped in the green frame refers to the substrate binding sites, and the zone surrounded with the orange frame refers to the PNCC. Following structural scrutiny, the PNCC is located at the surface of the protein and expressed as a pocket shape, which infers that the partial negative charge cluster may become the target to inhibit the coronavirus 3CLPro. During the catalysis, a conserved GSCGS motif is identified to form consecutive three turns and plays a crucial role for 3CLPro. In detail, guaranteed by S147 and S150, Turn I and Turn III fix the substrate active site in an appropriate location and abate the vibrating frequency of active site, which contribute to the attack of active C148. Additionally, maintained by an external hydrogen bond interaction between the motif and N28, stable Turn II and G149 are conducive to the free rotation of C148 side chain such that the C148 thiol approached the substrate active site (Figure B). Alternatively, mediated by the conserved water, a remote interaction between dyad and PNCC (R40-Y54-D190) is confirmed to smoothly accelerate the catalysis, as the PNCC can temporarily stabilize the protonated H41 (Figure C). Since the PNCC is located at the surface of the opposite side of the active center, it may be a fascinating allosteric site to design 3CLPro inhibitors via interfering with the partial charge of the PNCC (Figure D). To ensure the accuracy of the comprehensive catalytic mechanism we determined, an analog catalytic mechanism is checked with SARS-CoV 3CLPro. Importantly, the critical residues are highly conserved among diverse coronaviruses. The mechanisms reported here may be common among coronavirus 3CLPro and contribute to provide the foundation for the further investigation of SARS-CoV-2 3CLPro (Figure S20). More significantly, in view of the fact that coronavirus is effortless to evolve, investigation on the general catalytic mechanism of coronavirus 3CLPro is meaningful to face further virus variation. According to the catalytic mechanism, the distinctions of two protease on the catalytic characteristics are explicitly investigated. In brief, the essential secondary structure dominated by T174 of MERS-CoV 3CLPro is confirmed to be responsible for the inferior substrate affinity of MERS-CoV 3CLPro than SARS-CoV 3CLPro. In catalysis, the more stably conserved water anchored by Q167 causes more tight remote interaction and leads to the more efficient excision on the substrate in MERS-CoV 3CLPro. Moreover, the insight into the catalytic mechanism of MERS-CoV 3CLPro and SARS-CoV 3CLPro can effectively guide mutation studies to improve the catalytic potency of the protease. It also meaningfully directs the establishment of powerful hydrogen bond interaction between inhibitor and significant Q195 of MERS-CoV 3CLPro to improve the inhibitory activity. In summary, the cumulative experimental results clearly reveal the comprehensive molecular catalytic mechanism of MERS-CoV 3CLPro and SARS-CoV 3CLPro. On the basis of the comprehensive mechanism, the distinction of two proteases on the catalytic efficiency is investigated and effective applications are energetically explored. The results presented should provide a solid foundation for understanding the enzymatic mechanism, and help efforts in rational de novo protein design and antiviral drug design.
  28 in total

1.  Characterization of Self-Processing Activities and Substrate Specificities of Porcine Torovirus 3C-Like Protease.

Authors:  Shangen Xu; Junwei Zhou; Yingjin Chen; Xue Tong; Zixin Wang; Jiahui Guo; Jiyao Chen; Liurong Fang; Dang Wang; Shaobo Xiao
Journal:  J Virol       Date:  2020-09-29       Impact factor: 5.103

2.  The structure-based design of peptidomimetic inhibitors against SARS-CoV-2 3C like protease as Potent anti-viral drug candidate.

Authors:  Hao Wang; Rongjuan Pei; Xin Li; Weilong Deng; Shuai Xing; Yanan Zhang; Chen Zhang; Shuai He; Hao Sun; Shuqi Xiao; Jin Xiong; Yecheng Zhang; Xinwen Chen; Yaxin Wang; Yu Guo; Bo Zhang; Luqing Shang
Journal:  Eur J Med Chem       Date:  2022-05-13       Impact factor: 7.088

3.  Maturation of the SARS-CoV-2 virus is regulated by dimerization of its main protease.

Authors:  Shreyas Kaptan; Mykhailo Girych; Giray Enkavi; Waldemar Kulig; Vivek Sharma; Joni Vuorio; Tomasz Rog; Ilpo Vattulainen
Journal:  Comput Struct Biotechnol J       Date:  2022-06-14       Impact factor: 6.155

4.  Discovery of 2-thiobenzimidazoles as noncovalent inhibitors of SARS-CoV-2 main protease.

Authors:  Davide Deodato; Nadeem Asad; Timothy M Dore
Journal:  Bioorg Med Chem Lett       Date:  2022-06-24       Impact factor: 2.940

5.  Perspectives on SARS-CoV-2 Main Protease Inhibitors.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Jetze J Tepe; Faqing Huang; Guo-Wei Wei
Journal:  J Med Chem       Date:  2021-11-19       Impact factor: 8.039

6.  Covalent narlaprevir- and boceprevir-derived hybrid inhibitors of SARS-CoV-2 main protease.

Authors:  Daniel W Kneller; Hui Li; Gwyndalyn Phillips; Kevin L Weiss; Qiu Zhang; Mark A Arnould; Colleen B Jonsson; Surekha Surendranathan; Jyothi Parvathareddy; Matthew P Blakeley; Leighton Coates; John M Louis; Peter V Bonnesen; Andrey Kovalevsky
Journal:  Nat Commun       Date:  2022-04-27       Impact factor: 17.694

7.  An insight into the interaction between α-ketoamide- based inhibitor and coronavirus main protease: A detailed in silico study.

Authors:  Snehasis Banerjee
Journal:  Biophys Chem       Date:  2020-11-28       Impact factor: 2.352

8.  Unveiling the molecular mechanism of SARS-CoV-2 main protease inhibition from 137 crystal structures using algebraic topology and deep learning.

Authors:  Duc Duy Nguyen; Kaifu Gao; Jiahui Chen; Rui Wang; Guo-Wei Wei
Journal:  Chem Sci       Date:  2020-09-30       Impact factor: 9.825

9.  Asymmetric dynamics of dimeric SARS-CoV-2 and SARS-CoV main proteases in an apo form: Molecular dynamics study on fluctuations of active site, catalytic dyad, and hydration water.

Authors:  Shinji Iida; Yoshifumi Fukunishi
Journal:  BBA Adv       Date:  2021-06-20

10.  New insights into the catalytic mechanism of the SARS-CoV-2 main protease: an ONIOM QM/MM approach.

Authors:  Henrique S Fernandes; Sérgio F Sousa; Nuno M F S A Cerqueira
Journal:  Mol Divers       Date:  2021-06-24       Impact factor: 3.364

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.