Literature DB >> 34784198

Unraveling the Molecular Mechanism of Recognition of Human Interferon-Stimulated Gene Product 15 by Coronavirus Papain-Like Proteases: A Multiscale Simulation Study.

Rajarshi Roy1, Nisha Amarnath Jonniya1, Sayan Poddar1, Md Fulbabu Sk1, Parimal Kar1.   

Abstract

The papain-like protease (PLpro) of the coronavirus (CoV) family plays an essential role in processing the viral polyprotein and immune evasion. Additional proteolytic activities of PLpro include deubiquitination and deISGylation, which can reverse the post-translational modification of cellular proteins conjugated with ubiquitin or (Ub) or Ub-like interferon-stimulated gene product 15 (ISG15). These activities regulate innate immune responses against viral infection. Thus, PLpro is a potential antiviral target. Here, we have described the structural and energetic basis of recognition of PLpro by the human ISG15 protein (hISG15) using atomistic molecular dynamics simulation across the CoV family, i.e., MERS-CoV (MCoV), SARS-CoV (SCoV), and SARS-CoV-2 (SCoV2). The cumulative simulation length for all trajectories was 32.0 μs. In the absence of the complete crystal structure of complexes, protein-protein docking was used. A mutation (R167E) was introduced across all three PLpro to study the effect of mutation on the protein-protein binding. Our study reveals that the apo-ISG15 protein remains closed while it adopts an open conformation when bound to PLpro, although the degree of openness varies across the CoV family. The binding free energy analysis suggests that hISG15 binds more strongly with SCoV2-PLpro compared to SCoV or MCoV. The intermolecular electrostatic interaction drives the hISG15-PLpro complexation. Our study showed that SCoV or MCoV-PLpro binds more strongly with the C-domain of hISG15, while SCoV2-PLpro binds more favorably the N-domain of hISG15. Overall, our study explains the molecular basis of differential deISGylating activities of PLpro among the CoV family and the specificity of SCoV2-PLpro toward hISG15.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34784198      PMCID: PMC8610008          DOI: 10.1021/acs.jcim.1c00918

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


Introduction

The current worldwide outbreak by the severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2 or SCoV2) caused the spread of the disease (COVID-19) on an unprecedented global scale.[1] It has emerged as a highly contagious novel coronavirus (CoV).[2,3] According to the World Health Organization (WHO), more than 248 million people have been infected worldwide, including more than 4.9 million deaths, as of November 3, 2021. Consequently, WHO declared the situation a global health emergency. The novel CoV is considered far more contagious and catastrophic than other flu viruses, including SARS-CoV (SCoV) and MERS-CoV or MCoV (Middle East respiratory syndrome coronavirus).[4] SCoV and MCoV caused the outbreak in 2003 and 2012, respectively. The scientific community worldwide is making significant efforts to explore diverse mechanisms targeting viral replication via repurposing approved drugs using experimental and computational approaches.[5,6] However, to date, no effective therapeutic agent has been approved against COVID-19 except a few tested drugs like chloroquine derivatives, remdesivir, azithromycin, favipiravir, ivermectin, etc.[7−10] SCoV2 is a positive-stranded RNA Betacoronavirus.[11] Usually, these viruses harvest polypeptides that promote proteolytic cleavage in their life cycle. Two crucial proteases, namely the main protease (Mpro) and papain-like protease (PLpro), make the functional replicase complex, leading to viral spread.[12] Meanwhile, various efforts have been made to explore these proteases in combating this noxious COVID-19.[13−15] Great attention is given to Mpro because of its post-translational modification activity of replicase polyproteins.[14] It comprises three domains and a catalytic dyad (Cys145 and His41). On the other hand, PLpro includes a catalytic triad (C111, H272, D286 in SCoV2). The PLpro of SCoV2 and SCoV share ∼83% similarity, while only ∼31% similarity is observed between PLpro of SCoV2 and MCoV (see Figure A). Apart from cleaving polyprotein, PLpro has additional deubiquitination and deISGylating activities that promote virus replication. Deubiquitinating enzymes (DUBs) can remove the post-translational modification Ub from target proteins.[16] Similarly, deISGylating enzymes reverse the post-translational modification of the Ub-like (UbI) protein interferon-stimulated gene product 15 (ISG15) from cellular proteins.[17] The SCoV2/PLpro subdomains comprise UBI, thumb, finger, and palm (see Figure B). The flexible BL2 loop recognizes the substrate. It undergoes large conformational changes from close (unliganded) to open (liganded).[18] During viral infection, the first line of defense against viral pathogens is the innate response, involving the elevated expression of IFN stimulated genes (ISG15) via IFN-cytokine signaling pathways. PLpro cleaves ubiquitin and ISG15 from the host cell proteins that aid CoVs to circumvent the host innate immune responses.[19] The dual functionality of PLpro makes it an attractive antiviral drug target.[20−22]
Figure 1

(A) Multiple sequence alignment (MSA) of PLpro’s of MERS-CoV, SARS-CoV, and SARS-CoV2 using Clustal Omega. Meaning of different symbols used in the MSA are: “*” indicates perfect alignment, “:” indicates a site belonging to a group exhibiting strong similarity, and “.” indicates a site belonging to a group exhibiting weak similarity; (B) the cartoon representation of the PLpro of SCoV2, showing its different regions; thumb (green), finger (blue), palm (pink), UBI domain (brown), and BL2 loop (red);(C) the cartoon representation of the full-length hISG15, showing its N- (purple) and C-domain (cyan) and the connecting hinge region (gold);(D) the complex structure of the SCoV2-PLpro’s (orange) with the full-length hISG15 (sea green).

(A) Multiple sequence alignment (MSA) of PLpro’s of MERS-CoV, SARS-CoV, and SARS-CoV2 using Clustal Omega. Meaning of different symbols used in the MSA are: “*” indicates perfect alignment, “:” indicates a site belonging to a group exhibiting strong similarity, and “.” indicates a site belonging to a group exhibiting weak similarity; (B) the cartoon representation of the PLpro of SCoV2, showing its different regions; thumb (green), finger (blue), palm (pink), UBI domain (brown), and BL2 loop (red);(C) the cartoon representation of the full-length hISG15, showing its N- (purple) and C-domain (cyan) and the connecting hinge region (gold);(D) the complex structure of the SCoV2-PLpro’s (orange) with the full-length hISG15 (sea green). ISG15 is an interferon-regulated gene that regulates various cellular pathways involved in the host immune responses against different viral infections.[23−25] ISG15 contains two ubiquitin-like domains connected by a hinge or linker, making a linear dimeric protein[26] (see Figure C). Relative to ubiquitin, ISG15 has low cross-species conservation, ranging from 58% to as low as 35% in mammals.[26] Overall, CoVs-PLpro has been reported with robust deubiquitinating activities.[17,27] However, few studies have unveiled its importance in deISGylating activities.[28,29] The crystal structures of PLpro with the C-domain of ISG15 have been reported for all major coronavirues.[30−32] However, it is important to consider the full-length hISG15 since both domains of hISG15 interact with PLpro.[33] Herein, we study the structural dynamics of the full-length hISG15 and its interaction with the PLpro of SCoV, MCoV, and SCoV2 via atomistic molecular dynamics (MD) simulations in conjunction with the molecular mechanics/Poisson-Boltzmann surface area (MM/PBSA) scheme. First, the complex structure of full-length hISG15 with CoVs-PLpro was prepared and analyzed at the molecular level, followed by MD simulations. In addition, to elucidate their molecular recognition and specificity among CoVs, we performed free energy calculations. Overall, the current study provides insights into the molecular basis of CoVs-PLpro deISGylating activities, especially for the novel SARS-CoV-2 specificity toward hISG15 that may be exploited in targeting SARS-CoV2-PLpro to combat COVID-19. To our knowledge, this is the first study where the structural dynamics of the full-length hISG15 protein against all the major CoVs were explored in detail.

Materials and Method

System Preparation

The complex structure of PLpro with the C-terminal domain of hISG15 for MCoV and SCoV was obtained from Protein Data Bank (PDB) with an accession code of 5W8U[30] and 5TL6,[32] respectively. However, the crystal structure of the full-length Apo-hISG15 is already available (PDB ID: 1Z2M[34]), and the same was downloaded. Next, we superimposed the coordinates of the full-length Apo-hISG15 with the available complexes of MCoV and SCoV to obtain the final complex structures. In the case of SCoV2, the structure of PLpro was obtained from the PDB database (PDB ID 7JRN[31]), which was further superimposed with the constructed SCoV-PLpro-hISG15 complex to model the SCoV2-PLpro-hISG15 complex. (Figure D). We also estimated the structural differences between our construct and the available partial crystal structure by superimposition, which yields no significant differences. The root mean square deviation (RMSD) difference is below 0.5 Å, suggesting a good initial complex structure to run the MD simulation. Further, we considered one charge-flip mutation R167E that was reported to be 20 times less efficient in hydrolyzing ISG15 by SCoV-PLpro.[32] Hence, we studied the effect of the same mutation in SCoV2, which closely resembles SCoV. Also, the impact of the mutation was investigated for the diverse family of CoV, i.e., MCoV. In the current study, six complex systems of PLpro-hISG15 were prepared for MD simulations: three wildtypes (WTs) (SCoV, SCoV2, and MCoV) and three mutants (SCoV(MT), SCoV2(MT), and MCoV(MT)). Besides, in order to provide structural dynamics of hISG15 upon complexation with PLpro, we performed MD simulations of Apo-hISG15.

Simulation Protocol

All simulations were conducted using the pmemd.cuda module of AMBER18.[35] Missing hydrogens in the crystal structures were added using the LEaP module of Ambertools19 followed by neutralizing all systems by adding a proper amount of counterions. Prior to the system preparation, protonation states of the charged residues were determined using the Propka 3.1 webserver.[36] All systems were then solvated into periodic octahedron TIP3P water box,[37] keeping a buffering distance of 10 Å from all sides. The Amber ff14SB force field[38] was used for both Apo and complex forms for the simulations. A 2.0 fs time step was fixed for all these seven simulation systems. The SHAKE algorithm[39] was used to constraint the bond, including hydrogen atoms. The particle-mesh Ewald summation (PME)[40] was used to compute long-range interaction with a 10 Å nonbonded cut-off. At first, systems were subjected to energy minimization using 500 steps of steepest descent followed by 500 steps of the conjugant gradient algorithm. Amino acids were kept fixed in this step using the force constant of 2.0 kcal mol–1 Å–2. Secondly, in the absence of restraint, the entire systems were minimized on the solute using the steepest descent algorithm followed by the conjugant gradient algorithm. The systems were then gradually heated from 0 to 300 K in the NVT ensemble, where protein atoms were fixed using a force constant of 2.0 kcal mol–1 Å–2. The temperature was maintained using a Langevin thermostat.[41] The equilibration was conducted for 1 ns in the NPT ensemble without any restraint on the systems. Finally, both wild type and mutant complexes underwent a 2 × 2 μs production run. For the Apo-hISG15 system, the production simulation was conducted for 2 × 4 μs. Overall, 200,000 and 400,000 snapshots were generated for complexes and Apo, respectively. All analyses, such as RMSD, solvent accessible surface area (SASA), the radius of gyration (Rg), and other analyses were performed using the Cpptraj module of AmberTools19.[42] For hydrogen-bond calculations, a distance cut-off ≤3.5 Å and an angle cut-off of ≥120° were used.[43]

Principal Component Analysis

The principal component analysis (PCA) scheme has been applied to identify the lowest energy states in the collected structural and energetics MD trajectory data.[44] It is a dimensional reduction method and we were able to extract the dominant modes from the whole trajectory. In this technique, the covariance matrix obtained from the fluctuation of amino acid residues is diagonalized, and their motion is represented in terms of their eigenvectors and eigenvalues. The eigenvectors represent the direction of motions, while their respective eigenvalues give the amplitude of motions. The original data projected on the eigenvectors yield the principal components (PCs). The covariance matrix was calculated using the eq :where X and X are the Cartesian atomic coordinates of Cα atoms i and j, while and denote the mean of ith or jth atoms over the ensemble. Also, to investigate the dynamics in greater detail, the hinge region was subjected to dihedral PCA (dPCA), which uses the dihedral angles of each peptide bond of a selected region of interest. Further, we also generated a free energy surface using the first to PC components for both the PC analysis, a common technique to identify the potential barrier between the different conformational states. This was calculated using the formula ΔG = kBTln(ρ), where kB is the Boltzmann constant, T is the temperature, and ρx is the probability density of the geometric coordinate x.

Binding Free Energy Analysis

The effective interaction between PLpro and hISG15 in terms of free energy values was calculated using the Molecular Mechanics Poisson-Boltzmann Surface Area (MM/PBSA) scheme.[45−49] The binding free energy (ΔGbind) was estimated using the MMPBSA.py script of the AMBER package. This method is less accurate compared to other computationally expensive methods like free energy perturbation and thermodynamic integration. However, this scheme offers a good compromise between speed and accuracy and yields good qualitative agreements to the experimental observations.[50] The MM-PBSA protocol was already discussed in detail in our previous studies.[51−54] Here we used the same protocol for calculating the binding free energy. The following equations estimate the binding free energy in the MM/PBSA scheme:[55] where ΔEinternal, ΔGsolv, and TΔS represent total internal energy, desolvation free energy, and conformational entropy, respectively. The total internal energy (ΔEinternal) still further consists of ΔEcov (bond, dihedral, and angle), ΔEelec (electrostatic), and ΔE (van der Waals) and is represented by the following equation, while the desolvation free energy composed of polar solvation (ΔG) and the nonpolar solvation energy (ΔG) is represented by, The polar part of the solvation energy ΔGpol was calculated using the linearized Poisson-Boltzmann model with the dielectric constant of 1.0 and 80.0 for solutes and solvent, respectively, while the nonpolar part was computed using the surface area. A total of 10,000 conformations was used to estimate the binding free energy from the last 1 μs timescale in the equal interval. The entropic contribution was not considered due to the high computational cost. Further, at the residual level, the decomposition of the total binding free energy, which comprises van der Waals, electrostatic, polar, and nonpolar terms, was conducted by the Molecular Mechanics Generalized-Born Surface Area method.

Residual Network Analysis of Protein

The protein contact network or residual interaction network can represent the protein structure in the form of networks. In the present study, the network analysis of protein structure (NAPS) server was used to construct protein networks.[56] The last snapshots from the production simulations were used as input. The network can be drawn based on Cα, Cβ, or centroids. Here, nodes in a network denote the Cα atom pairs, and edges between Cα-Cα residue pairs are drawn based on the threshold distance of ∼7 Å. Various network-based hydrophobic, hydrophilic, or charged contacts were analyzed to identify the essential residual interactions in the network. The long-range interaction in a protein network can be found using the shortest-path parameters. The Floyd–Warshall[57] algorithm was used to calculate the shortest path, which defines the minimum number of nodes required to reach from one node to the other.[58,59]

Results and Discussion

Apart from the proteolytic activity, PLpro has deubiquitinating and deISGylating activities that make it a potential antiviral target. Hence to understand and explore the underlying interaction mechanism between PLpro and hISG15, we have provided an atomistic insight into the structural and binding free energy analysis using the large-scale MD and /MM-PBSA scheme. Moreover, in-depth analysis of their molecular recognition will help design inhibitors that could inhibit the interaction between SCoV2-PLpro and hISG15.

Structural Dynamics of Systems

The convergence of simulations can be noted from the time evolution of the RMSDs of protein’s backbone atoms, as shown in Supporting Information Figures S1–S3. Further, to confer the conformational dynamics states of hISG15 in its Apo and complex form, both in the WT and the mutant complexes of CoVs-PLpro, the combined probability distributions of the respective RMSDs are shown in Figure CA. The distinctive peaks from the plots correspond to the most probable conformational states of a system. Figure A reveals that the charge-flip mutation (R167E) systems exhibited many distinct conformations at varying RMSD values than their respective WT, like for the SCoV2(MT) (at ∼5 and ∼9 Å) and MCoV(MT) (at ∼7.3 and ∼8.2 Å) complexes. The SCoV complex showed a quite high stable conformation at ∼5 Å, and its mutant showed nearly two equiprobable states. However, the WT-MCoV complex showed the most conformational variability. Furthermore, to explore where the conformational diversity comes in the complex, the individual RMSD profile of PLpro and hISG15 from each complex was monitored and is shown in Figure B,C. It reveals that the PLpro of each complex remains relatively stable while most of the deviation comes from hISG15.
Figure 2

Distribution of RMSD of (A) complexes; (B) PLpro; and (C) hISG15. Distribution of radius of gyration (RoG) of (D) complexes; (E) PLpro; and (F) hISG15.

Distribution of RMSD of (A) complexes; (B) PLpro; and (C) hISG15. Distribution of radius of gyration (RoG) of (D) complexes; (E) PLpro; and (F) hISG15. The conformational dynamics of hISG15 were explored by comparing Apo-hISG15 with hISG15 from each complex, and the distribution of RMSD is shown in Figure C. It depicts that the Apo-hISG15 is quite dynamic, as seen from the large RMSD value peaks at ∼9 and ∼12.5 Å. In contrast, the highest probable peaks obtained for WT complexes were around 2.5 to 3.0 Å. The SCoV2(MT) and SCoV(MT) showed another likely conformational state at ∼5 Å. However, MCoV(MT) showed the most conformational dynamic states of the hISG15 close to Apo-hISG15, suggesting the more dynamic behavior of MCoV(MT). In addition, the time-series RMSD of Apo-hISG15 for the entire 4 × 4 μs is shown in Figure S3. It depicts that the Apo form of hISG15 is highly dynamic in nature. The individual N-domain of hISG15 exhibits more deviations than the C-domain, as shown in Figure S3B,C, respectively. Overall, the RMSD trajectory analysis reveals that the Apo-ISG15 is quite dynamic, and it gets stabilized upon interactions with the PLpro’s of CoVs. Moreover, the charge-flip mutation in PLpro also has a significant impact that affects these interactions’ stabilization. Overall, different CoV-PLpro behave differently upon interaction with hISG15, and among them, the MCoV-PLpro-hISG15 complex was found to be the most dynamic system. Next, we estimated the radius of gyration (RoG or Rg) to measure the structural compactness of each protein–protein complex. The distribution of RoG for each complex, PLpro, and hISG15 is displayed in Figure D–F. In agreement with the RMSD plot, the most significant dynamic states were observed for MCoV compared to others as the distribution is characterized by multiple peaks (Figure D). The RoG of PLpro from each complex shows similar conformational states between the WT and their mutants for each complex, except for MCoV, which exhibited two equiprobable states. However, the major difference in the angular motion was observed for the hISG15, as shown in Figure F. Compared to others, the lower RoG of Apo-hISG15 suggests that, alone hISG15 contributed to low conformational shifts concerning N- and C-domains. In contrast, complexes of PLpro with hISG15 exhibited a large value of RoG ∼18–19 Å, which could be attributed to conformational shifts resulting in changes in the N- and C-domain orientation of hISG15 upon interaction with PLpro during the MD simulations. We also measured the SASA of all six complexes as well as for Apo-hISG15, and shown in Table S1. The average SASA values varied between 21,721.49 Å2 (SCoV2) and 22,733.31 Å2 (MCoV) for all complexes. The most remarkable change in the SASA value after the mutation was observed in the case of SCoV2 (489.57 Å2). However, in cases of SCoV and MCoV, it decreased by almost 350 Å2. An increase in the SASA value for SCoV2 after mutation indicates the lesser compactness of the structural interface between PLpro and hISG15, which further influences the protein–protein interaction. On the other hand, Apo-hISG15 had a SASA value of 8520.82 Å2, suggesting ∼500–700 Å2 surface area is occupied by PLpro in the process of complexation. To estimate the dynamical behavior of hISG15 in the free and bound forms, the RMSFs of Cα atom for PLpro and hISG15 were calculated over combined trajectories and shown in Figure BA. In the case of PLpro, a similar pattern of fluctuation was observed for all cases except MCoV and SCoV2(MT). A relatively higher RMSF value was observed in the neighboring region of the mutation site (R166E), especially in the case of SCoV2(MT) (Figure A). The palm’s flexibility is heavily increased in the MCoV’s PLpro, which was subsequently reduced after the point mutation. Compared to all other domains, the thumb domain stays relatively less flexible, which is also observed in other studies.[60,61] Both N- and C-domains of hISG15 in the unbound state showed similar fluctuation, which reduced after the complex formation in both SCoVs and SCoV2, showing a compact interaction compared to other cases.
Figure 3

(A) Root-mean-square-fluctuations (RMSF) of PLpro of each complex system; (B) RMSF of hISG15 of each complex system along with the Apo hISG15.

(A) Root-mean-square-fluctuations (RMSF) of PLpro of each complex system; (B) RMSF of hISG15 of each complex system along with the Apo hISG15. Interestingly, both domains of hISG15 in the SCoV2 complex show higher rigidity, which is entirely disturbed by the mutation. Comparing all the complexes, hISG15 with SCoV complexes stays in the most stable form. hISG15 in both MCoV and MCoV(MT) shows a higher degree of fluctuation, and also differential flexibility was observed in terms of both N- and C-domains.

Dynamics of hISG15 in the Free and Complex State

The N- and C-domains of hISG15 induce intrinsic flexibility, and hence a critical inspection of its domain motion with respect to each other is needed. The structural stability of both domains was calculated for Apo and complexes by estimating the RMSD distribution. Both the domain’s RMSD was stable throughout our simulation length (Figures S3B,C, S4, and S5). So the dynamics of hISG15 are mostly contributed by the hinge region and interdomain distance. To estimate the two domains’ motion, we calculated the hinge angle for all complexes and in Apo hISG15 and shown in Figure . For calculating the angle, the center of mass of the N-domain, the center of mass of the median amino acids, and the center of mass of the C-domain were used as shown in the red ball and stick model image mentioned above. The free simulation of Apo-hISG15 yields a large conformational basin, including a closed to opened (high energy) structure. However, the closed structure is favorable in the free state (∼70°) (Figure A). Upon binding with PLpro, the global free energy minimum conformation of hISG15 shifted from closed to semiopened or opened state in all wild type and mutant complexes, and the conformational basin shrunk due to the protein–protein interaction. In SCoV2, a mutation in PLpro leads to broadening of the hinge angle, whereas in SCoV(MT), the opposite effect was observed. The global minimum of hISG15 for the complex of SCoV2 was observed around 97° (Figure C) surrounded by a high energy barrier, which got flattened (Global minimum: 111°) in the case of a mutant variant of PLpro (Figure D). On the contrary, SCoV(MT) moves to a semiopened structure (82°) with a higher deviation from its wild type complex (Figure B). In MCoV, both its WT and mutant complexes showed similar trends as SCoV. Thus, the point mutation in the PLpro structure in all three viruses influences the complex formation with hISG15, which changes the two domains’ dynamics.
Figure 4

Potential of mean force of the hinge angle calculated at 300 K. Angle is constructed between the center of mass of N-domain, the center of the hinge region, and the center of mass of C-domain as shown in red dots in the 3D conformation of hISG15. (A) Closed structure of hISG15 in free simulation. (B) and (C) Semiopened structure of hISG15 in complex simulation with SCoV(MT) and SCoV2 PLpro, respectively. (D) Open structure of hISG15 in SCoV2(MT) simulation.

Potential of mean force of the hinge angle calculated at 300 K. Angle is constructed between the center of mass of N-domain, the center of the hinge region, and the center of mass of C-domain as shown in red dots in the 3D conformation of hISG15. (A) Closed structure of hISG15 in free simulation. (B) and (C) Semiopened structure of hISG15 in complex simulation with SCoV(MT) and SCoV2 PLpro, respectively. (D) Open structure of hISG15 in SCoV2(MT) simulation. Along with the interdomain movement, we also estimate the two domains’ relative motion with respect to PLpro. For both domains, the center of mass distance between PLpro and the respective domain was calculated. The combined probability distributions are shown in Figure , along with several representative structures of the complexes. The C-domain mostly contributes to the binding of hISG15 with the finger and palm region of PLpro of CoV. N-domain, which is hanging through the hinge region, also makes loose contact with the thumb region. By analyzing the respective distance of each domain of hISG15 with PLpro, one may explore the behavior of hISG15 interaction with PLpro. As depicted by the distribution plot (Figure B), for all cases, the hISG15-domain to PLpro distance was decreased by 5 Å in the C-domain compared to N-domain (Figure A), suggesting strong interactions of PLpro via the C-domain of hISG15. Depending upon the types of PLpro and its mutation, the dynamicity of hISG15 was changed. In SCoV2, two similar populations were observed where both configurations were separated by 5 Å (Figure A). However, in the mutation (SCoV2(MT)), two conformations were observed, but both were shifted closer to PLpro. A similar observation was found in cases of SCoV and SCoV(MT). Contrary to the SCoV2’s bimodal distribution, SCoV and MCoV prefer a unimodal distribution with a higher occupancy in the case of mutation (see Figure C,E). In WTs, the C-domain of hISG15 was comparatively stable in most cases except for SCoV2, where three distinct peaks were observed at 22, 24.5, and 28.3 Å. It indicated a high flexibility level, suggesting that the sampled conformations occupied both close and far states from the PLpro (Figure H). However, the mutant SCoV2(MT) showed that the C-domain moved far from the PLpro with a broader distribution minimum of around 27.5 Å. In the case of SCoV, steady-state interactions were observed between both domains and PLpro, which lie parallel with the PLpro (Figure G). Its mutant variant (SCoV(MT)) exhibited broader distribution, suggesting that it moved closer to PLpro. Almost close contact was also observed with the WT-MCoV complex (Figure F) and even in its mutant MCoV complex. It suggests that the dynamic rearrangement of the respective domain of hISG15 occurs upon interaction with the PLpro of CoVs.
Figure 5

(A) Distribution of the center of the mass distance between PLpro and (A) N-domain of hISG15 and (B) C-domain of hISG15. Structures of several distinct peaks are shown in ribbon representation. (C) MCoV(MT); (D) SCoV2; (E) SCoV(MT); (F) MCoV; (G) SCoV; (H) SCoV2.

(A) Distribution of the center of the mass distance between PLpro and (A) N-domain of hISG15 and (B) C-domain of hISG15. Structures of several distinct peaks are shown in ribbon representation. (C) MCoV(MT); (D) SCoV2; (E) SCoV(MT); (F) MCoV; (G) SCoV; (H) SCoV2.

PCA of hISG15 in Apo and Complexes

The trajectories of hISG15 from its free simulation and complexes were used in the PCA by taking the Cα atom into account. The eigenvalues corresponding to eigenvectors were calculated and displayed in decreasing order and shown in the Supporting Information (Figure S6). The first few eigenvectors described the collective motion of significant fluctuations, and movements were not the same for all six complexes and Apo-hISG15. The first three PCs contained 84.30, 90.07, and 76.09% of the overall motion for the complex of MCoV, SCoV, and SCoV2. In contrast, the motions were increased in mutant complexes as calculated to be 92.45, 92.95, and 88.04%, respectively. In the Apo simulation, 86.62% of the fluctuation was estimated by the first three PCs. Subsequently, we have constructed a two-dimensional (2D) free energy landscape (FEL) for each system, taking the first two PCs (PC1, PC2) as collective variables, and shown in Supporting Information Figure S7. In all cases, several minima were observed, which indicates the intrinsic flexibility of hISG15. However, the sampling space was reduced in the complex compared to Apo. Still, several low energetic structures were observed for these complexes. In Apo hISG15, three distinct minima separated by almost 3 kcal/mol were observed (Figure S7A). In SCoV2, upon mutation in PLpro, the sampling space increased, and the global conformation position also shifted (Figure D & G). Only in the case of MCoV(MT), we obtained several minima sets in two groups separated by a high energy barrier (more than 4 kcal/mol). It leads to a highly variable structure comparable to the Apo state. The N-domain and C-domain’s internal structural behavior was almost stable as we already discussed (Figures S3B,C, S4, and S5), so the Cartesian coordinate-based PCA analysis will not be able to capture the entire dynamics. The linker loop between both domains is mostly responsible for the dynamics of hISG15. Hence, we performed dPCA of the hinge region of hISG15 as shown in Figure . This enables us to investigate the dynamics behind the hinge region. The sampled conformational space depicted for Apo hISG15 differs from those of complexes. Also, the difference can be noticed between WT and the corresponding mutant complex. To visualize the structural changes, the k-mean clustering was done corresponding to the hinge region and shown along with the dPCA plot (Figure ). Ramachandran plots were also generated for all significant structures obtained from the dPCA analysis (see Supporting Information Figure S8) to support the structural validity of the same. In the case of hISG15, Figure shows four prominent structural changes in the hinge loop from a bent loop (47.5%) to a linear structure with a lesser probability (11.2%). Compared with the SCoV2(MT) complex dPCA plot, SCoV2 (Figure D) showed higher conformational space variations due to small bend or twist in the loop. In the mutant complex (SCoV2(MT)), hISG15 exhibited two distinct states, mostly by an angular bend near the domain region. This loop structure behavior agrees with the angular distribution of hinge angle (Figure ), where SCoV2 and SCoV2(MT) complexes showed steep and flattened free energy well, respectively. The SCoV complex showed that the two minima conformations were separated by a high energy barrier. At the same time, its mutant form (SCoV(MT)) exhibited that their two conformational states were approachable. A higher degree of variation was observed in the MCoV complex, as agreed with the hinge angle distribution exhibiting broader free energy well. Comparably, MCoV(MT) showed distinct conformations among all other complexes, such as the secondary structure formation in the second-highest conformational cluster (26.3%). Overall, as observed for all complexes, hISG15 exhibits multiple minima, separated by a high energy barrier implying its dynamicity after the complex formation. So, the differential flexibility of hISG15 can be a great measure to estimate the interaction profile and the mechanism of the immune response against a different strain of CoV pathogenesis.
Figure 6

2D FEL generating using the dPCA of the hinge region of hISG15 of each complex obtained from the MD simulations. (A) Apo; (B) MCoV;(C) SCoV; (D) SCoV2; (E) MCoV(MT); (F) SCoV(MT); (G) SCoV2(MT).

2D FEL generating using the dPCA of the hinge region of hISG15 of each complex obtained from the MD simulations. (A) Apo; (B) MCoV;(C) SCoV; (D) SCoV2; (E) MCoV(MT); (F) SCoV(MT); (G) SCoV2(MT).

Binding Free Energy between PLpro and hISG15

To further evaluate the relative binding free energy between PLpro and hISG15 for MCoV, SCoV, and SCoV2, ΔGbind was calculated based on the MM/PBSA scheme. Moreover, the value of their binding free energy of WT was compared with their respective mutants (R167E). As shown in Table , the various components of the total binding free energy indicate that the intermolecular electrostatic interactions (ΔEele), van der Waals interactions (ΔEvdW), and nonpolar solvation free energy (ΔGnp) favor the binding of PLpro with hISG15. In contrast, the polar solvation free energy (ΔGpol) disfavors the complexation.
Table 1

Binding Energy and Its Various Components for the PLpro of MCoV, SCoV, and SCoV2 with the hISG15 Calculated Using the MM/PBSA Schemea

complexΔEvdWΔEelecΔGpolΔGnpΔEMMbΔGsolvcΔGTotald
MCoV–98.19 (0.11)–206.62 (0.51)247.62 (0.46)–12.81 (0.01)–304.81 (0.55)234.81 (0.45)–70.01 (0.15)
MCoV(MT)–113.05 (0.11)–417.66 (0.83)454.26 (0.72)–14.44 (0.01)–530.71 (0.90)501.98 (0.80)–90.97 (0.15)
SCoV–90.56 (0.14)–229.38 (0.61)256.44 (0.59)–12.43 (0.01)–319.86 (0.65)244.01 (0.58)–75.85 (0.17)
SCoV(MT)–130.45 (0.10)–142.29 (0.62)221.68 (0.60)–14.17 (0.01)–272.74 (0.64)207.51 (0.59)–65.22 (0.19)
SCoV2–101.35 (0.16)–231.60 (0.78)263.92 (0.68)–12.09 (0.02)–332.95 (0.82)251.83 (0.67)–81.11 (0.19)
SCoV2(MT)–78.44 (0.18)–415.64 (0.65)432.75 (0.66)–10.04 (0.02)–494.08 (0.72)422.70 (0.65)–71.38 (0.13)

The standard error of means is given in parentheses.

ΔEvdW + ΔEelec.

ΔGnp + ΔGpol.

ΔEvdW + ΔEelec + ΔGnp + ΔGpol.

The standard error of means is given in parentheses. ΔEvdW + ΔEelec. ΔGnp + ΔGpol. ΔEvdW + ΔEelec + ΔGnp + ΔGpol. The SCoV2-PLpro binds more strongly to hISG15 as it exhibited relatively stronger affinity (ΔGbind = −81.1 kcal/mol) than other WT complexes. The predicted ΔGbind for SCoV and MCoV was −75.85 and −70.01 kcal/mol, respectively. It is evident from Table that both ΔEvdW and ΔEelec are more favorable in the case of SCoV2 compared to MCoV or SCoV. This leads to stronger binding between SCoV2-PLpro and hISG15 than the other two variants. Further, the obtained ΔEMM revealed that for each complex, the electrostatic attraction is the main force inducing the molecular complexation between PLpro and hISG15. ΔEelec ranges from −206.62 to −231.60 kcal/mol while ΔEvdW varies between −90.56 and −101.35 kcal/mol for all WT complexes. It suggests that ΔEelec is ∼2–3 times stronger than ΔEvdW in cases of WT complexes. For each mutant complex, ΔEelec is ∼1–5 times more favorable than ΔEvdW. Notably, the high electrostatic contribution toward the complexation arises from the positively charged arginine (R167) of hISG15. Next, we investigated the effect of mutation on the PLpro-hISG15 binding. In cases of SCoV(MT) (−65.22 kcal/mol) and SCoV2(MT) (−71.38 kcal/mol), the binding free energy ΔGbind decreases compared to the respective WT. In contrast, the mutation enhances the complexation between PLpro and hISG15 for MCoV(MT) (ΔGbind = −90.97 kcal/mol). It is to be noted that in the case of SCoV2(MT), ΔEelec (−415.64 kcal/mol) increases significantly, while ΔEvdW (−78.44 kcal/mol) decreases compared to WT (ΔEelec = −231.60 kcal/mol, ΔEvdW = −101.35 kcal/mol). However, the net electrostatic energy (ΔEelec + ΔGpol) is less unfavorable to the complexation in SCoV2(MT) than SCoV2. Consequently, the mutation-induced reduced binding affinity in SCoV2(MT) arises primarily due to weaker van der Waals interaction. On the other hand, in the case of SCoV(MT), the opposite trend is observed where ΔEvdW (−130.45 kcal/mol) increases while ΔEelec (−142.29 kcal/mol) decreases compared to WT (ΔEelec = −229.38 kcal/mol, ΔEvdW = −90.56 kcal/mol). Further, the net electrostatic energy (ΔEelec + ΔGpol) is more unfavorable to SCoV(MT) complexation than SCoV2. This again leads to an overall reduced affinity for SCoV(MT) compared to SCoV, and the weaker electrostatic interaction is mainly responsible for the reduced binding free energy. Finally, in the case of MCoV(MT), both ΔEelec and ΔEvdW contribute more favorably than WT, leading to an increased binding affinity between PLpro and hISG15. Thus, it justifies that mutation significantly impaired some interactions between PLpro and hISG15, affecting the overall binding affinity. This finding is in good agreement with the reported binding of SCoV-PLpro and hISG15.[32] In addition, our predicted results also agree with different host substrate preferences where it was reported that SCoV preferentially cleaves the ubiquitin chains, whereas SCoV2 predominantly targets ISG15, as seen with mISG15.[62]

Hot-Spot Residues of the PLpro and hISG15

To gain insight into the molecular basis of the specificity of SCoV2 with hISG15, we investigated the crucial residues involved in the binding (ΔGbindresidue) using the MM/GBSA method. The per-residue decomposition of binding energy was estimated and is shown in Figure . Here, we highlighted those residues having free energy values of ≥1.5 kcal/mol and listed them in Table S2. The positive and negative values of ΔGbindresidue represent the stabilization and destabilization, respectively. The more the negative value, the more crucial the residue is for the complexation.
Figure 7

Per-residual contribution to free energy of PLpro and hISG15 complexes of the WT and mutants of MCoV, SCoV, and SCoV2, respectively, obtained via MM/GBSA analysis.

Per-residual contribution to free energy of PLpro and hISG15 complexes of the WT and mutants of MCoV, SCoV, and SCoV2, respectively, obtained via MM/GBSA analysis. Notably, the number of critical residues from PLpro of SCoV2 was higher compared to other CoVs. It includes Tyr268, Phe69, Pro223, Thr225, Met208, Asn267, Thr74, Thr75, and Asn128, which showed primary interaction with the N-domain residues of hISG15 (mainly with Arg57 of hISG15). In the case of SCoV, the key residues include Val226, Leu76, Asp230, Arg167, Pro224, and Tyr269. This result agrees with the experimental findings of SCoV interactions with the C-domain hISG15 (chISG15).[1] Moreover, it was reported that SCoV showed preferential ubiquitin activity from its Leu76 mediated hydrophobic interaction with Ile44 on ubiquitin. Similarly, we observed from the MD simulations that Leu76 of SCoV favorably interacts with hISG15. This indicates that the hydrophobic residue at this site is crucial in SCoV for its ubiquitin and hISG15 binding.[62] In contrast, the corresponding Thr75 of SCoV2 exhibited comparatively less contribution toward binding with hISG15. In addition, it was earlier shown that the PLpro of SCoV and SCoV2 displayed a similar binding mode with the C-domain mISG15.[32] However, our simulations result predicted the difference in SCoV and SCoV2-PLpro binding behavior with the full-length hISG15. It was observed that the SCoV-PLpro binds more strongly with the C-domain of hISG15 (Arg153 and Arg87), while SCoV2-PLpro binds more favorably with the N-domain (Arg57). Similarly, the major contributing residues for MCoV are also listed in Table S2. However, interesting results were observed for the mutant SCoV2, where the significantly contributing residues of PLpro lose their interactions with hISG15, leading to an overall decrease in the number of contributing residues compared to SCoV and MCoV. Instead, it is noted that in SCoV2(MT), Arg57 from hISG15 contributes (−11.48 kcal/mol) more than its WT (−6.90 kcal/mol). This illustrates the difference in the interaction profile between PLpro and hISG15 upon mutation. Also, as reported by Daczkowski et al.,[32] the charge-flip mutation of R167E in SCoV-PLpro showed 20 times less efficiency than WT in hydrolyzing ISG15. Similarly, we found that Arg167 contributes to the WT PLpro-hISG15 complexation while replacing the positively charged arginine with negatively charged glutamate abolishes the contribution. It also supported our findings of lower binding affinity observed in SCoV(MT) than SCoV. Similarly, apart from the closely related family of SCoV and SCoV2, the diverse family of MCoV showed that the charge-flip mutation increases the number of different contributing residues, such as Arg87, Arg92 from hISG15 and Arg82 from the PLpro. It may enhance some more interactions leading to its high affinity than its WT.

PLpro-hISG15 Interactions

Since the electrostatic interaction was the main force behind the PLpro-hISG15 complexation, we investigated the intermolecular hydrogen bond (H-bond) formation using the MD trajectories. The H-bond occupancy (%) is recorded in Table S3 and determines the strength and stability of the interaction. Also, the H-bond interaction between the WT or mutant SCoV2 PLpro and hISG15 is shown in the ball and stick representation to understand the key changes due to the point mutation (Figure S8A,B). It is found that hISG15 formed H-bonds with several polar and charged residues of CoV-PLpro. Notably, the number of residues forming a strong H-bond between PLpro and hISG15 (>50% occupancy) was higher in SCoV2 than other CoVs. As seen from Table S3, Arg57 of hISG15 forms strong H-bonds with PLpro residues, including Gln174 (∼51%), Asn128 (∼50%), and Asp179 (∼40%). Similarly, other H-bond interactions include Thr74-Ser21 (∼48%) and Thr225-Pro144 (∼35%). In contrast, in the case of SCoV2(MT), the H-bond interaction decreases as seen with Gln174-Arg57 (∼13%), while Arg57 from hISG15 forms another interaction with Glu166 (two H-bonds, ∼52%) and Glu167 (one H-bond, ∼21%) (see Figure ). Similarly, the main H-bond formed in SCoV was between Arg87 of hISG15 and Asp230 (two: H-bond, ∼41%). The other important H-bond was formed between Thr147@hISG15 and Val226@PLpro (∼34%). However, in the case of SCoV(MT), the primary H-bond between Asp230-Arg87 was not persistent (∼16%). Instead, other H-bonds with significant occupancy were formed, such as Met209-Gly128 (∼54%), Gly228-Thr147 (∼27%). In the case of MCoV, the significant H-bond interactions were formed between Gln227-Leu145 (∼53%), Thr147-Val225 (two: ∼50%, 35%), and Thr147-Tyr224 (∼32%). Upon mutation (MCoV(MT)), the Gln227 residue forms the main H-bond interaction with the Ser88 residue of hISG15 (∼42%). Other interactions include Phe128-Ser26 (38%) and Glu178-Ser31 (∼24%). Together, it reveals that the nitrogen atom of Arg57 from hISG15 forms hydrogen bonding with the oxygen of glutamine (Q174) of PLpro in SCoV2, which upon the charge-flip mutation (R166E), switch to form with the negatively charged glutamate (E166).
Figure 8

H-bonds and hydrophobic interactions between the hISG15 and PLpro using the Ligplot (dimplot) for the WT complexes of (A) MCoV; (B) SCoV and (C) SCoV2; respectively. H-bonds are displayed in green dotted lines, and red spikes represent the hydrophobic interactions. Significant H-bonding residues of hISG15 and PLpro are highlighted in green and blue, respectively.

H-bonds and hydrophobic interactions between the hISG15 and PLpro using the Ligplot (dimplot) for the WT complexes of (A) MCoV; (B) SCoV and (C) SCoV2; respectively. H-bonds are displayed in green dotted lines, and red spikes represent the hydrophobic interactions. Significant H-bonding residues of hISG15 and PLpro are highlighted in green and blue, respectively. In addition to the hydrogen-bond interaction, the hydrophobic interaction pattern governing the complex formation between hISG15 and PLpro is investigated and drawn using the Ligplot, as shown in Figure and Figure S9. The WT SCoV2-PLpro showed many H-bonds and hydrophobic interactions compared to SCoV and MCoV. Hydrophobic interactions are observed, including residues like Arg153, Phe149, Thr147, Ser146, Gly128, Val59, Ser22, Pro130, Pro59, Met23, and Trp3 from hISG15 and residues Gly209, Pro233, Ala176, Cys224, His175, Ile222, Met208, His73, and Phe69 from PLpro. This indicates a strong tendency of binding of SCoV2-PLpro with hISG15. In contrast, the mutant MCoV showed a large number of H-bonds and hydrophobic interactions compared to others. These results also supported the high binding energy for MCoV(MT) toward hISG15 than WT. Overall, our study provides evidence that SCoV2-PLpro shows preferable binding toward hSIG15 than other CoVs.

Residual Network Analysis

The NAPS server provided a visual inspection of the subnetwork based on various physicochemical properties, such as hydrophilic, hydrophobic, and charged interactions. The different complex nodes between PLpro and hISG15 of all systems were explored and are listed in Table S4. The result depicts the significance of the hydrophilic interaction network in WT-SCoV2 compared to other complexes, such as SCoV and MCoV (Figure ). No hydrophobic or charged nodes contact was detected for WT-SCoV2. Instead, the important hydrophilic interaction networks were found between PLpro and hISG15, which includes (Thr225-Thr147), (Gln174-Gln63), (Asn128-Ser62), (Thr74-Ser24, Ser22), and (Thr75-Ser21, Ser22). However, for SCoV2(MT), a lesser number of hydrophilic network nodes was observed. Additionally, some hydrophobic nodes were also observed, mainly (Leu199-Val52, Ala53) (Figure S9). In contrast, WT-SCoV exhibited mainly hydrophobic contact nodes, while upon R167E mutation in SCoV-PLpro, it showed more hydrophilic and charged interaction networks (see Table S4 and Figures S10 and S11). Similarly, for the WT-MCoV complex, mainly hydrophilic network nodes were prominent, while its mutant form displayed an increase in both hydrophobic and hydrophilic contact nodes compared to charge nodes. In general, it can be concluded from residual network analyses that SCoV2-PLpro interacted heavily with the hydrophilic residues of hISG15 resulting in high binding affinity.
Figure 9

Significant hydrophillic nodes (red) obtained from the network analysis between PLpro (blue) and ISG15 (green) of (A) MCoV; (B) MCoV(MT); (C) SCoV; (D) SCoV(MT); (E) SCoV2 and (F) SCoV2 (MT); respectively. Important connections are shown in yellow connective lines.

Significant hydrophillic nodes (red) obtained from the network analysis between PLpro (blue) and ISG15 (green) of (A) MCoV; (B) MCoV(MT); (C) SCoV; (D) SCoV(MT); (E) SCoV2 and (F) SCoV2 (MT); respectively. Important connections are shown in yellow connective lines.

Conclusions

This comprehensive study offers an integrated computational approach to explore the detailed mechanism of all three major CoVs-PLpro’s interaction with the complete human ISG15 for the first time. The PLpro of CoVs forms the replicase complex and antagonizes the innate immune responses via deubiquitinating and deISGylating activities, making it a potential target in treating the COVID-19. In the absence of the complete co-crystallized structure of hISG15 and PLpro, we constructed the same, and MD simulations were performed. Because of the bi-lobular construct of hISG15, each domain’s respective orientation and their differential interaction pattern with the conserved CoVs were investigated. By analyzing the dynamics of the free hISG15 protein and bound with PLpro of CoVs, we have provided insights into the differences in substrate preferences, binding mode, and the intermolecular interactions relevant to the function of SCoV2, SCoV, and MCoV. Despite the same sequence, hISG15’s dynamics of both domains were differently influenced upon binding with PLpro of CoVs. Our binding free energy analysis suggests that hISG15 binds more strongly with SCoV2-PLpro compared to other CoVs, and mainly the electrostatic interaction drives the complexation between hISG15-PLpro. This may explain why SCoV2-PLpro preferentially cleaves hISG15, whereas SCoV-PLpro predominantly targets ubiquitin chains. The C-terminal domain of hISG15 mainly governs all the interactions between PLpro and hISG15, while the N-terminal domain interacts weakly with PLpro. It agrees well with the recent experimental studies.[33] These in-depth analyses would fill the critical gap of understanding how PLpro of the CoV family interacts with hISG15, leading to specificity in SCoV2. This study may help in designing drugs targeting PLpro of SARS-CoV-2 that can suppress the COVID-19 infection and promote antiviral immunity.
  2 in total

Review 1.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

2.  Comparative Structural Dynamics of Isoforms of Helicobacter pylori Adhesin BabA Bound to Lewis b Hexasaccharide via Multiple Replica Molecular Dynamics Simulations.

Authors:  Rajarshi Roy; Nisha Amarnath Jonniya; Md Fulbabu Sk; Parimal Kar
Journal:  Front Mol Biosci       Date:  2022-05-02
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.