Literature DB >> 34346317

Investigation of nonsynonymous mutations in the spike protein of SARS-CoV-2 and its interaction with the ACE2 receptor by molecular docking and MM/GBSA approach.

Reem Y Aljindan1, Abeer M Al-Subaie2, Ahoud I Al-Ohali3, Thirumal Kumar D4, George Priya Doss C5, Balu Kamaraj6.   

Abstract

COVID-19 is an infectious and pathogenic viral disease caused by SARS-CoV-2 that leads to septic shock, coagulation dysfunction, and acute respiratory distress syndrome. The spreading rate of SARS-CoV-2 is higher than MERS-CoV and SARS-CoV. The receptor-binding domain (RBD) of the Spike-protein (S-protein) interacts with the human cells through the host angiotensin-converting enzyme 2 (ACE2) receptor. However, the molecular mechanism of pathological mutations of S-protein is still unclear. In this perspective, we investigated the impact of mutations in the S-protein and their interaction with the ACE2 receptor for SAR-CoV-2 viral infection. We examined the stability of pathological nonsynonymous mutations in the S-protein, and the binding behavior of the ACE2 receptor with the S-protein upon nonsynonymous mutations using the molecular docking and MM_GBSA approaches. Using the extensive bioinformatics pipeline, we screened the destabilizing (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) and stabilizing (H49Y, S50L, N501Y, D614G, A845V, and P1143L) nonsynonymous mutations in the S-protein. The docking and binding free energy (ddG) scores revealed that the stabilizing nonsynonymous mutations show increased interaction between the S-protein and the ACE2 receptor compared to native and destabilizing S-proteins and that they may have been responsible for the virulent high level. Further, the molecular dynamics simulation (MDS) approach reveals the structural transition of mutants (N501Y and D614G) S-protein. These insights might help researchers to understand the pathological mechanisms of the S-protein and provide clues regarding mutations in viral infection and disease propagation. Further, it helps researchers to develop an efficient treatment approach against this SARS-CoV-2 pandemic.
Copyright © 2021. Published by Elsevier Ltd.

Entities:  

Keywords:  ACE2 receptor; Binding affinity; Nonsynonymous mutations; SARS-CoV-2; Spike protein; Stability

Mesh:

Substances:

Year:  2021        PMID: 34346317      PMCID: PMC8282961          DOI: 10.1016/j.compbiomed.2021.104654

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


Introduction

The first case of SARS-Cov-2 was reported in Dec 2019 in Wuhan, China. It was primarily called 2019-nCoV and later designated as SARS-Cov-2 because of its taxonomic and genomics relationship with SARS-CoV [1,2]. COVID-19, the exceedingly infectious and pathogenic viral disease that leads to septic shock, coagulation dysfunction, and acute respiratory distress syndrome, is caused by SARS-CoV-2 [3]. The transmission rate of SARS-CoV-2 is higher than MERS-CoV and SARS-CoV [4]. SARS-CoV-2 is a single-stranded RNA genome and has 29,930 nucleotides. It has 14 open reading frames (ORFs) that code for 29 proteins, including four crucial structural proteins: spike (S) proteins, membrane (M), nucleocapsid (N), and envelope (E), as well as nine supporting proteins and 16 non-structural proteins [5,6]. A recent study reported that mutations within the S-protein intercede the viral section that can tweak viral pathogenesis [7]. The S-protein is practically separated into two segments and known as S1 and S2. The S1 segment of the S-protein binds with the host angiotensin-converting enzyme2 (ACE2) receptor. The S2 region, which is isolated from the S1–S2 linker by the protease cleavage sites, aids in virus-host cell fusion [8]. The S1 segment of the S-protein contains a signal peptide (SP), N-terminal domain, and C-terminal domain. The C-terminal domain of the S1 segment contains a receptor-binding motif (RBM) and the receptor-binding domain (RBD), whereas the S2 segment of S-protein contains a heptad repeat (HR1&HR2), transmembrane domain (TM), fusion peptide (FP), and a small cytoplasmic domain (CP). The above features were combined, making the S-protein the critical target for developing antibodies, vaccines, and drugs [8,9]. Furthermore, nonsynonymous mutations found in the Spike-gene may have a vital role in the pathogen's host range and pathogenicity [10]. Moreover, discovering a complete set of nonsynonymous mutations in the S-protein of SARS-CoV-2 and its effect on human ACE2 affinity is required to develop therapeutic remedies. Hence, investigating the S- protein and its progression can improve our insights into host receptor interaction changes and pathogenic levels. We believe that Spike's essential significance, both in terms of antibody target and viral infection, is critical for developing an “early warning” pipeline to assess the pandemic's progression. The GISAID (A global initiative on sharing avian flu data) consortium has collected so far ~10,000 units of viral genome sequences and made them widely accessible to the research society. This effort allows scientists to examine the data to realize genome diversity [11,12], to postulate targetable targets for drug repurposing [13,14], and to build prevention approaches [15]. Mercatelli and Giorgi [16] completed the significant comparative analysis by inspecting more than 10,000 complete SARS-CoV-2 genomes. They reported every nonsynonymous mutation, further underlining the uprising of sub-clads, and genomic high mutation spots and arranged them genetically and geographically. These findings are mainly helpful for designing and measuring program efficacy for restricting the spread of SARS-Cov-2 on a regional basis [16]. Other recent studies have identified 1815 nonsynonymous mutations that belong to 1176 genomes from 29 countries [17,18]. These mutations fall into 62 distinct types, with 29 different amino acid substitutions present in the S1 segment, 28 in the S2 segment, and 5 at the S1–S2 junction of the S-protein. Mutation D614G was the most abundant mutation and showed the highly infectious A2 subtypes of SARS-CoV-2 [17,18]. RBD consists of 223 amino acid lengths in the S1-region of the S-protein that connects SARS-like coronaviruses to various hosts by directly binding to cellular ACE2 [19]. Shijulal and Umashankar et al. found six distinct types of mutations namely, V367F, P384L, S438F, K439N, G476S, and V483A in RBD domain of the S-protein. They further analyzed and identified that only ~2% of the RBD mutations were nonsynonymous reported from the total S-protein [17,18]. Recently, the mutation N501Y was observed in the RBD domain, which has spread rapidly in the UK and other countries [[20], [21], [22], [23], [24]]. The mutation N501Y has augmented the many discussions and questions, but only a small amount of data relating to it is currently available [21]. The mutation D614G, which originated either in China or Europe, and started to spread swiftly first in Europe and then throughout the world, is the focus of the current pandemic in a number of countries [[25], [26], [27]]. In biological studies, in-silico mutational investigations have proven to be a promising alternative to wet-bench techniques [28,29]. Further in-silico studies should play a vital role in developing information on this new virus to implement procedures that suppress its occurrence. We hypothesize that nonsynonymous mutations in the S-protein alter the stability of its structure and interaction with the ACE2 receptor. Therefore, in this study utilizing comprehensive bioinformatics, we detected the highly significant nonsynonymous mutations in the S-protein based on the stability of the S-protein. Further, we predicted the binding behavior of the S-protein upon nonsynonymous mutations with the ACE2 receptor using the molecular docking and MM_GBSA approaches. MD simulation approach revealed the structural changes of mutant (N501Y and D614G) proteins. Fig. 1 depicts the general workflow used in this work. This could help researchers in better understanding the pathogenic mechanism of S-protein and provide insights into the role of mutations in viral infection and disease propagation. Further, it helps researchers develop an efficient treatment approach against this pandemic SARS-CoV-2.
Fig. 1

The workflow applied in this investigation.

The workflow applied in this investigation.

Materials & methods

Dataset

The human S-protein of the SARS-CoV-2 sequence (length: 1273 amino acids) was obtained in FASTA format from the UNIPROT (ID: P0DTC2) database [30]. The 62 nonsynonymous mutations information of S-protein was collected from the recent articles [17,18,21] and displayed in Fig. 2 . The amino acid sequence was further used to construct the three dimensional (3D) protein conformation of native and mutant S-protein. The ACE2 receptor structure (PDB ID: 1R42_A) [31], was collected from the Protein Data Bank (PDB) database [32].
Fig. 2

S-protein domains and their nonsynonymous mutations list.

S-protein domains and their nonsynonymous mutations list.

Modeling the native and mutant S-protein

The available PDB structures of S-proteins show many missing residues in the 3-D structure. Some of the collected SNP residue positions do not present in the existed PDB structures. To fix this issue, we have used the previous research studies as an example and implemented them in this study to construct the three-dimensional (3-D) conformation of the native S-protein [[33], [34], [35], [36], [37], [38]]. Hence, we used the I-TASSER server [39] to model the native S-protein. It is a threading-based structure prediction program which used to generate the three-dimensional structure of proteins from their amino acid sequence. Some recent studies opted for the threading approach to model the protein to check the interaction with other biological molecules [[33], [34], [35], [36], [37], [38]]. I-TASSER produces great quality model predictions of three-dimensional structures from amino acid sequences. It is a very accurate and practical approach. We used the PDB ID: 6XR8_A [40] as a template, and it has shown 100% sequence coverage and similarity in the S-protein query sequence. We acquired the most acceptable modeled structure from I-TASSER, based on the high c-score. The predicted model of native S-protein was further refined by the molecular dynamics simulation (MDS) approach using the GROMACS [41] package. The OPLS-AA force field [42] was used for the refinement. Our MD simulations were carried out according to a procedure that has previously been published [[43], [44], [45]]. The pre-equilibrated protein was applied for MD simulations till 12 nanoseconds (ns). The RMSD value was calculated to examine the protein's structural alteration. To further inspect the effect of nonsynonymous mutations on the S-protein, we generated the nonsynonymous mutations in the predicted S-protein model. Moreover, we used the SwissPDB viewer tool [46] program and performed an energy minimization to create the perfect mutant protein structures. Lastly, the PROCHECK [47] and PROSA [48] tools were applied to assess the reliability of native and mutant S-protein.

Prediction of native and mutant S-protein stability

Sequence-based

The S-protein amino acid sequence was used as an input for estimating protein stability upon nonsynonymous mutations. i-Stable Server [49] was applied to predict the stability alterations of S-protein upon nonsynonymous mutations. It gives results from I-Mutant2.0 [50] and MUpro [51] programs, predicting the meta results. The DDG scores from I-Mutant 2.0 and Conf score from MUpro and i-Stable are considered to predict protein stability. First, I-Mutant-2.0, the DDG score less than 0 is predicted as decreasing stability, whereas DDG scores greater than 0 is predicted as increasing stability. Second, with MUpro, we obtained the Conf Score in the range of −1 to 1. A DDG score higher than 0 predicted increased stability, and a score lower than 0 predicted decreased stability. Last, the i-Stable conf score ranges between 0 and 1, where the higher value exposes higher confidence.

Structure-based

We used the modeled native and mutant S-protein structures as input to estimate how the protein's stability would vary as a result of nonsynonymous mutations. CUPSAT CUPSAT (Cologne University Protein Stability Analysis Tool) [52] was used to analyze fluctuations in the stability of S-protein upon mutation. We uploaded the model structure of the S-protein as input and selected the location of the amino acid residue to be mutated. The stability of native and mutant S-proteins is predicted by calculating the difference in the free energy score (DDG score). Further, it provides information about nonsynonymous mutations, secondary structures, torsion angles, and solvent accessibility affected by the mutation. SDM2 The Site-directed mutator 2 (SDM2) Server was employed to compute the effect of nonsynonymous mutations on the stability of proteins [53]. It is a knowledge‐based tool and applies ESSTs tables to analyze the alterations in protein stability upon mutation. Further, it computes the stability changes score between the native and mutant proteins. We uploaded the 3-D model structures of native and mutant S-proteins and point variant information as an input file. DUET [54] is an integrated computational method used for calculating the impact of nonsynonymous mutations on the stability of the protein. We uploaded the 3-D model structures of native and mutant S-proteins as an input file.

S-protein and ACE2 docking by HADDOCK

The native and mutant S-proteins were docked with the ACE2 receptor molecule using HADDOCK v2. 4 [55]. The modeled native and mutant S-proteins and ACE2 protein (PDB ID: 1R42_A) molecules were used. We collected the interacting (binding) residues between the S-protein and ACE-2 receptor from recent studies [56]. The collected binding residues in S-protein together with the binding residues in the partner ACE-2 receptor were used as active residues, and the neighboring ones were used as passive residues. The default parameters we applied in our previous studies were also considered for the docking studies [[57], [58], [59], [60]]. The HADDOCK scoring function was executed based on the weighted aggregate of the various energy terms de-solvation (Desolv) and restraints energy, van der Waals (vdw), electrostatic (Elec), and buried surface area (BSA).

MM-GBSA calculation by HAWKDOCK

The MM-GBSA free energy decomposition analysis implemented in the S-protein and ACE2 receptor molecule and the binding affinity was estimated by the HawkDock server [61]. HawkDock considers a relatively more minor protein as a flexible receptor and a bigger protein as an inert receptor. The HawkRank scoring function and the ATTRACT docking algorithm with MM/GBSA free energy decomposition analysis were used to calculate the binding free energy between the protein complexes. We applied the haddock output complex structures (native and mutant S-proteins and ACE2 receptor) as an input to compute the binding free energy. Lastly, the best ten models of interacting proteins were re-ranked by MM/GBSA calculation [[62], [63], [64]]. All protein-protein interactions were represented diagrammatically using the LigPlot program [65] and PYMOL software [66].

Native and mutant (N501Y & D614G) S-proteins MD simulation

The native and most prominent stabilizing mutants (N501Y and D614G) of the S-protein were utilized as input for the molecular dynamics simulations. The initial setup of the MD simulation was prepared by following our previously published protocol [[43], [44], [45]]. The CHARMM36 m force field [67] was implemented in this simulation. The minimization, equilibration, and MD simulation procedures were performed per our previously published protocol [[43], [44], [45]]. Lastly, the MD simulation was carried out for 15 ns. XMGRACE [68] tool used to analyze the trajectories. We analyzed the energy plot, RMSD, RMSF, the solvent-accessible surface area (SASA), Principle component analysis (PCA) [69], and the number of hydrogen bonds (NH-bonds), and we made a comparison between native, stabilizing mutants (N501Y and D614G) to examine the structural behavior of the S-protein.

Results

Predicting the 3-D structures of native and mutant S-proteins

Observing the mutational effect on the S-protein and its interaction with the ACE2 receptor is very important for predicting the 3D conformation of the S-protein. Hence, we have used the I-TASSER online server to predict the 3D structure of the S-protein. The PDB ID: 6XR8_A was used as a template for predicting the 3D structure of the native S-protein. Further refinement of predicted model protein, MDS for 12ns performed to evaluate the stability of model protein for subsequent studies. The RMSD of the native S-protein is converged after 7 ns (Fig. 3 ). Further, the average structure of the native S-protein was shown at regular intervals. The best favorable structure was selected at the 12ns MDS, and subsequently used to build the mutant structures.
Fig. 3

The backbone RMSD of the native S-protein versus time at 300 K. The average structure of native S-protein has shown in different timescale.

The backbone RMSD of the native S-protein versus time at 300 K. The average structure of native S-protein has shown in different timescale. Further, the Swiss PDB viewer tool was used to build the mutant structures of the S-protein. Then, PROCHECK and PROSA software used to calculate the consistency of predicted model structures. The native S-protein showed that 99.7% favored and allowed region and z-score of −6.6. Mutant structures showed in the range of 96.4%–99.6% in favored residues and allowed region and z-score values in the range of 3.25–6.54. These predicted scores corroborate the high confidence level. Therefore, the native and mutant modeled S-protein structures were utilized for SNP and protein-protein docking analysis. The modeled structure of the native S-protein and ACE2 receptor was shown in Fig. 4 .
Fig. 4

(a) Modeled Spike (S) - protein, (b) ACE2 protein (PDB ID: 1R42_A). This figure was prepared by PYMOL.

(a) Modeled Spike (S) - protein, (b) ACE2 protein (PDB ID: 1R42_A). This figure was prepared by PYMOL.

Protein stability analysis

To identify the effect of 62 nonsynonymous mutations of different domains (SP, NTD, CTD/RBD, Linker, S1/S2, FP, S2, HR1, HR2, TM, and CT domain) of S-protein, we utilized a broad approach of numerous coherent sequences and structure-based online servers. First, we used the sequence-based i-Stable Server to verify the stability of the S-protein (whether increased or decreased stability) upon mutations. The i-Stable tool collates I‐Mutant 2.0 SEQ, MUpro, and i-Stable to evaluate the stability. I-Mutant 2.0 SEQ predicted 49 of 62 nonsynonymous mutations with decreased stability, and 13 nonsynonymous mutations with increased stability (Table 1 ). On the other hand, Mupro predicted 39 and 23 nonsynonymous mutations as decrease stability and increase stability (Table 1). As a meta result, i-stable predicted, out of 62 nonsynonymous mutations, 40 nonsynonymous mutations (L5F, L8V, L8W, L18F, L54F, T76I, V120I, D138Y, Y145H, M153T, F157S, L176F, G181V, D215H, A262T, V367F, G476S, V483A, L611F, Q675H, A706V, T791I, P809S, A845S, A846V, A879S, V1040F, P1162L, D936H, S939F, T941A, D1163G, I1216T, M1229I, M1237L, M1237I, C1247F, C1254F, D1260 N, and P1263L) with decreased the stability and 22 nonsynonymous mutations (H49Y, S50L, S71F, S221L, S221W, Q239K, S247R, S254F, W258L, S438F, N501Y, D614G, Q675R, A831V, D839Y, A845V, A852V, P1143L, D936Y, S940F, S940T, and S943P) with increased stability (Table 1).
Table 1

The sequence-based classification of nonsynonymous mutations of SPIKE protein (S-protein) as increasing or decreasing in stability by applying iStable Server.

DomainMutationI-Mutant2.0 SEQ
Mupro
Meta Result (iStable)
DDGPredictionScorePredictionScorePrediction
SPL5F−0.63Decrease−0.72Decrease0.71Decrease
L8V−2.82Decrease−0.45Decrease0.83Decrease
L8W−1.20Decrease−0.62Decrease0.82Decrease
NTDL18F−0.82Decrease−1.00Decrease0.86Decrease
H49Y0.69Increase0.98Increase0.84Increase
S50L−0.04Decrease1Increase0.52Increase
L54F−0.88Decrease−1Decrease0.82Decrease
S71F0.41Increase0.02Increase0.82Increase
T76I−0.93Decrease−1Decrease0.60Decrease
V120I−0.57Decrease−0.69Decrease0.78Decrease
D138Y0.08Decrease−0.67Decrease0.57Decrease
Y145H−1.43Decrease−1Decrease0.82Decrease
M153T−1.07Decrease−0.67Decrease0.84Decrease
F157S−1.94Decrease−1Decrease0.89Decrease
L176F−0.88Decrease−1Decrease0.81Decrease
G181V−0.48Decrease−0.15Decrease0.81Decrease
D215H−0.97Decrease−1Decrease0.69Decrease
S221L−0.25Decrease1Increase0.51Increase
S221W0.14Increase0.89Increase0.77Increase
Q239K−0.56Increase−0.01Decrease0.51Increase
S247R0.01Increase0.50Increase0.78Increase
S254F0.21Increase0.44Increase0.82Increase
W258L−0.84Decrease0.11Increase0.62Increase
A262T−0.73Decrease−0.83Decrease0.77Decrease
CTD/RBDV367F−2.07Decrease−0.19Decrease0.79Decrease
S438F0.00Increase0.75Increase0.82Increase
G476S−1.45Decrease−0.45Decrease0.82Decrease
V483A−1.46Decrease−0.51Decrease0.75Decrease
N501Y0.18Increase0.39Increase0.51Increase
Linker(S1/S2)L611F−0.72Decrease−1Decrease0.82Decrease
D614G−1.10Decrease0.19Increase0.68Increase
Q675H−0.64Decrease−0.06Decrease0.80Decrease
Q675R−0.06Decrease0.50Increase0.59Increase
A706V−0.26Decrease0.16Increase0.52Decrease
FPT791I−0.35Decrease−1Decrease0.66Decrease
S2P809S−1.59Decrease−1Decrease0.79Decrease
A831V−0.06Decrease0.85Increase0.56Increase
D839Y0.04Increase0.31Increase0.80Increase
A845S−0.77Decrease−1Decrease0.88Decrease
A845V−0.20Decrease0.09Increase0.51Increase
A846V−0.18Decrease0.89Increase0.54Decrease
A852V−0.14Decrease0.72Increase0.58Increase
A879S−0.62Decrease−0.50Decrease0.86Decrease
V1040F−1.72Decrease−0.92Decrease0.83Decrease
P1143L−0.64Decrease0.16Increase0.55Increase
P1162L−0.94Decrease−0.39Decrease0.81Decrease
HR1D936H−0.94Decrease−0.61Decrease0.76Decrease
D936Y−0.58Decrease0.09Increase0.69Increase
S939F−0.09Increase−0.24Decrease0.52Decrease
S940F0.09Increase0.99Increase0.75Increase
S940T−0.07Increase0.55Increase0.81Increase
T941A−1.04Decrease−0.51Decrease0.72Decrease
S943P−0.21Increase0.47Increase0.81Increase
HR2D1163G−1.78Decrease−0.83Decrease0.75Decrease
TMI1216T−1.96Decrease−1Decrease0.86Decrease
M1229I−0.85Decrease−1Decrease0.71Decrease
M1237L−0.91Decrease−0.25Decrease0.82Decrease
M1237I−0.75Decrease−0.63Decrease0.74Decrease
CTC1247F0.01Decrease−0.42Decrease0.69Decrease
C1254F−0.13Decrease−0.87Decrease0.76Decrease
D1260 N−0.94Decrease−0.57Decrease0.85Decrease
P1263L−0.66Decrease−0.41Decrease0.80Decrease
The sequence-based classification of nonsynonymous mutations of SPIKE protein (S-protein) as increasing or decreasing in stability by applying iStable Server. Further, we have collated the CUPSAT, SDM 2.0, and DUET structure-based online servers to predict S-protein stability upon nonsynonymous mutations. CUPSAT predicted that out of 62 nonsynonymous mutations, 36 and 26 nonsynonymous mutations were destabilizing and stabilizing (Table 2 ). The SDM 2.0 server depicted 28 nonsynonymous mutations with increased stability, and 34 nonsynonymous mutations with decreased stability (Table 2). While the DUET server predicted 46 and 16 nonsynonymous mutations as destabilizing and stabilizing (Table 2). In combination, structure-based online servers predicted 17 nonsynonymous mutations (L8V, L8W, L18F, S71F, Y145H, M153T, F157S, S221L, S221W, S247R, G476S, L611F, A831V, A852V, A879S, C1247F, and C1254F) with decreased stability, and 7 nonsynonymous mutations (H49Y, S50L, D215H, N501Y, D614G, A845V, and P1143L) with increased stability in the S-protein (Table 2). Together, both sequence and structure-based online servers predicted 11 nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) exhibiting decreased stability and 6 nonsynonymous mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) exhibiting increased stability in the S-protein upon mutations (Table 1, Table 2). These nonsynonymous mutations were separated based on each domain (SP, NTD, CTD/RBD, Linker, S2, and TM) of the S-protein. Further, we examined binding behavior of nonsynonymous mutations in S-protein with the ACE2 receptor using molecular docking and MM_GBSA studies.
Table 2

The structure-based classification of nonsynonymous mutations of SPIKE protein (S-protein) as increasing or decreasing in stability using CUPSAT, SDM 2.0, DUET Server.

DomainMutationCUPSAT
SDM 2.0
DUET
Predicted DDG (kcal/mol)PredictionDDGPredictionDDGPrediction
SPL5F−0.29Destabilizing0.2Increased stability−0.611Destabilizing
L8V−3.18Destabilizing−2.69Reduced stability−2.13Destabilizing
L8W−8.17Destabilizing−1.11Reduced stability−2.003Destabilizing
NTDL18F−2.64Destabilizing−1.11Reduced stability−1.72Destabilizing
H49Y0.96Stabilizing0.78Increased stability1.293Stabilizing
S50L4.18Stabilizing1.2Increased stability0.734Stabilizing
L54F4.41Stabilizing−0.08Reduced stability−0.866Destabilizing
S71F−0.53Destabilizing−0.66Reduced stability−1.772Destabilizing
T76I1.1Stabilizing1.1Increased stability−0.06Destabilizing
V120I0.89Stabilizing−0.65Reduced stability−0.029Destabilizing
D138Y1.71Stabilizing−0.21Reduced stability−1.505Destabilizing
Y145H−1.6Destabilizing−0.29Reduced stability−1.08Destabilizing
M153T−0.58Destabilizing−0.09Reduced stability−0.733Destabilizing
F157S−4.01Destabilizing−0.83Reduced stability−0.977Destabilizing
L176F−3.11Destabilizing0.2Increased stability−0.537Destabilizing
G181V3.27Stabilizing−1.05Reduced stability−0.466Destabilizing
D215H1.17Stabilizing0.85Increased stability1.087Stabilizing
S221L−1.23Destabilizing−0.08Reduced stability0.102Stabilizing
S221W−3.44Destabilizing−0.74Reduced stability−0.793Destabilizing
Q239K6.79Stabilizing−1.01Reduced stability−1.223Destabilizing
S247R−3.25Destabilizing−1.01Reduced stability−1.155Destabilizing
S254F0.11Stabilizing−0.73Reduced stability−1.638Destabilizing
W258L0.71Stabilizing−1.07Reduced stability−1.932Destabilizing
A262T1.51Stabilizing0.13Increased stability−0.865Destabilizing
CTD/RBDV367F−0.11Destabilizing0.14Increased stability−0.636Destabilizing
S438F3.64Stabilizing−0.38Reduced stability−0.485Destabilizing
G476S−0.8Destabilizing−4.37Reduced stability−1.112Destabilizing
V483A−0.13Destabilizing0.12Increased stability−0.317Destabilizing
N501Y0.07Stabilizing0.91Increased stability0.207Stabilizing
LinkerL611F−1.01Destabilizing−0.59Reduced stability−1.308Destabilizing
D614G0.3Stabilizing0.87Increased stability0.173Stabilizing
(S1/S2)Q675H−3.05Destabilizing0.33Increased stability−0.71Destabilizing
Q675R−0.87Destabilizing0.12Increased stability0.106Stabilizing
A706V0.17Stabilizing−0.12Reduced stability−0.104Destabilizing
FPT791I−2.58Destabilizing0.07Increased stability0.257Stabilizing
S2P809S2.16Stabilizing−0.63Reduced stability−0.236Destabilizing
A831V−1.46Destabilizing−0.84Reduced stability−0.127Destabilizing
D839Y−0.98Destabilizing0.67Increased stability−0.362Destabilizing
A845S0.77Stabilizing−0.14Reduced stability−0.328Destabilizing
A845V0.36Stabilizing1.16Increased stability0.119Stabilizing
A846V1.47Stabilizing−0.12Reduced stability−0.018Destabilizing
A852V−0.5Destabilizing−0.84Reduced stability−0.55Destabilizing
A879S−1.52Destabilizing−2.18Reduced stability−1.393Destabilizing
V1040F−1.11Destabilizing0.14Increased stability−0.781Destabilizing
P1143L0.45Stabilizing1.45Increased stability0.358Stabilizing
P1162L0.23Stabilizing−0.58Reduced stability−0.085Destabilizing
HR1D936H−0.91Destabilizing0.13Increased stability−0.684Destabilizing
D936Y−1.69Destabilizing0.77Increased stability−0.434Destabilizing
S939F−0.37Destabilizing0.54Increased stability−1.037Destabilizing
S940F−0.36Destabilizing0.54Increased stability−0.89Destabilizing
S940T0.06Stabilizing−0.14Reduced stability0.342Stabilizing
T941A−1.07Destabilizing1.1Increased stability−0.594Destabilizing
S943P0.75Stabilizing−0.48Reduced stability−0.138Destabilizing
HR2D1163G−1.93Destabilizing0.87Increased stability0.411Stabilizing
TMI1216T1.46Stabilizing−1.06Reduced stability−0.86Destabilizing
M1229I−0.46Destabilizing0.11Increased stability−0.18Destabilizing
M1237L−1.1Destabilizing0.54Increased stability0.498Stabilizing
M1237I−1.75Destabilizing0.11Increased stability0.439Stabilizing
CTC1247F−1.53Destabilizing−0.2Reduced stability−0.369Destabilizing
C1254F−1.24Destabilizing−0.2Reduced stability−0.424Destabilizing
D1260 N0.27Stabilizing−0.11Reduced stability0.616Stabilizing
P1263L−1.83Destabilizing2.24Increased stability0.394Stabilizing
The structure-based classification of nonsynonymous mutations of SPIKE protein (S-protein) as increasing or decreasing in stability using CUPSAT, SDM 2.0, DUET Server.

Docking the S-protein and its predicted mutations with the ACE2 receptor molecule

The HADDOCK tool and HawkDock Server was used to examine the binding energy between the native and mutant S-proteins [destabilizing (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) and stabilizing (H49Y, S50L, N501Y, D614G, A845V, and P1143L)] with the ACE2 receptor. The HADDOCK score must be computed in order to understand the interaction between biological partners. During docking, each structure is assigned a HADDOCK score, which allows the structures to be classified. The score is a sum of the intermolecular AIR energies, buried surface area (BSA) Electrostatic, van der Waals, and desolvation (Dsolv) energies [[70], [71], [72]]. The Haddock scores of the native S-protein-ACE2 complex, and destabilizing S-proteins of S-protein-ACE2 (L8V-ACE2, L8W-ACE2, L18F-ACE2, Y145H-ACE2, M153T-ACE2, F157S-ACE2, G476S-ACE2, L611F-ACE2, A879S-ACE2, C1247F-ACE2, and C1254F-ACE2) complexes were −123.5 ± 13.2, −118.8 ± 6.1, −120.8 ± 16.0, −116.0 ± 3.2, −119.0 ± 9.9, −115.5 ± 1.2, −118.4 ± 2.8, −119.2 ± 6.5, −117.0 ± 14.4, −116.8 ± 12.7, −117.1 ± 4.0, and −113.4 ± 8.0, respectively, shown in Table 3 a. Whereas the HADDOCK score of stabilizing nonsynonymous mutations of S-protein -ACE2 complexes (H49Y-ACE2, S50L-ACE2, N501Y-ACE2, D614G-ACE2, A845V-ACE2, and P1143L-ACE2) were −136.5 ± 6.7, −131.3 ± 10.3, −136.3 ± 6.3, −128.2 ± 12.0, −127.7 ± 4.7, and −125.9 ± 17.1, respectively, are shown in Table 3 b.
Table 3a

Docking analysis of native and destabilizing nonsynonymous mutants of S-protein with ACE2 receptor using HADDOCK.

Protein TypeDomainHaddock ScoreVan der waal energy (Kcal mol−1)Electrostatic energy (Kcal mol−1)Desolvation energy (Kcal mol−1)Restraints violation energy (Kcal mol−1)Buried surface area (A °2)
Native-ACE2−123.5 ± 13.2−51.8 ± 3.3−374.5 ± 80.4−10.3 ± 3.7134.0 ± 27.62099.9 ± 186.4
L8V-ACE2SP−118.8 ± 6.1−44.1 ± 4.6−345.6 ± 18.0−12.8 ± 5.073.0 ± 3.62012.7 ± 146.2
L8W-ACE2−120.8 ± 16.0−56.9 ± 8.9−320.2 ± 29.3−10.5 ± 2.8106.3 ± 36.12082.0 ± 84.8
L18F-ACE2NTD−116.0 ± 3.2−54.4 ± 11.4−319.1 ± 51.9−8.6 ± 3.3108.3 ± 57.12004.5 ± 81.5
Y145H-ACE2−119.0 ± 9.9−57.7 ± 4.0−294.7 ± 21.8−13.3 ± 2.8108.7 ± 28.41986.3 ± 52.0
M153T-ACE2−115.5 ± 1.2−47.7 ± 9.6−343.4 ± 77.9−7.1 ± 6.180.0 ± 29.11946.9 ± 59.7
F157S-ACE2−118.4 ± 2.8−56.9 ± 1.5−282.9 ± 27.0−13.4 ± 4.685.0 ± 36.91954.0 ± 25.2
G476S-ACE2CTD/RBD−119.2 ± 6.5−50.7 ± 4.5−375.0 ± 33.8−4.9 ± 0.9113.9 ± 60.61910.9 ± 123.7
L611F-ACE2Linker−117.0 ± 14.4−66.2 ± 6.1−242.3 ± 39.5−7.9 ± 2.655.4 ± 16.22019.9 ± 243.6
A879S-ACE2S2−116.8 ± 12.7−48.8 ± 12.0−339.1 ± 44.8−6.9 ± 5.766.8 ± 22.62001.8 ± 193.5
C1247F-ACE2TM−117.1 ± 4.0−51.5 ± 6.4−315.3 ± 19.5−13.3 ± 3.3108.1 ± 28.71977.5 ± 72.2
C1254F-ACE2−113.4 ± 8.0−65.8 ± 9.7−216.8 ± 48.0−11.6 ± 5.373.5 ± 35.52098.8 ± 135.2
Table 3b

Docking analysis of native and nonsynonymous stabilizing mutants of S-protein with ACE2 receptor using HADDOCK.

Protein TypeDomainHaddock ScoreVan der waal energy (Kcal mol−1)Electrostatic energy (Kcal mol−1)Desolvation energy (Kcal mol−1)Restraints violation energy (Kcal mol−1)Buried surface area (A °2)
Native-ACE2−123.5 ± 13.2−51.8 ± 3.3−374.5 ± 80.4−10.3 ± 3.7134.0 ± 27.62099.9 ± 186.4
H49Y-ACE2NTD−136.5 ± 6.7−50.8 ± 5.5−450.8 ± 19.9−6.4 ± 4.2108.8 ± 37.72278.1 ± 100.4
S50L-ACE2−131.3 ± 10.3−55.0 ± 2.2−381.8 ± 44.3−10.8 ± 2.7108.0 ± 22.72232.6 ± 166.2
N501Y-ACE2RBD−136.3 ± 6.3−61.5 ± 4.9−406.1 ± 23.8−2.6 ± 2.490.8 ± 30.62128.1 ± 109.5
D614G-ACE2Linker−128.2 ± 12.0−45.8 ± 7.5−427.7 ± 26.5−6.9 ± 4.199.4 ± 43.22135.1 ± 87.8
A845V-ACE2S2−127.7 ± 4.7−43.9 ± 6.4−434.2 ± 47.7−9.0 ± 5.2120.7 ± 19.42213.1 ± 90.2
P1143L-ACE2−125.9 ± 17.1−47.2 ± 7.5−398.4 ± 82.8−9.5 ± 3.2105.2 ± 47.02119.3 ± 118.2
Docking analysis of native and destabilizing nonsynonymous mutants of S-protein with ACE2 receptor using HADDOCK. Docking analysis of native and nonsynonymous stabilizing mutants of S-protein with ACE2 receptor using HADDOCK. The buried surface area (BSA) is applied to calculate the surface of the protein. The native-complex displays a BSA value of 2099.9 ± 186.4, while the BSA values of the destabilizing nonsynonymous mutations of S-protein – ACE2 complexes are in between the range of 1910.9 ± 123.7 and 2098.8 ± 135.2 (Table 3a), and the stabilizing nonsynonymous mutations of S-protein- ACE2 complexes exhibit a higher BSA score between the range of 2119.3 ± 118.2, and 2278.9 ± 100.4 (Table 3b), compared to the native complex. We applied the MM_GBSA approach to evaluate the binding free energy (ddG) between the native and mutants S-protein and ACE2 receptor molecules to verify this further. The MM_GBSA score (ddg) of native complex and destabilizing mutation complexes were −60.8, −45.35, −48.39, −28.55, −52.57, −32.55, −36.89, −47.3, −30.7, −48.06, −54.27, and −40.78, respectively, as shown in Table 4 a. Whereas the MM_GBSA score of stabilizing nonsynonymous mutations of S-protein (H49Y, S50L, N501Y, D614G, A845V, and P1143L) and ACE2 receptor complexes were −73.06, −80.89, −66.53, −79.81, −63.05, and −68.62, respectively (Table 4 b). Further, the interaction between the native and mutant S-protein and ACE2 receptor molecules is shown in Fig. 5a, Fig. 5b a and b.
Table 4a

MM-GBSA binding free energy (ddG) score using HawkDock, number of Hydrogen bonds and binding site residues of native and destabilizing nonsynonymous mutants of S-protein and ACE2 receptor.

Protein TypeDomainMM-GBSA Binding free energy (kcal/mol)No of HBONDSBinding site residues of S proteinBinding site residues of ACE2 protein
Native-ACE2−60.813Arg403, GLU406, LYS417, VAL445, TYR453, GLU484, GLN493, SER494, GLY496, THR500, ASN501, TYR505GLU23, ASP30, LYS31, HIS34, GLU35, ASP38, GLN42, LYS74, GLU75, GLN76
L8V-ACE2SP−45.359LYS417, TYR449, TYR453, LEU455, GLU484, TYR489, THR500, ASN501SER19, GLN24, LYS31, GLU35, GLN42, LYS68, GLU75, TYR83, LYS353
L8W-ACE2−48.3910ARG403, ASP405, LYS417, GLY446, TYR453, GLY476, ASN487, GLN498, ASN501, GLY502LYS31, GLU35, ASP38, GLN42, ASN61, LYS68, GLU75, GLN325
L18F-ACE2NTD−28.5511LYS417, ARG457, LYS458, GLN474, GLU484, CYS488, TYR489, GLN493, SER494, THR500, GLY502SER19, LYS31, GLU35, ASP38, GLN42, LYS68, GLU75, MET82, LYS353
Y145H-ACE2−52.5711TYR449, TYR453, GLU484, GLY485, ARG457, TYR489, GLN493, TYR505THR27, HIS34, GLU35, ASP38, LYS68, GLU75, GLN76, MET82, LYS353
M153T-ACE2−32.559ARG403, TYR453, LEU455, GLU484, CYS488, GLN493, SER494HIS34, GLU35, ASP38, LYS68, GLU75, GLN76, LYS353
F157S-ACE2−36.898GLY446, TYR453, LEU455, GLU484, CYS488, GLN493, SER494, TYR505GLN24, LYS31, HIS34, GLU35, ASP38, LYS68, GLU75
G476S-ACE2CTD/RBD−47.39GLU406, LYS417, TYR449, TYR453, GLU484, THR500, ASN501, GLY502THR27, GLU35, ASP38, TYR41, LYS353, ASP355, ARG559
L611F-ACE2Linker−30.79ARG403, GLU406, LYS417, TYR453, GLN474, ASN487, TYR489, THR500, TYR505SER19, GLN24, ASP30, ASP38, GLN42, ASN49, LYS453
A879S-ACE2S2−48.069ARG403, LYS417, TYR453, GLU484, ASN501, GLY502, TYR505GLN24, LYS31, GLU35, GLU75, MET82, TYR83, LYS353
C1247F-ACE2TM−54.2713ARG403, GLY446, TYR449, TYR453, GLU484, GLY485, CYS488, GLN493, SER494, THR500, ASN501GLN24, LYS31, HIS34, GLU35, ASP38, GLU75, GLN76, THR78, GLN81, TYR83, LYS353
C1254F-ACE2−40.788ARG403, GLU406, ARG408, LYS417, TYR449, ASN487, ASN501, VAL503GLN24, GLN42, ASN49, GLN325, ASN330, LYS353, ASP355
Table 4b

MM-GBSA binding free energy (ddG) score using HawkDock, number of Hydrogen bonds and binding site residues of native and stabilizing nonsynonymous mutants of S-protein and ACE2 receptor.

Protein TypeDomainMM-GBSA Binding free energy (kcal/mol)No of HBONDSBinding site residues of S proteinBinding site residues of ACE2 protein
Native-ACE2−60.813Arg403, GLU406, LYS417, VAL445, TYR453, GLU484, GLN493, SER494, GLY496, THR500, ASN501, TYR505GLU23, ASP30, LYS31, HIS34, GLU35, ASP38, GLN42, LYS74, GLU75, GLN76
H49Y-ACE2SP−73.0617ARG403, GLU406, LYS417, VAL445, GLY446, TYR453, GLU484, GLN493, SER494, GLY496, GLN498, THR500, ASN501, GLY504ASP30, LYS31, HIS34, GLU35, ASP38, GLN42, LYS68, LYS74, GLU75, GLN76
S50L-ACE2−80.8915ARG403, GLU406, VAL445, GLY446, TYR 453, GLU484, TYR489, GLN493, SER494, GLY496, GLN498, THR500, ASN501ASP30, LYS31, HIS34, GLU35, ASP38, GLN42, LYS68, LYS74, GLU75, GLN76, MET82
N501Y-ACE2RBD−66.5313ARG403, ASP405, LYS417, TYR421, TYR453, LEU455, ALA475, GLU484, PHE486, GLN493, GLY502ASP30, LYS31, GLU35, ASP38, GLN42, TRP48, ASN61, LYS68, GLU75, GLN325, ASN330, LYS353
D614G-ACE2Linker−79.8116ARG403, GLU406, LYS417, TYR421, TYR453, GLU484, GLN493, GLY496, GLN498, THR500, ASN501SER19, GLU23, ASP30, LYS31, HIS34, GLU35, ASP38, LYS74, GLU75, GLN76, LYS353
A845V-ACE2S2−63.0513ARG403, GLU406, LYS417, GLU484, GLN493, SER494, GLY496, THR500, ASN501, TYR505ASP30, LYS31, HIS34, GLU35, ASP38, LYS68, LYS74, GLU75, GLN76
P1143L-ACE2-68.6215ARG403, GLU406, LYS417, TYR449, TYR453, GLU484, GLN493, SER494, GLY496, GLN498, THR500, ASN501ASP30, LYS31, HIS34, GLU35, ASP38, LYS68, LYS74, GLU75, GLN76, LYS353
Fig. 5a

Native and destabilizing nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) of S-protein interaction with ACE2 receptor. All the proteins were represented in the surface style. The color coding represents the S-protein in green color, the RBD domain of S-protein in yellow color, and ACE2 protein in cyan color. The image was prepared by PYMOL.

Fig. 5b

Native and nonsynonymous stabilizing mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) of S-protein interaction with ACE2 receptor. All the proteins were represented in the surface style. The color coding represents the S-protein in green color, the RBD domain of S-protein in yellow color, and ACE2 protein in cyan color. The image was prepared by PYMOL.

MM-GBSA binding free energy (ddG) score using HawkDock, number of Hydrogen bonds and binding site residues of native and destabilizing nonsynonymous mutants of S-protein and ACE2 receptor. MM-GBSA binding free energy (ddG) score using HawkDock, number of Hydrogen bonds and binding site residues of native and stabilizing nonsynonymous mutants of S-protein and ACE2 receptor. Native and destabilizing nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) of S-protein interaction with ACE2 receptor. All the proteins were represented in the surface style. The color coding represents the S-protein in green color, the RBD domain of S-protein in yellow color, and ACE2 protein in cyan color. The image was prepared by PYMOL. Native and nonsynonymous stabilizing mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) of S-protein interaction with ACE2 receptor. All the proteins were represented in the surface style. The color coding represents the S-protein in green color, the RBD domain of S-protein in yellow color, and ACE2 protein in cyan color. The image was prepared by PYMOL. Therefore, the no. Of H-bonds was calculated for the native, destabilizing, and stabilizing nonsynonymous mutations of S-protein-ACE2 complexes, and the values are depicted in Fig. 6 and Table 4a, Table 4ba–b. The native S-protein -ACE2 receptor complex displays a total of 13 hydrogen bonds. The destabilizing nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) of S-protein and ACE2 complexes exhibits a lesser number of H-bonds compared to the native S-protein _ACE2 complex (Fig. 6).
Fig. 6

The no. H-bonds formed between the native and nonsynonymous mutants spike receptor and ACE2 receptor.

The no. H-bonds formed between the native and nonsynonymous mutants spike receptor and ACE2 receptor. The H-bond interactions between the native complex and destabilizing nonsynonymous mutations are shown in Fig. 7 a and Fig. S1, respectively. The four stabilizing nonsynonymous mutations (H49Y, S50L, D614G, and P1143L) of S-protein_ACE2 complexes show a greater number of h-bonds and other nonsynonymous mutations (N501Y andA845V) shows the same number of h-bonds compared to the native complex. (Fig. 6). The hydrogen bond interactions between the stabilizing nonsynonymous mutations (H49Y, S50L, N501Y, and D614G) of the S-protein with the ACE2 receptor are depicted in Fig. 7b, Fig. 7c, Fig. 7d, Fig. 7e b–e . The other two stabilization nonsynonymous mutations are depicted in Fig. S2. The essential binding site residues of native and mutant S-protein and ACE2 receptor complexes are indicated in Table 4a, Table 4ba–b.
Fig. 7a

Native S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions.

Fig. 7b

H49Y nonsynonymous mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions.

Fig. 7c

S50L nonsynonymous mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions.

Fig. 7d

N501Y nonsynonymous mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions.

Fig. 7e

D614G nonsynonymous Mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions.

Native S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions. H49Y nonsynonymous mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions. S50L nonsynonymous mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions. N501Y nonsynonymous mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions. D614G nonsynonymous Mutant S-protein residue interaction with ACE-2 receptor prepared by Ligplot. The S-protein is represented by a brown color, while the ACE2 protein is represented by a purple color. The green dashed lines represent hydrogen bonding interactions.

MD simulation of native and mutants (N501Y and D614G) S-proteins

The recent studies reported that N501Y and d614G stabilized nonsynonymous mutations are the most promising in S-protein [[73], [74], [75], [76]]. Our analysis confirmed that both the nonsynonymous mutations (N501Y and d614G) stabilized and showed better interaction with ACE2. These results motivated us to observe the structural changes of native and these two advantageous nonsynonymous mutations (N501Y and d614G) at the atomic level. Hence, we implemented the MD simulation approach to investigate how the structural transition in the mutant (N501Y and d614G) S- proteins influencing the interaction with ACE2. We analyzed the total energy, Root mean square deviation (RMSD), Root mean square fluctuation (RMSF), Solvent accessible surface area (SASA), and Hydrogen bond (H-bonds) analysis to investigate the differences in structural variations between the native and mutant (N501Y and D614G) S-proteins. The average RMSD, RMSF, SASA, and H-bond values of the native and mutant (N501Y and D614G) structures are listed in Table 5 .
Table 5

The average values of RMSD matrix, RMSF, SASA, covariance value and NH-bonds value of native and nonsynonymous mutant (N501Y & D614G) spike proteins.

Type of ProteinParameters
RMSD (nm)RMSFSASA (nm2)Covariance value (nm2)NH-bond
Native2.730.81 ± 0.50765.55 ± 4.121149.32849.64 ± 23.80
N501Y2.880.87 ± 0.46773.86 ± 4.041221.26859.02 ± 23.70
D614G3.010.95 ± 0.55768.55 ± 3.841551.80854.31 ± 21.09
The average values of RMSD matrix, RMSF, SASA, covariance value and NH-bonds value of native and nonsynonymous mutant (N501Y & D614G) spike proteins. To investigate the convergence of the native and mutant (N501Y and D614G) protein systems, the total energy was measured and displayed in Fig. 8 a. The native and mutant (N501Y and D614G) S-protein systems show the convergence from the beginning to the end of the simulation. To investigate the stability of the native and mutant (N501Y and D614G) protein system, the RMSD matrix for all Cα-atoms from the initial structure was measured (Fig. 8b). Fig. 8b shows that the RMSD value of the native and mutant structures (N501Y and D614G) is 2.73 nm, 2.88 nm, and 3.01 nm, respectively (Table 5). Furthermore, the RMSF value analysis revealed a significant difference in the fluctuation of residues between the native and mutant (N501Y and D614G) S-proteins (Fig. 8c).
Fig. 8

a–e: The total energy, RMSD matrix, RMSF, SASA, and NH-bonds of native and nonsynonymous mutants (N501Y and D614G) of the S-protein versus time at 300 K.

a–e: The total energy, RMSD matrix, RMSF, SASA, and NH-bonds of native and nonsynonymous mutants (N501Y and D614G) of the S-protein versus time at 300 K. The Mutants N501Y and D614G structures have a greater degree of fluctuation in the residue of 501 and 614 along with neighboring residues than native S-protein throughout the simulation and shown in Fig. 8c. By analyzing the solvent-accessible surface area (SASA) Plots, we determined the geometry and surface of native and mutant (N501Y and D614G) S- proteins. The changes of SASA of native and mutant (N501Y and D614G) S-protein over time are depicted in Fig. 8d. In Fig. 8d, from the beginning to the end of the simulation, the N501Y and D614G mutants have higher fluctuation in the SASA values compared to native structure (Fig. 8d). The protein folding, stability, and function are all dependent on the hydrogen bond. To better understand the stability of native and mutant (N501Y and D614G) S-proteins, we measured the intramolecular H-bond concerning time (Fig. 8e). From the beginning to the end of the simulation, both the mutants show higher numbers of h-bonds than the native structure (Fig. 8e). Further, we performed PCA analysis to attain the motion of S- protein upon mutation. The projection of the first two eigenvectors in the PCA plot (Fig. 9 ) shows that the structures of the mutants cover a broader region of phase space in both PC1 and PC2 planes than the native S-protein, again indicating the expansion in the structures. The covariance value of native and mutant structures is listed in Table 5. This further confirms the overall increased flexibility of the mutants (N501Y and D614G) compared to the native S-protein at 300 K. Overall, the mutant structures (N501Y and D614G) exhibited more motion and flexibility than the native S-protein.
Fig. 9

The projection of native and nonsynonymous mutant spike proteins motion in dimensional space through the initial two significant eigenvectors at 300 K. (a) The native is depicted in black, the N501Y nonsynonymous mutant is depicted in green, and the D614G nonsynonymous mutant is depicted in yellow. Each trajectory is also illustrated independently in (b), (c), and (d) for clarification.

The projection of native and nonsynonymous mutant spike proteins motion in dimensional space through the initial two significant eigenvectors at 300 K. (a) The native is depicted in black, the N501Y nonsynonymous mutant is depicted in green, and the D614G nonsynonymous mutant is depicted in yellow. Each trajectory is also illustrated independently in (b), (c), and (d) for clarification.

Discussion

Nonsynonymous mutations in the S-protein may influence the stability of its structure and interaction with the ACE2 receptor, which provides clues for viral infection and disease spread. Identifying the mutational effect on the S-protein and its interaction with the ACE2 receptor is very important for predicting the 3D conformation of the S-protein. Unfortunately, the entire length of the S-protein structure is not available in the protein data bank. We have used the earlier proved studies [[33], [34], [35], [36], [37], [38]] to fix this issue. Hence, we have preferred the I-TASSER online server to predict the 3D structure of the S-protein. Further refinement of predicted model S-protein, the MDS performed to assess the stability of model structure for further studies. Fig. 3 shows that the RMSD of native S-protein is converged after 7 ns and produces stable conformations. Further, the average structure of the native s-protein was shown at regular intervals (Fig. 3). The most favorable structure of native S-protein was selected at the 12 ns MD simulation, which was then used to build the mutant structures and validated with PROCHECEK and PROSA programs. The sequence and structure-based in-silico tools predict destabilizing and stabilizing nonsynonymous mutations in S-protein associated with COVD-19 infection. The sequence-based approach predicted 40 and 22 nonsynonymous mutations with decreased stability and increased stability. The structure-based approach predicted 36 and 26 nonsynonymous mutations as destabilizing and stabilizing, respectively. Combination of both sequence and structure-based online servers predicted 11 (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) nonsynonymous mutations with decreased stability and six nonsynonymous mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) with increased stability in the S-protein upon mutations (Table 1, Table 2). We considered these 11 destabilizing and six stabilizing nonsynonymous mutations for further docking analysis to investigate the interaction behavior with the ACE2 receptor. The screened nonsynonymous mutations in the S-protein may affect the stability of its structure and interaction with the ACE2 receptor, which provides clues for SARS-CoV-2 infection. Corroborating this further, we employed molecular docking and Molecular Mechanics, and energy merged with Generalized Born and Surface Area (MM/GBSA) computation to evaluate the affinity between the native and mutants (Destabilizing and stabilizing) of S-protein with the ACE-2 receptor. The results of the examination of the top complex are illustrated in Table 3a, Table 3ba–b & Fig. 5a, Fig. 5ba and b. The native complex (native S-protein_ACE2) shows more negative values in the HADDOCK score, which means there is higher binding interaction between the molecules (Table 3a & Fig. 7a). On the other hand, the destabilizing nonsynonymous mutations of S-protein and ACE2 complexes (L8V-ACE2, L8W-ACE2, L18F-ACE2, Y145H-ACE2, M153T-ACE2, F157S-ACE2, G476S-ACE2, L611F-ACE2, A879S-ACE2, C1247F-ACE2, and C1254F-ACE2) show less negative value than the native complex and indicate a lower binding interaction with biological partners (Table 3a & Fig. S1). On the other hand, the stabilizing nonsynonymous mutations of S-protein-ACE2 complexes have greater negative values as a HADDOCK score than the native S-protein-ACE2 complex and illustrate a better interaction between the biological partners (Table 3b Fig. 7b, Fig. 7c, Fig. 7d, Fig. 7eb–e, & Fig. S2). A more excellent BSA value facilitates an immediacy between the two molecules. The BSA, restraint and desolvation energy significantly associate with the HADDOCK docking score [[70], [71], [72]]. The S-protein loses its interaction with the ACE2 receptor upon destabilizing nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F), whereas the stabilizing nonsynonymous mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) in the S-protein shows increased interaction with the ACE2 receptor (Table 3a, Table 3ba–b). We applied the MM_GBSA approach to evaluate the binding free energy (ddG) between the native and mutants S-protein and ACE2 receptor molecules to verify this further. From Table 4a, it is thus again clear that the destabilizing nonsynonymous mutations show less binding free energy than the native complex, and they lose their interaction with the ACE2 receptor. The stabilizing nonsynonymous mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) of S-protein and ACE2 complexes show more binding free energy native complex (Table 4b). Hydrogen bonds are the essential interactions in biological identification processes and vital for establishing the binding specificity [[57], [58], [59], [60]]. The intermolecular hydrogen bonds can provide favorable binding energy [77,78]. This principle is the destabilizing nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) have lost their binding affinity between the S-protein and the ACE2 receptor molecule, and this affects their function. The stabilizing nonsynonymous mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) of the S-protein have a better binding affinity with ACE2 receptor complexes with the native complex (Table 4b). It is confirmed that all six stabilizing nonsynonymous mutations show better affinity with the ACE2 receptor compared to the native S-protein. Further, MDS was performed using several parameters like total energy RMSD matrix, RMSF, SASA, and NH-bond to evaluate the conformational transitions of native and mutant S-proteins. The total energy plot shows the convergence for the native and mutants S-protein system during the simulation and generates stable conformation, thus providing an appropriate foundation for further analysis (Fig. 8a). In the RMSD matrix plot, the mutant (N501Y and D614G) structures showed higher deviation values, whereas the native structure showed lower deviation values (Fig. 8b). It defines that the N501Y and D614 nonsynonymous mutations have structural transitions on the conformational geometry of S-protein. The RMSF values were calculated for native and mutant S-proteins to determine the dynamic behavior of residues (Fig. 8c). During simulation, higher flexibility was observed in both the mutant structures compared to native S-protein. It furthers confirms that mutant protein residues undergo a structural transition as compared to native S-protein. The variation in SASA of the native and mutant S-proteins with time is shown in Fig. 8d. Mutant (N501Y and D614G) S-protein structures showed higher values of SASA with time, whereas native structure shows lower values of SASA with time. It further confirms that mutant S-protein residues were flexible throughout the simulation and attained the extended conformation compared to native S-protein (Table 5, & Fig. 8b–d). For further reinforcement of our results, we performed the PCA analysis better to understand the flexibility behavior of native and mutant S-protein. From, Fig. 9, we observed that the mutation induces the structural transitions and causes the loss of the native conformation of the native S-protein, making it more flexible. As a result, the mutant S-protein structures (N501Y and D614G) had a larger fluctuation in RMSD, RMSF, SASA, and PCA, indicating that the native S-protein is undergoing a major structural shift, which could be the reason for better ACE2 receptor interaction. The number of H-bond (Fig. 8e) analyses confirms it. Due to the structural transitions in mutants S-protein, yield more hydrogen bonds than native S-protein, which could be the reason for better interaction with ACE2 receptors. Docking results well support this. The Mutant (N501Y and D614G) -ACE2 complex structures show similar or more h-bonds than other destabilizing nonsynonymous mutations (Fig. 6 & Table 4a, Table 4ba–b). Overall, the docking results confirm that the ACE2 receptor interacts with the residues of the RBD domain of the S-protein. The destabilizing nonsynonymous mutations alter the binding site residues of the RBD domain of S-protein and decrease the interaction with the ACE2 receptor. However, the stabilizing nonsynonymous mutations increase the interaction with ACE2. The MD simulation results confirm that the two advantageous stabilizing nonsynonymous mutations (N501Y and D614G) undergo the structural transitions and might be the reason for enhancing the interaction with ACE2. This confirms and provides evidence that was stabilizing nonsynonymous mutations have a high affinity with ACE2 and high virulent levels compared to native and destabilizing nonsynonymous mutations of the S-protein. There is some experimental evidence to corroborate our findings, which proves the better association between ACE2 and S-protein upon stabilizing nonsynonymous mutations [[79], [80], [81], [82], [83], [84]]. Many recent studies reported that mainly the stabilizing nonsynonymous mutations N501Y and D614G have a better affinity with the ACE2 receptor [[73], [74], [75], [76],[79], [80], [81], [82], [83], [84]]. This result will help experimental scientists understand the mechanism of the S-protein and its interaction with the ACE2 receptor upon nonsynonymous mutations.

Conclusion

Using the vast bioinformatics approaches, we segregated and screened the destabilizing and stabilizing nonsynonymous mutations in the Spike protein. Further, we examined the interaction between the S-protein and the ACE2 receptor. The docking and free binding energy (ddG) scores exhibited destabilizing nonsynonymous mutations (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) show lower interaction with the ACE2 receptor compared to native S-protein. On the other hand, the stabilizing nonsynonymous mutations (H49Y, S50L, N501Y, D614G, A845V, and P1143L) show greater interaction with the ACE2 receptor than native and destabilizing nonsynonymous mutations of the S-protein. Thus, the MD simulation of stabilizing nonsynonymous mutations (N501Y and D614G) undergo structural changes and might influence interaction with ACE2. The outcome of this study can support the other scientists, enabling them to comprehend the virulent level of nonsynonymous mutations in the Spike protein and provide clues regarding a mutation's role in infection and disease spread, contributing to the development of effective therapy against SARS-CoV-2.

Declaration of competing interest

We have no conflicts of interest to disclose.
  74 in total

1.  Assessing the performance of the MM/PBSA and MM/GBSA methods. 6. Capability to predict protein-protein binding free energies and re-rank binding poses generated by protein-protein docking.

Authors:  Fu Chen; Hui Liu; Huiyong Sun; Peichen Pan; Youyong Li; Dan Li; Tingjun Hou
Journal:  Phys Chem Chem Phys       Date:  2016-08-10       Impact factor: 3.676

2.  Evaluation of intrinsic binding energy from a hydrogen bonding group in an enzyme inhibitor.

Authors:  P A Bartlett; C K Marlowe
Journal:  Science       Date:  1987-01-30       Impact factor: 47.728

3.  iStable: off-the-shelf predictor integration for predicting protein stability changes.

Authors:  Chi-Wei Chen; Jerome Lin; Yen-Wei Chu
Journal:  BMC Bioinformatics       Date:  2013-01-21       Impact factor: 3.169

4.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

5.  The new SARS-CoV-2 strain shows a stronger binding affinity to ACE2 due to N501Y mutant.

Authors:  Fedaa Ali; Amal Kasry; Muhamed Amin
Journal:  Med Drug Discov       Date:  2021-03-02

6.  ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins.

Authors:  Markus Wiederstein; Manfred J Sippl
Journal:  Nucleic Acids Res       Date:  2007-05-21       Impact factor: 16.971

7.  Bioinformatic prediction of potential T cell epitopes for SARS-Cov-2.

Authors:  Kazuma Kiyotani; Yujiro Toyoshima; Kensaku Nemoto; Yusuke Nakamura
Journal:  J Hum Genet       Date:  2020-05-06       Impact factor: 3.172

8.  Cell entry mechanisms of SARS-CoV-2.

Authors:  Jian Shang; Yushun Wan; Chuming Luo; Gang Ye; Qibin Geng; Ashley Auerbach; Fang Li
Journal:  Proc Natl Acad Sci U S A       Date:  2020-05-06       Impact factor: 11.205

9.  Analysis of therapeutic targets for SARS-CoV-2 and discovery of potential drugs by computational methods.

Authors:  Canrong Wu; Yang Liu; Yueying Yang; Peng Zhang; Wu Zhong; Yali Wang; Qiqi Wang; Yang Xu; Mingxue Li; Xingzhou Li; Mengzhu Zheng; Lixia Chen; Hua Li
Journal:  Acta Pharm Sin B       Date:  2020-02-27       Impact factor: 11.413

10.  A SARS-CoV-2 protein interaction map reveals targets for drug repurposing.

Authors:  David E Gordon; Gwendolyn M Jang; Mehdi Bouhaddou; Jiewei Xu; Kirsten Obernier; Kris M White; Matthew J O'Meara; Veronica V Rezelj; Jeffrey Z Guo; Danielle L Swaney; Tia A Tummino; Ruth Hüttenhain; Robyn M Kaake; Alicia L Richards; Beril Tutuncuoglu; Helene Foussard; Jyoti Batra; Kelsey Haas; Maya Modak; Minkyu Kim; Paige Haas; Benjamin J Polacco; Hannes Braberg; Jacqueline M Fabius; Manon Eckhardt; Margaret Soucheray; Melanie J Bennett; Merve Cakir; Michael J McGregor; Qiongyu Li; Bjoern Meyer; Ferdinand Roesch; Thomas Vallet; Alice Mac Kain; Lisa Miorin; Elena Moreno; Zun Zar Chi Naing; Yuan Zhou; Shiming Peng; Ying Shi; Ziyang Zhang; Wenqi Shen; Ilsa T Kirby; James E Melnyk; John S Chorba; Kevin Lou; Shizhong A Dai; Inigo Barrio-Hernandez; Danish Memon; Claudia Hernandez-Armenta; Jiankun Lyu; Christopher J P Mathy; Tina Perica; Kala Bharath Pilla; Sai J Ganesan; Daniel J Saltzberg; Ramachandran Rakesh; Xi Liu; Sara B Rosenthal; Lorenzo Calviello; Srivats Venkataramanan; Jose Liboy-Lugo; Yizhu Lin; Xi-Ping Huang; YongFeng Liu; Stephanie A Wankowicz; Markus Bohn; Maliheh Safari; Fatima S Ugur; Cassandra Koh; Nastaran Sadat Savar; Quang Dinh Tran; Djoshkun Shengjuler; Sabrina J Fletcher; Michael C O'Neal; Yiming Cai; Jason C J Chang; David J Broadhurst; Saker Klippsten; Phillip P Sharp; Nicole A Wenzell; Duygu Kuzuoglu-Ozturk; Hao-Yuan Wang; Raphael Trenker; Janet M Young; Devin A Cavero; Joseph Hiatt; Theodore L Roth; Ujjwal Rathore; Advait Subramanian; Julia Noack; Mathieu Hubert; Robert M Stroud; Alan D Frankel; Oren S Rosenberg; Kliment A Verba; David A Agard; Melanie Ott; Michael Emerman; Natalia Jura; Mark von Zastrow; Eric Verdin; Alan Ashworth; Olivier Schwartz; Christophe d'Enfert; Shaeri Mukherjee; Matt Jacobson; Harmit S Malik; Danica G Fujimori; Trey Ideker; Charles S Craik; Stephen N Floor; James S Fraser; John D Gross; Andrej Sali; Bryan L Roth; Davide Ruggero; Jack Taunton; Tanja Kortemme; Pedro Beltrao; Marco Vignuzzi; Adolfo García-Sastre; Kevan M Shokat; Brian K Shoichet; Nevan J Krogan
Journal:  Nature       Date:  2020-04-30       Impact factor: 69.504

View more
  5 in total

1.  VOC-alarm: Mutation-based prediction of SARS-CoV-2 variants of concern.

Authors:  Hongyu Zhao; Kun Han; Chao Gao; Vithal Madhira; Umit Topaloglu; Yong Lu; Guangxu Jin
Journal:  Bioinformatics       Date:  2022-05-31       Impact factor: 6.931

Review 2.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

3.  Spike Mutation Profiles Associated With SARS-CoV-2 Breakthrough Infections in Delta Emerging and Predominant Time Periods in British Columbia, Canada.

Authors:  Chad D Fibke; Yayuk Joffres; John R Tyson; Caroline Colijn; Naveed Z Janjua; Chris Fjell; Natalie Prystajecky; Agatha Jassem; Hind Sbihi
Journal:  Front Public Health       Date:  2022-07-04

Review 4.  Passive Immunotherapy Against SARS-CoV-2: From Plasma-Based Therapy to Single Potent Antibodies in the Race to Stay Ahead of the Variants.

Authors:  William R Strohl; Zhiqiang Ku; Zhiqiang An; Stephen F Carroll; Bruce A Keyt; Lila M Strohl
Journal:  BioDrugs       Date:  2022-04-27       Impact factor: 7.744

5.  Evolutionary progression of collective mutations in Omicron sub-lineages towards efficient RBD-hACE2: Allosteric communications between and within viral and human proteins.

Authors:  Victor Barozi; Adrienne L Edkins; Özlem Tastan Bishop
Journal:  Comput Struct Biotechnol J       Date:  2022-08-17       Impact factor: 6.155

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.