Literature DB >> 34858396

Rapid Assessment of Binding Affinity of SARS-COV-2 Spike Protein to the Human Angiotensin-Converting Enzyme 2 Receptor and to Neutralizing Biomolecules Based on Computer Simulations.

Damiano Buratto1, Abhishek Saxena1, Qun Ji1, Guang Yang1, Sergio Pantano1,2, Francesco Zonta1.   

Abstract

SARS-CoV-2 infects humans and causes Coronavirus disease 2019 (COVID-19). The S1 domain of the spike glycoprotein of SARS-CoV-2 binds to human angiotensin-converting enzyme 2 (hACE2) via its receptor-binding domain, while the S2 domain facilitates fusion between the virus and the host cell membrane for entry. The spike glycoprotein of circulating SARS-CoV-2 genomes is a mutation hotspot. Some mutations may affect the binding affinity for hACE2, while others may modulate S-glycoprotein expression, or they could result in a virus that can escape from antibodies generated by infection with the original variant or by vaccination. Since a large number of variants are emerging, it is of vital importance to be able to rapidly assess their characteristics: while changes of binding affinity alone do not always cause direct advantages for the virus, they still can provide important insights on where the evolutionary pressure is directed. Here, we propose a simple and cost-effective computational protocol based on Molecular Dynamics simulations to rapidly screen the ability of mutated spike protein to bind to the hACE2 receptor and selected neutralizing biomolecules. Our results show that it is possible to achieve rapid and reliable predictions of binding affinities. A similar approach can be used to perform preliminary screenings of the potential effects of S-RBD mutations, helping to prioritize the more time-consuming and expensive experimental work.
Copyright © 2021 Buratto, Saxena, Ji, Yang, Pantano and Zonta.

Entities:  

Keywords:  COVID-19; SARS-CoV-2; Spike-RBD; binding affinity; human ACE2; molecular dynamics; neutralizing antibodies; protein-protein interaction

Mesh:

Substances:

Year:  2021        PMID: 34858396      PMCID: PMC8632240          DOI: 10.3389/fimmu.2021.730099

Source DB:  PubMed          Journal:  Front Immunol        ISSN: 1664-3224            Impact factor:   7.561


Introduction

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) causes pneumonia/severe respiratory infection in humans called Coronavirus disease 2019 (COVID-19). The first cases of COVID-19 were reported in December 2019 from Wuhan, China (1). At the moment of writing this manuscript, SARS-CoV-2 infection is reported in ~180 million people resulting in ~5 million deaths (2). SARS-CoV-2 is an enveloped virus belonging to a diverse subgenus sarbecovirus within the Betacoronaviruses, a lineage of viruses that use bats as reservoirs and can be transmitted into other mammals (3–8). SARS-CoV-2 is genetically distinct from severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV). Its closest known relative is Bat CoV RaTG13 (4–6), with 96.3% of gene identity. The single-stranded RNA genome of SARS-CoV-2 encodes Spike (S), Envelope (E), Membrane (M), and Nucleocapsid (N) structural proteins (5). The Spike (S) glycoprotein comprising S1 and S2 subdomains interacts with human angiotensin-converting enzyme 2 (hACE2) present primarily on pneumocytes/lung immune cells for attachment (via S1 c-terminal receptor-binding domain S-RBD), fusion, and virus entry into the host cell (via S2) (9–12). S-RBD also has a role in cross-species transmission and evolution of SARS-CoV-2 (8, 12–15). It is also the major immune determinant of a human neutralizing immune response upon natural infection and vaccination (16–19). Although S-RBD plays a critical role in viral infectivity and transmission, it is highly variable among sarbecoviruses and possibly a hotspot of complex selective pressure which shapes SARS-CoV-2 evolution (8, 20–22). Recombination events in the genome contribute to CoVs evolution, and recombination breakpoints are evident in the SARS-CoV-2 genome at the beginning and end of the S-RBD (6, 9, 23). Mutations within the S-RBD can increase affinity for ACE2, transmissibility, and mediate immune escape (8, 24–28). Computer simulations have been widely used to provide important insight into the role of mutations within the S-RBD. Molecular dynamics (MD) simulations-based analyses show some of the earliest known S-RBD mutations (F342L, N352D/D364Y, V367F, W436R, and V483A) can increase binding affinities and favor hACE2 interaction (29–31). Another study on the B.1.135 (K417N/E484K/N501Y) variant suggests that while N501Y alone could improve the binding affinity, the other two mutations reduced it, possibly causing a non-net change in binding properties (26). In agreement with this hypothesis, experimental mutational scanning suggest that N501Y/N501T slightly increase hACE2 binding while K417N/K417T enhance S-RBD expression, and E484K did not cause any significant phenotypic change (8). In this paper, we report a computational protocol to rapidly assess the binding affinity of the S-RBD to the hACE2 receptor. We applied our method on some early reported mutations (G476S, V483A, H519Q, and H520), the triple mutant B.1.135 K417N/E484K/N501Y, first isolated in South Africa – beta variant, and triple mutant P.1 K417T/E484K/N501Y, first isolated in Brazil – gamma variant, investigating their effect at the molecular level. Our results suggest that none of the mutations causes an essential increase of the K properties of the Spike protein. Still, we observe a significant, albeit small, decrease of binding affinity for the two triple mutants. However, we observe a more marked reduction of the binding of spike protein to an artificial neutralizing nanobody caused by E484K, a mutation found in several variants of concern and which has been reported to produce a virus able to escape neutralizing antibodies (32–35).

Material and Methods

Molecular Modeling and Dynamics

The model of the reference type (RT) variant Covid-19 RBD in complex with hACE2 receptor was derived by the X-RAY crystal structure PDBID 6LZG (36). All the mutants of the Covid-19 S-RBD were created starting from this model using CHIMERA (37). Each model was solvated with TIP3P water, containing Cl- and K+ ions at a concentration of ∼0.15 M to mimic the physiological ionic strength. After solvation, the total number of atoms for each system was around 1.7 × 105. MD simulations were carried on using the Gromacs 2018 package (38) and the Amber14SB force field (39), following simulation protocols similar to those we used in our previous works (40–43). Specifically, after energy minimization, we performed 200 ps of Simulated Annealing to allow side chains to equilibrate after each mutation is introduced. We then performed two short simulations lasting 100 ps, first in the NVT and then in the NPT ensembles, both with positional restraints (being the position restraint constant ) on the heavy atoms of the protein. Finally, we performed equilibrium MD simulation under periodic boundary conditions at constant pressure for 50 ns. Analyses were performed only on the last 25 ns after equilibration, as explained in the Results section. Temperature T and pressure P were kept constant during the equilibrium MD simulation, at 300 K and 1 atm, respectively, using the Berendsen thermostat and barostat (44). Fast smooth Particle–Mesh Ewald summation (45) was used for long-range electrostatic interactions, with a cut-off of 1.0 nm for the direct interactions. Each simulation was performed in five identical replicas: while classical MD simulations are in principle deterministic, parallel computing algorithms currently implemented in MD software can produce different trajectories. Nevertheless, experimental structures represent a thermodynamic average. Hence, replicating the simulations allows to check the results consistency and reduce the risk of being trapped by entropic barriers, thus improving the sampling of the configuration space available.

Binding Free Energy Computations

To produce fast and reliable predictions of the binding free energy, we use the PRODIGY web server (46, 47), which has been designed for this purpose. The results are then compared with those obtained with MM-PBSA (an acronym for Molecular Mechanics – Poisson Boltzmann and Surface Area continuum solvation approximation method), a more standard methodology, widely used in the field (48, 49). Binding free energies are calculated as ensemble averages over the configuration space explored by the five different replicas. To speed up the calculation, while maintaining a meaningful set of configurations for the energy calculations, we clustered the configuration space sampled by the various MD trajectories after equilibration (i.e., the last 25 ns each of the five replicas) according to their root mean square deviation (RMSD), and calculate the binding energy using one representative for 60 bigger clusters, being the clustering distance 1.2 Å. The final result is then obtained as the average of the free energy computed for each of these configurations. The results obtained by the two methods show correlation (R=0.82, ). However, the PRODIGY webserver is considerably faster than the MM-PBSA calculations (~50 times faster than our local MD-dedicated GPU cluster - this figure can be much higher if parallel computational resources are not available). Furthermore, it requires a much easier set-up so that the calculation can be performed by less experienced investigators. From the binding free energy difference, it is possible to estimate the change in binding affinity using thermodynamics theory, according to the expression (50): . The results computed with the PRODIGY webserver show correlation with experimental data ( ).

Recombinant Production of SARS-CoV-2 S-RBD Mutants and hACE2

Reference SARS-CoV-2 S-RBD (nt 22,517 – 23,185; MN908947) and hACE2-ECD (aa19 – aa617; Uniprot Q9BYF1) coding gene fragments were obtained by chemical synthesis. S-RBD reference gene with C-terminus hexahistidine tag was cloned in 5’NotI/3’BamHI restriction sites of pSCSTa plasmid under the control of CMV promoter and used to generate mutant constructs (G476S, V483A, H519Q, and A520S) by oligonucleotide-mediated PCR mutagenesis. All S-RBD proteins were transiently produced in FreeStyle 293f cells (Invitrogen). Culture supernatant containing protein was bound to Ni-NTA resin (Yeason Biotech), eluted with 500 mM imidazole in 20 mM HEPES/500 mM NaCl, buffer exchanged to 1x PBS and further cleaned by size exclusion chromatography using Superdex 200 10/300 column (GE Healthcare) on AKTA Avant150 FPLC system. The Human ACE2 gene was cloned into unique SfiI restriction sites in pFUSE-mIgG2A-Fc2 plasmid (Invivogen) under the control of a hEF1-HTLV-1 promoter and produced like S-RBD proteins. Culture supernatant containing hACE2-ECD-mFc was bound to Mabselect resin (GE Healthcare), eluted by Pierce IgG elution buffer (Thermo Scientific), and buffer exchanged to 1x PBS. All recombinant proteins were resolved on 4 – 12% gradient SDS-PAGE to ascertain purity and correct size.

Experimental Determination of the Binding Affinities Between S-RBD Variants and ACE2 Receptor

Maxisorp ELISA wells were coated with 200 nM S-RBD reference/mutant protein overnight at 4°C, blocked with 2% (w/v) skimmed milk at room temperature for 1 hour (h). To determine EC values, hACE2-ECD dilutions (two-fold; 250 nM – 0.015 nM) were added to the designated wells and incubated at room temperature for 1 h. To determine IC values, hACE2-ECD at predetermined EC concentration (for respective S-RBD reference/mutant) was mixed with the cognate S-RBD protein (two-fold dilution; 1000 – 0.487 nM) and incubated at room temperature for 1 h before being added to the designated S-RBD reference/mutant coated and blocked wells. The binding was detected with 1:1000 diluted anti-mouse IgG (Fc specific) PO labeled secondary antibody (Cell Signalling Technologies). The kinetics of S-RBD – hACE2-ECD interaction was analyzed by biolayer interferometry (BLI) using the Octet Red96 system (PALL ForteBio). HIS1K dip and read optical sensors (PALL ForteBio) were used to detect non-specific binding with the highest concentration of hACE2-ECD used in the assay, and passed sensors were subsequently loaded with 1000 nM S-RBD reference/mutant protein to reach a loading threshold of ~0.5 nm. Human ACE2-ECD dilutions (two-fold; 2500 – 312.5 nM) were used as an analyte to measure KD. Reference sensors with no load and reference well with only 1x kinetics buffer were used as controls.

Statistical Analysis

Statistical analysis of ELISA data was done using Prism software version 8.00 (GraphPad). EC and IC values were determined by nonlinear regression analysis, by fitting log (agonist concentration) vs. response and log (inhibitor concentration) vs. normalized response, respectively. The ICbinding curves (column means) were analyzed by two-way ANOVA with Dunnett’s multiple comparison tests. The biolayer interferometry (BLI) binding curves were generated and K were determined by fitting the curves globally and analyzed by the 1:1 model using Pall Forte Bio Octet Data Analysis Software version 10.0. The K values of three replicates so obtained were compared by two-way ANOVA with Tukey’s multiple comparison tests using Prism software version 8.00 (GraphPad). Binding free energies were computed from the representative configuration of the 60 more populated cluster. Values on , are presented as averages and standard errors of the mean. The errors for the variation of binding affinity were computed with the error propagation formula, p-values were obtained using the Student t-test. Box plots were draw using Python and the Seaborn package.
Table 1

Table of computed ΔG and ΔΔG (ΔGvariant – ΔGRT).

ΔG kcal/molσΔG kcal/molΔΔG kcal/molp-value
Reference Type-12.020.06
G476S-11.400.080.617.8*10-8
V483A-12.150.08-0.130.21
H519Q-12.180.08-0.170.11
A520S-11.880.080.130.19
N501Y E484K K417N-11.110.070.903.7*10-16
K417N-11.970.080.050.62
N501Y E484K K417T-11.030.090.983.3*10-15
K417T-11.840.070.170.08
N501Y-11.630.080.392.5*10-4
E484K-12.100.08-0.090.41

The table reports averages, standard error of the mean and p-values of the difference between the binding affinities of S-RBD and hACE2 receptor. The relative binding affinity ΔΔG use the RT as reference.

Table 2

Table of computed ΔG and ΔΔG (ΔGgamma variant – ΔGRT) to neutralizing proteins.

ΔG kcal/molσΔG kcal/molΔΔG kcal/molp-value
RT  - nanoBody  (7JVB)-10.360.09
Gamma - nanoBody  (7JVB)-9.130.061.233.0*10-17
RT  - miniprotein (7JZM)-9.190.05
Gamma - miniprotein (7JZM)-9.060.050.130.06
RT  - miniprotein (7JZU)-9.920.06
Gamma - miniprotein (7JZU)-9.700.040.231.1*10-3

The table reports averages, error of the mean and p-values of the difference between the binding affinities of gamma variant S-RBD and different neutralizing proteins. The relative binding affinity ΔΔG use the RT as reference.

Table of computed ΔG and ΔΔG (ΔGvariant – ΔGRT). The table reports averages, standard error of the mean and p-values of the difference between the binding affinities of S-RBD and hACE2 receptor. The relative binding affinity ΔΔG use the RT as reference. Table of computed ΔG and ΔΔG (ΔGgamma variant – ΔGRT) to neutralizing proteins. The table reports averages, error of the mean and p-values of the difference between the binding affinities of gamma variant S-RBD and different neutralizing proteins. The relative binding affinity ΔΔG use the RT as reference.

Results

Sampling of the Interaction Between the S-RBD and the hACE2 Receptor

A reliable computation of the binding free energy between two proteins should take into account the possibility of their dynamical rearrangement and extensive sampling (50). X-ray structures can be considered as faithful representations of the energy minima, but cannot take into account a very important contribution to their binding affinity, i.e., the temperature effects on the two interacting proteins. Furthermore, when introducing a mutation in a structural model, it is likely that the local structure will not be well equilibrated. For both these reasons, we performed MD simulations of each possible pairs of spike-hACE2 receptor proteins. Each simulation was repeated in five different replicas to further improve the configurational sampling and reducing the probability of being trapped in local minima (see Methods section). Analysis of the root mean square deviation (RMSD - ) of the various trajectories shows that the S-RBD finds its equilibrium position on average after 25 ns. For this reason, we decided to carry on the following analysis on the second half of each trajectory. From the dynamic point of view, all the different variants behave similarly. Analysis of the contact maps between the two proteins reveals that in the RT variant, the interaction is mainly mediated by residues Lys417, Tyr449, Leu455, Phe456, Ala475, Phe 486, Asn487, Tyr489, Gln493, Gly496, Gln498, Thr500, Asn501, Gly502 and Tyr505 (interaction probability higher than 90% along the trajectory, see , , ). No significant difference is observed between the RT and the single point mutations G476S, V483A, H519Q, A520S. It is worth noticing that only the first two mutants are in proximity of the binding region, while the other two are far from it, and we do not expect to see any effect on the binding to the hACE2 receptor caused by them.
Figure 1

Interactions between the spike protein and the hACE2 receptor. The top panel shows the binding between the S-RBD region of the RT and the hACE2 receptor. The interaction interface is shown in cartoon representation (white for hACE2 and cyan for S-RBD), while the rest of the protein is represented according to its surface (pink for hACE2 and light blue for S-RBD). The bottom panels show the differences between RT (cyan), beta (orange), and gamma (green) variants in correspondence with the position of the three S-RBD mutations (labeled with numbers 1-3 in the top panel). Relevant residues are shown with their side-chain representation in these panels. One can notice how residue Asn501 is at the center of a rich pattern of interactions, which is altered after its mutation to Tyr. On the other hand, Lys417 of the S-RBD interacts only with the Asp30 of the hACE2, and this interaction is broken after its modification to a non-basic amino acid. Finally, Glu484 does not show any critical interactions, and this does not change with the mutants.

Interactions between the spike protein and the hACE2 receptor. The top panel shows the binding between the S-RBD region of the RT and the hACE2 receptor. The interaction interface is shown in cartoon representation (white for hACE2 and cyan for S-RBD), while the rest of the protein is represented according to its surface (pink for hACE2 and light blue for S-RBD). The bottom panels show the differences between RT (cyan), beta (orange), and gamma (green) variants in correspondence with the position of the three S-RBD mutations (labeled with numbers 1-3 in the top panel). Relevant residues are shown with their side-chain representation in these panels. One can notice how residue Asn501 is at the center of a rich pattern of interactions, which is altered after its mutation to Tyr. On the other hand, Lys417 of the S-RBD interacts only with the Asp30 of the hACE2, and this interaction is broken after its modification to a non-basic amino acid. Finally, Glu484 does not show any critical interactions, and this does not change with the mutants. When looking at the two triple mutants, we can observe that mutations of Lys417 and Asp501 are slightly more impactful in affecting the interaction between the spike protein and the hACE2 receptor. Lys417 forms a salt bridge with Asp30 of hACE2, which is abolished upon lysine mutation to asparagine or threonine. The change from asparagine to tyrosine in position 501 forces a different arrangement of the spike protein residues Tyr499 and Gly496, affecting their interactions with Asp38, Gln32, and Lys353 of the hACE2 ( and ). Since Glu48 does not interact with the hACE2 receptor in the RT, its mutation to lysine does not produce critical differences in the contact map. In agreement with these observations, root mean square fluctuations (RMSF) show no evident changes in the dynamical behavior of the complex, especially in the contact region ( ).
Figure 2

Root Mean Square Fluctuation of the S-RBD variants (top panels) and hACE2 receptor (bottom panels) vs. residue index. The graphs are divided into three different panels for better readability. Amino acids belonging to the contact region are indicated by a blue line parallel to the x-axis. A direct comparison between the various trajectories reveals that the dynamic behavior of the complex S-RBD/hACE2 is not significantly affected by these mutations.

Root Mean Square Fluctuation of the S-RBD variants (top panels) and hACE2 receptor (bottom panels) vs. residue index. The graphs are divided into three different panels for better readability. Amino acids belonging to the contact region are indicated by a blue line parallel to the x-axis. A direct comparison between the various trajectories reveals that the dynamic behavior of the complex S-RBD/hACE2 is not significantly affected by these mutations.

Rapid Evaluation of Binding Free Energy Between the S-RBD and the hACE2 Receptor

In principle, binding free energy estimates can be computed from each of the configurations obtained from MD simulations. However, the computational cost for repeating the calculation on all of them would be extremely high, especially if we use standard methodology like MM-PBSA. Moreover, MD trajectories may be highly correlated on the short time scale. Hence, to ensure we are considering a wide variety of configurations, we clustered them using a 0.12 nm RMSD cutoff, and we computed the binding free energy only for one representative in each of the 60 bigger clusters. The final estimate is obtained as the average of the 60 representative configuration. To further reduce the time of computation of the binding free energy, we decided to use the PRODIGY web server (46), which produces results comparable with MM-PBSA calculations, being at the same time much less computationally expensive. Results are reported in and (free energy differences with the RT).
Figure 3

Box plots of the distribution of S-RBD and hACE2 receptor binding free energy. A boxplot is constructed of two parts, a box and a set of whiskers. The box is drawn from the first quartile (Q 1, the median of the lower half of the dataset) to the third quartile (Q 3, the median of the upper half of the dataset) with a horizontal line drawn in the middle to denote the median. The whiskers are drawn from the upper/lower quartile to the largest/lowest data point excluding any outliers. The outliers are shown with black diamonds. Statistically different distributions are indicated with a (*) symbol. G476 mutant, alpha (N501Y), beta, and gamma variants show a worse binding affinity. However, the differences in absolute value are small.

Box plots of the distribution of S-RBD and hACE2 receptor binding free energy. A boxplot is constructed of two parts, a box and a set of whiskers. The box is drawn from the first quartile (Q 1, the median of the lower half of the dataset) to the third quartile (Q 3, the median of the upper half of the dataset) with a horizontal line drawn in the middle to denote the median. The whiskers are drawn from the upper/lower quartile to the largest/lowest data point excluding any outliers. The outliers are shown with black diamonds. Statistically different distributions are indicated with a (*) symbol. G476 mutant, alpha (N501Y), beta, and gamma variants show a worse binding affinity. However, the differences in absolute value are small. None of the mutants show a significantly improved binding affinity, while mutants G476S, N501Y (alpha variant), and the two triple mutants (beta and gamma variants) show a significantly decreased binding affinity (P<<0.05) with a binding energy difference of 0.6 ± 0.8 kcal/mol, 0.4 ± 0.8 kcal/mol, 0.9 ± 0.7 kcal/mol, and 1.0 ± 0.8 kcal/mol respectively. These results are compatible with similar studies on the triple mutants (26, 51). All the other mutants show no significant differences (P>0.08). However, in all the cases the changes are small (less than 1kcal/mol), and the binding affinity changes are predicted to be within a 5-fold range.

Mutation E484K Reduces the Binding Affinity of the S Protein to a Potent Neutralizing Nanobody

To test whether spike mutations can result in a virus that is able to escape immune response, we explored the effect of the mutation present in the gamma variant on the binding affinity of the highly specific nanobody Nb20 (PDB ID 7JVB) reported by Xiang et al. (52). We also tested different kinds of neutralizing molecules, i.e., the highly specific miniproteins LCB1/LCB3 (PDB ID 7JZU and 7JZM respectively) designed by Cao et al. (53), using the same method described in the previous section. Our calculations revealed that the affinity of the nanobody to the gamma variant is significantly decreased when compared to the RT, with a difference in the binding energy of 1.2 ± 0.1 kcal/mol, which translates into approximately a 7.5-fold decrease of binding affinity. This may be in agreement with the significant reduction in binding affinity reported for this nanobody against the E484K mutation (54). The interaction of the miniproteins LCB1 and LCB3 with the gamma variant shows a small increase of 0.13 ± 0.06 kcal/mol (P=0.06) and 0.23 ± 0.07 kcal/mol (P<<0.05) in the binding affinity ( and ).
Figure 4

Box plots of the distribution of S-RBD and neutralizing molecules binding free energy (see for the box plot description). The three plots compare the binding free energy of the RT and the gamma variant to three neutralizing molecules (described in the text). Statistically different distributions are indicated with a (*) symbol.

Box plots of the distribution of S-RBD and neutralizing molecules binding free energy (see for the box plot description). The three plots compare the binding free energy of the RT and the gamma variant to three neutralizing molecules (described in the text). Statistically different distributions are indicated with a (*) symbol. A detailed analysis of the nanobody-S-RBD complex trajectory shows that the change in binding energy is primarily due to mutation E484K. Residue GLU484, indeed, is located inside a positively charged pocket and stably interacts with the side chain of two arginines and a tyrosine (Arg97, Arg31, and Tyr104 – ). These interactions are clearly disrupted by the mutation E484K that inverts the residue charge, forcing it out of the pocket. On the other hand, there are no notable differences in the interaction of the two miniproteins between the RT and gamma variants.
Figure 5

Details of the interaction between the nanobody and the gamma variant. The top panel shows the binding between the S-RBD region of the RT and the neutralizing nanobody Nb20. The interface of interaction is shown in cartoon representation (white for the Nb20 and cyan for S-RBD), while the rest of the protein is represented according to its surface (red for Nb20 and blue for S-RBD). The bottom panels show the differences between RT (cyan) and gamma variant (green) in stereographic representation. The positively charged Arg31 recognizes the negatively charged Glu484 in S-RBD in Nb20. This bond is clearly broken after the E484K mutation.

Details of the interaction between the nanobody and the gamma variant. The top panel shows the binding between the S-RBD region of the RT and the neutralizing nanobody Nb20. The interface of interaction is shown in cartoon representation (white for the Nb20 and cyan for S-RBD), while the rest of the protein is represented according to its surface (red for Nb20 and blue for S-RBD). The bottom panels show the differences between RT (cyan) and gamma variant (green) in stereographic representation. The positively charged Arg31 recognizes the negatively charged Glu484 in S-RBD in Nb20. This bond is clearly broken after the E484K mutation.

Experimental Validation of Computational Results on Single Point Variants

To validate the computational predictions, we performed an experimental comparative binding analysis using direct-binding and competitive ELISA experiments to determine EC values for S-RBD – hACE2-ECD interaction. This analysis is limited to single-point variants. To determine EC values of hACE-ECD binding to S-RBD reference/mutants, respective titration curves were generated using hACE-ECD dilutions on immobilized S-RBD protein ( ). Human ACE-ECD had a higher EC against S-RBD reference (23.75 nM) in comparison to the mutants (11.48 – 19.86 nM); however, this difference was only 1.19 to 2.06-fold. To determine IC of S-RBD – hACE2-ECD interaction, competitive binding reactions were set up by mixing hACE-ECD at a predetermined EC with dilutions of respective S-RBD protein. Human ACE-ECD bound to S-RBD mutants with a slightly higher IC (slightly lower apparent affinity) than S-RBD reference. We observed a statistically significant difference in the binding affinity of G476S (p=0.002) and A520S (p=0.007) S-RBD mutants compared to the reference ( ). However, consistent with the trend observed in EC values, the fold difference in IC for S-RBD reference/mutants was low (1.09 to 1.31). See for a summary of the results.
Figure 6

Interaction profiles of S-RBD reference/mutants and hACE2-ECD. (A) EC values of S-RBD – hACE2-ECD interaction were determined by direct binding ELISA using titration curves with hACE2-ECD dilutions, and (B) IC values of S-RBD – hACE2-ECD interaction were determined by competitive ELISA using titration curves with S-RBD dilutions and hACE2-ECD at a predetermined EC. Biolayer interferometry was used to generate association and dissociation curves of S-RBD – hACE2-ECD interaction for S-RBD reference (C), G476S (D), V483A (E), H519Q (F), and A520S (G); legends represent the nanomolar (nM) concentration of hACE2-ECD and K values are depicted.

Table 3

ELISA binding profiles of S-RBD reference/mutants and hACE2-ECD.

EC50 nMEC50 Error ( ± SD)IC50 nMIC50 Error ( ± SD)
Ref23.71.05123.211.05
G476S19.861.07162.551.04
V483A16.251.07134.581.06
H519Q11.481.07156.311.06
A520S13.391.07161.431.04

Geometric mean EC50 (double replicates; n=3 independent experiments) and IC50 (double replicates; n=2 independent experiments) values of S-RBD reference/mutants and hACE2-ECD interaction are tabulated with standard error of the mean ( ± SD).

Interaction profiles of S-RBD reference/mutants and hACE2-ECD. (A) EC values of S-RBD – hACE2-ECD interaction were determined by direct binding ELISA using titration curves with hACE2-ECD dilutions, and (B) IC values of S-RBD – hACE2-ECD interaction were determined by competitive ELISA using titration curves with S-RBD dilutions and hACE2-ECD at a predetermined EC. Biolayer interferometry was used to generate association and dissociation curves of S-RBD – hACE2-ECD interaction for S-RBD reference (C), G476S (D), V483A (E), H519Q (F), and A520S (G); legends represent the nanomolar (nM) concentration of hACE2-ECD and K values are depicted. ELISA binding profiles of S-RBD reference/mutants and hACE2-ECD. Geometric mean EC50 (double replicates; n=3 independent experiments) and IC50 (double replicates; n=2 independent experiments) values of S-RBD reference/mutants and hACE2-ECD interaction are tabulated with standard error of the mean ( ± SD). To further validate the binding trend obtained by ELISA IC values, biolayer interferometry (BLI) kinetics experiments were performed by loading HIS1K sensors with S-RBD reference/mutants at a concentration of 1000 nM, and hACE-ECD (two-fold dilutions; 2500 – 312.5 nM) was used as analyte. Consistent with ELISA IC trend, the calculated equilibrium dissociation constants (K range for S-RBD – hACE-ECD interaction was narrow (Ref – 46.3 ± 1.28 nM, G476S – 65.1 ± 1.17 nM, V483A 57.7 ± 0.96 nM, H519Q 53.4 ± 0.94 nM, and A520S 55.4 ± 1.05 nM) ( ). A statistically significant difference was observed in the K values of G476S and V483A, compared to reference S-RBD. Furthermore, like EC and IC, the K values differed on an average from 1.11 to 1.39-fold from the S-RBD reference. Further, the on (K 9.86 x 103 to 1.11 x 104 1/Ms) and off-rates (K 4.57 x 10-4 to 6.68 x 10-4 1/s) were largely similar for all S-RBD samples. The binding behavior of selected S-RBD mutants seems less likely to modulate S-RBD – hACE-ECD interaction. The experiments were performed in three replicas and results are summarized in .
Table 4

Kinetics profiles of S-RBD reference/mutants and hACE2-ECD.

KD (M)KD ErrorKON (1/Ms)KON ErrorKDIS (1/s)KDIS ErrorFull R 2
Rep-1
Ref4.63E-081.34E-099.86E+036.19E+014.57E-041.29E-050.9543
G476S6.51E-081.32E-091.03E+046.39E+016.68E-041.29E-050.9565
V483A5.77E-081.11E-091.11E+046.40E+016.39E-041.18E-050.9618
H519Q5.34E-081.05E-091.07E+045.63E+015.71E-041.08E-050.9688
A520S5.54E-081.14E-099.91E+035.24E+015.49E-041.09E-050.9694
Rep-2
Ref4.62E-081.21E-091.10E+047.04E+015.08E-041.29E-050.95
G476S6.32E-081.03E-091.29E+047.96E+018.17E-041.23E-050.9544
V483A4.78E-088.25E-101.39E+047.82E+016.65E-041.09E-050.9606
H519Q4.96E-088.50E-101.30E+046.94E+016.45E-041.05E-050.965
A520S5.42E-089.59E-101.16E+046.15E+016.30E-041.07E-050.9672
Rep-3
Ref4.16E-081.12E-091.21E+048.07E+015.03E-041.32E-050.9428
G476S5.70E-089.72E-101.42E+049.33E+018.11E-041.28E-050.9459
V483A4.50E-087.97E-101.53E+049.34E+016.91E-041.15E-050.9522
H519Q4.58E-088.27E-101.44E+048.49E+016.60E-041.13E-050.9554
A520S4.96E-089.09E-101.29E+047.26E+016.38E-041.11E-050.9607

Experimental KD values in molar (M) concentration, association rate constant [KON (1/Ms)], and dissociation rate constant [KDIS (1/s)] with error generated while fitting the binding curves from three replicates for S-RBD reference/mutants and hACE2-ECD interaction are tabulated.

Kinetics profiles of S-RBD reference/mutants and hACE2-ECD. Experimental KD values in molar (M) concentration, association rate constant [KON (1/Ms)], and dissociation rate constant [KDIS (1/s)] with error generated while fitting the binding curves from three replicates for S-RBD reference/mutants and hACE2-ECD interaction are tabulated. Comparison with computational estimates can be done using the formula (see method Section) where K is the average obtained from the 3 replicas. The correlation between the two set of data is R=0.996 ( ) showing that our method is able to achieve a qualitative correct prediction of the effect of the mutations on the binding affinity.

Discussion

In this paper, we performed an in silico screening of a selected number of SARS-COV2 variants and calculated the binding affinity between S-RBD and the hACE2 receptor. Results of the simulations, in agreement with experimental observations, do not show remarkable differences in the expected K for any of the variants. In some cases, variants show even a worsened affinity. However, variations in K not necessarily translate into a higher infectivity of the virus. Many different effects might be in play, and even mutations far from the binding site can improve the virus’s fitness. The D614G mutation constitutes a clear example. This variant introduced a mutation far away from the ACE2 interaction domain and displaced the original variant isolated in Wuhan worldwide in a couple of months. Similar effects have been recently reported for other mutations away from the recognition domain (55). Notwithstanding, the K remains a key factor to be analyzed. This is the reason why we think our contribution can be helpful to other researchers working in the design and identification of miniproteins and nanobodies against SARS-CoV-2 (52, 54, 56, 57). Having a preliminary screening of the effect of mutations on the binding affinity can help save time and other resources, especially during emergency situations, like the one we are experiencing in the current pandemic. The great diffusion of newer virus variants (58) suggests an evolutionary advantage due to the mutations, even though their affinity is not significantly changed. Several recent studies indicate that these mutations could lead to immune escape (32–35). To have an idea on how immune escape could happen at molecular level, we analyzed MD trajectories and computed the binding free energy of the gamma variant bound to a highly specific nanobody. We found that the affinity is indeed reduced by 1.2 kcal/mol and that this change is due mainly to the E484K mutation. This mutation can be found in several emerging SARS-COV-2 variants, and was shown to affect the binding of antibodies significantly. In other words, the virus can trade its ability to tightly bind to the hACE2 receptor in exchange for becoming more elusive to specific antibodies. While this mechanism cannot be generalized for the whole antibody population, we can see that position 484 is a good mutation spot for the virus, from an evolutionary point of view, since this residue only interacts with neutralizing antibodies and not with the hACE2 receptor. Indeed, other mutations have also been found in this position, such as the E484Q in the kappa variant. Spreading of similar variants could escape the antibody recognition and could require a periodical update of vaccines and monoclonal antibodies used in clinical applications to avoid a potential loss of efficacy (34). On the other hand, synthetic miniproteins that have been designed to mimic the structure of the hACE2 receptor, the natural binder for the S protein, are less affected by the mutation, and they still can work as bait. It is worth to notice that the absolute values of the binding energies calculated may depend on some of the computational details chosen (namely, force field, water models, specific methods to calculate binding energies, etc.). However, our method is able to achieve a qualitatively correct prediction of the effect of mutations at the protein-protein interface. Indeed, we obtained a high correlation with analogous values calculated using the MM-PBSA and with experimental results ( ). In summary, our work demonstrated that molecular simulations can be used to rapidly screen the effect of SARS-COVID-2 mutations, in particular concerning their ability to bind the hACE2 receptor or neutralizing molecules. This kind of analysis could be of primary importance as a preliminary screening and to produce working hypotheses that can help to prioritize the experimental study on the virus mutations.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors upon request, without undue reservation.

Author Contributions

DB built structural models and performed molecular dynamics simulations. QJ produced S-RBD and hACE2-ECD proteins and performed binding experiments by ELISA and BLI. DB, QJ, AS, GY, SP, and FZ analyzed data. DB, AS, and FZ wrote the first draft of the manuscript. FZ and SP wrote the final version of the manuscript. AS and GY supervised wet-lab experiments. FZ and SP supervised computational studies. All authors contributed to editing and approved the final draft.

Funding

This work was supported by the National Science Foundation of China (Grant No. 31770776 to FZ) and by FOCEM (MERCOSUR Structural Convergence Fund, COF 03/11 to SP).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  50 in total

1.  Relative Binding Free Energy Calculations in Drug Discovery: Recent Advances and Practical Considerations.

Authors:  Zoe Cournia; Bryce Allen; Woody Sherman
Journal:  J Chem Inf Model       Date:  2017-12-15       Impact factor: 4.956

2.  N501Y and K417N Mutations in the Spike Protein of SARS-CoV-2 Alter the Interactions with Both hACE2 and Human-Derived Antibody: A Free Energy of Perturbation Retrospective Study.

Authors:  Filip Fratev
Journal:  J Chem Inf Model       Date:  2021-11-22       Impact factor: 4.956

Review 3.  SARS-CoV and emergent coronaviruses: viral determinants of interspecies transmission.

Authors:  Meagan Bolles; Eric Donaldson; Ralph Baric
Journal:  Curr Opin Virol       Date:  2011-12       Impact factor: 7.090

4.  Contacts-based prediction of binding affinity in protein-protein complexes.

Authors:  Anna Vangone; Alexandre Mjj Bonvin
Journal:  Elife       Date:  2015-07-20       Impact factor: 8.140

5.  Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding.

Authors:  Roujian Lu; Xiang Zhao; Juan Li; Peihua Niu; Bo Yang; Honglong Wu; Wenling Wang; Hao Song; Baoying Huang; Na Zhu; Yuhai Bi; Xuejun Ma; Faxian Zhan; Liang Wang; Tao Hu; Hong Zhou; Zhenhong Hu; Weimin Zhou; Li Zhao; Jing Chen; Yao Meng; Ji Wang; Yang Lin; Jianying Yuan; Zhihao Xie; Jinmin Ma; William J Liu; Dayan Wang; Wenbo Xu; Edward C Holmes; George F Gao; Guizhen Wu; Weijun Chen; Weifeng Shi; Wenjie Tan
Journal:  Lancet       Date:  2020-01-30       Impact factor: 79.321

6.  SARS-CoV-2 infection protects against rechallenge in rhesus macaques.

Authors:  Abishek Chandrashekar; Jinyan Liu; Amanda J Martinot; Katherine McMahan; Noe B Mercado; Lauren Peter; Lisa H Tostanoski; Jingyou Yu; Zoltan Maliga; Michael Nekorchuk; Kathleen Busman-Sahay; Margaret Terry; Linda M Wrijil; Sarah Ducat; David R Martinez; Caroline Atyeo; Stephanie Fischinger; John S Burke; Matthew D Slein; Laurent Pessaint; Alex Van Ry; Jack Greenhouse; Tammy Taylor; Kelvin Blade; Anthony Cook; Brad Finneyfrock; Renita Brown; Elyse Teow; Jason Velasco; Roland Zahn; Frank Wegmann; Peter Abbink; Esther A Bondzie; Gabriel Dagotto; Makda S Gebre; Xuan He; Catherine Jacob-Dolan; Nicole Kordana; Zhenfeng Li; Michelle A Lifton; Shant H Mahrokhian; Lori F Maxfield; Ramya Nityanandam; Joseph P Nkolola; Aaron G Schmidt; Andrew D Miller; Ralph S Baric; Galit Alter; Peter K Sorger; Jacob D Estes; Hanne Andersen; Mark G Lewis; Dan H Barouch
Journal:  Science       Date:  2020-05-20       Impact factor: 47.728

7.  In Silico Investigation of the New UK (B.1.1.7) and South African (501Y.V2) SARS-CoV-2 Variants with a Focus at the ACE2-Spike RBD Interface.

Authors:  Bruno O Villoutreix; Vincent Calvez; Anne-Geneviève Marcelin; Abdel-Majid Khatib
Journal:  Int J Mol Sci       Date:  2021-02-08       Impact factor: 5.923

8.  Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses.

Authors:  Michael Letko; Andrea Marzi; Vincent Munster
Journal:  Nat Microbiol       Date:  2020-02-24       Impact factor: 17.745

9.  Versatile and multivalent nanobodies efficiently neutralize SARS-CoV-2.

Authors:  Sham Nambulli; Zhengyun Xiao; Heng Liu; Yufei Xiang; Zhe Sang; W Paul Duprex; Dina Schneidman-Duhovny; Cheng Zhang; Yi Shi
Journal:  Science       Date:  2020-11-05       Impact factor: 47.728

10.  Emergence of SARS-CoV-2 through recombination and strong purifying selection.

Authors:  Xiaojun Li; Elena E Giorgi; Manukumar Honnayakanahalli Marichannegowda; Brian Foley; Chuan Xiao; Xiang-Peng Kong; Yue Chen; S Gnanakaran; Bette Korber; Feng Gao
Journal:  Sci Adv       Date:  2020-07-01       Impact factor: 14.957

View more
  6 in total

1.  In Silico Analyses on the Comparative Potential of Therapeutic Human Monoclonal Antibodies Against Newly Emerged SARS-CoV-2 Variants Bearing Mutant Spike Protein.

Authors:  Nabarun Chandra Das; Pritha Chakraborty; Jagadeesh Bayry; Suprabhat Mukherjee
Journal:  Front Immunol       Date:  2022-01-10       Impact factor: 7.561

2.  Evaluation of protein descriptors in computer-aided rational protein engineering tasks and its application in property prediction in SARS-CoV-2 spike glycoprotein.

Authors:  Hocheol Lim; Hyeon-Nae Jeon; Seungcheol Lim; Yuil Jang; Taehee Kim; Hyein Cho; Jae-Gu Pan; Kyoung Tai No
Journal:  Comput Struct Biotechnol J       Date:  2022-01-31       Impact factor: 7.271

3.  A network biology approach to identify crucial host targets for COVID-19.

Authors:  Ranjan Kumar Barman; Anirban Mukhopadhyay; Ujjwal Maulik; Santasabuj Das
Journal:  Methods       Date:  2022-03-29       Impact factor: 4.647

4.  Design and selection of peptides to block the SARS-CoV-2 receptor binding domain by molecular docking.

Authors:  Kendra Ramirez-Acosta; Ivan A Rosales-Fuerte; J Eduardo Perez-Sanchez; Alfredo Nuñez-Rivera; Josue Juarez; Ruben D Cadena-Nava
Journal:  Beilstein J Nanotechnol       Date:  2022-07-22       Impact factor: 3.272

5.  Evolution of Stronger SARS-CoV-2 Variants as Revealed Through the Lens of Molecular Dynamics Simulations.

Authors:  Alec J Wozney; Macey A Smith; Mobeen Abdrabbo; Cole M Birch; Kelsey A Cicigoi; Connor C Dolan; Audrey E L Gerzema; Abby Hansen; Ethan J Henseler; Ben LaBerge; Caterra M Leavens; Christine N Le; Allison C Lindquist; Rikaela K Ludwig; Maggie G O'Reilly; Jacob H Reynolds; Brandon A Sherman; Hunter W Sillman; Michael A Smith; Marissa J Snortheim; Levi M Svaren; Emily C Vanderpas; Aidan Voon; Miles J Wackett; Moriah M Weiss; Sanchita Hati; Sudeep Bhattacharyya
Journal:  Protein J       Date:  2022-08-01       Impact factor: 4.000

6.  In Silico Maturation of a Nanomolar Antibody against the Human CXCR2.

Authors:  Damiano Buratto; Yue Wan; Xiaojie Shi; Guang Yang; Francesco Zonta
Journal:  Biomolecules       Date:  2022-09-13
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.