Literature DB >> 33689337

Structure, Dynamics, Receptor Binding, and Antibody Binding of the Fully Glycosylated Full-Length SARS-CoV-2 Spike Protein in a Viral Membrane.

Yeol Kyo Choi¹, Yiwei Cao¹, Martin Frank², Hyeonuk Woo³, Sang-Jun Park¹, Min Sun Yeom⁴, Tristan I Croll⁵, Chaok Seok³, Wonpil Im¹.

Abstract

The spike (S) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) mediates host cell entry by binding to angiotensin-converting enzyme 2 (ACE2) and is considered the major target for drug and vaccine development. We previously built fully glycosylated full-length SARS-CoV-2 S protein models in a viral membrane including both open and closed conformations of the receptor-binding domain (RBD) and different templates for the stalk region. In this work, multiple μs-long all-atom molecular dynamics simulations were performed to provide deeper insights into the structure and dynamics of S protein and glycan functions. Our simulations reveal that the highly flexible stalk is composed of two independent joints and most probable S protein orientations are competent for ACE2 binding. We identify multiple glycans stabilizing the open and/or closed states of the RBD and demonstrate that the exposure of antibody epitopes can be captured by detailed antibody-glycan clash analysis instead of commonly used accessible surface area analysis that tends to overestimate the impact of glycan shielding and neglect possible detailed interactions between glycan and antibodies. Overall, our observations offer structural and dynamic insights into the SARS-CoV-2 S protein and potentialize for guiding the design of effective antiviral therapeutics.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2021 PMID： 33689337 PMCID： PMC8047829 DOI： 10.1021/acs.jctc.0c01144

Source DB: PubMed Journal: J Chem Theory Comput ISSN： 1549-9618 Impact factor: 6.006

Introduction

The outbreak of Coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) presents a tremendous threat to global public health. It caused over 35 million confirmed cases and more than 1 million deaths as of October, 2020. Due to the unavailability of antiviral medicines or approved vaccines, the current treatment strategy is supportive care to relieve symptoms and isolation of infected individuals to reduce transmission, which has placed a huge burden on the public healthcare system and led to massive social and economic distress. SARS-CoV-2 is an enveloped virus with a positive-sense single-stranded RNA genome.[1] The spike (S) protein anchored in the viral envelope is a class I fusion protein that mediates receptor binding and host cell entry by interacting with human angiotensin-converting enzyme 2 (ACE2),[2−4] and it is also the target of a variety of neutralizing antibodies.[5−8] The S protein is a homotrimer and each monomer has two subunits (S1 and S2) separated by a cleavage site that is recognized by host proteases.[9] A number of recently published structural studies using cryogenic electron microscopy (cryo-EM) have provided a good understanding of the S protein structure at near-atomic resolution.[4,10−12] The S1 subunit responsible for receptor binding is composed of the signal peptide (SP), N terminal domain (NTD), and receptor-binding domain (RBD), and the S2 subunit responsible for membrane fusion is composed of the fusion peptide (FP), two heptad repeats (HR1 and HR2), transmembrane domain (TM), and cytoplasmic domain (CP). The three RBDs on the top of the S protein head are conformationally variable. In closed conformations, all three RBDs lay flat with the receptor-binding motif occluded by the RBDs on the neighboring monomers. In open conformations, one or multiple RBDs lift up and expose the receptor-binding motif(s). Although the cryo-EM structures of the S protein have provided crucial information about its overall structure, highly flexible protein regions such as loops and stalk still remain unresolved. Molecular dynamics (MD) simulations provide molecular-level insights into the underlying mechanisms of biological functions that are difficult to elucidate only with experiments. Recently, cryo-electron tomography (cryo-ET) and MD simulations have been used to explore the conformational variability of the S protein stalk that gives the head orientational freedom and allows the S protein to scan the host cell surface.[13,14] However, it still remains unclear what structural freedoms in the stalk portion are determinant to the overall shape of the S protein and its orientation, and how they affect the binding to ACE2. In addition, MD simulations along with accessible surface area (ASA) calculations have been used to estimate the impact of glycan shielding on the exposure of antibody epitopes.[15] Mutations of two glycosylation sites have been performed to study the role of two N-linked glycans in stabilizing an RBD open conformation.[16] Further investigation is still required to evaluate whether the ASA difference between glycosylated and non-glycosylated structures truly reflects the impact of glycan shielding on antibody binding and whether glycans have more functional roles than stabilizing the open-state RBD. In this work, we present all-atom MD simulations of the fully glycosylated full-length S protein in a viral bilayer, and multiple μs-long trajectories were generated for the RBD in open and closed states and the S stalk built from different models. We also performed multiple μs-long simulations of non-glycosylated S head-only systems. Our results provide deeper insights into the functional roles of glycans that not only provide shielding for immune evasion but also contribute to the trimer stability and transition of RBD open and close states. Moreover, our simulations give insights into the essential structural roles of highly flexible stalk conformations in the S protein binding to ACE2.

Methods

Model Structures of the Fully Glycosylated Full-Length SARS-CoV-2 S Protein

An illustrative snapshot of a fully glycosylated full-length S protein structure in a viral membrane is shown in Figure . When there are many missing residues and domains, the initial models for MD simulations need to be carefully built and validated against available experimental data. We have built the models using GALAXY protein modeling suite[17−19] for missing residues and domains, ISOLDE[20] for refining initial models against experimental density maps, and CHARMM-GUI Glycan Reader and Modeler,[21−23] and Membrane Builder[24−26] for glycan and membrane building. The head of the S trimer was built based on cryo-EM structures (PDB ids: 6VSB(4) and 6VXX(12)). All three chains of 6VXX have the RBD in a closed conformation. One chain of 6VSB (A chain) has the RBD in an open conformation and the other two chains have the RBD in a closed conformation. Two models were selected for each of HR2 linker, HR2-TM, and CP, resulting in a total of 16 structures after the domain by domain assembly. The glycan sequences selected for 22 N-linked and 1 O-linked glycosylation sites of each monomer were based on the mass spectrometry data.[27,28] The detailed model generation is described in ref (29). The model name follows the model numbers used for HR2 linker, HR2-TM, and CP structures. For example, “6VSB_1_2_1” represents a model based on 6VSB with HR2 linker model 1 (M1), HR2-TM model 2 (M2), and CP model 1 (M1). All 16 S protein simulation systems and trajectories are available in CHARMM-GUI COVID-19 archive (http://www.charmm-gui.org/docs/archive/covid19).

Figure 1

Model structure of the fully glycosylated full-length SARS-CoV-2 S protein in a viral membrane. A model structure of the SARS-CoV-2 S protein is shown on the left panel. Two models for RBD/NTD, HR2 linker, HR2-TM, and CP are enlarged on the right panel. The three individual chains of the S protein are colored in yellow, gray, and white, respectively, while glycans are represented as red sticks. The palmitoylation sites of the S protein are highlighted in cyan. The phosphate, carbon, and hydrogen atoms of the viral membrane are colored in green, gray, and white, respectively. For clarity, water molecules and ions are omitted. All illustrations were created using visual molecular dynamics (VMD).[30]

Simulation Details

In this study, the CHARMM36(m) force field was used for proteins,[31] lipids,[32,33] and carbohydrates.[34−36] The TIP3P water model[37] was utilized along with a 0.15 M KCl solution. The total number of atoms is 2 343 394 (6VSB_1_1_1: 668 899 water molecules, 2128 K+, and 1857 Cl–); see CHARMM-GUI COVID-19 archive for other system information. The van der Waals interactions were smoothly switched off over 10–12 Å by a force-based switching function[38] and the long-range electrostatic interactions were calculated by the particle-mesh Ewald method[39] with a mesh size of ∼1 Å. All simulations were performed using the inputs generated by CHARMM-GUI[40] and GROMACS 2018.6[41] for both equilibration and production with the LINCS algorithm.[42] The temperature was maintained using a Nosé–Hoover temperature coupling method[43,44] with a τt of 1 ps; for pressure coupling (1 bar), semi-isotropic Parrinello–Rahman method[45,46] with a τp of 5 ps and a compressibility of 4.5 ×10–5 bar–1 was used. During the equilibration run, NVT (constant particle number, volume, and temperature) dynamics was first applied with a 1 fs time step for 250 ps. Subsequently, the NPT (constant particle number, pressure, and temperature) ensemble was applied with a 1 fs time step (for 2 ns) and with a 2 fs time step (for 18 ns). During the equilibration, positional and dihedral restraint potentials were applied, and their force constants were gradually reduced. The production run was performed with a 4 fs time step using the hydrogen mass repartitioning technique[47] without any restraint potential. Each system ran about 20 ns/day with 1024 CPU cores on NURION in the Korea Institute of Science and Technology Information.

Results and Discussion

Stalk of the S Protein Consists of Two Highly Flexible Linkers

We have performed 1.25 μs all-atom MD simulations of each of 16 models (i.e., a total of 20 μs) each containing about 2.3 million atoms (see Methods). Conformational analysis tools (CAT, http://www.md-simulations.de/CAT/) were used for high-throughput analysis of all simulation trajectories. The root-mean-square deviation (RMSD) time series in Figure S1 shows that the head region of the S protein (residue 1-1140) remains stable during the simulations with RMSD values of 4–5 Å in most systems. The stalk region, however, exhibits highly flexible motions at the HR2 linker and HR2-TM (see Movies S1–S2), which is consistent with S protein structures observed in cryo-ET.[13] To further understand the flexible stalk motion, the bending characteristics of the two linker regions, defined as θ1 and θ2 (Figure A), were quantified. Figure B shows the distributions of θ1 and θ2 for each model. Both M1 and M2 of θ1 show similar angle distributions centered at 150° (±15°) and 155° (±12°), respectively. The HR2-TM region, however, exhibited different bending motions. The M1 of HR2-TM shows a narrow distribution centered at 172° (±4°), whereas the M2 of HR2-TM shows a wide distribution centered at 155° (±14°). Twisting motions were also dependent on the HR2-TM model (Figure S5A). While both M1 and M2 of the HR2 linker show similar twist angle distributions (ϕ) centered at 66° (±46°) and 68° (±49°), respectively, the M1 of HR2-TM shows a narrow distribution centered at 99° (±18°) and the M2 of HR2-TM shows a wide distribution centered at 98° (±71°).

Figure 2

Bending motions of the S protein in a viral membrane. (A) Illustrative snapshot of the S protein and definition of angles/dihedrals measured to characterize the stalk motion. (B) Probability distribution of the bending angle for each HR2 linker and HR2-TM linker model. (C) Probability distributions of tilt angles for the resampled S protein structures compared to the experimental observation.[13] The tilting angle is defined by the principal axis of the S protein head and the membrane normal. These bending and twisting characteristics are consistent with the secondary structure analysis. The secondary structures of HR2 linker M1 and M2 models are mostly in coil conformations during the simulation although local folding and unfolding occur in both models (Figure S2). The secondary structure of the initial HR2-TM M1 model mainly consists of helical structures that are mostly retained during the simulation time. On the other hand, the secondary structure of M2 initially modeled with turns and bends shows low helicity in the range of L1200–K1215 (Figure S3). This indicates that the flexible motions of the HR2-TM linker are strongly influenced by the secondary structure and initial model (within the current simulation time). Although the secondary structures of CP domains are different in the two models, they have no significant effect on the motions of the stalk (Figure S4). To further characterize the bending motions, the Pearson correlation coefficient (r) was calculated for all combinations of the HR2 linker and HR2-TM models (Figure S5B). The r values of all cases range from −0.16 to 0.13, indicating that there is no correlation between the bending motions of the HR2 linker and HR2-TM; thus, each linker acts as an independent hinge. Although 16 × 1.25 μs MD simulations were performed, they do not cover all possible configurations of the S protein, especially with such flexible two linkers. To increase sampling, utilizing the independent θ1 and θ2 characteristics, S protein orientation was resampled based on three regions: head-HR1, HR1-HR2, and HR2-TM. First, 30 HR2-TM conformations were randomly extracted from each trajectory (excluding HR2-TM M1 models), and their TM domain was superimposed to the TM domain of the initial model to resample HR2 domain motions. Second, 30 HR1-HR2 conformations were randomly extracted and superimposed to each of the resampled HR2 domains. Finally, 30 head-HR1 conformations were randomly extracted and superimposed to the resampled HR1 domains. In summary, 27 000 configurations of the S protein on a viral membrane were generated. Figure C shows the tilt angle of the resampled configurations of the S protein using M1 and M2 for the HR2 linker and only M2 for HR-TM. The S protein can tilt by up to 90° toward the membrane and tilt angles around 48° are most probable. This tilt angle distribution agrees well with the experimental observation.[11,13] However, if M1 was used for HR2-TM, the tilt angle distribution of the S protein becomes narrow (Figure S6). This indicates that both M1 and M2 of the HR2 linker are reliable models, but for HR2-TM, M2 is more appropriate to represent S protein configurations. To further understand the contribution of each independent hinge motion on the tilt angle, the S protein was resampled separately with HR2-TM only and with HR1-HR2 only. In both cases, the resampled S protein shows a narrow-angle distribution compared to the experimental observation (Figure C), indicating that both linkers are necessary for the full tilting motions of the S protein observed in the experiment.

Flexibility of the Stalk May Facilitate S Protein Binding to ACE2

To explore the effect of flexible stalk motion on ACE2 binding, we performed the structural alignment of the S protein to ACE2. The RBD in the complex with full-length human ACE2 in the presence of the neutral amino acid transporter B0AT1 (PDB: 6M1710) was used for alignment. Fully independent bending and twisting motions of two stalk linkers allow us to increase the number of S protein samples. 125 head-HR1, HR1-HR2, and HR2-TM-CP conformations were separately extracted from each trajectory with a 10 ns interval. Each RBD of head-HR1 conformations was first superimposed to the RBD-ACE2-B0AT1 complex. Then, the HR1-HR2 conformations were superimposed to each of HR1 from the previous step. Finally, the HR2-TM-CP conformations were superimposed to each of HR2 from the previous step. Figure A shows one of the most probable configurations of the S protein–ACE2 complex. The tilting angle (θ) is defined in Figure , and the distance (d) is defined by an arc length between the centers of mass (COMs) of two TM domains. As shown in Figure B, d ranges from 240 to 350 Å and θ ranges from 30 to 60°. At the most probable configuration, d and θ are about 290 Å and 46°, respectively. Note that there is approximately one S protein per 1000 nm2 (316 Å × 316 Å) on the viral surface.[14] This sparse distribution of the S protein suggests that receptor binding can be promoted by enough space to have orientational degrees of freedom for the RBD. Moreover, it is reported that the most probable tilting angle of the prefusion state is about 40–50°[11,13] (also see Figure ). This tilting angle appears to maximize the accessibility of the receptor-binding motif to ACE2 (when the RBD is in an open conformation), which could account for the high infection rate of SARS-CoV-2.

Figure 3

S protein configurations competent for ACE2 binding. (A) Illustrative snapshot of the S protein–ACE2-B0AT1 complex. Three individual chains of the S protein are colored in yellow, gray, and white, and ACE2 and B0AT1 are represented as red and pink, respectively. (B) Distribution of the tilting angle (θ) as a function of the arc length (d) between the centers of mass (COMs) of TM domains.

Glycans Influence RBD and NTD Motions and Contribute to S Trimer Stability

To explore RBD and NTD motions, we measured two structural features (Figure A): the RBD–NTD distance (d) defined by the minimum distance between RBD and NTD, and the RBD orientation angle (θ) defined by two points at each end of the RBD and the third point on the center axis of the S trimer. One RBD (in both open and closed states) forms a U-shaped pocket with the NTD in the same monomer, which is occupied by the neighboring RBD in a closed state. Therefore, d estimates the RBD–NTD pocket size and θ estimates the extent of RBD opening. The time series of d and θ in 6VSB and 6VXX are shown in Figure B and S7, respectively. In cryo-EM structures available in the PDB, θ in the open-state RBD ranges from 134 to 153°, and θ in the close-state RBD ranges from 88 to 93°. The trajectories of fully glycosylated models completely cover the RBD orientation angles observed in cryo-EM structures and explore a wider range of the conformational space. In particular, θ of 6VSB_A ranges from 120 to 170°, indicating that the open-state RBD is much more flexible than the closed-state RBD, and it is consistent with the RMSD and root-mean-square fluctuation (RMSF) results shown in Figure S1. For both 6VSB_A and 6VSB_C, the simulations cover the NTD–RBD distances observed in the PDB cryo-EM structures. In 6VSB_B, the pocket formed by NTD and RBD (chain B) is empty due to the opening of the neighboring RBD (chain A), and consequently, the NTD moved close to the RBD.

Figure 4

Motions of the RBD and the NTD in fully glycosylated systems and non-glycosylated systems. (A) Illustration of the NTD–RBD distance (d) and the RBD orientation angle (θ). (B) Time series of d and θ in three chains of 6VSB. d is defined by the minimum distance between the RBD (N334 to P527) and the NTD (C15 to S305). θ is defined by three points corresponding to the (i) COM of L452 and L492, (ii) COM of N334, and (iii) COM of S1030. The ranges of d and θ observed in the available PDB S protein structures are shaded by gray regions. To investigate the impact of glycans on the transition between RBD open and closed states, we built and simulated non-glycosylated head-only systems (three replicates for 6VSB and 6VXX) by removing all N-/O-linked glycans and truncating the stalk. It is worth noting that the RBD in non-glycosylated 6VSB_A started to close at the very beginning of trajectories in all three replicas (Figure B). In the trajectory (colored in brown), θ decreases to 110°, which is about in the middle of open and closed states, and in the trajectory (colored in purple), the RBD reverted to opening at around 0.25 μs. Since the transition between RBD open and closed states is a complicated process involving the motions of multiple domains and attached glycans, it may require a much longer simulation time to capture the conformation in which the geometries of both the protein and glycan satisfy the requirement for the transition to occur. Nonetheless, dramatic transitions from RBD open to more closed states in non-glycosylated 6VSB_A indicate that the open-state RBD is unstable when the glycans are removed. This implies that glycosylation plays critical roles in the viral entry as the RBD needs to be open to interact with ACE2. We identified three N-glycans that have important roles in the conformational change of the RBD. They are attached to N165 and N234 on the NTD and N343 on the RBD. When RBD is open (6VSB_A), N165 and N234 glycans on the NTD of the neighboring chain (6VSB_B) are both located below the open-state RBD (Figure A), which holds the open-state RBD. Although both are near the open-state RBD, only N165 glycan has frequent contacts with the RBD (>85% of snapshots) (Figure C), and N234 glycan interacts with the open-state RBD occasionally (<5% of snapshots), which is different from the findings from Casalino et al’s study.[16] Such a difference could attribute to the initial models and simulation lengths. When the RBD is closed (all except 6VSB_A), the RBD forms a sandwich-like arrangement with two glycans. N165 glycan is located above the RBD and N234 glycan is located below the RBD (Figure B). Both glycans frequently interact with the closed-state RBD (Figure C), which makes the transition to an open conformation hard. The glycan attached to N343 on the RBD orients toward the solvent and hardly interacts with other domains when the RBD is open (6VSB_A) (Figure D). When the RBD is closed (6VSB_B) but the neighboring RBD (6VSB_A) is open, N343 glycan orients toward the pocket between the NTD and the RBD and interacts with N165 glycan, which makes the open to closed state transition of the neighboring RBD difficult. When the RBD is closed (all expect 6VSB_A and 6VSB_B) and the neighboring RBD is also closed, N343 glycan makes extensive interactions with the neighboring RBD (Figure E), which contributes to the stability of both RBDs in a closed conformation. The atom contacts between RBD N343 glycan and the neighboring RBD exist in more than 95% of snapshots when both RBDs are closed (Figure F). Given that the RBD is shown to constantly transit between open and closed states in the experiment, we propose that glycans serve as a clutch that temporarily holds the RBD in an open or closed conformation, which modulates the lifetime of both open and closed states.

Figure 5

Critical roles of N165, N234, and N343 glycans in the transition between RBD open and closed conformations. (A) The open-state RBD (6VSB_A, pale yellow) is above N165 (red) and N234 (cyan) glycans on the neighboring NTD. (B) The closed-state RBD (6VSB_B) is between N165 and N234 glycans. (C) Frequency of contacts between the RBD and two glycans (N165 and N234). (D) N343 (magenta) glycan on the open-state RBD (6VSB_A) is free. (E) N343 glycan on the closed-state RBD (6VSB_C) interacts with the neighboring closed-state RBD (6VSB_C). (F) Frequency of contacts between N343 glycan and the neighboring RBD. In addition, we calculated the accessible surface area (ASA) reduced due to the formation of the S trimer. For example, the ASA reduction for chain A was calculated by SA + SBC – SABC, where SA, SBC, and SABC are the ASA of chain A, chains BC-only complex, and chains ABC complex (i.e., S trimer), respectively. The ASA reduction due to trimer formation was split into the portion from protein only and the portion involving glycans. The trimer interface interaction involving glycans is about 30% when the entire S1 subunit is considered, and it increases to about 40% when only the RBD and the NTD are considered (Figure S8). This suggests that glycans make significant contributions to the stability of the S trimer, which is different from the common belief that protein–protein interactions are the only dominating factor to the stability of a protein multimer.

Impact of Glycan Shielding on Antibody Binding is Overestimated

Viruses evolve to minimize the immunogenicity by coating the exposed viral proteins with non-immunogenic or weakly immunogenic glycans. It is commonly believed that the glycans on the viral envelope shield viruses from the host immune system. To get an impression of such glycan shielding, we aligned the S head in each trajectory snapshot and the glycan distributions are shown in Figure A. Most glycans are very flexible and they move around in a wide range of space, which covers most of the trimer surface. However, the role of glycan is not limited to shielding. During the past decade, many glycan-dependent HIV neutralizing antibodies have been discovered and extensively studied, which target both the envelope protein and glycans.[48−50] In the cryo-EM structure of the S trimer in the complex with the S309 antibody (PDB ids: 6WPS and 6WPT(51)), S309 interacts with the glycan attached to N343.

Figure 6

Clash between glycans and superimposed antibodies. (A) Distribution of glycans when S head structures in multiple snapshots are aligned (left). Four epitopes targeted by neutralizing antibodies are shown in different colors (right). (B) H11-D4 epitope (green) in the RBD (left: open, right: closed), N165 glycan (red) on the neighboring NTD, and N343 glycan (violet) on the neighboring RBD. (C) Distributions of glycan heavy atom numbers in clash (NCLASH) with the superposed H11-D4 antibody. (D) S309 epitope (yellow) and N331 (orange) and N343 (violet) glycans on the RBD (left: open, right: closed), N122 (blue) and N165 (red) glycans on the neighboring NTD. (E) Distributions of NCLASH with the superimposed S309 antibody. To explore the role of glycans in antibody binding, we used TM-align[52] to superimpose the RBD in RBD–antibody complex structures in the PDB onto the RBD in each simulation snapshot and calculated the number of glycan heavy atoms that have clashes with the antibodies (NCLASH, Figure S9). In this work, we discuss four RBD-targeting antibodies, namely, B38, CR3022, H11-D4, and S309 (Figure A). The epitopes of B38 and CR3022 are irrelevant to glycans. B38 binds to the same region of RBD as ACE2 does. This epitope is fully exposed in the open-state RBD, but when the RBD is closed, the B38 epitope is masked by the neighboring RBD in either the open or closed state (Figure S10A). The epitope of CR3022 is on the inner surface of the RBD, and it is only accessible when all three RBD are open (Figure S10B). The epitope of H11-D4 is next to the epitope of B38, and it is also fully exposed when the RBD is open. When the RBD is closed, the glycans attached to N165 and N343 on the neighboring chain are located near this epitope (Figure B and S10C). As shown in the distribution of NCLASH with H11-D4 (Figure C), N343 glycan rarely interferes with the antibody, but N165 glycan has high probabilities to make severe clashes with the antibody when the RBD is closed. For comparison, we also aligned a nanobody to the RBD in each simulation snapshot. Though N165 glycan still frequently clashes with the nanobody, the frequency of severe clashes is much lower (Figure S11); thus, there is a chance that a nanobody can bind to this epitope as shown in PDB id 6Z43. The epitope of S309 is surrounded by four glycans. Two of them are attached to N331 and N343 on the same RBD, and the other two are attached to N122 and N165 on the neighboring NTD (Figure D and S10D). The distributions of NCLASH with S309 are shown in Figure E. N165 and N331 glycans rarely interfere with the S309 antibody in both open- and closed-state RBD. N343 glycan has minor clashes with the antibody in most snapshots, and such minor clashes are not sufficient to block antibody binding, as these clashes can be easily removed with small changes in glycan conformation and orientation. The antibody–glycan interactions can also contribute to antibody binding, which is observed in the cryo-EM structures. In more than half of all snapshots, N122 glycan has severe clashes with the antibody when the RBD is open, but it moves away from the superposition of the antibody in the remaining snapshots. This suggests that the S309 epitope in the open-state RBD is blocked by N122 glycan in more than half of the simulation time, but it is still accessible to the antibody. As a comparison, we calculated the ASA of S309 and H11-D4 epitopes using a probe radius of 7.2 Å that is commonly used to approximate the size of hypervariable loops of the antibody and compared the epitope ASA with and without glycans (Figure S12). For H11-D4, we observed significant decreases in the epitope ASA in all chains except 6VSB_A and 6VSB_C, which is generally consistent with the frequency of clashes between glycans and antibodies. However, for S309, the epitope ASA decreases significantly in all chains when glycans are present. This is contradictory to the result that N343 glycans occasionally have only slight clashes with the superimposed antibody when the RBD is closed (all except 6VSB_A). In addition, the PDB structures of the S trimer in the complex with S309 show that N343 glycan interacts with the antibody. In the calculation of ASA, a point on the surface is considered inaccessible even if the probe sphere has a very tiny clash with the molecule. However, the shape of the antibody is not a sphere and it can have narrow-shaped regions that extend deeply into the pocket in the epitope. A glycan like the one attached to N343 can reduce the epitope ASA even though it may contribute to antibody binding. Therefore, in some cases, a simple comparison of protein ASA with and without glycans is likely to overestimate the impact of glycan shielding.

Conclusions

In this work, we present multiple μs-long all-atom MD simulations of the fully glycosylated full-length S protein in a viral membrane. Our MD simulations reveal the overall shape of the S protein, and its orientation on the membrane surface is determined by a highly flexible stalk composed of two independent joints. Importantly, S protein models from our simulations allow us to predict the possible configurations of the S protein–ACE2-B0AT1 complex with allowable orientations and distances between two S proteins on the membrane surface. The simulation here also provides insights into how glycans influence the open/closed state change of the RBD and the antibody binding to RBD epitopes. We identify glycans attached to multiple glycosylation sites that stabilize the open and/or closed states of the RBD by making a high energetic barrier between the open–closed transition. The simulation of non-glycosylated systems shows that the open-state RBD becomes unstable when glycans are removed and the transition to the closed state occurred at the early stage of the simulation. By aligning the RBD–antibody complex structures to the simulation trajectories, we reveal that the impact of glycan shielding is overestimated by a simple ASA analysis. More importantly, glycans not only serve as shields for immune evasion but also contributes to antibody binding. Our work sheds light on the full structure and dynamics of the S protein and we hope our work to be useful for the development of vaccines and antiviral drugs.

45 in total

1. Glycan Reader: automated sugar identification and simulation preparation for carbohydrates and glycoproteins.

Authors: Sunhwan Jo; Kevin C Song; Heather Desaire; Alexander D MacKerell; Wonpil Im
Journal: J Comput Chem Date: 2011-08-03 Impact factor: 3.376

2. Improving the CHARMM force field for polyunsaturated fatty acid chains.

Authors: Jeffery B Klauda; Viviana Monje; Taehoon Kim; Wonpil Im
Journal: J Phys Chem B Date: 2012-07-03 Impact factor: 2.991

3. Long-Time-Step Molecular Dynamics through Hydrogen Mass Repartitioning.

Authors: Chad W Hopkins; Scott Le Grand; Ross C Walker; Adrian E Roitberg
Journal: J Chem Theory Comput Date: 2015-03-30 Impact factor: 6.006

4. Glycan Reader is improved to recognize most sugar types and chemical modifications in the Protein Data Bank.

Authors: Sang-Jun Park; Jumin Lee; Dhilon S Patel; Hongjing Ma; Hui Sun Lee; Sunhwan Jo; Wonpil Im
Journal: Bioinformatics Date: 2017-10-01 Impact factor: 6.937

5. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps.

Authors: Tristan Ian Croll
Journal: Acta Crystallogr D Struct Biol Date: 2018-04-11 Impact factor: 7.652

6. Deducing the N- and O-glycosylation profile of the spike protein of novel coronavirus SARS-CoV-2.

Authors: Asif Shajahan; Nitin T Supekar; Anne S Gleinich; Parastoo Azadi
Journal: Glycobiology Date: 2020-12-09 Impact factor: 4.313

7. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2.

Authors: Yuanyuan Zhang; Yaning Li; Renhong Yan; Lu Xia; Yingying Guo; Qiang Zhou
Journal: Science Date: 2020-03-04 Impact factor: 47.728

8. Potent Neutralizing Antibodies against SARS-CoV-2 Identified by High-Throughput Single-Cell Sequencing of Convalescent Patients' B Cells.

Authors: Yunlong Cao; Bin Su; Xianghua Guo; Wenjie Sun; Yongqiang Deng; Linlin Bao; Qinyu Zhu; Xu Zhang; Yinghui Zheng; Chenyang Geng; Xiaoran Chai; Runsheng He; Xiaofeng Li; Qi Lv; Hua Zhu; Wei Deng; Yanfeng Xu; Yanjun Wang; Luxin Qiao; Yafang Tan; Liyang Song; Guopeng Wang; Xiaoxia Du; Ning Gao; Jiangning Liu; Junyu Xiao; Xiao-Dong Su; Zongmin Du; Yingmei Feng; Chuan Qin; Chengfeng Qin; Ronghua Jin; X Sunney Xie
Journal: Cell Date: 2020-05-18 Impact factor: 41.582

9. A highly conserved cryptic epitope in the receptor binding domains of SARS-CoV-2 and SARS-CoV.

Authors: Meng Yuan; Nicholas C Wu; Xueyong Zhu; Chang-Chun D Lee; Ray T Y So; Huibin Lv; Chris K P Mok; Ian A Wilson
Journal: Science Date: 2020-04-03 Impact factor: 47.728

10. Beyond Shielding: The Roles of Glycans in the SARS-CoV-2 Spike Protein.

Authors: Lorenzo Casalino; Zied Gaieb; Jory A Goldsmith; Christy K Hjorth; Abigail C Dommer; Aoife M Harbison; Carl A Fogarty; Emilia P Barros; Bryn C Taylor; Jason S McLellan; Elisa Fadda; Rommie E Amaro
Journal: ACS Cent Sci Date: 2020-09-23 Impact factor: 14.553

17 in total

1. Inhibition of Protein N-Glycosylation Blocks SARS-CoV-2 Infection.

Authors: Aitor Casas-Sanchez; Alessandra Romero-Ramirez; Eleanor Hargreaves; Cameron C Ellis; Brian I Grajeda; Igor L Estevao; Edward I Patterson; Grant L Hughes; Igor C Almeida; Tobias Zech; Álvaro Acosta-Serrano
Journal: mBio Date: 2022-02-15 Impact factor: 7.867

2. Posttranslational modifications optimize the ability of SARS-CoV-2 spike for effective interaction with host cell receptors.

Authors: Karan Kapoor; Tianle Chen; Emad Tajkhorshid
Journal: Proc Natl Acad Sci U S A Date: 2022-06-23 Impact factor: 12.779

3. Peptide mediated colorimetric detection of SARS-CoV-2 using gold nanoparticles: a molecular dynamics simulation study.

Authors: Nitu Verma; Yogesh Badhe; Rakesh Gupta; Auhin Kumar Maparu; Beena Rai
Journal: J Mol Model Date: 2022-06-24 Impact factor: 2.172

4. Unique Aggregation of Retroviral Particles Pseudotyped with the Delta Variant SARS-CoV-2 Spike Protein.

Authors: Jennifer D Petersen; Jianming Lu; Wendy Fitzgerald; Fei Zhou; Paul S Blank; Doreen Matthies; Joshua Zimmerberg
Journal: Viruses Date: 2022-05-11 Impact factor: 5.818

5. An Electrostatically-steered Conformational Selection Mechanism Promotes SARS-CoV-2 Spike Protein Variation.

Authors: Marija Sorokina; Jaydeep Belapure; Christian Tüting; Reinhard Paschke; Ioannis Papasotiriou; João P G L M Rodrigues; Panagiotis L Kastritis
Journal: J Mol Biol Date: 2022-05-17 Impact factor: 6.151

6. The inherent flexibility of receptor binding domains in SARS-CoV-2 spike protein.

Authors: Hisham M Dokainish; Suyong Re; Takaharu Mori; Chigusa Kobayashi; Jaewoon Jung; Yuji Sugita
Journal: Elife Date: 2022-03-24 Impact factor: 8.140

7. Fine-tuning the spike: role of the nature and topology of the glycan shield in the structure and dynamics of the SARS-CoV-2 S.

Authors: Aoife M Harbison; Carl A Fogarty; Toan K Phung; Akash Satheesan; Benjamin L Schulz; Elisa Fadda
Journal: Chem Sci Date: 2021-11-25 Impact factor: 9.825

Review 8. A Deadly Embrace: Hemagglutination Mediated by SARS-CoV-2 Spike Protein at Its 22 N-Glycosylation Sites, Red Blood Cell Surface Sialoglycoproteins, and Antibody.

Authors: David E Scheim
Journal: Int J Mol Sci Date: 2022-02-25 Impact factor: 5.923

9. 35B5 antibody potently neutralizes SARS-CoV-2 Omicron by disrupting the N-glycan switch via a conserved spike epitope.

Authors: Xiaofei Wang; Xiangyu Chen; Jiaxing Tan; Shuai Yue; Runhong Zhou; Yan Xu; Yao Lin; Yang Yang; Yan Zhou; Kai Deng; Zhiwei Chen; Lilin Ye; Yongqun Zhu
Journal: Cell Host Microbe Date: 2022-03-29 Impact factor: 31.316

10. The Delta variant SARS-CoV-2 spike protein uniquely promotes aggregation of pseudotyped viral particles.

Authors: Jennifer D Petersen; Jianming Lu; Wendy Fitzgerald; Fei Zhou; Paul S Blank; Doreen Matthies; Joshua Zimmerberg
Journal: bioRxiv Date: 2022-04-08