Yeol Kyo Choi1, Yiwei Cao1, Martin Frank2, Hyeonuk Woo3, Sang-Jun Park1, Min Sun Yeom4, Tristan I Croll5, Chaok Seok3, Wonpil Im1. 1. Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States. 2. Biognos AB, Box 8963, 40274 Göteborg, Sweden. 3. Department of Chemistry, Seoul National University, Seoul 08826, Republic of Korea. 4. Korean Institute of Science and Technology Information, Daejeon 34141, Republic of Korea. 5. Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, U.K.
Abstract
The spike (S) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) mediates host cell entry by binding to angiotensin-converting enzyme 2 (ACE2) and is considered the major target for drug and vaccine development. We previously built fully glycosylated full-length SARS-CoV-2 S protein models in a viral membrane including both open and closed conformations of the receptor-binding domain (RBD) and different templates for the stalk region. In this work, multiple μs-long all-atom molecular dynamics simulations were performed to provide deeper insights into the structure and dynamics of S protein and glycan functions. Our simulations reveal that the highly flexible stalk is composed of two independent joints and most probable S protein orientations are competent for ACE2 binding. We identify multiple glycans stabilizing the open and/or closed states of the RBD and demonstrate that the exposure of antibody epitopes can be captured by detailed antibody-glycan clash analysis instead of commonly used accessible surface area analysis that tends to overestimate the impact of glycan shielding and neglect possible detailed interactions between glycan and antibodies. Overall, our observations offer structural and dynamic insights into the SARS-CoV-2 S protein and potentialize for guiding the design of effective antiviral therapeutics.
The spike (S) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) mediates host cell entry by binding to angiotensin-converting enzyme 2 (ACE2) and is considered the major target for drug and vaccine development. We previously built fully glycosylated full-length SARS-CoV-2 S protein models in a viral membrane including both open and closed conformations of the receptor-binding domain (RBD) and different templates for the stalk region. In this work, multiple μs-long all-atom molecular dynamics simulations were performed to provide deeper insights into the structure and dynamics of S protein and glycan functions. Our simulations reveal that the highly flexible stalk is composed of two independent joints and most probable S protein orientations are competent for ACE2 binding. We identify multiple glycans stabilizing the open and/or closed states of the RBD and demonstrate that the exposure of antibody epitopes can be captured by detailed antibody-glycan clash analysis instead of commonly used accessible surface area analysis that tends to overestimate the impact of glycan shielding and neglect possible detailed interactions between glycan and antibodies. Overall, our observations offer structural and dynamic insights into the SARS-CoV-2 S protein and potentialize for guiding the design of effective antiviral therapeutics.
The
outbreak of Coronavirus disease 2019 (COVID-19) caused by severe
acute respiratory syndrome coronavirus 2 (SARS-CoV-2) presents a tremendous
threat to global public health. It caused over 35 million confirmed
cases and more than 1 million deaths as of October, 2020. Due to the
unavailability of antiviral medicines or approved vaccines, the current
treatment strategy is supportive care to relieve symptoms and isolation
of infected individuals to reduce transmission, which has placed a
huge burden on the public healthcare system and led to massive social
and economic distress.SARS-CoV-2 is an enveloped virus with
a positive-sense single-stranded
RNA genome.[1] The spike (S) protein anchored
in the viral envelope is a class I fusion protein that mediates receptor
binding and host cell entry by interacting with human angiotensin-converting
enzyme 2 (ACE2),[2−4] and it is also the target of a variety of neutralizing
antibodies.[5−8] The S protein is a homotrimer and each monomer has two subunits
(S1 and S2) separated by a cleavage site that is recognized by host
proteases.[9] A number of recently published
structural studies using cryogenic electron microscopy (cryo-EM) have
provided a good understanding of the S protein structure at near-atomic
resolution.[4,10−12] The S1 subunit
responsible for receptor binding is composed of the signal peptide
(SP), N terminal domain (NTD), and receptor-binding domain (RBD),
and the S2 subunit responsible for membrane fusion is composed of
the fusion peptide (FP), two heptad repeats (HR1 and HR2), transmembrane
domain (TM), and cytoplasmic domain (CP). The three RBDs on the top
of the S protein head are conformationally variable. In closed conformations,
all three RBDs lay flat with the receptor-binding motif occluded by
the RBDs on the neighboring monomers. In open conformations, one or
multiple RBDs lift up and expose the receptor-binding motif(s).Although the cryo-EM structures of the S protein have provided
crucial information about its overall structure, highly flexible protein
regions such as loops and stalk still remain unresolved. Molecular
dynamics (MD) simulations provide molecular-level insights into the
underlying mechanisms of biological functions that are difficult to
elucidate only with experiments. Recently, cryo-electron tomography
(cryo-ET) and MD simulations have been used to explore the conformational
variability of the S protein stalk that gives the head orientational
freedom and allows the S protein to scan the host cell surface.[13,14] However, it still remains unclear what structural freedoms in the
stalk portion are determinant to the overall shape of the S protein
and its orientation, and how they affect the binding to ACE2. In addition,
MD simulations along with accessible surface area (ASA) calculations
have been used to estimate the impact of glycan shielding on the exposure
of antibody epitopes.[15] Mutations of two
glycosylation sites have been performed to study the role of two N-linked
glycans in stabilizing an RBD open conformation.[16] Further investigation is still required to evaluate whether
the ASA difference between glycosylated and non-glycosylated structures
truly reflects the impact of glycan shielding on antibody binding
and whether glycans have more functional roles than stabilizing the
open-state RBD.In this work, we present all-atom MD simulations
of the fully glycosylated
full-length S protein in a viral bilayer, and multiple μs-long
trajectories were generated for the RBD in open and closed states
and the S stalk built from different models. We also performed multiple
μs-long simulations of non-glycosylated S head-only systems.
Our results provide deeper insights into the functional roles of glycans
that not only provide shielding for immune evasion but also contribute
to the trimer stability and transition of RBD open and close states.
Moreover, our simulations give insights into the essential structural
roles of highly flexible stalk conformations in the S protein binding
to ACE2.
Methods
Model Structures of the
Fully Glycosylated
Full-Length SARS-CoV-2 S Protein
An illustrative snapshot
of a fully glycosylated full-length S protein structure in a viral
membrane is shown in Figure . When there are many missing residues and domains, the initial
models for MD simulations need to be carefully built and validated
against available experimental data. We have built the models using
GALAXY protein modeling suite[17−19] for missing residues and domains,
ISOLDE[20] for refining initial models against
experimental density maps, and CHARMM-GUI Glycan Reader and Modeler,[21−23] and Membrane Builder[24−26] for glycan and membrane building. The head of the
S trimer was built based on cryo-EM structures (PDB ids: 6VSB(4) and 6VXX(12)). All three chains of 6VXX have the
RBD in a closed conformation. One chain of 6VSB (A chain) has the
RBD in an open conformation and the other two chains have the RBD
in a closed conformation. Two models were selected for each of HR2
linker, HR2-TM, and CP, resulting in a total of 16 structures after
the domain by domain assembly. The glycan sequences selected for 22
N-linked and 1 O-linked glycosylation sites of each monomer were based
on the mass spectrometry data.[27,28] The detailed model
generation is described in ref (29). The model name follows the model numbers used for HR2
linker, HR2-TM, and CP structures. For example, “6VSB_1_2_1”
represents a model based on 6VSB with HR2 linker model 1 (M1), HR2-TM
model 2 (M2), and CP model 1 (M1). All 16 S protein simulation systems
and trajectories are available in CHARMM-GUI COVID-19 archive (http://www.charmm-gui.org/docs/archive/covid19).
Figure 1
Model structure of the fully glycosylated full-length SARS-CoV-2
S protein in a viral membrane. A model structure of the SARS-CoV-2
S protein is shown on the left panel. Two models for RBD/NTD, HR2
linker, HR2-TM, and CP are enlarged on the right panel. The three
individual chains of the S protein are colored in yellow, gray, and
white, respectively, while glycans are represented as red sticks.
The palmitoylation sites of the S protein are highlighted in cyan.
The phosphate, carbon, and hydrogen atoms of the viral membrane are
colored in green, gray, and white, respectively. For clarity, water
molecules and ions are omitted. All illustrations were created using
visual molecular dynamics (VMD).[30]
Model structure of the fully glycosylated full-length SARS-CoV-2
S protein in a viral membrane. A model structure of the SARS-CoV-2
S protein is shown on the left panel. Two models for RBD/NTD, HR2
linker, HR2-TM, and CP are enlarged on the right panel. The three
individual chains of the S protein are colored in yellow, gray, and
white, respectively, while glycans are represented as red sticks.
The palmitoylation sites of the S protein are highlighted in cyan.
The phosphate, carbon, and hydrogen atoms of the viral membrane are
colored in green, gray, and white, respectively. For clarity, water
molecules and ions are omitted. All illustrations were created using
visual molecular dynamics (VMD).[30]
Simulation Details
In this study,
the CHARMM36(m) force field was used for proteins,[31] lipids,[32,33] and carbohydrates.[34−36] The TIP3P water model[37] was utilized
along with a 0.15 M KCl solution. The total number of atoms is 2 343 394
(6VSB_1_1_1: 668 899 water molecules, 2128 K+, and
1857 Cl–); see CHARMM-GUI COVID-19 archive for other
system information. The van der Waals interactions were smoothly switched
off over 10–12 Å by a force-based switching function[38] and the long-range electrostatic interactions
were calculated by the particle-mesh Ewald method[39] with a mesh size of ∼1 Å.All simulations
were performed using the inputs generated by CHARMM-GUI[40] and GROMACS 2018.6[41] for both equilibration and production with the LINCS algorithm.[42] The temperature was maintained using a Nosé–Hoover
temperature coupling method[43,44] with a τt of 1 ps; for pressure coupling (1 bar), semi-isotropic Parrinello–Rahman
method[45,46] with a τp of 5 ps and a
compressibility of 4.5 ×10–5 bar–1 was used. During the equilibration run, NVT (constant particle number,
volume, and temperature) dynamics was first applied with a 1 fs time
step for 250 ps. Subsequently, the NPT (constant particle number,
pressure, and temperature) ensemble was applied with a 1 fs time step
(for 2 ns) and with a 2 fs time step (for 18 ns). During the equilibration,
positional and dihedral restraint potentials were applied, and their
force constants were gradually reduced. The production run was performed
with a 4 fs time step using the hydrogen mass repartitioning technique[47] without any restraint potential. Each system
ran about 20 ns/day with 1024 CPU cores on NURION in the Korea Institute
of Science and Technology Information.
Results
and Discussion
Stalk of the S Protein
Consists of Two Highly
Flexible Linkers
We have performed 1.25 μs all-atom
MD simulations of each of 16 models (i.e., a total of 20 μs)
each containing about 2.3 million atoms (see Methods). Conformational analysis tools (CAT, http://www.md-simulations.de/CAT/) were used for high-throughput analysis of all simulation trajectories.
The root-mean-square deviation (RMSD) time series in Figure S1 shows that the head region of the S protein (residue
1-1140) remains stable during the simulations with RMSD values of
4–5 Å in most systems. The stalk region, however, exhibits
highly flexible motions at the HR2 linker and HR2-TM (see Movies S1–S2), which is consistent with
S protein structures observed in cryo-ET.[13] To further understand the flexible stalk motion, the bending characteristics
of the two linker regions, defined as θ1 and θ2 (Figure A),
were quantified. Figure B shows the distributions of θ1 and θ2 for each model. Both M1 and M2 of θ1 show
similar angle distributions centered at 150° (±15°)
and 155° (±12°), respectively. The HR2-TM region, however,
exhibited different bending motions. The M1 of HR2-TM shows a narrow
distribution centered at 172° (±4°), whereas the M2
of HR2-TM shows a wide distribution centered at 155° (±14°).
Twisting motions were also dependent on the HR2-TM model (Figure S5A). While both M1 and M2 of the HR2
linker show similar twist angle distributions (ϕ) centered at
66° (±46°) and 68° (±49°), respectively,
the M1 of HR2-TM shows a narrow distribution centered at 99°
(±18°) and the M2 of HR2-TM shows a wide distribution centered
at 98° (±71°).
Figure 2
Bending motions of the S protein in a viral
membrane. (A) Illustrative
snapshot of the S protein and definition of angles/dihedrals measured
to characterize the stalk motion. (B) Probability distribution of
the bending angle for each HR2 linker and HR2-TM linker model. (C)
Probability distributions of tilt angles for the resampled S protein
structures compared to the experimental observation.[13] The tilting angle is defined by the principal axis of the
S protein head and the membrane normal.
Bending motions of the S protein in a viral
membrane. (A) Illustrative
snapshot of the S protein and definition of angles/dihedrals measured
to characterize the stalk motion. (B) Probability distribution of
the bending angle for each HR2 linker and HR2-TM linker model. (C)
Probability distributions of tilt angles for the resampled S protein
structures compared to the experimental observation.[13] The tilting angle is defined by the principal axis of the
S protein head and the membrane normal.These bending and twisting characteristics are consistent with
the secondary structure analysis. The secondary structures of HR2
linker M1 and M2 models are mostly in coil conformations during the
simulation although local folding and unfolding occur in both models
(Figure S2). The secondary structure of
the initial HR2-TM M1 model mainly consists of helical structures
that are mostly retained during the simulation time. On the other
hand, the secondary structure of M2 initially modeled with turns and
bends shows low helicity in the range of L1200–K1215 (Figure S3). This indicates that the flexible
motions of the HR2-TM linker are strongly influenced by the secondary
structure and initial model (within the current simulation time).
Although the secondary structures of CP domains are different in the
two models, they have no significant effect on the motions of the
stalk (Figure S4). To further characterize
the bending motions, the Pearson correlation coefficient (r) was calculated
for all combinations of the HR2 linker and HR2-TM models (Figure S5B). The r values of all cases range
from −0.16 to 0.13, indicating that there is no correlation
between the bending motions of the HR2 linker and HR2-TM; thus, each
linker acts as an independent hinge.Although 16 × 1.25
μs MD simulations were performed,
they do not cover all possible configurations of the S protein, especially
with such flexible two linkers. To increase sampling, utilizing the
independent θ1 and θ2 characteristics,
S protein orientation was resampled based on three regions: head-HR1,
HR1-HR2, and HR2-TM. First, 30 HR2-TM conformations were randomly
extracted from each trajectory (excluding HR2-TM M1 models), and their
TM domain was superimposed to the TM domain of the initial model to
resample HR2 domain motions. Second, 30 HR1-HR2 conformations were
randomly extracted and superimposed to each of the resampled HR2 domains.
Finally, 30 head-HR1 conformations were randomly extracted and superimposed
to the resampled HR1 domains. In summary, 27 000 configurations
of the S protein on a viral membrane were generated. Figure C shows the tilt angle of the
resampled configurations of the S protein using M1 and M2 for the
HR2 linker and only M2 for HR-TM. The S protein can tilt by up to
90° toward the membrane and tilt angles around 48° are most
probable. This tilt angle distribution agrees well with the experimental
observation.[11,13] However, if M1 was used for HR2-TM,
the tilt angle distribution of the S protein becomes narrow (Figure S6). This indicates that both M1 and M2
of the HR2 linker are reliable models, but for HR2-TM, M2 is more
appropriate to represent S protein configurations. To further understand
the contribution of each independent hinge motion on the tilt angle,
the S protein was resampled separately with HR2-TM only and with HR1-HR2
only. In both cases, the resampled S protein shows a narrow-angle
distribution compared to the experimental observation (Figure C), indicating that both linkers
are necessary for the full tilting motions of the S protein observed
in the experiment.
Flexibility of the Stalk
May Facilitate S
Protein Binding to ACE2
To explore the effect of flexible
stalk motion on ACE2 binding, we performed the structural alignment
of the S protein to ACE2. The RBD in the complex with full-length
humanACE2 in the presence of the neutral amino acid transporter B0AT1 (PDB: 6M1710) was used for alignment. Fully independent bending
and twisting motions of two stalk linkers allow us to increase the
number of S protein samples. 125 head-HR1, HR1-HR2, and HR2-TM-CP
conformations were separately extracted from each trajectory with
a 10 ns interval. Each RBD of head-HR1 conformations was first superimposed
to the RBD-ACE2-B0AT1 complex. Then, the HR1-HR2 conformations
were superimposed to each of HR1 from the previous step. Finally,
the HR2-TM-CP conformations were superimposed to each of HR2 from
the previous step. Figure A shows one of the most probable configurations of the S protein–ACE2
complex. The tilting angle (θ) is defined in Figure , and the distance (d) is defined by an arc length between the centers of mass
(COMs) of two TM domains. As shown in Figure B, d ranges from 240 to
350 Å and θ ranges from 30 to 60°. At the most probable
configuration, d and θ are about 290 Å
and 46°, respectively. Note that there is approximately one S
protein per 1000 nm2 (316 Å × 316 Å) on
the viral surface.[14] This sparse distribution
of the S protein suggests that receptor binding can be promoted by
enough space to have orientational degrees of freedom for the RBD.
Moreover, it is reported that the most probable tilting angle of the
prefusion state is about 40–50°[11,13] (also see Figure ). This tilting angle appears to maximize the accessibility of the
receptor-binding motif to ACE2 (when the RBD is in an open conformation),
which could account for the high infection rate of SARS-CoV-2.
Figure 3
S protein configurations
competent for ACE2 binding. (A) Illustrative
snapshot of the S protein–ACE2-B0AT1 complex. Three
individual chains of the S protein are colored in yellow, gray, and
white, and ACE2 and B0AT1 are represented as red and pink,
respectively. (B) Distribution of the tilting angle (θ) as a
function of the arc length (d) between the centers
of mass (COMs) of TM domains.
S protein configurations
competent for ACE2 binding. (A) Illustrative
snapshot of the S protein–ACE2-B0AT1 complex. Three
individual chains of the S protein are colored in yellow, gray, and
white, and ACE2 and B0AT1 are represented as red and pink,
respectively. (B) Distribution of the tilting angle (θ) as a
function of the arc length (d) between the centers
of mass (COMs) of TM domains.
Glycans Influence RBD and NTD Motions and
Contribute to S Trimer Stability
To explore RBD and NTD motions,
we measured two structural features (Figure A): the RBD–NTD distance (d) defined by the minimum distance between RBD and NTD,
and the RBD orientation angle (θ) defined by two points at each
end of the RBD and the third point on the center axis of the S trimer.
One RBD (in both open and closed states) forms a U-shaped pocket with
the NTD in the same monomer, which is occupied by the neighboring
RBD in a closed state. Therefore, d estimates the
RBD–NTD pocket size and θ estimates the extent of RBD
opening. The time series of d and θ in 6VSB
and 6VXX are shown in Figure B and S7, respectively. In cryo-EM
structures available in the PDB, θ in the open-state RBD ranges
from 134 to 153°, and θ in the close-state RBD ranges from
88 to 93°. The trajectories of fully glycosylated models completely
cover the RBD orientation angles observed in cryo-EM structures and
explore a wider range of the conformational space. In particular,
θ of 6VSB_A ranges from 120 to 170°, indicating that the
open-state RBD is much more flexible than the closed-state RBD, and
it is consistent with the RMSD and root-mean-square fluctuation (RMSF)
results shown in Figure S1. For both 6VSB_A
and 6VSB_C, the simulations cover the NTD–RBD distances observed
in the PDB cryo-EM structures. In 6VSB_B, the pocket formed by NTD
and RBD (chain B) is empty due to the opening of the neighboring RBD
(chain A), and consequently, the NTD moved close to the RBD.
Figure 4
Motions of
the RBD and the NTD in fully glycosylated systems and
non-glycosylated systems. (A) Illustration of the NTD–RBD distance
(d) and the RBD orientation angle (θ). (B)
Time series of d and θ in three chains of 6VSB. d is defined by the minimum distance between the RBD (N334
to P527) and the NTD (C15 to S305). θ is defined by three points
corresponding to the (i) COM of L452 and L492, (ii) COM of N334, and
(iii) COM of S1030. The ranges of d and θ observed
in the available PDB S protein structures are shaded by gray regions.
Motions of
the RBD and the NTD in fully glycosylated systems and
non-glycosylated systems. (A) Illustration of the NTD–RBD distance
(d) and the RBD orientation angle (θ). (B)
Time series of d and θ in three chains of 6VSB. d is defined by the minimum distance between the RBD (N334
to P527) and the NTD (C15 to S305). θ is defined by three points
corresponding to the (i) COM of L452 and L492, (ii) COM of N334, and
(iii) COM of S1030. The ranges of d and θ observed
in the available PDB S protein structures are shaded by gray regions.To investigate the impact of glycans on the transition
between
RBD open and closed states, we built and simulated non-glycosylated
head-only systems (three replicates for 6VSB and 6VXX) by removing
all N-/O-linked glycans and truncating the stalk. It is worth noting
that the RBD in non-glycosylated 6VSB_A started to close at the very
beginning of trajectories in all three replicas (Figure B). In the trajectory (colored
in brown), θ decreases to 110°, which is about in the middle
of open and closed states, and in the trajectory (colored in purple),
the RBD reverted to opening at around 0.25 μs. Since the transition
between RBD open and closed states is a complicated process involving
the motions of multiple domains and attached glycans, it may require
a much longer simulation time to capture the conformation in which
the geometries of both the protein and glycansatisfy the requirement
for the transition to occur. Nonetheless, dramatic transitions from
RBD open to more closed states in non-glycosylated 6VSB_A indicate
that the open-state RBD is unstable when the glycans are removed.
This implies that glycosylation plays critical roles in the viral
entry as the RBD needs to be open to interact with ACE2.We identified three N-glycans that have important
roles in the conformational change of the RBD. They are attached to
N165 and N234 on the NTD and N343 on the RBD. When RBD is open (6VSB_A),
N165 and N234 glycans on the NTD of the neighboring chain (6VSB_B)
are both located below the open-state RBD (Figure A), which holds the open-state RBD. Although
both are near the open-state RBD, only N165glycan has frequent contacts
with the RBD (>85% of snapshots) (Figure C), and N234 glycan interacts with the open-state
RBD occasionally (<5% of snapshots), which is different from the
findings from Casalino et al’s study.[16] Such a difference could attribute to the initial models and simulation
lengths. When the RBD is closed (all except 6VSB_A), the RBD forms
a sandwich-like arrangement with two glycans. N165glycan is located
above the RBD and N234 glycan is located below the RBD (Figure B). Both glycans frequently
interact with the closed-state RBD (Figure C), which makes the transition to an open
conformation hard. The glycan attached to N343 on the RBD orients
toward the solvent and hardly interacts with other domains when the
RBD is open (6VSB_A) (Figure D). When the RBD is closed (6VSB_B) but the neighboring RBD
(6VSB_A) is open, N343 glycan orients toward the pocket between the
NTD and the RBD and interacts with N165glycan, which makes the open
to closed state transition of the neighboring RBD difficult. When
the RBD is closed (all expect 6VSB_A and 6VSB_B) and the neighboring
RBD is also closed, N343 glycan makes extensive interactions with
the neighboring RBD (Figure E), which contributes to the stability of both RBDs in a closed
conformation. The atom contacts between RBD N343 glycan and the neighboring
RBD exist in more than 95% of snapshots when both RBDs are closed
(Figure F). Given
that the RBD is shown to constantly transit between open and closed
states in the experiment, we propose that glycans serve as a clutch
that temporarily holds the RBD in an open or closed conformation,
which modulates the lifetime of both open and closed states.
Figure 5
Critical roles
of N165, N234, and N343 glycans in the transition
between RBD open and closed conformations. (A) The open-state RBD
(6VSB_A, pale yellow) is above N165 (red) and N234 (cyan) glycans
on the neighboring NTD. (B) The closed-state RBD (6VSB_B) is between
N165 and N234 glycans. (C) Frequency of contacts between the RBD and
two glycans (N165 and N234). (D) N343 (magenta) glycan on the open-state
RBD (6VSB_A) is free. (E) N343 glycan on the closed-state RBD (6VSB_C)
interacts with the neighboring closed-state RBD (6VSB_C). (F) Frequency
of contacts between N343 glycan and the neighboring RBD.
Critical roles
of N165, N234, and N343 glycans in the transition
between RBD open and closed conformations. (A) The open-state RBD
(6VSB_A, pale yellow) is above N165 (red) and N234 (cyan) glycans
on the neighboring NTD. (B) The closed-state RBD (6VSB_B) is between
N165 and N234 glycans. (C) Frequency of contacts between the RBD and
two glycans (N165 and N234). (D) N343 (magenta) glycan on the open-state
RBD (6VSB_A) is free. (E) N343 glycan on the closed-state RBD (6VSB_C)
interacts with the neighboring closed-state RBD (6VSB_C). (F) Frequency
of contacts between N343 glycan and the neighboring RBD.In addition, we calculated the accessible surface area (ASA)
reduced
due to the formation of the S trimer. For example, the ASA reduction
for chain A was calculated by SA + SBC –
SABC, where SA, SBC, and SABC are the ASA of chain A, chains BC-only complex, and chains ABC complex
(i.e., S trimer), respectively. The ASA reduction due to trimer formation
was split into the portion from protein only and the portion involving
glycans. The trimer interface interaction involving glycans is about
30% when the entire S1 subunit is considered, and it increases to
about 40% when only the RBD and the NTD are considered (Figure S8). This suggests that glycans make significant
contributions to the stability of the S trimer, which is different
from the common belief that protein–protein interactions are
the only dominating factor to the stability of a protein multimer.
Impact of Glycan Shielding on Antibody Binding
is Overestimated
Viruses evolve to minimize the immunogenicity
by coating the exposed viral proteins with non-immunogenic or weakly
immunogenic glycans. It is commonly believed that the glycans on the
viral envelope shield viruses from the host immune system. To get
an impression of such glycan shielding, we aligned the S head in each
trajectory snapshot and the glycan distributions are shown in Figure A. Most glycans are
very flexible and they move around in a wide range of space, which
covers most of the trimer surface. However, the role of glycan is
not limited to shielding. During the past decade, many glycan-dependent
HIV neutralizing antibodies have been discovered and extensively studied,
which target both the envelope protein and glycans.[48−50] In the cryo-EM
structure of the S trimer in the complex with the S309 antibody (PDB
ids: 6WPS and 6WPT(51)), S309 interacts with the glycan attached to N343.
Figure 6
Clash between
glycans and superimposed antibodies. (A) Distribution
of glycans when S head structures in multiple snapshots are aligned
(left). Four epitopes targeted by neutralizing antibodies are shown
in different colors (right). (B) H11-D4 epitope (green) in the RBD
(left: open, right: closed), N165 glycan (red) on the neighboring
NTD, and N343 glycan (violet) on the neighboring RBD. (C) Distributions
of glycan heavy atom numbers in clash (NCLASH) with the superposed H11-D4 antibody. (D) S309 epitope (yellow)
and N331 (orange) and N343 (violet) glycans on the RBD (left: open,
right: closed), N122 (blue) and N165 (red) glycans on the neighboring
NTD. (E) Distributions of NCLASH with
the superimposed S309 antibody.
Clash between
glycans and superimposed antibodies. (A) Distribution
of glycans when S head structures in multiple snapshots are aligned
(left). Four epitopes targeted by neutralizing antibodies are shown
in different colors (right). (B) H11-D4 epitope (green) in the RBD
(left: open, right: closed), N165glycan (red) on the neighboring
NTD, and N343 glycan (violet) on the neighboring RBD. (C) Distributions
of glycan heavy atom numbers in clash (NCLASH) with the superposed H11-D4 antibody. (D) S309 epitope (yellow)
and N331 (orange) and N343 (violet) glycans on the RBD (left: open,
right: closed), N122 (blue) and N165 (red) glycans on the neighboring
NTD. (E) Distributions of NCLASH with
the superimposed S309 antibody.To explore the role of glycans in antibody binding, we used TM-align[52] to superimpose the RBD in RBD–antibody
complex structures in the PDB onto the RBD in each simulation snapshot
and calculated the number of glycan heavy atoms that have clashes
with the antibodies (NCLASH, Figure S9). In this work, we discuss four RBD-targeting
antibodies, namely, B38, CR3022, H11-D4, and S309 (Figure A). The epitopes of B38 and
CR3022 are irrelevant to glycans. B38 binds to the same region of
RBD as ACE2 does. This epitope is fully exposed in the open-state
RBD, but when the RBD is closed, the B38 epitope is masked by the
neighboring RBD in either the open or closed state (Figure S10A). The epitope of CR3022 is on the inner surface
of the RBD, and it is only accessible when all three RBD are open
(Figure S10B).The epitope of H11-D4
is next to the epitope of B38, and it is
also fully exposed when the RBD is open. When the RBD is closed, the
glycans attached to N165 and N343 on the neighboring chain are located
near this epitope (Figure B and S10C). As shown in the distribution
of NCLASH with H11-D4 (Figure C), N343 glycan rarely interferes
with the antibody, but N165glycan has high probabilities to make
severe clashes with the antibody when the RBD is closed. For comparison,
we also aligned a nanobody to the RBD in each simulation snapshot.
Though N165glycan still frequently clashes with the nanobody, the
frequency of severe clashes is much lower (Figure S11); thus, there is a chance that a nanobody can bind to this
epitope as shown in PDB id 6Z43.The epitope of S309 is surrounded by four glycans.
Two of them
are attached to N331 and N343 on the same RBD, and the other two are
attached to N122 and N165 on the neighboring NTD (Figure D and S10D). The distributions of NCLASH with S309
are shown in Figure E. N165 and N331glycans rarely interfere with the S309 antibody
in both open- and closed-state RBD. N343 glycan has minor clashes
with the antibody in most snapshots, and such minor clashes are not
sufficient to block antibody binding, as these clashes can be easily
removed with small changes in glycan conformation and orientation.
The antibody–glycan interactions can also contribute to antibody
binding, which is observed in the cryo-EM structures. In more than
half of all snapshots, N122 glycan has severe clashes with the antibody
when the RBD is open, but it moves away from the superposition of
the antibody in the remaining snapshots. This suggests that the S309
epitope in the open-state RBD is blocked by N122 glycan in more than
half of the simulation time, but it is still accessible to the antibody.As a comparison, we calculated the ASA of S309 and H11-D4 epitopes
using a probe radius of 7.2 Å that is commonly used to approximate
the size of hypervariable loops of the antibody and compared the epitope
ASA with and without glycans (Figure S12). For H11-D4, we observed significant decreases in the epitope ASA
in all chains except 6VSB_A and 6VSB_C, which is generally consistent
with the frequency of clashes between glycans and antibodies. However,
for S309, the epitope ASA decreases significantly in all chains when
glycans are present. This is contradictory to the result that N343glycans occasionally have only slight clashes with the superimposed
antibody when the RBD is closed (all except 6VSB_A). In addition,
the PDB structures of the S trimer in the complex with S309 show that
N343 glycan interacts with the antibody. In the calculation of ASA,
a point on the surface is considered inaccessible even if the probe
sphere has a very tiny clash with the molecule. However, the shape
of the antibody is not a sphere and it can have narrow-shaped regions
that extend deeply into the pocket in the epitope. A glycan like the
one attached to N343 can reduce the epitope ASA even though it may
contribute to antibody binding. Therefore, in some cases, a simple
comparison of protein ASA with and without glycans is likely to overestimate
the impact of glycan shielding.
Conclusions
In this work, we present multiple μs-long all-atom MD simulations
of the fully glycosylated full-length S protein in a viral membrane.
Our MD simulations reveal the overall shape of the S protein, and
its orientation on the membrane surface is determined by a highly
flexible stalk composed of two independent joints. Importantly, S
protein models from our simulations allow us to predict the possible
configurations of the S protein–ACE2-B0AT1 complex
with allowable orientations and distances between two S proteins on
the membrane surface. The simulation here also provides insights into
how glycans influence the open/closed state change of the RBD and
the antibody binding to RBD epitopes. We identify glycans attached
to multiple glycosylation sites that stabilize the open and/or closed
states of the RBD by making a high energetic barrier between the open–closed
transition. The simulation of non-glycosylated systems shows that
the open-state RBD becomes unstable when glycans are removed and the
transition to the closed state occurred at the early stage of the
simulation. By aligning the RBD–antibody complex structures
to the simulation trajectories, we reveal that the impact of glycan
shielding is overestimated by a simple ASA analysis. More importantly, glycans not only serve as
shields for immune evasion but also contributes to antibody binding.
Our work sheds light on the full structure and dynamics of the S protein
and we hope our
work to be useful for the development of vaccines and antiviral drugs.
Authors: Meng Yuan; Nicholas C Wu; Xueyong Zhu; Chang-Chun D Lee; Ray T Y So; Huibin Lv; Chris K P Mok; Ian A Wilson Journal: Science Date: 2020-04-03 Impact factor: 47.728
Authors: Lorenzo Casalino; Zied Gaieb; Jory A Goldsmith; Christy K Hjorth; Abigail C Dommer; Aoife M Harbison; Carl A Fogarty; Emilia P Barros; Bryn C Taylor; Jason S McLellan; Elisa Fadda; Rommie E Amaro Journal: ACS Cent Sci Date: 2020-09-23 Impact factor: 14.553
Authors: Aitor Casas-Sanchez; Alessandra Romero-Ramirez; Eleanor Hargreaves; Cameron C Ellis; Brian I Grajeda; Igor L Estevao; Edward I Patterson; Grant L Hughes; Igor C Almeida; Tobias Zech; Álvaro Acosta-Serrano Journal: mBio Date: 2022-02-15 Impact factor: 7.867
Authors: Marija Sorokina; Jaydeep Belapure; Christian Tüting; Reinhard Paschke; Ioannis Papasotiriou; João P G L M Rodrigues; Panagiotis L Kastritis Journal: J Mol Biol Date: 2022-05-17 Impact factor: 6.151
Authors: Aoife M Harbison; Carl A Fogarty; Toan K Phung; Akash Satheesan; Benjamin L Schulz; Elisa Fadda Journal: Chem Sci Date: 2021-11-25 Impact factor: 9.825