Xiaocong Wang1, Lihua Bie1, Jun Gao1. 1. Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China.
Abstract
The viral entry process of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) requires heparin and heparan sulfates from the cell surface, functioning as a cofactor for human angiotensin-converting enzyme 2 (ACE2) for recognizing the receptor-binding domain (RBD) of the spike (S) protein on the surface of the virion. In the present study, the binding poses of an oligosaccharide with four repeating units of GlcNS6S-IdoA2S (octa) predicted by Vina-Carb in the RBD binding site were employed in molecular dynamics (MD) simulations to provide atomic details for studying the cofactor mechanism. The molecular model in the MD simulations reproduced the length- and sequence-dependent behavior observed from the microarray experiments and revealed an important planar U-turn shape for HP/HS binding to RBD. The model for octa with this shape in the ACE2-RBD complex enhanced the interactions in the binding interface. The comparisons with the ACE2-RBD complex suggested that the presence of octa in the RBD binding site blocked the movements in a loop region at the distal end of the RBD binding interface and promoted the contacts of this loop region with the ACE2 N-terminus helix. This study shed light on the atomic and dynamic details for HP/HS interacting with RBD and provided insights into their cofactor role in the ACE2-RBD interactions.
The viral entry process of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) requires heparin and heparan sulfates from the cell surface, functioning as a cofactor for human angiotensin-converting enzyme 2 (ACE2) for recognizing the receptor-binding domain (RBD) of the spike (S) protein on the surface of the virion. In the present study, the binding poses of an oligosaccharide with four repeating units of GlcNS6S-IdoA2S (octa) predicted by Vina-Carb in the RBD binding site were employed in molecular dynamics (MD) simulations to provide atomic details for studying the cofactor mechanism. The molecular model in the MD simulations reproduced the length- and sequence-dependent behavior observed from the microarray experiments and revealed an important planar U-turn shape for HP/HS binding to RBD. The model for octa with this shape in the ACE2-RBD complex enhanced the interactions in the binding interface. The comparisons with the ACE2-RBD complex suggested that the presence of octa in the RBD binding site blocked the movements in a loop region at the distal end of the RBD binding interface and promoted the contacts of this loop region with the ACE2 N-terminus helix. This study shed light on the atomic and dynamic details for HP/HS interacting with RBD and provided insights into their cofactor role in the ACE2-RBD interactions.
The ongoing pandemic of novel coronavirus 2019 (COVID-19) has caused serious public health
crises and enormous economic damages around the globe.[1] Significant
efforts have been devoted to studying the novel severe acute respiratory syndrome
coronavirus 2 (SARS-CoV-2), a new virus belonging to the β coronavirus family that has
caused this pandemic.[2,3]
SARS-CoV-2 is a single-stranded RNA-envelop virus,[4] which utilizes the
receptor-binding domain (RBD) of its highly glycosylated trimeric spike (S) protein on the
surface of the virion for the recruitment to the host receptor, human angiotensin-converting
enzyme II (ACE2), to facilitate its entry via the fusion between the viral membrane and the
host target membrane.[5−7] Due to the pivotal role of
the S protein in viral recognition and entry, as well as their locations on the viral
surface, S proteins have been employed as immunogens for generating neutralizing vaccines
and targets for developing therapeutic drug molecules.[8,9] Similarly, ACE2 has also been studied as a
potential target for interfering with the ACE2-targeting coronavirus
infections.[10,11]
Thus, a detailed understanding of the binding interactions between SARS-CoV-2 and ACE2 is
essential for elucidating the binding and entry of this new virus and its enhanced
infectivity,[6,12−15] as well as rational design for effective
therapeutics.[16]Previous studies have revealed that after the S protein undergoes a hinge-like
conformational change that exposes the RBD,[5−7,17] two short β strands with connecting loops and a flexible loop at
the distal end in RBD of the SARS-CoV-2 S protein were recognized by the N-terminal helix of
ACE2.[12,18] This
recognition and binding processes engaged a network of hydrophilic interactions formed by
multiple hydrogen bonding interactions and salt bridges.[7,12] Further studies identified significant
involvements of glycans in this process[19−21] after the
structural analysis showed that both SARS-CoV-2 S protein and ACE2 are heavily glycosylated.
For example, the glycans at N165 and N234 in RBD of the SARS-CoV-2 S protein play roles in
the conformational plasticity of RBD, where they stabilize the open state of
RBD;[22,23] the glycans
at N331 and N343 are shown to be critical for immune recognition.[23,24] In the meantime, the glycans at N53
in ACE2 facilitate the stabilization of the ACE2–RBD binding surface;[21] the glycans at N90 have been theoretically shown to enhance the
ACE2–RBD binding.[25−27]The exterior glycans that are not conjugated with the S protein or ACE2 have also been
shown to modulate their binding interactions. A recent seminal study by Clausen et
al.[28] revealed that heparin (HP)/heparan sulfate (HS) functions as a
necessary cofactor to facilitate the recruitment of RBD in the S protein of SARS-CoV-2 to
ACE2 by interacting with residues that are adjacent to the ACE2-binding site. As the binding
sites for ACE2 and HP/HS in the RBD of the SARS-CoV-2 S protein are
adjacent,[28,29]
understanding the mechanism of RBD–HP/HS interactions and their relationship with the
ACE2–RBD binding interactions could exceedingly advance our knowledge of the
SARS-CoV-2 infections and provide insights into a promising way for mitigating the
pandemic.[30−32]HP/HS is a major class of unbranched and highly negatively charged glycocalyx
polysaccharide, namely glycosaminoglycans (GAGs), which are composed of sulfated repeating
[d-glucuronic acid (GlcA) or l-iduronic acid (IdoA)] and
d-glucosamine (GlcN) disaccharide units and regulates a wide range of activities,
including cell dynamics, inflammation, signaling pathways, etc., through their interactions
with different proteins.[33−35] Liu et al.[36] performed microarray binding experiments with an extensive HP/HS
oligosaccharide library and revealed that the bindings between the RBD in the SARS-CoV-2 S
protein and HP/HS depend on the length and sequence of the oligosaccharides, in which HP
with four GlcNS6S-IdoA2S repeating units showed stronger
bindings than those with three repeating units and removing the sulfate moieties
significantly reduced their binding intensities. However, the lack of the crystallographic
structures of the RBD–HP/HS complexes obstructs the understanding of their
interactions at the atomic level. Given that HP/HS is a linear unbranched polysaccharide
that could undergo large internal motions to adopt multiple energy-favored conformations, it
is also essential to study their interactions with accurate predictions of their 3D shapes
and comprehensive representations for their dynamical motions in binding
sites[37−39]In the present study, a model with four repeating unit of
GlcNS6S-IdoA2S was built and docked to the putative
binding site in the RBD of the SARS-CoV-2 S protein. Molecular dynamic (MD) simulations were
performed for the complex models that were built with proper force fields. Careful
validations of the RBD complex model in MD simulations were carried out by reproducing their
length- and sequence-dependent behavior observed in the microarray experimental results. The
oligosaccharide in the validated binding pose presented a U-shape, which permits multiple
residues at the turn forming stable interactions with previously identified key arginine
residues simultaneously, as well as those at the reducing and nonreducing ends. This U-shape
is speculated to be a general structural feature for HP/HS in the RBD binding sites, as the
energy-favored glycosidic linkage conformations in HP/HS allows five consecutive
monosaccharides in the same plane, which parallels with the side chains of arginine residues
in the binding site and maximizes the possibilities of forming stable interactions.The validated binding pose for the oligosaccharide was merged into the model of the
ACE2–RBD complex to study the mechanism of HP promoting the ACE2–RBD binding
interactions. The essential stable intermolecular interactions in the concave
ACE2–RBD binding interface, observed in the crystal structure of the ACE2–RBD
complex, were reproduced by MD simulations with and without bound HP. But, the presence of
HP would block the motion of a distal end loop region that resides between the binding sites
of ACE2 and HP. This blockage reduced the freedom of the loop region and promoted its
interactions with residues at the N-terminus of ACE2, which agreed with the observation that
HP/HS functions as a cofactor for the recruitment of SARS-CoV-2 by the ACE2 at the host cell
membranes.The present theoretical study provided structural insights into the poses of HP in the RBD
binding site and the detailed atomic mechanism of the HP promoting the ACE2–RBD
interactions.
Methods
Structure Preparation
The initial coordinates for the S protein in SARS-CoV-2 were obtained from the Protein
Data Bank (PDB entry code: 6VYB(5)). The RBD domain was extracted from its chain B
(residues 334–527). Missing residues, 455–461, 467–490, and
516–521 were built in the SWISS-MODEL webserver[40] with the
template from the RBD in 6VW1.[12] The initial coordinates for the ACE2–RBD
complex were also obtained from the Protein Data Bank (PDB entry code: 6VW1(12)). For
consistency, the index of residues in this study followed those in 6VW1. The initial structure for octa was
built with the GAG builder on www.glycam.org. The
ring conformations of GlcNS6S and IdoA2S were set to
4C1 and
1C4 prior to docking, respectively. Docking was
performed with Vina-Carb[41] with the center of a docking region of 60
× 60 × 60 Å3 at the geometry center for the side chains of R346,
R355, and R466, and the top 6 ranked poses were extracted. Default Vina-Carb options
(chi_coeff = 1 and chi_cutoff = 2) were used. Force
field parameters for carbohydrate molecules were taken from GLYCAM06 (version j),[42] and those for proteins were taken from AMBER18
(ff99sb).[43,44]
Counterions (Na+) were added to neutralize each protein complex using the tLEaP
module[44] before they were solvated in a truncated octahedral box (8
Å buffer with TIP3P water model).
Simulation Setup
A two-step energy minimization of the solvated complexes was performed under the NVT
conditions: first, only the water molecules and counterions were subjected to energy
minimization (500 steps SD followed by 24 500 steps CG) and the atoms in solute
were restrained (100 kcal/mol·Å2); in the second step, the energy
minimization circle was repeated with the restraints applied only to the Cα atoms on
the protein backbone and ring atoms in carbohydrate molecules. Then, the solvated
complexes were heated to 300 K within 50 ps under the NVT conditions with a restraint (10
kcal/mol·Å2) applied only to the Cα atoms in the protein.
Prior to data collection, the solvated complexes were equilibrated at 300 K under NPT
conditions with a Berendsen thermostat[45] for 10 ns, during which all
restraints were removed. MD simulations for the data collection of each solvated complex
were performed for 500 ns with the GPU implementation[46] of PMEMD using
the AMBER18 software package.[44] Covalent bonds involving hydrogen atoms
were constrained using the SHAKE algorithm[47] in all MD simulations,
which allows a simulation time step of 2 fs. A nonbonded cutoff of 8 Å was applied to
van der Waals interaction energy calculations, and the particle mesh Ewald approximation
was applied to the long-range electrostatic interaction energy calculations. Standard
1–4 nonbonded scaling factors for proteins (2.0/1.2) and carbohydrate molecules
(1.0/1.0) were employed.[42]
Representative Conformation of Octa and Model Generations for hexa-1, hexa-2, and
ACE2–RBD-octa
The conformation of octa that was most similar to its average shape in the RBD complex
acquired from the MD simulations was extracted and presented as its representative
conformation. The GlcNS6S-IdoA2S segment was removed
from the reducing and nonreducing ends of octa in the RBD complex to generate the
molecular models for RBD-hexa-1 and RBD-hexa-2 complexes, respectively. The molecular
model for ACE2–RBD-octa was created by adding octa to the model for the
ACE2–RBD complex after superimposing the Cα atoms of the stable secondary
structures (residues 353–359, 375–381, 393–403, 430–438,
507–517) in the RBD of the ACE2–RBD complex and the representative
conformation of the RBD-octa complex.
Interaction Energy Calculations
Interaction energies in the complexes were calculated using the molecular
mechanics-generalized Born solvent-accessible surface area (MM-GBSA)[48,49] under the single trajectory
methodology with the MMPBSA.py.MPI module in AMBER on 10 000 snapshots extracted
evenly from each MD simulation. The GB1OBC model (igb = 2)[50] and internal dielectric constant (εint) of 4.0 were
applied in all MM-GBSA calculations.[51,52]
Results and Discussion
Predicting HP Binding Poses in RBD
HP is an unbranched oligosaccharide that could undergo large conformational changes by
altering the conformations of glycosidic linkages to adopt multiple energy-favored
conformations. Thus, predicting the pose of HP in the RBD binding site requires not only
approximations of binding interaction contributions from individual monosaccharides but
also estimations of the penalties from different glycosidic conformations that determine
the overall shape of the binding pose. Thus, Vina-Carb[41] was employed
for this purpose, as it was designed to include the glycosidic linkage conformations in
its scoring function and has been proven to be effective in predicting the binding poses
for flexible oligosaccharides in protein complexes.[53,54] The center for the docking region was selected
as the geometric center of the atoms in R346, R355, and R466, which were shown to be
critical for interacting with HP in previous studies[28] and were located
in a cavity next to the ACE2-binding site.A recent microarray study on the RBD of the SARS-CoV-2 S protein using an HP/HS library
suggested the length-dependent behavior of the bindings between HP/HS oligomers and
RBD.[36] An octasaccharide with four repeating units of
GlcNS6S-IdoA2S displayed the strongest binding
affinity, followed by a hexasaccharide with three repeating units. Further reducing the
length of the oligosaccharide almost abolished the binding interactions. According to this
finding, a molecular model for this octasaccharide (octa) was built and docked in the RBD
binding site with Vina-Carb. The top six poses with the predicted binding energies more
negative than −4.0 kcal/mol were extracted to form RBD-octa complexes for further
MD simulation studies (Table S1). All six selected poses of octa (Figures and S1) have shown to interact with at least two of the three key residues in
its binding site. Obviously, octa in poses 2, 3, and 6 shared a similar U-shape in the
binding site, which allows the residues in the turn form interactions with R355 and R466
simultaneously, and those at the reducing and nonreducing ends to interact with R346.
Although the docking region was set to cover the entire binding site, R346, R355, and R466
appeared to be essential for the predicted octa poses, as all 20 poses showed close
proximity to these residues. It is worth noting that these three selected poses were not
shown to interact with K444, a key residue in RBD for interacting with HP reported in the
pioneering experimental studies. Longer oligosaccharides with this U-shape are highly
likely to interact with K444 through the terminal residues. The U-shape was observed in
the other three poses, as well as 11 out of the remaining 14 docked poses (Figure S2 and Table S1), yet their orientations and locations differed. In
the MD simulations of RBD-octa complexes built with octa in poses 2, 3, and 6, the
oligosaccharide displayed positional stabilities. The total RMSDs for the ring atoms in
octa were less than 12 Å (Figure ).
However, octa failed to maintain stable binding interactions in the RBD complexes built
with octa in the other three poses, where the RMSDs reached up to 30 Å in the MD
simulations (Figure S1).
Figure 1
(A) Schematic of the structure of HS with four repeating units of
GlcNS6S-IdoA2S (octa); (B) Poses 2, 3, and 6 (red,
magenta, and yellow, respectively) of octa predicted by Vina-Carb in the binding site
of RBD for the SARS-CoV-2 S protein. The oligosaccharide is shown in the stick model
with the reducing end GlcNS6S and nonreducing end
IdoA2S labeled. The protein solvent-accessible surface is shown in
gray and those for the key residues in RBD interacting with HS are shown in blue and
labeled. (C) The representative conformations of octa from the MD simulations started
with poses 2, 3, and 6 (red, magenta, and yellow, respectively). The oligosaccharide
is shown in a stick model with the reducing end GlcNS6S and
nonreducing end IdoA2S labeled. The protein solvent-accessible
surface is shown in gray and those for the key residues in RBD interacting with HS are
shown in blue and labeled. (D–F) 2-D positional RMSD plots for the ring atoms
in octa from the MD simulations started with octa in poses 2 (D), 3 (E), and 6 (F).
Structures were evenly extracted every 0.05 ns from each MD simulation.
(A) Schematic of the structure of HS with four repeating units of
GlcNS6S-IdoA2S (octa); (B) Poses 2, 3, and 6 (red,
magenta, and yellow, respectively) of octa predicted by Vina-Carb in the binding site
of RBD for the SARS-CoV-2 S protein. The oligosaccharide is shown in the stick model
with the reducing end GlcNS6S and nonreducing end
IdoA2S labeled. The protein solvent-accessible surface is shown in
gray and those for the key residues in RBD interacting with HS are shown in blue and
labeled. (C) The representative conformations of octa from the MD simulations started
with poses 2, 3, and 6 (red, magenta, and yellow, respectively). The oligosaccharide
is shown in a stick model with the reducing end GlcNS6S and
nonreducing end IdoA2S labeled. The protein solvent-accessible
surface is shown in gray and those for the key residues in RBD interacting with HS are
shown in blue and labeled. (D–F) 2-D positional RMSD plots for the ring atoms
in octa from the MD simulations started with octa in poses 2 (D), 3 (E), and 6 (F).
Structures were evenly extracted every 0.05 ns from each MD simulation.Having predicted a similar pose for octa in the RBD binding site from three different
Vina-Carb results that could maintain stable interactions with key residues during MD
simulations provided a solid starting geometry for further studies. Thus, the
representative conformation of RBD-octa from these three MD simulations was extracted for
further validating this novel binding pose and understanding the length- and
sequence-dependent behavior of HP binding to RBD. All MD simulations associated with this
representative conformation of octa in the following study, including the RBD complexes
with hexa and tetrasaccharides, and ACE2–RBD-octa complexes were determined with
three replicas for establishing the reproducibility of the observed phenomena in
simulations.
Modeling the Length- and Sequence-Dependent Behavior
The predicted pose for octa in the RBD binding site needs to replicate the length- and
sequence-dependent behavior observed in the microarray results using the HP/HS library
before it can be used to provide insights into the RBD–HS interactions and its
cofactor function for the recruitment of SARS-CoV-2 S protein by ACE2. Therefore, models
for RBD complexes with three repeating units of
GlcNS6S-IdoA2S, RBD-hexa-1, and RBD-hexa-2, were
generated by removing the GlcNS6S-IdoA2S unit at the
reducing or the nonreducing end of octa, respectively, from the representative
conformation of the RBD-octa complex, and subjected to MD simulations. For a fair
comparison, an MD simulation starting from the representative conformation of RBD-octa was
also performed. The sequence-dependent behavior could be verified by careful comparisons
of the binding contributions from different moieties in the oligosaccharide obtained by
hydrogen bond analyses and MM-GBSA energy calculations to the microarray results using the
HP/HS library.All three oligosaccharides, octa, hexa-1, and hexa-2, maintained stable bindings to RBD
during their MD simulations. The positional RMSDs for the ring atoms reference to the
starting geometry for MD simulations in all of these oligosaccharides in their three
replicas of the MD simulations were generally less than 10 Å (Figures S3–S5). For the RBD-octa complex, the low RMSD values
suggested that the binding pose originally predicted by Vina-Carb was maintained in the
new MD simulation. For the RBD-hexa-1 and RBD-hexa-2 complexes, the low RMSD values
indicated that hexa-1 and hexa-2 in MD simulations maintained the same predicted binding
pose as octa, since their starting geometries were generated by removing a disaccharide
from the reducing or nonreducing ends, respectively. As seen in Figure
, the representative conformation of hexa-1 in the RBD
complex from the MD simulation displayed a similar shape as the six residues in the
representative conformation for octa from the nonreducing end, while that for hexa-2
matched the six residues from the reducing end. The interaction similarities among the
three oligosaccharides with their counterparts in RBD corresponded to their spatial
similarities in the binding site. The stable intermolecular hydrogen bond interactions
formed by hexa-1 and hexa-2 were similar to those formed by the corresponding parts in
octa (Tables and S2–S4). The middle sections of hexa-1 and octa both interacted with
R355 and R466, while their nonreducing ends interacted with R346. In the meantime, the
nonreducing end of hexa-2, which matched the middle section of octa, interacted with R355
and R466, while its reducing end interacted with R346. Thus, the spatial and interaction
similarities of these oligosaccharides in the binding site exemplified the robustness of
the binding pose originally predicted by Vina-Carb and provided further confidence.
Figure 2
Representative conformations of octa (red), hexa-1 (green, left panel), and hexa-2
(green, right panel) in the binding site of RBD from each MD simulation.
Monosaccharides are shown in licorice representation and their identities are shown
with the 3D-SNFG nomenclature (GlcNS6S, blue/white cube;
IdoA2S, yellow/white diamond) inside each ring.[55] The protein solvent-accessible surface is shown in gray and those for the key
residues in RBD interacting with HP/HS are shown in blue and labeled.
Table 1
Stable Intermolecular Hydrogen Bond Pairs and Salt Bridges Observed in The Three
Replicas of MD Simulations for RBD-octa, RBD-hexa-1, and RBD-hexa-2
monosaccharide
residues in RBD interacted with octa
residues in RBD interacted with hexa-1
residues in RBD interacted with hexa-2
(1) GlcNS6S
SO3 (2)
Y351
–a
T345, R346, N354, K356
SO3 (6)
R346, N354
–
R346
(2) IdoA2S
SO3
–
–
R346, S469
(3) GlcNS6S
SO3 (6)
–
S469
(4) IdoA2S
SO3
R466, T470
S469, T470
R355, R466, S469
(5) GlcNS6S
N2
R466
R466
R466
O3
R355
R355
SO3 (2)
R355, K129, R466
R355, R466
R355, R466
SO3 (6)
R357
R357
R357
(6) IdoA2S
O3
R355
R355
R355
SO3
R346, N354, R466
N354, R466
R346, N354, R355, R466
(7) GlcNS6S
N2
–
R355
–
O3
–
N354, R355, K356
–
O4
–
N354
–
SO3 (2)
K356
K356, R357
–
SO3 (6)
–
R346
–
(8) IdoA2S
O4
R346
R346
–
O5
–
N354
–
O6
T345, K356
T345, R346
–
SO3
T345, R346
R346
–
No stable interactions observed.
Representative conformations of octa (red), hexa-1 (green, left panel), and hexa-2
(green, right panel) in the binding site of RBD from each MD simulation.
Monosaccharides are shown in licorice representation and their identities are shown
with the 3D-SNFG nomenclature (GlcNS6S, blue/white cube;
IdoA2S, yellow/white diamond) inside each ring.[55] The protein solvent-accessible surface is shown in gray and those for the key
residues in RBD interacting with HP/HS are shown in blue and labeled.No stable interactions observed.The microarray results for RBD using the HP/HS library showed that the oligosaccharide
with four repeating units of GlcNS6S-IdoA2S was 15%
stronger in binding than that with three repeating units. As seen in Table , the total MM-GBSA energies for octa were more negative
than those for hexa-1 and hexa-2 (6 and 9%, respectively). The trend in the binding energy
difference suggested that the sequence-dependent binding behavior was reproduced in the
models of RBD complexes. The hydrogen bond analysis showed that
2-O-sulfate of IdoA2S at the nonreducing end of octa,
hexa-1, and hexa-2 formed stable interactions with the residues in RBD, including R346,
which was previously identified as essential (Table ). Losing these interactions would lead to reductions of the binding strengths
for the oligosaccharide, which agreed to the over 60% binding strength reduction for the
hexasaccharide in the microarray experiments after replacing the nonreducing end
IdoA2S with a GlcA residue.[36] The stable hydrogen
bond interactions formed by the 2-O-sulfate moieties of other
IdoA2S residues in hexa-1 and hexa-2 also agreed with the elimination
of binding interactions of the hexasaccharide after further replacements of
IdoA2S with GlcA residues. Similarly, the hydrogen bond interactions
between 6-O-sulfate in GlcNS6S residues and RBD residues
in the binding site corresponded to weaker binding strength for hexasaccharide after
replacing GlcNS6S with GlcNS residues in the microarray experiments.
Energetically, the decomposition of MM-GBSA energies showed that
2-O-sulfates in IdoA2S and 6-O-sulfate
in GlcNS6S residues all contributed to the binding interactions (Table ), confirming their involvements in the
RBD–HP interactions. Therefore, the binding pose predicted by Vina-Carb for octa
reproduced the length- and sequence-dependent behavior of HP/HS interacting with RBD.
Table 2
Per-Residue MM-GBSA Interaction Energies (kcal/mol) of octa, hexa-1, and hexa-2
in the Binding Site of RBD
octa
hexa-1
hexa-2
sugar
sulfatea
subtotal
sugar
sulfate
subtotal
sugar
sulfate
subtotal
(1) GlcNS6S
–4.4 ± 1.1
–0.8 ± 0.2
–5.2 ± 1.2
–b
–
–
–8.8 ± 1.1
–0.7 ± 0.1
–9.5 ± 1.2
(2) IdoA2S
–0.7 ± 0.3
–0.3 ± 0.1
–1.0 ± 0.4
–
–
–
–2.4 ± 1.2
–0.3 ± 0.2
–2.7 ± 1.0
(3) GlcNS6S
–1.3 ± 1.0
–0.2 ± 0.0
–1.5 ± 1.0
–3.1 ± 1.1
–0.6 ± 0.4
–3.7 ± 1.5
–3.0 ± 1.2
–0.2 ± 0.0
–3.3 ± 1.1
(4) IdoA2S
–1.2 ± 0.5
–0.6 ± 0.4
–1.8 ± 0.8
–1.8 ± 0.1
–1.1 ± 0.2
–2.9 ± 0.3
–1.4 ± 0.3
–1.0 ± 0.3
–2.4 ± 0.5
(5) GlcNS6S
–5.0 ± 0.7
–0.6 ± 0.1
–5.6 ± 0.8
–4.3 ± 0.2
–0.5 ± 0.2
–4.8 ± 0.4
–4.1 ± 1.1
–0.6 ± 0.2
–4.8 ± 1.3
(6) Idoa2S
–3.7 ± 0.2
–2.5 ± 0.7
–6.2 ± 0.8
–3.4 ± 0.3
–2.7 ± 0.1
–6.1 ± 0.4
–2.9 ± 0.9
–2.2 ± 0.9
–5.1 ± 1.6
(7) GlcNS6S
–4.3 ± 0.7
–0.2 ± 0.1
–4.5 ± 0.7
–5.3 ± 0.9
–0.4 ± 0.1
–5.7 ± 0.7
–
–
–
(8) IdoA2S
–5.4 ± 1.3
–1.0 ± 0.2
–6.4 ± 1.5
–6.2 ± 0.3
–1.2 ± 0.1
–7.4 ± 0.3
–
–
–
total
–32.1 ± 2.4
–30.7 ± 0.5
–27.7 ± 2.2
Energy contributions for the sulfates at 6-position of GlcNS6S and
2-position of IdoA2S were separate from their attached
monosaccharides.
Residue not present.
Energy contributions for the sulfates at 6-position of GlcNS6S and
2-position of IdoA2S were separate from their attached
monosaccharides.Residue not present.
Hypothetical General Shape of HP in RBD
In addition to the theoretical reproductions of the length- and sequence-dependent
interaction behavior of RBD and HS, understanding the U-shape of octa in the RBD binding
site could provide further insights into the RBD–HS interactions and advanced
knowledge of rational drug designs.It is of interest to determine the energetic preferences toward the planar U-turn shape
in the oligosaccharide composed of repeating units of
GlcNS6S-IdoA2S. First, an oligosaccharide model with 8
repeating units of GlcNS6S-IdoA2S, whose glycosidic
linkages were in the energy-favored ϕ/ψ conformations suggested in the CHI
energy study,[41,56]
appeared to be a helical structure (Figure A,B).
In the helices, five consecutive residues in the oligosaccharide could generate a planar
U-turn geometry. As shown in Figures S6–S8, the glycosidic linkages in the oligosaccharide
appeared to maintain the initial conformations suggested in the CHI energy study,[41] suggesting that such a helical structure is energetically stable in
explicit solvents. Finally, the glycosidic linkage conformations were altered manually to
make the conformation of this oligosaccharide linear (Figures S9–S11), which was later subjected to MD simulations.
Interestingly, the oligosaccharide spontaneously formed a helical conformation, although
not as ordered as seen in the previous MD simulations. Therefore, theoretical models of an
oligosaccharide with repeating units of GlcNS6S-IdoA2S confirmed that a helical
conformation could be energy-favored in solution and potentially facilitate its binding to
the RBD. In the planar U-turn shape, the sulfate moieties extended vertically to the plane
(Figure B). Both the planar shape of the
oligosaccharide and extended exo-cyclic moieties favor the formation of
intermolecular hydrogen bonds and salt bridges between HP/HS and the receptor. The planar
shape was observed between 2nd and 4th GlcNS6S of octa in the
representative conformation of the RBD-octa complex (Figure C). The sulfate moieties also appeared to be vertical to the
oligosaccharide plane and formed multiple intermolecular interactions with residues in RBD
simultaneously (Tables S2–S4). It is worth noting that octa in the binding site of
RBD was not in the helical form as shown in the ideal model in Figure
A,B. The glycosidic linkage distributions for octa during MD
simulations showed that the terminal residues in octa could adopt multiple stable
positions, as their glycosidic linkages adopted multiple conformations. On the other hand,
the glycosidic linkages for most of the monosaccharides in the same plane adopted only the
energy-favored conformations. As the shapes of hexa-1 and hexa-2 related to the
corresponding sections in octa, both of their poses contained a planar U-turn. The absence
of a repeating unit from each end of octa caused only a slight reduction in the binding
affinity instead of a significant reduction or elimination.
Figure 3
Demonstrations of the planar U-turn in octa. (A) Representation of the surface of the
helical conformation of the oligosaccharide with eight
GlcNS6S-IdoA2S repeating units. (B) 3D-SNFG
representations (GlcNS6S, blue/white cube; IdoA2S,
yellow/white diamond) of the oligosaccharide with eight
GlcNS6S-IdoA2S repeating units.[55] The sulfate moieties are shown in the stick model. The residues in the middle
planar U-turn are drawn solid and the other residues are drawn transparent. (C)
3D-SNFG representations[55] (GlcNS6S, blue/white
cube; IdoA2S, yellow/white diamond) of octa in the representative
conformation of the RBD-octa complex by the MD simulations. The sulfate moieties are
shown in the stick model. The residues in the planar U-turn were labeled and drawn
solid; the other residues are drawn transparent. (D) Distributions of glycosidic
linkage conformations (ϕ: C4–O4–C1′–C2′ and
ψ: C3–C4–O4–C1′) in octa during the MD
simulation.
Demonstrations of the planar U-turn in octa. (A) Representation of the surface of the
helical conformation of the oligosaccharide with eight
GlcNS6S-IdoA2S repeating units. (B) 3D-SNFG
representations (GlcNS6S, blue/white cube; IdoA2S,
yellow/white diamond) of the oligosaccharide with eight
GlcNS6S-IdoA2S repeating units.[55] The sulfate moieties are shown in the stick model. The residues in the middle
planar U-turn are drawn solid and the other residues are drawn transparent. (C)
3D-SNFG representations[55] (GlcNS6S, blue/white
cube; IdoA2S, yellow/white diamond) of octa in the representative
conformation of the RBD-octa complex by the MD simulations. The sulfate moieties are
shown in the stick model. The residues in the planar U-turn were labeled and drawn
solid; the other residues are drawn transparent. (D) Distributions of glycosidic
linkage conformations (ϕ: C4–O4–C1′–C2′ and
ψ: C3–C4–O4–C1′) in octa during the MD
simulation.These observations of oligosaccharide conformations and changes of binding affinities
suggested that this planar U-turn in five consecutive residues is important for the
binding of HP/HS to RBD. Furthermore, the microarray study showed that the binding
strength of a tetrasaccharide with only two repeating units of
GlcNS6S-IdoA2S was under detection limits. For
comparisons, three tetrasaccharide models were generated by removing two repeating units
from the reducing end of octa (tetra-1) or two repeating units from the nonreducing end
(tetra-2), or one repeating unit from both ends (tetra-3). MD simulations of their
complexes with the RBD all displayed larger positional variations (Figure S12) compared to those observed in the RBD-octa complexes. Both the
experimental and modeling studies suggest that two repeating units of
GlcNS6S-IdoA2S with an incomplete planar U-turn would
significantly harm the binding strength. Furthermore, the binding strength for a
tetrasaccharide with only two repeating units of
GlcNS6S-IdoA2S with RBD was under the detection limit
in the microarray study, suggesting that an incomplete planar U-turn would significantly
harm the binding strength. The U-turn shape was also observed in other studies, both free
form in the solution phase and the protein complex[57,58] (Figure S13). Therefore, one can speculate that this planar U-shape with five
residues may be a general structural form for the HP/HS binding in the RBD binding
site.Collectively, the model of the octasaccharide was validated for representing the dynamic
and energetic features of the RBD–HS complex through the reproduction of the
length- and sequence-dependent binding behavior by MD simulations starting from the
predicted pose in the RBD binding site. It would be further employed in generating the
molecular model for the ACE2–RBD–HS complex, as the binding sites for HS and
ACE2 in RBD are separated.
Mechanism of the Cofactor Role for HP in ACE2–RBD Binding Interactions
Thus, having confirmed the validity of the molecular model for the RBD-octa complex, the
representative conformation for octa in the RBD binding site was utilized to study the
cofactor mechanism of HP/HS for the recognition of the SARS-CoV-2 S protein by ACE2. Prior
to studying the mechanism, the molecular model of ACE2–RBD-octa was examined by its
structural stabilities and ability to replicate the experimentally observed key
interactions at the ACE2–RBD binding interface in MD simulations.The overall RMSDs for Cα atoms in ACE2 and RBD were less than 10 Å, suggesting
that proteins in the ACE2–RBD-octa complex were structurally stable during the MD
simulation. In addition, similar to the RBD-octa complex, octa in the ACE2–RBD-octa
complex also displayed low structural variations along the course of MD simulations
(Figure S14). The RBD in the SARS-CoV-2 S protein binds to ACE2 through a
network of hydrophilic interactions of multiple hydrogen bond interactions and salt
bridges[7] in their binding interface that is formed between two short
β strands with connecting loops in RBD and an N-terminal helix in ACE2. Two hotspots
in the binding interface of the ACE2–RBD complex, hotspots 31 and 353, centered
with K31 and K353 on the ACE2-binding ridge were previously identified by crystallographic
measurements.[7] The hydrogen bond interactions between K31 and Q493 in
RBD in the crystal structure of the ACE2–RBD complex[12] were
observed in the MD simulation of the ACE2–RBD-octa complex (Table
). Although the key interactions between K353 and the
backbone oxygen atom of G496 in the crystal structure were not found to be stable in the
MD simulation, K353 formed stable hydrogen bond interactions with Y495 and Q498 that are
adjacent to G496 (Table and Figure ). In addition to the essential interactions formed by
the two key hotspot residues, more hydrogen bond interactions were observed within the
binding interface (Table and Figure ). Both Y41 and D355 in ACE2 formed stable interactions
with the side chain of T500 in RBD in the MD simulation; in the crystal structure of the
ACE2–RBD complex, Y41 and D355 were found to be within 4 Å of the side chain
of T500. Similarly, stable interactions were seen between the side chain of N330 in ACE2
and the backbone of T500 in RBD, in which the distances were less than 4 Å in the
crystal structure. Additional hydrogen bond interactions were observed between Q24 in the
ACE2-binding ridge and N481 in the loop region of the RBD binding interface, which could
potentially enhance the contacts between two regions. In the crystal structure, although
the loop region presented a different conformation that separated two residues, the
interactions between these two regions were still observed as F486 formed hydrophobic
contacts with the ACE2 N-terminus.
Table 3
Stable Intermolecular Hydrogen Bond Pairs in the Binding Interface of
ACE2–RBD in the Three Replicas of the MD Simulations for the
ACE2–RBD-octa and ACE2–RBD Complexes
ACE2
RBD-octa
RBD
residue
atom
residue – atom
residue – atom
Protein–Protein
Interactions
Q24
Nε
N481-O
K31
Nζ
Q493-Oε
Q493-Oε
H34
Nδ
Y453-Oη
E37
Oε
Y505-Oη
Y505-Oη
D38
Oδ
Y449-Oη
Y449-Oη
Y41
Oη
T500-Oγ
T500-Oγ
Q42
Nε
Q498-Oε
G446-O, Y449-Oη, Q498-Oε
Q325
T500-O
N330
Nδ
T500-O
T500-Oγ
K353
Nζ
Y495-O, G496-O, Q498-Oε
Y495-O, G496-O, Q498-Oε
D355
Oδ
T500-Oγ
T500-Oγ
Glycan–Protein
Interactions
N90-GlcNAcβ-GlcNAcβ-Manβ
O3
K417-Nζ
K417-Nζ
N90-GlcNAcβ-GlcNAcβ-Manβ
O2
K417-Nζ, Y421-Oη
K417-Nζ, Y421-Oη
O3
D420-Oδ
T415-Oγ, D420-Oδ
O4
D420-Oδ
T415-O, D420-Oδ
Figure 4
Demonstration of the binding interface of ACE2–RBD by the MD simulations of
the complexes of ACE2–RBD-octa (A) and ACE2–RBD (B) during the MD
simulations.
Demonstration of the binding interface of ACE2–RBD by the MD simulations of
the complexes of ACE2–RBD-octa (A) and ACE2–RBD (B) during the MD
simulations.For comparison purposes, the model of the ACE2–RBD complex (without octa) was also
built and subjected to MD simulations. Similar to the ACE2–RBD-octa complex, the
overall RMSDs for Cα atoms in the ACE2–RBD complex were less than 10 Å
(Figure S14), showing similar structural stability to those observed with the
presence of octa in the RBD binding site. Furthermore, the intermolecular hydrogen bond
interactions in the ACE2–RBD binding interface without octa in the complex were
also similar to those previously observed in the ACE2–RBD-octa complex (Tables and S5 and Figure ). The key
interaction pairs involving hotspot residues, K31–Q493 and K353–Q498, were
observed. Instead of the backbone of Y495, K353 formed interactions with the backbone of
G496, as seen in the crystal structure. In addition to the hotspot residues, D355 was also
observed to interact with T500. The pairing of N330–T500 at the distal end of the
binding interface was not observed as stable interactions in the MD simulation of the
ACE2–RBD complex, yet, the interactions between Q42 and Q498 were seen as stable.
Collectively, the models for the ACE2–RBD complex with and without the presence of
octa in the binding site of RBD maintained structural stabilities and reproduced essential
intermolecular hydrogen bond interactions observed in the ACE2–RBD binding
interface. Therefore, the dynamic and thermodynamic differences of these two complexes
were employed for understanding the cofactor mechanism of HP/HS in the recognition of RBD
in the SARS-CoV-2 S protein by ACE2.The stable hydrogen bond pair of Q24–N481 was observed when octa was bound to RBD,
but not when octa was absent. The loop region at the distal end of the binding interface
that contained the N756 residue showed distinct conformations in the representative
conformations for both complexes. Without the presence of octa, this loop region was seen
to be further away from the ACE2 N-terminus helix in the binding interface compared to
that in the complex with the presence of octa (Figure ). The positions of this loop region in the static representative conformation
in the ACE2–RBD complex extracted from the MD simulation appeared to be different
from that observed in the crystal structure.[7] Thus, to confirm the
dynamic behavior of the loop region beyond the static representations, it is necessary to
examine the positional variations of the loop region during the course of the MD
simulations in ACE2–RBD complexes with and without the presence of octa in the RBD
binding site. The loop region showed larger positional variations in the ACE2–RBD
complex (as high as 30 Å) than those in the ACE2–RBD-octa complex (under 10
Å) as seen from their positional RMSD values (Figure S15). Other theoretical studies also showed that this loop region
displayed higher dynamical motions without the presence of the oligosaccharide.[59] It can be speculated that the dynamic motions in the loop region were
significantly reduced in the ACE2–RBD complex with the presence of octa in the RBD
binding site.The principal component analysis (PCA) for the motions of the loop region was performed
to further study the correlations between the dynamic motion changes for the loop region
and the presence of octa in the RBD binding site. The PCA results agreed with the
positional variations observed through RMSD calculations (Figure ). The motions of the loop were stronger as indicated by the longer
arrows when octa was absent than those when octa was present. More importantly, the top
motion component of the loop in RBD without octa was toward the binding site of octa,
suggesting that the vacancy of the binding site for HP/HS in RBD offered more freedom for
the loop region. Conversely, the spatial freedom of the loop would be reduced when octa is
present in the binding site, and so that the motion. Given that the position of the loop
region in RBD is between the binding sites for ACE2 and HP/HS, the presence of octa in the
RBD binding site would block the movements of the loop and reduce its motions, which in
turn would promote its contact and binding interactions with the N-terminal helix of ACE2,
as observed in the MD simulation of the ACE2–RBD-octa complex.
Figure 5
Principle component analysis for the loop region (residue D467 to C488) in RBD in the
ACE2–RBD-octa (left panel) and ACE2–RBD (right panel) complexes along
the course of the MD simulations. ACE2 and RBD are shown in cyan and gray,
respectively. The top three components for the motion of the loop region are
represented with arrows: first – green, second – red, and third –
blue. Monosaccharides in octa are drawn in the 3D-SNFG nomenclature[55] (GlcNS6S, blue/white cube; IdoA2S, yellow/white
diamond).
Principle component analysis for the loop region (residue D467 to C488) in RBD in the
ACE2–RBD-octa (left panel) and ACE2–RBD (right panel) complexes along
the course of the MD simulations. ACE2 and RBD are shown in cyan and gray,
respectively. The top three components for the motion of the loop region are
represented with arrows: first – green, second – red, and third –
blue. Monosaccharides in octa are drawn in the 3D-SNFG nomenclature[55] (GlcNS6S, blue/white cube; IdoA2S, yellow/white
diamond).The total binding energy contribution for RBD residues in the ACE2–RBD binding
interface (Table S6), ones within 10 Å of ACE2, were −49.2 ± 3.2
kcal/mol with the presence of octa, which was 15% more negative than that without the
presence of octa (−42.9 ± 3.1 kcal/mol). This enhanced binding interaction
agreed energetically with the cofactor role of octa in the recognition of SARS-CoV-2 S
protein by ACE2. There were 11 residues (T478 to C488) in the loop region, which were
included in the binding energy calculations. The difference of the total contributions
from these 11 residues in the ACE2–RBD-octa (−4.6 ± 1.8 kcal/mol) and
ACE2–RBD (0.7 ± 0.7 kcal/mol) complexes were −5.3 kcal/mol, while the
difference of the contributions from the other binding residues, including those
interacting with the hotspot residues, was only −1.0 kcal/mol. The disproportion of
the binding contribution differences caused by the presence/absence of octa between the
loop region at the distal end and the main binding interface suggested that the loop
region is a key variable for the ACE2–RBD binding strength.In summary, the presence of octa in the RBD binding site blocked the movements of the
loop region at the distal end of the ACE2–RBD binding interface, which in turn
promoted the contacts of the loop region with the ACE2 N-terminus helix and enhanced the
binding interactions of the ACE2–RBD complex.
Conclusions
Glycosaminoglycans involve in many aspects of different viral activities. In the infections
of SARS-CoV-2, HP/HS has shown to be a cofactor for the recognition of SARS-CoV-2 S protein
by ACE2. Thus, understanding the mechanisms for HP/HS interacting with RBD and promoting
ACE2–RBD interactions could provide useful theoretical insights into the therapeutics
against COVID-19. The microarray study for the RBD of SARS-CoV-2 S protein using the HP/HS
library has revealed that the binding of HP/HS to RBD is length- and sequence-dependent;
nevertheless, the lack of crystallographic data for the RBD–HP/HS complexes has been
an obstacle for further studying the mechanisms at the atomic level.In the present study, putative poses for octa, an oligosaccharide with four repeating units
of GlcNS6S-IdoA2S, in the binding site of RBD of the
SARS-CoV-2 S protein were generated by Vina-Carb that includes penalties for energetic
unfavored glycosidic linkage conformations in the docking scoring functions. Only three of
the top six ranked poses maintained their initial docked poses and bound stably to RBD along
the course of their MD simulations. Coincidently, they shared similar conformational and
spatial characters. The representative conformation of octa by MD simulations starting from
these three docked poses was employed in building models for RBD complexing with
hexasaccharides, RBD-hexa-1 and RBD-hexa-2, to reproduce the previously observed length- and
sequence-dependent behavior before further mechanism studies. Hexa-1 and hexa-2 displayed
weaker binding strengths than octa, which agreed with the length-dependent behavior. The
2-O-sulfate and 6-O-sulfate groups of all three
IdoA2S and GlcNS6S residues, respectively, in hexa-1 and
hexa-2 formed stable intermolecular hydrogen bond interactions and contributed to the
binding interaction energies, which agreed with the sequence-dependent behavior of the
IdoA2S residues. A planar U-turn shape was observed in the five
consecutive residues in octa, and such a shape was in favor of the HP/HS binding to RBD. The
oligosaccharides with four and three repeating units of
GlcNS6S-IdoA2S showed binding to RBD experimentally, but
not the tetrasaccharide with two repeating units, which suggested that this planar U-turn
may be a general structural form for HP/HS binding to RBD. These agreements suggested that
the docked pose for octa in the RBD possessed the binding characters of HP/HS in RBD in the
natural cells, if the actual binding pose is not reproduced.The ACE2–RBD complex with octa in its validated representative conformation and
without octa in the RBD binding site both reproduced the experimentally observed
intermolecular interactions between the residues in the ACE2-binding ridge and those in the
corresponding RBD binding surface. In addition to the key hydrogen bond interaction pairs
from hotspot 31 and 353, several other stable hydrogen bond interactions were also observed
in its MD simulation. The total binding energy contributions from the RBD residues in the
ACE2–RBD binding surface in the ACE2–RBD-octa complex were greater than those
in the ACE2–RBD complex, agreeing to the experimental observations that the presence
of HS would promote the binding between ACE2 and RBD. The interactions between N481 from the
loop in RBD and Q24 in the ACE2 N-terminus were observed in the ACE2–RBD-octa
complex, but not in the ACE2–RBD complex in the MD simulations, which possibly caused
the binding energy differences between these two complexes. Further analyses showed that the
loop containing N481 in RBD displayed more positional variations in the ACE2–RBD
complex than the ACE2–RBD-octa complex. The decompositions of the motions showed that
the loop carried movements toward the HS binding site in the ACE2–RBD complex.
Therefore, given that the loop is adjacent to both octa and ACE2, it is reasonable to
speculate that octa in the RBD binding site blocked such motion and reduced the freedom of
the loop, which promoted the loop to interact with ACE2 on the other side.