Rommie E Amaro1, Pek U Ieong1, Gary Huber1, Abigail Dommer1, Alasdair C Steven2, Robin M Bush3, Jacob D Durrant4, Lane W Votapka5. 1. Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California, United States. 2. Structural Biology Laboratory, National Institutes of Health, Bethesda, Maryland, United States. 3. Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, California, United States. 4. Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States. 5. Department of Chemistry, Point Loma Nazarene University, San Diego, California, United States.
Abstract
Studies of pathogen-host specificity, virulence, and transmissibility are critical for basic research as well as for assessing the pandemic potential of emerging infectious diseases. This is especially true for viruses such as influenza, which continue to affect millions of people annually through both seasonal and occasional pandemic events. Although the influenza virus has been fairly well studied for decades, our understanding of host-cell binding and its relation to viral transmissibility and infection is still incomplete. Assessing the binding mechanisms of complex biological systems with atomic-scale detail is challenging given current experimental limitations. Much remains to be learned, for example, about how the terminal residue of influenza-binding host-cell receptors (sialic acid) interacts with the viral surface. Here, we present an integrative structural-modeling and physics-based computational assay that reveals the sialic acid association rate constants (k on) to three influenza sites: the hemagglutinin (HA), neuraminidase (NA) active, and NA secondary binding sites. We developed a series of highly detailed (atomic-resolution) structural models of fully intact influenza viral envelopes. Brownian dynamics simulations of these systems showed how structural properties, such as stalk height and secondary-site binding, affect sialic acid k on values. Comparing the k on values of the three sialic acid binding sites across different viral strains suggests a detailed model of encounter-complex formation and indicates that the secondary NA binding site may play a compensatory role in host-cell receptor binding. Our method elucidates the competition among these sites, all present on the same virion, and provides a new technology for directly studying the functional balance between HA and NA.
Studies of pathogen-host specificity, virulence, and transmissibility are critical for basic research as well as for assessing the pandemic potential of emerging infectious diseases. This is especially true for viruses such as influenza, which continue to affect millions of people annually through both seasonal and occasional pandemic events. Although the influenza virus has been fairly well studied for decades, our understanding of host-cell binding and its relation to viral transmissibility and infection is still incomplete. Assessing the binding mechanisms of complex biological systems with atomic-scale detail is challenging given current experimental limitations. Much remains to be learned, for example, about how the terminal residue of influenza-binding host-cell receptors (sialic acid) interacts with the viral surface. Here, we present an integrative structural-modeling and physics-based computational assay that reveals the sialic acid association rate constants (k on) to three influenza sites: the hemagglutinin (HA), neuraminidase (NA) active, and NA secondary binding sites. We developed a series of highly detailed (atomic-resolution) structural models of fully intact influenza viral envelopes. Brownian dynamics simulations of these systems showed how structural properties, such as stalk height and secondary-site binding, affect sialic acid k on values. Comparing the k on values of the three sialic acid binding sites across different viral strains suggests a detailed model of encounter-complex formation and indicates that the secondary NA binding site may play a compensatory role in host-cell receptor binding. Our method elucidates the competition among these sites, all present on the same virion, and provides a new technology for directly studying the functional balance between HA and NA.
The
influenza virus is a negatively stranded RNA virus that encapsulates
its cargo with a host-derived lipid bilayer. The two surface glycoproteins
embedded in this bilayer, hemagglutinin (HA) and neuraminidase (NA),
have complementary functions. According to the accepted paradigm,
HA facilitates viral entry, and NA promotes viral release by cleaving
host-cell receptors.[1,2] Both NA and HA bind to sialic
acid (SIA), a common physiological monosaccharide with a nine-carbon
backbone[3,4] and the terminal residue on the host-cell
receptors to which the influenza virus binds. These two key glycoproteins
and their host-cell receptors are known to play roles in influenza
virulence and transmissibility. For example, HA from humaninfluenza
strains prefers binding to glycan receptors with a terminal α2,6
SIA linkage; in contrast, HA from the avian strains preferentially
recognizes the α2,3 SIA linkage.[5] Sialoglycans attached to the glycoproteins also help evade antibody
detection by masking epitopes.[6] The functional
equilibrium between HA and NA has also long been known to impact the
fitness, transmissibility, infectivity, and virulence of the virus.[5] Wagner et al. showed that an increase of HA affinity
for SIA must be counterbalanced with an increase in NA activity to
maintain effective replication.[7] A proper
HA/NA functional balance is also critical for human-to-human transmission
and for transmission across the species barrier.[8]Little is known about a secondary hemagglutination
(SIA-binding)
site adjacent to the primary NA sialidase (enzymatic) site,[9−11] though that site is likely to impact the HA/NA functional balance.
Previously, Sung et al. investigated the NA secondary site using Brownian
dynamics (BD) simulations of SIA binding to individual (isolated)
NA glycoproteins.[12] Typically BD simulations
treat solutes as rigid bodies that are free to tumble and translate
(diffuse) through a continuum solvent, driven by electrostatic and
stochastic forces that represent the interactions with the solvent
molecules. Because most of the details of the solvent and solute are
greatly simplified, simulations can be performed much more quickly
while still capturing important features of the receptor–ligand
interaction. These simulations can reveal second-order complex-encounter
association constants via the Northrup-Allison-McCammon formalism,[13] in which the ligand is started on a sphere surrounding
the receptor, and a trajectory is generated and terminated upon binding
or escaping to a larger surrounding sphere. The proportion of trajectories
that bind rather than escape can be used to compute the kon. Knowledge of the kon is
important for understanding the kinetics of molecular-recognition
events;[14] in the case of influenza, characterizing
how SIA encounters or is driven toward the various NA binding sites
gives insight into how the virus forms precatalytic “encounter”
complexes, potentially providing new opportunities for drug development.Sung et al. predicted that (1) SIA binding to the NA secondary
site in the circulating pandemic H1N1 strain is possible, despite
a bioinformatics analysis that speculated otherwise,[15] and (2) SIA association with the H1N1 strain is slower
than to the highly pathogenic H5N1 avian strain. Both predictions
were later confirmed experimentally by Lai et al.[16] through saturation transfer difference (STD) nuclear magnetic
resonance (NMR) on isolated NAs. Together, these single-glycoprotein
studies yield new insights into the association of sialic acid with
its various NA sites, with the simulations offering mechanistic details
not possible with experiments alone. However, these studies did not
include HA, which could potentially compete for drug or host-cell–receptor
binding. Thus, they provided limited insight into in vivo SIA binding,
which involves both HA and NA active sites in the context of a whole
viral surface.Assessing the HA/NA functional balance in physical
viruses with
atomic-level detail is currently experimentally intractable. Furthermore,
experimental assays typically test HA or NA in isolation, and their
results cannot be directly correlated with each other, because different
methods are used for the individual studies.[17] On the one hand, ethical concerns or the possibility of viral escape
also limit some experiments.[18] “Computational
assays,” on the other hand, are not subject to the same limitations.[19] With sufficient experimental integration and
validation, computational assays can extend and enrich our understanding
of complex phenomena beyond what is possible experimentally.[20] In this work, we devise a computational assay
that allows the time-dependent study of structurally realistic viral
surfaces and their interactions with small molecules.We sought
to mimic a series of studies that have investigated the
HA/NA functional balance and its impact on transmissibility. We created
two in silico influenza-coat models based on known clinical isolates
(Figure ). The VIET04
model represents the highly pathogenic avian influenza (A/Vietnam/1203/2004),
which has low transmissibility.[21] The CALI09
model represents the 2009 pandemic flu (A/California/04/2009) that
resulted in ∼60.8 million cases.[22]
Figure 1
An
illustration of the atomic-resolution viral-surface models,
shown with reduced resolution to illustrate differences in the NA
stalk heights. 09H1 and H5 are colored in blue and green, respectively.
09N1 and avian N1 are colored in white and yellow, respectively. (A)
The wild-type H1N1 virus (long-stalk NA). (B) The H5N1 virus with
a short-stalk H1N1 NA. (C) The reassortant H5N1 virus studied by Imai
et al.[25] (D) The reassortant H1N1 virus
studied by Blumenkrantz et al.[24]
An
illustration of the atomic-resolution viral-surface models,
shown with reduced resolution to illustrate differences in the NA
stalk heights. 09H1 and H5 are colored in blue and green, respectively.
09N1 and avian N1 are colored in white and yellow, respectively. (A)
The wild-type H1N1 virus (long-stalk NA). (B) The H5N1 virus with
a short-stalk H1N1NA. (C) The reassortant H5N1 virus studied by Imai
et al.[25] (D) The reassortant H1N1 virus
studied by Blumenkrantz et al.[24]We also created two in silico
models based on lab-created strains
(Figure ). Blumenkrantz
et al. used reverse genetics to replace the NA of A/California/04/2009
(CALI09) with the NA of A/Vietnam/1203/2004 (VIET04), which includes
a 20-amino acid stalk deletion correlated with limited mammalian viral
replication[23] and transmission.[24] Blumenkrantz et al. showed that their construct
has a reduced ability to cleave complex sialoglycan substrates and
that it exhibits reduced viral transmissibility among ferrets, the
mammalian model for humaninfluenza infection. We call our model of
this construct “BLUMEN” after the group that performed
the NA replacement. Imai et al. also built a reassortant flu virus
by swapping the H1 gene in the CALI09 virus with the H5 gene from
H5N1.[25] This construct, which we call “IMAI,”
could have been a potential public health hazard, but it was found
to be not transmissible.[25]We created
four atomic-resolution models of these constructs using
an integrative modeling approach. To build individual models of membrane-bound
NA and HA glycoproteins,[26] we used published
crystallographic models,[27−29] homology modeling,[30,31] and protein–protein docking.[32−35] To build models of entire influenza
viral surfaces, we used these single-glycoprotein models together
with new atomistic membrane-building tools[36] and cryoelectron tomography (cryoET) of a whole influenza virion[37] (Figure , Methods). The resulting atomically detailed, three-dimensional
models of the viral coat contain HA, NA, and M2 ion channels positioned
with realistic density and spatial patterning. To test the sensitivity
of our method to the relative orientation and proximity of the glycoproteins
on the viral surface, we also created two in silico strains with randomly
placed (“homogenized”) distributions of NA and HA on
the virion. Each virion contains 236 HA trimers (708 monomers) and
30 NA tetramers (120 monomers). The viruses are ∼120 nm in
diameter, though the whole structure is slightly aspherical, per the
cryoET image used for modeling. Each virion model contains ∼14.5
million atoms, including hydrogen atoms.
Figure 2
Virion reconstruction.
(A) The asymmetrical virus membrane represented
as points (from the tomographic data). (B) The proteins within the
tomographic map are represented as vectors. (C) HAs are positioned
according to their vectors. (D) NAs are similarly positioned. (E)
The lipid envelope is constructed through surface meshing via tessellation.
(F) The M2 ion channels are inserted randomly into the membrane construct.
(G) All-atom lipid molecules are added to the individual triangles
that represent the membrane. (H) The full influenza viral surface.
Virion reconstruction.
(A) The asymmetrical virus membrane represented
as points (from the tomographic data). (B) The proteins within the
tomographic map are represented as vectors. (C) HAs are positioned
according to their vectors. (D) NAs are similarly positioned. (E)
The lipid envelope is constructed through surface meshing via tessellation.
(F) The M2 ion channels are inserted randomly into the membrane construct.
(G) All-atom lipid molecules are added to the individual triangles
that represent the membrane. (H) The full influenza viral surface.We then performed atomic-scale
BD simulations[38] on each of the structural
models to study how the HA, NA
active, and NA secondary sites compete for SIA association (Figure ). The framework
presented here significantly extends a previous computational assay
developed by Sung et al.,[12] which focused
on single glycoproteins in isolation. Our new models include the entire
viral coat, allowing us to shift away from a single-protein paradigm
in favor of a more accurate and complex whole-virus understanding.
The BD simulations of our viral-surface constructs allow us to more
rigorously study encounter-complex formation and relative rates of
SIA association to the different viral sites. It has not been previously
possible to quantifiably explore these important determinants of viral
replication and transmissibility.
Figure 3
An atomic-resolution virtual model of
the viral envelope derived
from integrative modeling. (A) The whole viral envelope is shown,
with HAs in blue, NAs in white, and membrane in pink. Glowing green
points on glycoproteins indicate SIA binding sites. SIA is shown diffusing
from the b-surface (starting coordinate) toward the virus or escaping
to the q-surface. If SIA escapes, the simulation ends. (B) Close-up
of HA (blue) and NA (white), in more detail. SIA binding sites are
shown with glowing red and blue points. (C) The encounter-complex
definitions of SIA (shown in sticks, illustrated with polar hydrogens)
bound to the NA-secondary (blue shading on white surface) and NA-active
(red shading on white surface) sites. Atom–atom interactions
between the glycoprotein and SIA are visible. (D) The encounter-complex
definition of SIA bound to the HA receptor binding domain (blue).
An atomic-resolution virtual model of
the viral envelope derived
from integrative modeling. (A) The whole viral envelope is shown,
with HAs in blue, NAs in white, and membrane in pink. Glowing green
points on glycoproteins indicate SIA binding sites. SIA is shown diffusing
from the b-surface (starting coordinate) toward the virus or escaping
to the q-surface. If SIA escapes, the simulation ends. (B) Close-up
of HA (blue) and NA (white), in more detail. SIA binding sites are
shown with glowing red and blue points. (C) The encounter-complex
definitions of SIA (shown in sticks, illustrated with polar hydrogens)
bound to the NA-secondary (blue shading on white surface) and NA-active
(red shading on white surface) sites. Atom–atom interactions
between the glycoprotein and SIA are visible. (D) The encounter-complex
definition of SIA bound to the HA receptor binding domain (blue).
Results and Discussion
BD simulations
allow us to assess the probability of an “encounter
complex” forming between two species at a fixed distance from
a binding site (i.e., using the “reaction” criteria).
We defined separate reaction criteria for the three SIA-binding sites.
Three sets of atomic pairs were defined for each site. The reaction
criteria for each given site were satisfied (i.e., successful complex
formation was “counted”) when the distance between any
three of the associated atomic pairs fell within 7.5 Å (Table S1). We selected a distance of 7.5 Å,
because that cutoff reproduced the experimentally observed kon for SIA binding to the NA active site[39] (Supporting Information). No experimental kon information is
available for the binding of SIA to HA, so we used the same distance
criteria (7.5 Å) and selected residue pairs by examining a crystal
structure of SIA-bound HA. The second-order association rate constant
of SIA to a predefined binding site allows us to quantify binding
rate constants and enables us to assess how molecular recognition
events differ between the HA and NASIA binding sites among the four
strains. Noting that binding and activity in these systems has been
experimentally correlated,[40] we here use
the rate of encounter-complex formation, kon, as an initial proxy for activity.Though there are many more
HA binding sites on the influenza surface
available for SIA binding, the HA kon is
significantly slower than the NA kon in
all four constructs. The avian H5 HA had the lowest kon values. Avidity, or the binding of complex sialoglycans
to multiple HA sites simultaneously, may compensate for the slow rate
of association to HA, though this scenario is not tested here. Nevertheless,
our finding that the SIA association rates are faster to both the
NA sites compared to the HA sites supports published reports that
NA also contributes to host-cell receptor binding, either as a result
of specific NA mutations[23,41] or binding to the NA
secondary site.[4,16,42]In all four simulated constructs, the rates of SIA encounter-complex
formation were faster to both the NA active and secondary sites than
to the HA binding site (Figure ). The systems with the avian short-stalk NA (VIET04 and BLUMEN)
had NA-active-site kon values that were
slightly slower than the systems with long-stem NA (CALI09 and IMAI),
though the differences were within an order of magnitude. At the same
time, the two NA-short-stalk systems had NA-secondary-site kon values that were 2 orders of magnitude faster
than the kon to the NA active site. These
results suggest that the NA secondary site may compensate for the
slower rate to the NA active site in the short-stalk NA. The secondary
site may act as a general basin of attraction by electrostatically
steering SIA to the area of the NA active site, and the increased
binding propensity of SIA to an area adjacent to the NA active site
area may ultimately promote cleavage.
Figure 4
Association rates (log-scale) of SIA to
the NA active site, NA
secondary site, and HA.
Association rates (log-scale) of SIA to
the NA active site, NA
secondary site, and HA.The VIET04, BLUMEN, and IMAI constructs have low or absent
transmissibility,
while CALI09, the strain that caused the last pandemic, has high transmissibility.[21,24,25,29] Xu et al. found that NA and HA activity must be functionally matched
for the emergence of a pandemic (highly transmissible) strain.[8] In the present study, the most transmissible
influenza strain, CALI09, has the most similar (i.e., most balanced)
association rates to all three SIA sites, with minor differences of
1500 μM–1·s–1 and within
1 order of magnitude (Figure Table S2). In contrast, our studies
suggest that the VIET04 and IMAI HA kon values are comparatively low and that the VIET04 and BLUMAN NA-secondary-site kon values are comparatively high. In these three
nontransmissible strains, the kon values
among the three SIA-binding sites span 2 to 4 orders of magnitude
(Figure ). These results
indicate a large discrepancy in the rates of encounter-complex formation
among the three sites. We hypothesize that this imbalance between
the SIA binding rates among the three sites may be linked to the low
transmissibility of these strains.Imai et al. found that HPAI
H5 had to undergo additional mutations
(mainly in the HA receptor-binding domain, or RBD) before transmission
in respiratory droplets occurred. Our predicted kon values may reveal the molecular mechanism responsible
for this requirement. We hypothesize that the observed HA mutations
increase SIA binding to the HA RBD or otherwise compensate for the
kinetic imbalance among the three SIA sites present in this strain
(e.g., through additional mutations in NA). Our work supports the
conclusion of Imai et al. and suggests a molecular-level understanding
of how virus mutation and/or reassortment may be linked to transmissibility.
In cases where there are substantial differences in the encounter-complex
formation rates among the NA active, NA secondary, and HA sites, additional
mutations must be acquired to bring the binding sites back into a
balanced state.Our computational assay also allows us to explore
open questions
about the observed structural organization of NA glycoproteins on
the virion surface.[43−49] Clustering (or clumping) of NAs on the surface was observed using
anti-NA antibodies first by Compans et al.[50] and again later by Murti and Webster,[51] who reported finding between one and three NA patches per virion.
Harris et al.,[37] using the same cryoET
data that we use here to construct the experimentally patterned atomic
constructs, found both single NA spikes surrounded by HA and multiple
patches of NAs. This finding has more recently been confirmed by Chandla
et al.,[52] also via cryoET. Calder et al.[53] and Wasilewski et al.,[54] also using cryoET, reported that the ribonucleoprotein particles
(RNP) formed a tapered assembly at one end of the interior of the
virus, with the NA glycoproteins occurring in clusters at the end
of the virion opposite the RNP attachment. They suggested that NA
clusters play a role in viral release from the infected cell by destroying
the HA binding to receptors on the cell surface.Originally,
we hypothesized that the rate of sialic acid binding
to the available NA sites may be faster to NAs in patches. We tested
this hypothesis by creating new strains in silico with randomly placed
(“homogenized”) distributions of NA and HA and rerunning
binding simulations. Our results (Table S4) indicate that the homogenization causes no significant effect on
the binding rates between the virus and free floating sialic acid.
Further, an analysis of the binding patterns of SIA to the glycoproteins
on the experimentally patterned virion (Figure S2) indicates that SIA binds to many different NAs and that
once SIA comes within a certain proximity of the NA glycoprotein,
it is driven toward binding in a funnel-like fashion (Figures S2 and S3). An analysis of SIA binding
with the electrostatics turned off (i.e., electrostatics set to zero
in our BD simulations) indicates that the binding of SIA to NA is
primarily driven by electrostatics, whereas binding to HA is not (Table S2). Altogether, our results indicate that
clustering of NAs does not increase the rate of sialic acid binding
to the NA active or secondary sites. Instead, our work effectively
substantiates the hypothesis originally put forth by Calder et al.[53] that NA patches likely evolved to enable budding
of the virus from the host-cell membrane.A height analysis
of the virion clearly indicates that the regions
of NA clusters in the H5N1 (short stalk) viral strain exhibit a depressed
area on the viral surface, whereas in the H1N1 (normal NA stalk length),
the height differences are significantly less exaggerated (Figure S4). Relatedly, curvature analyses of
the whole-virion models similarly indicate that NA clusters in the
H5N1 short-stalk strain reduce the curvature of the viral particle
in those regions, relative to the H1N1 strain (Figures and S5). Our
SIA-binding results indicate that the local surface depressions caused
by clusters of NA stalk-deletion glycoproteins do not alter the rates
of SIA binding to the available sites, yet it seems plausible that
deformations of the virion structure due to NA clusters may affect
how well the virus binds larger and more structurally complex host
cells. Similar to a flattened soccer ball, areas of reduced curvature
in the locations of the NA clusters would provide a larger contact
area for viral particles to bind to host cells. It seems possible,
then, that viral strains with NA stalk deletions may have a slight
advantage during the initial binding event to host cells (a scenario
not tested here).
Figure 5
(A, B) H1N1 and H5N1, respectively, plotted by surface
protein
type and location. Hemagglutinins (circles) and neuraminidases (triangles)
are colored by their absolute curvature. Curvature is calculated from
fitting a sphere to the surrounding neighborhood of proteins within
a 440 Å radius. (C, D) Distributions of curvature values associated
with hemagglutinins (top, royal blue) and neuraminidases (bottom,
sky blue). H5N1 shows a more significant shift to lower curvatures
(flatter surfaces) for neuraminidase compared to the total protein
shell than those of H1N1.
(A, B) H1N1 and H5N1, respectively, plotted by surface
protein
type and location. Hemagglutinins (circles) and neuraminidases (triangles)
are colored by their absolute curvature. Curvature is calculated from
fitting a sphere to the surrounding neighborhood of proteins within
a 440 Å radius. (C, D) Distributions of curvature values associated
with hemagglutinins (top, royal blue) and neuraminidases (bottom,
sky blue). H5N1 shows a more significant shift to lower curvatures
(flatter surfaces) for neuraminidase compared to the total protein
shell than those of H1N1.
Conclusions
Overall our work suggests a novel pharmaceutical
strategy: disrupting
the balance between HA and NA activity by targeting the NA secondary
site with small-molecule ligands. While inhibitors of NA cleavage
(e.g., oseltamivir) will continue to be important, therapeutics targeting
the NA secondary site may provide an alternate, potentially less strain-specific
approach for combating new influenza variants. Molecular dynamics
simulations of the same N1 studied here, combined with computational
solvent mapping, have already found regions near the NA secondary
site that may be druggable,[26] and Lai et
al. have used STD NMR to confirm that SIA analogues bind the NA secondary
site.[16] Targeting the secondary site therefore
presents a promising unexplored route for disrupting the HA/NA functional
balance.More generally, our work demonstrates the utility of
coupling integrative
structural models, developed across multiple scales of resolution,
to BD simulations. Through the close integration of experiment and
simulation, computational assays provide insight into the molecular
mechanisms underlying the transmissibility of emerging strains. Our
computational assays also enable the systematic interrogation of the
structural biology of the influenza virus and the effects that glycoprotein
reorganization may have on substrate binding. As our experimental
design included both an SIA ligand as well as multiple viral-surface
binding sites, it enables a comprehensive and detailed understanding
of the competition among multiple sites on the same virion. In this
way, large-scale structure-based modeling and physics-based simulation
can serve as a powerful tool for investigating the functional balance
between HA and NA.
Authors: Rui Xu; Xueyong Zhu; Ryan McBride; Corwin M Nycholat; Wenli Yu; James C Paulson; Ian A Wilson Journal: J Virol Date: 2012-06-20 Impact factor: 5.103
Authors: Jean-Michel Garcia; Jimmy C C Lai; Thomas Haselhorst; Ka Tim Choy; Hui-Ling Yen; Joseph S M Peiris; Mark von Itzstein; John M Nicholls Journal: Influenza Other Respir Viruses Date: 2013-09-30 Impact factor: 4.380
Authors: Martina Maritan; Ludovic Autin; Jonathan Karr; Markus W Covert; Arthur J Olson; David S Goodsell Journal: J Mol Biol Date: 2021-11-10 Impact factor: 5.469
Authors: Peter Eugene Jones; Carolina Pérez-Segura; Alexander J Bryer; Juan R Perilla; Jodi A Hadden-Perilla Journal: Curr Opin Virol Date: 2021-08-28 Impact factor: 7.121
Authors: Christian Seitz; Lorenzo Casalino; Robert Konecny; Gary Huber; Rommie E Amaro; J Andrew McCammon Journal: Biophys J Date: 2020-10-31 Impact factor: 4.033