Nicole Balasco1, Josephine Alba2, Marco D'Abramo2, Luigi Vitagliano1. 1. Institute of Biostructures and Bioimaging, CNR, Via Mezzocannone 16, 80134 Naples, Italy. 2. Department of Chemistry, University of Rome Sapienza, P.le A.Moro 5, 00185 Rome, Italy.
Abstract
Human hemoglobin (HbA) is one of the prototypal systems used to investigate structure-function relationships in proteins. Indeed, HbA has been used to develop the basic concepts of protein allostery, although the atomic-level mechanism underlying the HbA functionality is still highly debated. This is due to the fact that most of the three-dimensional structural information collected over the decades refers to the endpoints of HbA functional transition with little data available for the intermediate states. Here, we report molecular dynamics (MD) simulations by focusing on the relevance of the intermediate states of the protein functional transition unraveled by the crystallographic studies carried out on vertebrate Hbs. Fully atomistic simulations of the HbA T-state indicate that the protein undergoes a spontaneous transition toward the R-state. The inspection of the trajectory structures indicates that the protein significantly populates the intermediate HL-(C) state previously unraveled by crystallography. In the structural transition, it also assumes the intermediate states crystallographically detected in Antarctic fish Hbs. This finding suggests that HbA and Antarctic fish Hbs, in addition to the endpoints of the transitions, also share a similar deoxygenation pathway despite a distace of hundreds of millions of years in the evolution scale. Finally, using the essential dynamic sampling methodology, we gained some insights into the reverse R to T transition that is not spontaneously observed in classic MD simulations.
Human hemoglobin (HbA) is one of the prototypal systems used to investigate structure-function relationships in proteins. Indeed, HbA has been used to develop the basic concepts of protein allostery, although the atomic-level mechanism underlying the HbA functionality is still highly debated. This is due to the fact that most of the three-dimensional structural information collected over the decades refers to the endpoints of HbA functional transition with little data available for the intermediate states. Here, we report molecular dynamics (MD) simulations by focusing on the relevance of the intermediate states of the protein functional transition unraveled by the crystallographic studies carried out on vertebrate Hbs. Fully atomistic simulations of the HbA T-state indicate that the protein undergoes a spontaneous transition toward the R-state. The inspection of the trajectory structures indicates that the protein significantly populates the intermediate HL-(C) state previously unraveled by crystallography. In the structural transition, it also assumes the intermediate states crystallographically detected in Antarctic fish Hbs. This finding suggests that HbA and Antarctic fish Hbs, in addition to the endpoints of the transitions, also share a similar deoxygenation pathway despite a distace of hundreds of millions of years in the evolution scale. Finally, using the essential dynamic sampling methodology, we gained some insights into the reverse R to T transition that is not spontaneously observed in classic MD simulations.
Human hemoglobin (HbA)
deserves a special position in structural
biology since it has been the model used to develop the fundaments
of protein crystallography.[1−5] Moreover, myoglobin has been the first protein whose structure has
been determined at an atomic level.[6] Even
more significantly, HbA has been, and it still is, the prototypal
system used to investigate structure–function relationships
in proteins. Indeed, the structural characterization of the different
functional states of HbA has provided fundamental insights for developing
the basic concepts of protein allostery, although the atomic-level
mechanism underlying the HbA functionality is still highly debated.[1,5,7−10] Due to the seminal work of Perutz,
the structural features of the endpoints of Hb structural transition
have been elucidated for more than half a century.[11] These pioneering studies have unveiled that the ligand-bound
forms are associated with a rather flexible structure denoted as the
R (relaxed) state. On the other hand, the unliganded HbA structure
is characterized by a rather rigid, tense, T-state.Since then,
hundreds of HbA structures have been reported in the
Protein Data Bank (PDB).[7,12] These studies have
significantly improved the structural sampling of the HbA bound states
by showing that the liganded HbA may manifest, in addition to the
R-state, in a variety of other relaxed states (R2, R3, RR2) that fall
outside the T–R pathway (Table S1).[7] These findings have initiated an intense
debate on the real extension of the Hb functional transition that
may go well beyond the T–R pathway and include these other
relaxed states.[7,13] On the other hand, the atomic-level
characterizations of intermediate R–T states have proven much
more difficult as HbA structures exhibiting intermediate R–T
features at the quaternary structure level have been rarely described.
The first example has been reported by Schumacher et al.,[14] who generated intermediate R–T states
through the crosslinking of the β chains of the HbA tetramer.
Interesting information on HbA function–structure relationships
were also provided by crystallographic structures showing remarkable
tertiary structure variations, although confined either to the R-
or the T-state.[7] More recently, a unique
crystal form of HbA characterized by the presence at the crystalline
state of three different quaternary structures has been reported.[15,16] One of these states, denoted as HL-(C), presents intriguing intermediate
features, although it is much closer to the R-state rather than to
the T-state (Table S1). In this scenario,
although insightful information on the R–T transition has been
obtained using other experimental and computational techniques,[7,16] the lack of detailed experimental information on intermediate T–R
states is one of the most important factors that has so far prevented
a full understanding of the HbA functional transition.The situation
is significantly different for the tetrameric Hbs
isolated from Antarctic fish (AntHbs) that share with HbA several
functional/structural features despite the fact that AntHbs operates
in organisms living in rather extreme conditions.[17−25] The intrinsic flexibility of AntHbs when studied at temperatures
significantly higher than the physiological ones has allowed the visualization
of states that exhibit unusual structural properties and/or peculiar
oxidation states. Specifically, the structures of Hbs extracted from Trematomus newnesi and Trematomus
bernacchii belonging to the Nototheniidae family and
from the sub-Antarctic fish Eleginops maclovinus have shown a variety of oxidation (hemichrome, aquomet, pentacoordinated
oxidized, etc.) often associated with noncanonical tertiary and quaternary
organizations.[23,26−29] Altogether, these structures
have provided some interesting insights into the possible structural
features of the intermediate states (see for example ref (30)). Interestingly, for the T. newnesi Hb (HbTn), three distinct intermediate
structures (tetramer A, TnA; tetramer B, TnB; and tetramer H, TnH),[26,30] in addition to the canonical T and R states, have been reported
(Figure and Table S1). For HbTn, the overall rotation of
one αβ dimer when the other is superimposed is about 11°
(15° in the case of HbA). Therefore, taking into account the
presence of the three intermediate states, the overall pathway may
be dissected into four subtransitions that are separated by approximately
3° rotation of one αβ dimer with respect to the other
(Figure ). In this
scenario, to gain further insights into this long-standing issue,
we here performed extensive fully atomistic molecular dynamics (MD)
simulations on HbA by also checking the relevance of the intermediate
species identified for AntHbs to the human counterpart.
Figure 1
Stepwise R–T
transition of HbTn as highlighted by crystallographic
investigations carried out on the protein. Superimposition of the
α1β1 dimer of different HbTn structural states: deoxygenated
T-state (TnT, black, PDB ID: 3NFE), TnA (green, PDB ID: 5LFG), TnB (magenta, PDB ID: 5LFG), TnH (cyan, PDB
ID: 3D1K), and
the canonical R-state (TnR, red, PDB ID: 1T1N). Magnifications of the helices A (residue
3–18) and G (residues 99–117) of the β2 subunit
are shown to highlight the transition from the T- to the R-state.
Stepwise R–T
transition of HbTn as highlighted by crystallographic
investigations carried out on the protein. Superimposition of the
α1β1 dimer of different HbTn structural states: deoxygenated
T-state (TnT, black, PDB ID: 3NFE), TnA (green, PDB ID: 5LFG), TnB (magenta, PDB ID: 5LFG), TnH (cyan, PDB
ID: 3D1K), and
the canonical R-state (TnR, red, PDB ID: 1T1N). Magnifications of the helices A (residue
3–18) and G (residues 99–117) of the β2 subunit
are shown to highlight the transition from the T- to the R-state.
Results
MD Simulations of the HbA
T-State: A Global Analysis of the
Trajectory Structures
The T–R transition was investigated
by MD simulations carried out using the fully unligated T-state of
HbA as the starting model (T0 simulation). Taking into account the
indications that emerged in previous analyses,[8,31] the
terminal His (His146) of the β chains was kept uncharged to
favor the transition toward the R-state. Indeed, the electrostatic
interaction between the side chain of the terminal His146β and
Asp94β is important for the stabilization of the T-state.[11] As shown in Figure , a clear transition is observed within the
first 100 ns. Indeed, the analysis of the root-mean-square deviation
(RMSD) values of the MD structures against either the R- or the T-state
indicates that they suddenly become closer to the R-state while diverging
from the T-state (Figure a). To gain further insights into the evolution of HbA throughout
the simulation, we computed the RMSD values of the trajectory structures
with respect to additional, well-characterized, structural states
of the protein. Indeed, the RMSD values calculated against the HL-(C)
state[14] follow the trend observed for those
computed against the R-state (Figure a). It is worth mentioning, however, that in the initial
stage of the trajectory, the RMSD values obtained versus the HL-(C)
state are significantly lower than those obtained versus the R-state,
as expected for this R-like HbA intermediate state. A closer inspection
of Figure a also indicates
that the RMSD values computed against the HL-(C) state present a local
minimum at ∼80 ns, when the trajectory structures have almost
completed their evolution toward the R-state. Once again, these observations
indicate that the HL-(C) state is a real global intermediate state
somehow close, however, to the R-state. Then, we evaluated the similarity/dissimilarity
of the trajectory structures with respect to the HbA states that are
off from the T–R pathway (i.e., R2, RR2, and R3 states). As
shown in Figure b,
the starting RMSD values calculated against these structures are larger
than those detected against the R-state. This observation is not surprising
considering the position of R2, RR2, and R3 in the T–R pathway.
Moreover, since these states are closer to the R-state rather than
the T-state, these RMSD values also decrease upon the transition observed
in the MD simulation. Notably, after the transition, the RMSD values
versus the RR2 state are similar to those displayed versus the R-state.
This observation indicates that at the end of the MD transition, the
conformational ensemble of the trajectory structures generated by
the simulation, although R-like, presents some features of the RR2
state. Similar transitions have been observed in three other independent
MD simulation runs (T0b, T0c, and T0d) conducted on the T-state of
HbA (Figures S1–S3).
Figure 2
(a) RMSD values (computed
on the Cα atoms) of
the T0 trajectory structures versus the starting T model (black, PDB
ID: 2DN2), the
R-state (red, PDB ID: 2DN1), and the intermediate HL-(C) state (violet, PDB ID: 4N7P). (b) RMSD values
computed against the off-pathway structures: R2 (blue, PDB ID: 1BBB), RR2 (yellow, PDB
ID: 1MKO), and
R3 (orange, PDB ID: 4NI0). (c) RMSD values computed against the intermediate states identified
for HbTn: TnA (green, PDB ID: 5LFG), TnB (magenta, PDB ID: 5LFG), and TnH (cyan,
PDB ID: 3D1K) (C). (d) RMSD values computed against the T-state (black) and the
R-state (red) of HbA and against TnH (cyan) in the time interval of
60–100 ns. RMSD values refer to the productive run without
the equilibration steps producing the initial drift.
(a) RMSD values (computed
on the Cα atoms) of
the T0 trajectory structures versus the starting T model (black, PDB
ID: 2DN2), the
R-state (red, PDB ID: 2DN1), and the intermediate HL-(C) state (violet, PDB ID: 4N7P). (b) RMSD values
computed against the off-pathway structures: R2 (blue, PDB ID: 1BBB), RR2 (yellow, PDB
ID: 1MKO), and
R3 (orange, PDB ID: 4NI0). (c) RMSD values computed against the intermediate states identified
for HbTn: TnA (green, PDB ID: 5LFG), TnB (magenta, PDB ID: 5LFG), and TnH (cyan,
PDB ID: 3D1K) (C). (d) RMSD values computed against the T-state (black) and the
R-state (red) of HbA and against TnH (cyan) in the time interval of
60–100 ns. RMSD values refer to the productive run without
the equilibration steps producing the initial drift.We also compared the T0 trajectory structures with the intermediate
states detected in the crystallographic characterizations of T. newnesi Hb. Despite the differences in the sequences
between HbA and HbTn, similar trends of the RMSD values are observed
(Figure c). The evolution
of the RMSD values computed against these intermediates shows remarkable
variations in the correspondence of the T to R transition of HbA that
occurs at ∼ 75 ns. The starting values of the RMSD observed
for the tetramers A, B, and H of HbTn are intermediate between those
observed for the T- and the R-state of HbA. For the tetramers A and
B, which display some similarity with the T-state, the RMSD values
increase upon the transition in the last part of the trajectory where
MD structures assume R-like states. A deep inspection of Figure c also indicates
that the trajectory structures closest to A and B tetramers are located
in the initial stage of the simulations. The observation suggests
that these tetramers fall in the conformational basin accessible to
the T-state.For the tetramer H, which is rather distinct from
both the T and
the R-state, we observe that trajectory structures present the lowest
RMSD values in the time interval that corresponds to the T–R
transition (Figure d).
MD Simulations of the HbA T-State: Essential Dynamics
To gain further insights into the T–R transition detected
in the T0 simulation, the trajectory was also analyzed by means of
the essential dynamics (ED) method, in which the principal motion
directions of large systems are represented by a set of eigenvectors
(see Materials and Methods for further details).
The eigenvectors were calculated from the entire ensemble of the
trajectory frames obtained from the T0 simulation, and ranked according
to their eigenvalues (Figure S4). The structures
derived from this simulation and some representative crystallographic
models have been projected along with the first principal component
(eigenvector) that accounts for ∼60% of the whole protein fluctuation
(Figure ). In line
with the expectations, the R2, RR2, and R3 crystallographic structures
of HbA are distributed outside of the T–R pathway. In line
with the hypothesis that the HbA functional transition may also include
these states, structures that emerged from the T0 MD simulation frequently
go beyond the R-state.
Figure 3
Projection on the first eigenvector of the T0 trajectory
structures.
The vertical solid lines correspond to the projections of the crystallographic
structures of human HbA states: T (black, PDB ID: 2DN2), R (red, PDB ID: 2DN1), intermediate HL-(C)
(violet, PDB ID: 4N7P), R2 (blue, PDB ID: 1BBB), RR2 (yellow, PDB ID: 1MKO), and R3 (orange, PDB ID: 4NI0) and of the HbTn
intermediates: TnA (green, PDB ID: 5LFG), TnB (magenta, PDB ID: 5LFG), and TnH (cyan,
PDB ID: 3D1K). Dashed lines correspond to the projections of crystallographic
structures showing remarkable tertiary structure variations as oxy
HbA T-state (black, PDB ID: 1GZX) and high-salt carbonmonoxy HbA (red, PDB ID: 1LJW) and of the crystallographic
structures of cross-linked carbonmonoxy HbAs (violet, PDB IDs: 1SDK and 1SDL).
Projection on the first eigenvector of the T0 trajectory
structures.
The vertical solid lines correspond to the projections of the crystallographic
structures of human HbA states: T (black, PDB ID: 2DN2), R (red, PDB ID: 2DN1), intermediate HL-(C)
(violet, PDB ID: 4N7P), R2 (blue, PDB ID: 1BBB), RR2 (yellow, PDB ID: 1MKO), and R3 (orange, PDB ID: 4NI0) and of the HbTn
intermediates: TnA (green, PDB ID: 5LFG), TnB (magenta, PDB ID: 5LFG), and TnH (cyan,
PDB ID: 3D1K). Dashed lines correspond to the projections of crystallographic
structures showing remarkable tertiary structure variations as oxy
HbA T-state (black, PDB ID: 1GZX) and high-salt carbonmonoxy HbA (red, PDB ID: 1LJW) and of the crystallographic
structures of cross-linked carbonmonoxy HbAs (violet, PDB IDs: 1SDK and 1SDL).In this representation, some of the Hb structures showing
remarkable
tertiary structure variations are close to either the R- or the T-state.
On the other hand, the HbA intermediate HL-(C) is located along the
pathway, although closer to the R- than to the T-state. In this projection,
the cross-linked intermediates identified by Schumacher et al.[14] lie very close to the HL-(C) state. Interestingly,
all of the intermediate states identified in the crystallographic
structures of HbTn also fall in the T–R pathway. These observations
clearly suggest that the principal motion direction, represented by
the first eigenvector, well reproduces the T–R functional transitions.
A closer inspection of the distribution also demonstrates that the
T0 trajectory structures well populate the states corresponding to
the intermediates HL-(C), TnA, and TnB that are not very far from
the canonical T or R states. On the other hand, the state TnH, which
lies in the middle of the transition, is very poorly populated. The
TnH state corresponds to the maximum of the free energy along the
trajectory as highlighted by the free energy difference between the
sampled states along with the T–R transition that was calculated
as the ratio between the H state population and a reference (most-populated)
state, ΔG = −RT ln (pTnH/pref) (Figure S6). This observation is not surprising
as HbA undergoes a very sudden T–R transition and, also, might
explain why intermediate states for HbA have been so elusive despite
the large amount of structural data collected for this protein.Similar results, in terms of the population of the states lying
within the transition, of the structural versatility of the R-state
and of the prevalent role played by the first eigenvector have been
obtained in the independent T0b, T0c, and T0d MD simulation runs (Figures S1–S6).
Evolution of the Structural
Probes that Characterize the Endpoints
of the T–R Transition in the MD Simulation
The analysis
of the trajectory structures performed at the global level in the
previous section, i.e., by principal component analysis, was extended
considering specific structural probes that undergo large variations
upon the transition. To this end, suitable structural probes have
been initially identified. Then, in the subspace defined by these
probes, we located representative crystallographic structures of both
HbA and HbTn and then the T0 trajectory structures.Following
the vast HbA literature data and the indications that emerged from
a comparative analysis of the T- and the R-state of the protein, we
identified local descriptors that assume specific distinct values
for each structural state of HbA (Tables S2 and S3). These include (a) the distance between groups/atoms whose
interactions specifically stabilize the T-state (K40α1 side
chain–H146β2 COOH terminal group and Y42α1–D99β2
side chains), (b) the distance between the residues located in the
switch region α1CD−β2FG at the interface between
the dimers α1β1 and α2β2 (Cα Pro44α1–Cα His97β2),[32] (c) the distance between the Cα atoms of the terminal His residues (His146) of the two β chains
(Cα His146β1–Cα His146β2),
and (d) some structural features of the heme pocket (Cα–Cα distance between the proximal and distal
His residues and the rotameric state of the distal His side chain).As shown in Table S2 and Figure , most of these structural
parameters, which present striking differences between the T- and
the R-state, well discriminate these two conformational states. Moreover,
the analysis of the values adopted by these parameters in the R2,
RR2, and R3 states clearly indicates that they fall outside the T–R
pathway. Notably, the inspection of the values of these probes in
the HbA and HbTn intermediate states provides interesting insights
into their location along the T–R pathway. The half-liganded
state HL-(C) of HbA lies in the T–R pathway, although it is
located in the proximity of the R-state (Figure ). This observation suggests that this structure
is representative of the early structural events that characterize
the transition from the R- to the T-state. Consequently, the atoms
involved in the interactions that strongly stabilize the T-state (Oη Tyr42α1–Oδ Asp99β2
and Nξ Lys40α1–OT His146β2) are
far apart in this R-like state. Interestingly, the extension of this
analysis to the putative intermediate states detected in the crystallographic
studies on HbTn suggests that TnA, TnB, and TnH are structurally located
in the functional T–R pathway of HbA also when these local
probes are analyzed (Figure ). Moreover, Figure also indicates that TnA, TnB, and TnH are located in different
phases of this transition. Indeed, TnA is essentially a T-like state
as the values adopted by these structural parameters do not significantly
differ from those exhibited by the T form. For TnB and TnH, these
probes display values that are generally rather distinct from those
exhibited by both the R and the T-state. TnB assumes either T-like
or R-like values for the distances Oη Tyr42α1–Oδ Asp99β2 (Figure b) or Cα Pro44α1–Cα His97β2 (Figure c), respectively. Notably, TnH appears to be essentially
equidistant from the endpoints of the transition as it presents, for
most of the parameters (the distances Nζ Lys40α1–OT
His146β2 and Oη Tyr42α1–Oδ Asp99β2) values that are in-between when compared
to those exhibited by the T- and the R-state (Figure a,b). On the other hand, the Cα–Cα distances between Pro44α1–His97β2
and His146β1-His146β2 of TnH present T-like and R-like
values, respectively (Figure c,d).
Figure 4
Time evolution in the T0 simulation of the structural
probes that
are characteristic of the different HbA states. Specifically, the
distances (a) Nζ Lys40α1–OT His146β2,
(b) Oη Tyr42α1–Oδ Asp99β2,
(c) Cα Pro44α1–Cα His97β2,
and (d) Cα His146β1–Cα His146β2 are monitored.
Time evolution in the T0 simulation of the structural
probes that
are characteristic of the different HbA states. Specifically, the
distances (a) Nζ Lys40α1–OT His146β2,
(b) Oη Tyr42α1–Oδ Asp99β2,
(c) Cα Pro44α1–Cα His97β2,
and (d) Cα His146β1–Cα His146β2 are monitored.The analysis of time evolution of these structural parameters in
the T0 MD simulation indicates that all of these probes suddenly change
when the overall structural T–R transition, detected by monitoring
the RMSD values (Figure a), occurs. Indeed, the H-bonding interactions Nζ Lys40α1–OT His146β2 and Oη Tyr42α1–Oδ Asp99β2 that strongly stabilize the T-state are
rapidly lost at ∼ 75 ns (Figure a,b). Concurrently, the distance between the residues
located in the switch region α1C−β2FG at the interface
between the dimers α1β1 and α2β2 (Cα Pro44α1–Cα His97β2) starts to
increase, adopting R-like values (Figure c). Accordingly, also the distance between
the Cα atoms of the terminal His146 residues of the
two β chains decreases, although the values adopted by the post-transition
trajectory frames are higher than those exhibited by the crystallographic
R-structure (Figure d).The characterization of the intermediate states of AntHb
has frequently
highlighted the concomitance of the quaternary structure modifications
with a significant deformation of the heme pocket.[26,29,30] In particular, these crystallographic analyses
have indicated a clear scissoring motion of the EF corner that allocates
the heme prosthetic group and a swing-out movement of the distal histidine
side chains that protrude from the heme pocket toward the solvent.
The scissoring motion was evaluated here by considering the distance
between the Cα atoms of the proximal His (located
in the helix F) and the distal His (located in the helix E). The evolution
of this Cα–Cα distance indicates
that no major deformation of the heme pocket occurs (Figure S7). Indeed, the values obtained are similar to those
observed in canonical Hbs.[26,29,30] The swing-out motion of the distal histidines was monitored by checking
the χ1 and χ2 dihedral angles of
these residues throughout the simulation. As shown in Figure S8, the distal histidines essentially
assume a single rotameric state that corresponds to their canonical
position within the heme pocket.
Essential Dynamics Sampling
The HbA structural transition
was also explored using a different approach based on the essential
dynamics sampling (EDS). Using this methodology, we were able to explore,
in addition to the T–R pathway, also the R–T transition
that did not emerge from the classical MD of the R-state both in our
(data not shown) and in literature simulations.[33]EDS was applied by preliminarily generating an ensemble
of T- and R-like structures necessary to extract the eigenvectors
used to apply the EDS procedure. The structures present in the first
ns of the T0 simulation, which present RMSDs against the crystallographic
T-state in the range of 0.5–1.0 Å, constituted the basin
of the T-like structures. A reliable basin of the R-state was generated
by performing a short (1 ns) MD simulation, using the same protocol
applied for the T0 simulation, starting from the fully ligated R-state
(R4) (PDB ID: 2DN1). Also, in this case, the models of the R-like basin present RMSDs
<1.0 Å when compared to the canonical crystallographic R-structure.
Using the two sets of MD structures, we computed the eigenvectors
that describe the overall protein transition. The movements along
the eigenvector with the highest eigenvalue accounted for 82.9% of
the protein total motion. We, then, projected in the space defined
by the first eigenvector the ensembles of the structures corresponding
to the two basins as well as the relevant structural states of HbA
and HbTn considered throughout this work (Figure ). As expected, the structures of the two
basins are very close to the canonical R- and T-states. In this representation,
the structures of the intermediates are located between the structures
of the R and T basins, whereas the R2, RR2, and R3 fall outside this
interval on the R side. This observation indicates that the first
eigenvector is able to provide a reliable description of only the
Hb functional transition. Upon eigenvector calculation, EDS was performed
by randomly selecting five structures of the T-like basin and five
structures of the R-like basin as starting points of the simulations
of the T–R and the R–T pathway, respectively. As shown
in Figure S9, both the T–R and the
R–T transitions occur within a limited number of steps. The
analysis of the plateau observed after the transition indicates that
the final structures present RMSD values of ∼ 2.5 Å compared
to the starting structures in line with the results obtained with
the classical MD (Figure ). The evolution of the structural probes indicates that the
interactions Nζ Lys40α1–OT His146β2
that stabilizes the T-state are rapidly lost in the simulation of
the T-state (Figure a). Almost simultaneously, the interaction between Oη Tyr42α1 and Oδ Asp99β2 also is essentially
lost (Figure b). The
Cα–Cα distances between Pro44α1–His97β2
and His146β1–His146β2 also assume R-like values
in the post-transition structures of the T-state simulation (Figure c,d). Almost inverse
trends are observed when these probes are monitored in the R–T
EDS trajectories. In these simulations, the atoms involved in the
Nζ Lys40α1–OT His146β2 and Oη Tyr42α1–Oδ Asp99β2
H-bond/electrostatic interactions come rather close, although the
H-bond is sporadically observed (Figure a,b). This is probably due to the fact that
the EDS is dictated by the overall motions of the protein along the
eigenvectors and not by the formation of local, specific interactions.
It is worth noting that in general, the EDS structures pass through
the crystallographic intermediate states and that the Oη Tyr42α1–Oδ Asp99β2 distance
observed in several post-transition structures of the R-state simulations
closely resembles that observed in the TnH state.
Figure 5
Projection on the first
eigenvector of the trajectory structures
extracted from the first 1 ns of the T0 (gray) and R4 (orange) simulations.
The vertical lines correspond to the projections of the crystallographic
structures of human HbA states—T (black, PDB ID: 2DN2), R (red, PDB ID: 2DN1), intermediate HL-(C)
(violet, PDB ID: 4N7P), R2 (blue, PDB ID: 1BBB), RR2 (yellow, PDB ID: 1MKO), and R3 (orange, PDB ID: 4NI0)—and that
of HbTn intermediates—TnA (green, PDB ID: 5LFG), TnB (magenta,
PDB ID: 5LFG), and TnH (cyan, PDB ID: 3D1K).
Figure 6
Evolution in the EDS
of the structural probes that are characteristic
of the different HbA states. Specifically, the distances (a) Nζ Lys40α1–OT His146β2, (b) Oη Tyr42α1–Oδ Asp99β2, (c) Cα Pro44α1–Cα His97β2,
and (d) Cα His146β1–Cα His146β2 are monitored in the T–R (green) and R–T
(orange) trajectory structures.
Projection on the first
eigenvector of the trajectory structures
extracted from the first 1 ns of the T0 (gray) and R4 (orange) simulations.
The vertical lines correspond to the projections of the crystallographic
structures of human HbA states—T (black, PDB ID: 2DN2), R (red, PDB ID: 2DN1), intermediate HL-(C)
(violet, PDB ID: 4N7P), R2 (blue, PDB ID: 1BBB), RR2 (yellow, PDB ID: 1MKO), and R3 (orange, PDB ID: 4NI0)—and that
of HbTn intermediates—TnA (green, PDB ID: 5LFG), TnB (magenta,
PDB ID: 5LFG), and TnH (cyan, PDB ID: 3D1K).Evolution in the EDS
of the structural probes that are characteristic
of the different HbA states. Specifically, the distances (a) Nζ Lys40α1–OT His146β2, (b) Oη Tyr42α1–Oδ Asp99β2, (c) Cα Pro44α1–Cα His97β2,
and (d) Cα His146β1–Cα His146β2 are monitored in the T–R (green) and R–T
(orange) trajectory structures.Finally, we monitored the coevolution of pairs of structural probes
to check whether they changed independently or in a simultaneous way.
From the analysis of Figure , it is evident that these parameters evolve in the T–R
transition in a similar way in both the classical MD and the EDS.
Moreover, the Nζ Lys40α1–OT His146β2
distance changes concomitantly with the distance Cα His146β1–Cα His146β2 (Figure a,b). On the other
hand, the variations of the distances between Oη Tyr42α1–Oδ Asp99β2 and Cα Pro44α1–Cα His97β2 are essentially independent. The EDS
procedure, which provides information on both R–T and T–R
transitions, clearly indicates that modifications at the α1CD−β2FG
interaction site (Cα Pro44α1–Cα His97β2 distance) precede the break/formation of the T-state
stabilizing the locking of Oη Tyr42α1–Oδ Asp99β2 interaction (Figure c,d). Fluctuations of the Cα Pro44α–Cα His97β2 distance are
allowed in both the locked and unlocked states. In this scenario,
the three intermediate states detected in the crystallographic analyses
of HbTn perfectly fit into the T to R pathway. As shown in Figure d, the EDS simulations
also indicate in the R–T transition the occurrence of a highly
populated state, specific to TnH, that has not been experimentally
detected yet.
Figure 7
Projections of the T0 trajectory structures (a, c) and
of the T–R
(green) and R–T (orange) trajectory structures generated by
the EDS analysis (b, d) in the space defined by some structural probes
that are characteristic of the various HbA states. The points corresponding
to the structures detected in the pretransition (<75 ns), transition
(75–83 ns), and post-transition (>83 ns) time interval of
the
T0 simulation are colored in gray, cyan, and red, respectively. The
representative crystallographic structures of HbA (black circles)
and HbTn (black triangles) are also reported. TnB is not reported
in panels (a) and (b) as the C-terminal residue of the β-chains
(His146) is missing.
Projections of the T0 trajectory structures (a, c) and
of the T–R
(green) and R–T (orange) trajectory structures generated by
the EDS analysis (b, d) in the space defined by some structural probes
that are characteristic of the various HbA states. The points corresponding
to the structures detected in the pretransition (<75 ns), transition
(75–83 ns), and post-transition (>83 ns) time interval of
the
T0 simulation are colored in gray, cyan, and red, respectively. The
representative crystallographic structures of HbA (black circles)
and HbTn (black triangles) are also reported. TnB is not reported
in panels (a) and (b) as the C-terminal residue of the β-chains
(His146) is missing.
Discussion
The
structural organization of tetrameric Hbs represents an admirable
example of how the juxtaposition of two paralog globin chains generates
an efficient system that is highly responsive to external stimuli
and to specific heterotrophic factors. Very recently, using the ancestral
reconstruction protein approach,[34] it has
been shown that modern Hb evolved from an ancient monomer that initially
evolved by forming a noncooperative homodimer with high oxygen affinity.[35] Intriguingly, in the ancestral progenitors of
tetrameric Hb, co-operativity was acquired by reducing the oxygen
affinity as a direct link between the oxygen binding site and the
oligomerization interface seems to be a primordial property of this
protein. Then, the appearance of the α2β2 tetramer provided
a considerable enhancement of functional capabilities[36] that made this protein crucial for the evolution of virtually
all vertebrate species. The relative easiness of purifying and characterizing
Hbs from different species has made these proteins prototypal systems
for unraveling structure–function relationships. However, the
significant complexity of these Hbs formed by two chains with different
oxygen affinities that also strongly depend on the tetramer binding
state and on external factors (pH and heterotrophic effectors) has
made functional studies particularly difficult and the related results
the subject of intense debates and controversies.[7,37−39] A similar scenario emerges from the inspection of
the structural side of the relationship. Indeed, despite the incredible
amount of data accumulated over the decades, the definition of the
structural bases of many issues related to either the general aspects
of Hbs functionality (co-operativity and modulator of effectors) or
the species-specific aspects (Bohr and Root effects) has not reached
a general consensus. The difficulties related to the elucidation of
the structural bases of Hb co-operativity are related to the paucity
of structural information available for the intermediate states of
the protein R–T functional transition. Indeed, crystallographic
analyses carried out on HbA have provided a clear picture only of
the endpoints of the transition with extremely limited information
about the intermediate states. One of the most striking differences
that emerges from a comparative analysis of the structural information
of Hbs isolated from different species is related to the detection
of crystallographic structures with intermediate R–T states.
Somehow surprising, the characterization of Antarctic and sub-Antarctic
fish Hbs, despite the close structural similarity of the R- and the
T-state shared with HbA, has frequently highlighted states characterized
by quaternary structures lying in the R–T pathway. Here, we
report MD simulations, which were performed starting from the T-state
of HbA, by focusing on the relevance of the intermediate states unraveled
by the crystallographic studies of the protein functional transition.In line with previous observations,[31,33] the simulation
of the HbA T-state indicates that the protein undergoes a spontaneous
transition toward the R-state. An essential dynamics analysis of the
trajectory indicates that the first eigenvalue accounts for a large
portion of the overall protein mobility. The projection of the canonical
T and R Hbs structures on this eigenvalue also indicates that it perfectly
represents the motions associated with the T–R transition.
These observations suggest that HbA possesses a limited repertoire
of global motions. This is in line with the observation that strong
perturbations of the heme pocket, such as oxidation and hemichrome
formation, in Antarctic fish Hbs lead to quaternary structure states
that fall in the T–R pathway.[26,28] Since hemichromes
are preunfolded states, it is likely that the early motions associated
with the unfolding process of tetrameric Hbs follow the ones associated
with the physiological functionality of the protein.The sudden
HbA T–R transition (Figure ) explains why the crystallographic detection
of HbA R–T intermediates has been so elusive despite the large
amount of structural data collected for this protein. Interestingly,
the projection of the HL-(C) state and of the trajectory structures
on the first eigenvector (Figure ) indicates that this state is significantly populated.
This observation indicates that although close to the R-state, it
represents a genuine intermediate along the trajectory. The projection
of the HbTn crystallographic intermediate states onto the same eigenvector
indicates that the A and B tetramers essentially fall in the conformational
ensemble that characterizes the T-state. Notably, the H state is almost
equidistant from the R- and the T-state of HbA. However, the low population
of this state in the MD suggests that it is a high-energy structure
lying in the functional pathway of human hemoglobin (Figure S5). This overall picture is corroborated by the analysis
of the evolution of the structural probes that distinguish the T-
and the R-state of HbA.The general structural similarity of
the HbA trajectory structures
lying in the R–T trajectory with those that emerged from crystallographic
studies on Antarctic fish Hbs indicates that in addition to the endpoints,
these Hbs also share a similar deoxygenation pathway despite a distance
of hundreds of millions of years in the evolution scale.The
HbA functional transition was also studied using the essential
dynamics sampling (EDS) approach with a twofold purpose. On one side,
the T–R transition, which could be seen in the classical MD,
was used to validate this methodology. On the other hand, the EDS
methodology was exploited to gain insights into the R–T transition
that is not spontaneous in classical MD simulations. The exploration
of the HbA T–R pathway by EDS highlighted its strengths and
limitations compared to classical MD simulations. Although the EDS
approach well reproduced the free MD trajectories, the analysis of
the EDS trajectories indicates that, these simulations being driven
by global motions as described by the Cα atoms, some
local probes were occasionally not precisely reproduced. Nevertheless,
by such an approach, it was feasible to monitor the reverse R–T
transition that is not spontaneous in free MD simulations.[33] The inspection of the R–T pathway indicates
that it is specific to the T–R one. Indeed, the modifications
at the CDα1–FGβ2 interaction site precede either
the break or the formation of the T-state stabilizing interactions
in the T–R and T–R transitions. On the other hand, the
variations of the distances between Oη Tyr42α1–Oδ Asp99β2 and Cα Pro44α1–Cα His97β2 are essentially independent. The EDS
analysis, which provides information on both R–T and T–R
transitions, clearly indicates that modifications at the α1CD−β2FG
interaction site (Cα Pro44α1–Cα His97β2) precede the break/formation of the T-state, stabilizing
the Oη Tyr42α1–Oδ Asp99β2
interaction (Figure c,d). In this scenario, the three intermediate states detected in
the crystallographic analyses of HbTn perfectly fit into the T to
R pathway.In conclusion, although the dynamic analysis of the
HbA functional
transition is a subject of long-standing investigations, the present
study provides a significant contribution to this debated field as
it provides a reliable three-dimensional description of the T–R
HbA transition at both a global and a local level. The analogies that
trajectory structures share with the intermediate states of Antarctic
fish Hbs indicate that vertebrate Hbs share a similar functional pathway
despite an evolutionary distance of hundreds of millions of years.
Moreover, the detection of a population of states that resembles the
HL-(C) structure recently reported for HbA corroborates the concept
recently described by Shibayama[16] that
the HbA transition, traditionally seen as a simple two-state switch,
actually occurs through population shifts among multiple quaternary
states. The successful description of the T–R transition (classical
MD and EDS approaches) and of the R–T transition (EDS approach)
reported here paves the way for future MD studies addressing the fine
HbA regulation, as the pH does in Bohr and Root effects.[40] Finally, the atomic-level description of the
intermediate states along the pathway may be also useful for the application
of sophisticated approaches, like QM//MM,[41] addressing the coupling between global HbA structural transition
and oxygen binding/release.
Materials and Methods
Structural Models
Fully atomistic MD simulations were
carried out on the T-state of HbA. In detail, the T–R transition
was followed using the high-resolution crystallographic structure
of the tetrameric HbA in the deoxy form (PDB ID: 2DN2) as the starting
model. Several literature studies have been devoted to pKa measurements and calculations.[42−46] Since the main goal of this study is the atomic-level
description of the T to R transition at neutral pH, Glu/Asp and Arg/Lys
residues were considered negatively and positively charged, respectively.
The protonation states of the histidine residues were chosen according
to Zheng et al.[47] Following indications
that emerged from previous MD simulations,[31] the terminal His146 of the β chains was deprotonated to favor
the transition. The protein structural transition was also explored
using the essential dynamics sampling (EDS).[48] To this aim (see below), an additional 1 ns-long classical MD simulation
on the fully ligated R-state (R4) was carried out using the high-resolution
crystal structure of the oxy form of the protein (PDB ID: 2DN1). The overall and
local structural features of the trajectory models were analyzed by
also considering a number of canonical states (R2, R3, and RR2) and
the putative T–R intermediate half-liganded HL-(C) state of
HbA (PDB ID: 4N7P). Moreover, we also compared the trajectory structures with the
following states identified for HbTn: T-state (TnT, PDB ID: 3NFE), R-state (TnR,
PDB ID: 1T1N), tetramers A and B (TnA and TnB, PDB ID: 5LFG), and tetramer H
(TnH, PDB ID: 3D1K). Details about all of these crystallographic models are reported
in Table S1.
Protocol
Fully
atomistic MD simulations of the HbA
T-state were performed using the GROMACS package[49] and the CHARMM-36 all-atom force field. The protein model
was solvated with water molecules (the TIP3P model was used) in a
cubic box of size 150 Å. The system was neutralized with sodium
and chloride counterions achieving a salt concentration of 0.15 m/L.
Electrostatic interactions were treated by means of the particle-mesh
Ewald (PME) method[50] with a grid spacing
of 1 Å, a relative tolerance of 10–6, together
with a 10 Å switching for the Lennard–Jones (LJ) interactions.
The LINCS algorithm was used for constraining bond lengths.[51] The system was first energy minimized using
the steepest descent (50 000 steps) and then equilibrated in
two steps. In the first phase, the system was heated to 300 K temperature
(increments of 10 K) for 300 ps (NVT). Equilibration of pressure at
1 atm was then conducted for 500 ps (NpT). The velocity rescaling
and Parrinello–Rahman algorithms were applied for temperature
and pressure control, respectively.[52,53] Four production
runs (T0, T0b, T0c, and T0d simulations) were performed at constant
temperature and pressure (NpT) at 300 K and 1 atm with a time step
of 2 fs. A principal component analysis was performed using the essential
dynamics technique.[54] In detail, starting
from the MD simulations, we built the covariance matrix of the protein
Cα atomic positions. The diagonalization of such
a matrix allowed us to obtain a set of eigenvectors with their associated
eigenvalues. These eigenvectors represent the principal protein motions
used to describe the “essential” protein modes, which
frequently correspond to the functional ones. This approach made it
possible to represent the protein overall dynamics in a reduced essential
subspace confined within the first eigenvectors defined as principal
components.The sampling method based on the essential dynamics
(EDS) was applied by preliminarily generating a basin of T- and R-like
models to calculate the eigenvectors used to perform the simulations.
Briefly, the EDS algorithm rejects the MD step when it brings the
system farther from the target and, in this case, the structure is
projected on the hypersphere defined by the set of eigenvectors so
that the system distance from the target is unchanged and a new MD
step can be performed. On the other hand, when the MD step brings
the system closer to the target, the step is accepted and a normal
“free” MD move is performed. When this algorithm is
applied an appropriate number of times, a MD trajectory bringing the
system from the starting geometry to the target one is obtained. This
approach has been successfully applied to the modeling of large protein
conformational rearrangements.[48,55,56] The ensemble of T-like structures contains the trajectory frames
of the first 1 ns of the classical simulation conducted on the T-state.
A reliable basin of the R-state was generated by performing a short
1 ns-long classical MD simulation starting from the fully ligated
R-state (PDB ID: 2DN1), applying the same protocol used in the simulation of the T-state.
Eigenvectors that describe the overall protein transition between
the T and R states were then computed using these two structural ensembles.
Upon eigenvector calculation, the EDS method was applied by randomly
selecting five structures from both basins used as starting points
in simulations of the T–R and R–T pathways and leaving
the system to move along such a set of eigenvectors. Also, the EDS
simulations were performed with CHARMM-36 all-atom force field and
the TIP3P water model using the GROMACS package.[49] The gmx covar and gmx anaeig tools were used to build the
covariance matrix and to calculate the projection with respect to
the first eigenvector.
Authors: David Van Der Spoel; Erik Lindahl; Berk Hess; Gerrit Groenhof; Alan E Mark; Herman J C Berendsen Journal: J Comput Chem Date: 2005-12 Impact factor: 3.376
Authors: L Camardella; C Caruso; R D'Avino; G di Prisco; B Rutigliano; M Tamburrini; G Fermi; M F Perutz Journal: J Mol Biol Date: 1992-03-20 Impact factor: 5.469
Authors: Rok Borštnar; Matej Repič; Shina Caroline Lynn Kamerlin; Robert Vianello; Janez Mavri Journal: J Chem Theory Comput Date: 2012-04-25 Impact factor: 6.006
Authors: R D'Avino; C Caruso; M Tamburrini; M Romano; B Rutigliano; P Polverino de Laureto; L Camardella; V Carratore; G di Prisco Journal: J Biol Chem Date: 1994-04-01 Impact factor: 5.157
Authors: Shane A Chandler; Yang Liu; Anthony V Signore; Arvind S Pillai; Carlos R Cortez-Romero; Justin L P Benesch; Arthur Laganowsky; Jay F Storz; Georg K A Hochberg; Joseph W Thornton Journal: Nature Date: 2020-05-20 Impact factor: 49.962