In biomolecules, bifurcated H-bonds typically involve the interaction of two donor protons with the two lone pairs of oxygen. Here, we present direct NMR evidence for a bifurcated H-bonding arrangement involving nitrogen as the acceptor atom. Specifically, the H-bond network comprises the Nδ1 atom of histidine and both the backbone N-H and side-chain Oγ-H of threonine within the conserved TXXH motif of ankyrin repeat (AR) proteins. Identification of the H-bonding partners is achieved via solution NMR H-bond scalar coupling (HBC) and H/D isotope shift experiments. Quantitative determination of (2h)J(NN) HBCs supports that Thr N-H···Nδ1 His H-bonds within internal repeats are stronger (∼4 Hz) than in the solvent exposed C-terminal AR (∼2 Hz). In agreement, pKa values for the buried histidines bridging internal ARs are several units lower than those of the C-terminus. Quantum chemical calculations show that the relevant (2h)J and (1h)J couplings are dominated by the Fermi contact interaction. Finally, a Thr-to-Val replacement, which eliminates the Thr Oγ-H···Nδ1 His H-bond and decreases protein stability, results in a 25% increase in (2h)J(NN), attributed to optimization of the Val N-H···Nδ1 His H-bond. Overall, the results provide new insights into the H-bonding properties of histidine, a refined structural rationalization for the folding cooperativity of AR proteins, and a challenging benchmark for the calculation of HBCs.
In biomolecules, bifurcated H-bonds typically involve the interaction of two donor protons with the two lone pairs of oxygen. Here, we present direct NMR evidence for a bifurcated H-bonding arrangement involving nitrogen as the acceptor atom. Specifically, the H-bond network comprises the Nδ1 atom of histidine and both the backbone N-H and side-chain Oγ-H of threonine within the conserved TXXH motif of ankyrin repeat (AR) proteins. Identification of the H-bonding partners is achieved via solution NMR H-bond scalar coupling (HBC) and H/D isotope shift experiments. Quantitative determination of (2h)J(NN) HBCs supports that Thr N-H···Nδ1 His H-bonds within internal repeats are stronger (∼4 Hz) than in the solvent exposed C-terminal AR (∼2 Hz). In agreement, pKa values for the buried histidines bridging internal ARs are several units lower than those of the C-terminus. Quantum chemical calculations show that the relevant (2h)J and (1h)J couplings are dominated by the Fermi contact interaction. Finally, a Thr-to-Val replacement, which eliminates the Thr Oγ-H···Nδ1 His H-bond and decreases protein stability, results in a 25% increase in (2h)J(NN), attributed to optimization of the Val N-H···Nδ1 His H-bond. Overall, the results provide new insights into the H-bonding properties of histidine, a refined structural rationalization for the folding cooperativity of AR proteins, and a challenging benchmark for the calculation of HBCs.
Hydrogen bonds (H-bonds) are essential
structural elements in the self-assembly, stability, and remarkable
catalytic properties of biomolecules. These low energy interactions
participate in processes essential to life either singly or as intricate
networks conveying structural and thermodynamic cooperativity. Yet,
the presence and configuration of intramolecular H-bonds, along with
their relative strength, are difficult to establish with direct experimental
methods. By default, H-bonds are often modeled in crystallographic
structures using theoretical idealized geometry, therefore leaving
the precise configuration unknown and important instances of strained
H-bonds unnoticed.Ankyrin repeat (AR) proteins have highly
cooperative folding–unfolding transitions.[1] Most of the available X-ray models show a conspicuous array
of H-bonds extending from repeat to repeat.[2,3] This
array involves conserved TXXH motifs initiating the first α-helix
of each repeat (Figure 1).[2−4] Histidine plays
an essential role within and between adjacent TXXH motifs. Specifically,
a side-chain/main-chain interaction involving the His Nδ1 and
Thr NH caps the N-terminus of the α-helix.[5] In addition, successive ARs pack against each other by
using the Nε2H of the TXXH histidine as an H-bond donor to the
carbonyl group of the residue preceding the TXXH threonine of the
next repeat.[4,6] To produce the detailed description
of the H-bond network necessary to rationalize thermodynamic properties,
we are pursuing solution NMR studies of consensus AR proteins. Here,
we provide direct evidence for stable bifurcated H-bonds in TXXH helix
capping motifs and demonstrate the importance of the Thr hydroxyl
group for the stability of the AR fold.
Figure 1
Ribbon diagram of the
first four ARs in E3_19 (PDB: 2BKG) as a model for the NRRC protein discussed
in this work (residue numbers correspond to NRRC). Each AR is an ∼33-residue
helix-turn-helix module followed by an extended β-hairpin loop.
The Thr and His of interest are shown in ball-and-stick representation
(green, 2nd repeat T44–H47; magenta, 3rd repeat T77–H80;
cyan, C-terminal repeat T110–H113). NRRC contains an additional
N-terminal TXXH motif (T11–H14).
Ribbon diagram of the
first four ARs in E3_19 (PDB: 2BKG) as a model for the NRRC protein discussed
in this work (residue numbers correspond to NRRC). Each AR is an ∼33-residue
helix-turn-helix module followed by an extended β-hairpin loop.
The Thr and His of interest are shown in ball-and-stick representation
(green, 2nd repeat T44–H47; magenta, 3rd repeat T77–H80;
cyan, C-terminal repeat T110–H113). NRRC contains an additional
N-terminal TXXH motif (T11–H14).Amide H/D exchange rates and the response of amide1H chemical shifts to temperature are routinely used to infer the
presence of H-bonds in proteins.[7] These
approaches are experimentally straightforward, but they do not identify
acceptor atoms or report on local structure. In contrast, scalar couplings
across H-bonds (HBCs)[8−13] are challenging to measure, but provide information not otherwise
experimentally accessible. In particular, the magnitude of the HBC
(hJ) is exquisitely sensitive to changes
in bonding geometry,[14−18] which makes this parameter well suited for interrogation of H-bond
strain and relaxation.[19]The most
common H-bond in proteins is of the N–H···O=C
type and has |3hJNC′| and |2hJHC′| HBCs
below 1 Hz.[10,13,20,21] The small hJ values and the yet smaller variations caused by different structural
contexts often limit the feasibility and utility of these measurements
to relatively small proteins. However, the 2hJNN coupling constants in H-bonds of the N–H···N
type can be as large as 11 Hz.[22] We therefore
focused on the N–H···Nδ1 interaction.
A survey of the Protein Data Bank indicates that the TXXH cap is found
in several non-AR protein structures (Table S1). These will offer interesting opportunities for comparative studies.In prior structural work, we assigned the backbone 1H, 15N, and 13C NMR signals of the three-repeat
consensus AR protein NRC, where N refers to the N-terminal AR, R to
an internal AR, and C to the C-terminal AR.[1] Here, we extend our study to the four-repeat AR protein NRRC, which
contains an additional TXXH motif. From 1H–15N LR HMQC spectra (Figure S1B,C) we deduce that each capping histidine adopts the Nε2H tautomer[23] in agreement with data published on the naturally
occurring AR protein gankyrin.[4] Intra-
and inter-repeat NOEs (Figure S2A) orient
the imidazole rings as in Figure 1.In
NRC, a soft HNN-COSY experiment detects three Thr(i) N–H···Nδ1
His(i+3) H-bonds through 2hJNN-mediated cross peaks between the NH of T11 (T44, T77) and Nδ1
of H14 (H47, H80) (Figure S3). In NRRC,
there are four detectable signals (Figure 2a, black peaks). A complete NMR connectivity map demonstrating the
helix capping Thr N–H···Nδ1 His H-bonds
in NRRC is presented in Figure S1.
Figure 2
(a) Overlay
of 600 MHz soft HNN-COSY spectra collected on 15N-labeled
consensus AR proteins for detection of backbone N–H···Nδ1
histidine H-bonds: NRRC (pH 6.6, 298 K, black) and T44V NRRC (pH 7.5,
308 K, red). Labels identify NH:Nδ1 bonding partners. (b) Downfield
region of quantitative 2hJNN-modulation 1-D HSQC spectra collected on NRRC. (c) Intensity-normalized
peak heights from (b) plotted as a function of the modulation period,
τ. Solid lines represent the best fit of the data to a cosine
wave (see text).
(a) Overlay
of 600 MHz soft HNN-COSY spectra collected on 15N-labeled
consensus AR proteins for detection of backbone N–H···Nδ1
histidine H-bonds: NRRC (pH 6.6, 298 K, black) and T44V NRRC (pH 7.5,
308 K, red). Labels identify NH:Nδ1 bonding partners. (b) Downfield
region of quantitative 2hJNN-modulation 1-D HSQC spectra collected on NRRC. (c) Intensity-normalized
peak heights from (b) plotted as a function of the modulation period,
τ. Solid lines represent the best fit of the data to a cosine
wave (see text).To investigate how the
attributes of Thr N–H···Nδ1 His H-bonds
vary from repeat to repeat (e.g., R1 to R2 to C in NRRC), we measured
the magnitude of 2hJNN by a
quantitative (Q) spin–echo difference method.[24,25] Figure 2b shows portions of Q-2hJNN 1-D HSQC spectra collected on NRRC.
The NH signals of T44, T77, and T110 undergo 2hJNN modulation with frequencies
reporting on each coupling constant. Peak heights were determined
as a function of the JNN modulation period
(τ) and fitted according to the relationship[25]I(τ) = A cos (π J τ) to extract 2hJNN values. In NRRC (Figure 2c), the
internal TPLH H-bonds (T44–H47, T77–H80) have 2hJNN values of ∼3.9–4.0
Hz, whereas the C-terminal TPEH H-bond has an attenuated value (T110–H113,
∼2.1 Hz). Because of 1H overlap, only an upper bound
for the N-terminal T11-H14HBC was determined (2hJNN < 4.1 Hz, not shown). The varying magnitude
of 2hJNN from internal to terminal
repeat (R1 = R2 > C) results from differences in H-bond geometry,
time-averaged populations, or both. The C-terminal T110–H113
motif has high solvent exposure and no carbonyl acceptor for H113
Nε2-H, which likely destabilizes the N–H···Nδ1
bond relative to those in internal repeats. Similar differences between
internal (T44–H47, 2hJNN ∼4.1 Hz) and C-terminal (T77–H80, 2hJNN ∼1.8 Hz) H-bonds are observed in NRC
(Table 1, Figure S4A).
Table 1
Chemical Shifts, 2hJNN Coupling Constants, and 2hΔ 1H Isotope Shifts in AR TXXH Motifs (298 K, pH 6.6)
Protein and proton
H-bond partners
δ (ppm)
|2hJNN| (Hz)
2hΔ 1H (ppb)
NRC NH
T44-H47
9.98
4.1 ± 0.2
ndb
NRC NH
T77-H80
9.18
1.8 ± 0.2
ndb
NRRC NH
T44-H47
9.79
4.0 ± 0.2
–53 ± 5
NRRC OγH
T44-H47
6.36
–47 ± 5
NRRC NH
T77-H80
9.58
3.9 ± 0.2
–53 ± 5
NRRC OγH
T77-H80
6.75
ndb
NRRC NH
T110-H113
9.13
2.1 ± 0.1
ndb
T44V NH
V44-H47
11.27a
5.2 ± 0.7a
0
T44V NH
T77-H80
9.57a
3.4 ± 0.5a
–55 ± 5
T44V OγH
T77-H80
6.70a
ndb
Data collected
at 308 K, pH 7.5. Most N–H···N type H-bonds
undergo thermal expansion (longer distance, lower 2hJNN) with increasing temperature.[32]
Not
determined.
Data collected
at 308 K, pH 7.5. Most N–H···N type H-bonds
undergo thermal expansion (longer distance, lower 2hJNN) with increasing temperature.[32]Not
determined.The trend in 2hJNN values is mirrored in other
physicochemical properties. For example, we measured the apparent
pKa of each TXXH histidine within NRR,
a protein containing identical repeats at internal (R1) and C-terminal
(R2) positions, by following the His Hε1, Hδ2, and Nδ1
resonances as a function of pH (Figure S5). H47, within R1, remains in the neutral state at pH values below
∼3, until the protein begins to undergo global acid unfolding.
H14 of the N-terminal AR is buried and shows similar behavior. In
contrast, the ionization midpoint of H80 (within R2) is only moderately
depressed (apparent pKa = 5.7). Protonation
of histidine necessarily breaks the N–H···Nδ1
H-bond, and greater pKa depression should
in part result from stronger H-bonds. Therefore, the data are consistent
with the use of the 2hJNNHBC
as a proxy for relative H-bond strength. The difference between buried
and solvent exposed N–H···Nδ1 H-bonds
in AR repeats is analogous to the difference observed between the
middle and ends of nucleic acid secondary structures, where fraying
leads to smaller 2hJNN values.[16]We next sought to detect Thr N–H···Nδ1
His1hJHN HBCs in NRRC by using
a high-sensitivity 1H–15N LR HMQC approach.[23] These 1hJHN were not observed, but surprisingly, the experiment yielded J-correlations between T44 (T77) Oγ–H and H47
(H80) Nδ1 (Figure 3a,b). LR HSQC modulation
experiments (Figure S6) confirmed the buildup
of the weak T44 Oγ–H···Nδ1 H47 and
T77 Oγ–H···Nδ1 H80 cross peaks.
From these spectra, a 3-Hz upper limit for 1hJHN was obtained, which is similar to the measured 1hJHN values of N···H–N
and N···H–O H-bonds in nucleic acids.[16,21,26−28] The observation
of 2hJNN and 1hJHN HBCs strongly suggests that H47 (H80) Nδ1
serves as a bifurcated H-bond acceptor to T44 (T77) N–H and
Oγ–H.
Figure 3
(a) Upfield and (b) downfield regions of the 1H–15N LR HMQC spectrum of NRRC at pH 6.6, 298 K.
Strong 2JNH Nε2–Hε1,
Nε2–Hδ2, Nδ1–Hε1, and weak 3JNH Nδ1–Hδ2
intra-imidazole correlations
are indicated. The 1hJHN His
Nδ1–H–Oγ Thr H-bond correlations are labeled
in red.
(a) Upfield and (b) downfield regions of the 1H–15N LR HMQC spectrum of NRRC at pH 6.6, 298 K.
Strong 2JNH Nε2–Hε1,
Nε2–Hδ2, Nδ1–Hε1, and weak 3JNH Nδ1–Hδ2
intra-imidazole correlations
are indicated. The 1hJHNHis
Nδ1–H–Oγ Thr H-bond correlations are labeled
in red.An H/D exchange experiment was
performed to test the proposed bifurcated H-bond scheme. When an 15N-labeled NRRC NMR sample was diluted into a 50:50 H2O/D2O solvent mixture, we observed H/D equilibration
of Thr hydroxyl groups, as illustrated by the reduction in intensity
of the resolved T44 Oγ–H signal (Figure 4a,b). Interestingly, equilibration was accompanied by splitting
of T44 and T77 N–H signals, confirmed with 1H–15N HSQC spectra (Figure S7A–B). We hypothesize that the population of T44 Oγ–H and
Oγ–D species and relatively slow H/D exchange (<5
s–1; see Figure S7) cause
a two-bond isotope effect, 2hΔ1H = δ1H(H) – δ1H(D) = −53 ppb, on
the corresponding amide NH, communicated via shared interaction with
H47 Nδ1 (Figure 4f). An identical effect
on the NH of T77 (2hΔ1H = −53 ppb)
is attributed to the population and slow exchange of T77 Oγ–H/D
species. H/D isotope shifts were not observed for any other backbone
amide. However, a reciprocal isotope shift (splitting of Thr Oγ–H
signal caused by mixed H/D occupancy at the Thr amide) is detected
after ∼24 h (Figure 4c). The 2hΔ1H value is −47 ppb for T44 OγH. Further
variation in the solvent composition confirms the assignment of each
isotopomer (Figure 4d–e). The N–H···Nδ1···H–Oγ two-bond isotope effects observed for
consensus ARs are of greater magnitude than those reported for protein
N–H···O···H–N H-bonds (2hΔ1H
= −18 to +23 ppb),[29] but are significantly
smaller than those detected for O–H···O···H–O H-bonds in the oxyanion hole of ketosteroid
isomerase (2hΔ1H = −250 to −170
ppb).[30] The two-bond isotope shifts (2hΔ1H) and HBCs (2hJNN and 1hJHN) in
NRRC provide independent evidence for bifurcated H-bonding in the
α-helix N-cap TXXH motif.
Figure 4
(a–e) 15N-decoupled 1H 1-D spectra of NRRC in H2O/D2O mixtures.
(a) 90:10; (b) 50:50, 1.5 h incubation; (c) sample (b) after 24 h;
(d) dilution of sample (c) to achieve a 66:34 mixture, 40 min; (e)
sample (d) after 21 h. Peaks marked with * result from the presence
of a D nucleus at the adjacent H-bond. (f) Proposed origin for 2hΔ H/D isotope effect: a bridging His acts as acceptor
to both Thr amide and hydroxyl hydrogens.
(a–e) 15N-decoupled 1H 1-D spectra of NRRC in H2O/D2O mixtures.
(a) 90:10; (b) 50:50, 1.5 h incubation; (c) sample (b) after 24 h;
(d) dilution of sample (c) to achieve a 66:34 mixture, 40 min; (e)
sample (d) after 21 h. Peaks marked with * result from the presence
of a D nucleus at the adjacent H-bond. (f) Proposed origin for 2hΔ H/D isotope effect: a bridging His acts as acceptor
to both Thr amide and hydroxyl hydrogens.The consequences of Thr Oγ–H···Nδ1
His H-bond deletion were explored with the isosteric T44V replacement
in NRRC. We reasoned that elimination of the bifurcated N–H···Nδ1···H–Oγ
interaction would perturb the remaining N–H···Nδ1
His H-bond. Denaturation experiments conducted at pH 8.0 indicate
that T44V NRRC is destabilized by ∼2.6 kcal/mol relative to
the consensus protein (Figure S8). Figure 2a shows the HNN-COSY spectrum of the variant (red
peaks). H-bond detection in T44V NRRC is difficult because of protein
aggregation at concentrations above ∼100 μM; nevertheless,
the weak T77 N–H···Nδ1 H80 correlation
is observable and overlays well with the reference NRRC signal. In
contrast, the V44 N–H···Nδ1 H47 cross
peak has increased intensity compared to the reference T44 N–H···Nδ1
H47 cross peak. Also remarkable are the large downfield shifts of
both V44 amide1H (∼1.5 ppm) and H47 15Nδ1 (∼10 ppm) (Figures 2a and S9), signifying N–H···N
H-bond reconfiguration and a decrease in bond length.[14,16,27,29,30]Measurement of 2hJNN due to the V44-H47 (∼5.2 Hz) and T77-H80
(∼3.4 Hz) H-bonds provides insight into the repercussions of
the T44V replacement relative to the TPLH helix caps in NRRC. In the
T44-H47 and T77-H80 bifurcated interactions, the 2hJNN, 1hJHN, and 2hΔ1H isotope shift data support
that the His ring orients its Nδ1 atom between Thr NH and OγH, adopting a nonlinear
geometry for both H-bonds and leading to relatively small T44-H47
and T77-H80 N–H···Nδ1 2hJNN couplings (∼4 Hz). Upon T44V replacement,
the observed ∼25% increase in 2hJNN (Figure S4B,C) suggests
a straightening (and concomitant shortening) of the Val N–H···Nδ1
His H-bond.To gain insight into the nature of the N–H···Nδ1···H–Oγ
H-bond network, quantum chemical calculations (described in the Supporting Information) were performed using
Gaussian 09[31] on fragments of 2BKG mimicking
an internal repeat. Energy minimization from multiple starting geometries
(Figure S10) validated the use of the X-ray
coordinates for the calculations. The computed HBCs (Figure S11, Table S2) are 2hJNN = +3.0 Hz, 1hJHN =
+2.0 Hz (N–H···Nδ1), and 1hJHN = +2.2 Hz (Nδ1···H–Oγ),
all dominated by the Fermi contact (FC) contribution. These values
are in reasonable agreement with the experimental numbers. Along with
the geometry of the TXXH unit, the HBCs suggest that the His Nδ1
sp2 lone pair is the major contributor to both hydrogen
bonds. H-bonding may also be augmented by interaction with the His
π-system, as in a cation−π interaction.[33] The larger 2hJNN in T44V NRRC is consistent with a repositioning of the histidine
ring that improves the orbital overlap and enhances the FC effect[34] (Figure S12, Table S2).Collectively, the data illuminate a relationship between
the magnitude of 2hJNN scalar
couplings and H-bond sharing. The N–H···N HBCs
measured for bifurcated H-bonds in buried consensus ARs (∼4
Hz) are significantly smaller than those for the few other reported
protein N–HN HBCs (∼6–11
Hz),[22,35,36] an indication
of weaker bonds and nonideal geometry in the former. Stability compensation
is likely provided by the bifurcated arrangement. Importantly, our
description of the H-bond network clarifies the role of the threonine
hydroxyl group and contributes a comparative view of the TXXH motif
within individual repeats. Further work will extend to longer AR proteins
in order to explore the generality of the bifurcated N–H···Nδ1···H–Oγ
H-bond and determine the factors controlling its formation.
Authors: Andreas Kohl; H Kaspar Binz; Patrik Forrer; Michael T Stumpp; Andreas Plückthun; Markus G Grütter Journal: Proc Natl Acad Sci U S A Date: 2003-02-03 Impact factor: 11.205
Authors: Alexander Eletsky; Tim Heinz; Osvaldo Moreira; Alexander Kienhöfer; Donald Hilvert; Konstantin Pervushi Journal: J Biomol NMR Date: 2002-09 Impact factor: 2.835
Authors: Jacob D Marold; Kevin Sforza; Kathryn Geiger-Schuller; Tural Aksel; Sean Klein; Mark Petersen; Ekaterina Poliakova-Georgantas; Doug Barrick Journal: Protein Sci Date: 2020-11-02 Impact factor: 6.725