Literature DB >> 26523621

Helix-Capping Histidines: Diversity of N-H···N Hydrogen Bond Strength Revealed by (2h)JNN Scalar Couplings.

Matthew R Preimesberger¹, Ananya Majumdar², Selena L Rice¹, Lauren Que¹, Juliette T J Lecomte¹.

Abstract

In addition to its well-known roles as an electrophile and general acid, the side chain of histidine often serves as a hydrogen bond (H-bond) acceptor. These H-bonds provide a convenient pH-dependent switch for local structure and functional motifs. In hundreds of instances, a histidine caps the N-terminus of α- and 310-helices by forming a backbone NH···Nδ1 H-bond. To characterize the resilience and dynamics of the histidine cap, we measured the trans H-bond scalar coupling constant, (2h)JNN, in several forms of Group 1 truncated hemoglobins and cytochrome b5. The set of 19 measured (2h)JNN values were between 4.0 and 5.4 Hz, generally smaller than in nucleic acids (~6-10 Hz) and indicative of longer, weaker bonds in the studied proteins. A positive linear correlation between (2h)JNN and the difference in imidazole ring (15)N chemical shift (Δ(15)N = |δ(15)Nδ1 - δ(15)Nε2|) was found to be consistent with variable H-bond length and variable cap population related to the ionization of histidine in the capping and noncapping states. The relative ease of (2h)JNN detection suggests that this parameter can become part of the standard arsenal for describing histidines in helix caps and other key structural and catalytic elements involving NH···N H-bonds. The combined nucleic acid and protein data extend the utility of (2h)JNN as a sensitive marker of local structural, dynamic, and thermodynamic properties in biomolecules.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2015 PMID： 26523621 PMCID： PMC4660981 DOI： 10.1021/acs.biochem.5b01002

Source DB: PubMed Journal: Biochemistry ISSN： 0006-2960 Impact factor: 3.162

The importance of hydrogen bonds (H-bonds) in the folding and structural organization of biomacromolecules has been recognized for decades. “Hydrogen Bonding in Biological Structures”[1] presents an excellent historical perspective extending to the late 1980s. Since then, nuclear magnetic resonance (NMR) spectroscopy has emerged as an essential tool in the characterization of hydrogen bonding in both proteins and nucleic acids. Indeed, numerous NMR observables are available for assessing the presence, strength, and other fundamental characteristics of H-bonds. They include chemical shifts, fractionation factors, temperature coefficients, and hydrogen exchange rates. In favorable instances, these indirect manifestations can be complemented with direct detection of the H-bonding interaction. The parameter of interest in this work is the scalar coupling that connects NMR-active nuclei across the H-bond.[2−5] H-Bond scalar couplings (HBCs) have the advantage of identifying donor and acceptor atoms and are extraordinarily sensitive to geometric properties such as H-bond length.[5−9] In addition, because the magnitude of an HBC is affected by time and ensemble averaging, HBCs can also inform on the local dynamics and energetics of H-bonds within biomacromolecules.[8−10] The predominant H-bond of proteins is the backbone–backbone N–H···O=C′ sustaining regular secondary structure. The 15N–13C three-bond (3hJNC′) and 1H–13C two-bond (2hJHC′) couplings associated with these and similar interactions have been measured in several proteins under various conditions.[2,8,11−15] These couplings are invariably small (<1 Hz), which has focused application to diamagnetic proteins of fewer than 100 residues because of their favorable T1 and T2 relaxation times. In contrast, N–H···N H-bonds are relatively rare but exhibit 15N–15N two-bond couplings (2hJNN) between 2 and 11 Hz,[16−18] an experimentally accessible range for proteins with less-than-optimal relaxation properties. Protein N–H···N H-bonds frequently involve a histidine as the acceptor and another histidine or a backbone amide as the donor. A perusal of X-ray structures suggests an average of one bond of the latter type in every three deposited structures and fewer yet of the side chain to side chain type. However, such rarity does not imply that protein N–H···N H-bonds are insignificant; participation in active sites and critical elements of structure, for example, N-terminal caps of helices,[19] along with unique ionization and tautomeric properties make histidines and their H-bonds especially interesting to study. In its helix N-capping role, the histidine uses the Nδ1 atom as an H-bond acceptor and adopts the Nε2-H tautomeric state. Two capping configurations predominate, which we denote as i-to-i+3 and i-to-i–2, where i is the donor amide (Figure ). Recently, we have reported the detection of i-to-i+3 N–H···Nδ1 H-bonds within the α-helix N-caps of ankyrin repeat (AR) proteins.[18] The consensus AR N-cap has the sequence TXXH and is unusual in that the histidine Nδ1 atom acts as a bifurcated H-bond acceptor to both threonine amide and hydroxyl hydrogens (Figure B) in an arrangement that conditions inter-repeat packing and the geometry of the first turn of helix. In the study presented here, we expanded the AR protein experiments to a greater set of histidine N-caps and chose those caps in heme proteins (truncated hemoglobins and cytochrome b5) so that changes in heme placement, oxidation state, ligation state, or post-translational modification (PTM) could be exploited as perturbations to the N–H···Nδ1 bond.

Figure 1

Helix capping by histidine. (A) i-to-i+3 N–H···Nδ1 H-bond in Synechococcus GlbN (PDB entry 4MAX).[20] (B) Bifurcated i-to-i+3 N–H···Nδ1···H–Oγ interaction in the consensus AR (PDB entry 2BKG).[21] (C) i-to-i–2 N–H···Nδ1 H-bond in rat mitochondrial cytochrome b5 (PDB entry 4HIL). Truncated hemoglobins (TrHbs) share a basic architecture composed of seven helices labeled A–C and E–H by analogy to the canonical globins.[22] In one group of TrHbs (TrHb1s), α-helix G is initiated by a strong start signal[19] of the i-to-i+3 variety. The motif uses a histidine at the i+3 position in hundreds of TrHb1s, including those from the unicellular eukaryotes Chlamydomonas eugametos (CtrHb) and Chlamydomonas reinhardtii (THB1), and from the cyanobacteria Synechocystis sp. PCC 6803 (Synechocystis GlbN) and Synechococcus sp. PCC 7002 (Synechococcus GlbN). These homologous TrHb1s (∼40–50% identical) differ in the sequence of the cap and the residues immediately preceding and following it. The TrHb1 cap is close to the heme group and on the same side as the “proximal” histidine (Figure S1A), a conserved ligand to the iron in the ferric (oxidized) and ferrous (reduced) states. THB1 and both GlbNs are “hexacoordinate” hemoglobins. They ligate the heme iron with the proximal histidine and a distal lysine (THB1)[23] or histidine (GlbNs).[24,25] Exogenous ligands such as O2 can displace the distal residue and force a conformational rearrangement bringing a tyrosine from the B helix (Tyr B10) and glutamine(s) from the E helix into the distal heme pocket.[20,23,26] The B and E helices affected by the ligand replacement are remote from the helix-capping H-bond (Figure S1A). Comparing HBCs with and without exogenous ligand therefore offers an opportunity to assess long-range coupling between the distal and proximal sides of the heme. In addition to hexacoordination, the two GlbNs have the ability to react with the heme group (Figure ). The irreversible post-translational modification (PTM) involves His117, a noncoordinating histidine located on the H helix near the heme 2-vinyl group.[27] Heme reduction in the absence of oxygen causes spontaneous formation of the His117 Nε2–2-Cα heme linkage (Figure B)[28,29] and generates “GlbN-A”. Recent work has demonstrated that analogous histidine–heme modifications can be engineered at the 4-vinyl group with the Leu79His replacement.[30] In the absence of the native (2-vinyl) cross-link, the 4-vinyl-reacted protein is denoted as GlbN-B (Figure C), whereas the doubly cross-linked material is termed GlbN-AB (Figure D). The “B” cross-link was also implanted into CtrHb with the Leu75His replacement (CtrHb-B).[31] Panels C and D of Figure emphasize the proximity of the engineered modification to the α-helix N-cap. For our purposes, the artificial linkages serve to probe the response of the G helix N-cap to a nearby structural perturbation.

Figure 2

Heme b and its modifications in TrHb1s. (A) Heme b. (B) Native cross-link in wild-type Synechococcus and Synechocystis GlbN (GlbN-A). (C) Cross-link engineered in GlbN by placing a histidine at position 79 (GlbN-B). (D) Double cross-link engineered in GlbN (GlbN-AB). Amino acid labeling corresponds to that of Synechocystis GlbN. The i-to-i–2 motif often has a proline at the i–1 position and forms a tight turn adequate for capping 310-helices. This second type of N-capping N–H···Nδ1 H-bond is found in the electron transfer protein cytochrome b5, where it initiates the C-terminal helix of the soluble heme domain. The H-bond is formed between His80 Nδ1 and Asp82 N–H and is more than 20 Å from the heme binding pocket (Figure S1B). The cap is preserved in the apoprotein,[32] although an ∼1% capless population can be detected in the NMR data.[32,33]Figure C shows the structure of the 310 histidine N-cap. The goals of this work were 2-fold. First, we wanted to explore the use of 2hJNN as a semiquantitative reporter of stress in helix caps formed with a histidine side chain. The HBC values collected on TrHb1s and cytochrome b5 in different states would inform on long-range transmission of perturbation. Second, we sought to add to the HBCs acquired on the helix caps of AR proteins[18] and compare the NMR properties of protein N–H···N H-bonds with those of nucleic acids.[3,34,35] These goals required the accurate measurement of 2hJNN HBCs in multiple proteins. As will be demonstrated, the data set uncovers trends in NMR parameters and extends the utility of 2hJNN HBCs for biophysical characterization in proteins. It also provides insights into the distinctive properties of HBCs involving an acceptor histidine.

Materials and Methods

Protein Expression and Purification

Overexpression and purification of uniformly 15N-labeled recombinant ferric TrHb1s and cytochrome b5 were achieved as described previously.[23,25,36,37] A summary of the published procedures is provided in the Supporting Information. Covalently modified hemoglobins (GlbN-A, GlbN-B, GlbN-AB, and CtrHb-B) were produced by reduction of the ferric or cyanomet holoprotein with a 2–5-fold molar excess of dithionite (DT, >85%, Alfa-Aesar) for at least 15 min, followed by oxidation and purification.[30]

NMR Sample Preparation

Lyophilized ferric recombinant protein was dissolved in NMR buffer [0–125 mM sodium/potassium phosphate (pH 7.0–7.5) and 10% D2O]. Cyanide-bound ferric hemoglobins were prepared by addition of a 2–5-fold molar excess of KCN (J. T. Baker). Ferrous cytochrome b5 was produced by DT reduction of an ∼3 mM ferric protein (1.5-fold molar excess of DT to protein) sample. To prevent evaporation or oxidation, protein samples were transferred to Shigemi tubes and sealed with Parafilm M prior to the collection of NMR data. Protein concentrations ranged from 500 μM to 5 mM, although the concentrations of most samples were from ∼1.0 to 2.5 mM. Under the chosen conditions, each protein is monomeric. Specific NMR sample conditions are listed in Table S1.

NMR Data Acquisition

NMR spectroscopy was conducted using 600.13 MHz Bruker Avance or 600.53 MHz Avance-II spectrometers, each equipped with a cryogenic probe. 1H–15N HSQC, histidine-selective 1H–15N long-range (LR) HMQC,[38] soft 1H–(N)–15N COSY,[39] and quantitative 2hJNN constant-time spin–echo (CTSE) difference 1D/2D HSQC spectra[39] were acquired as detailed elsewhere.[18] A typical 2hJNN CTSE series consisted of 12–20 1D or 2D spectra using different τ modulation periods and two or three duplicate values. For a representative 1–2 mM protein sample, a single 2hJNN CTSE 1D spectrum was acquired with 512–1024 transients, a constant-time relaxation delay of ∼150 ms, and a recycle delay of ∼1.0 s and therefore required ∼10–20 min per data point. All ferric proteins in this study are paramagnetic (S = 1/2). Unless otherwise noted, NMR data were collected at 313 K to improve H-bond detection. 1H chemical shifts were referenced with respect to the water line (4.58 ppm at 313 K and 4.76 ppm at 298 K); 15N chemical shifts were referenced indirectly using the Ξ ratio.[40]

NMR Data Processing, Analysis, and Curve Fitting

NMR data were processed with NMRPipe[41] or Topspin 2.1 (Bruker BioSpin). Spectra were analyzed using Sparky 3.[42] For 2hJNN modulation data, 1D peak intensities were obtained using the deconvolution (mixed Lorentzian/Gaussian) routine of Topspin 2.1. 2D peak volumes were calculated by peak integration (Sparky 3). Peak intensities were tabulated and plotted as a function of the 2hJNN modulation time, τ. The fit to the data was performed with Kaleidagraph (Synergy Software) and the equation I(τ) = A cos(πJτ) to extract the initial amplitude (A) and J coupling magnitude (|2hJNN|).[43] For some 2hJNN modulation curves, peak heights were substituted for intensities to yield better signal-to-noise ratios. However, the choice of heights or intensities did not significantly affect the best fit 2hJNN values. In most instances, the fitting error for 2hJNN was well below 0.1 Hz. The experimental error was based on peak intensities (or heights) in duplicate spectra collected at the beginning and end of an experimental series. The combined error estimate reported in Table was typically <5%. The 2hJNN modulation curves plotted in Figure S3 have been normalized by their individually fitted amplitudes to facilitate comparison of peaks with different intensities.

Table 1

2hJNN Values, Amide Donor 1H Chemical Shifts, and Histidine 15Nδ1 and 15Nε2 Chemical Shifts Observed for N–H···Nδ1 Helix-Capping H-Bondsa

labelb	protein	distalc	\|^2hJ_NN\| (Hz)	¹H (ppm)	δ¹⁵Nε2 (ppm)	δ¹⁵Nδ1 (ppm)
	CtrHb, Md^,e	CN	5.3 ± 0.2	11.41	ndf	254.9
A	CtrHb, md	CN	5.2 ± 0.2	11.39	166.6	254.9
B	CtrHb, M	CN	5.0 ± 0.1	11.17	166.5	255.1
C	CtrHb-B	CN	4.0 ± 0.3	10.45	168.6	249.6
D	THB1, Mg	CN	4.8 ± 0.1	11.18	166.4	256.6
E	THB1, mg	CN	4.6 ± 0.3	11.28	166.9	256.9
F	Syn7002h GlbN	His	4.7 ± 0.2	10.77	166.9	256.4
G	Syn7002 GlbN-A	His	4.8 ± 0.2	10.97	167.3	256.6
H	Syn6803i GlbNg	CN	5.0 ± 0.2	11.34	165.9	257.7
I	Syn6803 GlbN-Ag	CN	4.8 ± 0.2	11.40	166.1	258.0
J	Syn6803 GlbN	His	4.9 ± 0.2	11.14	165.8	257.4
K	Syn6803 GlbN-A	His	4.9 ± 0.2	11.28	166.0	257.5
L	Syn6803 GlbN-B	His	4.5 ± 0.2	10.93	166.9	255.3
M	Syn6803 GlbN-AB	His	4.3 ± 0.2	10.87	167.6	254.5
N	Syn6803 GlbN-B	CN	5.4 ± 0.1	11.40	166.0	256.9
O	Syn6803 GlbN-AB	CN	5.0 ± 0.1	11.29	166.4	257.1
P	ferric cyt b₅	His	5.0 ± 0.1	10.88	165.0	250.5
Q	ferrous cyt b₅	His	5.0 ± 0.1	10.94	165.3	250.6
R	apocyt b₅j	−	5.3 ± 0.1	11.15	165.0	250.3

Measured at 313 K, 10% D2O, and pH 7.0–7.2 on proteins containing ferric heme iron unless otherwise noted.

As annotated in Figure .

Distal ligand to the heme iron.

M, major heme isomer; m, minor heme isomer.

Measured at 283 K.

Not determined.

Measured at pH 7.3–7.5.

Syn7002, Synechococcus sp. PCC 7002.

Syn6803, Synechocystis sp. PCC 6803.

Measured at 298 K.

Simulation of the pH Response

The influence of N–H···Nδ1 H-bonding on the histidine ionization equilibrium, observed 2hJNN couplings, and imidazole 15N chemical shifts was explored using simulations performed with Scilab 5.4.1 (Scilab Entreprises S.A.S.). The thermodynamic model assumed the existence of four states: (1) a capped configuration in which the N–H···Nδ1 hydrogen bond is intact, coupling is active [2hJNN = 2hJNN(capped)], and the histidine adopts the Nε2-H neutral tautomer, (2) an open state in which the hydrogen bond is broken (2hJNN = 0) and the histidine occurs as the Nε2-H neutral tautomer, (3) an open state in which the hydrogen bond is broken (2hJNN = 0) and the histidine occurs as the Nδ1-H neutral tautomer, and (4) an open imidazolium form (2hJNN = 0). The cis state of the cytochrome Xxx–Pro bond was not detected, and the Xxx–Pro cis–trans equilibrium was not considered in the modeling. The input parameters included the microscopic pKa for the capped state and both open states. For the open state, we chose the values of Nα-acetyl-histidine methylamide to eliminate terminal charge and intramolecular H-bonding effects.[44] As determined by Tanokura,[45] the imidazolium form dissociates to the Nε2H tautomer (open, Nε2H) with a pKa of 6.53 and to the Nδ1H tautomer (open, Nδ1H) with a pKa of 6.92. The Hill coefficients (nH) characterizing proton uptake[46] were all held at unity. The fractional populations of the four forms (capped, Nε2H; open, Nε2H; open, Nδ1H; and open, +) were evaluated as a function of pH using eqs –5.where Q is the partition function using the open protonated state as the reference. The predicted histidine 15N chemical shifts were calculated assuming fast exchange on the 15N chemical shift time scale using eqs and 7: The coefficients in eqs and 7 correspond to the limiting 15N chemical shift values of Vila (neutral open tautomers)[47] and Pelton and co-workers (imidazolium);[38] the limiting 15N chemical shift values for the “capped, Nε2H” state were chosen to encompass the experimentally determined heme protein 15N chemical shifts. We note that the open state pKas were determined at 310 K.[45] No correction was applied to these values when considering experimental data at 313 K. Open state pKas at 298 K were adjusted assuming a histidine enthalpy of ionization of +30 kJ/mol.[48]

Database Analysis

The structures of proteins determined by X-ray crystallography with better than 1.5 Å resolution were collected from the Protein Data Bank (July 30, 2015) and culled at 70% identity. Each of the 3253 structures was examined for the presence of amide–histidine N–Nδ1 pairs having an interatomic distance of <3.2 Å. Hydrogens were ignored, and a small number of structures having short N–Cδ2 distances were not assessed for the plausibility of alternative histidine χ2 angles. A representative set of helix caps (one chain per structure) was extracted from the data to obtain inter-nitrogen distance distributions for the two types of caps. The histogram in Figure was optimized for bin size with a published method.[49] In addition, 77 myoglobin structures (resolution better than 1.5 Å) containing the His24–His119 pair were collected and the Nε2–Nε2 distances measured for comparison with the N-capping H-bonds.

Figure 11

Dependence of 2hJNN(capped) on inter-nitrogen distance as approximated by the Fermi contact contribution to coupling.[7] The equation is 2hJNN (Hz) = 795579 exp(−3.9868 d), where d is the inter-nitrogen distance in angstroms.[7] The red dashed lines bracket the values measured in this work. The green dashed line indicates the 2hJNN(capped) predicted for the solvent-exposed cap of AR proteins. The blue dashed line indicates the value reported for the His24–His119 pair of apomyoglobin.[17] The optimized histogram[49] (right ordinate) is for 803 nonredundant i-to-i–2 and i-to-i+3 distances of <3.2 Å obtained from the PDB survey.

Results

N–H···N H-Bonding in Cyanomet CtrHb

We first present the data collected with one of the globins, namely CtrHb, in the ferric state with cyanide as the distal ligand [cyanomet CtrHb, or CtrHb-CN (PDB entry 1DLY)].[50] The putative i-to-i+3 interaction in CtrHb involves Ser76 and His79 and is a convenient subject for illustrating the spectroscopic manifestations of the capping H-bonds along with the utility of solution phase characterization. The nuclei of interest are the backbone amide 15N and 1H nuclei and the histidine 15Nδ1, 1Hε1, 15Nε2, and 1Hδ2 nuclei. These are the targets of J-correlated experiments designed to establish unambiguously the existence of the N–H···Nδ1 H-bond. The amide 1H–15N HSQC spectrum of CtrHb-CN shows two sets of cross-peaks (Figure A) corresponding to two heme orientations in the heme pocket. The major and minor isomers, related by an ∼180° rotation of the heme about its α–γ meso axis, are in slow exchange on the chemical shift time scale and occur in a 7:3 ratio in this protein.[31] Sequence specific assignments identify the amide proton signals of Ser76 at downfield shifts of ∼11.2 ppm (major isomer) and ∼11.4 ppm (minor isomer). Notably, high-resolution HSQC data show that both Ser76 cross-peaks appear as 5 Hz doublets in the 15N dimension (Figure S2A,B), a fine structure that is absent from all other 15N–1H cross-peaks.

Figure 3

Identification of the N–H···Nδ1 H-bond in CtrHb-CN. (A) 1H–15N HSQC (downfield amide 1H region). The minor heme isomer is denoted with a prime. (B) 1H–15N LR-HMQC (upfield 15N region). (C) 1H–15N LR-HMQC (downfield 15N region). (D) 1H–(N)–15Nδ1 COSY. (E) 1H-coupled 1Hε1–(Nδ1)–15N–(H) LR-COSY. (F) Ser76 N–H···Nδ1 His79 helix-capping scheme in C. eugametos CtrHb-CN. Correlated nuclei determined by the NMR data shown in panels D and E are highlighted with red and blue circles, respectively.

Histidine-selective 1H–15N LR-HMQC spectra obtained with CtrHb-CN were analyzed in previous work.[31] They show that major His79 and minor His79 are deprotonated at neutral pH, adopt the Nε2H tautomer, and have Nδ1 signals at ∼255 ppm (Figure B,C and Table ). Backbone amide 1H–15N HSQC spectra acquired with 15N decoupling centered at the His79 Nδ1 frequency during the evolution period contain collapsed Ser76 doublets (Figure S2C). This observation is in agreement with a ∼ 5 Hz 2hJNN caused by the Ser76–His79 H-bond. In addition, a weak cross peak between His79 15Nδ1 and the amide proton of Ser76 (major isomer) is detected in the high sensitivity 1H–15N LR-HMQC data (Figure S2D,E). Because these J-correlated nuclei are separated by 14 covalent bonds, the observed connectivity must arise via the one-bond trans HBC, 1hJHN. Identification of the N–H···Nδ1 H-bond in CtrHb-CN. (A) 1H–15N HSQC (downfield amide 1H region). The minor heme isomer is denoted with a prime. (B) 1H–15N LR-HMQC (upfield 15N region). (C) 1H–15N LR-HMQC (downfield 15N region). (D) 1H–(N)–15Nδ1 COSY. (E) 1H-coupled 1Hε1–(Nδ1)–15N–(H) LR-COSY. (F) Ser76 N–H···Nδ1 His79 helix-capping scheme in C. eugametos CtrHb-CN. Correlated nuclei determined by the NMR data shown in panels D and E are highlighted with red and blue circles, respectively. Measured at 313 K, 10% D2O, and pH 7.0–7.2 on proteins containing ferric heme iron unless otherwise noted. As annotated in Figure .

Figure 5

Overlay of HNN-COSY spectra. Data collected on hemoglobins and holocytochrome b5 at 313 K and apocytochrome b5 at 298 K (peak R). Each cross-peak corresponds to the amide 1H and His 15Nδ1 of an N–H···Nδ1 H-bond, correlated by 2hJNN. The peak labels are defined in the first column of Table .

Distal ligand to the heme iron. M, major heme isomer; m, minor heme isomer. Measured at 283 K. Not determined. Measured at pH 7.3–7.5. Syn7002, Synechococcus sp. PCC 7002. Syn6803, Synechocystis sp. PCC 6803. Measured at 298 K. To document further the presence of the N–H···Nδ1 H-bond, a soft 1H–(N)–15N COSY experiment[18,39] was used. Unlike the LR-HMQC, the HNN-COSY pulse sequence utilizes the two-bond homonuclear HBC 2hJNN for N–H···N H-bond identification. This approach is advantageous because the 2hJNN coupling is often considerably larger than the 1hJHN coupling[4] and therefore leads to a gain in sensitivity. Figure D illustrates the results of such an experiment. Two peaks are present in the spectrum and identify Ser76 amide 1H and His79 15Nδ1 within the N–H···N H-bonds (major and minor isomers). In agreement, the 1H-coupled 1H–(N)–15N–(H) LR COSY data[51] shown in Figure E also display two 2hJNN-mediated peaks and correlate His79 1Hε1 with Ser76 15N, the latter split by its directly attached proton (|1JNH| ∼ 90 Hz). The NMR connectivity pattern shown in Figure is unequivocal evidence of the N–H···Nδ1 H-bond between Ser76 and His79 in CtrHb-CN. To determine if the 2hJNN coupling constants involving a histidine ring may be practical reporters for H-bond properties, we measured the magnitude of the 2hJNN couplings using a high-precision quantitative CTSE difference HSQC experiment.[43] With this method, amide NH groups within N–H···Nδ1 H-bonds undergo 2hJNN modulation according to the timing (τ) of a histidine-selective 15Nδ1 inversion pulse. Repeating the experiment for different values of τ yields a modulation curve from which the magnitude of 2hJNN can be accurately extracted. Figure A presents the downfield region of 1D data collected for CtrHb-CN. The intensities of the resolved Ser76 NH protons were plotted as a function of the 2hJNN modulation time τ and fit according to the relationship I(τ) = A cos(πJτ).[43]

Figure 4

Measurement of the 2hJNN magnitude in CtrHb-CN. (A) Stack plot showing the resolved amide proton resonances of Ser76 (major and minor isomers) as a function of the 2hJNN modulation time, τ. (B) Normalized Ser76 NH peak intensities (major isomer, black circles) as a function of τ. The horizontal black line indicates zero intensity, and the black dashed vertical line corresponds to the null time for 2hJNN = 5 Hz (100 ms). Simulated 2hJNN curves (black lines) between 4 and 6 Hz in 0.2 Hz increments are included for comparison. The data in Figure B yield a well-defined value (|2hJNN| = 5.0 ± 0.1 Hz, major isomer) for the Ser76–His79 N–H···Nδ1 interaction, in agreement with the directly observed splitting (Figure S2A,B). The minor heme isomer has the same coupling constant within error [|2hJNN| = 5.2 ± 0.2 Hz (Figure S3A)]. Decreasing the temperature from 313 to 283 K yielded a relatively constant |2hJNN| [5.3 ± 0.2 Hz, major isomer (Figure S3S)]. The magnitude of the CtrHb-CN 2hJNN couplings is within the range expected from the few previously reported instances of amide N–H···N His H-bonds (∼2–6 Hz).[16,18]

N–H···N H-Bonding in THB1-CN and Synechocystis GlbN-CN

We collected HSQC, LR-HMQC, HNN-COSY, and 2hJNN CTSE difference spectra to characterize the N–H···N H-bonds in THB1-CN (Asn87–His90) and Synechocystis GlbN-CN (Asn80–His83). These proteins yielded HNN-COSY signals (Figure ) and 2hJNN values (Table and Figure S3D,E,H) similar to those of CtrHb-CN, in a suggestion that sequence context and the identity of the amide donor (Ser in CtrHb and Asn in Synechocystis GlbN and THB1) have minimal effects on the helix-capping H-bond. GlbN-CN with native heme PTM (GlbN-A-CN) yielded a 2hJNN value [4.8 ± 0.2 Hz (Figure S3I)] close to that in its unmodified form. Thus, the native His117–2-Cα cross-link appears to have little influence on the N–H···Nδ1 H-bond in the cyanomet complexes. Overlay of HNN-COSY spectra. Data collected on hemoglobins and holocytochrome b5 at 313 K and apocytochrome b5 at 298 K (peak R). Each cross-peak corresponds to the amide 1H and His 15Nδ1 of an N–H···Nδ1 H-bond, correlated by 2hJNN. The peak labels are defined in the first column of Table .

N–H···N H-Bonding in Bis-histidine Synechocystis and Synechococcus GlbNs

To examine how binding of cyanide to the distal heme site and the consequent conformational rearrangement alter the N–H···N N-cap, we prepared Synechococcus and Synechocystis GlbNs with and without PTM in their ferric bis-histidine state. All four species yielded observable HNN-COSY cross-peaks (Figure ) and 2hJNN values of 4.7–4.9 Hz (Table and Figure S3F,G,J,K). These results indicate insensitivity to the distal ligand and associated structural differences in the wild-type GlbNs, regardless of heme modification status. In the case of Synechococcus GlbN, which contains a TXXH motif as in the previously studied AR proteins,[18] no evidence of bifurcated N–H···Nδ1···H–Oγ H-bonding was observed. In addition, the relatively constant magnitude of 2hJNN detected for wild-type TrHb1s from C. eugametos, C. reinhardtii, Synechococcus, and Synechocystis (4.6–5.2 Hz) was greater than the 2hJNN values obtained for the bifurcated H-bonds of ARs (1.8–4.1 Hz),[18] in support of a shorter N···Nδ1 distance in the former proteins.

Influence of Engineered His-Heme PTM on the Helix-Capping N–H···N H-Bond

Non-native heme modifications can be introduced into Synechocystis GlbN as depicted in panels C and D of Figure . Synechocystis GlbN-B and GlbN-AB exhibit bis-histidine coordination as in the wild-type protein.[30]Figure shows the correlations between Asn80 backbone amide 1H and His83 15Nδ1 and therefore the presence of the helix-capping N–H···N H-bonds. However, the data demonstrate systematic upfield shifts for Asn80 1H and His83 15Nδ1 nuclei relative to GlbN-A. Measurement of the 2hJNN couplings (Table and Figure S3L,M) also suggests H-bond lengthening (or weakening) relative to the bis-histidine wild-type reference. Upon replacement of the distal histidine with cyanide in GlbN-B and GlbN-AB, the H-bonds persist (Figure ). In contrast to observations in the bis-histidine state, Asn80 1H and His83 15Nδ1 resonances have large downfield shifts. Furthermore, a significant (∼16–20%) increase in the 2hJNN coupling constants is measured in both GlbN-AB-CN (+0.7 Hz; 2hJNN = 5.0 Hz) and GlbN-B-CN (+0.9 Hz; 2hJNN = 5.4 Hz) (Table and Figure S3O,N). This is presumably due to H-bond relaxation relative to the strained bis-histidine complexes. Thus, formation of the engineered linkage to the FG turn, adjacent to the helix N-cap, couples distal cyanide binding to the proximal side of the heme pocket in a manner not observed in the wild-type proteins. A non-native histidine–heme covalent modification analogous to that in Synechocystis GlbN-B can be engineered in CtrHb.[31] HNN-COSY (Figure ) and 2hJNN modulation experiments performed with CtrHb-B-CN (Figure S3C) demonstrate that the N–H···Nδ1 H-bond is considerably perturbed (2hJNN = 4.0 Hz) relative to its wild-type reference [2hJNN = 5.0 Hz (Table )]. These results suggest that the formation of the engineered cross-link to the FG turn in CtrHb-B-CN generates strain in the adjacent helix-capping H-bond. The ∼20% decrease in 2hJNN observed for CtrHb-B-CN compared to wild-type CtrHb-CN is in contrast to that of the GlbN-B-CN and GlbN-CN pair [8% increase by cross-linking (Table )]. The opposite effect of the engineered cross-link in CtrHb-CN and GlbN-CN on the G helix N-cap implicates a difference in the conformational plasticity of neighboring elements of structure. It is worth noting that the F-helix/FG loop sequence preceding the cap in CtrHb (VPHL) is shorter by two residues than in GlbN (VENHGL).

N–H···N H-Bonding in the His-Pro-Asp Motif of Cytochrome b5

To study the i-to-i–2 motif and its response to iron redox changes and heme binding, we prepared the ferric, ferrous, and apoprotein states of cytochrome b5. As in the hemoglobins, the N–H donor proton resonates downfield from its random shift (HSQC data on the ferric form shown in Figure S4A). Also in agreement with previous work,[36] histidine-selective LR-HMQC spectra confirm that His80 is neutral and that its Nδ1 atom is deprotonated (Figure S4B). HNN-COSY data recorded for the ferric protein (Figure S4C), the ferrous protein, and the apoprotein (Figure ) support the presence of the N–H···Nδ1 H-bond (Figure C) in all three forms of the cytochrome. Measurement of 2hJNN returned values of 5.0, 5.0, and 5.3 Hz for the ferric form (313 K), ferrous form (313 K), and apoprotein form (298 K, for stability reasons), respectively (Table and Figure S3P–R). Thus, the His-Pro-Asp N–H···N H-bond of cytochrome b5 depends little on the iron redox state or the presence of the heme. Additionally, the cytochrome b5i-to-i–2 H-bond appears to be similar to those detected within the TrHb1 i-to-i+3 motifs.

Discussion

N–H···N H-Bonding, 2hJNN, and Histidine pKa

The 2hJNN data obtained with the hemoglobins and cytochrome suggest that this parameter can register long-range influences and be a reliable indicator of the state of an N-capping H-bond. However, to interpret the 2hJNN values, it is essential to consider that the observables may be ensemble averages over the H-bonded state having a limiting 2hJNN equal to 2hJNN(capped) and one or more nonbonded states having a 2hJNN equal to 0. We will assume that nonbonded states expose the histidine to solvent, in which case there are three possible open species: the imidazolium (protonated) state, the Nε2-H tautomer (imidazolium pKa ∼ 6.53), and the Nδ1-H tautomer (imidazolium pKa ∼ 6.92).[44,45,52] The two tautomers partition in an ∼5:2 ratio dictated by the pKa values. The set of relevant equilibria is presented in Figure . Although the details of this scheme are expected to differ from protein to protein and conditions to conditions, the thermodynamic analysis provides a framework with which to scrutinize the significance of the 2hJNN measurements.

Figure 6

Scheme used to assess the effect of fast exchange averaging on histidine–amide H-bonding and histidine 15N chemical shifts. The histidine can adopt an H-bonded state in which 2hJNN is active. The open states are characterized by a 2hJNN of 0 and partition according to sample pH and microscopic pKa values. Limiting 15Nδ1 and 15Nε2 chemical shifts[38,47] are listed in parentheses. The limiting 15N chemical shifts for the H-bonded state are estimates based on globins with large Δ15N values and low apparent histidine pKa values. To perform the simulation shown in Figure A, the microscopic pKa of the histidine in the H-bonded state was set to 4, 5, or 6. For each pKa value, the fractional population of all four states was calculated as a function of pH using eqs –5. As the histidine pKa in the capped state approaches that of the open state, the maximal attainable fraction of the H-bonded form drops below 1. Under such conditions, the observed 2hJNN value does not reach 2hJNN(capped) and the observed histidine chemical shifts do not correspond to the pure state limiting chemical shifts. Capping histidine pKas have been published[32,36,53] or were estimated in this work (Figures S6–S8 and Table S2); they suggest that in most cases, the pKa is sufficiently low (<4.5) that a direct measurement of 2hJNN(capped) is possible. Open/closed averaging may be influential for the solvent-exposed cap of AR proteins,[18] CtrHb-B-CN, and cytochrome b5, and in those instances, extrapolation of the observed 2hJNN is necessary to estimate 2hJNN(capped). In this framework, 2hJNN(capped) is a parameter determined primarily by bond length,[54] and Kclose (Figure ) is a measure of bond strength directly related to the free energy of H-bond formation, ΔGf°.[55]

Figure 7

Effect of capping histidine pKa on (A) microstate fractional populations and (B) histidine 15Nδ1 and 15Nε2 chemical shifts. Fast exchange averaging among imidazolium (red trace), capped Nε2-H (black trace), and open Nδ1-H and Nε2-H histidine tautomers (cyan and magenta traces, respectively) was assumed. In panel B, the limiting 15N chemical shift values were as defined in Figure . In both simulations, the capping histidine pKa was set to 4, 5, or 6 as indicated on the traces. As the capping histidine pKa nears the model compound values, a significant proportion of open states persists at neutral and higher pH, which reduces the maximal observable Δ15N. For the i-to-i+3 N–H···N N-cap of consensus AR proteins, we found that buried H-bonds have larger observed couplings (2hJNN ∼ 4–5 Hz) than H-bonds exposed to solvent (2hJNN ∼ 2 Hz). We also note that H-bonds with larger 2hJNN values are exhibited by histidines having a depressed ionization constant (pKa < 3), whereas those with smaller values have an only moderately depressed pKa (∼5.7). Assuming a macroscopic pKa of 5.7 (Figure S6), a simulation similar to that shown in Figure leads to a closed state population of 77% at the pH of the AR 2hJNN measurement (6.6). The model indicates that increasing the pH by one unit leads to a closed state population of 86% and an increase in 2hJNN (+0.23 Hz), in good agreement with pH-dependent NMR data (Figure S6). Extrapolation to a vanishing population of open states predicts a 2hJNN(capped) of ∼2.7 Hz for the solvent-exposed H-bond, which can be meaningfully compared to the 2hJNN(capped) of buried H-bonds [where 2hJNN ∼ 2hJNN(capped) ∼ 4 Hz]. The same analysis was applied to CtrHb-CN. NMR data demonstrate that His79 in WT CtrHb-CN remains neutral until global acid unfolding, suggesting a pKa of <3.5 (Figure S7A,B). In contrast, His79 in L75H CtrHb-B-CN undergoes partial protonation at pH 5.5 in support of a pKa near 5 (Figure S7C,D). Thus, at pH 7.2, the wild-type N–H···Nδ1 H-bond is formed ∼100% of the time and the L75H variant H-bond only ∼95% of the time. A hypothetical fully formed H-bond in L75H CtrHb-B-CN would therefore give rise to a 2hJNN(capped) value of ∼4.2 Hz, lower than the wild-type value of 5.0 Hz. The parameters used in the simulations are listed in Table S3. In both examples, the low 2hJNN(capped) and low observed 2hJNN values are likely manifestations of H-bond lengthening and concomitant increased sampling of open states, respectively. The contribution from open/closed averaging is also possible in cytochrome b5. When the estimated apoprotein pKas of the H-bonded (∼5.0)[36] and open (∼6.9)[32] states were used in the simulation, 2hJNN(capped) was evaluated at ∼5.5 Hz. In general, the modeling can be adjusted to account for deviations of the open state pKa from the fully exposed residue and guide the choice of experimental conditions under which a satisfactory estimate of 2hJNN(capped) can be obtained.

Fast Exchange Averaging and 2hJNN–Chemical Shift Correlations

Histidine nitrogen shifts have been the subject of multiple studies. 14N NMR studies of azoles[56] and extension to 15N NMR of free histidine in solution,[44] model imidazoles,[57] and histidine within the catalytic triad of α-lytic protease[58] define the ring nitrogens as one of two types: the pyrrole α-type, which is protonated and has an upfield 15N shift (167.5 ppm in imidazole and 176.5 ppm in imidazolium),[38] and the pyridine β-type, which is deprotonated and resonates downfield (15N = 249.5 ppm).[38] The canonical limiting values for neutral histidine were recently revised with density functional theory (DFT) calculations,[47] which return shifts of 167.5 ppm (Nε2) and 261.5 ppm (Nδ1) for the Nε2-H tautomer and shifts of 183.5 ppm (Nδ1) and 266.5 ppm (Nε2) for the Nδ1-H tautomer (Figure ). According to the revised values, the theoretical maximal separation between the Nδ1 and Nε2 signals (Δ15N = |δ15Nδ1 – δ15Nε2|) is 94 ppm for the Nε2-H tautomer and 83 ppm for the Nδ1-H tautomer. Under fast exchange on the chemical shift time scale, the 15N chemical shifts of the neutral ring reflect the ∼5:2 Nε2-H:Nδ1-H ratio, and Δ15N is smaller than for the pure tautomeric states. As the pH is lowered, protonation of the ring occurs, which changes the β-nitrogen to an α-like nitrogen and also reduces Δ15N. The N-capping interaction favors the Nε2-H tautomer and contributes primarily to a large Δ15N, but secondarily, H-bonding tends to shift the acceptor Nδ1 β-type nitrogen upfield of its maximal value,[59] thereby leading to a small reduction in Δ15N. These competing influences and local context typically render histidine 15N chemical shifts difficult to interpret. A plot of 2hJNN versus Δ15N is shown in Figure for the globins, cytochrome b5, and AR proteins. The observed 15N separations are below the DFT-derived maximum value of 94 ppm.

Figure 8

Plot of 2hJNN vs histidine Δ15N for globins (black fill), cytochrome b5 (orange fill), and AR proteins (○). Data for WT CtrHb-CN (for the major isomer, 2hJNN = 5.0 Hz and Δ15N = 88.3 ppm) and L75H CtrHb-B-CN (2hJNN = 4.0 Hz, and Δ15N = 81 ppm) are colored blue. Linear regression of the full data set returns a slope of 0.10 Hz/ppm and an x-intercept (Δ15N for 2hJNN = 0 Hz) of 43.3 ppm (r2 = 0.88). Linear regression of the AR protein data alone yields a slope of 0.11 Hz/ppm and an x-intercept of 45.4 ppm (r2 = 0.98), whereas the heme protein data alone (omitting apo cytochrome b5) return a slope of 0.06 Hz/ppm and an x-intercept of 10.5 ppm (r2 = 0.29). Although significant scatter is present in the heme protein data, the composite Δ15N values are positively correlated to 2hJNN such that larger couplings are associated with larger Δ15N. This behavior is characterized by large variations in the acceptor (15Nδ1) chemical shift [larger coupling, larger shift (Table )]. The other ring nitrogen (15Nε2H) displays modest variations that are negatively correlated to 2hJNN (larger coupling, smaller shift). The AR protein data span the greatest range of 2hJNN (1.8–5.2 Hz), Δ15N (61.5–92.0 ppm), and pKa (<3.0–5.7) values; in addition, their 2hJNN versus Δ15N correlation is the strongest. In contrast, the heme protein data cluster around similar 2hJNN (4.0–5.4 Hz), Δ15N (81.0–91.9 ppm), and pKa (<4.0–5.0) values and show only a weak positive correlation between 2hJNN and Δ15N. The positive correlation between the histidine Nδ1 chemical shift and 2hJNN is noteworthy. For a pyridine β-type acceptor that is H-bonded 100% of the time, as H-bond length decreases, we expect the magnitude of 2hJNN to increase and the 15Nδ1 shift to decrease (i.e., move upfield) as the nitrogen nucleus assumes greater α character.[9] The reverse trend is observed. To determine if open/closed averaging may be responsible, we simulated the pH profile of the 15N chemical shifts as a function of histidine pKa predicted by the model shown in Figure . The imidazole 15N shifts, calculated with eqs and 7, are presented in Figure B. Interestingly, the model predicts a linear relationship between Δ15N and the fractional population of the capped form; thus, a linear correlation between Δ15N and the magnitude of 2hJNN may also be expected. Assuming pH 7.2 and 2hJNN(capped) = 5 Hz, the model predicts a 0.09 Hz/ppm line with an x-intercept (Δ15N for 2hJNN = 0 Hz) of 37 ppm, values similar to those obtained from linear regression of the experimental data (Figure ). The 2hJNN values and chemical shifts for heme proteins (Table ) and AR proteins[18] were combined in a plot of 2hJNN coupling versus amide 1H chemical shifts (Figure ). In this plot, the shifts were corrected for random coil values and the effect of nearest neighbors in the primary structure.[60] Paramagnetic effects, though estimated to be small at the cap site (∼0.2 ppm), are likely to contribute to the scatter of the heme protein data along with secondary and tertiary structure differences. Despite this referencing problem, a positive correlation emerges by which greater 1H deshielding accompanies greater 2hJNN coupling, as anticipated.[61]

Figure 9

Plot of 2hJNN vs the amide 1H-corrected chemical shift in heme proteins (fill colors correspond to Figure S5) and AR proteins (◇).[18] Linear regression of the entire data set gives a slope of 1.13 Hz/ppm and a y-intercept of 1.62 Hz [r2 = 0.84 (—)]. The non-zero y-intercept is attributed to the lack of data at low J values. Fixing the y-intercept to zero yields a best fit slope of 1.72 Hz/ppm [r2 = 0.59 (−–−)]. Linear regression of the heme protein data alone returns a slope of 1.13 Hz/ppm and a y-intercept of 1.56 Hz [r2 = 0.55 (not shown)]. The chemical shift trend in Figure is reminiscent of the trend observed for Hoogsteen and Watson–Crick base pairs in triplex DNA[34] and prompts a comparison of the new protein data to published nucleic acid information. In nucleic acids, 2hJNN has been especially useful for inspecting the diversity of H-bond geometries. For example, ultra-high-resolution crystallographic structures of model U-A and G-C base pairs support the possibility that the former have slightly shorter N–N H-bonding distances (Δd ∼ 0.05 Å).[62,63] In agreement, the central imino N1–H···N3 H-bond within an RNA G-C base pair typically displays 2hJNN values between 5 and 7 Hz, on average smaller than those of the imino N3–H···N1 H-bond found in RNA (DNA) U-A (T-A) pairs (6–8 Hz).[3,4,64] Similarly, cooperative networks of amino N2–H···N7 H-bonds found in G-quadruplex DNA display 2hJNN values of 6–8 Hz.[35] Hoogsteen C-G base pairs involving protonated cytidine contain short, electrostatically stabilized H-bonds (N3–H···N7) and display the largest 2hJNN values yet measured in a biomacromolecule (10–11 Hz).[34] On the other hand, A-A mismatches, which have amino N6–H···N7-type H-bonds, tend to display low 2hJNN values (2–3 Hz), in support of their nonideal geometry.[39] In addition, a positive correlation between the imino donor 1H chemical shift and the magnitude of 2hJNN, which encompasses C+(N3)–G(N7), T(N3)–A(N1), T(N3)–A(N7), and G(N1)–C(N3) bond types within triplex DNA, has been reported.[34] Similar observations have been made between the amide donor 1H shift and |3hJNC′| within N–H···O=C′ protein H-bonds.[11] The parallel between protein and nucleic acid N–H···N data is apparent in a plot of 2hJNN couplings versus raw 1H chemical shifts (Figure ). The different absolute magnitude of protein and nucleic acid 2hJNN values supports the idea that in solution, both Watson–Crick and Hoogsteen N–H···N H-bonds in the DNA triplex are shorter (and possibly stronger) than the helix-capping amide N–H···Nδ1 His H-bond in proteins. This observation may reflect the fact that the difference in pKa between the donor and acceptor groups in the nucleic acids is significantly smaller than that of an amide–histidine pair.[7,65] The analysis of combined raw protein amide and nucleic acid imino 1H chemical shifts is particularly susceptible to the referencing problem mentioned above, but remarkably, the aggregate data describe a relatively constant slope and consistent linear behavior.

Figure 10

Plot of 2hJNN vs donor 1H chemical shift for globin and cytochrome b5 (this work), AR protein,[18] and nucleic acid[34] N–H···N H-bonds. A linear regression of the combined data returns a slope of 1.13 Hz/ppm and an x-intercept (1H chemical shift for 2hJNN = 0) of 6.83 ppm (r2 = 0.94).

2hJNN and Internuclear Distance

As per Figure , capping histidines with a pKa near or below 4 and open state pKas close to model compound values are expected to have a neutral-pH 2hJNN equal to 2hJNN(capped) and be directly related to the geometry of the H-bond. Del Bene and co-workers have demonstrated that the coupling originates principally from the Fermi contact interaction and that, accounting for differences in equilibrium bond length, the coupling is not particularly sensitive to the hybridization or charge state of the donor and acceptor groups.[6,7,54] The inter-nitrogen distance is the main determinant of the coupling magnitude, with angular dependence becoming steep only at large deviations from ideal geometry. The distance dependence is captured by an exponential decay[7] that can be used to rationalize our results. Figure places the estimated 2hJNN(capped) in this context. The range covered by the hemoglobin and cytochrome values is framed by red dotted lines; the corresponding N–N atomic separations are between ∼2.98 and 3.04 Å. The lowest predicted 2hJNN(capped) of 2.7 Hz is associated with a separation of 3.16 Å. Dependence of 2hJNN(capped) on inter-nitrogen distance as approximated by the Fermi contact contribution to coupling.[7] The equation is 2hJNN (Hz) = 795579 exp(−3.9868 d), where d is the inter-nitrogen distance in angstroms.[7] The red dashed lines bracket the values measured in this work. The green dashed line indicates the 2hJNN(capped) predicted for the solvent-exposed cap of AR proteins. The blue dashed line indicates the value reported for the His24–His119 pair of apomyoglobin.[17] The optimized histogram[49] (right ordinate) is for 803 nonredundant i-to-i–2 and i-to-i+3 distances of <3.2 Å obtained from the PDB survey. The accuracy of J-derived distances can be cross-validated with a survey of N-capping characteristics in available protein crystallographic structures. For the i-to-i–2 H-bonds with an intervening Pro, the mean inter-nitrogen distance is 3.00 ± 0.08 Å (n = 298). For the i-to-i+3 H-bonds, the distribution has a similar mean of 3.00 ± 0.09 Å (n = 70). These values are in good agreement with the predicted distances obtained from our solution 2hJNN data. Interestingly, the His–His Nε2-H···Nε2 bond in apomyglobin has a significantly larger 2hJNN (∼11 Hz).[17] Inspection of multiple holomyoglobin structures reveals a short inter-nitrogen distance [2.77 ± 0.10 Å (n = 77)]. The histidine pair has the same properties in apo- and holomyoglobin (buried, low acceptor pKa),[66] and it is therefore reasonable to expect that the H-bond characteristics are maintained. Again using the Fermi contact curve of Figure ,[7] a remarkable correspondence between the measured J and N–N distance is obtained (blue dashed line). It is difficult to associate a free energy with the variation in 2hJNN detected here, in part because of uncertainty in the determination of Kclose in Figure . However, the work of Del Bene suggests that in the relevant range of distances, a 1 Hz difference in 2hJNN(capped) corresponds to several kilojoules per mole in binding energy.[54] In agreement, the 2 unit increase in histidine pKa for CtrHb-B compared to that of CtrHb (Figure S7) corresponds to a 100-fold change in Kclose (assuming a common model pKa for the open states) and gives rise to a 0.8 Hz difference in 2hJNN(capped). Thus, qualitatively, the perturbations caused by heme modification do appear to have a significant local effect on H-bond geometry and, consequently, on the open/closed equilibrium of the helix cap. A full quantitative analysis would require knowledge of the pH dependence of protein stability and measurement of the histidine “open” state pKas.

Conclusion

In this work, we demonstrated that helix-capping N–H···N H-bonds can be routinely detected in 15N-labeled proteins using hydrogen bond scalar coupling experiments. Direct assignment of H-bonding nuclei was achieved by tailoring HNN-COSY and CTSE difference experiments for protein amide 15N–1H and histidine 15Nδ1 nuclei. Evaluation of the 2hJNN coupling constants, along with knowledge of the histidine pKa, provides a convenient metric for the length and relative strength of N–H···N H-bonds. In addition, a linear correlation between 2hJNN couplings and donor 1H chemical shift emerges as observed for N–H···N H-bonds in Watson–Crick and Hoogsteen DNA base pairs. The absolute magnitude of the protein couplings tends to be smaller than those in nucleic acids and hints that in solution, DNA/RNA N–H···N H-bond lengths are likely shorter (∼2.8–2.9 Å) than those in the studied proteins (∼3.0–3.1 Å). Other varieties of N–H···N H-bonds, for example, those using tryptophan as a donor or deprotonated lysine as an acceptor, are feasible as well. Indeed, the LR-HNN-COSY experiments[51] (Figure E) could be used to overcome instances in which rapid hydrogen exchange precludes detection of the shared proton. The open/closed capping model (Figure ) suggests that longer, weaker H-bonds will lead to decreased 2hJNN (and decreased amide 1H and imidazole Δ15N chemical shifts) through two primary mechanisms: (1) increased N–N distance leading to decreased n → σ* donation and weaker Fermi contact interaction and (2) enhanced sampling of the open non-H-bonded states (each with 2hJNN = 0). Thus, only for strong H-bonds [∼100% closed, where 2hJNN = 2hJNN(capped)] can the relative magnitude of 2hJNN be interpreted purely in terms of H-bond geometry. For weaker H-bonds, the expected distance and angular dependencies are obscured; specifically, open/closed averaging will always lead to a decrease in the observed 2hJNN value and therefore a population-dependent overestimate of H-bond length. We envision that 2hJNN measurements can be routinely extended to resolve persistent questions of enzyme mechanism. Perturbation of individual H-bonds could be probed in the presence or absence of inhibitors, within a series of homologous enzymes or variants, and under different environmental conditions. The sensitivity of 2hJNN to bonding geometry and time-averaged population could reveal information not otherwise accessible by other solution methods or inspection of crystallographic structures. Several proteins for which NMR data have been published can be used as examples. (1) His95, in the active site of triosephosphate isomerase (PDB entry 1TIM),[67] has a depressed pKa that allows this residue to be an efficient electrophile over a broad pH range.[68] H-Bond formation with the backbone amide of Glu97 is thought to be responsible at least in part for the low pKa.[68,69] (2) Likewise, His187, in the active site of uracil DNA glycosylase, functions in the neutral state, and its low pKa is attributed to a capping interaction.[70,71] (3) In E2 ligases, the histidine of the conserved His-Pro-Asn motif has been proposed to play a catalytic role, but there is also support for a strictly structural role (histidine locked in the cap position).[72] (4) Tautomeric state switching in the dual-histidine motif of peptidyl prolyl isomerase has been proposed as an integral feature of the allosteric mechanism.[73] In each of these instances, measurements of 2hJNN HBCs would settle controversies or provide a robust indicator of H-bond presence and strength in the solution state.

55 in total

1. Cloning, expression, purification, and preliminary characterization of a putative hemoglobin from the cyanobacterium Synechocystis sp. PCC 6803.

Authors: N L Scott; J T Lecomte
Journal: Protein Sci Date: 2000-03 Impact factor: 6.725

2. Ligand-induced strain in hydrogen bonds of the c-Src SH3 domain detected by NMR.

Authors: F Cordier; C Wang; S Grzesiek; L K Nicholson
Journal: J Mol Biol Date: 2000-12-08 Impact factor: 5.469

3. A DFT study of the interresidue dependencies of scalar J-coupling and magnetic shielding in the hydrogen-bonding regions of a DNA triplex.

Authors: M Barfield; A J Dingley; J Feigon; S Grzesiek
Journal: J Am Chem Soc Date: 2001-05-02 Impact factor: 15.419

4. Sequence-dependent correction of random coil NMR chemical shifts.

Authors: S Schwarzinger; G J Kroon; T R Foss; J Chung; P E Wright; H J Dyson
Journal: J Am Chem Soc Date: 2001-04-04 Impact factor: 15.419

5. A novel two-over-two alpha-helical sandwich fold is characteristic of the truncated hemoglobin family.

Authors: A Pesce; M Couture; S Dewilde; M Guertin; K Yamauchi; P Ascenzi; L Moens; M Bolognesi
Journal: EMBO J Date: 2000-06-01 Impact factor: 11.598

6. Truncated hemoglobin from the cyanobacterium Synechococcus sp. PCC 7002: evidence for hexacoordination and covalent adduct formation in the ferric recombinant protein.

Authors: Nancy L Scott; Christopher J Falzone; David A Vuletich; Jindong Zhao; Donald A Bryant; Juliette T J Lecomte
Journal: Biochemistry Date: 2002-06-04 Impact factor: 3.162

4. Characterization of Ionizable Groups' Environments in Proteins and Protein-Ligand Complexes through a Statistical Analysis of the Protein Data Bank.

Authors: Alexandre Borrel; Anne-Claude Camproux; Henri Xhaard
Journal: ACS Omega Date: 2017-10-30

5. Site-selective ¹³C labeling of histidine and tryptophan using ribose.

Authors: Ulrich Weininger
Journal: J Biomol NMR Date: 2017-08-30 Impact factor: 2.835

5 in total