Alexander Klein1,2, Petra Rovó1, Varun V Sakhrani3, Yangyang Wang3, Jacob B Holmes3, Viktoriia Liu3, Patricia Skowronek1, Laura Kukuk2, Suresh K Vasa1,2, Peter Güntert4,5,6, Leonard J Mueller3, Rasmus Linser7,2. 1. Department of Chemistry and Pharmacy, Ludwig Maximilians University, 81377 Munich, Germany. 2. Department of Chemistry and Chemical Biology, TU Dortmund University, 44227 Dortmund, Germany. 3. Department of Chemistry, University of California, Riverside, CA 92521. 4. Institute of Biophysical Chemistry, Goethe University, 60438 Frankfurt am Main, Germany. 5. Laboratory of Physical Chemistry, Eidgenössische Technische Hochschule (ETH) Zürich, 8093 Zürich, Switzerland. 6. Department of Chemistry, Tokyo Metropolitan University, Tokyo 192-0397, Japan. 7. Department of Chemistry and Pharmacy, Ludwig Maximilians University, 81377 Munich, Germany; rasmus.linser@tu-dortmund.de.
Abstract
NMR chemical shifts provide detailed information on the chemical properties of molecules, thereby complementing structural data from techniques like X-ray crystallography and electron microscopy. Detailed analysis of protein NMR data, however, often hinges on comprehensive, site-specific assignment of backbone resonances, which becomes a bottleneck for molecular weights beyond 40 to 45 kDa. Here, we show that assignments for the (2x)72-kDa protein tryptophan synthase (665 amino acids per asymmetric unit) can be achieved via higher-dimensional, proton-detected, solid-state NMR using a single, 1-mg, uniformly labeled, microcrystalline sample. This framework grants access to atom-specific characterization of chemical properties and relaxation for the backbone and side chains, including those residues important for the catalytic turnover. Combined with first-principles calculations, the chemical shifts in the β-subunit active site suggest a connection between active-site chemistry, the electrostatic environment, and catalytically important dynamics of the portal to the β-subunit from solution.
NMR chemical shifts provide detailed information on the chemical properties of molecules, thereby complementing structural data from techniques like X-ray crystallography and electron microscopy. Detailed analysis of protein NMR data, however, often hinges on comprehensive, site-specific assignment of backbone resonances, which becomes a bottleneck for molecular weights beyond 40 to 45 kDa. Here, we show that assignments for the (2x)72-kDa protein tryptophan synthase (665 amino acids per asymmetric unit) can be achieved via higher-dimensional, proton-detected, solid-state NMR using a single, 1-mg, uniformly labeled, microcrystalline sample. This framework grants access to atom-specific characterization of chemical properties and relaxation for the backbone and side chains, including those residues important for the catalytic turnover. Combined with first-principles calculations, the chemical shifts in the β-subunit active site suggest a connection between active-site chemistry, the electrostatic environment, and catalytically important dynamics of the portal to the β-subunit from solution.
The family of pyridoxal-5′-phosphate (PLP)–dependent enzymes catalyze a wide variety of chemical transformations including transamination, racemization, decarboxylation, elimination, and substitution (1, 2). The large number of PLP enzymes and their crucial metabolic functions make them drug targets for the treatment of diseases including tuberculosis, epilepsy, and Parkinson’s disease (3, 4). Fig. 1 depicts the crystal structure of Salmonella typhimurium tryptophan synthase (TS) (5). TS itself is both an important drug target in the context of continuously emerging bacterial antibiotics resistance (6) and of great interest in biotechnology (7) as an enantiospecific source of a large variety of unnatural amino acids and their derivatives (Fig. 1) (8, 9). Wild-type TS catalyzes the final two steps in tryptophan biosynthesis: production of indole from indole-3-glycerol phosphate (IGP) and its subsequent condensation reaction with L-serine to give L-tryptophan (5, 10, 11). As for many other enzymes, X-ray structural data are abundant, but the rational design of therapeutic agents and the understanding and engineering of catalysis, in particular regarding the β-subunit enzymatic reaction, hinge on the availability of detailed knowledge of the chemical and electrostatic properties of the active site. Fig. 1 shows the initial steps of the β-subunit reaction, which acts as a pivot for the overall reaction and selectivity of the catalytic cycle. Nucleophilic attack of the PLP cofactor in the β-subunit active site is thought to involve activation of C4′ (Fig. 1) by protonation of βLys87 Nζ. However, the thermodynamic and kinetic details of potential tautomeric exchange are currently missing (11–13). Such features, in particular protonation, hybridization, and tautomeric states of the active-site side chains and substrates, cannot be directly determined by protein crystallography or cryoelectron microscopy (cryo-EM) but are accessible from NMR chemical shifts.
Fig. 1.
TS is an αββα heterodimer with an asymmetric unit of 72 kDa. (A) Topology for the internal aldimine resting state (PDB ID: 4HT3) (5). (B) The natural product tryptophan (black) and a selection of additional substrates and products of engineered TS enzymes for biotechnological applications (gray). The lower right compound represents thaxtomin A, a natural product synthesized from 4-nitroTrp (Lower Left) (8, 9). (C) Initial step of the β-subunit catalytic cycle, drawn with a protonated βK87 Schiff base (red box).
TS is an αββα heterodimer with an asymmetric unit of 72 kDa. (A) Topology for the internal aldimine resting state (PDB ID: 4HT3) (5). (B) The natural product tryptophan (black) and a selection of additional substrates and products of engineered TS enzymes for biotechnological applications (gray). The lower right compound represents thaxtomin A, a natural product synthesized from 4-nitroTrp (Lower Left) (8, 9). (C) Initial step of the β-subunit catalytic cycle, drawn with a protonated βK87 Schiff base (red box).NMR spectroscopy has been invaluable for addressing the chemical features and dynamics of molecules across disciplines. Mechanistic studies of enzymatic catalysis by NMR have been indispensable for complementing the insights from crystallography, cryo-EM, optical spectroscopy, and computational simulation (14–16). In addition to atomic-resolution access to protein dynamics and domain motion, the individual chemical shifts themselves are a prime source of information. Even though bulk properties from relaxation or diffusion measurements can sometimes be sufficient to address specific biological questions, the site-specific assignment of chemical shifts is a common prerequisite for elucidation of structure, dynamics, and interactions in more detail. For proteins exceeding a monomer molecular weight of 40 to 45 kDa, resonance assignments become a major bottleneck (17), as witnessed by the scarcity of proteins with substantial backbone assignments in this range (). Site-specific amino acid labeling of canonical nuclei (18), the introduction of noncanonical probes such as 19F (19), and the use of different types of methyl labeling (20) are examples of approaches used to strongly reduce the otherwise excessive spectral overlap in large proteins. Whereas such a reduction of complexity can be very potent to answer important biological questions even for extremely large systems, a wealth of common and versatile NMR approaches are tied to resonance assignment of the protein backbone, including backbone relaxation and relaxation dispersion, secondary structural propensities, H-N residual dipolar couplings, and H/D exchange. Other applications, such as high-resolution structure calculations, even rely on (close-to) complete resonance assignments of both backbone and side chains. In order to facilitate assignments, particularly as the size of the system increases, the resonances are usually dispersed by appending additional dimensions to multidimensional NMR experiments (21, 22).Solid-state NMR (ssNMR) has been established as an atomic-level probe capable of providing insights into structure, intermolecular interactions, and dynamics in increasingly complex targets of higher effective (oligomeric) molecular weight (23–27). In particular, detailed insights have been obtained for supramolecular assemblies like virus capsids or large-scale cellular architectures (28), fibrillar proteins, including those associated with neurodegenerative diseases (29, 30), and membrane proteins within a lipid bilayer (31). Recently, innovations in sample preparation, most notably various deuteration strategies (32–35), paramagnetic doping (36–38), and hardware for increasingly fast magic-angle-spinning (MAS) (39–41), have led to a large pool of proton-detected ssNMR methodology. Combined with a series of smart spectroscopic approaches (42–46), this framework has been facilitating access to atom-specific chemical-shift assignments in increasingly challenging target proteins (25–27, 47, 48).As the size of the target protein system increases, the number of molecules in the MAS rotor decreases. The corresponding decrease in signal intensity can, in principle, be compensated for by increased measurement times and the use of higher magnetic fields; even still, the increasing extent of resonance overlap and the resulting ambiguities in the assignment of proteins of high complexity remains a main limiting factor. Higher-dimensional ssNMR experiments have specifically been developed to ameliorate resonance overlap in spectral assignments (24, 45, 49–53), structure calculation (43, 54–56), and characterization of protein dynamics (53). Still, a wealth of proteins of medical, biological, or biotechnological interest remain significantly more complex than those accessible to NMR assignment to date, which calls for further methodological developments for characterization of high-molecular-weight targets.As TS is an almost completely α-helical enzyme and has an asymmetric unit of 665 individual amino acids (72 kDa) and a molecular weight of 144 kDa for the full αββα complex, chemical-shift assignments have been available only for specifically labeled cofactor, substrates, and individually labeled residues (5, 11, 13, 57–59). To enable chemical-shift assignments for access to site-specific chemical properties and other downstream NMR analyses of TS, we introduce a higher-dimensionality ssNMR approach based on proton-detected, fast-MAS ssNMR spectroscopy. Focusing specifically on the β-subunit active site, this strategy reveals important features of residue βK87, which holds the PLP cofactor. In particular, the active site’s chemical nature is characterized by the Schiff base comprising a fast tautomeric exchange between the protonated and unprotonated forms (red box in Fig. 1). This tautomeric equilibrium, moreover, appears to be coupled to variations of the pocket architecture on an intermediate timescale consistent with substrate transport and trapping.
Results
Access to Complex Target Proteins via 1H-Detected, Fast-MAS ssNMR.
Higher-dimensionality (>3D [three-dimensional]) experiments are a direct approach to increasing the effective resolution of NMR correlation experiments. For example, Fig. 2 demonstrates the increase in dispersion for backbone experiments from 3D to 5D for TS, which comprises strong overlap in the 2D H/N plane (Fig. 3). However, sensitivity typically suffers from the multitude of transfer steps and evolution periods required when going to higher dimensionality. Compared to the exponentially decreasing transfer efficiency with molecular size in solution, however, ssNMR polarization transfer efficiency is independent of molecular weight (45, 60). This, in conjunction with the associated long coherence lifetimes and the absence of high-power decoupling, makes proton-detected, fast-MAS ssNMR approaches well suited when complex (and, in particular, higher-dimensionality) experiments are desired (Fig. 2). For NMR experiments exceeding three dimensions, nonuniform sampling (NUS; used here with down to <0.01% sampling density) and spectral reconstruction are commonly used to accelerate data acquisition (24, 48, 50, 52, 54, 55), allowing the experimental time to be determined by sensitivity instead of resolution. The approach of automated projection spectroscopy, which is compatible with the same pulse sequences as shown here, shares a similar goal and has been shown to facilitate assignment in cases in which peak picking in 2D source spectra is possible (45, 61).
Fig. 2.
Strategies employed for site-specific resonance assignment of TS. (A) Magnetization pathways for the 4D hCACBcaNH/hCACBcacoNH and 4D hCACONH/hCOCANH pairs, the 5D HNcoCANH, and the 4D S2B [side chain-to-backbone] experiment (with a 4D hCCNH pathway). All experiments are cross polarization (CP) based; only for Cα/Cβ transfers INEPTs (Insensitive Nuclei Enhanced by Polarization Transfers) were employed. For the S2B experiment, MOCCA was used for mixing, and both proton and carbon polarization was enCOPORADEd (COmbined POlarization from long-Range transfer And Direct Excitation) (72, 73). (See all acquisition and processing parameters in .) (B) The gain in dispersion for higher dimensionality witnessed for αE135 as a representative residue in the overlapping region between 8 and 10 ppm, comparing three (Top, 3D hCANH in gray), four (Center, 4D hCOCANH in cyan and 4D hCACBcaNH in yellow), and five dimensions (Bottom, 5D HNcoCANH in blue) (noncrossed peaks: maxima in clearly different planes). All experiments were recorded on a deuterated and 1H-back-exchanged, 13C/15N-labeled sample at 700 MHz proton Larmor frequency at 55 kHz MAS and 30 °C in a 1.3-mm rotor. (C) Generic assessment of the growing benefits of ssNMR for increasingly complex experiments (in particular, increasing dimensionality) as a function of molecular weight, relative to simple 2D correlations for a 10-kDa protein in the respective aggregation state. (Details of the parameters used are in .) Gray arrow: trend for molecular weights of break even; numbers on the right: trends for sensitivity advantages over NMR in solution at 100 kDa (monomeric protein), increasing with experiment complexity.
Fig. 3.
Higher-dimensionality ssNMR for assignment of TS. (A) The 2D H/N correlation of TS (proton–back-exchanged, perdeuterated at 55 kHz MAS and 700 MHz 1H Larmor frequency). (B) Backbone walk via a 5D HNcoCANH (blue), shown via gray arrows for βG84 to βH86 in their respective 2D Hi+1/Ni+1 slices (taken from Hi/Ni/Cαi coordinates as depicted in the upper right corner), overlayed on the H/N plane. The full set of experiments recorded is shown for these residues in . (C) Acquisition and processing of the 5D HNcoCANH, acquired as a sparse NUS dataset and reconstructed via SSA (89) in conjunction with a 3D hCANH experiment that shares the Cαi, 15Ni, and 1Hi dimensions with the 5D. Fourier transformation is performed by SMFT (68), generating two-dimensional 1Hi+1 (F1)/15Ni+1 (F2) planes at F3/F4/F5 i positions derived from the 3D peak list (also ). (D) Occurrence of assignment ambiguity within all of three carbon chemical shifts (as used in common, 13C-match-making assignment experiments) for TS, only considering assigned shifts. (E) Ambiguity upon linking experiments that provide residue type information (e.g., hCACBcaNH or hCCNH) with either H/N only (dark red) or H/N/Cα (gray) shifts, which are the shift combinations available from sequential linking via either 4D or 5D amide-to-amide correlations, respectively. See details in the caption. In D and E, the y-axes are normalized to the overall number of residues assigned sufficiently.
Strategies employed for site-specific resonance assignment of TS. (A) Magnetization pathways for the 4D hCACBcaNH/hCACBcacoNH and 4D hCACONH/hCOCANH pairs, the 5D HNcoCANH, and the 4D S2B [side chain-to-backbone] experiment (with a 4D hCCNH pathway). All experiments are cross polarization (CP) based; only for Cα/Cβ transfers INEPTs (Insensitive Nuclei Enhanced by Polarization Transfers) were employed. For the S2B experiment, MOCCA was used for mixing, and both proton and carbon polarization was enCOPORADEd (COmbined POlarization from long-Range transfer And Direct Excitation) (72, 73). (See all acquisition and processing parameters in .) (B) The gain in dispersion for higher dimensionality witnessed for αE135 as a representative residue in the overlapping region between 8 and 10 ppm, comparing three (Top, 3D hCANH in gray), four (Center, 4D hCOCANH in cyan and 4D hCACBcaNH in yellow), and five dimensions (Bottom, 5D HNcoCANH in blue) (noncrossed peaks: maxima in clearly different planes). All experiments were recorded on a deuterated and 1H-back-exchanged, 13C/15N-labeled sample at 700 MHz proton Larmor frequency at 55 kHz MAS and 30 °C in a 1.3-mm rotor. (C) Generic assessment of the growing benefits of ssNMR for increasingly complex experiments (in particular, increasing dimensionality) as a function of molecular weight, relative to simple 2D correlations for a 10-kDa protein in the respective aggregation state. (Details of the parameters used are in .) Gray arrow: trend for molecular weights of break even; numbers on the right: trends for sensitivity advantages over NMR in solution at 100 kDa (monomeric protein), increasing with experiment complexity.Higher-dimensionality ssNMR for assignment of TS. (A) The 2D H/N correlation of TS (proton–back-exchanged, perdeuterated at 55 kHz MAS and 700 MHz 1H Larmor frequency). (B) Backbone walk via a 5D HNcoCANH (blue), shown via gray arrows for βG84 to βH86 in their respective 2D Hi+1/Ni+1 slices (taken from Hi/Ni/Cαi coordinates as depicted in the upper right corner), overlayed on the H/N plane. The full set of experiments recorded is shown for these residues in . (C) Acquisition and processing of the 5D HNcoCANH, acquired as a sparse NUS dataset and reconstructed via SSA (89) in conjunction with a 3D hCANH experiment that shares the Cαi, 15Ni, and 1Hi dimensions with the 5D. Fourier transformation is performed by SMFT (68), generating two-dimensional 1Hi+1 (F1)/15Ni+1 (F2) planes at F3/F4/F5 i positions derived from the 3D peak list (also ). (D) Occurrence of assignment ambiguity within all of three carbon chemical shifts (as used in common, 13C-match-making assignment experiments) for TS, only considering assigned shifts. (E) Ambiguity upon linking experiments that provide residue type information (e.g., hCACBcaNH or hCCNH) with either H/N only (dark red) or H/N/Cα (gray) shifts, which are the shift combinations available from sequential linking via either 4D or 5D amide-to-amide correlations, respectively. See details in the caption. In D and E, the y-axes are normalized to the overall number of residues assigned sufficiently.
Experimental Strategies.
For assignment and downstream analysis in TS, we used a triple-labeled, proton back-exchanged, and Cu-edta–doped microcrystalline sample (Materials and Methods). For residue linking, we first employed 4D experiments for carbon match making [hCACONH and hCOCANH (24, 50, 51), green/cyan in Fig. 2, and hCACBcaNH and hCACBcacoNH, yellow/red, constructed from existing 3D schemes (62–64) via additional chemical-shift evolution on Cα and full magnetization transfer between Cα and Cβ]. Such carbon-based approaches (64, 65) usually involve 1) finding a given CC combination from a first experiment within a second one and 2) finding the H/N coordinates associated there again in the first experiment. Hence, the ambiguity for identifying the next H/N corresponds to the overlap within a 3D Cα/Cβ/CO, and the correct selection of the next CCC combination is ruled by 2D H/N dispersion.We (and, simultaneously, the Pintacuda laboratory) previously developed sensitive amide-to-amide correlation experiments (HNCOCANH-type experiments) in three and four dimensions (53, 66, 67), which have also been exploited for projection spectroscopy in the solid state (45). The direct linking between amides in such experiments circumvents any ambiguity of CC matching steps, which is an important aspect in TS, given that here, even Cα/CO/Cβ triple overlap occurs for 65% of the residues (Fig. 3 and ) and that each ambiguity scales the number of possibilities in an exponential fashion. In the case of overlapping H/N signals, however, ambiguity remains that can only be resolved in a combinatorial way in conjunction with the residue type information associated with 13C shifts. These are, however, absent in 4D HNcocaNH-type experiments, such that the connection of sequential linking and residue type is again diffused by the level of H/N overlap. Therefore, we expanded HNCOCANH-type experiments to NUS 5D experiments. Like their 3D/4D counterparts, the 5D HNcoCANH (Fig. 2, blue)—or, likewise, the (inverted) HNcaCONH—each allow for a backbone walk based on amide-to-amide connectivities individually (Fig. 3), however, now identifying a source residue by H/N/C shifts (i.e., three rather than two dimensions) and identifying its neighbor by H/N shifts (two further dimensions). This renders one out of the two connected residues (the source residue) rather unambiguously characterized, as the H/N/Cα triple facilitates correlating it with the set of side chain shifts and thus residue type. (The overall extent of H/N/CA overlap in TS is 3× lower than H/N overlap; see Fig. 3.) This combination between sharp sequential connectivities and the residue type–specific knowledge for 13C shifts with intrinsically correct referencing enables short strips of sequential connections obtained from these five-dimensional experiments to be mapped onto the known primary sequence and makes this experiment more potent than the respective 4D version. HNcoCANH and HNcaCONH pathways can be set up with a high bulk sensitivity of ∼8% and ∼5%, respectively, relative to an hNH (compare ). The 5D NUS data were treated as established for solution NMR (brief description in ) using sparse multidimensional Fourier transformation (SMFT) (68) (Fig. 3). shows the setup of both sequences for the case of the SH3 domain of chicken α-spectrin, including a high-quality 5D dataset obtainable (for this 7.2-kDa protein) in only 1.5 d ().Whereas backbone assignments have been playing a major role for proton-detected ssNMR, side chain nuclei exceeding the usual Cβ have mostly been ignored in recent methodological efforts. This is despite their obvious significance for structure calculation (41, 69–71), as a reporter on protein chemical features and interactions, and their value for residue type information. Here, we generated a 4D hCCNH version of the side chain-to-backbone (S2B) experiment (42, 53, 72) based on modified phase-cycled Carr–Purcell (MOCCA) mixing (72, 73) (beige in Fig. 2), which we had proposed in a 3D fashion originally. The 4D hCCNH experiment yields the set of side chain carbons (in one dimension) dispersed by their H/N/Cα shifts in one additional dimension each. Finally, 2D H/C correlations and variable-temperature H/N spectra were acquired. Further material on assignment strategies, all pulse schemes and acquisition parameters, and a comparison of bulk signal intensities for the individual experiments are shown in , respectively. gives examples for the inverted HNcaCONH pulse sequence. In addition, we used pseudo-4D, R1ρ-edited hCONH experiments to warrant reasonable dispersion in a first assessment of TS relaxation. (See considerations on dimensionality in assignment versus relaxation experiments in .) As the assignment process of TS was still a large effort, even with the multitude of data sets available, systematic evaluation of the quantitative benefits of each individual dataset could only be performed with respect to the combinatorial/statistical assessments presented.
Assignment of TS.
Assignments were supported by state-of-the-art computational capability via FLYA (74). Modifications (magnetization pathways, tolerances for chemical-shift matching, validation criteria, etc.) are described in detail in . This computationally aided strategy enables a residue-specific, quantitative assessment of assignment quality in an iterative way. Reliable assignment benefits from the high redundancy introduced by the combination of multiple, mutually consistent higher-dimensionality approaches. ( shows a set of spectral excerpts for an exemplary stretch of residues.) In total, chemical-shift assignments were obtained that enable potential downstream analyses for up to 498 residues (74.8% of the 665 total residues or 79.0% of the 630 nonproline residues). Fig. 4 depicts the backbone chemical-shift assignments of TS that satisfy highly conservative validation with stringent exclusion criteria in FLYA with respect to next-neighbor assignments (details in ). Missing residues, other than prolines, derive from assignment ambiguity or insufficient intensity upon reconstruction due to exchange broadening or H/D back-exchange. Chemical-shift tables can be found in as well as under Biological Magnetic Resonance Data Bank (BMRB) entry 51166 (75) .
Fig. 4.
Chemical-shift assignment in TS from ssNMR. (A) Assigned residues for the α- (Top) and β-subunit (Bottom) and analysis with respect to the secondary structure predicted by TALOS-N (90). Cyan to green colors denote high-confidence assignment in modified FLYA quality assessment, requiring all of HN, N, Cα, CO, and Cβ of a residue to be individually assigned as “strong,” which rating itself is defined very conservatively as detailed in . Gray tones denote that a subset of the five nuclei within a residue are designated as “strong” assignments, whereas the remaining nuclei of the residue have shifts that are likely correct also but do not reach the same confidence level. TALOS predictions are compared with secondary structure found in crystals structure 4HT3 (5) (Top). Boxes highlight domains considered to undergo major conformational changes upon ligand binding at different states of the catalytic cycle (5), also bearing low assignment coverage. Mismatches between TALOS predictions and the crystal structure are expected for residues with incompletely assigned shifts, at the edges of assigned regions, and for short stretches (e.g., β233 to β243 or β261 to β265), while secondary chemical shifts () are usually consistent. (B) Residues with strong assignments for all of the above-mentioned nuclei marked on the asymmetric unit (630 nonproline residues) of the crystal structure (green, PDB ID: 4HT3).
Chemical-shift assignment in TS from ssNMR. (A) Assigned residues for the α- (Top) and β-subunit (Bottom) and analysis with respect to the secondary structure predicted by TALOS-N (90). Cyan to green colors denote high-confidence assignment in modified FLYA quality assessment, requiring all of HN, N, Cα, CO, and Cβ of a residue to be individually assigned as “strong,” which rating itself is defined very conservatively as detailed in . Gray tones denote that a subset of the five nuclei within a residue are designated as “strong” assignments, whereas the remaining nuclei of the residue have shifts that are likely correct also but do not reach the same confidence level. TALOS predictions are compared with secondary structure found in crystals structure 4HT3 (5) (Top). Boxes highlight domains considered to undergo major conformational changes upon ligand binding at different states of the catalytic cycle (5), also bearing low assignment coverage. Mismatches between TALOS predictions and the crystal structure are expected for residues with incompletely assigned shifts, at the edges of assigned regions, and for short stretches (e.g., β233 to β243 or β261 to β265), while secondary chemical shifts () are usually consistent. (B) Residues with strong assignments for all of the above-mentioned nuclei marked on the asymmetric unit (630 nonproline residues) of the crystal structure (green, PDB ID: 4HT3).NMR chemical shifts are direct reporters on the chemical and electrostatic properties of individual sites within the protein. The chemical-shift values of side chain moieties in particular are shaped by their individual protonation, tautomerization, and H-bonding properties. Conversely, such features can be inferred when shifts have been determined. Chemical-shift data represent the foundation for investigating the catalytic mechanism, often with assistance from crystallography and first-principles calculations (5, 11, 57, 58, 59). Here, we exploit the availability of TS chemical-shift assignments to advance insights into the β-subunit active site, in particular residue βK87, at physiological temperatures. βK87 is a key residue of the β-subunit catalytic pocket, initially holding the PLP cofactor and later serving as the acid–base catalyst (10). Full βK87 chemical-shift assignments began with the proton-detected 4D and 5D sequential backbone experiments, and these in turn enabled side chain carbon assignments (from Cα through Cε) via the 4D hCCNH, which link to Nζ (the Schiff base nitrogen) and the Hζ proton in the H/N via a long-range H/C correlation (Fig. 5 and Table 1). The entirety of the βK87 shifts give direct experimental access to the question of the linking Schiff-base equilibrium protonation state and the associated energies (10).
Fig. 5.
NMR assessment of the catalytically important residue βK87. (A) Backbone and side chain carbon assignment, respectively, via 5D HNcoCANH (green) and 4D hCCNH (yellow). (B) The 2D H/N correlation (blue) and 2D long-range H/C correlation (magenta). The cross-section along the 13C axis (dotted line) is plotted in magenta in A. (C) Temperature-dependent H/N correlations of the Schiff base, achieved by measurements of the same (deuterated) protein sample in a 0.7-mm rotor at 55 kHz MAS. (D) Cross-sections for the 15N of the Schiff base (Nζ, Left) and its proton (Right), spectrum with 50 Hz exponential decay and Lorentzian as solid and dashed lines, respectively for line shape analysis/determination of exchange rates. The plane from the 5D HNcoCANH depicted in A is extracted at βH86 H,N,Cα shifts 9.44/117.1/58.9, with the 1H coordinates shown folded in from 6.3 ppm (indirect HN).
Table 1.
Experimental and first-principles predicted chemical shifts (in ppm) for the PSB, phenolic, and their best-fit two-site exchange (61% PSB, 39% Phen) models at 30 °C
Atom
PSB
Phen
Two-site exchange
Experimental
PLP
N1
304.4
302.7
303.7
294.7
C3
173.9
158.8
166.0
168.3
C2
162.0
153.3
158.6
159.6
C2′
23.3
22.2
22.9
20.4
βLys87
Nζ/SBN
166.8
319.3
226.6
227.3
Cγ
25.8
26.3
26.0
25.9
Cδ
34.0
35.6
34.7
32.7
Cε
50.7
57.6
53.4
53.3
Red-χ2
28.4
66.6
1.3
—*
*Red-χ2 is the reduced χ2 value between the set of experimental chemical shifts (rightmost column) and the set of shifts calculated for each model.
NMR assessment of the catalytically important residue βK87. (A) Backbone and side chain carbon assignment, respectively, via 5D HNcoCANH (green) and 4D hCCNH (yellow). (B) The 2D H/N correlation (blue) and 2D long-range H/C correlation (magenta). The cross-section along the 13C axis (dotted line) is plotted in magenta in A. (C) Temperature-dependent H/N correlations of the Schiff base, achieved by measurements of the same (deuterated) protein sample in a 0.7-mm rotor at 55 kHz MAS. (D) Cross-sections for the 15N of the Schiff base (Nζ, Left) and its proton (Right), spectrum with 50 Hz exponential decay and Lorentzian as solid and dashed lines, respectively for line shape analysis/determination of exchange rates. The plane from the 5D HNcoCANH depicted in A is extracted at βH86 H,N,Cα shifts 9.44/117.1/58.9, with the 1H coordinates shown folded in from 6.3 ppm (indirect HN).Experimental and first-principles predicted chemical shifts (in ppm) for the PSB, phenolic, and their best-fit two-site exchange (61% PSB, 39% Phen) models at 30 °C*Red-χ2 is the reduced χ2 value between the set of experimental chemical shifts (rightmost column) and the set of shifts calculated for each model.
The Protonation State of βK87 Nζ.
Protonation of the Schiff-base nitrogen, Nζ, has been proposed to activate the cofactor C4′ carbon for nucleophilic attack by the incoming substrate, serine (Fig. 1) (11–13). How this activation might be coupled to larger conformational motions responsible for substrate trapping and allosteric signaling remains an intriguing mechanistic question. Here, the intermediate value of 227.3 ppm found for Nζ at 30 °C suggests a dynamic tautomeric exchange between protonated and neutral Schiff-base forms (11). To quantify the exchange and the identity of the chemical structure of the exchanging partners, we turned to NMR-assisted crystallography—the integrated application of ssNMR, X-ray crystallography, and first-principles computational chemistry (11, 26, 44, 57, 59, 76–81).Our approach follows that of Caulkins et al. (11). Starting with the crystal structure of the TS internal-aldimine form (Protein Data Bank identifier [PDB ID]: 4HT3), a cluster model of the active site was constructed that included all residues within 7 Å of the PLP cofactor. Five models of the active-site chemistry were generated by varying the protonation states of the pyridine ring nitrogen, pyridoxal phenolic oxygen, and Nζ of βLys87 (). Each of these candidate structures was geometry optimized using density functional theory (DFT), with the exterior residues of the cluster fixed at their crystallographic positions. NMR chemical shieldings were calculated using a locally dense basis approach and converted to chemical shifts (82) (). Finally, the structural models were ranked based on the agreement between their first-principles predicted chemical shifts and the ssNMR assignments for the βLys87 side chain and PLP cofactor using the reduced-χ2 statistic, the weighted deviation of the model from experimental shifts.Of the candidate structures, none was found to show acceptable agreement between the predicted and experimental chemical shifts, with the lowest reduced χ2 = 28.4 (). Based on the temperature dependence and large line width of the Schiff-base nitrogen (Fig. 5 and A Modulated Tautomeric Exchange), fast-exchange equilibrium models were considered next, in which the effective chemical shifts were given as the population-weighted average of the shifts for the individual structures. All models were paired and their populations optimized for best agreement with the experimental chemical shifts. The best-fit fast exchange, with a reduced χ2 of 1.3, was found to be between the protonated Schiff base (PSB; ketoenamine) and phenolic (Phen; enolimine) forms (Fig. 6). Table 1 summarizes select experimental and first-principles predicted shifts for this model compared with its parent states. The next-best exchange model had a reduced χ2 of 4.9, making it statistically unlikely (). Bayesian probability analysis (83) confirmed that the best-fit exchange model is the most probable experimental state with 91% confidence (Fig. 6 and ).
Fig. 6.
Analysis of the tautomeric equilibrium at the βK87 Schiff base in the active site of the β-subunit. (A) Shifts obtained from first-principles calculations for the active site with PSB, protonated phenolic oxygen (Phen), and tautomeric exchange between the two with a 61:39 ratio. Numbers denote differences with respect to experimental chemical shifts in parts per million (ppm). (B) Bayesian probability for the best-fitting combinations of different two-site, fast-exchange equilibrium models, comparing experimental and first-principles chemical shifts, with protonation being denoted by “1” or “0”, respectively, in order (pyridine nitrogen), (phenolic oxygen), (Schiff base nitrogen). (C) Best fit of temperature-dependent populations to the apparent enthalpy and entropy difference for the entire tautomeric exchange process within the enzyme. The dashed curve shows the dependence consistent with ΔHapp. = 9.6 ± 1 kJ/mol and ΔSapp. = 28.6 ± 3 J/(mol K). (Populations are derived from temperature-dependent 15Nζ chemical shifts measured in a 2D fashion but with universal carbon shifts.) (D) Effective energy profile of the tautomeric exchange of the Schiff base at 30 °C, reflecting both the equilibrium with the PSB slightly dominating, as well as the exchange rate and apparent activation energy of the process. Both the temperature dependence of populations and tautomeric exchange rate hint to association of the protonation/deprotonation of the Schiff base with changes in its environment in the pocket.
Analysis of the tautomeric equilibrium at the βK87 Schiff base in the active site of the β-subunit. (A) Shifts obtained from first-principles calculations for the active site with PSB, protonated phenolic oxygen (Phen), and tautomeric exchange between the two with a 61:39 ratio. Numbers denote differences with respect to experimental chemical shifts in parts per million (ppm). (B) Bayesian probability for the best-fitting combinations of different two-site, fast-exchange equilibrium models, comparing experimental and first-principles chemical shifts, with protonation being denoted by “1” or “0”, respectively, in order (pyridine nitrogen), (phenolic oxygen), (Schiff base nitrogen). (C) Best fit of temperature-dependent populations to the apparent enthalpy and entropy difference for the entire tautomeric exchange process within the enzyme. The dashed curve shows the dependence consistent with ΔHapp. = 9.6 ± 1 kJ/mol and ΔSapp. = 28.6 ± 3 J/(mol K). (Populations are derived from temperature-dependent 15Nζ chemical shifts measured in a 2D fashion but with universal carbon shifts.) (D) Effective energy profile of the tautomeric exchange of the Schiff base at 30 °C, reflecting both the equilibrium with the PSB slightly dominating, as well as the exchange rate and apparent activation energy of the process. Both the temperature dependence of populations and tautomeric exchange rate hint to association of the protonation/deprotonation of the Schiff base with changes in its environment in the pocket.NMR-assisted crystallography of the TS internal-aldimine state reveals a fast-exchange equilibrium between the PSB and phenolic forms. At 30 °C, the equilibrium populations are 61% and 39%, respectively, indicating an effective free-energy difference of only 1.2 kJ/mol between the tautomers and demonstrating the PSB to be the dominant, but not exclusive, form at physiologically relevant temperatures.
A Modulated Tautomeric Exchange.
The populations of PSB and phenolic tautomers are strongly temperature dependent (Fig. 5), which is unexpected for a simple two-state model for the exchanging proton and requires a significant entropy term to accommodate. (See the fit of enthalpy/entropy contributions to the free-energy difference in Fig. 6 and .) The temperature dependence of the populations is, however, consistent with larger-scale processes in the surrounding active site being coupled to this exchange. Crystal structures of TS show both open and closed conformations of the β-subunit for various intermediates in the catalytic cycle () (2). The open conformation is necessary for the free diffusion of substrate into the active site. It also establishes an aqueous environment proximal to the cofactor that favors the Zwitterionic, PSB form (11). Closed conformations largely exclude water from the active site, favoring the neutral Schiff base, phenolic form. The open and closed states of the active site remain in equilibrium, with a switch between the predominant form for the various intermediates (5). An entire conformational exchange through the crystallographic conformations is unlikely in the absence of a substrate. However, it is noteworthy that already within a single (cryogenic) X-ray structure (PDB ID: 4HT3), conformational plasticity is seen in the entry portal for serine in the β-subunit active site (), which involves interactions between residues in the carboxy-terminal α-helix and the loop holding the cofactor (). Likewise, an isolated tautomeric-exchange process would be associated with a low effective activation barrier and an expected timescale in the picosecond-to-nanosecond regime (84, 85). To assess the effective timescale of the tautomeric exchange in the enzyme and whether contributions from conformational motion in the surrounding could play a role, we conducted line shape analysis of the Schiff-base nitrogen based on the limiting chemical shifts given by the computational modeling as well as R1ρ analysis of the protein backbone via the pseudo-4D, relaxation-edited hCONH experiments (details in ). Line shape analysis gives experimental access to the apparent rates of the tautomeric exchange and hence, via the Eyring equation, the effective free energy of activation. Whereas homogeneous nitrogen line widths in the sample in the absence of exchange (including the βK87 backbone amide) generally amount to only around 20 Hz, linewidths for the Schiff-base nitrogen are on the order of 270 Hz (Fig. 5 and ). Equally, the Schiff-base proton has a line width of 120 Hz compared to amide HN widths of, generally, around 50 Hz. Assuming a two-state exchange, the exchange-broadened lines suggest a tautomeric turnover on the microsecond motional timescale, with a forward rate of around 2.4 × 106 s−1, a regime much slower (or a ΔG‡ larger) than expected for an isolated proton-exchange process (84, 85). Even though the ssNMR line width does not purely reflect the exchange contribution as in solution NMR and would have to be scaled down somewhat (to account for residual anisotropic interactions, sample inhomogeneity, and anisotropic bulk magnetic susceptibility) (86), the high apparent effective activation barrier is in line with the Nζ temperature dependence and consistent with a linkage of tautomerism with variations in the surrounding electrostatic environment. Details of conformational exchange dynamics can be provided by relaxation dispersion experiments, which have remained exceedingly demanding for TS so far. However, the simple R1ρ experiments at least reveal moderately elevated rates (5 to 9 s−1, compared to rates of 1 to 2 s−1 for inconspicuous residues) at ∼30 °C for entry portal residues like D649 and G352 (rates in , and relaxation decays in ). The increased rates, and likely also the low degree of assignments in the adjacent communication domain residues, are in agreement with the exchange-broadened character of the Schiff-base line shapes. Whereas details of the modulation of the exchange by the environment remain elusive, the data are consistent with a connection between tautomerism and a variable pocket environment, which suggests a modulation of Schiff-base protonation by the surrounding architecture. Possible scenarios in the functional enzyme would be a conformation-dependent fast-exchange equilibrium of the Schiff base directly driven by electrostatics or changes in the water network that differentially stabilize neutral and Zwitterionic forms dependent on pocket conformations and thus indirectly tune the Schiff base properties. These provide one possible mechanism for coupling global structural changes with chemical reactivity as part of the allosteric regulation in TS.
Discussion
The chemical shift in NMR is a sensitive probe of the individual chemical environment of a given atom, reporting on protonation state, hybridization state, hydrogen bonding interactions, and the surrounding electrostatic environment. With structure determination more and more facilitated by automation and computation, the addition of chemical shifts will become increasingly interesting for targets in various scientific disciplines. Chemical-shift measurements in enzymes have been the preeminent tool for characterizing the chemical structures of intermediates throughout the catalytic cycle, highlighting the protonation states and associated tautomeric equilibria for the cofactor, substrates, and active-site residues (10, 11, 13, 57, 59, 87). For enzymes as complex as TS, this approach has been limited to distinct, selective incorporation of 13C and 15N spin labels. Whereas individual shifts from specific labeling have often been sufficient for important insights, access to all shifts in (almost) the entire protein from a single preparation can yield a large number of individual insights (regarding interactions, residue-specific structural, motional, and chemical properties) at once, which can be beneficial from a biological as well as from a preparative perspective.As the use of chemical shifts as restraints in first-principles computational refinement hinges on priors from real, experimentally obtained shifts, a high level of comprehensiveness for the chemical-shift data obtained through extensive assignments on U-13C,15N-labeled samples ushers in far-reaching possibilities for NMR-assisted crystallography that we expect will allow it to expand beyond the active site to include extended hydrogen bonding and proton transfer pathways, conformational dynamics, and allostery. In addition to simultaneous characterization of multiple shifts combined with complementary data on dynamics as becoming possible here, comprehensive assignments will also allow access to interactions with/distances to small molecules, substrates, and water, to other relaxation parameters, and to conformational assessment of individual residues or atoms. We anticipate that the analysis pursued for the TS resting internal-aldimine state studied here can be readily extended to other stable intermediates along the catalytic pathway, facilitated by the fact that the additional intermediates would not largely alter the chemical shifts for the majority of the enzyme and the bulk of the information can be utilized.The assignments in TS are paradigmatic and foreshadow the value of ssNMR data for other high-molecular-weight targets in biology, pharmacological research, and biotechnology for which (micro)crystalline samples can be obtained. For example, common drug targets, including many kinases, phosphatases, nuclear receptors, and many membrane proteins like surface receptors, channels, and transporters, are often in a molecular-weight range of 60 to 80 kDa. Similarly, the monomeric molecular mass of many biocatalysts in industrial applications like dehydrogenases, lipases, and esterases often fall into this molecular-weight regime. With a moderate magnetic field of 16.5 T (700 MHz) used in this study, the assessment of TS has been a challenge, but decreasing measurement times by several fold would each apply for higher magnetic fields or with emerging MAS cryoprobes (88), which, coupled together, could shorten measurement times by up to an order of magnitude. As a drawback in comparison to carbon detection methods, prolines (and also side chain nuclei of aromatic residues) escape all of the assessments. In addition, incomplete back-exchange into the deuterated sample is a common drawback for proton- (HN-)detected ssNMR both here as well as for other large proteins in which unfolding/refolding protocols fail. shows first spectra and sensitivities obtained for a nondeuterated TS sample spun at 111 kHz MAS in a 0.7-mm rotor, a framework that can circumvent both of these problems. In fact, very similar experiments are possible as for deuterated samples, as exemplified by a 4D hCOCANH experiment recorded for comparison (). Lower sensitivity due to reduced sample volume (∼0.5 µL) and transfer efficiencies is noted but will benefit from the same advances in field strength and cryoprobes mentioned above.
Conclusion
Here, we have shown a protein ssNMR study enabled by higher-dimensionality (4D and 5D) shift assignments in the 144-kDa TS bienzyme complex with an asymmetric unit of 72 kDa. The benefits of higher dimensionality required for the 665-residue asymmetric unit, in particular low-ambiguity sequential correlations directly concatenated with side chain shifts and residue type data, are enabled by proton-detected fast-MAS ssNMR. The success of this approach is owed to high transfer efficiencies that are independent of molecular weight; thus the concatenation of many transfer steps and evolution periods within complex experiments at low duty cycles becomes possible. In combination with state-of-the-art computational approaches, the chemical shifts provide access to chemical, thermodynamic, and kinetic parameters for active-site species and give experimental insight into the interplay between plasticity, essential for substrate trapping and product release, and chemical properties within the pocket. The data reveal the dominance of a protonated Schiff-base species under physiological temperatures, with a tautomeric dynamic equilibrium that is linked to the electrostatic environment of the pocket architecture. This study demonstrates the feasibility of NMR assignment and assessment of dynamics and chemical properties in highly complex targets with minimal amounts of uniformly labeled protein. Facilitated access to NMR data in this molecular-weight regime will unlock an atomic-level understanding of reaction thermodynamics and kinetics widely sought for biological, medical, and industrial applications.
Materials and Methods
Salmonella typhimurium TS was expressed and purified as described in detail in . In brief, Escherichia coli CB149 in M9 minimal medium was used with a pEBA-10 plasmid, and the protein was purified via a crystallization and recrystallization procedure in the presence of Cs+, polyethylen glycol-8000, and spermine in Tris buffer at pH 7.8. The SH3 domain of chicken α-spectrin was expressed and purified as described before (36). NMR spectra for SH3 and TS were acquired each on a single microcrystalline sample of a Cu-edta–doped, uniformly 2H/13C/15N triple-labeled and 100% exchangeable-proton back-exchanged preparation. NMR spectra were recorded using a 1.3-mm probe at an MAS frequency of 55 kHz MAS at ∼30 °C effective temperature, using recycle delays of 0.6 s on a Bruker NEO spectrometer with a proton Larmor frequency of 700 MHz. All assignment experiments were performed as NUS experiments. All new pulse sequences for the 4D and 5D backbone and side chain assignment experiments, a list of acquisition and processing parameters, and practical considerations for setting up and processing NUS 5D ssNMR are found in .First-principles geometry optimization and chemical-shift calculations for the TS resting state (internal aldimine) β-subunit active site were conducted using a DFT cluster-based approach following that of Caulkins et al. (11) as detailed in . All chemical-shift assignments can be found under BMRB entry 51166.
Authors: Dimitri Niks; Eduardo Hilario; Adam Dierkers; Huu Ngo; Dan Borchardt; Thomas J Neubauer; Li Fan; Leonard J Mueller; Michael F Dunn Journal: Biochemistry Date: 2013-09-06 Impact factor: 3.162
Authors: Yu-Ming M Huang; Wanli You; Bethany G Caulkins; Michael F Dunn; Leonard J Mueller; Chia-En A Chang Journal: Protein Sci Date: 2015-09-22 Impact factor: 6.725
Authors: Donghua H Zhou; Gautam Shah; Mircea Cormos; Charles Mullen; Dennis Sandoz; Chad M Rienstra Journal: J Am Chem Soc Date: 2007-08-29 Impact factor: 15.419
Authors: Holger Haas; Sahand Tabatabaei; William Rose; Pardis Sahafi; Michèle Piscitelli; Andrew Jordan; Pritam Priyadarsi; Namanish Singh; Ben Yager; Philip J Poole; Dan Dalacu; Raffi Budakian Journal: Proc Natl Acad Sci U S A Date: 2022-09-26 Impact factor: 12.779
Authors: Manuel Cordova; Edgar A Engel; Artur Stefaniuk; Federico Paruzzo; Albert Hofstetter; Michele Ceriotti; Lyndon Emsley Journal: J Phys Chem C Nanomater Interfaces Date: 2022-09-23 Impact factor: 4.177