Literature DB >> 24597914

Experimental and computational mutagenesis to investigate the positioning of a general base within an enzyme active site.

Jason P Schwans¹, Philip Hanoian, Benjamin J Lengerich, Fanny Sunden, Ana Gonzalez, Yingssu Tsai, Sharon Hammes-Schiffer, Daniel Herschlag.

Abstract

The positioning of catalytic groups within proteins plays an important role in enzyme catalysis, and here we investigate the positioning of the general base in the enzyme ketosteroid isomerase (KSI). The oxygen atoms of Asp38, the general base in KSI, were previously shown to be involved in anion-aromatic interactions with two neighboring Phe residues. Here we ask whether those interactions are sufficient, within the overall protein architecture, to position Asp38 for catalysis or whether the side chains that pack against Asp38 and/or the residues of the structured loop that is capped by Asp38 are necessary to achieve optimal positioning for catalysis. To test positioning, we mutated each of the aforementioned residues, alone and in combinations, in a background with the native Asp general base and in a D38E mutant background, as Glu at position 38 was previously shown to be mispositioned for general base catalysis. These double-mutant cycles reveal positioning effects as large as 10(3)-fold, indicating that structural features in addition to the overall protein architecture and the Phe residues neighboring the carboxylate oxygen atoms play roles in positioning. X-ray crystallography and molecular dynamics simulations suggest that the functional effects arise from both restricting dynamic fluctuations and disfavoring potential mispositioned states. Whereas it may have been anticipated that multiple interactions would be necessary for optimal general base positioning, the energetic contributions from positioning and the nonadditive nature of these interactions are not revealed by structural inspection and require functional dissection. Recognizing the extent, type, and energetic interconnectivity of interactions that contribute to positioning catalytic groups has implications for enzyme evolution and may help reveal the nature and extent of interactions required to design enzymes that rival those found in biology.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2014 PMID： 24597914 PMCID： PMC4004248 DOI： 10.1021/bi401671t

Source DB: PubMed Journal: Biochemistry ISSN： 0006-2960 Impact factor: 3.162

A central goal of biology is to understand the factors responsible for the enormous rate enhancements provided by enzymes. A distinguishing feature of enzymes relative to small-molecule catalysts is the use of noncovalent interactions to position substrates and functional groups relative to one another, and many X-ray structures of enzymes reveal apparent networks of interactions. However, understanding how these structural features position active site residues, how important different features are in this positioning, and how they operate together requires extensive functional tests. Three catalytic strategies have been identified for the enzyme ketosteroid isomerase (KSI): stabilization of the incipient oxyanion via an oxyanion hole, facilitated proton abstraction via a general base (Figure 1), and localization and positioning of the substrate via binding interactions.[1−4] In particular, the nature of the hydrogen bonding interactions in the oxyanion hole has been extensively investigated using experimental[5−12] and theoretical[13−ref21] approaches. Recently, the oxyanion hole has been fully ablated, yet 109-fold of the overall 1012-fold catalysis remains, indicating that other features—those noted above or hitherto unidentified catalytic sources—are responsible for the remaining billion-fold catalysis.[21] We previously showed that anion–aromatic interactions with Phe54 and Phe116 help position the general base carboxylate via favorable interactions between the carboxylate and the partial positive charges situated on the hydrogen atoms on the aromatic rings.[22] Here we investigate the roles of more distal active site interactions to better understand how the general base is positioned.

Figure 1

Steroid isomerization reaction of (A) 5(10)-estrene-3,17-dione [5(10)-EST] and (B) 5-androstene-3,17-dione (5-AND) catalyzed by the general base Asp38 in ketosteroid isomerase in Comamonas testosteroni. The isomerization process involves two sequential proton transfer reactions: transfer of a proton from C4 of the steroid to Asp38, followed by transfer of a proton from Asp38 to C6 of the steroid in 5-AND and C10 of the steroid in 5(10)-EST. Prior studies have suggested that residues not directly participating in the chemical transformation can play important roles in enzymatic catalysis. For example, the activity of a catalytic antibody was increased by 102-fold through mutation of only non-active site residues, an effect that was attributed to repositioning active site residues for optimal geometries.[23] Mutation of loop residues ∼10 Å from the active site and mutation of residues directly within the trypsin binding pocket were required to change the substrate specificity of trypsin to that of chymotrypsin.[24] In each study, the residues identified were not obvious a priori from visual inspection of the enzyme’s structure. More generally, the overall folded structure of an enzyme is necessary to establish the active site architecture and to position the residues within it.[25−30] Motivated by these and other prior results and the current challenge to understand the positioning of catalytic groups, we probed nearby residues that might contribute to the positioning and thus the function of the catalytic base Asp38 in KSI. Our results highlight the extended nature of catalytic sites and the need to understand effects extending beyond readily identifiable catalytic residues. The insights provided by this and related studies provide crucial information toward the development of predictive models to describe the roles of distal residues and active site features in catalysis.

Methods

All buffers were prepared with reagent grade materials or better. All reagents were of the highest purity commercially available (≥97%), except for the substrate 5(10)estrene-3,17-dione [5(10)-EST] (Steraloids, Inc.). Solutions of the steroid substrate slowly converted to 4-estrene-3,17-dione (product) as determined by thin layer chromatography. The concentrations of substrate stock solutions were determined by measuring the absorbance change at 248 nm for the acetate-catalyzed conversion of substrate to product and using the molar absorptivity of the product (14800 M–1 cm–1).[8] Independent measurements with ≥93% pure substrate and ≥99% pure substrate agreed within experimental error, indicating ≤7% product in the substrate stock did not affect kinetics measurements. All substrate stocks contained ≤7% product contamination.

KSI Mutagenesis, Expression, and Purification

QuikChange (Stratagene) site-directed mutagenesis was used to introduce the mutations into KSI genes from C. testosteroni encoded on pKK22-3 plasmids or pET21c plasmids. Mutations were confirmed by sequencing mini-prep DNA from DH5α cells. Proteins were expressed and purified as previously described.[9] The final purity was >99% as estimated on the basis of a Coomassie-stained sodium dodecyl sulfate–polyacrylamide gel electrophoresis gel. The protein concentration was determined using the calculated molar extinction coefficient in 6 M guanidinium hydrochloride according to the method of Gill et al.[31]

Kinetics Experiments

Reactions with 5(10)-EST were conducted at 25 °C in 10 mM buffer with 2% DMSO (v/v) added as a cosolvent for substrate solubility, 1 mM sodium EDTA, and 2 mM DTT and were monitored continuously at 248 nm in a PerkinElmer Lambda 25 spectrophotometer. A constant ionic strength of 100 mM was maintained in all samples using NaCl. The following buffers were used: sodium citrate, pH 4.3–5.9; sodium phosphate, pH 6.1–8.0; sodium 3-(N-morpholino)propanesulfonate (MOPS), pH 6.3–8.3. The molar absorptivity for the product of 14800 M–1 cm–1 was previously determined using commercially available 4-estrene-3,17-dione.[8] The values of kcat, KM, and kcat/KM were determined by fitting the initial rates as a function of substrate concentration (typically eight concentrations that varied from 2 to 600 μM) to the Michaelis–Menten equation. At least three determinations at differing enzyme concentrations (varying at least 4-fold overall) were averaged.

X-ray Crystallography

Single-crystal diffraction data were collected at SSRL beamline BL9-1 using a wavelength of 0.98 Å.[32] The reflections were indexed and integrated with XDS;[33] the intensities were scaled, merged, and converted to amplitudes with SCALA and TRUNCATE.(34) Molecular replacement was performed with AMoRe[35] using the 3CPO structure, and refinement was performed with REFMAC5.(36,37) Manual model building was conducted with COOT.(38)

Molecular Dynamics Simulations

Classical MD simulations of wild-type (WT) and mutant KSIs were initiated from relevant crystal structures. For WT KSI and the A114G, S42G, and V40G/S42G mutants, chains C and D of the crystal structure of KSI D38N with bound equilenin [Protein Data Bank (PDB) entry 1QJG][39] were used as an initial structure. For the D38E and D38E/A114G mutants, chains A and B of unliganded D38E KSI (PDB entry 4L7K) were used.[22] For the P39G and D38E/P39G mutants, the unliganded structure of KSI P39A (PDB entry 3MHE) determined herein was used. For the P39G/V40G/S42G mutants, chains A and B of KSI P39G/V40G/S42G with bound equilenin (PDB entry 3OV4) were used. For D38E/P39G/V40G/S42G, the crystal structure of unliganded D38E/P39G/V40G/S42G (PDB entry 3NM2) determined herein was used. KSI was simulated as a dimer with bound 17-oxo-andro-3,5-dien-3-olate in all cases. This ligand corresponds to the intermediate in the steroid isomerization reaction of 5-androstene-3,17-dione (5-AND), while the kinetics experiments were performed for the isomerization reaction of the 5(10)-EST substrate (Figure 1). Changes in rates due to mutation have been shown to be qualitatively consistent between these two substrates.[40] Furthermore, prior simulations using the intermediates of the isomerization reactions of 5-AND and 5(10)-EST were found to be qualitatively similar.[41] The crystal structures were modified as needed to obtain a complete dimeric structure of the protein. For simulations initiated from the P39A crystal structure (PDB entry 3NM2), the complete chain in the asymmetric unit (chain B) was aligned by root-mean-square deviation (rmsd) minimization of Cα atoms with respect to the 1QJG crystal structure to obtain a dimer. Similarly, for D38E/P39G/V40G/S42G, the single monomeric unit in the structure (PDB entry 3NM2) was aligned by rmsd minimization with respect to 1QJG to obtain a dimer. Note that the initial conformation of the catalytic base, residue 38, in the P39G/V40G/S42G and D38E/P39G/V40G/S42G mutants differs significantly from that of the WT and is directed more toward the solvent. This difference persists throughout the simulations and is addressed in Results and Discussion. In the 3NM2 and 3OV4 crystal structures, residues Pro39Gly, Val40Gly, Gly41, and Ser42Gly were not present and were added using Profix,[42] with peptide bonds in the trans conformation, which is consistent with that observed in the D38E/P39G/V40G/S42G structure (PDB entry 3NM2). In cases where Ala125 was absent (PDB entries 4L7K and 3OV4), this C-terminal residue was also added using Profix. The structures were further modified to incorporate the ligand, add protons as needed, and solvate the system. In cases where a bound equilenin ligand was present (PDB entries 1QJG and 3OV4), an energy-minimized structure of the steroid intermediate (17-oxo-andro-3,5-dien-3-olate) was aligned with equilenin by minimizing the rmsd of the C3, O3, C4, C17, and O17 atoms in the steroid and the corresponding atoms in equilenin. For cases where equilenin was absent (PDB entries 4L7K and 3NM2), the steroid intermediate was aligned by minimizing the rmsd of the Cα atoms for each monomer with respect to the equilenin-bound 1QJG crystal structure. Aspartic acid residues 38 and 99 were considered to be protonated, and the steroid intermediate was considered to be deprotonated. Histidines 6 and 100 were protonated at the ε-position, and histidine 122 was doubly protonated, based on protonation states determined using H++ [43] and inferable hydrogen bonds observed in the 1QJG crystal structure. Each system was immersed in a truncated octahedral box with 12900 explicit water molecules and four sodium counterions to ensure a neutral system. The AMBER99SB force field[44,45] was employed for the protein, and the rigid TIP3P water model[46] was used for the water molecules. Charges for the intermediate state of the steroid were determined using the restrained electrostatic potential (RESP) method[47] and are the same as those reported previously.[41] Missing steroid force field parameters were obtained from the Generalized Amber Force Field (GAFF).[48] A 10 Å real space cutoff was used for nonbonded interactions, and long-range electrostatics were treated with the particle mesh Ewald method.[49,50] Bonds involving hydrogen were constrained to their equilibrium bond lengths using LINCS.[51] A multistep equilibration procedure was performed prior to data collection for each system. First, the positions of the solvent and ions were optimized using the steepest descent energy minimization algorithm; subsequently, the solvent and ions were equilibrated at constant NVT for 500 ps at 300 K using the Nosé-Hoover thermostat.[52,53] Then the positions of the solvent and ions were optimized again by energy minimization, followed by optimization of the full system. The system was then annealed from 50 to 300 K, heating in increments of 50 K over 10 ps and maintaining the temperature during each interval for 50 ps at constant NPT at a pressure of 1 atm using the Nosé-Hoover thermostat and Parrinello-Rahman barostat.[54] The system was then simulated at constant NPT for 2 ns. A configuration consistent with the equilibrated average volume from this simulation was extracted and further equilibrated at constant NVT for 2 ns. These simulations were extended for additional production for 20 ns, and configurations were saved every 1 ps. A time step of 1 fs was used in all simulations. All simulations were performed using Gromacs.[55] Two independent trajectories were performed for each dimeric system, resulting in data for four active sites that were averaged to obtain the data presented in the main text; data for individual trajectories are presented in the Supporting Information.

Results and Discussion

The general base of KSI, Asp38, is situated at the end of a β-sheet and the beginning of a loop (Figure 2A), with its side chain proximal to those of Phe54, Met112, Ala114, and Phe116 (Figure 2B). Prior studies have suggested the presence of favorable anion–aromatic interactions between the carboxylate oxygens of Asp38 and the hydrogen atoms on the aromatic rings of Phe54 and Phe116.[22] A structured loop connecting two stands of an antiparallel β-sheet via a β-turn[56] begins with the general base Asp38 and contains Pro39, Val40, Gly41, Ser42, Glu43, and Pro44, with a cis peptide linkage between Asp38 and Pro39 (Figure 2B). Our very simple expectation was that the structure of this loop and packing interactions with the nearby hydrophobic side chains would help to orient Asp38 to act as a general base. However, the underlying energetic contributions to catalysis of the structured loop and packing interactions cannot be ascertained through structural inspection alone; in particular, the structure does not indicate which residues are needed for optimal general base catalysis, which have larger or smaller contributions, or whether the entire loop or portions of the loop function as cooperative units.

Figure 2

General base, Asp38, positioned in the KSI active site. (A) Superposition of the WT unbound structure (PDB entry 8CHO) and the equilenin-bound D38N structure (PDB entry 1QJG), where only equilenin (EQU) is shown from the equilenin-bound structure. The Asp general base (orange) is located at the end of a β-strand (gray) and at the beginning of a loop (green), and it is situated near Phe54 and Phe116 (cyan) and Met112 and Ala114 (magenta). The bound equilenin ligand (blue-violet) is situated at the location of the bound substrate during the isomerization reaction. (B) Detailed view of the general base and surrounding residues with labels. Colors are as in panel A. Prior kinetic studies showed that the D38E mutation leads to a 240-fold decrease in kcat,[57] and a crystal structure of this mutant showed that the mutation perturbs the position of the general base by ∼1 Å without introducing additional structural rearrangements.[22] These results suggested that the rate reduction arises predominantly from local mispositioning of the general base upon lengthening its side chain by one methylene group and allowed us to use the D38E mutation in double-mutant systems to probe the effects of other mutations on general base positioning.[22] In the simplest model, if a residue is important for positioning the general base, mutation of that residue will lead to a less deleterious effect in a KSI mutant with an already mispositioned general base (i.e., in the D38E mutant background) than in WT KSI with a well-positioned general base (i.e., in the Asp38 background). We previously applied this double-mutant cycle approach to assess the roles of Phe54 and Phe116, which directly interact with the Asp38 carboxylate atoms and apparently contribute to the positioning of the Asp38 carboxylate group via the anion–aromatic interactions noted above.[22] To further investigate the effects of the D38E mutation, we compared molecular dynamics (MD) simulations of WT and D38E KSI (Table 1 and Figure 3). Because we have more precise structural information for bound equilenin, an intermediate analogue, than we do for the reaction substrate and product, we simulated the dynamics of the KSI·intermediate complex (see Methods). Mixed quantum mechanical/molecular mechanical (QM/MM) calculations of the catalyzed reaction could provide further insight into the impact of these mutations, but the large number of mutations in this study and the long simulation times required to achieve convergence precluded the use of these more expensive methods.

Table 1

Average Proton Donor–Acceptor Distances (angstroms) from MD Simulationsa

enzyme	step 1	step 2
WT	2.97 (0.01)	3.66 (0.04)
A114G	2.93 (0.01)	3.82 (0.12)
P39G	4.79 (1.22)	3.92 (0.86)
P39G/V40G/S42G	5.67 (1.32)	4.54 (1.15)
S42G	2.97 (0.04)	3.70 (0.15)
V40G/S42G	3.02 (0.07)	3.53 (0.25)
D38E	4.22 (0.34)	5.29 (0.48)
D38E/A114G	3.67 (0.35)	4.03 (0.67)
D38E/P39G	5.02 (1.07)	4.14 (1.01)
D38E/P39G/V40G/S42G	6.57 (0.72)	5.35 (0.45)

Distances are calculated between the protonated oxygen atom of the general base carboxylic acid and C4 of the steroid for step 1 and C6 of the steroid for step 2 as shown in Figure 1B. The values were averaged over two 20 ns MD trajectories and two active sites. Simulations were performed for the intermediate state of the isomerization reaction of 5-AND. Standard deviations among the four data sets are given in parentheses.

Figure 3

Atomic isodensity surfaces of the protonated catalytic base carboxylate group from the MD simulations. Oxygen density is colored red and hydrogen density white. In each case, the greatest 90% of the oxygen or hydrogen density is shown as a transparent isosurface, and the greatest 50% of the oxygen or hydrogen density is shown as an opaque isosurface. The MD average structures of the heavy atoms of the steroid intermediate (5-AND), the catalytic base Asp38(Glu), and the active site residues Tyr14, Tyr55, and Asp99 are shown. Distances are calculated between the protonated oxygen atom of the general base carboxylic acid and C4 of the steroid for step 1 and C6 of the steroid for step 2 as shown in Figure 1B. The values were averaged over two 20 ns MD trajectories and two active sites. Simulations were performed for the intermediate state of the isomerization reaction of 5-AND. Standard deviations among the four data sets are given in parentheses. Consistent with the X-ray structures, the carboxylic acid moiety of the Glu at position 38 is displaced, with an average donor–acceptor distance >1 Å larger than that with Asp at this position (Table 1). Furthermore, the general base in the D38E mutant occupies several conformations with different donor–acceptor distances that are visible in the probability distributions of these distances (Figures S1 and S2 of the Supporting Information). Root-mean-square fluctuations (rmsfs) of the side chain carbon atoms (Table 2) and atomic isodensity surfaces of the general base carboxylate (Figure 3) also illustrate more mobility of the general base than in WT. The MD results are consistent with mispositioning and a greater degree of motion of the mutant Glu carboxylate group than of the WT Asp group and support the use of this comparison in double-mutant cycles designed to probe the effects of packing and loop residues on the positioning of Asp38.

Table 2

Root-Mean-Square Fluctuations (angstroms) for Carbon Atoms in the Catalytic Base Side Chaina

enzyme	C_α	C_β	C_γ
WT	0.48 (0.02)	0.52 (0.03)	0.53 (0.03)
A114G	0.51 (0.04)	0.56 (0.04)	0.58 (0.04)
P39G	0.66 (0.10)	0.81 (0.17)	1.01 (0.28)
P39G/V40G/S42G	0.53 (0.14)	0.66 (0.18)	0.92 (0.38)
S42G	0.48 (0.02)	0.52 (0.03)	0.54 (0.04)
V40G/S42G	0.51 (0.03)	0.57 (0.04)	0.62 (0.07)

Results obtained by averaging over two 20 ns MD trajectories and two active sites. Standard deviations among the four data sets are shown in parentheses. Note Cδ is only present in species containing the D38E mutation.

Hydrophobic Packing with the General Base Side Chain

Crystal structures of KSI show that the hydrophobic side chains of Met112 and Ala114 are situated within 4 Å of the general base side chain (Figure 2B), suggesting that hydrophobic packing with these side chains may restrict the conformational freedom of the general base and position it within the active site. To functionally probe this hypothesis, we used double-mutant cycles, illustrated schematically in Figure 4A for Ala114, with the M112A and A114G mutations to reduce the size of each hydrophobic side chain. The M112A and A114G mutations gave 33- and 225-fold decreases in kcat, respectively (Table 3, Figure 4B,C). Throughout this study, we observe similar reductions in kcat and kcat/KM (Figure S6 of the Supporting Information) and will discuss only rate reductions in kcat for the remainder of the paper. We have utilized a substrate known to be limited by a chemical rather than physical step;[4,58] nevertheless, there are two chemical steps in the KSI reaction (Figure 1), and it is possible that the rate-limiting step changes with different mutations,[59] an interesting and potentially informative possibility that is not addressed in this study.

Figure 4

Table 3

Effects of Mutation of Met112 and Ala114 on KSI Kinetic Parameters with an Asp (WT) and Glu (D38E) General Basea

enzyme	k_cat (s^–1)	K_M (μM)	k_cat/K_M (M^–1 s^–1)	k_cat ratio (WT/mutant)b	K_M ratio (WT/mutant)b	k_cat/K_M ratio (WT/mutant)b
WT	36 ± 2	50 ± 4	(7.2 ± 0.3) × 10⁵	(1)	(1)	(1)
M112A	1.1 ± 0.1	28 ± 6	(3.9 ± 0.5) × 10⁴	33	1.8	18
A114G	(1.6 ± 0.2) × 10^–1	35 ± 4	(4.6 ± 0.3) × 10³	225	1.4	160

Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58]

By definition, the ratio for WT KSI is 1, as represented by the values in parentheses.

By definition, the ratio for D38E KSI is 1, as represented by the values in parentheses.

Double-mutant cycle designed to test the roles of Met112 and Ala114 in positioning the general base. (A) Mutation of Asp38 to the “mispositioned” Glu general base allows it to populate alternate conformations (vertical arrows). Mutation of Ala114 affects the positioning of the general base starting from an enzyme with a positioned general base (top horizontal arrow) but has a smaller effect for an enzyme with a mispositioned general base (bottom horizontal arrow). For the sake of clarity, only the double-mutant cycle for Ala114 is shown. Effects on kcat for mutation of (B) Met112 to Ala and (C) Ala114 to Gly in KSI with a positioned Asp general base and a mispositioned D38E mutant general base. Values are from Table 3. The A114G mutational effect was greatly reduced in the D38E mutant background, from 225- to 15-fold, whereas the M112A effect was within 3-fold in both general base backgrounds. The larger effect of the Ala114 mutation with Asp than with Glu as the general base suggests that it helps to position the general base, whereas the similar effect from mutation of Met112 in both backgrounds suggests that other interactions are sufficient for near-optimal positioning of the general base. Met112 might instead contribute to catalysis via interactions with the oxyanion hole, a model that can be tested via future double-mutant cycles with oxyanion hole mutants, or to the overall stability of the active site.a In MD simulations of the A114G mutant, we observed a nearly negligible decrease in the distance between the catalytic base hydroxyl oxygen and C4 of the steroid and a small yet potentially significant increase in the distance between the catalytic base hydroxyl oxygen and C6 of the steroid (Table 1). In addition to this change, the active site of the A114G mutant exhibits a small conformational rearrangement, relative to the WT, with the catalytic base shifted slightly in the direction of the absent methyl group in Ala114Gly (Figure 5). The remainder of the enzyme’s structure is unaffected by the A114G mutation (Figure S3 of the Supporting Information), and the oxyanion hole hydrogen bonds are maintained (Table S3 of the Supporting Information). Thus, the MD results support a decrease in the level of general base function and overall catalysis because of a reorientation of the carboxylate toward the cavity created by the A114G mutation (Figure 6B) and provide no indication of an effect due to broader conformational sampling of the carboxylate group (Figure 6C). We nevertheless caution that the proton transfer reaction is likely to occur in conformers with shorter distances between the proton donor and acceptor[20] that are minimally sampled by equilibrium MD simulations (Figures S1 and S2 of the Supporting Information).

Figure 5

Figure 6

Models illustrating possible effects of mutations on the positioning of the catalytic base carboxylate group (red) in the KSI active site. (A) In WT KSI, packing interactions with Ala114 position the Asp carboxylate relative to the substrate. The cavity created by the Ala114Gly mutation allows the carboxylate to populate a single mispositioned conformation (B) or occupy multiple conformations within the active site with a subset of conformations that are mispositioned (C).

Average structures of WT KSI and the A114G, V40G/S42G, and P39G mutants obtained from MD trajectories. The average structures of the backbone amide nitrogen, α-carbon, and carbonyl carbon of residues 37–42 and the catalytic base side chain (Asp38) are depicted as color-coded balls and sticks with the corresponding atoms in the D38N-equilenin crystal structure (PDB entry 1QJG) colored red for reference. The average structure of the steroid intermediate from the MD trajectories is shown as balls and sticks, as well. In addition, the average structure of Ala114(Gly) is shown for the A114G and V40G/S42G mutants. A representative data set is shown in each case, with the remaining data sets provided in the Supporting Information. Models illustrating possible effects of mutations on the positioning of the catalytic base carboxylate group (red) in the KSI active site. (A) In WT KSI, packing interactions with Ala114 position the Asp carboxylate relative to the substrate. The cavity created by the Ala114Gly mutation allows the carboxylate to populate a single mispositioned conformation (B) or occupy multiple conformations within the active site with a subset of conformations that are mispositioned (C).

Role of the Catalytic Base Loop in General Base Positioning

As described above, the general base Asp38 in KSI is situated at the beginning of a loop that is composed of Pro39, Val40, Gly41, Ser42, Glu43, and Pro44 (Figure 2). This loop connects two strands of an antiparallel β-sheet and meets the geometrical definition of a type II β-turn.[56] A role for Pro39 in positioning the general base was suggested previously by Choi et al. on the basis of the deleterious rate effects from mutations of Pro39 to alanine or glycine, as well as a crystal structure of the P39A mutant that showed the general base carboxylate displaced by ∼3 Å relative to the WT.[60] We mutated each loop residue in both the Asp38 (WT) and D38E mutant backgrounds to functionally test each for a role in general base positioning, and we made several combined mutations to probe for cooperative and collective effects. Mutations of the loop residues Pro39 and Val40 to glycine led to 51- and 71-fold decreases in kcat, respectively. Smaller rate reductions of 2–20-fold were observed for mutations of the remaining residues in the loop, G41V, S42G, E43G, and P44G (Figure 7 and Figure S7 of the Supporting Information). In the D38E background, mutation of residues 42 and 44 led to 2–3-fold rate reductions, slightly smaller than in the WT background, suggesting only a small role of these residues in general base positioning. In contrast, mutation of Glu43 to glycine in the mispositioned D38E background gave effects that were ∼2-fold larger than in the native Asp38 background, suggesting that Glu43 plays a slightly larger role in catalysis with the mutant Glu than with the native Asp general base.

Figure 7

Double-mutant cycles designed to test the role of the loop residues Pro39–Pro44 in positioning the general base. Effects on kcat for the mutation of loop residues in KSI with a positioned Asp38 general base and a mispositioned D38E mutant general base are shown. Values are plotted relative to WT KSI on the left-hand side and relative to the D38E mutant on the right-hand side. Data are from Table 4.

Table 4

Effects of Individual Mutations of Catalytic Base Loop Residues on KSI Kinetic Parameters with an Asp (WT) and Glu (D38E) General Basea

enzyme	k_cat (s^–1)	K_M (μM)	k_cat/K_M (M^–1 s^–1)	k_cat ratio (WT/mutant)b	K_M ratio (WT/mutant)b	k_cat/K_M ratio (WT/mutant)b
WT	36 ± 2	50 ± 4	(7.2 ± 0.3) × 10⁵	(1)	(1)	(1)
P39G	(7.1 ± 0.2) × 10^–1	100 ± 2	(7.1 ± 0.2) × 10³	51	0.5	100
V40G	(5.1 ± 0.2) × 10^–1	40 ± 2	(1.3 ± 0.3) × 10⁴	71	1.3	57
G41A	2.6 ± 0.2	50 ± 3	(5.2 ± 0.2) × 10⁴	14	1.0	14
G41V	1.8 ± 0.4	40 ± 3	(4.5 ± 0.4) × 10⁴	20	1.3	16
S42G	6.3 ± 0.2	34 ± 3	(1.9 ± 0.4) × 10⁵	5.7	1.5	3.9
E43G	16 ± 0.2	25 ± 4	(6.4 ± 0.4) × 10⁵	2.3	2.0	1.1
P44G	3.4 ± 0.3	50 ± 5	(6.8 ± 0.2) × 10⁴	11	1.0	11

Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58]

By definition, the ratio for WT KSI is 1, as represented by the values in parentheses.

By definition, the ratio for D38E KSI is 1, as represented by the values in parentheses.

Turning to the residues giving the largest effects, Pro39, Val40, and Gly41, we found that the rate reduction due to mutation of each of these residues was considerably smaller in the D38E background than with the WT Asp general base (Figure 7 and Figure S7 of the Supporting Information), suggesting that each of these residues plays a role in positioning the general base for catalysis. Mutation of Pro39 to glycine in the D38E background retained a 13-fold rate decrease, apparently consistent with Pro39 also providing some positioning for the mutant Glu general base. These kinetics results for each loop residue mirror the conservation of each position. A Basic Local Alignment Search Tool (BLAST)[61,62] sequence search for KSI gave 419 sequences with an Asp residue at the position equivalent to Asp38. All of these sequences had proline at position 39 and glycine at position 41; position 40 was highly conserved as a hydrophobic residue (313 Val, 51 Ile, 47 Leu, and 7 Ala residues and 1 Phe residue). Conversely, the amino acids present at positions 42–44, which had the smallest functional impacts, were variable. These observations are broadly consistent with homology studies of β-turns in proteins, which have shown that the analogues of positions 39 and 40 exhibit slight preferences for proline, while the analogues of positions 41 and 42 exhibit slight preferences for glycine.[56,63] Overall, these results lead to the general conclusion that the loop aids in positioning the general base; i.e., the anion–aromatic interactions with Phe54 and Phe116 and the side chain packing with Ala114 are not alone sufficient to favor an optimally positioned general base carboxylate group. These results further suggest that the loop does not act as a single cooperative unit: there are different effects at different positions, and there are different levels of importance of individual residues, relative to the alternative mutated residues, in positioning the general base Asp38. Below we further analyze the effects of each mutation and the role of each loop residue.

Pro39

The effect of Pro39 mutations in both the WT and D38E backgrounds could result from restriction of the conformational freedom of the backbone[64,65] and/or from accessing conformations favored by a cis peptide bond. Proline is unique among the amino acids in its ability to adopt a stable cis isomer of its peptide bond, and the cis conformation is conserved in the two homologous KSIs for which structures are available.[7,66] In contrast, the trans isomer is observed in crystal structures of Pro39 mutants, suggesting that the cis conformation may be important for KSI activity.[60] To learn more about the role of Pro39, we turned to MD simulations. Comparison of the MD simulations of WT KSI and the P39G mutant reveals a greatly perturbed average structure of the loop in the P39G mutant (Figure 5), suggesting that the remainder of the loop must compensate for the trans conformation at the mutant position to bring the loop back to the core β-strands that it connects. Furthermore, the larger rmsfs of the Cα atoms of each loop residue in the P39G mutant, relative to WT (Table 5), and the broader distributions of dihedral angles for the other loop residues for the P39G mutant, relative to WT (Figure 8), are indicative of increased loop flexibility.

Table 5

Root-Mean-Square Fluctuations (angstroms) of Cα Atoms for the Catalytic Base Loop Residues 38–42 Obtained from MD Simulationsa

enzyme	Asp38(Glu)	Pro39(Gly)	Val40(Gly)	Gly41	Ser42(Gly)
WT	0.48 (0.02)	0.58 (0.04)	0.54 (0.05)	1.00 (0.08)	0.72 (0.02)
A114G	0.50 (0.04)	0.60 (0.04)	0.55 (0.03)	1.01 (0.03)	0.76 (0.02)
P39G	0.66 (0.09)	1.00 (0.24)	1.60 (0.13)	2.85 (0.35)	2.04 (0.24)
P39G/V40G/S42G	0.53 (0.12)	0.85 (0.30)	1.20 (0.52)	1.72 (0.70)	1.59 (0.59)
S42G	0.48 (0.02)	0.55 (0.02)	0.53 (0.03)	0.90 (0.04)	1.26 (0.43)
V40G/S42G	0.51 (0.03)	0.75 (0.10)	0.97 (0.23)	1.28 (0.20)	1.51 (0.28)
D38E	0.69 (0.18)	0.75 (0.25)	0.62 (0.11)	0.98 (0.11)	0.90 (0.10)
D38E/A114G	0.53 (0.08)	0.67 (0.08)	0.66 (0.07)	1.18 (0.09)	0.85 (0.10)
D38E/P39G	0.49 (0.03)	0.79 (0.22)	0.90 (0.14)	1.61 (0.21)	1.28 (0.10)
D38E/P39G/V40G/S42G	0.68 (0.05)	1.04 (0.23)	1.88 (0.31)	2.33 (0.46)	1.94 (0.18)

Results obtained by averaging over two 20 ns MD trajectories and two active sites. Standard deviations among the four data sets are shown in parentheses.

Figure 8

Ramachandran plots of the catalytic base loop residues Asp38(Glu), Pro39(Gly), Val40(Gly), Gly41, and Ser42(Gly) obtained from MD trajectories by calculating the probability distribution function averaged over all data sets. The x-axis corresponds to angle φ (degrees), and the y-axis corresponds to angle ψ (degrees). Each row corresponds to the specified mutant, and each column corresponds to the specified residue in the catalytic base loop.

Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58] By definition, the ratio for WT KSI is 1, as represented by the values in parentheses. By definition, the ratio for D38E KSI is 1, as represented by the values in parentheses. Ramachandran plots of the catalytic base loop residues Asp38(Glu), Pro39(Gly), Val40(Gly), Gly41, and Ser42(Gly) obtained from MD trajectories by calculating the probability distribution function averaged over all data sets. The x-axis corresponds to angle φ (degrees), and the y-axis corresponds to angle ψ (degrees). Each row corresponds to the specified mutant, and each column corresponds to the specified residue in the catalytic base loop. Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58] By definition, the ratio for WT KSI is 1, as represented by the values in parentheses. By definition, the ratio for D38E KSI is 1, as represented by the values in parentheses. Results obtained by averaging over two 20 ns MD trajectories and two active sites. Standard deviations among the four data sets are shown in parentheses. These results suggest that the P39G mutation increases the conformational flexibility of the backbone in the loop that bears the general base, but a catalytic effect would arise only if the base itself were mispositioned or exhibited increased flexibility. The MD simulations show increased average proton donor–acceptor distances for both proton transfer positions relative to those of the WT (Table 1), and the probability distributions of these distances are broader, as well (Figures S1 and S2 of the Supporting Information), consistent with the rate reductions observed experimentally. These probability distributions and the three-dimensional atomic isodensity surfaces of the catalytic base carboxylate (Figure 3) reveal two types of conformations: a “catalytically competent” type (loosely defined), in which the carboxylic acid group is situated near C6 of the steroid, and a “catalytically incompetent” type, in which the carboxylic acid is disordered and interacts with the solvent.b The most probable distances in each type of conformation are provided in Tables S1 and S2 of the Supporting Information. The functional and MD results together suggest that Pro39 works with the rest of the loop to position the catalytic base such that, when Pro39 is mutated to glycine, the mutant residue adopts the trans configuration, the remaining loop residues become more flexible, and the positioning of the general base side chain is disrupted. The cis conformation is ∼2.5 kcal/mol less stable than the trans conformation for non-prolyl amino acids in solution,[67] providing an estimate for the upper limit of the conformational preference of the remaining loop residues of the Pro39 mutants for the cis conformation relative to the trans conformation.

Val40

A recent study using continuum electrostatics calculations and a structure-based algorithm to predict catalytically important residues in enzymes suggested that remote residues make only small contributions to catalysis in the homologous KSI from Pseudomonas putida.[68] However, we observe a 71-fold rate decrease with the distal V40G mutation (Figure 7). Structural inspection revealed that Val40 is situated at the interface of the KSI dimer and is positioned to pack against Val74 in a loop in the adjacent monomer (Figure 9A), raising the possibility that packing interactions extending across the dimer interface contribute to function. To test whether loop–loop packing interactions across the dimer interface could help structure the loop adjacent to Asp38 and thereby help to position the general base, we determined the effect of the V74G mutation in KSI with the structured WT loop and in a construct with multiple loop residues mutated to glycine (P39G/V40G/S42G) to ensure that loop positioning was disrupted (as will be discussed further below).

Figure 9

Double-mutant cycle analysis designed to test the hypothesis that packing interactions across the dimer interface position the general base in KSI. (A) In WT KSI, Val40 of monomer A (gray) and Val74 of monomer B (gold) are situated near each other. (B) Effects on kcat for mutation of Val74 to Gly with a WT loop sequence and with the loop residues mutated to glycine are shown, where the Gly loop designation refers to the P39G/V40G/S42G mutant background. Values are averages from three or more independent measurements and are from Table 6.

Table 6

Effects of Mutation of Val74, a Residue Situated across the Dimer Interface from the Structured Loop, in WT KSI and in a KSI Mutant with Multiple Residues in the Loop Mutated to Glycinea

enzyme	k_cat (s^–1)	K_M (μM)	k_cat/K_M (M^–1 s^–1)	k_cat ratio (WT/mutant)b	K_M ratio (WT/mutant)b	k_cat/K_M ratio (WT/mutant)b
WT	36 ± 2	50 ± 4	(7.2 ± 0.3) × 10⁵	(1)	(1)	(1)
V74G	6.7 ± 2	35 ± 4	(1.9 ± 0.3) × 10⁵	5.4	1.4	3.8
P39G/V40G/S42G	(4.8 ± 0.6) × 10^–2	55 ± 11	(8.7 ± 0.2) × 10²	750	0.9	830
P39G/V40G/S42G/V74G	(7.5 ± 0.2) × 10^–2	35 ± 2	(2.1 ± 0.2) × 10³	480	1.4	340

Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58]

By definition, the ratio for WT KSI is 1, as represented by the values in parentheses.

The V74G mutation decreased activity 5-fold with the loop residues intact, whereas the same mutation slightly increased activity in an enzymatic background with the loop residues mutated to glycine (Figure 9B). These data suggest that packing interactions across the dimer interface provide a modest but significant contribution to the positioning of the general base. To gain insight into the effects of the V74G mutation and the residues with which it interacts, we examined the average structures from the MD simulations of each mutant in the residues proximal to Val74. In MD simulations of the P39G mutant (and other mutants containing the P39G mutation), we observe a conformational change in the backbone of residues 74–76 (Figure 10 and Figure S13 of the Supporting Information), whereas the average structure of residues 74–76 is well maintained in MD simulations of WT KSI and mutants not containing the P39G mutation. This observation suggests the existence of a structural link between these sites such that a perturbation in one loop affects the structure of the other across the dimer interface. On the basis of the contributions of the residues in the general base loop to positioning the general base described above, these results are consistent with packing interactions between these loops that affect each loop’s structure and thereby help to position the general base.

Figure 10

Ribbon diagrams of the average structure of WT KSI (left) and the P39G mutant (right), both colored gray, highlighting the conformational changes in the catalytic base loop residues (38–43, green) and the proximal residues across the dimeric interface (74–76, blue). The backbone structures of these two regions in the D38N-equilenin crystal structure (PDB entry 1QJG) are colored red for reference. Structures were aligned by minimization of the rmsd of the Cα atoms in the dimer. An alternative model for the effects of the V40G mutation is suggested by inspection of crystal structures. Val40 is situated to pack against the residues in the β-strand bearing Ala114, which positions the general base via hydrophobic packing interactions with its side chain as described above. Mutation of Val40 might disrupt the positioning of the general base by perturbing the packing interactions with the adjacent β-strand, thereby perturbing the packing interactions between Ala114 and the general base. The 71- and 5-fold rate reductions due to the V40G and V74G mutations, respectively, are smaller than the 200-fold rate reduction due to the A114G mutation, which is consistent with the rate effects of the V40G and V74G mutations acting through Ala114, whereby hydrophobic packing interactions with Val40 and secondary effects from Val74 would help position Ala114, which in turn positions the general base as described above. To probe this alternative model, we examined the average structure from MD simulations of the V40G/S42G mutant. The structure of this mutant exhibits a conformational change in the general base side chain that is similar to the conformational change observed upon mutation of Ala114 (Figure 5). In addition to this change, the average structure of Ala114 in the V40G/S42G mutant is shifted further from the general base side chain relative to its position in the WT enzyme. These observations are consistent with the model in which mutations of the more distal Val40 and Val74 affect the positioning of the general base via perturbation of the packing interactions between the general base Asp38 and the hydrophobic side chain of Ala114. This model leads to the testable prediction that the V40G and V74G mutations will be less deleterious in the A114G background.

Gly41

The loop residue Gly41 was found to be conserved among KSI homologues in a BLAST sequence search and occurs at a position of a β-turn that is frequently glycine according to surveys of the Protein Data Bank.[56,69] Furthermore, KSI crystal structures indicate that the backbone dihedral angles of Gly41 are in a region of the Ramachandran plot that is unfavorable for all other amino acids (Figure 8). The 14- and 20-fold rate decreases observed experimentally for the G41A and G41V mutations, respectively (Table 4), support the hypothesis that adoption of a backbone conformation more favorable for glycine than for the other amino acids at position 41 is important for the loop structure, Asp38 positioning, and thus KSI activity. Furthermore, the smaller 4-fold effect of the G41V mutation in the D38E mutant background suggests that Gly41 helps to position the general base. The similar rate reductions from mutations of Gly41 to alanine and the bulkier valine provide no support for the alternative model in which unfavorable steric overlap involving the side chain is responsible for the rate reductions in these mutants. MD simulations of WT KSI and the D38E mutant reveal the preponderance of a single conformation of Gly41, one that is unfavorable for all other amino acids (Figure 8). We did not perform MD simulations of the Gly41 mutants, because of the unavailability of an appropriate crystal structure, but in simulations of mutants with P39G, multiple conformations of Gly41 are observed, including regions of Ramachandran space that are accessible to all other amino acids and not unique to glycine residues (Figure 8). Gly41 appears to aid general base positioning and catalysis by allowing a loop conformation that requires occupancy of a region of Ramachandran space that is inaccessible to other amino acids.

Ser42, Glu43, and Pro44

The observation, described above, that the overall structure of the loop is important for general base positioning and KSI activity might suggest significant contributions from all of the residues in the loop. In particular, crystal structures of KSI illustrate that the side chain hydroxyl group of Ser42 is positioned to hydrogen bond with the backbone carbonyl group of Pro39, suggesting that this interaction could stabilize the native loop structure. Nevertheless, the S42G mutation led to an only 6-fold rate decrease (Figure 7). The results from MD simulations of the S42G mutant are virtually indistinguishable from those of WT KSI in all analyses performed: donor–acceptor distances (Table 1), atomic isodensity surfaces (Figure S4 of the Supporting Information), and Ramachandran plots (Figure S5 of the Supporting Information). These similarities in the MD simulations are consistent with the small experimental rate reduction. Mutations of the remaining residues comprising the loop, Glu43 and Pro44, led to rate reductions of 2- and 11-fold, respectively, for kcat (Figure 7). Surprisingly, mutation of Glu43 to glycine in the mispositioned D38E background gave effects that were ∼2-fold larger than in the native Asp38 background. Alternate conformations may be allowed with E43G that increase the probability of reaction utilizing the alternative Glu38 general base. Future tests of our understanding might involve attempts to reoptimize catalysis with Glu at position 38 or at other positions in the loop.[70−72]

Multiple Glycine Mutations for Evaluating the Contribution of the Full Loop to General Base Positioning

To obtain an upper bound on the catalytic effect of the loop, we mutated all of the loop residues to a polyglycine loop and conducted a double-mutant cycle analysis as described above. Mutation of the loop to an all-glycine sequence with Asp38 present led to a 1700-fold rate decrease (Figure 11A), and mutation to Glu38 had no additional effect (Figure 11B), indicating that any preferential positioning of Asp at position 38 had been eliminated.

Figure 11

Double-mutant cycle analysis designed to test the collective role of the catalytic base loop in positioning the general base in the KSI active site. (A) Effects on kcat for mutation of the residues in the loop to glycine, where the Gly loop designation refers to the P39G/V40G/S42G/E43G/P44G mutant. (B) Effects on kcat for mutation of the Asp38 general base to Glu with a WT loop sequence and with the residues in the loop mutated to glycine. Values are from Table 7.

Table 7

Effects of Mutating Multiple Loop Residues to Glycine on KSI Kinetic Parameters with an Asp (WT) and Glu (D38E) General Basea

enzyme	k_cat (s^–1)	K_M (μM)	k_cat/K_M (M^–1 s^–1)	k_cat ratio (WT/mutant)b	K_M ratio (WT/mutant)b	k_cat/K_M ratio (WT/mutant)b
WT	36 ± 2	50 ± 4	(7.2 ± 0.3) × 10⁵	(1)	(1)	(1)
P39G	(7.1 ± 0.6) × 10^–1	100 ± 11	(7.1 ± 0.2) × 10³	51	0.5	100
P39G/V40G	(8.6 ± 0.3) × 10^–2	55 ± 12	(1.6 ± 0.3) × 10³	420	0.9	460
P39G/V40G/S42G	(4.8 ± 0.6) × 10^–2	55 ± 11	(8.7 ± 0.2) × 10²	750	0.9	830
P39G/V40G/S42G/E43G	(3.9 ± 0.1) × 10^–2	35 ± 8	(1.1 ± 0.4) × 10³	920	1.4	650
P39G/V40G/S42G/E43G/P44G	(2.1 ± 0.5) × 10^–2	50 ± 16	(4.2 ± 0.4) × 10²	1700	1.0	1700

Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58]

By definition, the ratio for WT KSI is 1, as represented by the values in parentheses.

By definition, the ratio for D38E KSI is 1, as represented by the values in parentheses.

The 1700-fold effect from conversion to the all-glycine loop (Figure 11A) is larger than the 240-fold effect from mutating Asp to Glu with the WT loop (Figure 11B) and constitutes our best estimate for the full contribution of the loop to positioning, relative to an unpositioned loop. The smaller effect from the D38E mutation alone, with the WT loop present, would be expected if the WT loop helped position Glu to function as the general base, relative to an all-glycine loop. Alternatively, the larger effect on activity from conversion to the all-glycine loop with Asp38 present could arise from secondary structural effects from mutation of the loop that impair catalysis in other ways. However, the similar KM values for WT KSI and the all-glycine loop mutant (Table 7) provide no evidence for binding effects due to the mutations, and several additional observations, described below, also provide no indication of structural rearrangements beyond the general base position. We therefore suggest that the 1700-fold effect provides a reasonable measure of the overall positioning effect from the loop, relative to the unstructured all-glycine loop and in the context of the rest of KSI being folded. Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58] By definition, the ratio for WT KSI is 1, as represented by the values in parentheses. Kinetics experiments were performed for the 5(10)-EST substrate for which the chemical steps of the KSI reaction are rate-limiting.[4,58] By definition, the ratio for WT KSI is 1, as represented by the values in parentheses. By definition, the ratio for D38E KSI is 1, as represented by the values in parentheses. To directly test whether mutating multiple loop residues to glycine introduces further structural rearrangements, we determined the crystal structures of the P39G/V40G/S42G (PDB entry 3OV4) and D38E/P39G/V40G/S42G (PDB entry 3NM2) mutants at 1.8 and 1.9 Å resolution, respectively. Refinement details are given in Table S4 of the Supporting Information. The catalytic rates of the these mutants are within 2-fold of the catalytic rates of the all-glycine loop mutants, and the D38E mutation has a negligible rate effect in both the P39G/V40G/S42G and all-glycine mutant backgrounds (Table 7), suggesting that these structures are representative of the all-glycine loop mutants. The overall structures of these mutants can be superimposed with that of WT KSI [PDB entry 8CHO(73)] with Cα rmsds of 0.71 and 0.52 Å, respectively, for a single monomer, excluding residues 38–42, suggesting that these mutations do not cause significant rearrangements outside of the loop. We observed poor electron density for the loop backbone atoms in both structures, suggesting that the loop is disordered, as would be expected. This disorder observed in the loop residues appears to affect general base positioning. In the P39G/V40G/S42G mutant, the loop residues have poorly defined electron density, the refined position of the Asp38 carboxylate group in all four chains of the asymmetric unit is displaced by ∼2.5 Å relative to that of the WT (Figure 12), and the normalized B factors in the Asp38 side chain are larger than those in the WT enzyme (Table S5 of the Supporting Information).[74] Furthermore, a distinct refined position of this carboxylate group is found in each asymmetric unit of this structure. These structural data, similar to the functional data presented above, suggest that the all-glycine loop disrupts positioning of the Asp38 general base.

Figure 12

Superposition of the 1.8 Å structure of the P39G/V40G/S42G mutant determined herein (PDB entry 3OV4, carbon atoms colored green) and the previously determined 2.3 Å structure of WT KSI (PDB entry 8CHO, carbon atoms colored blue). In both structures, oxygen atoms are colored red and sulfur atoms yellow. The crystal structure of the P39G/V40G/S42G mutant shows that the base is mispositioned relative to WT KSI in all four chains in the asymmetric unit. The overall root-mean-square deviation between the two structures for backbone atoms is 0.6 Å. For the sake of clarity, only the A chain is shown for the P39G/V40G/S42G structure. The Asp38 carboxylate group is displaced by ∼2.5 Å in the mutant compared to WT KSI, as indicated by the green and blue circles, respectively. To further investigate the positioning of the general base in the polyglycine loop mutants, we performed MD simulations of the P39G/V40G/S42G and D38E/P39G/V40G/S42G mutants and compared these results to those of the single mutants examined above. The calculated rmsfs of Cα atoms in the loop (Table 5) suggest that the rmsfs of the polyglycine mutants are similar to or smaller than those of the single glycine mutant, a result that is somewhat surprising. Ramachandran plots of the residues in the loop (Figure 8), however, illustrate that the polyglycine loop mutants P39G/V40G/S42G and D38E/P39G/V40G/S42G exhibit many local minima in residues 39–42, consistent with the presence of a disordered loop. We caution that while these distributions illustrate an increase in the number of allowed conformations for the polyglycine mutants, they may not be quantitatively accurate on the basis of the limitations of equilibrium MD simulations. The increased flexibility observed in MD simulations of the loop due to polyglycine mutation would impact the rate only if it affects the reactive interface of the general base and the substrate. The average donor–acceptor distances from these simulations are larger than those of the single-glycine mutants and significantly larger than those observed in the WT (Table 1). The distance distributions are slightly broader than those of the single mutants, as well (Figures S1 and S2 of the Supporting Information). These results support the presence of an increased level of disorder in the general base side chain and are consistent with the experimentally observed rate reductions (Table 7). The slightly broader configuration space sampled by the general base can be visualized in the atomic isodensity surfaces (Figure 3). Finally, we asked whether the loop residues act independently or cooperatively to position the general base. In a model of full cooperativity of all of the loop residues, any mutation in the loop would disrupt the positioning of the general base. This was ruled out by the data presented above showing that single mutations in different loop residues gave different effects (Figure 7). Figure 13 shows the results of successively mutating loop residues to Gly. The increasing effect from adding more residues again argues against the unlikely fully cooperative model. On the other hand, the effect from mutation to the all-glycine loop of 1700-fold is much smaller than that predicted from a model of fully independent contributions from each residue. Such a model predicts multiplicative effects from the individual mutations and an effect of 105.7-fold (Table 4). Thus, an intermediate model is supported in which the residue identity at individual loop positions affects the contributions from certain other residues. Accurately describing the detailed nature of the partially cooperative interactions among these loop residues, as well as designing alternative yet effective loop sequences, provides future challenges for both theory and experiment.

Figure 13

Effects of mutating multiple loop residues to glycine on kcat in KSI with a positioned Asp38 general base and a mispositioned Glu (D38E mutant) general base. Values are plotted relative to that of WT KSI on the left-hand side and relative to that of the D38E mutant on the right-hand side. Data are from Table 7.

Conclusions and Implications

We draw enzyme mechanisms showing so-called catalytic residues, but we know that these residues must be placed within the context of a folded protein. Understanding the functional interrelationships between the catalytic groups and their protein surroundings is an important and difficult current challenge in enzymology. Developments in this area have implications for understanding how enzymes have evolved and for developing effective strategies for designing protein and nonprotein catalysts with activities and specificities that rival those of naturally occurring enzymes. Our results reveal interrelationships involving and surrounding a highly efficient general base within an enzyme active site and provide valuable benchmarks to facilitate the development of general and quantitative models for these effects. Prior work provided strong evidence of anion–aromatic interactions between the Asp38 carboxylate oxygen atoms and two active site Phe residues, and such interactions appear to be a common alternative to hydrogen bond networks for positioning catalytic residues.[22] Packing interactions of Ala114 but not Met112 with the Asp38 side chain contribute to its positioning and function. These disparate observations underscore the need to supplement structural studies with functional analysis, and double-mutant cycles as conducted herein are particularly incisive as functional probes. The loop that begins with the general base (Figure 14) also contributes to its positioning, and we have uncovered catalytic contributions as large as 103-fold from this loop. From another perspective, the anion–aromatic and packing interactions are insufficient to optimally position the general base, a conclusion that could not have been drawn from structural inspection alone. Further investigation of the loop contributions, via functional analysis, structural determination, and simulations, revealed differential contributions from individual loop residues and further suggested contributions both from disfavoring alternative, noncatalytic conformations and from limiting the mobility of the general base.

Figure 14

Multiple interactions position the general base in the KSI active site. The structural model shows the residues that have been mutated as sticks. Residues are colored according to the interaction suggested from the functional and structural data to position the general base. The residues of the loop bearing the general base in KSI exhibit partial cooperativity [i.e., mutations of different loop residues have different effects, and mutations of multiple residues have effects beyond mutation of the first (Figure 13)], but these effects fall far short of exhibiting energetic independence. Structurally, given the limited packing of protein loops relative to their interiors, we might generally expect multistate structural behavior of loops, with multiple conformational states explored and mutations affecting the “shape” of the conformational energy landscape, in contrast to a single dominant folded–unfolded transition as in a two-state, fully cooperative process. Protein sectors have been suggested to evolve and function together,[75,76] and evolution has “mixed and matched” domains, as in dehydrogenases that use the same NAD(P)H cofactor but reduce different substrates.[77,78] The KSI loop adjoining Asp38 appears to position this residue to act as a general base without influencing substrate binding or the neighboring oxyanion hole. Identifying functional regions such as this may ultimately define the extent of interactions needed to create a functional catalytic motif, whether through evolution or design, and may allow us to independently alter substrate specificity and/or mutate the catalytic residue into another catalytically active group to perform distinct chemistries. Understanding protein substates, the conformational ensembles that describe them, and how such ensembles can be shaped and limited by engineering and by evolution to allow specificity and catalysis present major current challenges that will likely require a tight feedback loop between experiment and computation. Here we have begun this interaction focusing on the groups interacting with KSI’s general base, and new predictions arise from the MD simulations conducted herein. Nevertheless, we emphasize the need, moving forward, to further develop computational methods that can efficiently provide extensive quantitative and nontrivial predictions that can subsequently and independently be experimentally tested.

67 in total

1. The Protein Data Bank.

Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

Review 2. Enzymatic mechanisms for catalysis of enolization: ketosteroid isomerase.

Authors: Ralph M Pollack
Journal: Bioorg Chem Date: 2004-10 Impact factor: 5.275

3. Redesigning the substrate specificity of an enzyme by cumulative effects of the mutations of non-active site residues.

Authors: S Oue; A Okamoto; T Yano; H Kagamiyama
Journal: J Biol Chem Date: 1999-01-22 Impact factor: 5.157

4. Ketosteroid isomerase provides further support for the idea that enzymes work by electrostatic preorganization.

Authors: Shina C L Kamerlin; Pankaz K Sharma; Zhen T Chu; Arieh Warshel
Journal: Proc Natl Acad Sci U S A Date: 2010-02-11 Impact factor: 11.205

Review 5. NAD-binding domains of dehydrogenases.

Authors: A M Lesk
Journal: Curr Opin Struct Biol Date: 1995-12 Impact factor: 6.809

6. Use of anion-aromatic interactions to position the general base in the ketosteroid isomerase active site.

Authors: Jason P Schwans; Fanny Sunden; Jonathan K Lassila; Ana Gonzalez; Yingssu Tsai; Daniel Herschlag
Journal: Proc Natl Acad Sci U S A Date: 2013-06-24 Impact factor: 11.205

7. Hot spots for allosteric regulation on protein surfaces.

Authors: Kimberly A Reynolds; Richard N McLaughlin; Rama Ranganathan
Journal: Cell Date: 2011-12-23 Impact factor: 41.582

8. A tale of two isomerases: compact versus extended active sites in ketosteroid isomerase and phosphoglucose isomerase.

Authors: Srinivas Somarowthu; Heather R Brodkin; J Alejandro D'Aquino; Dagmar Ringe; Mary Jo Ondrechen; Penny J Beuning
Journal: Biochemistry Date: 2011-10-04 Impact factor: 3.162

9. Contribution of the hydrogen-bond network involving a tyrosine triad in the active site to the structure and function of a highly proficient ketosteroid isomerase from Pseudomonas putida biotype B.

Authors: D H Kim; D S Jang; G H Nam; G Choi; J S Kim; N C Ha; M S Kim; B H Oh; K Y Choi
Journal: Biochemistry Date: 2000-04-25 Impact factor: 3.162

10. The spatial architecture of protein function and adaptation.

Authors: Richard N McLaughlin; Frank J Poelwijk; Arjun Raman; Walraj S Gosal; Rama Ranganathan
Journal: Nature Date: 2012-10-07 Impact factor: 49.962

8 in total

1. BIOPHYSICS. Response to Comments on "Extreme electric fields power catalysis in the active site of ketosteroid isomerase".

Authors: Stephen D Fried; Steven G Boxer
Journal: Science Date: 2015-08-27 Impact factor: 47.728

2. Oscillatory Active-site Motions Correlate with Kinetic Isotope Effects in Formate Dehydrogenase.

Authors: Philip Pagano; Qi Guo; Chethya Ranasinghe; Evan Schroeder; Kevin Robben; Florian Häse; Hepeng Ye; Kyle Wickersham; Alán Aspuru-Guzik; Dan T Major; Lokesh Gakhar; Amnon Kohen; Christopher M Cheatum
Journal: ACS Catal Date: 2019-10-25 Impact factor: 13.084

Review 3. Electric Fields and Enzyme Catalysis.

Authors: Stephen D Fried; Steven G Boxer
Journal: Annu Rev Biochem Date: 2017-03-24 Impact factor: 23.643

4. A Critical Test of the Electrostatic Contribution to Catalysis with Noncanonical Amino Acids in Ketosteroid Isomerase.

Authors: Yufan Wu; Steven G Boxer
Journal: J Am Chem Soc Date: 2016-09-01 Impact factor: 15.419

Review 5. Computational design of structured loops for new protein functions.

Authors: Kale Kundert; Tanja Kortemme
Journal: Biol Chem Date: 2019-02-25 Impact factor: 4.700

6. Assessment of enzyme active site positioning and tests of catalytic mechanisms through X-ray-derived conformational ensembles.

Authors: Filip Yabukarski; Justin T Biel; Margaux M Pinney; Tzanko Doukov; Alexander S Powers; James S Fraser; Daniel Herschlag
Journal: Proc Natl Acad Sci U S A Date: 2020-12-21 Impact factor: 12.779

7. Accurate positioning of functional residues with robotics-inspired computational protein design.

Authors: Cody Krivacic; Kale Kundert; Xingjie Pan; Roland A Pache; Lin Liu; Shane O Conchúir; Jeliazko R Jeliazkov; Jeffrey J Gray; Michael C Thompson; James S Fraser; Tanja Kortemme
Journal: Proc Natl Acad Sci U S A Date: 2022-03-07 Impact factor: 11.205

8. Bottom-Up Nonempirical Approach To Reducing Search Space in Enzyme Design Guided by Catalytic Fields.

Authors: Wiktor Beker; W Andrzej Sokalski
Journal: J Chem Theory Comput Date: 2020-04-23 Impact factor: 6.006

8 in total