Fanny Sunden1, Ishraq AlSadhan1, Artem Y Lyubimov2, Susanne Ressl3, Helen Wiersma-Koch1,4, Jamar Borland1, Clayton L Brown1, Tory A Johnson1, Zorawar Singh1, Daniel Herschlag1,5. 1. Department of Biochemistry, Beckman Center, Stanford University , Stanford, California 94305, United States. 2. Departments of Molecular and Cellular Physiology, Neurology and Neurological Science, Structural Biology, and Photon Science, Howard Hughes Medical Institute, Stanford University , Stanford, California 94305, United States. 3. Molecular and Cellular Biochemistry Department, Indiana University , Bloomington, Indiana 47405, United States. 4. Department of Biology, Indian River State College , Fort Pierce, Florida 34981, United States. 5. Departments of Chemical Engineering and Chemistry, and Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health), Stanford University , Stanford, California 94305, United States.
Abstract
Naively one might have expected an early division between phosphate monoesterases and diesterases of the alkaline phosphatase (AP) superfamily. On the contrary, prior results and our structural and biochemical analyses of phosphate monoesterase PafA, from Chryseobacterium meningosepticum, indicate similarities to a superfamily phosphate diesterase [Xanthomonas citri nucleotide pyrophosphatase/phosphodiesterase (NPP)] and distinct differences from the three metal ion AP superfamily monoesterase, from Escherichia coli AP (EcAP). We carried out a series of experiments to map out and learn from the differences and similarities between these enzymes. First, we asked why there would be independent instances of monoesterases in the AP superfamily? PafA has a much weaker product inhibition and slightly higher activity relative to EcAP, suggesting that different metabolic evolutionary pressures favored distinct active-site architectures. Next, we addressed the preferential phosphate monoester and diester catalysis of PafA and NPP, respectively. We asked whether the >80% sequence differences throughout these scaffolds provide functional specialization for each enzyme's cognate reaction. In contrast to expectations from this model, PafA and NPP mutants with the common subset of active-site groups embedded in each native scaffold had the same monoesterase:diesterase specificities; thus, the >107-fold difference in native specificities appears to arise from distinct interactions at a single phosphoryl substituent. We also uncovered striking mechanistic similarities between the PafA and EcAP monoesterases, including evidence for ground-state destabilization and functional active-site networks that involve different active-site groups but may play analogous catalytic roles. Discovering common network functions may reveal active-site architectural connections that are critical for function, and identifying regions of functional modularity may facilitate the design of new enzymes from existing promiscuous templates. More generally, comparative enzymology and analysis of catalytic promiscuity can provide mechanistic and evolutionary insights.
Naively one might have expected an early division between phosphate monoesterases and diesterases of the alkaline phosphatase (AP) superfamily. On the contrary, prior results and our structural and biochemical analyses of phosphate monoesterase PafA, from Chryseobacterium meningosepticum, indicate similarities to a superfamily phosphate diesterase [Xanthomonas citri nucleotide pyrophosphatase/phosphodiesterase (NPP)] and distinct differences from the three metal ion AP superfamily monoesterase, from Escherichia coli AP (EcAP). We carried out a series of experiments to map out and learn from the differences and similarities between these enzymes. First, we asked why there would be independent instances of monoesterases in the AP superfamily? PafA has a much weaker product inhibition and slightly higher activity relative to EcAP, suggesting that different metabolic evolutionary pressures favored distinct active-site architectures. Next, we addressed the preferential phosphate monoester and diester catalysis of PafA and NPP, respectively. We asked whether the >80% sequence differences throughout these scaffolds provide functional specialization for each enzyme's cognate reaction. In contrast to expectations from this model, PafA and NPP mutants with the common subset of active-site groups embedded in each native scaffold had the same monoesterase:diesterase specificities; thus, the >107-fold difference in native specificities appears to arise from distinct interactions at a single phosphoryl substituent. We also uncovered striking mechanistic similarities between the PafA and EcAP monoesterases, including evidence for ground-state destabilization and functional active-site networks that involve different active-site groups but may play analogous catalytic roles. Discovering common network functions may reveal active-site architectural connections that are critical for function, and identifying regions of functional modularity may facilitate the design of new enzymes from existing promiscuous templates. More generally, comparative enzymology and analysis of catalytic promiscuity can provide mechanistic and evolutionary insights.
Obtaining a fundamental
understanding of how enzymes achieve their
enormous rate enhancements and exquisite specificities and elucidating
how new enzymes have evolved are central goals of biochemistry. Progress
in these areas has biological implications and implications for the
ultimate practical goal of effectively and efficiently designing enzymes
with new and beneficial activities.Site-directed mutagenesis,
coupled with structural analysis, has
been the prevailing approach in enzymology over the past decades.
Nevertheless, approaches beyond subtractive site-directed mutagenesis
are needed because enzyme energetics are non-additive.[1−16] Our recent quantitative dissection of the functional interplay of
five residues in the active site of Escherichia coli alkaline phosphatase (EcAP) provided unique mechanistic
and evolutionary insights, as well as information that may help guide
the design of new enzymes.[17] Nevertheless,
enzyme function is also dependent on its overall context, as the enzyme’s
structure is, of course, needed for catalysis, but it is not yet practical
to comprehensively dissect the interconnections between active-site
residues and the sea of surrounding residues and structural elements.Given the natural complexities of enzymes and these practical limitations,
the study of enzyme superfamilies has been enormously valuable. These
studies have identified common and potentially critical structural
and catalytic elements and, conversely, suggested residues and structural
features that may allow individual enzymes to specialize in different
reactions (e.g., refs (18−20)). Of particular
utility is the observation that enzymes across a superfamily often
exhibit low but substantial activity for reactions of other superfamily
members, a property referred to as ‘catalytic promiscuity.[21,22] Catalytic promiscuity can be used as a comparative tool in elucidating
mechanism and as a probe of factors that may have influenced enzyme
evolution (see refs (23 and 24)).The alkaline phosphatase (AP) superfamily has been particularly
amenable to comparative studies, as members exhibit promiscuity for
multiple reactions catalyzed by other superfamily members.[24−35] We compare PafA, a phosphate monoesterase from Chryseobacterium
meningosepticum, with two other enzymes within the AP superfamily
that share its Zn2+ bimetallo core, one catalyzing the
same reaction and the other catalyzing phosphate diester hydrolysis.
Intriguingly, the PafA monoesterase has similarities to superfamily
phosphate diesterases and differences from the three metal ion AP
superfamily monoesterases such as EcAP. These observations
raise intriguing evolutionary questions and provide compelling opportunities
to obtain mechanistic insights via comparative enzymology and analysis
of catalytic promiscuity.[32]
Experimental Section
Plasmid Design
C. meningosepticum expresses
the alkaline phosphatase gene pafA. This gene has
been modified with a sequence expressing a C-terminal strepII tag
with a Factor Xa cleavage site between the tag and the natural C-terminal
end. The gene was cloned into a pET22b vector (Supporting Information S1). Control experiments with and without
the strepII-tag removed from expressed wild-type (WT) PafA suggest
that the tag has no effect on catalysis; the tag was therefore retained
in the kinetic experiments but cleaved for crystallization.
Expression
and Purification of PafA and NPP
PafA was
expressed from E. coli SM547(DE3) cells containing
the PafAstrepII construct. The cells were grown to an optical density
of 0.6–0.8 OD600 in rich medium and glucose (10
g of tryptone, 5 g of yeast extract, 5 g of NaCl, and 2 g of glucose
per liter) with 50 mg/L carbenicillin at 37 °C. To induce protein
expression, Isopropyl thiogalactopyranoside (IPTG) was added to a
final concentration of 0.3 mM. The cultures were further grown at
30 °C for 16–20 h and were harvested by centrifugation.
The pellets were resuspended in column buffer (100 mM Tris-HCl, pH
8.0, 150 mM NaCl, and 10 μM ZnCl2) and either frozen
for later purification or lysed by passing the suspension through
an EmulsiFlex-C5 apparatus (Avestin, Ottawa, ON) three times. The
lysate was cleared from cell debris by centrifugation (20000g, 20 min), and the supernatant was filtered through a 0.45
μm filter. The filtrate was loaded over a 1 mL gravity column
containing Strep-Tactin Superflow resin (IBA Life
Sciences, Göttingen, Germany). The resin was washed with 6
column volumes of column buffer, and the protein was eluted with 2.5
mM desthiobiotin in column buffer. The purity of the PafA-containing
fractions were assessed with sodium dodecyl sulfate–polyacrylamide
gel electrophoresis (SDS-PAGE) and was >95% pure as estimated by
staining
with Coomassie Blue. Fractions containing purified PafA were pooled
and buffer exchanged into storage buffer (10 mM sodium MOPS, pH 8.0,
50 mM NaCl, and 100 μM ZnCl2).Mutant F91A/L123A/Y205A
of NPP, from Xanthomonas axonopodis pv citri, was expressed and purified from a construct containing an N-terminal
maltose binding protein (MBP) fusion and C-terminal strepII tags with
a Factor Xa cleavage site between it and the natural C-terminal end
of NPP, as previously described.[36] The
NPP mutant was expressed and purified over a Strep-Tactin Superflow column as described above for PafA. Fractions containing
purified NPP were pooled and buffer exchanged into storage buffer
(100 mM sodium MOPS, pH 8.0, 150 mM NaCl, and 100 μM ZnCl2).
Kinetic Assays of PafA Variants
Activity measurements
for all enzymes were performed at 25 °C in a Perkin-Elmer UV/vis
Lambda 25 spectrophotometer (Perkin Elmer, Waltham, MA) in 0.1 M sodium
MOPS (or/and Tris-HCl), pH 8.0, 0.5 M NaCl, 100 μM ZnCl2, unless otherwise noted. Hydrolysis of the substrates containing
a p-nitrophenolate leaving group (pNPP and me-pNPP, Chart ) were followed by
assaying the formation of free p-nitrophenolate continuously
at 400 nm (Scheme ). Rate constants were determined from initial rates, and the activity
of the free enzyme, kcat/KM, was determined. The kinetic parameters were shown to
be first-order in both enzyme and substrate concentration, with concentrations
varied over at least a 5-fold range. For pNPP, substrate concentrations
high enough to obtain KM and kcat values were achievable, and these values are also
reported; the concentration of pNPP was varied at least 5-fold below
and above the measured KM in all cases.
The following pNPP concentrations were used: WT, 2.0 × 10–2–2.5 mM; T79S, 0.55–110 μM; K162A,
1.1 × 10–3–2.5 mM; R164A, 1.1 ×
10–3–4.5 mM. Fits had R2 values
of >0.98 in all cases. Reported errors were estimated from at least
two independent kinetic measurements. Comparisons with independent
enzyme preparations for each of the PafA mutants gave values within
the reported error.
Chart 1
Substrates Used in This Study
Scheme 1
Reaction Scheme for Phosphate Ester Hydrolysis
The KM value for
pNPP of the N100A
mutant was too low to measure directly at pH 8.0. We therefore used
an indirect method described previously.[24] Briefly, we used an inhibitor (tungstate), whose inhibition constant
was determined using a substrate with a much higher KM (me-pNPP), to raise the observed KM value in these pNPP reactions. Apparent values for kcat and KM were
obtained at 2.1, 6.3, and 12.5 mM tungstate. As expected for competitive
inhibition, the kcat values were constant
(and the same as obtained in the absence of tungstate), and the observed KM values increased with increasing tungstate
concentration. From these observed values, KM for the pNPP reaction was calculated using eq , the measured KM,apparent, and the independently determined Ki.Values of kcat/KM for Me-P (Chart ) were obtained using a discontinuous kinetic
assay, following the formation of free phosphate by a Malachite green
assay (as described in ref (38)). For the N100A and T79S mutants, the KM was too low, given the sensitivity of this assay to
obtain reliable kcat/KM values. Tungstate could not be used as above to raise
the KM and allow determination of kcat/KM because the
Malachite Green assay is not compatible with tungstate. We therefore
measured the activity of these mutants at pH 9.0, where the KM is higher, and extrapolated to pH 8.0 using
the ratio of pNPP kcat/KM values at pH 8.0 and pH 9.0 (eq ; Supporting Information S2). As a control, the same measurements were carried out for WT
PafA, and the measured and calculated values agreed within 2-fold.Hydrolysis rates for phenyl phosphate (Ph-P)
were measured by following production of free phenol. Production of
free phenol was assayed as phenolate by taking aliquots of the reaction
at specified time points and quenching in equal volume of 0.1 M sodium
hydroxide. Absorbance was measured at 278 nm with 500 nm as a reference
wavelength on a Tecan Infinite 200 PRO Microplate Reader (Tecan, Männedorf,
Switzerland), against a standard curve of phenol (Sigma-Aldrich, St.
Louis, MO).The following buffers were used for pH dependences:
MES (pH 6.0),
MOPS (pH 7.0), CHES (pH 9.0), CAPS (pH 10.0) at 100 mM, each as a
sodium salt in the presence of 500 mM NaCl and 100 μM ZnCl2, to ensure that there were no changes in pH-dependences or
protonation states introduced by the mutations (Supporting Information S2).Inhibition constants were
determined for tungstate, vanadate, and
inorganic phosphate (Pi) in 0.1 M sodium MOPS, pH 8.0,
0.5 M NaCl, 100 μM ZnCl2, using subsaturating concentrations
of me-pNPP (Chart ). The following inhibitor concentrations were used: WT, 0.16–28
μM vanadate, 1.6 × 10–4–13 mM
tungstate, 0.13–5.0 mM Pi; T79S, 16–450 μM
vanadate, 6.3 × 10–2–13 mM tungstate,
6.3 × 10–3–1.2 mM Pi; N100A,
6.3 × 10–2–13 mM vanadate, 6.3 ×
10–2–13 mM tungstate, 0.2–13 mM Pi; K162A, 16–450 μM vanadate, 6.3 × 10–2–13 mM tungstate, 0.38–13 mM Pi; R164A, 1.6 × 10–2–13 mM vanadate,
6.3 × 10–2–13 mM tungstate, 0.38–13
mM Pi.Inhibition constants for vanadate were also
measured with pNPP
for WT PafA and the T79S, K162A, and R164A mutants to test if the
observed activity arose from the same active site (Supporting Information S3). As the KM for pNPP hydrolysis by N100APafA at pH 8.0 is very small,
the Ki for tungstate was measured at pH
9.0 (0.1 M sodium CHES, 0.5 M NaCl, 100 μM ZnCl2)
for me-pNPP and pNPP. The inhibition constant was calculated with
a non-linear least-squares fit to the data using eq for competitive inhibition.
Accounting for Non-chemical
Rate-Limiting Steps and Limits in kcat/KM Measurements
As previously
observed for WT EcAP,[39] the kcat/KM values for pNPP by WT PafA and a subset of
its mutants are within the range observed when substrate binding,
rather than a chemical step, is rate limiting. To allow comparisons
of catalysis by PafA and its mutants that assess the ability of these
enzymes to carry out the chemical step, we used information from intrinsically
less reactive substrates so that comparisons reflect effects on substrate
binding and the subsequent chemical step and not simply association
kinetics. We followed the approach previously taken by O’Brien
et al. for EcAP.[34,39]In principle
we could obtain and compare kcat/KM values for all PafA variants with a poorly
reactive substrate, and thereby avoid any need to correct values for
pNPP or other more activated substrates. However, reactivity of the
least reactive PafA mutant, K162A, was too low to measure with Me-P.
Nevertheless, we were able to obtain an upper limit for this mutant
with Me-P and kcat/KM values for all of the other PafA variants, and all of the
conclusions herein could be made by these comparisons alone. Further,
we used comparisons between substrates of differing intrinsic reactivity
to obtain a (kcat/KM)Me-P estimate for K162APafA relative to
the other variants. The only underlying assumption is that each variant
has the same ratio of chemical catalysis for the substrates, and this
assumption is supported by comparisons of several variants and substrates
(Supporting Information S4).
Occupancy of
Metals and Phosphate in the PafA Active Site and
Structure
To test for possible metal ion concentration dependent
activation, such as observed for EcAP mutant E322Y,[24] WT PafA was incubated with the following metal
concentrations: 10 μM ZnCl2, 100 μM ZnCl2, 500 μM ZnCl2, 1.0 mM MgCl2 and
100 μM ZnCl2 in 10 mM sodium MOPS, pH 8.0, and 50
mM NaCl at 25 °C. No activation was observed after a week of
incubation. The occupancy of metals and phosphorus in the WT PafA
active site was determined with atomic emission spectroscopy as described
previously.[24] The following ratios were
obtained: Zn2+:protein = 3.1; Mg2+:protein =
0.012; Ca2+:protein = 0.53; Pi:protein = 2.9.
The metal ion occupancies suggested that Mg2+ does not
bind to the PafA active site, consistent with the lack of activation
by 1 mM MgCl2. The Zn2+ value is consistent
with occupancy of the Zn2+-bimetallo site and partial occupancy
of one or both of the remote Zn2+ sites observed in the
crystal structure (see below). Although the results suggest half occupancy
of a Ca2+ ion, addition of Ca2+ (1 mM) did not
increase activity.
Crystallization and Crystallographic Data
Collection for WT
PafA
The affinity tag was cleaved from PafA by incubating
WT PafA with factor Xa (New England BioLabs, Ipswich, MA) for 4 days
in 50 mM Tris-HCl, pH 8.0, 200 mM NaCl, 10 mM maltose, 5 mM CaCl2. PafA was buffer exchanged into 10 mM Tris-HCl, pH 8.0, 50
mM NaCl, 100 μM ZnCl2 and concentrated to 2.5 mg/mL.
Crystal growth was achieved via the hanging drop method using vapor
diffusion where equal volumes of PafA and precipitant solution (22%
polyethylene glycol (PEG) MW 3350, 0.1 M sodium acetate, pH 4.4, 0.2
mM ammonium sulfate) were mixed on a cover slide and placed over a
reservoir of 1 mL of precipitant solution at room temperature. Crystals
were harvested using nylon loops (Hampton Research, Aliso Viejo, CA)
or LithoLoops (Molecular Dimensions Inc., Altamonte Springs, FL) and
immediately flash frozen into liquid nitrogen; the high concentration
of PEG MW 3350 in the precipitation solution was determined to be
sufficient for cryopreservation. Crystallographic data were collected
at the Stanford Synchrotron Radiation Laboratory (SSRL) Beamline 11-1.
Radiation sensitivity of the crystals was immediately apparent during
data collection, as the overall intensity of recorded reflections
decreased with concurrent rapid increase of mosaicity after ∼20–40
5-second exposures. To alleviate this problem, we collected data sets
from seven PafA crystals and merged high-quality frames into a single
data set.
Processing of Diffraction Data and Structure Determination
Raw diffraction images were indexed and integrated using iMosflm,[40,41] followed by a merging and scaling step using Pointless and Aimless.[42] The final
merged data set included ∼20–40 initial frames from
each of the seven separate crystals and extended to 1.7 Å. The
L-test[43] detected significant twinning,
with the twin fraction of 0.242; as a result, the merging R-values
appear somewhat inflated. The final merging statistics are provided
in Table . The structure
was solved by molecular replacement (MR) using Phaser,[44,45] with the modified structure of SPAP (PDB ID: 3Q3Q) used as a search
model. MR was successful only when most of the divergent regions were
omitted, with essentially the conserved alpha-beta core retained in
the search model. The maps were steadily improved with multiple rounds
of incremental manual modeling of the best-resolved portions in Coot[46] alternating with automated refinement in phenix.refine.[47] A set of twin
operators (−h, –k, –l) was utilized throughout the refinement. The final
model included nearly the entire expressed PafA monomer, with the
exception of 12 residues of the N-terminus and 1 residue of the C-terminus.
At this stage, clear electron density could be seen for the two active-site
Zn2+ ions as well as two additional Zn2+ sites
at the PafA exterior. Once these were modeled, a strong residual electron
density feature in the active site could be identified as a phosphate
moiety covalently attached to Thr79. The final model converged with Rwork/Rfree of 15.9%/17.7%,
are lower than typical for structures at this resolution (as reported
by the Phenix statistical tool[48]), as a
result of twinning.[49] The model exhibited
good stereochemistry and reasonable refinement statistics as per Molprobity[50] as well as the Polygon tool[48] in the Phenix suite of software.[47] Refinement statistics are summarized in Table .
Table 1
X-ray Crystallographic
Data Collection
and Refinement Statistics
Data Collection
space group
I4
unit cell axes
a, b, c (Å)
113.8, 113.8, 71.4
α,
β, γ (deg)
90, 90, 90
resolution range (Å)
36.0–1.7
Rmerge (%)
28.7 (50.0)
Rpim (%)
13.1 (52.0)
⟨I⟩/⟨σI⟩
8.4 (3.2)
completeness (%)
96.6 (85.3)
multiplicity
5.4 (1.9)
CC1/2
95.8% (41.6%)
PafA, EcAP, and NPP Sequence Comparisons
EMBOSS Stretcher
was used to align the amino acid sequence translated from the genes
for PafA (UniProtKB/Swiss-Prot: AF157621.2) with SPAP (UniProtKB/Swiss-Prot: A1YYW7.1), NPP
(UniProtKB/Swiss-Prot: AAM37669.1), and EcAP,
(UniProtKB/Swiss-Prot: AAG54729.1) using BLOSUM 62 (see Supporting Information S5). Structures were overlaid
using the Chimera[51] alignment tool Matchmaker.
The following PDB codes were used: SPAP, 3Q3Q; EcAP, 1ALK; and NPP, 2RH6, using the Smith–Waterman
algorithm (local), BLOSUM-50, and a secondary structure score of 80%.
The structural alignment was used to make a multiple sequence alignment
(Supporting Information S5).
Results
The PafA
Structure
Crystals of PafA were obtained,
and its structure was determined to 1.7 Å resolution (Table ). As expected based
on sequence similarities, its core architecture matches that of EcAP, NPP, and other members of the AP superfamily (Figure A,B, gray and white),
and its Zn2+ bimetallo center is highly homologous to those
of NPP and EcAP (Figure ; see Supporting Information S5 for a structure-based sequence alignment).
Figure 1
Global structure and
active site of PafA, NPP, and EcAP. (A) Comparison
of the global structure of PafA, NPP, and EcAP. The
structurally conserved core and bimetallo Zn2+ (spheres)
are shown in gray, conserved zinc ligands are
shown in black, and auxiliary domains that are not conserved between
all three enzymes are shown in color; PafA in green, NPP in magenta,
and EcAP in blue. The active-site nucleophile (threonine
or serine) is shown in blue, and other active-site residues and EcAP Mg2+ (sphere) are shown in orange. (B) Topographic
diagrams for PafA, NPP, and EcAP, with arrows representing
β-sheets and rods representing α-helices. Gray represents
conserved sheets and white conserved helices, and the auxiliary regions
that are not conserved between all three enzymes are colored: PafA,
helices in olive, sheets in green; NPP, helices in purple, sheets
in magenta; and EcAP, helices in light blue, sheets
in blue. Structures are from 3tg0 (EcAP,[97]) and 2gso (NPP,[32]) with ligands removed for clarity.
Figure 2
Active-site schematics for PafA, NPP, and EcAP.
The enzymes have a conserved bimetallo core (black, including the
threonine or serine nucleophile), conserved zinc ligands (gray), and
additional non-universally conserved active-site residues, which are
colored for each enzyme. The phosphoryl group, in orange, is shown
with partial bonding to the nucleophile and its leaving group (OR),
corresponding to the reaction’s presumed transition state.
Global structure and
active site of PafA, NPP, and EcAP. (A) Comparison
of the global structure of PafA, NPP, and EcAP. The
structurally conserved core and bimetallo Zn2+ (spheres)
are shown in gray, conserved zinc ligands are
shown in black, and auxiliary domains that are not conserved between
all three enzymes are shown in color; PafA in green, NPP in magenta,
and EcAP in blue. The active-site nucleophile (threonine
or serine) is shown in blue, and other active-site residues and EcAP Mg2+ (sphere) are shown in orange. (B) Topographic
diagrams for PafA, NPP, and EcAP, with arrows representing
β-sheets and rods representing α-helices. Gray represents
conserved sheets and white conserved helices, and the auxiliary regions
that are not conserved between all three enzymes are colored: PafA,
helices in olive, sheets in green; NPP, helices in purple, sheets
in magenta; and EcAP, helices in light blue, sheets
in blue. Structures are from 3tg0 (EcAP,[97]) and 2gso (NPP,[32]) with ligands removed for clarity.Active-site schematics for PafA, NPP, and EcAP.
The enzymes have a conserved bimetallo core (black, including the
threonine or serine nucleophile), conserved zinc ligands (gray), and
additional non-universally conserved active-site residues, which are
colored for each enzyme. The phosphoryl group, in orange, is shown
with partial bonding to the nucleophile and its leaving group (OR),
corresponding to the reaction’s presumed transition state.The overall structure and active-site
configuration of PafA closely
match those of Sphingomonas sp. BSAR-1 (SPAP), an
AP superfamily member closely related to PafA (Supporting Information S6). As noted by Bihani et al., SPAP
shares specific active-site features with the AP superfamily phosphodiesterases
such as NPP.[52] These similarities are also
seen with PafA (Figure ). PafA, SPAP, and NPP share a nucleophilic threonine, while EcAP has a serine, and PafA, SPAP, and NPP contain a homologous
active-site asparagine that appears to make identical interactions
with oxygen atom (O1) of the transferred phosphoryl group
(Figure , Scheme , Supporting Information S5). EcAP lacks this
asparagine residue, and, although the PafA, SPAP, and EcAP active sites all have arginine and lysine residues, the arginine
and lysine residues in the EcAP active site originate
from distinct helices and loops and make distinct interactions (Figures B and 2, Supporting Information S5 and
S6, ref (52)).
Scheme 2
Transition-State Model for Alkaline Phosphatase Superfamily Reactions
PafA was previously isolated
and expressed by Berlutti et al.,[53] and
initial structure–function studies
were carried out by Bihani et al. with SPAP.[52,54] Our studies build upon this prior research and are consistent with
their results.
PafA’s Reaction Specificity for Phosphate
Monoester versus
Diester Hydrolysis
Based on structural inspection, SPAP and
PafA are expected to be phosphate monoesterases, as they lack a binding
pocket for the second diester substituent and have positively charged
residues situated to interact with both non-bridging oxygen atoms
of phosphomonoester substrates (Figure , Supporting Information S6, ref (52)). To
formally test this expectation, we followed the PafA hydrolysis of
phosphate monoester and diester substrates, pNPP and me-pNPP, respectively
(Chart ). We observed
a preference of ∼105-fold for catalysis of the monoester
reaction (Table ,
WT). This value represents a lower limit for the preferential chemical
catalysis, as binding rather than the chemical step appears to be
rate limiting for the pNPP reaction (see Experimental
Section, Accounting for Non-chemical Rate-Limiting Steps and
Limits in kcat/KM Measurements, and Supporting Information S4), and as the methyl substituent of the diester substrate does
not make binding interactions made with substituents of physiological
diesterase substrates.[24,32,55]
Table 2
PafA Phosphomonoesterase and Diesterase
Reaction Kineticsa
pNPP
me-pNPPc
(kcat/KM)pNPP
enzyme
kcat/KM (M–1 s–1)
krelb
KM (M)
kcat (s–1)
me-pNPP (M–1 s–1)
krelb
(kcat/KM) Me-pNPP
WT
1.6(0.4) × 106
(1)
1.9(0.4) × 10–4
3.0 × 102
1.7(0.5) × 101
(1)
9.4 × 104
T79S
6.3(1.5) × 105
2.5
2.9(0.3) × 10–6
1.8
6.6(2.2) × 10–1
26
9.5 × 106
N100A
7.6(2.0) × 105
2.1
5.1(1.3) × 10–7
3.9 × 10–1
1.1(0.4) × 10–1
1.5 × 102
6.9 × 106
K162A
1.3(0.1) × 102
1.2 × 104
2.3(0.8) × 10–4
3.0 × 10–2
1.3(0.1) × 101
1.3
1.0 × 101
R164A
1.3(0.4) × 105
1.2 × 101
5.1(1.5) × 10–5
6.6
1.4(0.1) × 101
1.2
9.3 × 103
Assay conditions:
100 mM sodium
MOPS, pH 8.0, 500 mM NaCl, 100 μM ZnCl2.
krel values
are kcat/KM for WT divided by kcat/KM for each mutant. By definition the value for
WT PafA is one.
(KM)Me-pNPP ≫ 1.0 mM
for WT and all mutants.
Assay conditions:
100 mM sodium
MOPS, pH 8.0, 500 mM NaCl, 100 μM ZnCl2.krel values
are kcat/KM for WT divided by kcat/KM for each mutant. By definition the value for
WT PafA is one.(KM)Me-pNPP ≫ 1.0 mM
for WT and all mutants.
Effect
of Active-Site Mutations on PafA Catalysis
We
next assessed the effect of PafA active-site mutations on the phosphate
monoesterase and diesterase activities. Each PafA active-site residue
that appears to contact the phosphoryl group, N100, K162, and R164,
was mutated to alanine, and the nucleophilic threonine residue, T79,
was mutated to serine. Values of kcat/KM were first determined with a p-nitrophenoxide leaving group (Table ; pNPP and me-pNPP, Chart ). Whereas the diester reactions were uniformly
slow, and thus very likely limited by the chemical step, the high kcat/KM values of
∼106 M–1 s–1 for pNPP, the monoester substrate, were within the range seen for
rate constants for substrate binding to enzymes.[56] Furthermore, mutations that gave 20–200-fold effects
on the non-cognate diesterase reaction had essentially no effect on
the cognate monoesterase reaction for a subset of the mutations (Table , i.e., T79S and N100A).
This result is most simply accounted for by a non-chemical rate-limiting
step that masks effects on the chemical step for the pNPP monoester
reaction for WT PafA (Supporting Information S4).To further investigate the effects of the mutations on
phosphomonoester hydrolysis we therefore carried out reactions for
each PafA variant with less reactive phosphate monoester substrates,
phenyl phosphate (Ph-P) and methyl phosphate (Me-P; Chart , Table ). The larger mutational effects observed
with the less reactive substrates supported the model of rate limiting
binding for reaction of pNPP with the faster mutants. Hence, we used
Me-P to compare the PafA mutants. As only an upper limit could be
obtained for the Me-P reactivity of the most affected mutant (K162A),
we used the observed constant ratio of reactivity of the different
substrates to estimate this kcat/KM value (see Experimental Section and Supporting Information S4); use of
this value rather than the limit does not affect the conclusions drawn
herein.
Table 3
kcat/KM Values for Phosphomonoester Hydrolysis by
WT and Mutant PafA Enzymesa
enzyme
pNPPa (M–1 s–1)
krelb
Ph-Pa (M–1 s–1)
krelb
Me-Pa (M–1 s–1)
krelb
WT
1.6(0.4) × 106
(1)
8.3(1.2) × 105
(1)
8.5(0.6) × 104
(1)
T79S
6.3(1.5) × 105
2.5
1.4 × 105b
5.9
3.8 × 103b
22
N100A
7.6(2.0) × 105
2.1
1.5 × 105b
5.5
2.2 × 102b
3.9 × 102
K162A
1.3(0.1) × 102
1.2 × 104
8.2(0.1)
1.0 × 105
1.8 × 10–2c
4.7 × 106
R164A
1.3(0.4) × 105
1.2 × 101
1.0(0.3) × 104
8.3 × 101
4.2(0.3) × 101
2.0 × 103
Assay conditions:
100 mM sodium
MOPS, pH 8.0, 500 mM NaCl, 100 μM ZnCl2, unless annotated
(pH 9.0) for which 100 mM sodium CHES, pH 9.0, 500 mM NaCl, 100 μM
ZnCl2 was used.
kcat/KM was
measured at pH 9.0 rather than
pH 8.0 for these substrates and mutants, as the low KM values at pH 8.0 prevented measurement of accurate kcat/KM values. The kcat/KM at pH 8.0
were estimated using the ratio of kcat/KM for pNPP at pH 8.0 and pH 9.0, as
described in the Experimental Section (see
Kinetic Assays of PafA Variants).
The pH
9.0 rate constants are given in Supporting Information Table S4.
Estimated assuming
constant ratios
of reactivity ratios for the substrates as described in the Experimental Section (see Accounting for Non-chemical
Rate-Limiting Steps and Limits in kcat/KM Measurements). The measured limit
is ≤2.0 × 10–1 M–1 s–1.
Assay conditions:
100 mM sodium
MOPS, pH 8.0, 500 mM NaCl, 100 μM ZnCl2, unless annotated
(pH 9.0) for which 100 mM sodium CHES, pH 9.0, 500 mM NaCl, 100 μM
ZnCl2 was used.kcat/KM was
measured at pH 9.0 rather than
pH 8.0 for these substrates and mutants, as the low KM values at pH 8.0 prevented measurement of accurate kcat/KM values. The kcat/KM at pH 8.0
were estimated using the ratio of kcat/KM for pNPP at pH 8.0 and pH 9.0, as
described in the Experimental Section (see
Kinetic Assays of PafA Variants).The pH
9.0 rate constants are given in Supporting Information Table S4.Estimated assuming
constant ratios
of reactivity ratios for the substrates as described in the Experimental Section (see Accounting for Non-chemical
Rate-Limiting Steps and Limits in kcat/KM Measurements). The measured limit
is ≤2.0 × 10–1 M–1 s–1.Given the uniformly low kcat/KM values with the phosphate diester substrate,
additional precautions and controls are required to ensure that a
contaminant that is physically undetectable, but highly proficient
at the alternative reaction, does not account for the observed promiscuous
activity. We therefore determined inhibition constants for the phosphate
monoester and diester reactions (Supporting Information S3). For each PafA variant, the same inhibition constant was observed
for the phosphate mono- and diester reactions, suggesting that catalysis
of both reactions arises from the same active site. Also, different
inhibition constants were observed for different mutants, providing
further support for the conclusion that the promiscuous diesterase
activities arose from PafA and its variants (Table ).
Table 4
Inhibition of WT
and Mutant PafA by
Ground- and Transition-State Analoguesa
Pi
tungstate
vanadate
enzyme
Ki (mM)
Krel
Ki (mM)
Krel
Ki (mM)
Krel
WT
8.3(1.0) × 10–1
(1)
3.9(0.1) × 10–3
(1)
6.7(0.4) × 10–4
(1)
T79S
9.6(0.4) × 10–3
1.2 × 10–2
2.8(0.8) × 10–1
7.2 × 101
9.0(0.2) × 10–2
1.3 × 102
N100A
1.3(0.2)
1.6
5.7(2.3) × 10–1
1.5 × 102
1.9(0.1) × 10–1
2.8 × 102
K162A
4.7(1.2)
5.7
>13
>3.3 × 103
7.9(2.6) × 10–2
1.2 × 102
R164A
>13
>16
1.3(0.4)
3.3 × 102
3.0(0.7) × 10–1
4.5 × 102
Conditions: 100
mM sodium MOPS,
pH 8.0, 500 mM NaCl, 100 μM ZnCl2. Obtained as described
in Experimental Section.
Conditions: 100
mM sodium MOPS,
pH 8.0, 500 mM NaCl, 100 μM ZnCl2. Obtained as described
in Experimental Section.
Effect of Active Site Mutations on Binding
of Ground- and Transition-State
Analogues
For WT PafA and each of the active-site mutants,
we determined the inhibition constants for Pi and for two
potential transition-state analogues, vanadate and tungstate (Table ). The T79S mutation,
which decreases catalysis (Table ) and, correspondingly, decreases binding of both transition-state
analogues, lead to an increase in Pi affinity
(Table , decreased Ki). The paradoxical stronger binding accompanied
by weaker catalysis is accounted for by an electrostatic destabilization
model, akin to that observed for EcAP,[57,58] as described in the Discussion. There is
also a remarkable congruence of the mutational effects on activity
and binding of these ligands to PafA and EcAP, despite
the lack of conservation between the mutated residues, as also described
in the Discussion.
Comparative Enzymology
of PafA versus NPP
As noted
by Bihani et al. for SPAP, and emphasized above for PafA, these phosphate
monoesterases share significant homology with AP superfamily diesterases
such as NPP, despite their specialization to catalyze a different
reaction (Figures and 2, Table ). Accordingly, we created mutant versions of PafA
and NPP that remove the active-site residues that are distinct between
these enzymes and retain the common Zn2+ bimetallo center,
threonine nucleophile, and active-site asparagine, and then we compared
their activities (Figure A). For diesterase activity, we used me-pNPP, as its methyl
substituent minimizes interactions between the substituent of the
transferred phosphoryl group and the diesterase (NPP) binding site
for this substituent.[32,36] Whereas wild-type PafA and NPP
show large preferences for monoester and diester substrates, respectively
(Figure B,C), the
mutants of each show no significant preference (≤3-fold) between
these reactions and also exhibit extensive catalysis of ∼1012-fold for both reactions (Table ).
Figure 3
Comparative kinetics for wild-type and “common”
PafA
and NPP variants. (A) Schematics of the active residues in PafA (green)
and NPP (magenta) that are removed to give mutants with common active-site
residues. The residues removed interact with substrate oxygen O2 in the PafA monoesterase reaction (Scheme ) and the R′ group attached to this
oxygen atom in the NPP diesterase reaction. (B) kcat/KM values for WT PafA,
WT NPP, and the common core mutants. Phosphomonoester reactions are
shown by black bars (pNPP) and phosphodiester reactions (me-pNPP)
by gray bars. Rate constants are from Table . (C) The ratio of phosphomonoester to phosphodiester
hydrolysis for WT PafA and WT NPP (gray) and for the common core mutants
(black). The ratio of these ratios gives the relative specificity
for the enzyme.
Table 5
Comparison
of Rate Constants for WT
and Mutants of PafA and NPPa
kcat/KM (M–1 s–1)
rate enhancementd
enzyme
pNPP
me-pNPP
krelc
pNPP
me-pNPP
WT PafA
1.6(0.4) × 106
1.7(0.5) × 101
9.4 × 104
5.3 × 1016
5.3 ×
1012
WT NPPb
1.1
230
4.8 × 10–3
3.7 × 1010
7.0 × 1013
PafA K162A/R164A
16(6)
10(0.1)
1.6
5.3 × 1011
3.0 × 1012
NPP F91A/L123A/Y205A
1.6(0.1)
2.6(0.4)
0.6
5.3 × 1010
7.9 × 1011
Assay conditions: 100 mM sodium
MOPS, pH 8.0, 500 mM NaCl, 100 μM ZnCl2.
Rate constants from ref (32).
Calculated as the ratio of (kcat/KM)pNPP/(kcat/KM)Me-pNPP.
Rate enhancements are
calculated
as (kcat/KM)/kW using the water hydrolysis rate
constants of KWpNPP = 3.0 × 10–11 M–1 s–1 and KWMe-pNPP =
3.3 × 10–12 M–1 s–1.[66]
Comparative kinetics for wild-type and “common”
PafA
and NPP variants. (A) Schematics of the active residues in PafA (green)
and NPP (magenta) that are removed to give mutants with common active-site
residues. The residues removed interact with substrate oxygenO2 in the PafA monoesterase reaction (Scheme ) and the R′ group attached to this
oxygen atom in the NPP diesterase reaction. (B) kcat/KM values for WT PafA,
WT NPP, and the common core mutants. Phosphomonoester reactions are
shown by black bars (pNPP) and phosphodiester reactions (me-pNPP)
by gray bars. Rate constants are from Table . (C) The ratio of phosphomonoester to phosphodiester
hydrolysis for WT PafA and WT NPP (gray) and for the common core mutants
(black). The ratio of these ratios gives the relative specificity
for the enzyme.Assay conditions: 100 mM sodium
MOPS, pH 8.0, 500 mM NaCl, 100 μM ZnCl2.Rate constants from ref (32).Calculated as the ratio of (kcat/KM)pNPP/(kcat/KM)Me-pNPP.Rate enhancements are
calculated
as (kcat/KM)/kW using the water hydrolysis rate
constants of KWpNPP = 3.0 × 10–11 M–1 s–1 and KWMe-pNPP =
3.3 × 10–12 M–1 s–1.[66]
Discussion
Enzymes of the AP superfamily catalyze a range
of reactions, with
its bimetallo branch containing phosphate monoesterases and diesterases,
along with phosphomutases and phosphonoacetate hydrolases.[24−27,32,33,59,60] Naively, one
might expect an early division between phosphate monoesterase and
diesterase superfamily members, such that there would be similarities
between all monoesterases and persistent distinctions between these
enzymes and the diesterases. However, PafA and SPAP are phosphate
monoesterases with similarities to superfamily phosphate diesterases
and differences from the three metal ion AP superfamily monoesterases
such as E. coli AP (EcAP) (Figures and 2; Supporting Information S6; ref (52), comment (61)). These observations raised
intriguing evolutionary questions and provided us with compelling
opportunities to obtain mechanistic insights via comparative enzymology
and analysis of catalytic promiscuity.[32]
Evolution
of Phosphate Monoesterases with Distinct Functional
Properties
Why are there AP superfamily phosphate monoesterases
with distinct active-site architectures, and why would one more resemble
the AP superfamily phosphate diesterases? EcAP comes
from a gut bacteria (E. coli), whereas PafA is derived
from a bacterium found in soil and salt and fresh water (C.
meningosepticum).[53,62] Further, phosphate
is often a scarce resource for growth, and there is complex regulation
of Pi uptake and storage and roles of phosphate in stress
and pathogenesis.[63,64] The different catalytic constants
we observe for PafA and EcAP are consistent with
different metabolic evolutionary pressures, arising from different
lifestyles that favored distinct active-site architectures.EcAP exhibits very strong product inhibition by
Pi, with a Ki in the low micromolar
and more than 3 orders of magnitude stronger than that for PafA (Figure A). The strong binding
could be functional—for example, limiting enzyme turnover until
Pi that has been produced can be cleared from the periplasm
and utilized. Alternatively, the strong product inhibition may be
a necessary trade-off to achieve the ∼4-fold higher kcat/KM for EcAP relative to PafA (Figure B).[65]
Figure 4
Comparison
of EcAP and PafA kinetics. (A) Values
of KM for phosphomonoester hydrolysis
by WT PafA (black) and EcAP (gray) and inhibition
constants (Ki) for binding the product,
inorganic phosphate (Pi). Values are from Tables and 4 and refs (39 and 57). (B) kcat/KM values for
phosphomonoester (Me-P) hydrolysis by WT PafA (black) and EcAP (gray). Values are from Table and ref (17).
Comparison
of EcAP and PafA kinetics. (A) Values
of KM for phosphomonoester hydrolysis
by WT PafA (black) and EcAP (gray) and inhibition
constants (Ki) for binding the product,
inorganic phosphate (Pi). Values are from Tables and 4 and refs (39 and 57). (B) kcat/KM values for
phosphomonoester (Me-P) hydrolysis by WT PafA (black) and EcAP (gray). Values are from Table and ref (17).While it is always possible that the functional differences
between EcAP and PafA arose from trapping of a suboptimal
evolutionary
solution as a local maximum on the enzyme’s fitness landscape
for one of enzymes, there is also an intriguing possibility that E. coli and a subset of other bacteria utilize strong product
inhibition for phosphate homeostasis and signaling in ways that remain
to be elucidated. Our results open new biological questions that would
be indiscernible without quantitative comparative enzymology.
Modular
Active-Site Architecture Facilitates Transitions between
AP Superfamily Phosphate Monoesterases and Diesterases
Within
the EcAP active site, R166 interacts with both of
the phosphoryl oxygen atoms that face away from the Zn2+ bimetallo site (Figure ). Consistent with this structural picture, mutation of R166
is deleterious for both phosphate monoesterase and diesterase reactions
but is more deleterious for monoesterase reactions.[24] Thus, if R166 of EcAP were mutated to
allow the introduction of residues that interact with a diester R′
substituent, interactions with the other, non-esterified oxygen atom
would be compromised, and the promiscuous diester reaction would be
adversely affected. Based on these observations, the EcAP active-site architecture would be expected to impede evolutionary
transitions between mono- and diesterases, and these observations
may account for the absence of diesterase superfamily members that
are highly related to EcAP. For analogous reasons, EcAP might be a suboptimal starting point for engineering
a new phosphodiesterase.In contrast, residues within PafA’s
active site interact with either of the two phosphoryl
oxygen atoms facing away from the Zn2+ bimetallo center
(O1 and O2 in Scheme ). Oxygen atom O1 accepts hydrogen
bonds from the side chain of N100 and the backbone amide of T79 (the
active-site nucleophile), and O2 accepts hydrogen bonds
from K162 and R164 (Figure ). Correspondingly, mutation of N100 gives a uniform deleterious
effect on monoester and diester hydrolysis, suggesting that this interaction
facilitates both reactions, and mutations of K162 and R164 are deleterious
only to phosphate monoester hydrolysis, with essentially no effect
on the diester reaction (Figure A). Further, the opposite behavior is observed for
NPP—i.e., mutations of the residues that constitute the R′
binding pocket are deleterious to NPP’s diesterase activity
but not to its monoesterase activity.[36]
Figure 5
Effect
of PafA mutations on phosphate monoester and diester hydrolysis.
(A) Ratio of kcat/KM values for PafA mutants for phosphomonoester (blue) and phosphodiester
(yellow) reactions. Reactions are for Me-P and me-pNPP (Chart ), and kinetic values are from Tables and 3. (B) Ratio of the mutational effects on the phosphate monoester
and diester reactions, with larger values representing larger effects
on the monoester relative to diester reaction. The phosphoryl oxygen
atom (Scheme ) that
each residue interacts with is noted and is based on the structural
schematic of Figure .
Effect
of PafA mutations on phosphate monoester and diester hydrolysis.
(A) Ratio of kcat/KM values for PafA mutants for phosphomonoester (blue) and phosphodiester
(yellow) reactions. Reactions are for Me-P and me-pNPP (Chart ), and kinetic values are from Tables and 3. (B) Ratio of the mutational effects on the phosphate monoester
and diester reactions, with larger values representing larger effects
on the monoester relative to diester reaction. The phosphoryl oxygen
atom (Scheme ) that
each residue interacts with is noted and is based on the structural
schematic of Figure .The presence of distinct interactions
with each of the two phosphoryl
oxygen atoms or substituents in PafA and NPP allow interactions around
one of the phosphoryl oxygen atoms to be altered and optimized for
phosphate monoester or diester catalysis, while the interactions with
the phosphoryl oxygen atom that contributes equally to both reactions
can be maintained. We propose that the structural and functional modularity
of the phosphoryl oxygen interactions facilitated evolutionary transitions
between phosphate mono- and diesterases across the AP superfamily.[63] Information of this type about the functional
architecture of active sites may also be powerful in guiding efforts
to reengineer enzymes for new functions.
Beyond the O2 and R′ Site in PafA and NPP:
Extensive Sequence Variation in These Enzymes’ Scaffolds Does
Not Contribute to Reaction Specificity
The Zn2+ bimetallo site is a hallmark of a major branch of the AP superfamily
and, as is generally the case for conserved features of superfamilies,
is thought to play the same role across different reactions.[30,31] For the AP Zn2+ bimetallo site, one Zn2+ presumably
deprotonates and positions the nucleophilic serine or threonine, the
other Zn2+ stabilizes charge accumulation on the leaving
group oxygen atom, and there are also preferential interactions with
the non-bridging oxygen atom that is situated between the Zn2+ ions in the transition state (Scheme ).[66]Nevertheless,
the transition states for phosphate monoester and diester reactions
differ in bond lengths and charge distributions, so the catalytic
contributions from the Zn2+ interactions could differ quantitatively.[66] In particular, the different scaffolds surrounding
the conserved Zn2+ ions and ligands could, in principle,
adjust their positions or tune their charge densities to favor interactions
with a phosphate monoester or diester transition state.[67] There is no structural evidence to support such
differences,[67] but the differences could
be small, transient, or only in electric field.We therefore
tested whether the PafA and NPP scaffolds, which differ
in >80% of their residues (Supporting Information S5), provide functional specialization for each enzyme’s
cognate reaction. We created “common” mutant forms of
PafA and NPP with the disparate residues interacting with the O2 atom in PafA and the R′ group in NPP removed but the
rest of the surrounding PafA and NPP scaffolds maintained (Figure A). Remarkably, whereas
WT PafA and NPP differ in their preference for reaction with phosphate
mono- and diesters by >107-fold, the mutant versions
catalyze
both reactions equally well (Figure B,C; the 107-fold difference in preference
was calculated as the ratio of the ratios of the specificity for pNPP
over me-pNPP for each enzyme in Figure C; see also ref (68)). Thus, despite having evolved to carry out different cognate
reactions and despite very low sequence identity, PafA and NPP have
essentially identical specificities after removal of only five characteristic
local active-site residues—three from NPP and two from PafA
(Figure A). While
this represents the simplest and most likely model, the possibility
of secondary conformational rearrangements of either enzyme that coincidentally
lead to the same activity will require structural comparisons.As the rate enhancements of these mutants are substantial (∼1011–1012-fold) and rivaling enhancements observed
for many fully evolved enzymes (Table ),[69], and as the >107-fold difference in native specificities appears to arise
from interactions at a single phosphoryl position, these scaffolds
seem to provide excellent jumping off points for evolving or engineering
highly efficient and specific phosphate monoesterases and diesterases.
Comparisons across the AP Superfamily Suggest Active-Site Roles
for Specific Structural Elements
As emphasized in the Introduction,
understanding enzyme function will require the identification of structural
connections beyond the active site. Indeed, the notion that the enzyme
scaffold positions binding and catalytic groups for catalysis is the
fundamental precept of enzymology (e.g., refs (70−73)). While the importance of elements beyond the active site is clear,
identifying their roles has been difficult. For example, most attempts
to convert one enzyme to another start from highly related enzymes
with a finite set of amino acid differences and have used trial-and-error
strategies,[74−76] and engineering efforts have generally not reproduced
natural catalytic efficiencies.[77−81] Here, comparisons across the AP superfamily allowed us to identify
two structural elements that may make specific functional connections
to the active site.As emphasized above, PafA and NPP catalyze
distinct reactions but have common interactions with phosphoryl oxygen
atom O1 (Figures and 3A). Strikingly, PafA and NPP
share an inserted structural element that is not present in EcAP, which catalyzes the same reaction as PafA but has
distinct O1 interactions. This insertion is situated between
conserved sheets 6 and 7 in PafA and NPP and sits directly behind
the common asparagine residue in their three-dimensional structures
(Figure B and Supporting Information S7). Given this observation,
we surveyed all 27 bimetallo AP superfamily members for which high-resolution
structures are available. Of these, 10 contained a residue homologous
to N100 of PafA and N111 of NPP, and in all cases these enzymes also
contained an insertion at the position observed for PafA and NPP.
Further, none of the enzymes lacking a corresponding asparagine contained
this insertion (Supporting Information S8, Table ). This correspondence
suggests that the inserted structural elements play a common role
in positioning the active-site asparagine for its interactions with
an oxygen O1 of the transferred phosphoryl group (Figures and 2).There is also evidence for a helical element that
may govern phosphate
monoesterase activity. PafA and EcAP contain a helix
(4a) that is inserted after β sheet 3 and is absent in NPP (Figure B; Supporting Information S9). Within or adjoining this helix
are residues interacting with O2 in PafA (K162 and R164)
and EcAP (D153) as well as the EcAP Mg2+ ligand T155 (Figures B and 2). Repeating
the above structural analysis revealed that helix 4a is present in
all structurally characterized AP superfamily members that catalyze
reactions of monosubstituted phosphates and is absent in all members
that catalyze reactions of phosphate diesters (Supporting Information S8, Table ). Additional elements accompany the helix
4a insertion (e.g., helices 4a-e and β sheet 4a for PafA; helices
4a-g, and β sheet 4a-b for EcAP; Figure B) but are absent in the diesterases,
perhaps because their role is to position helix 4a and the nearby
active-site residues, akin to the structural elements behind the conserved
asparagine of PafA and NPP described above. Thus, helix 4a and the
remainder of this monoester-specific insert may help position active-site
residues for interactions with O2 of monosubstituted phosphate
substrates.The absence of the 4a helix in diesterases could
reflect the absence
of selective pressure to maintain it, or it could be selected against
to provide space for a bulky diester substituent on O2 and
allow a more remote binding site to be sculpted for that substituent.
Most generally, understanding whether active-site elements work cooperatively
with adjacent structural elements or distinctly will help elucidate
the evolutionary history of enzymes within and beyond the AP superfamily
and may also suggest effective modular approaches to the problem of
engineering new enzymes.
Mechanistic Commonalities between EcAP and
PafA Phosphate Monoesterases
For catalysis to occur, an enzyme
must stabilize a reaction’s
transition state to a greater extent than the reaction’s ground
state; otherwise, if both are stabilized to the same extent, the reaction
barrier is the same on the enzyme as off of the enzyme so that there
is no catalysis.[82−86] In essence, such destabilization occurs in all enzymatic reactions,
as fixation of reactants in the active site can be considered as entropically
destabilizing, even though quantification of this destabilization
has proven elusive. Prior experiments with EcAP showed
that Pi binds orders of magnitude stronger to EcAP mutants when the anionic serine residue is removed. These and
additional data provided strong evidence for electrostatic ground-state
destabilization and a quantitative estimate of at least 103-fold for this effect.[57,58]Our results with
PafA suggest that its active site harnesses analogous ground-state
destabilization. Mutation of the nucleophilic threonine to serine
decreases catalysis and weakens binding to vanadate and tungstate
transition-state analogues[87,88] by ∼100-fold
(Figure B), similar
to the ∼20-fold decrease in activity (Figure ) as expected for transition-state analogues
(see also Discrimination between Transition-State
Analogues in AP Superfamily Active Sites Suggests Precise Transition-State
Recognition, below). But in contrast, this mutation increases inhibition by Pi by >100-fold (Figure A, Table ).
Figure 6
Evidence for electrostatic
ground-state destabilization. (A) Ratio
of affinities (Ka) for binding of the
ground-state analogue Pi to PafA active-site mutants, relative
to WT PafA. The dashed line represents a ratio of one, the value for
WT PafA by definition. The arrow for R164A indicates an upper limit.
Data are from Table . (B) Ratio of affinities (Ka) for binding
of putative transition-state analogues vanadate (black) and tungstate
(gray) to PafA active-site mutants, relative to WT PafA. The dashed
line represents a ratio of one, the value for WT PafA by definition.
The arrow for K162A indicates an upper limit. Data are from Table . (C) Model for electrostatic
ground-state destabilization in WT PafA that is partially relieved
in the T79S mutant due to lessened conformational restriction of the
oxyanion of this residue. This model predicts lowered reactivity,
weakened binding of transition-state analogues, and stronger binding
of a ground-state analogue, as is observed. Only the T79S mutation
exhibits this set of effects.
Evidence for electrostatic
ground-state destabilization. (A) Ratio
of affinities (Ka) for binding of the
ground-state analogue Pi to PafA active-site mutants, relative
to WT PafA. The dashed line represents a ratio of one, the value for
WT PafA by definition. The arrow for R164A indicates an upper limit.
Data are from Table . (B) Ratio of affinities (Ka) for binding
of putative transition-state analogues vanadate (black) and tungstate
(gray) to PafA active-site mutants, relative to WT PafA. The dashed
line represents a ratio of one, the value for WT PafA by definition.
The arrow for K162A indicates an upper limit. Data are from Table . (C) Model for electrostatic
ground-state destabilization in WT PafA that is partially relieved
in the T79S mutant due to lessened conformational restriction of the
oxyanion of this residue. This model predicts lowered reactivity,
weakened binding of transition-state analogues, and stronger binding
of a ground-state analogue, as is observed. Only the T79S mutation
exhibits this set of effects.These effects are predicted from the ground-state electrostatic
destabilization model shown in Figure C. The decreased activity and decreased transition-state
analogue binding are predicted from loosening the positioning of the
nucleophilic oxyanion, resulting from a loss of interactions with
the methyl group of T79. The reaction and formation of covalent vanadate
and tungstate adducts now requires overcoming an additional barrier
for proper positioning and are correspondingly less favorable. In
contrast, greater freedom of motion of the seryl oxyanion (Figure C, right) allows
it to move away from anionic phosphate ligand, thereby lessening electrostatic
repulsion and allowing the observed increase in Pi binding.
The 80-fold stronger binding corresponds to 2.2 kcal/mol of destabilization
energy, providing a lower limit for the extent of ground-state destabilization,
as some destabilization may remain in the mutant.[89]Interestingly, while EcAP and PafA
both appear
to utilize electrostatic ground-state destabilization to facilitate
catalysis, the interactions that are responsible for positioning the
oxyanion for nucleophilic attack and providing ground-state repulsion
differ for the two enzymes. EcAP has a nucleophilic
serine and so lacks the methyl group of PafA’s threonine residue
that contributes to oxyanion positioning. Understanding the strategies
used to position the threonyl and seryl nucleophiles will help uncover
distinct evolutionary solutions to a common problem and may help reveal
the types and extent of interactions needed to efficiently engineer
new enzymes.Another functional analogy between PafA and EcAP that involves distinct interactions was suggested from
our mutagenesis
results. Prior results revealed a ∼105-fold rate
decrease from removal of the active-site Mg2+ of EcAP (via mutation of one of its ligands, E322; Figure )[24] and showed that this Mg2+ ion acts as part of
a functional network with K328, D153, and a set of bound water molecules
(Figure A, blue).[17,24] Such a large effect is rare for a group not directly involved in
chemical catalysis (like the serine nucleophile) and implies important
functional role of this interaction network. Thus, it was particularly
noteworthy to observe a 106-fold rate decrease in phosphate
monoester catalysis upon mutation of K162 (Figure A), a residue that has no direct role in
PafA’s reaction chemistry (Figure ) and, similar to the observation of EcAP’s Mg2+ removal, had little effect
on PafA’s diesterase activity (Figure A).
Figure 7
Analogous hydrogen bond networks in PafA (A)
and EcAP (B). Each active-site schematic highlights
a hydrogen bond network,
that involving the active-site Mg2+ of EcAP and its associated network (A, blue) or K162 of PafA (B, green).
The red lines highlight the analogous interactions made with other
active-site elements. The common conserved Zn2+ bimetallo
sites, including the serine or threonine nucleophile, are shown in
black, and other active-site residues are shown in gray.
Analogous hydrogen bond networks in PafA (A)
and EcAP (B). Each active-site schematic highlights
a hydrogen bond network,
that involving the active-site Mg2+ of EcAP and its associated network (A, blue) or K162 of PafA (B, green).
The red lines highlight the analogous interactions made with other
active-site elements. The common conserved Zn2+ bimetallo
sites, including the serine or threonine nucleophile, are shown in
black, and other active-site residues are shown in gray.Intriguingly, although there is no structural homology
or sequence
identity in this region of PafA and EcAP, analogous
interaction networks can be drawn for K162 of PafA and the Mg2+ ion of EcAP (Figure ). In PafA (Figure B), K162 donates a hydrogen bond to the O2 phosphoryl oxygen atom, like the Mg2+·OH2 of EcAP (Figure A). K162 also hydrogen bonds to D38, the
residue homologous to D51 of EcAP that is also a
Mg2+ ligand. In addition, K162hydrogen bonds to D305,
the Zn1 ligand that is homologous to D327 of EcAP.Given the remarkable correspondence of active-site connections
and similar large functional effects, the Mg2+ ion of EcAP and K162 of PafA may play analogous important functional
roles and may represent distinct evolutionary solutions to analogous
structural and functional challenges within AP superfamily active
sites. Structural inspection suggests that these groups make connections
within the active site that may simultaneously optimize the position
of the two Zn2+ ions, the threonine or serine nucleophile,
and the transferred phosphoryl group, via one of its non-bridging
oxygen atoms (Figure ). These observations may help define the type and degree of interconnections
that are needed to engineer and re-engineer highly efficient enzymes.
Discrimination between Transition-State Analogues in AP Superfamily
Active Sites Suggests Precise Transition-State Recognition
A recent high-resolution crystal structure of WT EcAP revealed that tungstate, like vanadate, binds as a covalent pentavalent
adduct, mimicking the reaction’s transition-state geometry.[88] Extensive analysis of the catalytic properties
of 20 EcAP variants relative to the binding of Pi, vanadate, and tungstate showed that tungstate binding mirrors
activity much more closely than Pi binding and also more
closely than vanadate binding.[88] Thus,
tungstate meets the most stringent criterion for acting as a transition-state
analogue, although no stable analogue can perfectly mimic a transient,
partially bonded transition state.[90]We also see a better correlation of activity and binding for tungstate
for the five PafA variants studied herein (Figure ). Tungstate gives a higher coefficient of
determination (Pearson R2) than vanadate
and Pi and a best-fit slope closer to one (Table ), as expected for a transition-state
analogue.[90,91] Similarly, when the correlation slope is
fixed at one (Figure ), the RMSD is considerably lower for tungstate than vanadate or
Pi (Table ). Figure and Table include the results
for EcAP (semi-transparent squares), highlighting
that tungstate more closely mimics the behavior expected for a transition-state
analogue with both EcAP and PafA.
Figure 8
Correlation of activity
and affinity for ground- and transition-state
analogues. Correlations of activity values (kcat/KM measured with Me-P normalized
to WT PafA, and kcat/KM measured with pNPP normalized to WT EcAP) with inhibition (Ka = 1/Ki) by inorganic phosphate (Pi), tungstate,
and vanadate. Data for PafA (circles) from Table [(kcat/KM)Me-P] and Table ; data for EcAP (light squares) from ref (88). The solid lines are fits to the PafA data with fixed slopes
of 1, as expected for a transition-state analogue;[90,91] the rmsd given is for deviation from these lines; and Pearson’s R2 values are also listed. Statistical tests
were carried out as described by Peck et al.,[88] and additional statistical parameters for PafA and EcAP are given in Table . Values at the vertical dashed lines marked with arrows represent
lower limits for affinity for AP variants and were not included in
the fits. Similarly PafA points that are limits are denoted with an
arrow and were excluded from fitting of the data. The PafA mutants
are color-coded as noted in the attached legend. The EcAP mutants are specified and color-coded as follows: WT, white; mutants
with the Mg2+ ion removed (E322Y or E322A), red; all other
mutants, blue; these data represent combinations of mutations at five
active-site residues, and the individual data points can be found
in ref (88).
Table 6
Statistical Test
for Transition Analog
Behavior.a
phosphate
tungstate
vanadate
PafA
EcAP
PafA
EcAP
PafA
EcAP
Pearson’s r2
0.54
0.74
0.94
0.94
0.57
0.48
orthogonal
distance regression
mb
3.8
3.3
1.3
1.4
3.4
3.8
rmsdc
0.82
0.54
0.28
0.28
0.75
0.66
constrained regression (m = 1)
rmsdc
1.41
1.15
0.35
0.41
1.25
1.04
Analysis of data
presented in Figure .
Slope of best fit line
determined
from orthogonal distance regression.
Root-mean-squared deviation (RMSD)
calculated from the orthogonal distances between data points and the
linear fit with either a variable slope (m) or a
slope fixed at m = 1.
Correlation of activity
and affinity for ground- and transition-state
analogues. Correlations of activity values (kcat/KM measured with Me-P normalized
to WT PafA, and kcat/KM measured with pNPP normalized to WT EcAP) with inhibition (Ka = 1/Ki) by inorganic phosphate (Pi), tungstate,
and vanadate. Data for PafA (circles) from Table [(kcat/KM)Me-P] and Table ; data for EcAP (light squares) from ref (88). The solid lines are fits to the PafA data with fixed slopes
of 1, as expected for a transition-state analogue;[90,91] the rmsd given is for deviation from these lines; and Pearson’s R2 values are also listed. Statistical tests
were carried out as described by Peck et al.,[88] and additional statistical parameters for PafA and EcAP are given in Table . Values at the vertical dashed lines marked with arrows represent
lower limits for affinity for AP variants and were not included in
the fits. Similarly PafA points that are limits are denoted with an
arrow and were excluded from fitting of the data. The PafA mutants
are color-coded as noted in the attached legend. The EcAP mutants are specified and color-coded as follows: WT, white; mutants
with the Mg2+ ion removed (E322Y or E322A), red; all other
mutants, blue; these data represent combinations of mutations at five
active-site residues, and the individual data points can be found
in ref (88).Analysis of data
presented in Figure .Slope of best fit line
determined
from orthogonal distance regression.Root-mean-squared deviation (RMSD)
calculated from the orthogonal distances between data points and the
linear fit with either a variable slope (m) or a
slope fixed at m = 1.We
noted previously that a subset of EcAP variants—mutants
that removed the active-site Mg2+ ion addressed in the
preceding section—were predominantly responsible for the poorer
correlation of binding and activity for vanadate than tungstate (Figure , red squares).[88] Intriguingly, for PafA, the K162 mutant deviated
substantially with vanadate but fell closer to the line with tungstate
(Figure , red circles,
the tungstate value is a limit). This result extends the analogy between
K162 of PafA and the active-site Mg2+ ion of EcAP (Figure ) and
suggests a high degree of precision to discriminate between tungstate
and vanadate species within the active site of these extremely efficient
enzymes.
General Implications
Over the past
decades, there has been an enormous amount learned
about how enzymes catalyze reactions, in terms of the underlying chemical
mechanisms, the residues and cofactors that facilitate those reactions,
and the overall structural context in which these reactions take place.
Nevertheless, there is also an enormous amount that we do not yet
understand, as exemplified by the limited ability to engineer new
enzymes and the prevalence of trial-and-error approaches.Given
that we can identify the residues directly involved in binding
and catalysis, the next stage of understanding will entail deciphering
how protein scaffolds and interaction networks assemble and position
active-site residues for catalysis. Enzyme superfamilies provide sets
of enzymes with functional distinctions in similar overall structural
contexts,[92,93] and catalytic promiscuity provides a means
to probe the functional consequences of the structural elements and
sequence features that differ between superfamily members.[24] Thus, superfamily members provide sufficient
“contrast” to make meaningful comparisons, but sufficient
similarities to allow inferences to be drawn from their juxtaposition,
and catalytic promiscuity allows systematic and multi-dimensional
functional analysis across superfamily members and properties.Propitiously, enzymes of the AP superfamily have been highly amenable
to comparative approaches (e.g., refs (32, 35, 94, and 95)). We have taken advantage of the occurrence
of distinct AP superfamily phosphate monoesterases, one of which has
features similar to the superfamily diesterases that are absent in
another monoesterase (Figures and 2). Multi-faceted comparisons
of structure, sequence, and cognate and promiscuous reactions of these
superfamily members and their mutants in this work and previously
have extended our mechanistic understanding, have led to models for
the roles of active site and more distal structural elements, and have suggested evolutionary driving
forces and adaptations.[17,24,32,34−36,39,52,53,57,59,60,88,96−99]While this work represents modest steps toward
a deeper and more
comprehensive understanding of enzyme assembly and function, we are
heartened by how many insights have arisen by making comparisons across
superfamilies of cognate and promiscuous reactions. This approach
to a mechanistic puzzle might be likened, abstractly, to an algebraic
problem. In algebra one needs the number of equations to at least
match the number of variables; for the complex problem of enzyme mechanism,
we require a large number of comparisons. In addition, as in algebra
where the equations used need to be orthogonal to one another, different
types of comparisons are needed to assess different aspects of mechanism.
Structural, sequence, and functional comparisons of superfamily members,
in conjunction with comparisons of cognate and promiscuous activities,
provide complementary and powerful approaches toward the goal of understanding
the features and factors responsible for assembly of an enzyme and
for its catalytic function. While this enormously complex problem
will undoubtedly require many studies and the development of new approaches,
the partial structural and functional modularity suggested herein
and by prior results[17] may simplify the
path to the robust engineering of novel, highly effective enzymes.
Authors: Elena Bobyr; Jonathan K Lassila; Helen I Wiersma-Koch; Timothy D Fenn; Jason J Lee; Ivana Nikolic-Hughes; Keith O Hodgson; Douglas C Rees; Britt Hedman; Daniel Herschlag Journal: J Mol Biol Date: 2011-10-28 Impact factor: 5.469
Authors: T Geoff G Battye; Luke Kontogiannis; Owen Johnson; Harold R Powell; Andrew G W Leslie Journal: Acta Crystallogr D Biol Crystallogr Date: 2011-03-18
Authors: Charlotte M Miton; Stefanie Jonas; Gerhard Fischer; Fernanda Duarte; Mark F Mohamed; Bert van Loo; Bálint Kintses; Shina C L Kamerlin; Nobuhiko Tokuriki; Marko Hyvönen; Florian Hollfelder Journal: Proc Natl Acad Sci U S A Date: 2018-07-16 Impact factor: 11.205
Authors: Gloria Komazin; Michael Maybin; Ronald W Woodard; Thomas Scior; Dominik Schwudke; Ursula Schombel; Nicolas Gisch; Uwe Mamat; Timothy C Meredith Journal: J Biol Chem Date: 2019-11-08 Impact factor: 5.157
Authors: Ian D E A Lidbury; Chiara Borsetto; Andrew R J Murphy; Andrew Bottrill; Alexandra M E Jones; Gary D Bending; John P Hammond; Yin Chen; Elizabeth M H Wellington; David J Scanlan Journal: ISME J Date: 2020-11-30 Impact factor: 10.302
Authors: Ian D E A Lidbury; David J Scanlan; Andrew R J Murphy; Joseph A Christie-Oleza; Maria M Aguilo-Ferretjans; Andrew Hitchcock; Tim J Daniell Journal: Proc Natl Acad Sci U S A Date: 2022-02-01 Impact factor: 11.205
Authors: Abhishek Srivastava; Daniel E M Saavedra; Blair Thomson; Juan A L García; Zihao Zhao; Wayne M Patrick; Gerhard J Herndl; Federico Baltar Journal: ISME J Date: 2021-05-28 Impact factor: 10.302