N-Acetylneuraminic acid lyase (NAL) is a Class I aldolase that catalyzes the reversible condensation of pyruvate with N-acetyl-d-mannosamine (ManNAc) to yield the sialic acid N-acetylneuraminic acid (Neu5Ac). Aldolases are finding increasing use as biocatalysts for the stereospecific synthesis of complex molecules. Incomplete understanding of the mechanism of catalysis in aldolases, however, can hamper development of new enzyme activities and specificities, including control over newly generated stereocenters. In the case of NAL, it is clear that the enzyme catalyzes a Bi-Uni ordered condensation reaction in which pyruvate binds first to the enzyme to form a catalytically important Schiff base. The identity of the residues required for catalysis of the condensation step and the nature of the transition state for this reaction, however, have been a matter of conjecture. In order to address, this we crystallized a Y137A variant of the E. coli NAL in the presence of Neu5Ac. The three-dimensional structure shows a full length sialic acid bound in the active site of subunits A, B, and D, while in subunit C, discontinuous electron density reveals the positions of enzyme-bound pyruvate and ManNAc. These 'snapshot' structures, representative of intermediates in the enzyme catalytic cycle, provided an ideal starting point for QM/MM modeling of the enzymic reaction of carbon-carbon bond formation. This revealed that Tyr137 acts as the proton donor to the aldehyde oxygen of ManNAc during the reaction, the activation barrier is dominated by carbon-carbon bond formation, and proton transfer from Tyr137 is required to obtain a stable Neu5Ac-Lys165 Schiff base complex. The results also suggested that a triad of residues, Tyr137, Ser47, and Tyr110 from a neighboring subunit, are required to correctly position Tyr137 for its function, and this was confirmed by site-directed mutagenesis. This understanding of the mechanism and geometry of the transition states along the C-C bond-forming pathway will allow further development of these enzymes for stereospecific synthesis of new enzyme products.
N-Acetylneuraminic acid lyase (NAL) is a Class I aldolase that catalyzes the reversible condensation of pyruvate with N-acetyl-d-mannosamine (ManNAc) to yield the sialic acid N-acetylneuraminic acid (Neu5Ac). Aldolases are finding increasing use as biocatalysts for the stereospecific synthesis of complex molecules. Incomplete understanding of the mechanism of catalysis in aldolases, however, can hamper development of new enzyme activities and specificities, including control over newly generated stereocenters. In the case of NAL, it is clear that the enzyme catalyzes a Bi-Uni ordered condensation reaction in which pyruvate binds first to the enzyme to form a catalytically important Schiff base. The identity of the residues required for catalysis of the condensation step and the nature of the transition state for this reaction, however, have been a matter of conjecture. In order to address, this we crystallized a Y137A variant of the E. coli NAL in the presence of Neu5Ac. The three-dimensional structure shows a full length sialic acid bound in the active site of subunits A, B, and D, while in subunit C, discontinuous electron density reveals the positions of enzyme-bound pyruvate and ManNAc. These 'snapshot' structures, representative of intermediates in the enzyme catalytic cycle, provided an ideal starting point for QM/MM modeling of the enzymic reaction of carbon-carbon bond formation. This revealed that Tyr137 acts as the proton donor to the aldehyde oxygen of ManNAc during the reaction, the activation barrier is dominated by carbon-carbon bond formation, and proton transfer from Tyr137 is required to obtain a stable Neu5Ac-Lys165 Schiff base complex. The results also suggested that a triad of residues, Tyr137, Ser47, and Tyr110 from a neighboring subunit, are required to correctly position Tyr137 for its function, and this was confirmed by site-directed mutagenesis. This understanding of the mechanism and geometry of the transition states along the C-C bond-forming pathway will allow further development of these enzymes for stereospecific synthesis of new enzyme products.
Aldolases catalyze a fundamental
reaction in biology, namely, the formation or cleavage of carbon–carbon
bonds. Examples are found in many pathways of both central and secondary
metabolism. The basic aldol reaction involves the condensation of
an aldehyde (the aldol acceptor) with a ketone (the aldol donor) to
form the new carbon–carbon bond. Aldolases are generally highly
stereoselective in their catalysis and form the new carbon–carbon
bond while controlling the configuration of up to two new chiral centers.
This ability to form new carbon–carbon bonds under mild conditions
without the need for elaborate protecting group chemistry, while controlling
the stereoselectivity of aldol condensation, has made the aldolase
family an important group of enzymes for use in biocatalysis. Indeed,
protein engineering and directed evolution studies have been directed
against a number of aldolases to improve their utility in chemical
synthesis.[1]N-Acetyl-d-neuraminic acid lyase (NAL) (EC 4.1.3.3) is a pyruvate-dependent
aldolase that catalyzes the reversible aldol condensation between N-acetyl-d-mannosamine (ManNAc) and pyruvate to
yield N-acetyl-d-neuraminic acid (Neu5Ac),
the most abundant of the sialic acids. The enzyme is a member of the
Class I aldolase family in which the reaction proceeds through a Schiff
base intermediate between a lysine and the substrate pyruvate.[2] NAL has been shown to follow an ordered Bi-Uni
aldol condensation kinetic mechanism[3] (Scheme 1) in which pyruvate binds first as a Schiff base,
followed by ManNAc. Aldol addition then produces Schiff base-bound
Neu5Ac, which is then hydrolyzed to release Neu5Ac. A number of crystallographic
studies[4−9] have determined three-dimensional NAL structures containing bound
substrates and substrate analogues, including pyruvate,[5,9] a pyruvate analogue,[7] and a Neu5Ac analogue.[4] These structures, while informative, provide
only limited information on the enzyme mechanism since they do not
reveal the transitions between these low energy, stable states. Obtaining
atomic-resolution structural information for high energy transition
states is impossible by experiment, but such information is available
through computational modeling. Quantum mechanical/molecular mechanical
(QM/MM) methods, in particular, have been very effective in modeling
enzyme reactions.[10] Here we use a combined
crystallographic and QM/MM approach to follow carbon–carbon
bond formation catalyzed by NAL, with a view to revealing how substrates
are bound and interact with the enzyme and determine the stereochemical
outcome of the reaction.
Scheme 1
General Reaction Mechanism of NAL
The ordered sequential Bi-Uni
reaction leading to the formation of Neu5Ac is illustrated. After
enamine formation between pyruvate and Lys-165, nucleophilic attack
of the pyruvate-Schiff base onto the aldehyde carbon of ManNAc and
protonation of the aldehyde oxygen generates the enzyme Neu5Ac-Schiff
base, which is hydrolyzed to yield Neu5Ac as product. The reaction
is reversible. The numbering scheme for Neu5Ac is shown, and for ease
of following the fate of the substrates these numbers are also used
in the enzyme-Pyr-ManNAc complex.
General Reaction Mechanism of NAL
The ordered sequential Bi-Uni
reaction leading to the formation of Neu5Ac is illustrated. After
enamine formation between pyruvate and Lys-165, nucleophilic attack
of the pyruvate-Schiff base onto the aldehyde carbon of ManNAc and
protonation of the aldehyde oxygen generates the enzyme Neu5Ac-Schiff
base, which is hydrolyzed to yield Neu5Ac as product. The reaction
is reversible. The numbering scheme for Neu5Ac is shown, and for ease
of following the fate of the substrates these numbers are also used
in the enzyme-Pyr-ManNAc complex.The formation
of the carbon–carbon bond between ManNAc and pyruvate depends
on not only the proximity and orientation of the two bonding carbon
atoms but also the transfer of a proton onto the aldehyde oxygen of
ManNAc. The determination of several three-dimensional NAL structures
revealed a lack of acidic or basic residues near the reaction center
and led to a suggestion that the pyruvate carboxylate group could
be involved in a “substrate-assisted” protonation step.[11] In addition, the three-dimensional structure
of Haemophilus influenzae NAL with a substrate analogue,
4-oxo-Neu5Ac, bound as a Schiff base with the catalytic lysine, revealed
an interaction between the inhibitor C-4 hydroxyl group and the hydroxyl
group of a tyrosine residue (Tyr136 in H. influenzae NAL).[4] This led to the hypothesis that
this tyrosine mediates a proton transfer to and from the substrate
carboxylate group.[4,12] The importance of this tyrosine
was supported by the observation that its mutation to phenylalanine
(Y130F) in the Clostridium perfringens NAL (equivalent
to Tyr136 in H. influenzae NAL and Tyr137 in E. coli NAL (E. coli numbers used throughout
the rest of this manuscript)) resulted in a significant decrease in
activity.[13] Here we use crystallography,
QM/MM modeling, and mutagenesis to explore this proposed mechanism
in further mechanistic detail.
Results and Discussion
Crystal Structures
The ideal starting point for QM/MM modeling of the formation or breakage
of the carbon–carbon bond between ManNAc and pyruvate in NAL
would be a crystal structure corresponding to either the ManNAc and
pyruvate-bound structure or the Neu5Ac-bound structure. No such structures
had been reported until this study. The structure of the H.
influenzae NAL containing the substrate analogue 4-oxo-Neu5Ac[4] is a possible starting structure for QM/MM simulation
but does not necessarily represent the true binding mode of Neu5Ac
because the C4 atom is sp2 hybridized in this structure,
rather than the sp3 hybridization of the natural substrate,
Neu5Ac. Soaking crystals of wild-type E. coli NAL
with Neu5Ac always resulted in structures of the enzyme-pyruvate complex,[5] most likely due to rapid turnover of Neu5Ac and
unbinding of ManNAc.We reasoned that inactivation of NAL by
a Y137A mutation might produce an enzyme that could be crystallized
with Neu5Ac to act as a starting point for our modeling studies. Both
Y137A and Y137F NAL variants were constructed by site-directed mutagenesis
and expressed in E. coli as for the wild-type enzyme.[14] Steady-state enzyme kinetics showed that, as
expected, the variants were severely impaired with only ca. 0.2% activity (as judged by kcat/Km) for the Y137F variant and no detectable activity
for the Y137A variant (Table 1). The Y137A
variant was crystallized in the same conditions and space group as
the wild-type enzyme. The crystals were then soaked with 75 mM Neu5Ac,
and the three-dimensional structure was solved (PDB code 4BWL; see Supplementary Table S1 for crystallographic data)
by direct Fourier methods using the wild-type E. coli structure[5] as a starting model. In each
subunit of the Y137A variant, additional electron density contiguous
with the catalytic lysine residue, Lys165, was found, demonstrating
that, despite the fact that no activity had been detected for this
variant, the enzyme was still able to form a Schiff base with incoming
substrates. This indicates that Tyr137 is not critical in forming
the covalent enzyme-pyruvate complex. In subunits A, B, and D of the
tetramer, the electron density observed in the active site was continuous
and of the approximate size and shape to be fitted by Neu5Ac (Figure 1A). In subunit C, however, two distinct areas of
electron density were visible, one similar to the NAL-pyruvate complex
previously seen[5] and one of the approximate
size and shape of ManNAc (Figure 1B). Extensive
model building and testing (see Supporting Information) revealed that soaking with Neu5Ac in the Y137A variant serendipitously
produced a structure of the Enz-ManNAc-pyruvate complex in subunit
C. This result arises presumably because although we could not detect
activity in our standard kinetic assays (Table 1), the Y137A mutant has an extremely low level of activity in the
crystal sufficient for turnover to take place. Perhaps even more surprisingly,
the continuous electron density found in subunits A, B, and D was
consistent only with the 4-epimer of Neu5Ac (4-epi-Neu5Ac) rather
than Neu5Ac (see Supporting Information for details of model fitting and testing). This result is unexpected
because of the stereoselectivity of the wild-type enzyme where the
stereochemical outcome of the NAL condensation reaction depends on
the structure of the substrate.[25] In the
case of the wild-type enzyme acting on ManNAc and pyruvate, the substrate
aldehyde group is attacked from the si face, generating
a new stereocenter with 4S-configuration, whereas
with other substrate pairs, re-face attack and subsequent
4R-configured products are found. This lack of overall
stereochemical control together with 1H NMR studies of
product distribution during the course of the reactions with different
substrates[26] implies that NAL can bind
substrate in different conformations. We suggest that in the crystals
of the Y137A variant, cleavage of Neu5Ac and reformation of the carbon–carbon
bond with the opposite stereochemical configuration at C4 must be
possible, leading to 4-epi-Neu5Ac being trapped. In this respect it
is important to note that the kinetics of various steps in the enzyme
reaction in the cryo-cooled crystalline state may well be different
from the kinetics measured in solution at RT,[27] leading to trapping of different intermediates. However, it is interesting
to note that 4-epi-Neu5Ac can bind to NAL active site as it has been
shown to be a competitive inhibitor of the wild-type Clostridium
perfringens NAL.[28]
Table 1
Steady-State Kinetic Parameters of E. coli Tyr137
NAL Variants
enzyme
kcat (min–1)
Km (mM)
kcat/Km (min–1 mM–1)
wild-type
250 ± 5
2.2 ± 0.1
114
Y137F
0.8 ± 0.1
3.7 ± 1.1
0.2
Y137A
ND
ND
ND
Steady-state kinetic
parameters for the cleavage of N-acetyl neuraminic
acid (Neu5Ac) by wild-type and variant E. coli NAL
determined using an LDH coupled enzyme assay.[14,15] Data were fitted to the Michaelis–Menten equation, and the
fitted values ± standard error of the fit are shown. ND = not
detectable.
Figure 1
Ligand binding in the
active site of the Y137A NAL crystal structure. (A) Continuous 2Fo – Fc electron
density, contoured at 2 RMSD around the Schiff base of Lys165 covalently
linked to 4-epi-Neu5Ac in subunit A of the solved structure (PDB code 4BWL). (B) 2Fo – Fc map (at the
same contour level) for subunit C. The discontinuous electron density
in subunit C reveals that cleavage of Neu5Ac has occurred, producing
the Lys165-pyruvate Schiff base and ManNAc. Hydrogen bonds between
the substrate and Glu192, Thr167, Thr48, and Ser47 are shown. For
clarity other hydrogen bonds between the enzyme and substrate(s) are
omitted.
Steady-state kinetic
parameters for the cleavage of N-acetyl neuraminic
acid (Neu5Ac) by wild-type and variant E. coli NAL
determined using an LDH coupled enzyme assay.[14,15] Data were fitted to the Michaelis–Menten equation, and the
fitted values ± standard error of the fit are shown. ND = not
detectable.Ligand binding in the
active site of the Y137A NAL crystal structure. (A) Continuous 2Fo – Fc electron
density, contoured at 2 RMSD around the Schiff base of Lys165 covalently
linked to 4-epi-Neu5Ac in subunit A of the solved structure (PDB code 4BWL). (B) 2Fo – Fc map (at the
same contour level) for subunit C. The discontinuous electron density
in subunit C reveals that cleavage of Neu5Ac has occurred, producing
the Lys165-pyruvate Schiff base and ManNAc. Hydrogen bonds between
the substrate and Glu192, Thr167, Thr48, and Ser47 are shown. For
clarity other hydrogen bonds between the enzyme and substrate(s) are
omitted.In all four subunits of the crystal
structure, the carboxylate moiety of the substrate forms the same
hydrogen bonding pattern with the backbone of Thr48 and Ser47 and
with the hydroxyl group of Thr48 as found in the wild-type enzyme-pyruvate
complex (PDB code 2WNN) (Figure 1A and B).[5] In subunits A, B, and D, the N-acetyl group of
4-epi-Neu5Ac at C5 lies close (∼3.4 Å) to the side chain
of Phe252 but does not appear to form any hydrogen bonds with the
surrounding residues, with the exception of chain B, where it forms
a hydrogen bond with an ordered water molecule. This is in agreement
with the tolerance of NAL for substitutions at C5.[25] The hydroxyl groups of the C6–C9 moiety of 4-epi-Neu5Ac
form slightly different hydrogen bonds with Ser208, Asp191, Glu192,
and the backbone of Gly189 in the three subunits, consistent with
a degree of flexibility of this portion of the substrate (see Supplementary Figure S2). The C4 hydroxyl group
of 4-epi-Neu5Ac results from the protonation of the aldehyde oxygen
of ManNAc, and it is interesting to note that this group forms a hydrogen
bond with Thr167 in all of the subunits A, B, and D (Figure 1)In subunit C, ManNAc and pyruvate are bound.
The planarity of the electron density of the covalent Lys165-pyruvate
is consistent with the pyruvate enamine complex previously seen in
crystal structures of NAL.[5] The hydrogen
bond network of ManNAc differs slightly from that of the C4–C9
portion of 4-epi-Neu5Ac in chains A, B, and D. The hydroxyl groups
at carbons positions C4, C7, C8, and C9 of ManNAc (Neu5Ac numbering)
form a similar hydrogen bond network as the corresponding hydroxyl
groups of 4-epi-Neu5Ac, but a shift of the hydrogen bond network is
seen at C6 mainly due to the different orientation of ManNAc, which
interacts with Asp191 and Ser208 instead of Ser208 alone.
QM/MM Modeling
After in silico mutation of Ala137 back to Tyr,
the structure found in subunit C, with pyruvate and ManNAc bound,
was used as a starting point for QM/MM modeling of the reaction mechanism.
Initially, we modeled the carbon–carbon bond formation between
ManNAc and pyruvate by performing QM/MM umbrella sampling molecular
dynamics (MD) simulations along d(C–C), the
distance between the carbons involved (the ManNAc aldehyde carbon
atom and the carbon of the methyl group of pyruvate). ManNAc, the
pyruvate-Schiff base (modeled in its enamine form) and the Tyr137
side chain were included in the QM region. Five series of umbrella
sampling simulations (with different starting structures originating
from unbiased QM/MM MD simulations) were performed. In all cases,
the Tyr137 hydroxyl proton transferred spontaneously onto the ManNAc
aldehyde oxygen, producing a Neu5Ac-Lys165 Schiff base in the final
structures. Free energy profiles obtained from the umbrella sampling
(Supplementary Figure S3) indicate that
this intermediate is approximately 4–8 kcal mol–1 lower in energy than the transition state. The spontaneity of the
proton transfer and the apparently low free energy of the intermediate
indicate that Tyr137 is the proton donor in the wild-type NAL reaction,
consistent with previous speculations.[4,12] To test whether
water might also be involved as a proton donor in the reaction, we
repeated the simulations including an additional water molecule, in
proximity to Tyr137 and the ManNAc aldehyde oxygen, in the QM region.
In our simulations, this water molecule often moved away from this
position, and proton transfer from Tyr137 onto the ManNAc aldehyde
oxygen took place as before. It is therefore unlikely that water is
directly involved in this reaction step. Water however is certainly
required for the breakdown of the Schiff base between Lys165 and the
substrate/product and may also serve to reprotonate the Tyr137 phenolate.
In every umbrella sampling series along d(C–C),
the ManNAc aldehyde underwent attack on the si face,
producing the 4S-configured diastereoisomer. This
is consistent with the previously reported high stereoselectivity
of wild-type NAL with pyruvate and ManNAc.[25,28] Analysis of the MD trajectories revealed that the attacking pyruvate
carbon is almost always positioned on the si face,
and this preference becomes more pronounced as the carbon atoms approach
each other. The simulations therefore strongly suggest that the ManNAc
binding mode in wild-type NAL is the main contributory factor to the
high stereoselectivity reported.[25,28]QM/MM potential
energy profiles for the carbon–carbon bond formation between
ManNAc and the pyruvate-Lys165 Schiff base, with Tyr137 as proton
donor. (A) SCC-DFTB/CHARMM22 potential energy surface along two reaction
coordinates: the carbon–carbon distance, dCC, and the proton transfer between Tyr137 and the ManNAc aldehyde, PT. Minima and approximate transition state energy are indicated
by solid black and dashed gray lines, respectively. (B) Potential
energies from two reaction path optimizations (blue and red, from
ManNAc and pyruvate-Lys165 Schiff base on the left to the Neu5Ac-Lys165
Schiff base on the right). X-axis values are calculated
as a combined reaction coordinate from optimized structures: ((PT minus d(C–C)), see text). Structures
were optimized at the B3LYP/6-31+G(d)/CHARMM22 level with energies
calculated at the SCS-MP2/aug-cc-pVDZ/CHARMM22 level. (See Supplementary Figure S5 for B3LYP/MM energies.)To model the mechanism of bond
formation between ManNAc and pyruvate, QM/MM potential energy surfaces
(PESs) of the reaction were obtained along two reaction coordinates: d(C–C) and PT, the proton transfer
between Tyr137 and the ManNAc aldehyde (defined as the difference
between the proton-donor and proton-acceptor distances) (Figure 2). Seven PESs at the SCC-DFTB/CHARMM22 level were
obtained from different enzyme–substrate conformations of the
Neu5Ac-Lys165 Schiff base intermediate as sampled from QM/MM MD (Figure 2A, Supplementary Figure S4). These multiple PESs allowed for examination of conformational
effects,[23,29] to test reproducibility and to provide starting
points for reaction pathway optimization at higher levels of QM/MM
theory (see below). Minimum energy paths along these PESs indicated
three of the PESs had approximate activation energies (differences
between the reactant minimum and the approximate transition state)
of 10.1–12.0 kcal mol–1 and four PESs with
approximate activation energies of 6.2–7.7 kcal mol–1. The four PESs with lower activation energies all showed the same
hydrogen bonding network (see also Figure 3): the Tyr137 hydroxyl oxygen accepts a hydrogen bond from Ser47
OγH and Ser47 Oγ accepts a hydrogen bond from the hydroxyl
oxygen of a tyrosine residue in an adjacent subunit (Tyr110′).
In the other three PESs, the hydrogen bond between Ser47 OγH
and the Tyr137 hydroxyl oxygen was not formed. All seven PESs indicate
that the carbon–carbon bond formation and proton transfer are
asynchronous: proton transfer occurs only when d(C–C)
is 1.7 Å or smaller.
Figure 2
QM/MM potential
energy profiles for the carbon–carbon bond formation between
ManNAc and the pyruvate-Lys165 Schiff base, with Tyr137 as proton
donor. (A) SCC-DFTB/CHARMM22 potential energy surface along two reaction
coordinates: the carbon–carbon distance, dCC, and the proton transfer between Tyr137 and the ManNAc aldehyde, PT. Minima and approximate transition state energy are indicated
by solid black and dashed gray lines, respectively. (B) Potential
energies from two reaction path optimizations (blue and red, from
ManNAc and pyruvate-Lys165 Schiff base on the left to the Neu5Ac-Lys165
Schiff base on the right). X-axis values are calculated
as a combined reaction coordinate from optimized structures: ((PT minus d(C–C)), see text). Structures
were optimized at the B3LYP/6-31+G(d)/CHARMM22 level with energies
calculated at the SCS-MP2/aug-cc-pVDZ/CHARMM22 level. (See Supplementary Figure S5 for B3LYP/MM energies.)
Figure 3
Stereo image showing
the transition state structures (TSs) for the carbon–carbon
bond formation between ManNAc and the pyruvate-Lys165 Schiff base,
optimized at the B3LYP/6-31+G(d)/CHARMM22 level. The TS structure
for the first profile is shown with green carbons (and hydrogen bonds
in orange), and the TS structure for the second profile with light
blue carbons. Side chains for the residues targeted in the mutagenesis
studies are labeled.
To confirm the mechanism and obtain
more accurate reaction energetics and transition state structures,
two reaction path optimizations were performed at the density functional
theory (DFT) (B3LYP/6-31+G(d)/CHARMM22) QM/MM level, using starting
conformations from two PESs with the Tyr137-Ser47-Tyr110′ hydrogen
bond network in place (Figure 2B). After geometry
optimization, frequency calculations of the reactant, transition,
and intermediate states verified that the structures were true minima
(no imaginary frequency) and transition states (one imaginary frequency)
and gave zero-point energy corrections (Supplementary
Table S2). Energies of the optimized conformations were also
calculated at the SCS-MP2/aug-cc-pVDZ/CHARMM22 QM/MM level; this ab initio method has been shown to give results in good
agreement with ‘gold standard’ coupled-cluster theory
and provides a high-level test of the DFT results.[23,29]Zero-point corrected activation energies at the SCS-MP2/aug-cc-pVDZ/CHARMM22
level are 9.5 and 10.0 kcal mol–1 for the two profiles,
respectively. The activation free energy is likely to be higher due
to entropy (as expected for a bond-forming reaction). The transition
states of both pathways are very similar (Figure 3) and occur during the carbon–carbon bond formation
(d(C–C) at 2.13 and 2.04 Å, respectively),
before proton transfer from Tyr137 takes place (d(O–H) of the Tyr137 hydroxyl is 1.00 Å in both cases).
The reaction paths obtained at the higher QM/MM level confirm that
Tyr137 is the proton donor in the condensation reaction and that the
deprotonated Tyr137 is stable in the context of the enzyme active
site with a Neu5Ac-Lys165 Schiff base.Stereo image showing
the transition state structures (TSs) for the carbon–carbon
bond formation between ManNAc and the pyruvate-Lys165 Schiff base,
optimized at the B3LYP/6-31+G(d)/CHARMM22 level. The TS structure
for the first profile is shown with green carbons (and hydrogen bonds
in orange), and the TS structure for the second profile with light
blue carbons. Side chains for the residues targeted in the mutagenesis
studies are labeled.
Enzyme Mechanism and Mutagenesis
Based on the crystal structure
presented here and the QM/MM simulations, a detailed mechanism for
formation of the carbon–carbon bond formation between ManNAc
and pyruvate can be proposed (Scheme 2).
Scheme 2
Mechanism of Carbon–Carbon Bond Formation and Protonation
during NAL Catalysis As Derived from the Crystal Structure of the
Y137A Variant in the Presence of Pyruvate and ManNAc and QM/MM Modelling
The pyruvate-Lys165 enamine is shown in grey as it lies
below the plane of the ManNAc aldehyde. Hydrogen bonds are shown as
dotted lines. The central structure is unstable and indicated to be
a transition state by the modelling (see Figure 3).
Mechanism of Carbon–Carbon Bond Formation and Protonation
during NAL Catalysis As Derived from the Crystal Structure of the
Y137A Variant in the Presence of Pyruvate and ManNAc and QM/MM Modelling
The pyruvate-Lys165 enamine is shown in grey as it lies
below the plane of the ManNAc aldehyde. Hydrogen bonds are shown as
dotted lines. The central structure is unstable and indicated to be
a transition state by the modelling (see Figure 3).The enamine form of pyruvate, in Schiff-base
complex with Lys165, specifically attacks the si face
of the ManNAc aldehyde to form Neu5Ac. The increasing electron density
around the (former) ManNAc aldehyde oxygen (due to the emerging tetrahedral
conformation around the aldehyde carbon) is stabilized by hydrogen
bonds from the Thr167 and Tyr137 hydroxyls. Once the carbon–carbon
bond is (almost) formed, the (former) ManNAc aldehyde oxygen is protonated
by the Tyr137 hydroxyl group. The resulting Tyr137 phenolate is stabilized
by a hydrogen bond donated from the Ser47 hydroxyl, which may in turn
be supported by a hydrogen donated to the Ser Oγ from the Tyr110′
hydroxyl. The QM/MM energies indicate that this Neu5Ac Schiff base
intermediate with a Tyr137 phenolate is quite stable. It is therefore
possible that this phenolate exists until Neu5Ac release allows its
reprotonation by solvent.The triad of Tyr110′, Ser47,
and Tyr137 is similar to the Tyr107′, Thr44, Tyr133 triad in
the related (α/β)8 TIM barrel lyase, dihydrodipicolinate
synthase (DHDPS)[30,31] and to Tyr103′, Thr43,
and Tyr130 in another pyruvate-dependent aldolase, 2-keto-3-deoxygluconate
aldolase.[32] In DHDPS these residues are
thought to act as a proton shuttle to assist Tyr133 in its role as
a proton donor during the DHDPS reaction. The presence of this catalytic
triad in related enzymes provides strong evidence that the identified
catalytic triad in wild-type E. coli NAL also plays
a key role in its catalytic mechanism. In order to test this hypothesis
further the active site residues identified here (Ser47, Tyr110, Tyr137,
and Thr167) along with two other residues close to this area of the
active site (Thr48, which interacts with the pyruvate carboxylate,
and Phe252, which lies close to the N-acetyl of ManNAc)
were targeted for site-directed mutagenesis. The two variants at Tyr137
(Y137A and Y137F) (Table 1) are extremely detrimental
to activity, and it is now clear that this is because Tyr137 plays
a direct, crucial role in the donation of a proton during the carbon–carbon
bond formation between ManNAc and pyruvate. The importance of Thr167
in stabilizing the transition state by hydrogen bonding to the (former)
aldehyde oxygen is confirmed by the mutagenesis results: removal of
the possibility of H-bonding in the T167A mutant results in a 4-fold
decrease in kcat, whereas the replacement
with serine (T167S) left the kinetic parameters unaffected (Table 2).
Table 2
Steady-State Kinetic
Parameters of E. coli NAL Variantsa
enzyme
kcat (min–1)
Km (mM)
kcat/Km (min–1 mM–1)
wild-type
250 ± 5
2.2 ± 0.1
114
S47A
18 ± 0.4
3.4 ± 0.2
5.3
S47C
13 ± 0.3
4.7 ± 0.3
2.8
S47T
340 ± 7.8
2.9 ± 0.2
120
T48A
510 ± 17
3.0 ± 0.3
170
T48S
680 ± 15
3.4 ± 0.2
200
Y110A
12 ± 0.8
4.3 ± 0.7
2.8
Y110F
170 ± 1.9
1.8 ± 0.1
94
T167A
71 ± 0.7
2.0 ± 0.1
36
T167S
200 ± 2.8
1.9 ± 0.1
110
F252A
260 ± 9.9
3.1 ± 0.3
84
F252Y
120 ± 2.3
1.3 ± 0.1
92
Steady-state kinetic parameters for
the cleavage of N-acetyl neuraminic acid (Neu5Ac)
by wild-type and variant E. coli NAL determined using
an LDH coupled enzyme assay.[14,15] Data were fitted to
the Michaelis–Menten equation, and the fitted values ±
standard error of the fit are shown.
Steady-state kinetic parameters for
the cleavage of N-acetyl neuraminic acid (Neu5Ac)
by wild-type and variant E. coli NAL determined using
an LDH coupled enzyme assay.[14,15] Data were fitted to
the Michaelis–Menten equation, and the fitted values ±
standard error of the fit are shown.Whereas mutations of active site residues postulated
not to be involved in the catalytic mechanism (Thr48 and Phe252) had
no or only minor effects on the kinetic behavior of the enzyme, replacement
of Ser47 with alanine or cysteine (thus removing the potential to
form the natural hydrogen bonding network) had a significant effect.
However, in line with its postulated importance in a hydrogen bond
network, the replacement of Ser47 with a threonine resulted in no
significant change in kinetic parameters for the enzyme. In the case
of Tyr110′, the most distal of the triad from the active site,
replacement with alanine results in a significant decrease (40-fold)
in kcat/Km, whereas replacement with the bulkier phenylalanine results in no
significant change in kinetic parameters. It seems that, in this case,
the hydrogen bond is less important than the steric bulk of the residue
in holding Ser47 in a correct position to hydrogen bond to Tyr137
and thereby stabilize its deprotonated, phenolate form once it has
donated its proton to the aldehyde oxygen of ManNAc during catalysis.
Summary
and Conclusions
The current work exemplifies the mutual benefit
of protein crystallography, QM/MM simulation, and mutagenesis in discerning
enzyme mechanisms. The high-resolution structural information of the
enzyme–substrate complex necessary for reliable QM/MM reaction
modeling was obtained for NAL, allowing new insight into the mechanism
of proton transfer in this enzyme in atomic detail. In turn, the modeled
mechanism was then confirmed by kinetic analysis of enzyme mutants.
On the basis of the X-ray crystallographic data, we have modeled the
ManNAc-pyruvate-Schiff base complex with the pyruvate carboxylate
group deprotonated (and it remained as such throughout the simulations).
Our results indicate that a deprotonated Tyr137 in the vicinity of
the Neu5Ac-Lys165 Schiff-base carboxylate is energetically stable,
and there is thus no need to invoke a substrate-assisted mechanism
as postulated for this enzyme[11] to allow
reprotonation of Tyr137. This increased understanding of the catalytic
cycle of NAL should provide the basis for future protein engineering
to develop new aldolase-based biocatalysts.
Methods
Expression and Purification of NAL
Wild-type and variant E. coli NAL were expressed
from the plasmid pKnanA-His6 as previously described.[14]
Site-Directed
Mutagenesis
Site-directed mutagenesis was performed using
a QuikChange Lightning Site-Directed Mutagenesis Kit, (supplied by
Agilent Technologies, Cheshire, U.K.) according to the manufacturer’s
guidelines
Kinetic Analyses
Kinetic parameters
of the aldol cleavage reaction were determined at 30 °C using
a standard coupled assay[14,15] with lactate dehydrogenase
(LDH) and NADH. The reaction (1 mL final volume) contained varying
volumes (2–300 μL) of substrate (Neu5Ac) (100 mM, in
Tris/HCl 50 mM, pH 7.4), 0.5 units LDH, 0.2 mM NADH, and Tris/HCl
(50 mM; pH 7.4). NAL samples, in Tris/HCl (50 mM; pH 7.4), were added
in volumes of between 10 and 200 μL. The decrease in absorbance
at 340 nm was recorded on a Uvikon 930 spectrophotometer as the measure
of enzyme activity. The rate of substrate cleavage was calculated
using a molar extinction coefficient for NADH of 6220 M–1 cm–1. Kinetic parameters were estimated by fitting
the data to the Michaelis–Menten equation.
Protein Crystallization
E. coli NAL was crystallized using previously
established conditions.[5] To form the Neu5Ac-
and ManNAc/Pyr complexes NAL crystals were successively briefly soaked
in the mother liquor containing 15% (w/v) PEG400, then 20% (w/v) PEG400,
and subsequently for 5 min in 25% (w/v) PEG400 containing 75 mM Neu5Ac.
Crystals were then flash cooled in liquid nitrogen prior to data collection.
Data Collection and Refinement
Diffraction data were collected
from single crystals at the Diamond Light Source macromolecular crystallography
beamline I02. The temperature for data collection was 100 K. Integration
and scaling of data were carried out by MOSFLM[16] and SCALA.[17] REFMAC5[18] was used for refinement of the data, and after
each refinement cycle model building was performed in COOT.[19] As the data were highly twinned (twin fraction
0.488, operator −h, −k, h+l) the twin refinement option
of REFMAC5 was used. Coordinates and restraint library files for the
lysine residue covalently bound to a pyruvoyl moiety (HET code KPI)
were as previously described.[6] Those for
the Neu5Ac (HET code SI3) covalently bound to lysine and for ManNAc
(HET code MN9) were generated using the PRODRG server. A number of
models for the subunit C active site were tested (see Supporting Information for details) in order
to identify that which best fit the observed electron density. The
final models were validated using the PDB validation server.
QM/MM
Modeling
A starting structure for QM/MM modeling was obtained
by building the Tyr137 side chain onto Ala137 in subunit C of the
Y137A crystal structure (PDB code 4BWL), based on the conformation of Tyr137
in the previously solved wild-type enzyme with a pyruvate-Lys165 Schiff
base (PDB code 2WNN).[5] ManNAc, pyruvate, Lys165 (from Cγ),
and Tyr137 (from Cβ) were treated quantum mechanically with
self-consistent charge density functional tight-binding (SCC-DFTB),
all other atoms with CHARMM22,[20] and all
SCC-DFTB/CHARMM22 simulations were performed with CHARMM.[21] A 25 Å solvent sphere was added around
the pyruvate-Lys165 Schiff base. Further details of the model and
methods are described in the Supporting Information. In short, two 300 ps QM/MM MD trajectories (with different starting
velocities) were run at 300 K. After at least 150 ps of simulation,
5 conformations were selected to start a series of short QM/MM MD
simulations with a varying restraint on the carbon–carbon bond-forming
distance d(C–C) (‘umbrella sampling’). Subsequently,
a series of energy minimizations (‘adiabatic mapping’)
was applied along two reaction coordinates (see Results
and Discussion), starting from the obtained intermediate. A
set of structures along the thus identified SCC-DFTB/CHARMM22 minimum
energy paths was selected to perform optimization of two reaction
paths using B3LYP/6-31+G(d) for the QM region and applying the nudged-elastic
band (NEB) and climbing-image NEB (CI-NEB) methods[22,23] in QoMMMa.[24] Transition states and minima
were confirmed using QM/MM frequency calculations. Single-point spin-component
scaled (SCS)-MP2/aug-cc-pVDZ/CHARMM22 calculations were performed
on the optimized conformations.
Authors: Ivan Campeotto; Stephen B Carr; Chi H Trinh; Adam S Nelson; Alan Berry; Simon E V Phillips; Arwen R Pearson Journal: Acta Crystallogr Sect F Struct Biol Cryst Commun Date: 2009-10-24
Authors: Alex Theodossis; Helen Walden; Elaine J Westwick; Helen Connaris; Henry J Lamble; David W Hough; Michael J Danson; Garry L Taylor Journal: J Biol Chem Date: 2004-07-20 Impact factor: 5.157
Authors: Jake A LeVieux; Brenda Medellin; William H Johnson; Kaci Erwin; Wenzong Li; Ingrid A Johnson; Yan Jessie Zhang; Christian P Whitman Journal: Biochemistry Date: 2018-06-11 Impact factor: 3.162
Authors: Xinglong Zhang; Simon J Bennie; Marc W van der Kamp; David R Glowacki; Frederick R Manby; Adrian J Mulholland Journal: R Soc Open Sci Date: 2018-02-14 Impact factor: 2.963
Authors: Claire L Windle; Katie J Simmons; James R Ault; Chi H Trinh; Adam Nelson; Arwen R Pearson; Alan Berry Journal: Proc Natl Acad Sci U S A Date: 2017-02-14 Impact factor: 11.205
Authors: María Inmaculada García-García; Fernando Gil-Ortiz; Francisco García-Carmona; Alvaro Sánchez-Ferrer Journal: PLoS One Date: 2014-05-09 Impact factor: 3.240
Authors: Jennifer Stockwell; Adam D Daniels; Claire L Windle; Thomas A Harman; Thomas Woodhall; Tomas Lebl; Chi H Trinh; Keith Mulholland; Arwen R Pearson; Alan Berry; Adam Nelson Journal: Org Biomol Chem Date: 2015-11-05 Impact factor: 3.876
Authors: Ivan Campeotto; Andrey Lebedev; Antoine M M Schreurs; Loes M J Kroon-Batenburg; Edward Lowe; Simon E V Phillips; Garib N Murshudov; Arwen R Pearson Journal: Sci Rep Date: 2018-10-05 Impact factor: 4.379