Natural sesquiterpene synthases have evolved to make complex terpenoids by quenching reactive carbocations either by proton transfer or by hydroxylation (water capture), depending on their active site. Germacradien-11-ol synthase (Gd11olS) from Streptomyces coelicolor catalyzes the cyclization of farnesyl diphosphate (FDP) into the hydroxylated sesquiterpene germacradien-11-ol. Here, we combine experiment and simulation to guide the redesign of its active site pocket to avoid hydroxylation of the product. Molecular dynamics simulations indicate two regions between which water molecules can flow that are responsible for hydroxylation. Point mutations of selected residues result in variants that predominantly form a complex nonhydroxylated product, which we identify as isolepidozene. Our results indicate how these mutations subtly change the molecular choreography in the Gd11olS active site and thereby pave the way for the engineering of terpene synthases to make complex terpenoid products.
Natural sesquiterpene synthases have evolved to make complex terpenoids by quenching reactive carbocations either by proton transfer or by hydroxylation (water capture), depending on their active site. Germacradien-11-ol synthase (Gd11olS) from Streptomyces coelicolor catalyzes the cyclization of farnesyl diphosphate (FDP) into the hydroxylated sesquiterpenegermacradien-11-ol. Here, we combine experiment and simulation to guide the redesign of its active site pocket to avoid hydroxylation of the product. Molecular dynamics simulations indicate two regions between which water molecules can flow that are responsible for hydroxylation. Point mutations of selected residues result in variants that predominantly form a complex nonhydroxylated product, which we identify as isolepidozene. Our results indicate how these mutations subtly change the molecular choreography in the Gd11olS active site and thereby pave the way for the engineering of terpene synthases to make complex terpenoid products.
Terpenoids are the largest group of natural products universally present in all living
systems with immense diversity in their structure and functions.[1,2] To date, over 80000 naturally occurring
isoprenoids have been reported, including more than 7000 sesquiterpenoids derived from
∼300 stereochemically distinct hydrocarbon skeletons.[3,4] A highly complex and structurally diverse array
of hydrocarbon or hydroxylated terpenes is synthesized by terpene synthase catalysis from
acyclicisoprenyl diphosphates such as geranyl diphosphate (GDP), farnesyl diphosphate
(FDP), and geranylgeranyl diphosphate (GGDP).[4,5] The diversity of terpene skeletons arises not only from the
number of terpene synthases but also from the ability of some terpene synthases to form
multiple products from a single substrate.[6−8] The product
specificity and activity of terpene synthases are highly dependent on a small number of
amino acids present inside or near the hydrophobic active site pocket. Previous studies have
shown the profound effect that slight changes to these amino acids can have on the
electronic distribution and the geometry of the active site pocket, which in turn affect the
enzyme activity and specificity.[9−16]In class I terpene synthases, a cluster of three Mg2+ ions (referred to as
Mg2+A–C) is responsible for binding and triggering loss of
the substrate diphosphate group to give a carbocation–inorganic diphosphate ion
(PPi) pair. Two highly conserved metal-binding motifs on helices D (DDXXD/E
motif) and H (NSE/DTE motif) face each other across the active site entrance. The motif on
helix H binds a Mg2+ ion (Mg2+B) which allows substrate
binding, whereupon Mg2+ binding to the motif on helix D triggers transition from
an open to a closed active site conformation (including loop movement and changes in the
G-helix).[3,17,18] Once the active site is closed with all three Mg2+ ions
bound, the carbocation–PPi pair is formed with the highly reactive
carbocation intermediates sheltered from premature quenching by bulk
solvent.[3,17,18] The carbocations formed can undergo a series of intramolecular reactions
involving cyclization, double-bond isomerization, hydride, methyl, and alkyl migrations, and
finally proton loss to form hydrocarbons or hydroxylation to form terpene
alcohols.[3,19,20]Several class I terpene synthases have evolved to produce hydroxylated terpenes by
mediating the capture of water in the reaction cascade.[21−25] Although many terpene synthases likely harbor water
molecules in the active site pocket (as observed in the crystal structures of several
terpene synthases),[26−30] the majority
avoid water capture and do not produce hydroxylated compounds. Despite extensive
experimental and theoretical investigations of the cyclization mechanism and structural
elucidation of terpene synthases, there are still many questions regarding what determines
water capture and what mechanism is involved. Some reports suggest that loop movements
enable ingress of bulk water into the active site pocket to mediate water capture in
germacradien-4-ol synthase[31] and mutants of δ-cadinene
synthase.[32] Structural analysis of mutant Aspergillus
terreus aristolochene synthase (ATAS) inhibitor complexes indicated that a
Mg2+C-bound water molecule is most likely involved in the formation
of (linear) non-native hydroxylated products.[28] Isotope labeling studies
on, for example, the epi-cedrol[33] and
epi-cubenolsesquiterpene synthases[34] demonstrate that
the oxygen atom present in these compounds is derived from bulk water. Mutation analysis of
hedycaryol synthase showed the catalytic involvement of Asp82 of the DDXXD motif in the
activation of water for the hydroxylation,[35] whereas in bacterial cineole
synthase, a specific active site Asn has been implicated in water activation.[36] In the hydroxylating diterpene synthase CotB2, extensive experimental and
computational work has highlighted key active site interactions and suggestions of how these
are altered by mutations that lead to different main products.[16,37] However, a complete picture of how
(sesqui)terpene synthases mediate water capture remains elusive.Germacradien-11-ol synthase (Gd11olS), a class I sesquiterpene synthase, is the N-terminal
domain of geosmin synthase from S. coelicolor which catalyzes the
cyclization of FDP (1) into the hydroxylated sesquiterpenegermacradien-11-ol
(Gd11ol, 2) as the major product and germacrene D (3) as the minor
product (Scheme ).[38−40] Formation of germacrene D involves a 1,3-hydride shift in the germacryl
cation and final deprotonation at C15. However, formation of germacradien-11-ol putatively
involves deprotonation of C1 of the germacryl cation and formation of the strained bicyclic
intermediate isolepidozene (4), which then undergoes ring opening upon water
attack to form Gd11ol (Scheme ).[38,41]
Scheme 1
Reactions Catalyzed by Gd11olSWT and Mutants
In the present study, a combination of a molecular dynamics (MD) simulation of the
enzyme–substrate complex (Figure ) and
kinetic characterization of enzyme variants generated by site-directed mutagenesis is used
to obtain insight into and modulate the water capture behavior of Gd11olS. A total of 15
mutants were created in regions that the simulations suggest are involved in guiding a
reactive water molecule to quench the neutral intermediate isolepidozene (4)
(Scheme ). Mutation of these residues resulted
in changes in the product profile and water capture behavior of Gd11olS. A Gd11olS mutant
(G188A) was identified which (almost fully) avoids water capture and predominantly produces
isolepidozene. The findings presented here will aid in the engineering of terpene synthases
to channel the production of novel terpenoids by manipulating their water capture
ability.
Figure 1
Representative MD snapshot of the FDP·Gd11olSWT complex in binding mode
I. Protein residues shown with sticks and transparent spheres are important for FDP
binding. Hydrogens are omitted for clarity. In binding mode II, the farnesyl chain of
FDP turns in the opposite direction (see Figure S2).
Representative MD snapshot of the FDP·Gd11olSWT complex in binding mode
I. Protein residues shown with sticks and transparent spheres are important for FDP
binding. Hydrogens are omitted for clarity. In binding mode II, the farnesyl chain of
FDP turns in the opposite direction (see Figure S2).
Results and Discussion
Characterization of Wild-Type Gd11olS and Identification of the FDP Binding
Mode
First, we set out to confirm the product profile and kinetics for wild-type Gd11olS
(hereafter Gd11olSWT). The gene for the N-terminal domain of geosmin synthase
(coding for residues 1–366 only of the whole protein) was codon optimized for
E. coli and synthesized in a pET28a vector (see the Supporting Information). After overexpression in E. coli,
the recombinant protein was purified to homogeneity (see the Supporting Information for more details). Incubation of Gd11olSWT
with FDP resulted in the formation of germacradien-11-ol as the major product
(2, 92.4%) and germacrene D as the minor product (3, 7.6%)
(Figure and Table S2), and the kinetic parameters for turnover of FDP
(KM = 0.5 ± 0.08 μM,
kcat = (5.0 ± 0.02) × 10–4
s–1) were in good agreement with previous work.[40]
Figure 2
Product distribution of Gd11olSWT and its mutants (obtained using GC-MS).
The product yield here refers to the relative abundance expressed as a percentage of
the total amount of products formed from FDP. Product yield % values are given in
Table S2. Total ion chromatograms for all variants are given in
Figures S9–S24, with mass spectra for all six compounds provided
in Figures S28–S33.
Product distribution of Gd11olSWT and its mutants (obtained using GC-MS).
The product yield here refers to the relative abundance expressed as a percentage of
the total amount of products formed from FDP. Product yield % values are given in
Table S2. Total ion chromatograms for all variants are given in
Figures S9–S24, with mass spectra for all six compounds provided
in Figures S28–S33.Having established the product profile of Gd11olSWT, we turned to a MD
simulation to obtain detailed structural information to guide in the engineering of its
active site to change the product profile. For Gd11olSWT, a crystal structure
in a complex with three Mg2+ ions and alendronic acid (PDB code 5DZ2) mimics the active
“closed” form of the enzyme (with alendronic acid indicating the location of
the diphosphate);[42] however, the positioning of the farnesyl moiety of
the FDP substrate is unknown. We thus modeled the farnesyl chain in four distinct possible
orientations that could lead to cyclization. The first was based on the orientation of the
substrate analogue in a complex with aristolochene synthase (PDB code 4KUX),[43] with the others
obtained by changing the rotation of single bonds accompanied by geometry optimization
(Figure S2). For each of these FDP·Gd11olSWT complexes, 10
independent 30 ns MD simulations were performed in explicit solvent (see Supporting Information for details).Only two of the four orientations of the farnesyl chain were stable in the MD
simulations: binding modes I and II (Figure and
Figure S2). The other two starting orientations changed into either binding
mode I or II during the MD simulation; thus, they were discarded from further analysis.
Although all simulations of binding modes I and II indicate the same overall orientation,
the farnesyl chain is flexible and thus shows a wide ensemble of conformations during
simulation (Figure S3), as shown, for example, by the wide distribution sampled of the
interatomic distance between the atoms C1 and C10 (Figure a,b; see Scheme for atom
numbering). The C1–C10 distance reaches values of ≤5 Å in both binding
modes, suggesting that the farnesyl chain could, in principle, cyclize into a germacryl
cation (upon ionization of FDP) regardless of the binding mode. However, while binding
mode I shows a marked preference for the formation of conformations of FDP that can lead
to the expected R enantiomer of the germacryl cation, mode II almost
exclusively samples conformations in line with the formation of the S
enantiomer (Figures c,d). Thus, mode I is
identified to be the most likely binding mode of FDP. This binding mode is equivalent to
that observed in the crystal structure of other sesquiterpene synthases in complexes with
FDP analogues (e.g., aristolochene synthase with FSDP and selinadiene synthase with
(3S,6E)-3,7,11-trimethyldodeca-6,10-dien-1-yl
trihydrogen diphosphate; see Figure S4).[30,43]
Figure 3
Histograms of the C1–C10 distance (a, b) and percentage of (total,
pro-R, and pro-S) FDP conformations with
C1–C10 ≤ 5 Å (c, d) for the FDP·Gd11olS (WT and mutant
variants) complexes in both binding modes I and II, calculated over the 10 MD runs for
each (200 ns). Histograms are in 0.1 Å bins. A C1–C10 distance of
≤5 Å is used here as a qualitative measure of the ability of FDP to
undergo cyclization and form a germacryl cation. To define the prochirality of the C10
atom, apart from the C1–C10 distance, a proper re or
si orientation of the C1 atom with respect to the C10=C11
double bond was considered (to determine an FDP conformation as pro-R
or pro-S; Figure S5). The trends remain the same when a smaller cutoff value of
the C1–C10 distance is used.
Histograms of the C1–C10 distance (a, b) and percentage of (total,
pro-R, and pro-S) FDP conformations with
C1–C10 ≤ 5 Å (c, d) for the FDP·Gd11olS (WT and mutant
variants) complexes in both binding modes I and II, calculated over the 10 MD runs for
each (200 ns). Histograms are in 0.1 Å bins. A C1–C10 distance of
≤5 Å is used here as a qualitative measure of the ability of FDP to
undergo cyclization and form a germacryl cation. To define the prochirality of the C10
atom, apart from the C1–C10 distance, a proper re or
si orientation of the C1 atom with respect to the C10=C11
double bond was considered (to determine an FDP conformation as pro-R
or pro-S; Figure S5). The trends remain the same when a smaller cutoff value of
the C1–C10 distance is used.To further confirm the FDP binding mode in Gd11olS obtained from simulation, we turned to
mutagenesis. The bulky side chains of residues Y79, F83, W312, and H320 line the farnesyl
moiety in binding mode I (Figure and Figure S3), with W312 and H320 likely enforcing the final part of the
farnesyl chain to bend back toward the C1 atom, facilitating C1–C10 cyclization. We
thus generated and assayed the W312A and H320A mutants of Gd11olS experimentally, to
corroborate this prediction. As expected, both mutations lead to significant amounts of
the “early exit” noncyclic product nerolidol (6) and reduce
production of 2 (Figure and
Table S2), which further supports binding mode I to be the catalytically
relevant substrate complex (Michaelis complex, MC) in Gd11olSWT. Note that
H320A still forms approximately the same amount of germacrene D, indicating that His320 is
unlikely to be the base required for its formation (leaving PPi and His226 as the possible
bases involved). The formation of nerolidol was confirmed by coeluting the assay sample of
W312A with an authentic standard (Figure S25). As a complement, using the same computational approach used
with Gd11olSWT, we modeled and simulated the W312A and H320A mutants with both
FDP binding modes I and II. For binding mode I, there is a significant decrease (by
>70%) in the number of FDP conformations with C1–C10 ≤ 5 Å
(hereafter referred to as PCCs, productive conformations for cyclization) for both mutants
in comparison to Gd11olSWT, whereas the number of extended conformations
(C1–C10 > 7 Å) increases from 14% to 42% for W312A and to 25% for H320A
(Figure a,c). This agrees well with the
reduction in the total yield of cyclic products and the formation of nerolidol observed
experimentally. The higher proportion of more extended conformations of FDP in W312A
Gd11olS also correlates well with the higher yield of nerolidol (41.6% vs 22.5%) that it
produces (Figure ). Meanwhile, due to the larger
space available in the active site of the mutants for the distal end of the farnesyl
chain, there is no clear preference for pro-R or pro-SPCCs (Figure c). This indicates that the active
site cavity contour of Gd11olSWT guides the stereoselectivity of germacryl
cation formation in mode I, while for the W312A and H320A mutants the formation of the
observed (R)-germacryl cation products are enforced by the reaction
energetics.[44] In binding mode II, pro-SPCCs are
highly favored over pro-R PCCs for both mutants (as for
Gd11olSWT). Further, a decrease in PCCs in comparison to Gd11olSWT
is only observed for H320A (33%), which also shows a higher increase in extended FDP
configurations in comparison to W312A (Figure b,d). This is in contrast with the higher reduction in cyclization and higher
yield of nerolidol observed with the latter (Figure ). Thus, the simulations with the mutants also support binding mode I to be the
MC. The mutant W312A has a similar KM (0.4 ± 0.07
μM) and the H320A variant has an ∼3-fold higher
KM (1.12 ± 0.10 μM) in comparison to
Gd11olSWT, whereas an ∼10-fold reduction is observed in
kcat for both mutants (Table ), indicating that such alterations of the active site cavity lead
to less efficient turnover. Replacement of W312 with phenylalanine leads to the formation
of fully cyclized products germacrene A (5, 71.5%) and 2
(28.5%). The formation of 5 was confirmed by coinjection with an authentic
standard generated from FDP upon incubation with aristolochene synthase from
Penicillium roquefortii (which also makes 5; Figure S26). The kinetic efficiency of this mutant was found to be similar
to that of Gd11olSWT (Table ). These
results show that Phe312 can still provide sufficient stabilization for C1–C10
closure but that Trp312 is required to selectively produce 2 (e.g., by
stabilizing conversion of the germacryl cation to 4). The stabilization of
carbocation intermediates by cation−π interactions with aromatic residues is
frequently indicated in terpene synthase catalysis.[3,30,45,46]
Table 1
Kinetic Parameters for Gd11olSWT and Mutants
KM (μM)
kcat (10–5 s–1)
kcat/KM
(10–5 μM–1 s–1)
Gd11olSWT
0.5 ± 0.08
52.4 ± 0.02
116.4
R228V
0.8 ± 0.10
18.14 ± 0.04
22.7
Q313A
1.10 ± 0.14
6.9 ± 0.02
6.3
Q313N
0.5 ± 0.07
5.02 ± 0.01
10.04
H320A
1.12 ± 0.10
3.8 ± 0.01
3.4
H320F
1.8 ± 0.5
2.3 ± 0.02
1.3
V187D
N/Aa
V187N
NDb
V187A
0.3 ± 0.06
19.4 ± 0.01
64.7
G188V
N/Aa
G188A
3.8 ± 1.5
3.7 ± 0.5
1.0
G188S
5.7 ± 0.9
4.08 ± 0.2
0.7
A190V
4.6 ± 2.2
1.3 ± 0.3
0.3
A190L
N/Aa
W312F
0.9 ± 0.09
25.5 ± 0.06
28.3
W312A
0.4 ± 0.07
4.02 ± 0.01
10.05
Not applicable (inactive).
Not determined due to low activity.
Not applicable (inactive).Not determined due to low activity.
Dynamics of Water in the Gd11olS Active Site Pocket
With the (catalytically relevant) FDP-Gd11olS complex confirmed to be binding mode I, we
focused on analyzing the behavior of water surrounding FDP, with the aim of identifying
water molecules that could react with the FDP:C11 atom at the final stages of the reaction
to form 2 (Scheme ). There are two
major regions of the protein close to FDP where water molecules are often present during
the MD simulations: (i) the region composed of the residues R228, Q313, and H320, adjacent
to FDP:C11 (hereafter referred to as the RQH site) and (ii) the region composed of the
residues E161, R184, H226, and D230, next to the G-helix (hereafter referred to as the
G-helix site) (Figure a). Water molecules are
observed to flow between the G-helix site and the RQH site (Figure b), suggesting that even though FDP:C11 is closer to the latter,
both sites can play a role in the availability of a reactive water molecule and changes in
either could affect the hydroxylating activity of the enzyme. Notably, exchange of waters
between the G-helix site and bulk water is observed regularly in our simulations, whereas
direct exchange between the RQH site and bulk water is not. The highly conserved kink of
the G-helix is an important catalytic feature of terpene synthases that has been
previously shown to have a profound effect on product specificity.[7,11,12,47] For example, TPS4 and TPS5 (sesquiterpene synthases from Zea
mays that share 98% sequence identity) produce the same mixture of compounds
but with different proportions of certain stereoisomers, an amino acid substitution (G by
A) in the kink of the G-helix being the major factor determining the
stereoselectivity.[11] Another example is the conversion of
δ-cadinene synthase (DCS) into a germacradien-4-ol synthase (Gd4olS) by mutation of
the hinge points of the kink (DCS-N403P/L405H).[47] Moreover, in
bacterial class I terpene synthases, two residues (G188 and V187 in Gd11olS) of the
so-called effector triad, which is involved in substrate binding, substrate ionization,
and active site closure, are located at the kink.[30] Thus, the RQH site
and G-helix kink (187VGGA190) were selected as mutation targets to
modulate the water capture behavior of Gd11olS.
Figure 4
(a) Representative MD snapshot of the FDP·Gd11olSWT complex in
binding mode I showing the two regions of the active site where water molecules are
observed during the MD simulations: the G-helix and R228Q313H320 sites. Non-water
hydrogen atoms are omitted for clarity. (b) Superimposed structures of the
FDP·Gd11olSWT complex in binding mode I (at intervals of 4 ps for
water molecules and 200 ps for the rest of the system) during one of the 10 MD runs.
Hydrogens are omitted for clarity; the distribution of red spheres thus indicates the
location of water along the trajectory. Water can flow between the G-helix and RQH
sites.
(a) Representative MD snapshot of the FDP·Gd11olSWT complex in
binding mode I showing the two regions of the active site where water molecules are
observed during the MD simulations: the G-helix and R228Q313H320 sites. Non-waterhydrogen atoms are omitted for clarity. (b) Superimposed structures of the
FDP·Gd11olSWT complex in binding mode I (at intervals of 4 ps for
water molecules and 200 ps for the rest of the system) during one of the 10 MD runs.
Hydrogens are omitted for clarity; the distribution of red spheres thus indicates the
location of water along the trajectory. Water can flow between the G-helix and RQH
sites.
Mutation of the RQH Site
Each residue of the RQH site was mutated (R228 to V, Q313 to N and A, and H320 to F),
aiming to reduce the polarity and thus the likely availability of water in this site. H320
was mutated to F to keep a similar side-chain size, considering that the H320A mutant
displayed a shift to linear product formation (see above and Figure ). R228 was changed to V and Q313 to N because, on the basis of the
substrate–analogue complex structure of the (nonhydroxylating) ATAS (PDB code
4KUX),[43]
residues V212, N299, and S303 of ATAS are positioned equivalently to R228, Q313, and H320
of Gd11olSWT (Figure a),
respectively.[42,43]
Note that, despite the absence of hydroxylating activity in ATAS, a water molecule is
found at its active site hydrogen-bonded to N299 and S303.[31,43] To avoid any possible
water-activating role of the residue at 313, Q313 was also changed to the nonpolar A;
previously, a water-activating role for an active-site Asn residue was indicated (N305 in
the hydroxylating bacterial 1,8-cineole synthase, bCinS).[36]Overall, modifications of the RQH site residues to nonpolar residues led to an increase
in the total yield of nonhydroxylated compounds (i.e., isolepidozene and germacrene D, and
also germacrene A for H320F; Figure and
Table S2) with a concomitant decrease in the total yield for hydroxylated
compounds (i.e., Gd11ol). However, a significant shift away from hydroxylation occurred
only with the H320F mutant, which produced isolepidozene as the major nonhydroxylated
compound, with a 43.2% yield. On consideration that the modification of H320 by F is
almost isosteric, it is reasonable to expect that FDP still binds in binding mode I (Figure ). Since the H320F variant still produces
2 (45.3%), H320 is not a required general base for hydroxylation at C11. A
likely explanation for the large shift away from hydroxylation with F320 is that FDP:C11
is now less accessible to water, possibly due to the disruption of the G-helix–RQH
water flow observed in Gd11olSWT (for which H320 is in a key position; Figure ). Since R228 is exposed to bulk solvent, the
small perturbation of hydroxylation of the R228V mutation indicates that a flux of water
directly from the solvent to the RQH site (only observed once in one of the
Gd11olSWT MD simulations) is not a major factor determining water
availability for the reaction. While the Q313A mutation also has a slight effect on
hydroxylation, the product profile remains the same with Q313N. The kinetic constants for
these two mutants are very similar to wild type (Table ). This suggests that Q313 mediates in the availability of a reactive water
molecule but does not participate in the reaction as a (transient) proton acceptor. The
results with the mutants thus show that the three residues of the RQH site work in synergy
to favor hydroxylation of the isolepidozene intermediate in Gd11olSWT
catalysis, with H320 being the most important.
Mutation of the G-Helix Kink Region
Residues V187 and G188A of the G-helix kink (187VGGA190) of Gd11olS
correspond to the so-called linker and effector residues of the conserved effector triad
of class I terpene synthases. The former plays a key role in rearrangement of the G-helix
in active-site closure, and the latter may assist in substrate ionization.[30] In bacterial terpene synthases, the most common residues present at the
linker position are V, D, T, and N, and those at the effector position are G, A, V, and
S.[30] This was used as a reference to generate several Gd11olS mutants
of the G-helix kink region (G188A, G188S, G188V, V187D, and V187N) to be tested. Three
additional mutants of the G-helix kink (V187A, A190V, and A190L) were included for further
comparison. The corresponding product profiles for all of these mutants are depicted in
Figure and Table S2. Replacement of V187 by the smaller hydrophobic residue alanine had
no effect on the product profile apart from producing small quantities of 4
(4.1%). However, V187N and (mainly) G188A led to a pronounced shift of the product profile
in favor of the formation of 4 (77.9% and 88.4%, respectively), with reduced
production of 2. Meanwhile, replacing G188 by the small polar serine reduced
the formation of 2 to 59.4% and yielded 37.4% of 4. The mutant
A190V displayed the widest product diversity and generated four products in total:
2 (25.3%), 4 (62.5%), 5 (5.7%), and
7 (6.5%). The remaining mutants V187D, G188V, and A190L led to a complete
loss of activity, which may be attributed to the inability of the protein to form the
corresponding reactive closed enzyme–substrate complex (MC) due to a significant
perturbation of G-helix conformational behavior. Interestingly, mutation of the equivalent
residue of G188 in selinadiene synthase (G182) with valine also led to a complete loss of
activity.[30] The importance of the G-helix for the formation of the
relevant Michaelis complex is further indicated by the >8-fold increase in the
KM value obtained for the mutants G188A, G188S, and A190V in
comparison to Gd11olSWT (Table ).These results show that the G-helix kink region has a major influence on hydroxylation
(as well as MC formation). To shed more light on this, we modeled the MC of the mutants
G188S and G188A using MD simulations. Again, we performed simulations for both binding
modes I and II. A preference for pro-R PCCs was again observed for both
mutants in mode I, with no pro-R PCCs sampled in G188A and no
pro-R/S preference for G188S in mode II (Figure ). It is thus likely that mode I is again the
relevant MC for these variants, and we focused our analysis on this binding mode.As with Gd11olSWT, water molecules were observed in the RQH and G-helix sites
for both mutants, in similar quantities. At least one water is present in the RQH site for
the majority of the time in Gd11olSWT and the G188A and G188S variants
(>85%, Figure S6). Further, an evaluation of the interatomic distance between
FDP:C11 and the closest RQH site water molecule when pro-R PCCs are
formed does not reveal a significant difference either (Figure S7), with water molecules able to come close to FDP:C11 similarly.
Thus, simulations of the MC complexes do not indicate water availability and its proximity
to FDP:C11 to be a factor in the reduction of product hydroxylation in the G188S and G188A
mutants.The MD simulations do indicate that both the G188A and G188S mutations affect the
conformation of the G-helix. As expected due to their preferred backbone orientations, the
kink is more helical in the mutant enzymes than in Gd11olSWT; hydrogen bonds
between the amide of residue 188 and the carbonyl oxygens of R185 and R184 are formed more
frequently (Figure and Figure S8). In the G188S mutant, the hydroxyl group of S188 can additionally
form hydrogen bonds with surrounding polar residues (e.g., H226 and S222) and the backbone
of A190 and R185, which also changes the helix kink conformation with respect to
Gd11olSWT. For example, the hydrogen bond between the residues M183 and V187
is less stable in the G188S mutant (Figure and
Figure S8). The changes in hydrogen bonds subtly affect the hydration
pattern around the G-helix site. Further, the shape of the active site varies among these
enzyme variants, which will have an influence on the reaction energetics and therefore on
product specificity, by subtly changing substrate–enzyme–water interactions
along the reaction.
Figure 5
Dominant configurations of the G-helix (green, mutation target) in the MD simulations
of the MCs of Gd11olSWT and the mutants G188A and G188S. Relevant hydrogen
bond interactions of residue 188 are shown as dashed lines.
Dominant configurations of the G-helix (green, mutation target) in the MD simulations
of the MCs of Gd11olSWT and the mutants G188A and G188S. Relevant hydrogen
bond interactions of residue 188 are shown as dashed lines.
Confirmation of Isolepidozene as the Major Product in Gd11olS G188A
To determine the structure of the major sesquiterpene produced by the G188A mutant,
preparative-scale incubation with FDP (see the Supporting Information for more details) was performed, resulting in 12 mg
of product. After chromatographic purification using deactivated silica (neutralized with
1% triethylamine) with n-pentane as eluent, a total of 7.5 mg of a
colorless oil was isolated with >99% purity, as judged by GCMS (Figure S27). The purified compound was analyzed by NMR spectroscopy and
found to have a fused dimethylcyclopropane–germacrene type skeleton (spectroscopic
assignment of all the 1H and 13C NMR spectra is given in Figures S46–S52 in the Supporting Information). The 13C
and 13C DEPT 135 spectra revealed the presence of a distinctive peak for a
fused cyclopropane ring with signals at δC 24.3, 36.2, and 41.1 ppm
corresponding to C11, C1, and C10 respectively. Hence, the major product could be either
isolepidozene or its cis-fused isomer bicyclogermacrene (Figure ).[48,49] To determine the relative stereochemistry of the C1 and C10 atoms and
to confirm the product, the coupling constant of the H1 and H10 protons
(3JHH) was measured by 1H homonuclear
decoupling NMR spectroscopy and predicted computationally (see the Supporting Information for more details). The experimentally determined
coupling constant was 5.20 Hz (Figure S53). Spin–spin coupling constant
calculations[50,51]
were performed at the ωB97X-D[52]/6-311G(d,p)[53]
level on QM optimized structures (using Gaussian 09;[54] see the
Supporting Information for details), which predicted the coupling constant
between the H1 and H10 protons to be 4.84 Hz for isolepidozene and 8.82 Hz for
bicyclogermacrene. This confirms that the major product formed by Gd11olS G188A and also
produced by other variants, is indeed isolepidozene (4). This supports the
proposal that 4 is the neutral intermediate that undergoes ring opening and
water attack to form germacradien-11-ol.[55]
Figure 6
QM (DFT) optimized geometries of (a) isolepidozene and (b) bicyclogermacrene, with H1
and H10 protons highlighted.
QM (DFT) optimized geometries of (a) isolepidozene and (b) bicyclogermacrene, with H1
and H10 protons highlighted.
Conclusions
We have used a combination of computational simulation and site-directed mutagenesis to
gain insight into the water activation and hydroxylation catalyzed by the sesquiterpeneGd11ol synthase. The FDP binding mode in the active site was confirmed by a combination of
MD simulation and mutagenesis to be similar to that observed in other (nonhydroxylating)
sesquiterpene synthases. On the basis of this binding mode, we identified two active site
regions, namely the G-helix and RQH sites, that are directly or indirectly involved in
managing water access to FDP:C11 and its hydroxylation at the right stage of the reaction.
Water can flow freely between these sites in Gd11olSWT and investigated mutants,
with the RQH site able to hold a water molecule close to FDP:C11 for nucleophilic attack on
isolepidozene. Mutagenesis indicates, however, that none of the residues in this site are
essential for hydroxylation: e.g., as a general base for water activation. Mutations in the
G-helix kink region confirm its involvement in forming a reactive enzyme–substrate
complex, as previously observed for other sesquiterpene synthases. Remarkably, certain
mutations in this region resulted in a drastic change in the product profile, with G188A in
particular leading to formation of the complex nonhydroxylated isolepidozene as the major
product. Our work thereby shows that alteration of the G-helix kink motif reduces
hydroxylation more efficiently than alteration of the RQH site. This indicates that subtle
changes in the water binding regions of terpene synthases can profoundly affect their
hydroxylation activities, and understanding the precise mechanism of such changes is a major
challenge. Future work, including simulation of key intermediates and reactions alongside an
experimental determination of mutant structures, may provide further insight into the exact
origin of the reduction in hydroxylation caused by G-helix kink mutations. Our work
highlights the intricate molecular choreography involved in water management by terpene
synthases. A detailed understanding of how terpene synthases control substrate hydroxylation
will pave the way for targeted enzyme engineering to allow efficient production of specific
and novel terpenoids.