Okke Melse1,2,3, Samuel Sutiono1,2, Magdalena Haslbeck1,2, Gerhard Schenk4,5, Iris Antes2,3, Volker Sieber1,2,4,6. 1. Chair of Chemistry of Biogenic Resources, Campus Straubing for Biotechnology and Sustainability, Technical University of Munich, Schulgasse 16, 94315, Straubing, Germany. 2. SynBiofoundry@TUM, Technical University of Munich, Schulgasse 22, 94315, Straubing, Germany. 3. TUM Center for Functional Protein Assemblies, Technical University of Munich, Ernst-Otto-Fischer-Straße 8, 85748, Garching, Germany. 4. School of Chemistry and Molecular Biosciences, The University of Queensland, 68 Cooper Road, 4072, St. Lucia, Australia. 5. Sustainable Minerals Institute, The University of Queensland, 47 Staff House Road, 4072, St. Lucia, Australia. 6. Catalytic Research Center, Technical University of Munich, Ernst-Otto-Fischer-Straße 1, 85748, Garching, Germany.
Abstract
The FeS cluster-dependent dihydroxyacid dehydratases (DHADs) and sugar acid-specific dehydratases (DHTs) from the ilvD/EDD superfamily are key enzymes in the bioproduction of a wide variety of chemicals. We analyzed [2Fe-2S]-dependent dehydratases in silico and in vitro, deduced functionally relevant sequence, structure, and activity relationships within the ilvD/EDD superfamily, and we propose a new classification based on their evolutionary relationships and substrate profiles. In silico simulations and analyses identified several key positions for specificity, which were experimentally investigated with site-directed and saturation mutagenesis. We thus increased the promiscuity of DHAD from Fontimonas thermophila (FtDHAD), showing >10-fold improved activity toward D-gluconate, and shifted the substrate preference of DHT from Paralcaligenes ureilyticus (PuDHT) toward shorter sugar acids (recording a six-fold improved activity toward the non-natural substrate D-glycerate). The successful elucidation of the role of important active site residues of the ilvD/EDD superfamily will further guide developments of this important biocatalyst for industrial applications.
The FeS cluster-dependent dihydroxyacid dehydratases (DHADs) and sugar acid-specific dehydratases (DHTs) from the ilvD/EDD superfamily are key enzymes in the bioproduction of a wide variety of chemicals. We analyzed [2Fe-2S]-dependent dehydratases in silico and in vitro, deduced functionally relevant sequence, structure, and activity relationships within the ilvD/EDD superfamily, and we propose a new classification based on their evolutionary relationships and substrate profiles. In silico simulations and analyses identified several key positions for specificity, which were experimentally investigated with site-directed and saturation mutagenesis. We thus increased the promiscuity of DHAD from Fontimonas thermophila (FtDHAD), showing >10-fold improved activity toward D-gluconate, and shifted the substrate preference of DHT from Paralcaligenes ureilyticus (PuDHT) toward shorter sugar acids (recording a six-fold improved activity toward the non-natural substrate D-glycerate). The successful elucidation of the role of important active site residues of the ilvD/EDD superfamily will further guide developments of this important biocatalyst for industrial applications.
Production of higher chain alcohols, such as isobutanol, has gained major interest in the last decade because of their potential as biofuels with properties similar to gasoline. Isobutanol has a higher energy density and is less hygroscopic than ethanol, the traditional biofuel.
Bioproduction of isobutanol from fermentation can be achieved by a modified Ehrlich pathway, i. e. catabolism of branched chain amino acids (BCAA).
One key enzyme in this pathway is dihydroxyacid dehydratase (DHAD; Scheme 1), which catalyzes the dehydration of (R)‐2,3‐dihydroxyisovalerate (DHIV) to 2‐ketoisovalerate (KIV). As an alternative to the fermentation approach, a cell‐free system with minimized cofactor and enzyme requirements was developed.
This in vitro system is able to convert D‐glucose to isobutanol utilizing only eight enzymes, in contrast to the 15 enzymes needed in the in vivo approach. The key enzyme in this system is a promiscuous DHAD from Saccharolobus solfataricus (SsDHAD). In addition to the dehydration of DHIV to KIV, this enzyme also catalyzes the dehydration of the sugar acids D‐gluconate to 2‐keto‐3‐deoxy‐D‐gluconate (KDG) and D‐glycerate to pyruvate (Scheme 1).
While the first reaction is part of the semi‐ or non‐phosphorylative Entner‐Doudoroff (ED) pathway, the latter is a non‐natural reaction that proceeds at a very slow rate, thus serving as the major bottleneck in the cell‐free system.[
,
] The dehydration of D‐glycerate is also the key step in the valorization of glycerol, thus enhancing the significance of DHAD for applications in biotechnology.[
,
]
Scheme 1
Minimized synthetic pathway for the production of isobutanol from D‐glucose and glycerol enabled by SsDHAD (orange). Furthermore, the dehydration products of the SsDHAD‐catalyzed reaction, i. e. pyruvate and 2‐ketoisovalerate, can be converted into additional chemical building blocks, alcohols, and amino acids (shown in blue, red and magenta, respectively).
Minimized synthetic pathway for the production of isobutanol from D‐glucose and glycerol enabled by SsDHAD (orange). Furthermore, the dehydration products of the SsDHAD‐catalyzed reaction, i. e. pyruvate and 2‐ketoisovalerate, can be converted into additional chemical building blocks, alcohols, and amino acids (shown in blue, red and magenta, respectively).In contrast to DHADs, which use DHIV as preferred substrate, a closely related group of dehydratases (DHTs) prefer various sugar acids. D‐xylonate DHT from Caulobacter crescentus (CcXylDHT) and L‐arabinonate DHT from Rhizobium leguminosarum (RlArDHT) have recently been characterized; both are candidate alternatives for SsDHAD in the in vitro cascade.
However, while CcXylDHT and RlArDHT are reactive toward long sugar acid substrates (e. g. D‐gluconate) we recently showed that they are practically inactive toward D‐glycerate.
In that study, several novel dehydratases (DHADs and DHTs) with activity toward D‐glycerate were described, with the DHAD from Fontimonas thermophila (FtDHAD) and the DHT from Paralcaligenes ureilyticus (PuDHT) being the most promising. Both showed up to 50‐fold higher activity toward D‐glycerate when compared to SsDHAD. Further substrate profiling demonstrated however that FtDHAD, in contrast to CcXylDHT and RlArDHT, is significantly less active toward longer sugar acids (such as D‐gluconate), while PuDHT is not active on branched‐chain substrates such as DHIV, in contrast to SsDHAD (Table 1).
Table 1
Experimental specific activities of DHTs and DHADs toward sugar acid substrates of varying size and the branched chain acid DHIV.
V (U/mg)[e]
D‐glycerate
L‐threonate
D‐xylonate
D‐gluconate
DHIV
PuDHT[a]
0.31±0.03
5.71±0.98
19.86±1.06
48.23±1.18
n. a.
RlArDHT[b]
<0.01±0.00
n. d.
6.19±0.76
7.65±0.71
n. d.
CcXylDHT[b]
<0.01±0.00
n. d.
25.65±0.67
47.91±2.86
n. d.
FtDHAD[c]
0.96±0.07
2.23±0.17
0.36±0.01
n. a.
12.92±0.63
MtDHAD[12]
n. d.
n. d.
n. d.
n. d.
1.87±0.06
AthDHAD[17]
n. d.
n. d.
n. d.
n. d.
Active[d]
SsDHAD[c]
0.02±0.00
0.03±0.00
2.18±0.03
0.77±0.04
0.75±0.03
[a] Activity determined in this study at 30 °C. [b] Activity toward D‐glycerate was taken from Sutiono et al. (2020)
and that toward D‐xylonate and D‐gluconate from Andberg et al. (2016).
[c] Activities in this study were determined at 50 °C. The loading of [Fe−S] was not determined. Thus, although the values of the activities for the DHADs may be higher than reported here, the substrate preference will remain the same. [d] Activity was determined, but no absolute value provided in cited study. [e] Activity of PuDHT, FtDHAD and SsDHAD was determined with a substrate concentration of 25 mM. n. d.=activity was not determined in the previous studies; n. a.=activity <0.01 U/mg.
Experimental specific activities of DHTs and DHADs toward sugar acid substrates of varying size and the branched chain acid DHIV.V (U/mg)[e]D‐glycerateL‐threonateD‐xylonateD‐gluconateDHIVPuDHT[a]0.31±0.035.71±0.9819.86±1.0648.23±1.18n. a.RlArDHT[b]<0.01±0.00n. d.6.19±0.767.65±0.71n. d.CcXylDHT[b]<0.01±0.00n. d.25.65±0.6747.91±2.86n. d.FtDHAD[c]0.96±0.072.23±0.170.36±0.01n. a.12.92±0.63MtDHADn. d.n. d.n. d.n. d.1.87±0.06AthDHADn. d.n. d.n. d.n. d.Active[d]SsDHAD[c]0.02±0.000.03±0.002.18±0.030.77±0.040.75±0.03[a] Activity determined in this study at 30 °C. [b] Activity toward D‐glycerate was taken from Sutiono et al. (2020)
and that toward D‐xylonate and D‐gluconate from Andberg et al. (2016).
[c] Activities in this study were determined at 50 °C. The loading of [Fe−S] was not determined. Thus, although the values of the activities for the DHADs may be higher than reported here, the substrate preference will remain the same. [d] Activity was determined, but no absolute value provided in cited study. [e] Activity of PuDHT, FtDHAD and SsDHAD was determined with a substrate concentration of 25 mM. n. d.=activity was not determined in the previous studies; n. a.=activity <0.01 U/mg.DHTs and DHADs belong to the ilvD/EDD (isoleucine, leucine, valine Dehydratase/Entner‐Doudoroff Dehydratase) superfamily of enzymes.[
,
,
] To date, there is no consistent naming system for enzymes from this superfamily, which complicates their comparative analysis. Most of the characterized representatives are tetrameric, containing four identical monomers arranged as a dimer of homodimers. Each dimer contains two active sites, located at the dimer interface (Figure 1). Despite the central role of these enzymes in metabolism and potential in biotechnology, X‐ray structures are currently only available for a few of these enzymes including the DHADs from Arabidopsis thaliana (AthDHAD; PDB: 5ze4)
and Mycobacterium tuberculosis (MtDHAD; PDB: 6ovt),
and the DHTs CcXylDHT (PDB: 5oyn)
and RlArDHT (PDB: 5j84/5j85).
This paucity of available structures limits both in‐depth comparative functional studies among members of the ilvD/EDD superfamily and the (semi‐)rational engineering of variants with properties suitable for industrial applications. No in silico studies of DHADs or DHTs belonging to the ilvD/EDD superfamily have yet been reported, partially due to the limited availability of experimental structures, but also because of the complex nature of the catalytic site, which requires both an FeS cluster ([2Fe−2S] or [4Fe−4S]) and a nearby Mg2+ ion (Figure 1).[
,
] The complex electronic properties of the FeS cluster and the highly polarized binding site are difficult to parameterize and simulate, making computational studies challenging and time‐consuming. There are also no bound substrates in any of the available DHAD or DHT X‐ray structures. Therefore, molecular docking simulations are required to identify likely binding poses of the substrates, as well as residues that play an important role in substrate binding and catalysis. Such residues may represent mutational hotspots to engineer DHAD/DHT variants with tailored properties.
Figure 1
Oligomerization of [2Fe−2S]‐dependent dehydratases. The figures were created based on a DHAD from M. tuberculosis (MtDHAD, PDB: 6ovt). The single monomers are shown as respective green and grey cartoon, forming a dimer (A). The termini and the binding site are indicated in the figure. Two dimers pack together, forming a tetramer (B). The octahedral coordination of the Mg2+ ion is completed by water molecules in the X‐ray structure (C). The alignment numbers (see Figure S1) are used for residue numbering; residue numbers from the original sequence of MtDHAD are shown in brackets.
Oligomerization of [2Fe−2S]‐dependent dehydratases. The figures were created based on a DHAD from M. tuberculosis (MtDHAD, PDB: 6ovt). The single monomers are shown as respective green and grey cartoon, forming a dimer (A). The termini and the binding site are indicated in the figure. Two dimers pack together, forming a tetramer (B). The octahedral coordination of the Mg2+ ion is completed by water molecules in the X‐ray structure (C). The alignment numbers (see Figure S1) are used for residue numbering; residue numbers from the original sequence of MtDHAD are shown in brackets.In this work, we introduce a new classification system, which will allow more straightforward comparison among members of the ilvD/EDD superfamily based on their substrate profiles and evolutionary relationship, in particular the [2Fe−2S]‐dependent dehydratases. Furthermore, we used the available crystal structures of DHADs and DHTs to build homology models of SsDHAD, FtDHAD and PuDHT. These models, together with computational simulations, allowed predictions of the origin of the substrate selectivity of these enzymes. We used this insight to rationally design variants of these three enzymes with improved activity toward substrates of interest. The success of our predictions was highlighted in the H653F variant of PuDHT (residue numbering according to the sequence alignment; Figure S1), which has a six‐fold higher activity toward D‐glycerate than its wild‐type counterpart. This study sheds light on the sequence, structure and activity relationship within the ilvD/EDD superfamily and thus facilitates targeted bioengineering studies to design optimized biocatalysts for industrially relevant biotransformations.
Results and Discussion
Substrate profiles and phylogenetic relationship: New classification of DHADs and DHTs
In previous work we identified dehydratases that are active on D‐glycerate and partially characterized more than 20 enzymes.
We focused primarily on the dehydratases that contain a [2Fe−2S] center because of their relative stability in the presence of oxygen.
These enzymes can be grouped into two clusters based on their substrate profiles. The first cluster consists of DHTs, which are active toward longer sugar acids, and the second cluster contains DHADs, which have a preference for branched dihydroxyacids such as DHIV. We used one enzyme from each cluster as model enzyme in this work, namely PuDHT and FtDHAD.
We included SsDHAD in our study as this enzyme was reported to be active on both substrate classes.
We characterized these enzymes under similar experimental conditions, and compared the results to other known dehydratases (Table 1). In the assays, sugar acids with increasing chain length (i. e. from D‐glycerate via L‐threonate and D‐xylonate to D‐gluconate) and the branched DHIV were used as substrates. The concentration for each substrate was 25 mM, which is well‐above the KM of previously characterized DHADs and DHTs.[
,
,
,
] Furthermore, we focused primarily on relative activities rather than absolute values in order to compare the change in the substrate profiles of the investigated enzymes.In agreement with our previous study, PuDHT is most active toward D‐gluconate, with the activity decreasing as the chain length of the sugar acid substrate decreases; no activity toward DHIV is recorded (Table 1). In contrast, FtDHAD has virtually no activity toward D‐gluconate, but is active toward shorter sugar acid substrates, in particular L‐threonate. Maximum activity for FtDHAD is recorded with its biological substrate, DHIV, more than five‐times faster than that measured with L‐threonate. The substrate preference of SsDHAD lies somewhere in between those of PuDHT and FtDHAD with the relatively large sugar acid D‐xylonate being the most preferred reactant, but the enzyme displaying significant activity for DHIV as well.
To explore the evolutionary relationship of these dehydratases further, we constructed a phylogenetic tree based on the sequences of all reported DHADs and DHTs in the literature (Figure 2). Both FtDHAD and SsDHAD cluster with other DHADs such as the enzymes from A. thaliana (AthDHAD) and M. tuberculosis (MtDHAD). However, SsDHAD appears to have diverged from other DHADs early, maintaining larger similarities to DHTs. PuDHT clusters closely with other known DHTs, including CcXylDHT and RlArDHT.
Figure 2
Phylogenetic tree of characterized and selected putative enzymes from the IlvD/EDD superfamily. The blue branch represents sugar acid dehydratases (SADHTs) and the red branch represents DHADs active on branched chain acids (BCADHTs). The yellow branch represents promiscuous DHADs active on sugar acids and branched chain acids (PADHT). Sto and MsDHADs have not yet been characterized but were included in the tree to highlight the PADHT clade. The green branch represents DHADs, which contain a [4Fe‐4S] cluster, and the purple branch represents 6‐phosphogluconate dehydratases. The tree was constructed using MAFFT with the default method and visualized using iTOL.[
,
]
Phylogenetic tree of characterized and selected putative enzymes from the IlvD/EDD superfamily. The blue branch represents sugar acid dehydratases (SADHTs) and the red branch represents DHADs active on branched chain acids (BCADHTs). The yellow branch represents promiscuous DHADs active on sugar acids and branched chain acids (PADHT). Sto and MsDHADs have not yet been characterized but were included in the tree to highlight the PADHT clade. The green branch represents DHADs, which contain a [4Fe‐4S] cluster, and the purple branch represents 6‐phosphogluconate dehydratases. The tree was constructed using MAFFT with the default method and visualized using iTOL.[
,
]Although DHADs and DHTs all belong to the ilvD/EDD superfamily, only DHADs are associated with a function in BCAA biosynthesis (due to their activity toward DHIV). DHTs, with preference for sugar acids, are key enzymes in the oxidative pentose (OP) pathways (Dahms and Weimberg pathways).[
,
] SsDHAD, on the other hand, appears to occupy a unique evolutionary position, demonstrating significant activity toward both sugar acids and DHIV, and is thus believed to play a role in the BCAA, ED and OP pathways. However, strictly speaking, since all of these substrates contain a hydroxyl group at the α‐ (C2) and β‐positions (C3) of an acid substrate, all of these enzymes are, in fact, dihydroxyacid dehydratases, i. e. DHADs. Thus, to make a clearer distinction, we propose another naming scheme, where enzyme homologous to PuDHT, CcXylDHT and RlArDHT are named sugar acid dehydratases (SADHTs), and enzymes homologous to FtDHAD, which are predominantly active toward DHIV, are named branched chain acid dehydratases (BCADHTs). Finally, enzymes that display comparable activities toward sugar and branched chain acid substrates (such as SsDHAD) are labelled as promiscuous acid dehydratases (PADHTs). To date, SsDHAD is the only PADHT that has been characterized but the phylogenetic inference (Figure 2) suggests that enzymes such as StoDHAD from Sulfurisphaera tokodaii and MsDHAD from Metallosphaera sedula should also have a promiscuous substrate profile.
In silico modeling of representative DHADs
To gain structural insights relevant to the engineering of DHADs with optimized catalytic properties, homology models of representative DHADs for each of the three classes were generated (i. e. PuDHT, FtDHAD and SsDHAD). The crystal structures of AthDHAD, MtDHAD, CcXylDHT and RlArDHT were used as templates. A multiple sequence comparison was performed to align all templates and target proteins (Figure S1). Since residues sharing an alignment position are located in the same three‐dimensional space in the protein, the alignment positions allow easier identification and comparison of residues based on their location in the protein. Therefore, we used the alignment positions of amino acids, rather than their position in the sequence throughout the remainder of this study. The correlation between sequence and alignment positions for relevant residues is shown in Table S1. The homology models (Figure S2) scored well during the QMEAN
quality estimation analysis, with QMEAN4 Z‐scores ranging from −1.34 to −0.62 (Table S2), indicating a high quality for the homology models (see the Computational Biology section in Supporting Information for further details).Two models for the reaction mechanism employed by DHADs and DHTs were recently proposed. Rahman et al.,
based on their studies with CcXylDHT, suggested a mechanism for the dehydration reaction where the proton from the C2 atom of the substrate (Scheme 1) is removed by the deprotonated alkoxide side chain of the serine residue in alignment position 552 (see Figure 3 for details). The resulting carbanion is stabilized by the Mg2+ in the active site, leading to a weakening of the C3−O bond. Subsequently, one of the iron atoms of the [2Fe−2S] cluster (Fe2) promotes the abstraction of the hydroxyl group from C3, which triggers the tautomerization of the product to its keto form. This mechanistic model is supported by mutagenesis data that demonstrate the essential role of Ser552, as well as Mössbauer and EPR spectra that highlight the significance of the Lewis acid behavior of the [2Fe−2S] cluster in the catalytic cycle.[
,
,
] More recently, an alternative reaction mechanism was proposed for the DHIV dehydration catalyzed by a DHAD from the cyanobacterium Synechocystis sp. PCC 6803 (SnDHAD).
In this model, the C2 hydrogen is abstracted by a base (possibly a water molecule coordinated by Asp180). Subsequently, Ser552 donates a proton to 3‐OH and thereby facilitates elimination of this hydroxyl group, releasing a water molecule. We performed docking simulations (see below) of relevant substrates in the homology models of SsDHAD, FtDHAD and PuDHT, which largely support the reaction mechanism proposed by Rahman and colleagues. All of our docking simulations indicate that Ser552 is in close proximity to the C2 hydrogen, Fe2 is near C3−OH and the Mg2+ ion is close to C2−OH, where it can stabilize the negative charge of the carbanion (Figure 3).
Figure 3
Structural representation of the binding site of FtDHAD (A), PuDHT (B) and SsDHAD (C). The binding pocket is visualized by the mesh surface and residues identified as mutation hotspots and other relevant binding site residues are shown as orange and grey sticks, respectively. DHIV docked in FtDHAD is used to visualize the conserved binding mode of dihydroxyacids in DHADs (D). Alignment numbers are used for the residue numbering, and amino acid side chains are indicated using their single letter abbreviation, except KCX, which stands for carboxylated lysine; a singular X indicates any amino acid. Note that the amino acid at alignment position 73 originates from the other monomer. The sequence alignment of the hotspot regions is shown (E), where the numbers above the sequences indicate the alignment numbers, and helices and arrows indicate the secondary structure of that corresponding sequence (alpha‐helix or beta‐sheet, respectively). The dotted boxes highlight the mutation hotspots.
Structural representation of the binding site of FtDHAD (A), PuDHT (B) and SsDHAD (C). The binding pocket is visualized by the mesh surface and residues identified as mutation hotspots and other relevant binding site residues are shown as orange and grey sticks, respectively. DHIV docked in FtDHAD is used to visualize the conserved binding mode of dihydroxyacids in DHADs (D). Alignment numbers are used for the residue numbering, and amino acid side chains are indicated using their single letter abbreviation, except KCX, which stands for carboxylated lysine; a singular X indicates any amino acid. Note that the amino acid at alignment position 73 originates from the other monomer. The sequence alignment of the hotspot regions is shown (E), where the numbers above the sequences indicate the alignment numbers, and helices and arrows indicate the secondary structure of that corresponding sequence (alpha‐helix or beta‐sheet, respectively). The dotted boxes highlight the mutation hotspots.
Hotspot identification
From the multiple sequence alignment (Figure S1), a number of conserved residues were identified within each of the three DHT classes (i. e. SADHT, BCADHT and PADHT), including the three cysteine residues that coordinate the two iron atoms of the FeS cluster (Cys106, Cys179 and Cys257), and the catalytic serine residue (Ser552), but there are some interesting variations. For example, the motif around the second cysteine ligand of the [2Fe−2S] cluster (Cys179; underlined) is identical for SA‐ and PADHTs (i. e. GCDKTT; Figure S1) but is replaced by GCDKNM in BCADHTs. However, in the motif surrounding the third cysteine ligand of the FeS cluster (Cys257), PA‐ and BCADHTs are well conserved (GTCSG vs GACGG), whereas the SADHTs are less so (GHCM/NT). Since representatives from the three DHT classes differ significantly in their substrate profiles while maintaining a similar overall structure and mechanism (Table 1), residues with high intra‐class but low inter‐class conservation are expected to be major determining factors (i. e. “hotspots”) that control substrate specificity in these enzymes. Using the homology models of PuDHT, FtDHAD and SsDHAD (Figure S2), an amino acid sequence comparison (Figure S1) and in silico substrate docking (Figure 3D), we identified several hotspots. Only residues in or within proximity of the active site were considered. The identified hotspots are (alignment positions): 73, 138, 224, 261, 554–555 (SGXX motif) and the C‐terminal histidine of SADHTs (Figure 3A–C, E). We performed site‐directed mutagenesis or saturation mutagenesis of these hotspots to validate our predictions, and to define the contribution of each of these hotspots on the substrate profile and enzyme activity (Table 2).
Table 2
Relative activity of PuDHT, FtDHAD and SsDHAD variants against substrates with varying size. The highest activity observed in the wild‐type enzyme was set as 100 %. The sequence numbers are the alignment positions.[a]
D‐glycerate
L‐threonate
D‐xylonate
D‐gluconate
DHIV
PuDHT – WT
0.65±0.06
11.90±2.04
41.38±2.20
100.00±2.45
n. a.
E138D
n. a.
0.71±0.06
1.23±0.03
4.23±0.42
n. a.
E138A
n. d.
n. d.
n. d.
0.05±0.00
n. d.
T224E
n. a.
n. a.
n. a.
n. a.
n. a.
G261F
n. a.
n. a.
n. a.
n. a.
n. a.
G261Y
n. a.
n. a.
n. a.
n. a.
n. a.
T554G, A555T (SGGT)
n. a.
n. a.
n. a.
n. a.
n. a.
H653F (C‐terminus)
4.02±0.22
4.31±0.93
5.56±0.06
0.50±0.02
n. a.
FtDHAD – WT
7.44±0.56
18.04±1.30
2.81±0.07
0.02±0.00
100.00±4.91
P73G
1.76±0.1
4.68±1.72
1.55±0.14
0.22±0.01
64.20±7.59
D138E
0.10±0.00
2.04±0.13
0.15±0.00
n. a.
16.55±1.15
D138A
n. d.
n. d.
n. d.
n. d.
0.04±0.00
V224E
0.02±0.01
0.14±0.01
0.03±0.01
n. a.
0.60±0.08
Y261G
0.03±0.00
0.32±0.00
0.11±0.00
n. a.
1.03±0.04
G554T, T555A (SGTA)
n. a.
0.08±0.00
n. a.
n. a.
n. a.
SsDHAD – WT
1.12±0.00
1.58±0.09
100.00±1.54
35.5±1.78
34.73±1.16
D138E
0.92±0.00
1.28±0.02
11.53±1.50
16.76±2.30
22.54±3.34
D138A
n. d.
n. d.
0.23±0.01
n. d.
n. d.
A554T, T555A (SGTA)
n. a.
n. a.
n. a.
n. a.
n. a.
[a] n. d.=not determined; n. a.=relative activity <0.01 %.
Relative activity of PuDHT, FtDHAD and SsDHAD variants against substrates with varying size. The highest activity observed in the wild‐type enzyme was set as 100 %. The sequence numbers are the alignment positions.[a]D‐glycerateL‐threonateD‐xylonateD‐gluconateDHIVPuDHT – WT0.65±0.0611.90±2.0441.38±2.20100.00±2.45n. a.E138Dn. a.0.71±0.061.23±0.034.23±0.42n. a.E138An. d.n. d.n. d.0.05±0.00n. d.T224En. a.n. a.n. a.n. a.n. a.G261Fn. a.n. a.n. a.n. a.n. a.G261Yn. a.n. a.n. a.n. a.n. a.T554G, A555T (SGGT)n. a.n. a.n. a.n. a.n. a.H653F (C‐terminus)4.02±0.224.31±0.935.56±0.060.50±0.02n. a.FtDHAD – WT7.44±0.5618.04±1.302.81±0.070.02±0.00100.00±4.91P73G1.76±0.14.68±1.721.55±0.140.22±0.0164.20±7.59D138E0.10±0.002.04±0.130.15±0.00n. a.16.55±1.15D138An. d.n. d.n. d.n. d.0.04±0.00V224E0.02±0.010.14±0.010.03±0.01n. a.0.60±0.08Y261G0.03±0.000.32±0.000.11±0.00n. a.1.03±0.04G554T, T555A (SGTA)n. a.0.08±0.00n. a.n. a.n. a.SsDHAD – WT1.12±0.001.58±0.09100.00±1.5435.5±1.7834.73±1.16D138E0.92±0.001.28±0.0211.53±1.5016.76±2.3022.54±3.34D138An. d.n. d.0.23±0.01n. d.n. d.A554T, T555A (SGTA)n. a.n. a.n. a.n. a.n. a.[a] n. d.=not determined; n. a.=relative activity <0.01 %.
Influence of alignment position 73 on substrate specificity and catalytic efficiency
The residue located in alignment position 73 is on an N‐terminal helix of a monomer of the DHAD/DHT homodimer (Figure S2); however, this residue forms part of the active site of the other monomer (Figures 3 and S2). Diverse residues of varying size are found at this position, ranging from proline in SsDHAD and FtDHAD, to the polar and bulky tyrosine in RlArDHT and CcXyDHT (Figure 3). In PuDHT an isoleucine is found in this position, and saturation mutagenesis did not result in variants with a significant change in specificity toward D‐gluconate, L‐threonate and DHIV (Figure S3). This observation stands in contrast to an earlier study on the structure of CcXylDHT, which predicted that the residue in this position is important for the substrate preference of SADHTs.
The activity landscape of the FtDHAD P73X library toward DHIV indicates that >70 % of the variants show a significant decrease in activity in comparison to the wild‐type enzyme (Figure S3). Decreased activity was also observed toward L‐threonate. However, several variants in this library were observed that have almost >three‐fold higher activity toward D‐gluconate than wild‐type FtDHAD. Importantly, and in contrast to the wild‐type enzyme, three of them display comparable activity toward DHIV and L‐threonate (Figure S3). Since our main goal is to find enzymes which show enhanced substrate promiscuity, we selected one of these three variants for further studies. It contains the proline to glycine substitution at position 73, and has a >10‐fold higher activity toward D‐gluconate, while retaining ∼64 % of the activity toward DHIV when compared to wild‐type FtDHAD (Table 2 and Figure 4A). A possible explanation for this observed effect is a slight increase of binding site volume. Thus, the combined data demonstrate that the amino acid at position 73 plays an important role in altering the substrate preference of a BCADHT to that of a SADHT, but not vice versa (i. e. PuDHT is not sensitive toward mutations in that position).
Figure 4
Effect of site‐directed mutations on the substrate profile of FtDHAD. Relative activities of FtDHAD variants shown in a radar plot in logarithmic scale, with the highest observed activity set to 100 % (A). Exact values are provided in Table 2. Docking of DHIV (B) and D‐gluconate (C) in the active site of FtDHAD. P73 is shown as orange sticks. Alignment numbers are used for the residue numbering.
Effect of site‐directed mutations on the substrate profile of FtDHAD. Relative activities of FtDHAD variants shown in a radar plot in logarithmic scale, with the highest observed activity set to 100 % (A). Exact values are provided in Table 2. Docking of DHIV (B) and D‐gluconate (C) in the active site of FtDHAD. P73 is shown as orange sticks. Alignment numbers are used for the residue numbering.
Influence of alignment position 138 on Mg2+ coordination
Apart from the FeS cluster, members of the ilvD/EDD superfamily also require a Mg2+ ion (or other divalent cations) in the active site for their catalytic function. The majority of the residues coordinating this ion are highly conserved and include D180 and E524, as well as a carboxylated lysine (KCX181) and two water molecules (Figure 1C). Some modest variability is observed with the ligand in alignment position 138, where an aspartate is present in BCADHTs but a glutamate in SA‐ and PADHTs (Figure 3E). Our docking studies reveal that sugar acid substrates replace the water molecules and coordinate the Mg2+ ion with one carboxyl oxygen and the C2−OH, thereby completing the octahedral coordination sphere of the metal ion (Figure 3D). To evaluate the effect of the non‐conserved residue in position 138 on the catalytic properties of the dehydratases, we performed the following mutations: E138D and E138A in PuDHT and D138E and D138A in FtDHAD and SsDHAD (Table 2). In all variants containing the alanine substitution a virtually complete loss of activity was observed for all substrates, likely to be due to impaired Mg2+ binding. The E138D mutation in PuDHT had a similar effect, possibly because the shorter aspartate side chain is not able to interact effectively with the Mg2+. Less dramatic is the effect of the D138E substitution in FtDHAD and SsDHAD, but in general a reduction of the activity was observed for each substrate tested. The in silico simulations could not provide a conclusive explanation for this effect, but the longer side chain of glutamate may lead to a shift of the Mg2+ ion away from the [2Fe−2S] cluster, and thereby prevent optimal substrate orientation. Importantly, while the residue in position 138 clearly plays an essential role in Mg2+ binding, none of the mutations in this position had a significant effect on the substrate specificity of the dehydratases.
The effect of a negative charge in position 224 in the active site
The residues in alignment position 224 are either a hydrophobic branched chain amino acid in BCADHTs, a hydrophilic threonine in SADHTs or a negatively charged glutamate in the PADHT SsDHAD (Figure 3). The promiscuity of SsDHAD is beneficial for the cell‐free bioproduction of isobutanol (Scheme 1), a trait we aimed to establish in SADHTs and BCADHTs. We hypothesized that the negative charge at position 224 plays an important role in the promiscuity of PADHTs. Accordingly, T224E and V224E variants of PuDHT and FtDHAD, respectively, were generated. This mutation, however, completely abolished the activity of PuDHT toward all substrates tested (Table 2), and a similar effect was observed for FtDHAD, although the mutant retained modest (<1 %) activity for most of the substrates. Thus, substituting an amino acid at this position in PuDHT and FtDHAD into a negatively charged glutamate is not sufficient to introduce the promiscuity of SsDHAD into SADHTs or BCADHTs.
Binding site volume (alignment position 261)
Another marked difference between the three classes of DHTs is observed in position 261. While glycine is present in the SADHTs, this small side chain is replaced by the much bulkier tyrosine and phenylalanine in the BCADHTs and PADHTs, respectively. An analysis of the active sites of the homology models for PuDHT, FtDHAD and SsDHAD (Figure 3) reveals that these residues point inside the active site of these enzymes. Consequently, the SADHTs have a considerably larger active site volume than the other DHTs. Hence, in order to test if the active site volume plays an important role in their substrate selection, G261F and G261Y mutations were introduced in PuDHT and the Y261G in FtDHAD (we did not perform the corresponding mutation in SsDHAD, i. e. F261G, as our primary interest was in broadening the substrate scope of PuDHT and FtDHAD to a level similar to that of SsDHAD). The substitutions Y261G in FtDHAD resulted in drastic loss of activity, while the substitutions G261F and G261Y in PuDHT both completely abolished the activity toward all substrates. Thus, changing the binding volume of FtDHAD and PuDHT by single substitution is not enough to alter their substrate preferences.
The role of the SGXX motif on the catalytic properties (alignment positions 552–555)
The crystal structure of the SADHT CcXylDHT highlighted the crucial role of the highly conserved serine residue in position 552 in the reaction mechanism of ilvD/EDD dehydratases (see above).
It has been proposed that the deprotonated side chain of Ser552 initiates the catalytic cycle by abstracting a proton from the C2 atom of the substrate. Furthermore, structural information from another SADHT, RlArDHT, also ascribes an important role for the threonine at position 554 in the mechanism, stabilizing the catalytic serine during the reaction.
In BCADHTs and PADHTs a glycine or alanine, respectively, is located in this position. However, in these enzymes a threonine is present in position 555 (SGGT and SGAT, respectively), whereas an alanine is present in SADHTs instead (SGTA). In order to assess the importance of the residues in these positions for the mechanism of these ilvD/EDD dehydratases, a series of mutants were generated (Tables 2 and S3). For PuDHT, the threonine in position 554 is indeed crucial for the activity toward its preferred substrate, D‐gluconate, with (>0.5 % remaining upon substitution with glycine or valine). Interestingly, the alanine in position 555 also plays an important role; upon its replacement by another threonine only ∼0.7 % of the activity remains. Furthermore, replacing the wild‐type Thr554‐Ala555 sequence in PuDHT with the Gly554‐Thr555 sequence present in BCADHTs leads to a completely inactive double mutant (irrespective of the substrate used). A similar trend was observed for FtDHAD, where the conversion of its native SGGT motif to the SGTA motif present in SADHTs led to the virtually complete inactivation of the enzyme (again irrespective of the substrates used; Table 2). The threonine in this motif is also important for the function of FtDHAD, but contrasting the observations made for PuDHT, its replacement by a non‐polar amino acid does not completely abolish the catalytic function of the enzyme (the SGGA variant retains over 18 % of the activity). However, in both enzymes it is essential that a non‐polar side chain is located next to this threonine ‐ replacing it with another threonine is highly detrimental to the activities of these enzymes (Table S3). The same observations were also made for corresponding mutations in the PADHT SsDHAD. Hence, while functionally essential, residues of the SGxx motif do not play an important role in determining the substrate preference of these enzymes. This interpretation is supported by in silico mutation studies, suggesting that a potential interaction between the substrate and Thr554 in PuDHT may also exist in the G554T−T555A double mutant of FtDHAD (i. e. a variant with the SGTA motif present in DHT). Since the residues of the SGxx motif are located on a rather flexible loop, we performed MD simulations with wild‐type enzymes and variant forms with inverted motifs to probe if the mutations alter the loop dynamics. However, no significant difference was observed (data not shown). It is thus likely that long‐range interactions, possibly mediated via an extensive hydrogen bond network, may play an important role in the mechanism. An in‐depth analysis of such effects is beyond the scope of the current study.
Altering the substrate specificity of DHADs toward D‐glycerate
The hotspots identified in this study play an important role in the function of the ilvD/EDD dehydratases, and in some cases the substrate preference could be broadened through targeted mutations (e. g. the P73G mutation which widens the substrate preference of FtDHAD to accept D‐gluconate; Table 2). However, none of the mutations in these hotspots altered the substrate preferences of the SA‐ and PADHTs. Furthermore, none of the variants displayed increased activity toward D‐glycerate, an important substrate in biomanufacturing processes (Scheme 1). The crystal structures of CcXylDHT and RlArDHT and the homology model of PuDHT reveal that the conserved C‐terminal histidine residue of the SADHTs extends toward the active site, located at the dimer interface. In contrast, in the available crystal structures of the BCADHTs AthDHAD and MtDHAD, as well as the homology models of FtDHAD and SsDHAD, the C‐terminal residue is not conserved and is not in the vicinity of the active site (Figures S1 and S4). Docking simulations were thus performed with the homology model of PuDHT and various sugar acid substrates to probe if the C‐terminal histidine contributes to catalysis in SADHTs. We hypothesized that this residue may play a role in positioning sugar acid substrates in the active site, a hypothesis that is based on the observation that PuDHT is more reactive toward larger sugar acids (Table 1). Indeed, the docking simulations demonstrate that both D‐gluconate and D‐xylonate are able to form a hydrogen bond with the C‐terminal histidine via their C5−OH groups; L‐threonate and D‐glycerate are too short to form a similar H‐bond (Figure S5). These findings were further supported by MD simulations with enzyme‐D‐gluconate and enzyme‐L‐threonate complexes. In these simulations, both C5−OH and C6−OH of D‐gluconate alternately form a hydrogen bond with the terminal histidine residue of PuDHT (Figures 5 and S6), an interaction that is conserved throughout the entire MD simulation. In contrast, MD simulations with the docked enzyme‐L‐threonate complex confirmed that no interactions between this substrate and the C‐terminal histidine is formed. In order to substantiate these computational predictions, two saturation libraries for PuDHT were generated. One library was designed to target the terminal histidine residue, and the other added an amino acid prior to this terminal histidine, thus extending the C‐terminal end by one amino acid. The two libraries were screened using different sugar acids and DHIV as substrates. The extension of the C‐terminal end showed a highly detrimental effect, completely inactivating the enzyme regardless of which substrate was used in the assays (Figure S7). Saturating the terminal histidine to other amino acids demonstrated a similar effect for the reaction with D‐gluconate. However, one variant (H653F) showed an improved activity toward D‐glycerate. While wild‐type PuDHT is minimally active toward D‐glycerate (less than 1 % when compared to D‐gluconate), the variant is ∼six‐fold more reactive toward this substrate, and the ratio of D‐gluconate/D‐glycerate activity has improved >1000‐fold in comparison to the wild‐type enzyme (Table 2 and Figure 5A). Hence, the C‐terminal end plays indeed an important role in the mechanism of SADHTs. Extending the length of the C‐terminal end did not promote interactions with the smaller substrates, possibly because of steric clashes. However, replacing the native histidine by a slightly larger phenylalanine altered the size of the active site cavity sufficiently to exclude the large D‐gluconate substrate and favor binding of smaller sugar acids.
Figure 5
Effect of site‐directed mutations on the substrate profile of PuDHT. Relative activities of PuDHT variants shown in a radar plot in logarithmic scale with the highest observed activity set to 100 % (A). Exact values are provided in Table 2. Two preferred binding orientations for D‐gluconate in the active site of PuDHT were modelled (B, C), where the percentage indicates the frequency of the MD simulations in which the respective binding mode was observed. Alignment numbers (Figure S1) are used for the residue numbering.
Effect of site‐directed mutations on the substrate profile of PuDHT. Relative activities of PuDHT variants shown in a radar plot in logarithmic scale with the highest observed activity set to 100 % (A). Exact values are provided in Table 2. Two preferred binding orientations for D‐gluconate in the active site of PuDHT were modelled (B, C), where the percentage indicates the frequency of the MD simulations in which the respective binding mode was observed. Alignment numbers (Figure S1) are used for the residue numbering.
Stereoselective substrate specificity of PuDHT and FtDHAD
We also probed the stereoselective substrate specificity of SADHTs and BCADHTs. SADHTs only accept sugar acid substrates that have the (R) and (S) configuration at positions C2 and C3, respectively.
Similarly, spinach DHAD (SoDHAD), a BCADHT, has a preference for substrates with (R) configuration at position C2.
Here, we examined the stereoselective substrate specificity of PuDHT and FtDHAD using several C5 sugar acids with different configurations in positions C2 and C3, namely D‐ribonate (R,R), D‐arabinonate (S,R), D‐xylonate (R,S) and D‐lyxonate (S,S). PuDHT, similar to other SADHTs, is only active toward D‐xylonate, while the BCADHT FtDHAD is equally active toward D‐ribonate and D‐xylonate (Figure 6). Molecular docking simulations using the homology models of PuDHT and FtDHAD demonstrate the (R) conformation at position C2 is essential as only in this stereoisomer both the OH group at C2 and a carboxyl oxygen at C1 are able to coordinate the catalytically essential Mg2+ ion. The docking simulations also indicate that D‐ribonate can be accommodated in the active site of FtDHAD, maintaining its orientation and distance to Ser552, which initiates the catalytic cycle by abstracting a proton from the substrate (see above and Figure S8). However, docking simulations were not able to rationalize the stricter stereochemical requirement of SADHTs at position C3.
Figure 6
The effect of the stereochemistry at positions C2 and C3 on the substrate acceptance of SADHTs (represented by PuDHT) and BCADHTs (represented by FtDHAD). Exact values are provided in Table S4.
The effect of the stereochemistry at positions C2 and C3 on the substrate acceptance of SADHTs (represented by PuDHT) and BCADHTs (represented by FtDHAD). Exact values are provided in Table S4.
Conclusions
Dihydroxyacid dehydratases, in particular those that contain a [2Fe−2S] cluster in their active sites, encompass enzymes with broad substrate spectra. Three distinct classes (SADHTs, BCADHTs and PADHTs) can be discerned based on their phylogenetic relationship (Figure 2) and substrate profiles (Table 1). We built homology models of three representative enzymes (PuDHT, FtDHAD and SsDHAD) from each class. Together with in silico simulations and an analysis of sequence conservation, we were able to predict several hotspots that may play an important role in the function of these enzymes. Site‐directed mutations within these hotspots demonstrated that all of them affect the substrate selectivity and/or enzyme activity (Tables 2 and S3; Figures 4 and 5). We probed the catalytic roles of several of these hotspots in more detail. For example, the amino acid at alignment position 138 plays a role in coordinating the catalytically essential Mg2+ ion across all of these enzymes. Furthermore, their substrate promiscuity (or lack thereof) may be governed by three amino acids located at alignment positions 73, 224 and 261. Substituting the native proline at position 73 by a glycine (P73G) improved the substrate promiscuity of FtDHAD; its P73G variant has >10‐fold improved activity toward D‐gluconate, while the activity toward DHIV (the biological substrate of the wild‐type enzyme) remains high. We used structural and sequence information to engineer dehydratases with enhanced preference for the non‐natural substrate D‐glycerate, which is an intermediate in the biotransformation of D‐glucose and glycerol to various chemicals (Scheme 1). The C‐terminal histidine was identified as relevant hotspot, as it stabilizes and orients larger sugar acid substrates in SADHTs. Replacing this histidine in PuDHT by a bulkier phenylalanine indeed shifted the substrate preference of this enzyme toward shorter sugar acids, improving the activity toward D‐glycerate ∼six‐fold. In summary, the combination of in silico and mutagenesis studies with representatives from the three classes of the ilvD/EDD superfamily provides a “roadmap” for the engineering of optimized dehydratases for biotransformation that are of great interest to the chemical industry.
Experimental Section
Enzyme production and site‐directed mutagenesis: All dehydratases in this study, except the ones from S. solfataricus (SsDHAD) were produced using E. coli BL21 (DE3) in Terrific Broth (TB) and purified as described previously.
SsDHAD and its variants were produced using E. coli BL21 (DE3) in Autoinduction media and purified as well as activated as described previously.
Site‐directed or saturation mutagenesis experiments were performed using either QuikChange, two‐step PCR, or overlap extension PCR. More details are described in the Supporting Information.Enzyme activity measurements: Kinetic parameters of the enzymes were measured using HPLC as described previously.
Assays with wild type and variant forms of PuDHT were performed at 30 °C, while wild type and variant forms of FtDHAD and SsDHAD were assayed at 50 °C. More details are described in the Supporting Information.Molecular modelling: Homology models were produced based on the templates with the highest sequence identity. The homology models were generated with Modeller and the resulting model with the best DOPE score was selected for further use. Molecular Mechanics parameters were generated for the [2Fe−2S] cluster and the deprotonated serine. Follow‐up MD simulations were conducted with AMBER. Molecular docking simulations were performed with AutoDock, and in silico mutations were performed with PyMOL using Dunbrack's rotamer library. More details are described in the Supporting Information.Supporting information: Supporting experimental and computational methodology, multiple sequence alignment and structural representations of the homology models and a conversion table between sequence and alignment numbers. Activity landscape of the saturation library of alignment position 73 and C‐terminal residue, structural properties of the binding sites and substrate binding in DHAD (variants). Docked poses of D‐xylonate and D‐ribonate in DHAD. Activity data for produced DHAD (variants) and the primers and auxiliary enzymes used in this study.Accession codes of enzymes discussed in this study (UniProtKB); PuDHT: A0A4R3LQ44, FtDHAD: A0A1I2J0Y3, SsDHAD: Q97UB2, RlArDT: B5ZZ34, MtDHAD: A0A0E8UWV6, AtDHAD: Q9LIR4, CcXylDHT: Q9A9Z2.
Conflict of interest
The authors declare no conflict of interest.As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.Supporting InformationClick here for additional data file.
Authors: Ghader Bashiri; Tyler L Grove; Subray S Hegde; Thomas Lagautriere; Gary J Gerfen; Steven C Almo; Christopher J Squire; John S Blanchard; Edward N Baker Journal: J Biol Chem Date: 2019-07-16 Impact factor: 5.157
Authors: Peilan Zhang; Brian S MacTavish; Guang Yang; Manyun Chen; Jaehyeok Roh; Kevin R Newsome; Steven D Bruner; Yousong Ding Journal: ACS Chem Biol Date: 2020-08-12 Impact factor: 5.100
Authors: Tenuun Bayaraa; Jose Gaete; Samuel Sutiono; Julia Kurz; Thierry Lonhienne; Jeffrey R Harmer; Paul V Bernhardt; Volker Sieber; Luke Guddat; Gerhard Schenk Journal: Chemistry Date: 2022-06-16 Impact factor: 5.020