Literature DB >> 25222710

Insights into substrate and metal binding from the crystal structure of cyanobacterial aldehyde deformylating oxygenase with substrate bound.

Benjamin C Buer¹, Bishwajit Paul, Debasis Das, Jeanne A Stuckey, E Neil G Marsh.

Abstract

The nonheme diiron enzyme cyanobacterial aldehyde deformylating oxygenase, cADO, catalyzes the highly unusual deformylation of aliphatic aldehydes to alkanes and formate. We have determined crystal structures for the enzyme with a long-chain water-soluble aldehyde and medium-chain carboxylic acid bound to the active site. These structures delineate a hydrophobic channel that connects the solvent with the deeply buried active site and reveal a mode of substrate binding that is different from previously determined structures with long-chain fatty acids bound. The structures also identify a water channel leading to the active site that could facilitate the entry of protons required in the reaction. NMR studies examining 1-[(13)C]-octanal binding to cADO indicate that the enzyme binds the aldehyde form rather than the hydrated form. Lastly, the fortuitous cocrystallization of the metal-free form of the protein with aldehyde bound has revealed protein conformation changes that are involved in binding iron.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：
Metals
Oxygenases

Year: 2014 PMID： 25222710 PMCID： PMC4245163 DOI： 10.1021/cb500343j

Source DB: PubMed Journal: ACS Chem Biol ISSN： 1554-8929 Impact factor: 5.100

The biosynthesis of aliphatic hydrocarbons has recently garnered much interest as drop in replacements for conventional transportation fuels.[1−6] In Nature, various organisms such as plants,[7] insects,[8] birds,[9] and microbes[5] are known to produce long-chain alkanes. These alkanes are derived from fatty acyl-CoA esters in a two-step biosynthetic pathway that involves a fatty-acyl-CoA reductase (FAR) that first converts the fatty acyl-CoA ester to the corresponding aldehyde.[10−13] Next, an aldehyde decarbonylase (AD) cleaves the C1–C2 (aldehyde carbon−α-carbon) bond of the aldehyde to generate an alkane and, depending upon the organism, CO,[14,15] CO2,[8,16] or HCO2H.[17,18] Although alkanes are very simple molecules, their production from aldehydes is a chemically difficult transformation,[19] and the AD enzymes as a group remain very poorly understood.[20] This is due in large part to the fact that most of these enzymes are integral membrane proteins that have proved hard to purify and express in recombinant form. Recently, however, a soluble version of AD was discovered in cyanobacteria,[5] which is referred to as either cADO[21] (cyanobacterial aldehyde deformylating oxygenase) or in earlier reports as cAD (cyanobacterial aldehyde decarbonylase).[5,17,18,22,23] Although the physiological role of cADO in cyanobacteria remains unknown, the amenability of the enzyme has stimulated significant interest in understanding its mechanism and its use in engineering new pathways to generate alkane-based biofuels. cADO is a nonheme di-iron protein that catalyzes the conversion of aldehydes to alkanes and formate in a reaction that requires O2 and reducing equivalents, as shown in Scheme 1.[17,18,22,23]

Scheme 1

Deformylation Reaction of Aldehydes Catalyzed by cADO

Labeling studies have shown that the aldehyde proton is retained in formate[17,18] and that one of the oxygen atoms derives from molecular oxygen,[22] whereas the proton in the product alkane derives from the solvent.[17,18] Spectroscopic studies point to the initial formation of a diferric intermediate in the cADO catalyzed reaction.[24] Addition of a further electron to this complex is proposed to lead to its breakdown and scission of the C1–C2 bond. A radical mechanism for C1–C2 bond cleavage is supported by the observed ring-opening of cyclopropyl aldehydes and oxiranyl aldehydes designed to act as “radical clocks” during deformylation by cADO.[25,26] The crystal structure of the enzyme reveals it to possess an α-helical structure with the di-iron center housed in a ferritin-like four-helix bundle at the core of the protein.[5,27] The iron atoms are coordinated by two histidine and four carboxylate residues, as is seen in other di-iron oxygenases, such as methane monoxygenase, ribonucleotide reductase, stearoyl-ACP-Δ9-desaturase, and toluene-4-monooxygenase.[28−34] All cADO structures solved so far include an aliphatic fatty acid that is bound in a hydrophobic channel, with the carboxylate group providing a bridging ligand to the two iron atoms.[27,35] The fatty acid presumably mimics the binding of the aldehyde substrate; however, it is unclear how the substrate gains access to the active site because the hydrophobic channel is completely enclosed by the protein. Here, we address the question of how substrates and metal ions gain access to the active site of cADO. We have solved crystal structures of cADO with a long-chain, water-soluble aldehyde bound and a medium-chain carboxylic bound that reveal a new mode of substrate binding and protein conformational changes involved in binding iron. We have also undertaken NMR studies to establish whether the enzyme binds the aldehyde or its hydrated form, which predominates in solution.

Results and Discussion

NMR Studies of Substrate Binding to cADO

In aqueous solution, aldehydes exist in equilibrium with the hydrated (gem-diol) forms, which predominate. Mechanistic studies have assumed that it is the aldehyde form that is reactive toward deformylation. However, it is unclear which form initially binds to the enzyme, as it would be plausible for the enzyme to bind the hydrated aldehyde and for dehydration to occur at the active site; this could be catalyzed by the Lewis acidic diferric form of the enzyme. The gem-diol and carbonyl forms of the aldehyde carbon resonate at very different frequencies in the 13C NMR spectrum, making NMR an excellent technique to probe which form binds to the enzyme. We examined the binding of 1-[13C]-octanal to cADO (Figure 1). These experiments were conducted on the diferric form of the enzyme in the absence of an auxiliary reducing system so that turnover could not occur. 1-[13C]-octanal exhibits peaks at 208 ppm due to the carbonyl form and at 92 ppm due to the diol form. Addition of increasing concentrations of cADO to a 1 mM solution of octanal in assay buffer resulted in a progressive reduction in intensity for the peak at 208 ppm. At 125 μM cADO, the 208 ppm peak could no longer be detected, whereas the peak at 92 ppm retained significant intensity. The loss of the 208 peak indicates that the enzyme binds the aldehyde form. The peak is lost due to broadening caused by chemical exchange with the enzyme-bound form on the NMR time scale; some broadening may also be due to paramagnetic relaxation associated with the iron center. Note that the unbound aldehyde remains in chemical equilibrium with the hydrated form but this equilibrium is slow on the NMR time scale so that the peak at 92 ppm is not broadened. Further increasing the enzyme concentration to be equimolar with the substrate resulted in almost complete loss of the 92 ppm peak as well. This is due to the re-establishment of the aldehyde-hydrate equilibrium in solution as more enzyme is added and available to bind the aldehyde form. Eventually all the substrate is converted to the aldehyde form and becomes bound to the enzyme.

Figure 1

Binding of 1-[13C]-octanal to cADO monitored by 13C NMR. Octanal (1 mM) was titrated with increasing concentrations of cADO and the peaks correspond to the carbonyl (208 ppm) and hydrate forms (92 ppm) were monitored. At 125 μM cADO, the carbonyl peak is completely lost indicating that the enzyme selectively binds this form.

Synthesis and Activity of a Water-Soluble Long-Chain Aldehyde Substrate

Previously solved crystal structures of cADO all feature a long-chain fatty acid bound that was presumed to occupy the binding site of the aldehyde substrate.[5,27,35] Recent studies have identified a mixture of palmitic, stearic, and oleic acids as being bound,[35] which most likely are derived from E. coli during purification. Furthermore, it appears that the fatty acids are necessary for crystallization as even extensively lipid-depleted enzyme crystallizes with fatty acids bound. We aimed to obtain a structure of the enzyme with an aldehyde bound, which would be more mechanistically informative. Preliminary screens with short-chain aldehydes in the crystallization buffer failed to obtained crystals of cADO and the cocrystallization of the enzyme with long-chain aldehydes, such as octadecanal, proved infeasible due to the very low solubility of these compounds. Therefore, we synthesized a water-soluble mimic of icosanal, 11-(2-(2-ethoxyethoxy)ethoxy)undecanal (1) that incorporates three oxygen atoms in the carbon chain (Scheme 2). This compound was straightforwardly synthesized by coupling silyl-protected 11-bromoundecanol to 2-(2-ethoxyethoxy)ethanol followed by deprotection and oxidation of the alcohol to the aldehyde, as described in the Supporting Information.

Scheme 2

Structures of 11-(2-(2-Ethoxyethoxy)ethoxy)undecanal, 1, and trans-2-Nonylcyclopropane-1-carboxylic Acid, 2

Compound 1 was soluble at millimolar concentrations under the buffer conditions typically used to assay cADO.[18,25] We compared the activity of cADO in catalyzing the deformylation of compound 1 (500 μM) and octadecanal (500 μM + 4% DMSO to improve solubility) using previously described assay conditions with NADH and PMS as the auxiliary reducing system.[25,26] cADO deformylated 1 to produce 1-(2-(2-ethoxyethoxy)ethoxy)decane at a rate 0.049 ± 0.002 min–1, a rate only ∼2-fold slower than the deformylation of octadecanal to heptadecane under similar conditions.

Crystallization of cADO with Substrate Analogs Bound

During the course of this investigation, we solved the structure of cADO with the following substrate analogs bound: stearate (i.e., enzyme as isolated from E. coli), PDB ID 4TW3; with compound 1 bound, PDB ID 4PGK; and with trans-2-nonylcyclopropane-1-carboxylate, 2, bound (Scheme 2), PDB ID 4PG1. We also solved the structure of a mutant enzyme, cADO-L194A, with 1 bound, PDB ID 4PGI. These structures are discussed individually in detail in the following. In each case, the protein crystallized in the space group, P43212, and the structures were solved by molecular replacement to resolutions ranging from 1.6 to 2.2 Å, with refinement statistics given in Table 1.

Table 1

Data Collection and Refinement Statistics for Crystal Structures

data set	cADO (stearate)	cADO-(1)	cADOL194A-(1)	cADO-(2)
PDB ID	4TW3	4PGK	4PGI	4PG1
space group	P4₃2₁2	P4₃2₁2	P4₃2₁2	P4₃2₁2
unit cell	a = b = 77.21; c = 116.57	a = b = 77.34; c = 116.96	a = b = 77.07; c = 117.26	a = b = 77.12; c = 116.20
unit cell	α = β = γ = 90	α = β = γ = 90	α = β = γ = 90	α = β = γ = 90
wavelength (Å)	0.97872	0.97856	0.97872	1.0781
resolution (Å)	1.60 (1.60–1.63)	2.17 (2.17–2.21)	2.08 (2.08–2.12)	1.70 (1.70–1.73)
R_sym (%)	6.6 (71.9)	8.5 (92.3)	8.7 (57.0)	7.3 (86.4)
{I/σI}	20 (3)	20 (5)	20 (3)	20 (3)
completeness (%)	100.0 (100.0)	100.0 (100.0)	100.0 (100.0)	99.9 (100.0)
redundancy	14.4 (14.3)	14.4 (14.7)	14.3 (14.6)	14.2 (14.3)
Refinement Statistics
resolution (Å)	1.60 (38.61–1.60)	2.17 (22.13–2.17)	2.08 (22.48–2.08)	1.70 (36.60–1.70)
R-factor (%)	17.3	19.4	19.9	18.6
R_free (%)	18.9	21.3	22.8	20.5
protein atoms	1815	1907	1886	1783
water molecules	306	227	148	204
unique reflections rmsd	47 171	19 412	21 862	39 033
bonds	0.010	0.010	0.010	0.010
angles	0.89	0.97	0.91	0.85
MolProbity score	0.95	1.19	1.26	0.64
clash score	1.91	2.37	4.47	0.28

Structure of cADO with Stearate Bound

To facilitate our analysis of the enzyme, we redetermined the structure of cADO with stearate bound at 1.60 Å resolution (PDB ID 4TW3). The enzyme comprises 8 α-helices (Figure 2A), of which helices 1 (blue), 2 (cyan), 4 (green), and 5 (yellow-green) make up the canonical antiparallel 4-helix bundle that provides the coordinating protein ligands to the di-iron center. Helix 3 (cyan–green), which is kinked in the middle, crosses over helix 1 before connecting through a short loop to helix 4. Helices 6 (yellow), 7 (orange), and 8 (red) form a subdomain in which helices 6 and 7 pack against helices 1, 3, and 4, whereas the shorter helix 8 packs at an angle of ∼45 across the interface of helices 6 and 7. Close inspection of the structure revealed a shallow hydrophobic cavity on the surface composed of side chains from helices 6, 7, and 8. This region of the surface is notably devoid of crystallographically resolved water molecules, as illustrated in Figure 2B. Although crystallographic symmetry places identical hydrophobic patches from neighboring protein molecules in close proximity, there are no intermolecular van der Waals contacts between hydrophobic residues of different proteins and the protein in solution is a monomer.

Figure 2

Structure of cADO with stearate bound (PDB ID 4TW3). (A) Ribbon diagram of cADO with helices colored from the N-terminus: 1 (blue), 2 (cyan), 3 (cyan-green), 4 (green), 5 (yellow–green), 6 (yellow), 7 (orange), and 8 (red); the iron atoms are brown and stearate is shown in gray. (B) The region containing the surface of helices 6, 7, and 8 shows a narrow hydrophobic cavity that is devoid of crystallographically resolved water molecules; L194 is shown in stick form at the center of the hydrophobic cavity.

At the center of the hydrophobic cavity is L194, which is located on helix 6 at the interface between the aqueous protein exterior and hydrophobic protein interior. Interestingly, this residue appeared highly mobile as evidenced by the poor electron density and high temperature factors of L194 side chain carbon atoms. This degree of side chain mobility is not observed for surrounding side chains in the hydrophobic patch or buried hydrophobic side chains elsewhere in the protein. These observations suggested that L194 might serve as a gateway for substrate entry. Therefore, to assess the extent to which L194 might constrict access to the substrate-binding channel, we mutated this residue to alanine. However, this mutant, referred to as cADO-L194A, had kinetic properties very similar to the wild-type enzyme (data not shown) suggesting that L194 does not play a kinetically significant role in limiting substrate access to the active site. Structure of cADO with stearate bound (PDB ID 4TW3). (A) Ribbon diagram of cADO with helices colored from the N-terminus: 1 (blue), 2 (cyan), 3 (cyan-green), 4 (green), 5 (yellow–green), 6 (yellow), 7 (orange), and 8 (red); the iron atoms are brown and stearate is shown in gray. (B) The region containing the surface of helices 6, 7, and 8 shows a narrow hydrophobic cavity that is devoid of crystallographically resolved water molecules; L194 is shown in stick form at the center of the hydrophobic cavity.

Structure of cADO with 1 Bound

We were next able to solve the structures of cADO and cADO-L194A with 1 bound at 2.17 and 2.08 Å resolution, respectively (PDB IDs 4PGK and 4PGI, respectively). These structures were essentially identical, differing only in minor alterations to side chain conformations of residues in the immediate vicinity of the mutation. However, because the L194A mutant structure is at higher resolution, we have used this structure in our discussion of 1 binding to cADO in complex as described in the following. The initial inspection of the electron density 2F0 – Fc map contoured at 1σ (Figure 3 and Supporting Information Figure S10) for the complex of cADO-L194A with 1 revealed several interesting features. First, where the substrate was expected to bind, there was T-shaped region of electron density extending from the metal center with one fork of the T occupying the cavity previously observed to bind stearic acid. The other fork exited the protein structure adjacent to L194 (A194 in the mutant cADO), supporting the view that this is the substrate entry point. The overall shape and the relatively poor definition of this electron density made it apparent that 1 must bind to the enzyme in two quite different, and somewhat mobile, conformations. A second area of ambiguity concerned the electron density of helix 5, which appeared partially unfolded between residues 154 and 165. Notably, this region encompasses E157 and H160, which supply ligands to Fe2. Lastly, the electron density for the metal ions in the structure refined to an occupancy of only ∼60%, which is considerably lower than that of any fatty acid-containing cADO structure.

Figure 3

Electron density 2F0 – Fc map contoured at 1σ for cADO-L194A (PDB ID 4PGI) complexed with 1. The substrate can be modeled in two conformations to fit the T-shaped region of electron density, suggesting that at least two conformations of the substrate binding are possible. The electron density map also indicates that the region encompassing E157 and H160 also exists in two conformations: one in which these residues coordinate the metal; the other in which helix 5 is partially unfolded. The carboxylate group of E157 is disordered in both conformations and has therefore not been modeled in the structure. Taken together, these observations led to the conclusion that (at least) two structural forms of cADO were represented within the crystal. Although there is some ambiguity associated with interpreting electron density maps in which residues and ligands appear to adopt more than one conformation, we consider that, as discussed below, the most reasonable interpretation is that one form represents the metal-containing protein in which the substrate enters the protein at L194, whereas the other represents the metal-free protein in which helix 5 is partially unfolded and the substrate binds in a similar conformation to stearate. These conformations are shown in Figure 4. To support this conclusion, each conformation was fitted and refined independently. The resulting electron density maps produced density for the alternate conformation in both the Fo–Fc maps (contoured at 3σ) and in the 2Fo–Fc maps (contoured at 1σ). A figure showing the alternate positions of His160 and Glu157 overlaid with a map calculated from the refined B conformation of the helix is available in the Supporting Information (Figure S10). The low metal content of the crystals is not surprising because, as purified from E. coli, only 30–35% of the active sites contain metal, which we have previously characterized as being mainly iron and zinc,[18] and no additional divalent metal ions were present in the crystallization buffer.

Figure 4

Structure of metal-bound (purple) and metal-free (yellow) cADO-L194A in complex with 1 (PDB ID 4PGI). (A) Overlay of the two conformations of cADO; metal-bound form in purple; metal-free form in yellow, in which helix 5 is partially unfolded between the residues 154 and 165. (B) Details of the active site geometry for the metal-bound form. (C) Detail of the substrate entry channel showing conformational changes in L194 and F130 with the structure of the stearate-bound form of cADO superimposed for comparison. (D) Details of the active site geometry in the metal-free form in which the substrate intrudes into the metal-binding site. The carboxylate group of E157 is disordered in both conformations and has therefore not been modeled in the structure.

Structure of Metal-Bound cADO

In this structure, the coordinating ligands surrounding the two metal atoms adopt a similar geometry to that seen in the structures of the cADO/stearate complex and other dinuclear metalloenzymes that possess the canonical ‘4-carboxylate, 2-histidine’ binding motif. As shown in Figure 4B, for Fe1, the carboxylate of E45 from helix 1 provides a bidentate ligand, whereas E73 and H76 from helix 2 provide monodentate ligands. Similarly, for Fe2, E128 from helix 4 provides a bidentate ligand, and H160 from helix 5 provides a monodentate ligand. The carboxylate of E157, which should provide the final protein ligand, was not resolved in the electron density maps. This suggests that it is conformationally labile and because of this we do not include it in the model. The fifth coordination site to Fe1 is occupied by water, whereas it is the aldehyde oxygen that supplies the fifth ligand to Fe2, with an Fe–O distance of 2.5 Å. In the metal-bound form of cADO, there is insufficient space available within the hydrophobic cavity for compound 1 to bind in the same conformation as stearate. We therefore modeled the aldehyde chain to fit the electron density that exits the protein between helices 6 and 7 (Figure 4B). To accommodate 1, the side chain of L194 is displaced by ∼2 Å, (measured with respect to Cγ) compared to the structure of the stearate-binding enzyme. V197 on the consecutive helical turn accommodates the rotomeric shift of L194. The phenyl ring of F130 undergoes a χ2 rotation of 50 degrees so the face of the ring now lines the channel that terminates at the protein exterior as illustrated in Figure 4C. Attempts to crystallize the enzyme in the fully metalated form by including Fe2+ or Zn2+ ions in the crystallization buffer together with compound 1 were unsuccessful.

Structure of Metal-Free cADO

The cocrystallization of the metal-free form of the enzyme was unexpected and has provided insight into the pathway by which metals gain access to the active site of cADO. Several attempts to crystallize apo-cADO in the presence of 1 failed to yield crystals, presumably due to the conformational lability of the protein. Therefore, we conclude that the fortuitous presence of an approximately 50:50 mixture of apo- and metallo-enzyme, as isolated from E. coli, allowed the apoenzyme to cocrystallize with the metallo-enzyme and thereby be trapped for crystallographic analysis. In the apo enzyme, residues 154–165, which in the metalated enzyme are contained within helix 5 directly adjacent to the diiron center, unfold to form a solvent-exposed loop (Figure 4D). This region of the protein includes E157 and H160, which coordinate Fe2. A less dramatic change in the conformation of the other ligand to Fe2, E128, is also seen; small changes in the positions of Y52 and Q123 also occur, which adopt alternate hydrogen bonding arrangements within the protein. In contrast, the positions of the ligands to Fe1 are very little changed; interactions with surrounding residues appear to maintain their metal-binding conformations in the absence of metal. The conformation of helix 5 does not appear to be influenced by crystal packing effects in either conformation. None of the residues in helix 5 make contacts with other symmetry related protein molecules within the crystal lattice. The side-chain of the closest residue Tyr163 is makes the closest approach, 4.4 Å. Helical residues N-terminal to Y163 are completely solvent exposed whereas residues C-terminal to Y163 are greater than 5.3 Å from any symmetry related molecule. Helix 5 is one of the two more solvent exposed helices housing the di-iron center and its partial unfolding suggests a mechanism by which metal ions can enter the active site. The metal-free structures of other structurally related di-iron enzymes have been described previously, notably Class I ribonucleotide reductase R2 subunit (RNR R2)[36] and methane monooxygenase hydroxylase subunit (MMOH).[37] In both these enzymes, the tertiary structure remains intact and the metal-chelating ligands adopt conformations that are close to their metal-bound conformations. In contrast, in cADO, one metal binding site remains intact, but the localized unfolding of helix 5 between residues 154 and 165 completely dissembles the second metal-binding site. Interestingly, the corresponding helices in RNR R2 and MMOH are similarly the more solvent exposed of the helices housing the di-iron center.[37,38] cADO is smaller than either RNR R2 or MMOH and so may require metal coordination to stabilize the folded structure, whereas for RNR and MMO more extensive tertiary interactions with other helices in the protein presumably stabilize the metal-free structure. This suggests that, although RNR R2 and MMOH remain folded in the absence of metals, the same localized unfolding may provide a mechanism for metalation in these and other enzymes of this structural class. In the metal-free structure, the electron density due to the substrate, 1, extends into the region that would be occupied by Fe2 (Figures 4D and 3). It is possible that the intrusion of the substrate into the metal-binding site may have helped stabilize the open conformation of helix 5, allowing this conformation to be trapped in the crystal. The additional space available to the substrate in the metal-free structure, allows the tail end of 1 to be accommodated completely within the protein in a conformation very similar to that seen for cADO with stearate bound. Although we chose to model the substrate in two conformations associated with metal-bound and metal-free forms of cADO, we do not rule out the possibility that the substrate may bind in other orientations. The conformational mobility of the long aliphatic chain, combined with the lack of any hydrogen-bonding residues in the substrate-binding channel that might serve as “anchor points” for the substrate, suggest that many energetically similar configurations may be possible. In particular, it is possible to model the substrate in a nonproductive binding mode in which the aliphatic chain stretches across the top of the T-shaped channel, with the aldehyde carbonyl extended into the solvent rather than binding to the metal.

Structure of cADO with trans-2-Nonylcyclopropane-1-Carboxylate (2) Bound

Because the electron density map associated with 1 bound to cADO left some ambiguity as to its mode of binding, we also attempted to cocrystallize cADO with other aldehydes. We were able to obtain highly diffracting crystals in the presence of trans-2-nonylcyclopropane-1-carbaldehyde, which allowed the structure to be solved at 1.7 Å resolution, PDB ID 4PG1. During refinement it became apparent that the aldehyde carbon had oxidized to the carboxylic acid, to give compound 2 (Scheme 2), a finding confirmed by chromatographic analysis of the mother liquor. This is not especially surprising given that aldehydes are sensitive to oxidation by oxygen in the presence of metal ions, and no attempt was made to rigorously exclude oxygen from the crystallization trays. Nevertheless, the structure is instructive because it reveals two molecules of 2 bound to the protein (Figure 5), which clearly delineate the entry point for the substrate. The high quality of the electron density map allowed the cyclopropyl ring to be clearly resolved, removing any ambiguity about the identity and orientation of the ligands. Interestingly, although the aldehyde of 2 was synthesized in racemic form, it is the 1R,2R enantiomer of 2 that cocrystallizes with the enzyme in both positions.

Figure 5

Structure of cADO in complex with cyclopropyl carboxylic acid, 2 (PDB ID 4PG1). (A) The structure of cADO shows two molecules of 2 bound to the enzyme; one molecule directly coordinates to the di-iron center by the carboxyl group, other molecule assumes a “reverse” orientation where the carboxyl group extends into the solvent. (B) Overlay of structures of cADO with stearate bound (tan) and 2 bound (orange) illustrating large scale movement of helix 8 (for clarity the stearate molecule has been omitted from the figure).

As shown in Figure 5, one molecule of 2 is coordinated to the metal center through the carboxylate group, the other molecule binds to the protein in the substrate entry channel in a “reverse” orientation and extends into the solvent, with the carboxylate group making a hydrogen bond with the guanidine group of R191 and last three carbon atoms of 2 contacting L194. As seen with 1, access to the protein interior is provided through a displacement of the L194 side chain to accommodate the aliphatic tail of 2. The binding of 2 also results in a large-scale movement of helix 8 (Figure 5B), which pivots about its N-terminus resulting in a displacement of the C-terminus of the helix of ∼4.3 Å with respect to the stearate-bound structure. This movement is accompanied by the formation of additional hydrophobic contacts between 2 and residues in helix 8. Structure of cADO in complex with cyclopropyl carboxylic acid, 2 (PDB ID 4PG1). (A) The structure of cADO shows two molecules of 2 bound to the enzyme; one molecule directly coordinates to the di-iron center by the carboxyl group, other molecule assumes a “reverse” orientation where the carboxyl group extends into the solvent. (B) Overlay of structures of cADO with stearate bound (tan) and 2 bound (orange) illustrating large scale movement of helix 8 (for clarity the stearate molecule has been omitted from the figure). The structure of cADO with 2 bound also provided insight into another aspect of the reaction—the pathway by which the proton that is introduced into the product alkane is shuttled between the solvent and the interior of the protein. A chain of water molecules is seen in the structure that enters the protein between helices 1 and 3 close to the metal center (Figure 6). These waters are stabilized by hydrogen bonds to Q108, E47, N51, and Q123 and could potentially form a proton relay to the solvent. The three closest potential proton donors, a water molecule, the side-chain of Q123 and an iron-bound water/hydroxide, are each about 4.8 Å from the α-carbon of 2. Although this is too far for them to act as proton donors, one may imagine that in the catalytically active enzyme relatively small movements in the active site may bring any of these donors within range. Furthermore, recently published solvent isotope effect measurements have provided evidence that the proton is most likely transferred from the iron-bound water molecule.[39]

Figure 6

Water molecules within the structure of cADO with 2 bound (PDB ID 4PG1). A chain of water molecules were resolved, which enter the enzyme in between helices 1 and 3 and are hydrogen bonded to Q108, E47, N51, and Q123. The channel potentially provides a conduit to supply the proton necessary for alkane formation. The structures of cADO with 1 and 2 bound have illuminated the likely entry point for the substrate, which was not apparent from previous structures with stearate or palmitate bound. The structures also demonstrate that for long-chain aldehydes several modes of binding are possible and that for short-chain aldehydes more than one molecule can bind at a time. These observations suggest that nonproductive modes of substrate binding are possible and may contribute to the low rates of turnover measured for cADO. For example, substrate inhibition might occur with short aldehydes if a second substrate molecule is bound in the channel preventing product release. Likewise, if a long-chain aldehyde bound across the top of the ‘T’-shaped substrate channel, instead of in the channel leading to the metal center, this would effectively inhibit the enzyme complex. This suggests that protein engineering could be used to occlude the “dead-end” branch of the substrate-binding channel and possibly improve catalytic efficiency. A final intriguing observation is that the entry point for the aldehyde is in the center of a hydrophobic depression on the enzyme’s surface that is flanked by several positively charged residues. This suggests that cADO may interact with either another protein for the delivery of these very hydrophobic molecules, or possibly the cell membrane. No interaction partners for cADO have been identified and the enzyme does not appear to be membrane associated; however, it seems unlikely that cytosolic long-chain aldehydes would be the physiological substrates for cADO because of their extremely low solubilities. Indeed, it is unclear why some species of cyanobacteria biosynthesize alkanes, and it remains possible that the physiological substrates for cADO are other hydrophobic molecules.

Conclusion

The deformylation reaction catalyzed by cADO is chemically complex and unlike that of other nonheme iron oxygenases in that O2 is completely reduced to water during the catalytic cycle. This reaction requires that, in addition to the aldehyde substrate and O2, further electrons and a proton be delivered to the active site during the reaction.[17,18,22,23] cADO deformylates aldehydes extremely slowly,[17,18,21,23,35] with kcat for even the fastest substrates not exceeding ∼1 min–1. We had considered that the improved solubility of 1 and its product might result in faster turnover; however, the sluggish rate at which 1 is deformylated indicates that the solubility of the substrate is not a limiting factor in the rate of turnover. A key step in understanding both the mechanism of the enzyme and its possible function in vivo is to understand how the substrates gain access to the active site. The NMR studies presented here indicate that the enzyme selectively binds the aldehyde form from solution, rather than the hydrate. The selectivity may arise because the hydrated form is too hydrophilic to enter the hydrophobic substrate-binding channel. The crystal structures of cADO with substrate analogs bound have identified local protein conformational changes that allow iron to enter the active site of cADO and a pathway by which the proton required in the reaction may enter the active site. Notably, the substrate-binding channel identified in these structures is different from that inferred for previous structures containing fatty acids. These observations should facilitate protein engineering to improve the efficiency of the enzyme.

Methods

Materials

Phenazine methosulfate (PMS) and ferrous ammonium sulfate were from Sigma-Aldrich; NADH was obtained from Acros Organics; 1-[13C]-octanal was obtained from Cambridge Isotope Laboratories.

Synthesis of 11-(2-(2-Ethoxyethoxy)ethoxy)undecanal, 1, and trans-2-Nonylcyclopropane-1-Carbaldehyde (Aldehyde of Compound 2)

Details of the synthesis and characterization of compound 1 and the aldehyde of compound 2 are described in the Supporting Information.

Enzyme Purification and Reconstitution

The purification of recombinant P marinus cADO from Escherichia coli was performed as described previously.[18,23] Activity assays of cADO with compound 1 were performed as described previously.[25,26]

Preparation of Samples for NMR

NMR samples of the diferric form of cADO were prepared in 50 mM potassium phosphate buffer (pH 7.2) and 20 mM CHAPS (500 μL volume). 1-[13C]-octanal was dissolved in d6-DMSO for a stock concentration of 100 mM and added to the proteins solution to give the desired final concentration. Spectra were acquired using a Varian VNMRS 700 MHz spectrometer with a 5 mm PFG AutoX broadband probe at 4 °C using 2000 scans.

Crystallization and Structure Determination

For crystallization of cADO or cADOL194A with 1, the compound was added directly to a solution of 396 μM cADO in 50 mM HEPES, pH 7.4 for a final concentration of 10 mM 1. Samples were incubated for 16 h at 4 °C immediately prior to crystallization screening. The protein samples used for these experiments did not undergo removal/replacement of endogenous metals and cADO retained the metal content as isolated from E. coli. The crystals of cADO and cADOL194A with 1, from which the highest resolution data were collected and were crystallized against 20% PEG 3350, 0.3 M NaCl and 20% PEG 3350, 0.2 M sodium tartrate, respectively, with drops containing equal volumes of protein sample and precipitant. For crystallization of cADO with 2, the sample was prepared by adding 2.0 equiv of Fe(II) in the form of ferrous ammonium sulfate to 400 μM apo cADO in 50 mM HEPES buffer, pH 7.4 with 2% glycerol under a microaerobic environment. Protein solutions were oxidized for 1 h under atmospheric conditions at 4 °C where they acquired a slight yellow color and metal oxidation was verified by UV absorbance at 360 nm, which is characteristic of a iron(III)-oxo species. Stock solutions of 100 mM of the aldehyde of 2 in DMSO were prepared and added to diferric cADO to give final concentrations of 2 mM or 4 mM of the aldehyde of 2. Diferric cADO solutions containing of the aldehyde of 2 were incubated for 16 h at 4 °C immediately prior to crystallization. The crystals of cADO with 2 mM and 4 mM 2 from which the highest resolution data were collected and were crystallized in 35% PEG 400, 0.1 M Tris pH 8.5, and 30% PEG 550MME, 0.1 M Tris, 0.1 M ammonium acetate pH 8.5, respectively. Crystals were cryoprotected in liquid N2 in their mother liquor for data collection. Data were collected at the Advanced Photon Source (APS) (LS-CAT Beamlines 21-D, 21-F, and 21-G) at Argonne National Laboratory on a MarCCD (Mar USA, Evanston, IL) at wavelengths of 1.0781, 0.97872, and 0.97857 Å, respectively, at −180 °C. Data were processed and scaled with HKL2000.[40] All proteins for this study crystallized in space group P43212 with unit cell parameters as listed in Supporting Information Table S1. All proteins contain a monomer in the asymmetric unit. Phases were initially determined by molecular replacement with a native structure of cADO having all ligands and water molecules removed (PDB ID: 2OC5) using Phaser[41] in the CCP4i program suite.[42] Smiles for the compounds were created using Molinspiration WebME Editor 3.81 (Dr. Peter Ertl, http://www.molinspiration/docu/webme/index.html) while restraints were produced using mogul=qm option in grade.[43,44] Protein models were refined by rigid body refinement and restrained refinement using Buster.[44] Side chains were visualized and adjusted using Coot[45] with 2F0–Fc and F0–Fc electron density maps from Buster. Structures were validated using Molprobity.[46] All residues are in allowed regions of the Ramachandran plot. Data refinement statistics are given in Table 1.

40 in total

1. Dioxygen Activation by Enzymes Containing Binuclear Non-Heme Iron Clusters.

Authors: Bradley J. Wallar; John D. Lipscomb
Journal: Chem Rev Date: 1996-11-07 Impact factor: 60.622

2. PRODRG: a tool for high-throughput crystallography of protein-ligand complexes.

Authors: Alexander W Schüttelkopf; Daan M F van Aalten
Journal: Acta Crystallogr D Biol Crystallogr Date: 2004-07-21

Review 3. Manufacturing molecules through metabolic engineering.

Authors: Jay D Keasling
Journal: Science Date: 2010-12-03 Impact factor: 47.728

4. Solubilization and purification of aldehyde-generating fatty acyl-CoA reductase from green alga Botryococcus braunii.

Authors: X Wang; P E Kolattukudy
Journal: FEBS Lett Date: 1995-08-14 Impact factor: 4.124

5. Molecular characterization of the CER1 gene of arabidopsis involved in epicuticular wax biosynthesis and pollen fertility.

Authors: M G Aarts; C J Keijzer; W J Stiekema; A Pereira
Journal: Plant Cell Date: 1995-12 Impact factor: 11.277

6. Conversion of fatty aldehydes to alka(e)nes and formate by a cyanobacterial aldehyde decarbonylase: cryptic redox by an unusual dimetal oxygenase.

Authors: Ning Li; Hanne Nørgaard; Douglas M Warui; Squire J Booker; Carsten Krebs; J Martin Bollinger
Journal: J Am Chem Soc Date: 2011-04-04 Impact factor: 15.419

7. Unusual mechanism of hydrocarbon formation in the housefly: cytochrome P450 converts aldehyde to the sex pheromone component (Z)-9-tricosene and CO2.

Authors: J R Reed; D Vanderwel; S Choi; J G Pomonis; R C Reitz; G J Blomquist
Journal: Proc Natl Acad Sci U S A Date: 1994-10-11 Impact factor: 11.205

8. MolProbity: all-atom structure validation for macromolecular crystallography.

Authors: Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral; Laura W Murray; Jane S Richardson; David C Richardson
Journal: Acta Crystallogr D Biol Crystallogr Date: 2009-12-21

9. Production of propane and other short-chain alkanes by structure-based engineering of ligand specificity in aldehyde-deformylating oxygenase.

Authors: Basile Khara; Navya Menon; Colin Levy; David Mansell; Debasis Das; E Neil G Marsh; David Leys; Nigel S Scrutton
Journal: Chembiochem Date: 2013-06-11 Impact factor: 3.164

10. Phaser crystallographic software.

Authors: Airlie J McCoy; Ralf W Grosse-Kunstleve; Paul D Adams; Martyn D Winn; Laurent C Storoni; Randy J Read
Journal: J Appl Crystallogr Date: 2007-07-13 Impact factor: 3.304

8 in total

Review 1. Divergent mechanisms of iron-containing enzymes for hydrocarbon biosynthesis.

Authors: Courtney E Wise; Job L Grant; Jose A Amaya; Steven C Ratigan; Chun H Hsieh; Olivia M Manley; Thomas M Makris
Journal: J Biol Inorg Chem Date: 2016-12-21 Impact factor: 3.358

2. Ferritin-Like Proteins: A Conserved Core for a Myriad of Enzyme Complexes.

Authors: Rahul Banerjee; Vivek Srinivas; Hugo Lebrette
Journal: Subcell Biochem Date: 2022

3. A consensus-guided approach yields a heat-stable alkane-producing enzyme and identifies residues promoting thermostability.

Authors: Tabinda Shakeel; Mayank Gupta; Zia Fatma; Rakesh Kumar; Raubins Kumar; Rahul Singh; Medha Sharma; Dhananjay Jade; Dinesh Gupta; Tasneem Fatma; Syed Shams Yazdani
Journal: J Biol Chem Date: 2018-04-09 Impact factor: 5.157

4. Structure-oriented substrate specificity engineering of aldehyde-deformylating oxygenase towards aldehydes carbon chain length.

Authors: Luyao Bao; Jian-Jun Li; Chenjun Jia; Mei Li; Xuefeng Lu
Journal: Biotechnol Biofuels Date: 2016-08-31 Impact factor: 6.040

5. Identification of residues important for the activity of aldehyde-deformylating oxygenase through investigation into the structure-activity relationship.

Authors: Qing Wang; Luyao Bao; Chenjun Jia; Mei Li; Jian-Jun Li; Xuefeng Lu
Journal: BMC Biotechnol Date: 2017-03-16 Impact factor: 2.563

6. Comparison of orthologous cyanobacterial aldehyde deformylating oxygenases in the production of volatile C3-C7 alkanes in engineered E. coli.

Authors: Pekka Patrikainen; Veronica Carbonell; Kati Thiel; Eva-Mari Aro; Pauli Kallio
Journal: Metab Eng Commun Date: 2017-05-08

7. Identification of non-conserved residues essential for improving the hydrocarbon-producing activity of cyanobacterial aldehyde-deformylating oxygenase.

Authors: Hisashi Kudo; Yuuki Hayashi; Munehito Arai
Journal: Biotechnol Biofuels Date: 2019-04-17 Impact factor: 6.040

8. Hydrocarbon Desaturation in Cyanobacterial Thylakoid Membranes Is Linked With Acclimation to Suboptimal Growth Temperatures.

Authors: Eerika Vuorio; Kati Thiel; Duncan Fitzpatrick; Tuomas Huokko; Jari Kämäräinen; Hariharan Dandapani; Eva-Mari Aro; Pauli Kallio
Journal: Front Microbiol Date: 2021-11-26 Impact factor: 5.640

8 in total