Oxalate is an abundant plant metabolite that serves diverse functions including pH homeostasis,[1] ion balance and regulation of calcium levels,[2] metal detoxification and tolerance,[1,3-5] defense against insects and herbivores,[6] tissue support,[7] nutrient sharing with bacterial symbionts,[8] and regulation of light distribution to chloroplasts in shade plants.[9] In plants, it is produced from glyoxylate, glycolate, oxaloacetate or ascorbate during photorespiration and the glyoxylate cycle.[10] Despite its beneficial functions, oxalate is also highly acidic and a strong chelator that may reduce the availability of essential ions. Consequently, its levels in plants need to be tightly regulated. This is achieved by mechanisms that control its synthesis, precipitation (predominantly into calcium-based crystals), and degradation.[2,10,11] The latter proceeds mainly by oxidation or by CoA-dependent decarboxylation,[12] in plants, and by direct decarboxylation, in bacteria and fungi.[13] Decarboxylation of oxalyl-CoA is catalyzed by the enzyme oxalyl-CoA decarboxylase (OXC; 4.1.1.8), which catalyzes the thiamine diphosphate-dependent conversion of oxalyl-CoA to formyl-CoA and carbon dioxide.[14] Formyl-CoA can then be further converted to formate and, subsequently, to carbon dioxide in reactions catalyzed by formyl-CoA hydrolase and formate dehydrogenase, respectively.[11] The initial step in this pathway is performed by the enzyme oxalyl-CoA synthetase (OCS).Oxalyl-CoA synthetase (EC 6.2.1.8) is an ATP-dependent enzyme that catalyzes the ligation of oxalate to CoA, forming oxalyl-CoA (Scheme 1a and b). It has been identified in plants such as Arabidopsis thaliana,[15]Medicago truncatula,[16]Vigna umbellate,[17]Oryza sativa[18] and Glycine soja[5] and in the yeast Saccharomyces cerevisiae,[19] where it plays a central role in oxalate catabolism. Its encoding gene, AAE3, is induced by oxalate and its deletion in dicots, such as Arabidopsis thaliana and Medicago truncatula, in which there is no known oxalate oxidase,[20] was shown to inhibit oxalate degradation to carbon dioxide and to increase the intracellular concentration of oxalate.[15,16] The subsequent increase in oxalate concentration was shown to interfere with seed coat-development and lead to reduced and delayed seed germination.[15,21] An important role of OCS in plants is to metabolize exogenous oxalate secreted by certain pathogenic fungi, e.g., Sclerotinia sclerotiorum, as a virulence factor that facilitates infection.[22,23] Indeed, inactivation of OCS both in A. thaliana and in M. truncatula was shown to increase their susceptibility to infection by S. sclerotiorum.[15,16]
Scheme 1
The enzymatic reactions catalyzed by LsOCS and LsBOS. a. In the first step, LsOCS catalyzes the ligation of oxalate (red) to ATP in the presence Mg2+ ions. A high-energy oxalyl-adenylate intermediate is formed and di-phosphate (PPi) is released. b. In the second step, the pantetheine thiol group of coenzyme A (CoA, blue) attacks the carboxylate carbon of oxalyl-adenylate, releasing AMP and oxalyl-CoA. c. The ligation of oxalyl-CoA to l-α,β-diaminopropionic acid is catalyzed by β-ODAP synthase (BOS) and produces β-N-oxalyl-l-α,β-diaminopropionic acid (β-ODAP), releasing CoA (blue).
OCS was identified more than 50 years ago[24] in grass pea (Lathyrus sativus), a legume crop that is grown for food and forage purposes in parts of South Asia and Africa.[25] It was suggested to be part of a metabolic network that synthesizes a neurotoxic compound β-N-oxalyl-l-α,β-diaminopropionic acid (β-ODAP, Scheme 1c) using an enzyme that ligates oxalyl-CoA and l-α,β-diaminopropionic acid.[24,26] The compound β-ODAP may cause lathyrism, a neurological disorder characterized by spastic paralysis of the lower limbs,[27-29] if consumed as a primary diet component over a prolonged period of time. Since OCS in grass pea uses oxalate as a substrate and is likely involved in the synthesis of β-ODAP, an attempt has been made to eliminate β-ODAP production in grass pea by the introduction of a constitutively expressed fungal oxalate decarboxylase gene.[30] Indeed, a reduction of up to 75% in oxalate levels and 73% in β-ODAP levels were obtained in the seeds of the transgenic grass pea relative to the wild type plant. However, despite the significant reduction in β-ODAP concentration, this genetically modified crop was still not devoid of the neurotoxin.We identified and cloned the gene encoding OCS from L. sativus (LsOCS), expressed the protein recombinantly, characterized its catalytic activity and determined its structure. We show that LsOCS is a bonne fide oxalyl-CoA synthetase that catalyzes the ligation of oxalate to CoA, yet exhibits low promiscuous activity with glyoxalate. We determine that LsOCS has a moderate thermal stability, which is not significantly enhanced by binding of its substrates. We then describe the crystal structure we obtained of LsOCS, captured in a thioester-forming conformation, and compare it with the structure of its homolog from Arabidopsis thaliana (AtAAE3) – the only other plant OCS crystalized so far, and with structures of other members of the ANL superfamily of enzymes.
Results
Identification of the gene encoding LsOCS in grass pea
In order to identify the ORF encoding LsOCS in grass pea we searched sequence databases for homologous sequences in other legumes. Specifically, we chose four related legumes: Arachis duranensis (XR_001592575.1), Cicer arietinum (XM_004514691.2), Vigna radiata (XM_014659057.1) and Glycine max (XM_014764165.1). Using multiple sequence alignments, we identified a consensus sequence region of ∼1 kb and used it to design gene-specific primers. The region was then PCR-amplified from purified genomic grass pea DNA (Fig S1, ESI†) and sequenced. The sequence of the amplified region was compared to a published grass pea transcriptome,[31] enabling the identification of the entire transcript. The identified transcript sequence has been deposited in the GenBank database (GenBank: MK492104.1) as the sequence of LsOCS.We performed a database search using this sequence and found OCS homologs from >90 different plants that had a relatively high sequence identity (>76%) to LsOCS (Fig. S2, ESI†). The most similar sequence, of an oxalyl-CoA ligase from Medicago truncatula (GenBank: XP_003599555.1), displayed ∼88% amino-acid identity to LsOCS. During the course of this work, a putative OCS sequence from grass pea was deposited by Kushwah, N. S. et al., in GenBank (GenBank: MH469748.1). There are, however, two differences between the protein sequence we derived and the one deposited by Kushwah, N. S. et al. While the sequence we identified contains Ala at position 474 and Thr at position 497, the deposited sequence lists Pro at both positions. In the multiple sequence alignment of OCS homologs we made, none of them possesses a Pro residue at these positions.
LsOCS was expressed and purified from E. coli cells as a soluble, monomeric protein
To ensure a high expression level of the recombinant protein, the gene encoding LsOCS was N-terminally fused to a cleavable His-bdSUMO tag.[32,33] The expressed protein was purified from bacterial cells using a Ni-NTA column followed by on-column cleavage using a bdSUMO protease. An additional size-exclusion chromatography step resulted in >95% pure, untagged protein (Fig. S4, ESI†), which migrated as a single band with an apparent molecular weight of ∼56 kDa (data not shown), consistent with the expected size of a monomeric protein.
LsOCS displays its highest catalytic efficiency with oxalic acid
We examined the ability of the purified LsOCS protein to ligate CoA and oxalate in the presence of ATP and Mg2+ (Scheme 1). Enzymatic activity was measured using a coupled enzyme assay (see Materials and methods). The turnover number (kcat) and Michaelis (KoxalateM) constant derived for the reaction were 7.6 ± 0.7 s−1 and 71.5 ± 13.3 μm respectively, giving rise to a catalytic efficiency (kcat/KoxalateM) of 1.1 × 105 M−1 s−1 (Fig. 1 and Table 1). The corresponding specific activity was 8.2 ± 0.8 μmole min−1 mg−1. To verify that ATP hydrolysis is coupled to the ligation of CoA to oxalic acid we measured the concentration of unligated CoA at different time points along the reaction, using a DTNB assay that detects free thiol groups (Materials and ethods, Fig. S5, ESI†). We found that free CoA levels were reduced by ∼80% within 10 min following the addition of substrates (Fig. S5, ESI†). Control reactions lacking the enzyme, ATP, or oxalic acid resulted in no reduction in the concentration of free CoA (Fig. S5, ESI†). Finally, we examined samples of the reaction mixture before and after incubation with the enzyme using LC-MS, and found that >95% of the initial CoA substrate had been converted to oxalyl-CoA under the reaction conditions used (Fig. S6, ESI†).
Fig. 1
Kinetics of organic acid ligation to CoA by LsOCS. The rates of CoA ligation to: oxalate (), malonate (), glyoxalate (), succinate (), lactate (○) and glycolate () were assayed using purified LsOCS (1 μM) and excess concentrations of ATP, CoA, and Mg+2 (≥1 mM). The data for oxalate was fitted to the Michaelis–Menten equation; error bars denote SD of three independent biological replicates. Data for all other substrates was fitted to a linear regression equation; error bars denote SE of the fit.
Catalytic efficiencies of LsOCS with different substrates
Substrate
kcat [s−1]
±[s−1]
KM [μM]
±[μM]
kcat/KM [s−1 M−1]
±[s−1 M−1]ab
Oxalate
7.6
0.7
71.5
13.3
1.1 × 105
2.2 × 104
Glyoxalate
—
—
—
—
1.8 × 103
3.8 × 101
Malonate
—
—
—
—
1.2 × 101
2.1
Succinate
—
—
—
—
NDc
ND
Glycolate
—
—
—
—
ND
ND
Lactate
—
—
—
—
ND
ND
Errors derived from two independent measurements per substrate except for oxalate, for which 3 independent measurements were performed. All measurements were preformed using excess amounts of CoA, ATP and Mg2+ over the carboxylate substrates.
The error in kcat/kM for oxalate was calculated using the individual errors in kcat and KM.
ND – not detectable.
Errors derived from two independent measurements per substrate except for oxalate, for which 3 independent measurements were performed. All measurements were preformed using excess amounts of CoA, ATP and Mg2+ over the carboxylate substrates.The error in kcat/kM for oxalate was calculated using the individual errors in kcat and KM.ND – not detectable.In order to examine its substrate specificity, we measured the activity of LsOCS with a number of carboxylates, all of which are prevalent plant metabolites that have similar structures to oxalate (Table 2). Two of these, malonate and succinate, are 3- and 4-carbon chain dicarboxylic acid analogues of oxalate (Table 2). Moving from the 2-carbon chain oxalate to malonate and to the longer succinate substrate, resulted in a 4-order of magnitude decrease in the enzyme's catalytic efficiency (kcat/KM) for malonate (Table 1) and the activity with succinate was bellow detection limit. With glyoxylate, a C2 monocarboxylic acid, in which one of the carboxylic groups present in oxalate was reduced to an aldehyde, a sharp 61-fold decrease in catalytic efficiency (kcat/KM) was observed (Table 1). This is likely due to loss of planarity of the hydrated species (the aldehyde form is converted to a geminal-diol in aqueous solutions). The complete loss of planarity probably underlies the lack of detectable catalytic efficiencies observed with another C2 monocarboxylate, glycolate, and with the 3-carbon monocarboxylate, lactate (Fig. 1 and Table 1).
Structures of carboxylic acids used in this study
Name
Structure
Oxalate
Malonate
Succinate
Glyoxylate
Glycolate
Lactate
LsOCS is most active at pH 8
The activity of LsOCS was assayed at different pHs, ranging from 6 to 8.5 (Fig. S7, ESI†). The highest activity was measured at pH 8. Similar results were reported for the AtAAE3[15] and MtAAE3[16] homologues of LsOCS.
The thermal stability of LsOCS is slightly increased by CoA or ATP binding
We used differential scanning fluorimetry (DSF) to assess the effect of ligand binding on the stability of LsOCS. In its Apo state, LsOCS was found to have moderate thermal stability, with an apparent midpoint temperature of unfolding (Tappm) of 43.9 °C (Table 3). Addition of CoA at 100-fold or ATP + Mg2+ at 10-fold molar excess over LsCOS, led to an increase of 1.8 °C and 2.7 °C in the Tappm of the protein, respectively (Table 3). The addition of either oxalic acid at 100-fold, CoA at 10-fold or Mg2+ at 70-fold excess, on the other hand, had no effect the thermal stability of the enzyme (Table 3). Thus, the thermal stability of OCS was only slightly increased in the presence of a large, 100-fold, excess of CoA but not by a similar concentration of oxalic acid. A greater increase was obtained using only a 10-fold excess of ATP + Mg2+ but not by a large excess of magnesium. This may indicate that in its Apo form, LsOCS prefers binding the combination of ATP and magnesium over the binding of CoA or oxalate.
Thermal stability of LsOCS
Molar ratioa
Tappmb (°C)
±(°C)c
ΔTappm relative to LsOCS-Apo (°C)
LsOCS – apo
43.9
0.02
—
LsOCS + CoA
1 : 100
45.7
0.03
1.8
LsOCS + CoA
1 : 10
44.2
0.01
0.3
LsOCS + Oxalate
1 : 100
44.1
0.02
0.2
LsOCS + Mg2+
1 : 70
44.2
0.01
0.3
LsOCS + ATP+ Mg2+
1 : 10
46.6
0.02
2.7
Molar ratio of LsOCS to the added ligand.
T
app
m is the apparent midpoint temperature of melting determined by differential scanning fluorimetry (DSF). Results represent the mean of two independent measurements.
Experimental error from two independent measurements.
Molar ratio of LsOCS to the added ligand.T
app
m is the apparent midpoint temperature of melting determined by differential scanning fluorimetry (DSF). Results represent the mean of two independent measurements.Experimental error from two independent measurements.
The crystal structure of LsOCS complexed with AMP
LsOCS was co-crystallised with Mg·ATP and sodium oxalate and the structure of the enzyme, bound to AMP, was obtained at 2.7 Å resolution (Table 4). No electron density was observed for the oxalyl moiety or for a Mg2+ ion. The crystals appeared after a few weeks, at which time the phosphoester bond between the oxalyl and AMP may have been hydrolyzed and the oxalate could have dissociated from the active site. Similar incidents of hydrolysis of the adenylate intermediate were reported for AtAAE3,[34] and for the phenylalanine-adenylating domain of gramicidin S non-ribosomal peptide synthetase, GrsA.[35]
Data collection and refinement statistics for LsOCS
Data collection
PDB code
6QJZ
Space group
P22121
Cell dimensions:
a,b,c (Å)
61.99 74.26 101.68
α,β,γ (°)
90, 90, 90
No. of copies in a.u.
1
Resolution (Å)
47.59–2.7
Upper resolution shell (Å)
2.797–2.7
Unique reflections
13 429 (1294)a
Completeness (%)
99.97 (100.00)
Multiplicity
19.7 (20.2)
Average I/σ(I)
21.78 (6.76)
R-pim
0.1858 (0.2636)
CC1/2
0.851 (0.782)
Refinement
Resolution range (Å)
47.59–2.7
No. of reflections (I/σ(I) > 0)
13 429
No. of reflections in test set
649
R-working/R-free
0.2152/0.2306
No. of protein atoms
3797
No. of water molecules
117
Overall average B factor (Å2)
22.37
Root mean square deviations:
– Bond length (Å)
0.014
– Bond angle (°)
1.69
Ramachandran plot
Most favored (%)
96.2
Additionally allowed (%)
3.6
Disallowed (%)
1.0
Values in parentheses refer to the data of the corresponding upper resolution shell.
Values in parentheses refer to the data of the corresponding upper resolution shell.By virtue of its activity and sequence, LsOCS belongs to the acyl CoA-synthetase enzyme family, which is a subfamily of the ANL superfamily of adenylating enzymes.[36] Similar to other members of this superfamily (Fig. 2), LsOCS exhibits a two-domain architecture, comprising a large N-terminal domain and a smaller C-terminal domain (Fig. 3a). The N-terminal domain consists of residues 1–415 and contains three β-sheets and 12α-helices arranged in an αβαβα domain structure. The C-terminal domain (residues 420–521) contains two β-sheets: an antiparallel, two stranded β-sheet and a central three-stranded β-sheet packed between three helices. The N- and C-terminal domains are connected via a small hinge-loop region 417IKEL420 (Fig. 3a). This loop contains a central hinge residue, Lys418, which is commonly replaced by an Asparagine or even a Serine residue in other homologs of the ANL superfamily (Fig. 2). In homologous enzymes, the hinge residue serves as a pivot for the rotation of the C-terminal domain upon the transition between the adenylate- and thioester-forming reactions.[36] The hinge loop is preceded by Arg416, a highly conserved residue known to function in the positioning and stabilization of the ATP substrate and the resulting AMP moiety in the binding pocket through interactions with the nucleotide ribose hydroxyls.[36] In the structure of LsOCS, Lys418 was found bound to the ribose of AMP, however, Arg416 was detached from it (Fig. 3b).
Fig. 2
Structural based sequence alignment of LsOCS and homologous proteins. Depicted is a protein sequence alignment of OCS from Lathyrus Sativus (LsOCS) to homologs with known structures from the ANL superfamily: OCS from Arabidopsis thaliana (AtAAE3), 4-chlorobenzoyl-CoA ligase/synthetase (CBL) from Alcaligenes sp. (AsCBL), methylmalonyl Coenzyme A synthetase (MatB) from Rhodopseudomonas palustris (RpMatB), long chain fatty acyl-CoA synthetase from Thermus thermophillus (TtFattyAcs), 4-coumarate CoA ligase from Populus tomentosa (Pt4CL), and o-succinylbenzoate CoA synthetase (MenE) from Bacillus subtilis (BsMenE). LsOCS secondary structure elements are labeled above the corresponding sequence; α-helices are depicted as spirals, and β-strands as arrows. Residues conserved in all proteins are shown as red blocks. P-loop residues are marked by a green arrow. Residues of LsOCS that interact with AMP are marked by green asterisks. The hinge residue Lys418 is marked by an inverted, green triangle. The alignment was done using MultAlin[61] and the figure was created using ESPript.[62]
Fig. 3
Structural features of LsOCS. a. The crystal structure of LsOCS with the N- (residues 1–411) and C-terminal (residues 420–521) domains shown in grey and cyan, respectively. The P-loop (177SGTTSRP183) is shown in red and the hinge-loop region (417IKEL420) is shown in purple. AMP is shown as a stick model with carbon atoms in orange, oxygen in red and nitrogen in blue. b. Plot of the interactions between LsOCS and the bound AMP. Residues making hydrogen bonds with the ligand are represented in ball and stick, whereas residues making hydrophobic interactions are shown as red crowns. The figure was created using LigPlus. c. The structures of LsOCS (PDB ID: 6QJZ), AsCBL in its thioester forming conformation (PDB ID: 3CW9) and AtAAE3 in its adenylate-forming confirmation (PDB ID: 5IE3) were superimposed, yet only their P-loops and AMP ligands are shown for simplicity. The P-loops of LsOCS (red), AtAAE3 (blue) and AsCBL (yellow) are shown next to their respective AMP ligands: LsOCS (magenta sticks), AtAAE3 (blue sticks) and AsOCS (yellow sticks). d. The oxalyl-binding pocket of AtAAE3 is shown as a gray surface. The oxalate and AMP ligands (blue sticks) are surrounded by the pocket forming residues (yellow sticks). The pocket forming residues of LsOCS (pink sticks) were overlaid. Figures a, c and d were created using PyMOL.[63]
The active site of LsOCS, like that of other ANL enzymes, is located at the interface between the N- and C-terminal domains. The AMP moiety resides in a cleft on the surface of the N-terminal domain (Fig. 3a and Fig. S8, ESI†). This position is the same as that occupied by AMP in other structures, including its Arabidopsis homolog (PDB ID: 5IE3)[34] and a chlorobenzoyl-CoA ligase (CBL) homolog from Alcaligenes sp. (AsCBL, PDB ID: 3CW9)[37] (Fig. 3c). Within the cleft, AMP interacts with eleven residues (Fig. 2 and 3b), nine of which are located in the N-terminal domain (His221, Ser296, Ala297, Ser298, Ala318, Ala320, Met321, Thr322, Asp401), one in the hinge region (Lys418) and one in the C-terminal domain (Lys427). Of these nine residues, Thr322 and Asp401 are highly conserved amongst all ANL family members, with the former residue interacting with the phosphate oxygens of the nucleotide and the latter with its ribose hydroxyls. As discussed below, the P-loop (177SGTTSRP183; marked with a green arrow in Fig. 2 and in red in Fig. 3a), which plays a critical role in the binding of ATP, points away from the binding site, similar to its orientation in AsCBL crystalized in its thioester-forming conformation (Fig. 3c).Oxalyl CoA-synthetase from Arabidopsis thaliana (AtAAE3) is a close homolog (74.7% sequence identity) of LsOCS. Currently, it is the only oxalyl CoA-synthetase whose structures have been deposited in the Protein Data Bank (PDB).[34] As mentioned above, the oxalyl moiety was not detected in the crystal structure of LsOCS. However, a binding pocket for oxalate was observed in the structure of AtAAE3, which was co-crystalized with oxalate (PDB ID: 5IE3). The pocket is shaped by eight residues in AtAAE3 (Val215-His216, Cys288-Ser289, Ala313-Met314, His319 and Lys500), three of which (Ser289, His319 and Lys500) directly interact with the substrate.[34] While Lys500 is present in all ANL enzymes and is essential for the adenylation reaction,[36] Ser289 and His319 are conserved only amongst oxalyl-CoA synthetases.[34] The latter two residues were therefore proposed to serve primarily in the binding and positioning of oxalate, governing the substrate specificity of this subset of acyl-CoA synthetases.[34] We found that the conformations of the corresponding residues in LsOCS (Val222-His223, Cys295-Ser296, Ala320-Met321, and His326) overlapped with those of AtAAE3 (Fig. 3d). The only oxalyl binding residue missing from the binding pocket structure of LsOCS was Lys507, which corresponds to Lys500 in AtAAE3. This is because no electron density was observed for the loop region 502KTATGKIL509, within which Lys507 resides in LsOCS (Fig. 3a). In addition, since Lys507 is part to the C-terminal region of the enzyme, it moves away from the active site upon transition to the thioester binding conformation (see below), thus losing its binding contacts with the oxalate substrate.
The LsOCS structure exhibits a thioester forming conformation
Acyl-CoA synthetases catalyze the ligation of carboxylate containing substrates to CoA via a two-step reaction mechanism (Scheme 1). In the first step, the substrate's carboxylate reacts with ATP, forming an adenylated substrate and releasing inorganic phosphate (PPi). In the second, the adenylated substrate reacts with CoA, forming a CoA-thioester and releasing AMP. The enzymes employ two different conformations, the adenylate- and thioester-forming conformations, to catalyze the two reaction steps. The transition between the two conformations is mediated by changes in the position of the small C-terminal domain relative to the large N-terminal domain.[36] In order to obtain structures of LsOCS in both conformations, we attempted to crystalize the enzyme with its oxalate substrate and ATP, with CoA only, and without substrates. Unfortunately, we obtained a structure of the enzyme in only one conformation. To determine in which conformation we crystalized LsOCS, we compared its structure to structures of other members of the superfamily, crystalized in either one or both conformations.Currently, there are at least 200 crystal structures from 79 different proteins of the ANL superfamily of adenylating enzymes deposited in the Protein Data Bank (http://www.acsu.buffalo.edu/∼amgulick/RANLChart.html). The enzymes have been crystallised in different liganded states, providing detailed insights into the catalytic mechanism they employ. We compared the conformation of LsOCS with the adenylate- and thioester-forming conformations of six CoA synthetases/ligases: 4-coumarate CoA ligase from Populus tomentosa (Pt4CL), o-succinylbenzoate CoA synthetase from Bacillus subtilis (BsMenE), 4-chlorobenzoyl-CoA ligase/synthetase from Alcaligenes sp (AsCBL), the long chain fatty acyl-CoA synthetase from Thermus thermophillus (TtFattyAcs), oxalyl CoA-synthetase from Arabidopsis thaliana (AtAAE3), and methylmalonyl CoA synthetase from Rhodopseudomonas palustris (RpMatB) (Fig. 4). Three of these enzymes had been crystalized in both their adenylating and thioester forming conformations (AsCBL, BsMenE, TtFattyAcs).
Fig. 4
Ribbon representation of the LsOCS crystal structure and related ANL family members. a. Structural overlay of LsOCS (PDB ID: 6QJZ) and the thioester-forming conformations of Pt4CL (PDB: 3A9V), BsMenE (PDB ID: 5X8G), AsCBL (PDB ID: 3CW9) and TtFattyAcs (PDB ID: 1V25). The N-terminal domains are in grey and the C-terminal domain are shown in cyan, green, orange, red and yellow, respectively. AMP is shown as stick model with the carbon atoms in pink, oxygen in red and nitrogen in blue. b. Structural overlay of LsOCS (PDB ID: 6QJZ) and the adenylate-forming conformations of AtAAE3 (PDB ID: 5IE3), RpMatB (PDB ID:4FUT), BsMenE (PDB ID: 5BUS) and AsCBL (PDB ID:3CW8). The N-terminal domains are in grey and the C-terminal domains are shown in cyan, blue, purple, pink and lemon-green respectively. c. Structural overlay of LsOCS (PDB ID: 6QJZ) and the adenylate-forming conformation of AtAAE3 (PDB ID: 5IE3). The N-terminal domains are in grey and the C-terminal domains are shown in cyan and blue, respectively. Also highlighted are the symmetry rotation axis between the conformations of the two C-terminal regions, the pivot residue of LsOCS, Lys418 (green sticks) and the beta hairpin following the hinge region in LsOCS (red). The figure was created using PyMOL.[63]
A structural overlay of LsOCS with the thioester-forming conformations of AsCBL, Pt4CL, BsMenE and TtFattyAcs, revealed a very good alignment of both N- and C-terminal domains (RMSD = 1.8 ± 0.7 Å), despite a low amino-acid sequence identity of 27 ± 2% (Fig. 4a). However, when we overlaid the structure of LsOCS with the adenylate-forming conformation of AsCBL, BsMenE, RpMatB and AtAAE3 (29 ± 1% aa seq. identity to LsOCS except for AtAAE3 that is 75% identical), only the N-terminal domains aligned well (Fig. 4b). The C-terminal domain of LsOCS was shifted by 138–150 degrees with respect to those of the other structures (Fig. 4c).We concluded that, while LsOCS was crystallised in the presence of ATP and oxalate, the substrates of the adenylation reaction, and in the absence of CoA, its crystal structure had adopted the thioester-forming conformation of the enzyme. Similar results had been obtained with the d-alanine-d-alanyl carrier protein ligase (DltA) from Bacilus subtilis,[38] and with the medium chain acyl-CoA synthetase (AAE) from Methanosarcina acetivorans;[39] Likewise, they both crystalized in the thioester-forming conformation in the absence of the coenzyme in the crystallization buffer.
Structural features of LsOCS indicate a partial transition to the thioester-forming conformation
As was mentioned earlier, the transition between the two partial reactions of acyl-CoA synthetases involves a large rotation of their C-terminal domain, which pivots, in LsOCS, around the hinge region residue Lys418 (Fig. 2 and 4c). This movement, termed “domain alternation”, brings the bulk of the C-terminal domain over the N-terminal domain, giving rise to multiple interactions between the two.[36] The structure of LsOCS exhibits many of the expected features of such a rotation. Lys507, a highly conserved residue that is likely involved in the catalysis of the adenylation reaction, and the P-loop are removed from the active site, whereas, the first two strands of the C-terminal (β21, β22 Fig. 2) have moved into the active site (Fig. 3d). The removal of the P-loop from the active site is essential for the initiation of the thioesterification reaction because it enables the rotation of the C-terminal domain, which is otherwise hindered by steric clashes between the P-loop and the hinge region.[34,36] Similarly, the side chain of Arg416 points away from the nucleotide rather than pointing towards its ribose hydroxyls with which it probably interacts in the ATP- and oxalyl-AMP-bound states (Fig. S9, ESI†).Another structural prerequisite for thioesterification is the opening of the pantetheine tunnel, by which the pantetheine moiety of CoA passes into the active site. This tunnel is obstructed by a histidine or by another aromatic residue, when it binds the α-phosphate group of ATP during the adenylation reaction. Upon transitioning into the thioester conformation, the obstructing residue detaches from the α-phosphate group and moves away from the tunnel.[36] In AtAAE3, this residue, His214, had been crystalized both in its ATP bound, tunnel-obstructing rotamer (PDB ID: 5IE2) and in its perpendicular, tunnel-opening rotamer (PDB ID: 5IE3, 5IE2).[34] In the structure of LsOCS, the corresponding residue (His221) is bound to AMP and blocks the pentathein tunnel (Fig. S10, ESI†). It therefore appears, that while LsOCS has been crystalized in the thioester-forming conformation, its transition to this conformation has not been fully completed.
Discussion
The activity of the enzyme oxalyl-CoA synthetase was first identified 60 years ago in extracts of germinated pea seeds (Pisum sativum) during an investigation into the metabolism of oxalic acid in plants.[40] OCS itself was later partially purified and characterized from extracts of pea seeds[41] and grass pea (Lathyrus sativus L.), in which it was first suggested to produce oxalyl-CoA, and that the latter is a precursor to the neurotoxin β-ODAP.[24,26] Grass pea seeds have high nutritional value[42,43] and serve as a source of food and forage mostly in South Asia and Sub-Saharan Africa.[25,44,45] Unfortunately, if consumed as a primary diet component over a prolonged period of time, the β-ODAP they contain may cause neurolathyrism, a disorder characterized by spastic paralysis of the lower limbs.[27-29] Since β-ODAP is synthesized by the ligation of oxalyl-CoA and l-α,β-diaminopropionic acid, the concentration of oxalyl-CoA in grass pea is key to regulating the amount of β-ODAP it produces. This was exemplified by the concomitant reduction in both oxalate and β-ODAP concentrations in grass pea following the introduction of a constitutively expressed fungal oxalate decarboxylase gene into its genome.[30] It also indicates the role of LsOCS in grass pea, which may serve not only to reduce the levels of oxalate but also to promote the synthesis of β-ODAP.The properties of LsOCS we characterized in this work highlight its resemblance to previously characterized OCS homologs. The specific activity of LsOCS with oxalate, 8.2 ± 0.2 μmole min−1 mg−1, is similar to that of several plant homologs such as AtAAE3, MtAAE3 and GsAAE3 (11.4 ± 1, 19 ± 0.9 and 12.64 ± 0.34 μM min−1 mg−1), respectively,[5,15,16] as well as to that of a yeast homolog ScAAE3 (12 ± 1 μM min−1 mg−1).[19] The KM we determined for oxalate binding by LsOCS (71.5 ± 13.28 μM) is also similar to those reported for AtAAE3, MtAAE3 and GsAAE3 (149 ± 12.7, 81 ± 8.1 and 105.1 ± 12.3 μM), but ∼3 fold greater than that reported for ScAAE3 (20 ± 2.7 μM). All five OCS's have similar molecular weights ∼55–60 kDa and display a maximal activity at pH 8. Additionally, the two enzymes crystalized, LsOCS and AtAAE3, present similar structures[15] and display a strong preference for oxalate as a substrate over other carboxylic acids such as malonate, succinate, glycolate, glyoxalate and lactate.[15] The main difference between LsOCS and AtAAE3 was the higher catalytic efficiency we found with glyoxalate (Table 1) with respect to the low activity found by Foster et al.[15] This difference may result from intrinsic differences in the substrate promiscuity of the two orthologs or may be attributed to differences in the experimental systems used.The ANL superfamily of adenylating enzymes consists of acyl- and aryl-CoA synthetases, the adenylation domains of the non-ribosomal peptide synthetases (NRPSs) and firefly luciferase. These enzymes are highly diverse, share little overall sequence identity and catalyze different overall reactions. Nonetheless, all of them contain two independent domains: a large N-terminal domain and a small C-terminal domain, connected by a short hinge region, and all of them employ a two-step reaction mechanism, in which the active site alternates between two main active conformations. The first conformation relates to the initial partial reaction, in which the carboxylate substrate is activated by interaction with ATP, forming an adenylate intermediate (e.g. acyl/aryl-AMP) followed by the release of inorganic phosphate (PPi) (Scheme 1). The second conformation is formed by a rotation of the C-terminal domain around the hinge region (Fig. 3). This conformation is suitable for catalyzing the second step; the interaction of a nucleophile (e.g. an amine, an alcohol or a thiol) with the adenylated substrate and the release of AMP (Scheme 1). The adenylate intermediate, a high-energy acid anhydride, provides the activation energy for a diverse set of second partial reactions.[36] In the acyl-CoA synthetase family, to which LsOCS belongs, during the second part of the reaction the carboxylate carbon of the acyl-AMP intermediate undergoes a nucleophilic attack by the pantetheine thiol group of CoA, displacing AMP and forming the acyl-CoA product (Scheme 1). Thus, the two distinct active conformations of acyl-CoA synthetases are those promoting the adenylate intermediate and the thioester product. While AtAAE3 adapted the adenylate forming conformation in its crystal structure, LsOCS had crystalized in the thioester conformation. The similarity between LsOCS and AtAAE3 in sequence, structure and activity enables their crystal structures to be seen to complement each other and provides information on the rotation of their C-terminal domain between its two conformational states.The inactivation of AtAAE3 in Arabidopsis thaliana resulted in the accumulation of 3-fold higher oxalate levels in its seeds, the formation of oxalate crystals and seed coat defects that reduced their germination substantially.[15] Similarly, the inactivation of MtAAE3 in Medicago truncatula resulted in reduced vegetative growth and seed germination, as well as, increased calcium levels, calcium-oxalate crystal number and permeability of its seeds.[21] In both cases, the reduction in oxalate degradation capabilities had also increased the susceptibility of their host plants to infection by the pathogenic fungus Sclerotinia sclerotiorum.[15,16] In view of this, we suspect that simply inactivating LsOCS in grass pea using genetic engineering may serve to reduce the biosynthesis of β-ODAP, but may also result in the accumulation of toxic oxalate in its cells and increase its susceptibility to pathogens such as S. sclerotiorum. Thus, we suggest that replacing the gene encoding LsOCS in grass pea with an exogenous oxalate-oxidase or decarboxylase may enable the plant to regulate cellular oxalate levels while reducing the levels of β-ODAP in the plant.Finally, the importance of identifying new OCS enzymes extends beyond the need to generate improved grass pea cultivars. Oxalate-degrading enzymes have potential use not only for crop improvement,[10] and for human therapeutic purposes,[46,47] but also in other areas. For example, in the pulp and paper industry and in forest bio-refineries, oxalate-degrading enzymes are used in the prevention of scaling, the formation of calcium oxalate incrusts.[48] Similarly, in the brewery industry, the use of these enzymes reduces calcium oxalate deposits in beer production. Thus, the identification of LsOCS and additional homologous may serve a broad range of research fields and applications.
The solution buffer of purified LsOCS was exchanged to phosphate buffer (PBS 20 mM, pH 8) by repeated (4×) concentration (4000 × g, 20 min) and washing cycles, using an Amicon® Ultra-2 centrifugal filter unit (10 kDa, MWCO). Protein concentration was determined using solution absorbance at 280 nm. Solutions of purified LsOCS (final concentration 0.5 mg ml−1 in PBS) with CoA (0.1, 1 mM), Oxalate (1 mM), MgCl2 (0.7 mM), ATP + MgCl2 (0.1 mM) or without any added compounds (Apo), were incubated for 15–20 min on ice. Differential scanning fluorometry (DSF) was measured for duplicate samples at 330 and 350 nm using a NanoDSF Prometheus NT.48™ instrument.
We co-crystallised LsOCS in the presence of ATP, MgCl2 and oxalate, the substrates of the adenylate-forming half-reaction. LsOCS complexes formed rod-like crystals using the hanging drop vapor diffusion method and a Mosquito robot (TTP LabTech) at 19 °C utilizing the precipitants 0.2 M NaCl and 25% Polyethylene glycol (PEG) 3350 in 100 mM Tris pH 8.5. The LsOCS complex crystals formed in the orthorhombic space group P22121, with one monomer per asymmetric unit and diffracted to 2.7 Å resolution. Data collection was performed under cryo conditions (100 K), in-house, using a Rigaku RU-H3R X-ray instrument. All diffraction images were indexed and integrated using the Mosflm program,[53] and the integrated reflections were scaled using the SCALA program.[54] Structure factor amplitudes were calculated using TRUNCATE[55] from the CCP4 program suite. The structure of LsOCS was solved by molecular replacement with the program PHASER,[56] using the refined homologous (75% sequence identity) structure of the LsOCS from Arabidopsis thaliana (AtAAE3), PDB-ID 5IE2.[34] All steps of the atomic refinements were performed with the PHENIX.refine program.[57] The model was built into 2mFobs-DFcalc, and mFobs – DFcalc maps using COOT.[58] Refinement movements were accepted only when they produced a decrease in the Rfree value. The model was optimized using PDB_REDO,[59] and was evaluated with MOLPROBIDITY.[60] Details of the data collection and refinement statistics of the LsOCS structure are described in Table 4.Coenzyme ALathyrus sativus oxalyl-CoA synthetaseArabidopsis thaliana oxalyl-CoA synthetaseMedicago truncatula oxalyl-CoA synthetaseAcyl activating enzyme 3l-β-N-Oxalyl-α-β-diaminopropionic acid.
Author contributions
M. G. performed the kinetic analyses, the thermal stability assays, and the HPLC analysis, analyzed the data and wrote the paper with contributions from all authors. S. B. identified and isolated the gene from grass pea. A. B. and T. M. performed the LC-MS analysis. Y. P. cloned and expressed the protein for purification. S. A. purified the protein. O. D. crystalized the protein and determined its structure. Z. R. initiated and supervised the study and wrote the manuscript with contributions from all authors.
Data and materials availability
Refined coordinates were deposited in the Protein Data Bank Database under accession code 6QJZ. The identified transcript sequence has been deposited in the GenBank database (GenBank MK492104.1) as the sequence of LsOCS. Materials are available from the authors upon request.