Moshe Goldsmith1, Shiri Barad2, Maor Knafo2, Alon Savidor3, Shifra Ben-Dor4, Alexander Brandis4, Tevie Mehlman4, Yoav Peleg4, Shira Albeck4, Orly Dym4, Efrat Ben-Zeev5, Ranjit S Barbole6, Asaph Aharoni7, Ziv Reich8. 1. Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel. Electronic address: moshe.goldsmith@weizmann.ac.il. 2. Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel. 3. De Botton Institute for Protein Profiling, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel. 4. Department of Life Science Core Facilities, Weizmann Institute of Science, Rehovot, Israel. 5. Medicinal Chemistry Unit, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel. 6. Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel; Plant Molecular Biology Unit, Division of Biochemical Sciences, Council of Scientific and Industrial Research-National Chemical Laboratory, Pune, Maharashtra, India. 7. Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel. 8. Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel. Electronic address: ziv.reich@weizmann.ac.il.
Abstract
Grass pea (Lathyrus sativus L.) is a grain legume commonly grown in Asia and Africa for food and forage. It is a highly nutritious and robust crop, capable of surviving both droughts and floods. However, it produces a neurotoxic compound, β-N-oxalyl-L-α,β-diaminopropionic acid (β-ODAP), which can cause a severe neurological disorder when consumed as a primary diet component. While the catalytic activity associated with β-ODAP formation was demonstrated more than 50 years ago, the enzyme responsible for this activity has not been identified. Here, we report on the identity, activity, 3D structure, and phylogenesis of this enzyme-β-ODAP synthase (BOS). We show that BOS belongs to the benzylalcohol O-acetyltransferase, anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-hydroxycinnamoyl/benzoyltransferase, deacetylvindoline 4-O-acetyltransferase superfamily of acyltransferases and is structurally similar to hydroxycinnamoyl transferase. Using molecular docking, we propose a mechanism for its catalytic activity, and using heterologous expression in tobacco leaves (Nicotiana benthamiana), we demonstrate that expression of BOS in the presence of its substrates is sufficient for β-ODAP production in vivo. The identification of BOS may pave the way toward engineering β-ODAP-free grass pea cultivars, which are safe for human and animal consumption.
Grass pea (Lathyrus sativus L.) is a grain legume commonly grown in Asia and Africa for food and forage. It is a highly nutritious and robust crop, capable of surviving both droughts and floods. However, it produces a neurotoxic compound, β-N-oxalyl-L-α,β-diaminopropionic acid (β-ODAP), which can cause a severe neurological disorder when consumed as a primary diet component. While the catalytic activity associated with β-ODAP formation was demonstrated more than 50 years ago, the enzyme responsible for this activity has not been identified. Here, we report on the identity, activity, 3D structure, and phylogenesis of this enzyme-β-ODAP synthase (BOS). We show that BOS belongs to the benzylalcohol O-acetyltransferase, anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-hydroxycinnamoyl/benzoyltransferase, deacetylvindoline 4-O-acetyltransferase superfamily of acyltransferases and is structurally similar to hydroxycinnamoyl transferase. Using molecular docking, we propose a mechanism for its catalytic activity, and using heterologous expression in tobacco leaves (Nicotiana benthamiana), we demonstrate that expression of BOS in the presence of its substrates is sufficient for β-ODAP production in vivo. The identification of BOS may pave the way toward engineering β-ODAP-free grass pea cultivars, which are safe for human and animal consumption.
Grass pea (Lathyrus sativus L.[Ls]) is an annual legume crop grown for food and forage, mainly in South Asia and Sub-Saharan Africa (1, 2). In addition to its high grain yield (3) and the high nutritional value of its seeds (4, 5), its attractiveness as a farming crop stems from its remarkable tolerance to harsh environmental conditions such as drought, high salinity, and flooding (6, 7, 8), as well as its resistance to insects and fungal diseases (9, 10). Unfortunately, it produces a neurotoxic glutamate analog, named β-N-oxalyl-L-α,β-diaminopropionic acid (β-ODAP) (11, 12). This neurotoxin may cause neurolathyrism, a nutritional neurodegenerative disorder characterized by lower limb paralysis (13, 14). Neurolathyrism results from a chronic intoxication caused by the long-term ingestion of seeds or flour made from grass pea as a primary diet component (15). While grass pea cultivars with reduced concentrations of β-ODAP have been bred (16), none is devoid of β-ODAP. Furthermore, β-ODAP levels may increase under stress conditions such as drought (17). As an alternative approach to conventional breeding, a constitutively expressed fungal oxalate decarboxylase gene was introduced into grass pea to degrade oxalate and, subsequently, reduce the concentration of oxalyl-CoA, a precursor of β-ODAP (18). This attempt reduced β-ODAP concentrations up to 73% in the seeds of the transgenic plant but was unable to prevent β-ODAP production completely. Thus, despite its high nutritional value and enormous agricultural potential, grass pea remains an underutilized crop of limited economic importance in global markets. Yet, it constitutes an essential source of food and income security for resource-poor farmers in developing countries (2, 19).To ensure safe consumption of grass pea seeds and increase its use as a food crop, a cultivar that does not contain β-ODAP is required. For such a cultivar to be developed by genetic engineering using, for example, CRISPR/CRISPR-associated protein 9(20), the biosynthetic pathway leading to β-ODAP production needs to be elucidated.Studies aiming to identify the enzymes responsible for the biosynthesis of β-ODAP in grass pea were initiated more than 50 years ago (21, 22, 23). They indicated that β-ODAP is synthesized by the ligation of oxalyl-CoA and L-α,β-diaminopropionic acid (L-DAPA), catalyzed by a dedicated synthase (23). This synthase, however, has hitherto not been identified, precluding the use of genome editing tools to generate β-ODAP–free cultivars.Here, we report on the isolation, identification, and characterization of a β-ODAP synthase (BOS) from grass pea. The identification of BOS paves the path toward the application of genome editing techniques to generate grass pea cultivars devoid of β-ODAP.
Results and discussion
Isolation and identification of BOS synthase from grass pea
Grass pea seeds, seedlings, and developing leaves accumulate the highest levels of β-ODAP in the plant (24). We, therefore, used seeds and seedlings to identify and isolate BOS by means of fractionation and protein purification. To trace the enzyme during purification, we assayed protein fractions for their ability to ligate oxalyl-CoA to L-DAPA (Fig. 1), using a colorimetric assay that detects the presence of free L-DAPA following its derivatization by o-phthalaldehyde (OPT) (25). While L-DAPA is commercially available, the availability of oxalyl-CoA is quite limited (26). We, therefore, synthesized it in vitro from oxalic acid and Co-A using a recombinant oxalyl-CoA synthetase (OCS) (27). OCS is an ATP-dependent enzyme that catalyzes the ligation of oxalate to CoA, forming oxalyl-CoA (Scheme S1). Its activity and involvement in the production of β-ODAP in grass pea were discovered more than 50 years ago (21, 22). We cloned the OCS gene from grass pea, expressed and purified it from Escherichia coli (27), and used it to produce oxalyl-CoA as a substrate for β-ODAP activity assays.
Figure 1
Reaction scheme and catalytic efficiencies of BOS. A, a scheme of the reaction catalyzed by BOS. B, additional CoA substrates analyzed in this work. C, apparent catalytic efficiencies of BOS with different acyl-CoA substrates. The data for oxalyl-CoA were derived from a fit to the Michaelis–Menten equation; error bars denote SD of three independent repeats. Data for all other substrates were fitted to the linear regime of the Michaelis–Menten model, and kcat/Km was deduced from the slope; error bars denote SE of the fit. BOS, β-ODAP synthetase.
Reaction scheme and catalytic efficiencies of BOS. A, a scheme of the reaction catalyzed by BOS. B, additional CoA substrates analyzed in this work. C, apparent catalytic efficiencies of BOS with different acyl-CoA substrates. The data for oxalyl-CoA were derived from a fit to the Michaelis–Menten equation; error bars denote SD of three independent repeats. Data for all other substrates were fitted to the linear regime of the Michaelis–Menten model, and kcat/Km was deduced from the slope; error bars denote SE of the fit. BOS, β-ODAP synthetase.Following sequential chromatographic separations of soluble grass pea extracts on multiple columns (Experimental procedures section), we obtained a protein fraction enriched in BOS activity. The process was repeated four times, and the proteins present in the final fractions were then subjected to proteolysis and peptide sequencing using liquid chromatography-mass spectrometry (LC–MS/MS) analysis (Experimental procedures section).To correlate between the peptide sequences and their encoding genes, we sequenced the grass pea transcriptome using long-read PacBio sequencing. The constructed set of annotated full-length mRNA transcripts was translated and used as a database for the identification of the proteins in the enzymatically active fractions (Table S1). Between 244 and 1960 different proteins, including BOS, were identified, of which only 219 were found in all active fractions (Table S2). Of these, 45 were predicted to be ligases (gene ontology [GO]:0016874), transferases (GO:0016740), synthases (e.g., GO:0004019), or CoA-binding enzymes (GO:0120225). From this latter set, we selected 18 sequences for functional analysis, based on the fact that they were annotated as CoA-binding enzymes or shikimate pathway ligases and transferases and seemed likely to bind the relevant substrates and catalyze the ligation reaction. The corresponding genes were cloned, expressed, and purified from E. coli cells, and the level of BOS-like activity was assayed in vitro. Of the 18 clones tested, only one exhibited a significant level of such activity (Fig. S1).
The catalytic activity of BOS
To verify that the observed activity was that of a bona fide BOS, BOS was purified from E. coli and the catalytic activity of the recombinant enzyme, an untagged monomer of 439 aa (49.3 kDa, Fig. S2), was assayed using several CoA substrates. The catalytic efficiencies of BOS toward L-DAPA were determined using fixed concentrations of the CoA substrates in excess over varying concentrations of L-DAPA (Figs. 1 and S3). With oxalyl-CoA as substrate, BOS exhibited moderate efficiency, with an apparent turnover rate (kcat) and Michaelis (KmL-DAPA) constants of 118 ± 15 (sec˗1) and 2.5 ± 0.6 (mM), respectively, giving rise to an apparent catalytic efficiency (kcat/KmL-DAPA) of 4.7 ± 1.3 × 104 (sec−1M−1) (Fig. S3) and specific activity of 13.2 ± 1.6 (mmol⋅sec1⋅mg˗1). However, when oxalyl-CoA was substituted by other CoA substrates, such as acetyl-CoA, malonyl-CoA, or glutaryl-CoA, the catalytic efficiency dropped by 30− to 48-fold (Figs. 1 and S3), indicating a catalytic specificity of BOS toward oxalyl-CoA. The enzyme used for synthesizing the substrate oxalyl-CoA, OCS (LsOCS), did not produce any β-ODAP in the presence of L-DAPA and oxalyl-CoA (data not shown).Despite the fact that L-DAPA is, by itself, a stable compound in vitro, attempts to isolate it from grass pea have failed to do so, suggesting that it is a short-lived metabolic intermediate (28). Its precursor β-(isoxazolin-5-on-2-yl)alanine (BIA) and its product (β-ODAP) (Scheme S1), on the other hand, accumulate to high concentrations in the seeds and seedlings of grass pea: up to 2% of the dry weight for BIA (29) and 0.5 to 2.5% for β-ODAP (11). Notwithstanding the large variabilities in seed weight (34–350 mg) (30, 31) and water content (7.5–30.7%) (12, 32) of grass pea seeds, their effective concentrations of BIA and β-ODAP may reach 0.25 to 1.7 M, suggesting that L-DAPA is produced in grass pea at concentrations well above the observed KmL-DAPA of BOS. The magnitude of the latter is similar to that of other enzymes on the biosynthetic pathway of β-ODAP in grass pea, such as cysteine synthase A and B (Km = 6.1 mM and 7.2 mM, respectively) (33), enzymes involved in secondary metabolism (34) and many other enzymes (35).The formation of β-ODAP and its nontoxic isomer α-ODAP in the reaction, along with the consumption of L-DAPA, were monitored by LC–MS. The results showed that >99% of the L-DAPA in the BOS-catalyzed reaction had been converted to β-ODAP, while the concentration of α-ODAP was in order of magnitude less (<1%) and similar to those obtained in the uncatalyzed control reaction (Fig. S4). BOS, thus, displays a high degree of regioselectivity toward acylating the β-amino group of L-DAPA in vitro.
Production of β-ODAP by BOS in tobacco leaves
To determine if BOS activity is sufficient to produce β-ODAP in vivo, we transiently expressed BOS in Nicotiana benthamiana leaves using Agrobacterium tumefaciens infiltration and injected them with L-DAPA (Experimental procedures section). The presence of the other precursor, oxalyl-CoA, in N. benthamiana leaves was expected due to the identification of putative OCSs in database searches of several Nicotiana species, including benthamiana (data not shown). The concentrations of α- and β-ODAP, as well as of L-DAPA, were then determined in leaf samples using LC–MS (Fig. S5). The results revealed that only leaves that expressed BOS and were injected with L-DAPA produced α- and β-ODAP (Fig. S5). The relative concentrations of the two isomers in the leaves could not be determined since the extraction process converts part of β-ODAP to α-ODAP (36). Wild-type N. benthamiana leaves or leaves infiltrated with A. tumefaciens carrying an empty vector and injected with L-DAPA did not produce α- or β-ODAP. Thus, BOS expression in the presence of its substrates is both necessary and sufficient to produce β-ODAP in planta.
Phylogeny
Sequence analysis of BOS revealed that almost its entire sequence (residues 3–434 of 439) corresponds to a transferase domain (PF02458) of a family found predominantly in plants and fungi. The closest relative identified was enhanced Pseudomonas susceptibility 1—an Arabidopsis thaliana protein that belongs to the benzylalcohol O-acetyltransferase, anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-Hydroxycinnamoyl/benzoyltransferase, deacetylvindoline 4-O-acetyltransferase (BAHD) acyltransferase superfamily (37). Correspondingly, BOS was found to possess the two hallmarks of this superfamily: a conserved active site HXXXD motif (residues 162–166, HSVVD, Fig. 2) and a structural DFGWG motif (residues 381–385, Fig. 2) (37, 38, 39). We note that the sequence we determined is identical to that of a transcript that was identified as a BOS candidate and published in a PhD thesis (40).
Figure 2
Secondary structure–based sequence alignment of BOS and HCT homologs. The sequences of hydroxycinnamoyl:CoA-shikimate hydroxycinnamoyl transferases (HCTs) from different species were aligned to that of BOS. Sequences are labeled by their PDB codes: Selaginella moellendorffii (6DD2); Plectranthus scutellarioides (5KJV); Coffea canephora (4G0B); Arabidopsis thaliana (5KJU); Panicum virgatum (5FAL); and Sorghum bicolor (4KE4). BOS secondary structure elements are noted above the sequences and those of Sorghum bicolor (4KE4) - at the bottom. α-Helices and η-helices are shown as spirals, and β-strands are indicated by arrows. Conserved residues are shown in red blocks. Green stars denote BOS His162 and Asp166, which are part of the conserved BAHD acyltransferase HXXXD motif. BAHD, Benzylalcohol O-acetyltransferase, Anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-Hydroxycinnamoyl/benzoyltransferase, Deacetylvindoline 4-O-acetyltransferase; BOS, β-ODAP synthetase.
Secondary structure–based sequence alignment of BOS and HCT homologs. The sequences of hydroxycinnamoyl:CoA-shikimate hydroxycinnamoyl transferases (HCTs) from different species were aligned to that of BOS. Sequences are labeled by their PDB codes: Selaginella moellendorffii (6DD2); Plectranthus scutellarioides (5KJV); Coffea canephora (4G0B); Arabidopsis thaliana (5KJU); Panicum virgatum (5FAL); and Sorghum bicolor (4KE4). BOS secondary structure elements are noted above the sequences and those of Sorghum bicolor (4KE4) - at the bottom. α-Helices and η-helices are shown as spirals, and β-strands are indicated by arrows. Conserved residues are shown in red blocks. Green stars denote BOS His162 and Asp166, which are part of the conserved BAHD acyltransferase HXXXD motif. BAHD, Benzylalcohol O-acetyltransferase, Anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-Hydroxycinnamoyl/benzoyltransferase, Deacetylvindoline 4-O-acetyltransferase; BOS, β-ODAP synthetase.A search for other BAHD acyltransferases in the grass pea transcriptome that we generated identified 30 additional members of this superfamily (SI Appendix A). A phylogenetic tree of BAHD proteins was constructed to determine the clade to which BOS belongs (Fig. 3). In total, 299 sequences were aligned, 93 of which had known biological functions (SI Appendix A and B). The tree (Fig. 3) shows the typical clades of the superfamily, namely, Ia, Ib, II, IIIa IIIb, IV, Va, and Vb (37, 41, 42) with many branches exhibiting species-specific expansions (Fig. S6). BOS, along with 10 other grass pea proteins, forms part of clade Ib (Figs. 3 and S6) and is the second protein in that clade, apart from A. thaliana defective in cuticular ridges (43), to be characterized. Of these 10 proteins (LsIb1–10), three (LsIb3, LsIb4, and LsIb9) were not found in any of the four enzymatically active fractions of BOS. Four sequences (LsIb7–10) constitute the closest homologs of BOS (Fig. 3). Nonetheless, all clade Ib members were cloned, purified, and assayed for BOS activity. Besides BOS, none of them had BOS activity (Fig. S1). Additional BAHD proteins we identified in grass pea belong to other, more distant clades (Fig. 3), have much lower sequence similarity to BOS, and are highly unlikely to have BOS activity. BOS is, thus, likely to be the sole enzyme capable of producing β-ODAP in grass pea.
Figure 3
Phylogenetic tree of BAHD superfamily proteins. Sequence names are prefixed by a two-letter species code and colored by species: L. sativus (Ls), blue; A. thaliana (At), green; M. truncutula (Mt), purple; P. trichocarpa (Pt), brown; other species are colored black. The outer ring and clades are colored as follows (clockwise from the open end on the left): putative clade IIIb (dark purple), clade II (red), clade IV (pink), clade Vb (light green), clade Va (dark green), clade IIIa (light purple), clade Ia (light blue), and clade Ib (dark blue). Sequences of proteins with known functions are shaded in light blue; BOS is marked by a red arrow and shaded in yellow. BAHD, Benzylalcohol O-acetyltransferase, Anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-Hydroxycinnamoyl/benzoyltransferase, Deacetylvindoline 4-O-acetyltransferase.
Phylogenetic tree of BAHD superfamily proteins. Sequence names are prefixed by a two-letter species code and colored by species: L. sativus (Ls), blue; A. thaliana (At), green; M. truncutula (Mt), purple; P. trichocarpa (Pt), brown; other species are colored black. The outer ring and clades are colored as follows (clockwise from the open end on the left): putative clade IIIb (dark purple), clade II (red), clade IV (pink), clade Vb (light green), clade Va (dark green), clade IIIa (light purple), clade Ia (light blue), and clade Ib (dark blue). Sequences of proteins with known functions are shaded in light blue; BOS is marked by a red arrow and shaded in yellow. BAHD, Benzylalcohol O-acetyltransferase, Anthocyanin O-hydroxycinnamoyltransferase, anthranilate N-Hydroxycinnamoyl/benzoyltransferase, Deacetylvindoline 4-O-acetyltransferase.
Crystal structure
BOS was crystallized in its apo state, and its structure was solved to 2.35-Å resolution (Table S3). It consists of 15 β-strands and 11 α-helices and is organized into two equally sized domains of ∼200 aa. Each domain centers around a six-stranded β-sheet flanked by α-helices (Fig. 4A). The two domains are connected by two loops, one (residues 183–209) linking the N- and C-terminal domains (Fig. 4A) and the other (residues 371–388) joining β-strand 13 from the C-terminal domain (residues 389–393) to the β-sheet of the N-terminal domain (Fig. 4A). The latter loop contains the aforementioned DFGWG motif (Fig. 4B).
Figure 4
Structure of BOS and docked substrates. A, ribbon diagram of BOS (PDB ID: 6ZBS). The N- (cyan) and C-terminal (green) domains are connected by loop (residues 183–209; blue) and segment comprised of a β-strand and a short loop (residues 371–393, red). B, surface representation with oxalyl-CoA (magenta) docked into the active site. The conserved DFGWG motif is colored yellow. C, side view showing both oxalyl-CoA (magenta) and L-DAPA (cyan) docked into the active site. D, a close-up view showing oxalyl-CoA (magenta) and L-DAPA (green) docked into the active site along with the conserved catalytic residues His162 and Asp166 (orange). A rotamer of Asp166 that places its residue in hydrogen-bonding distance to His166 was chosen manually to demonstrate a possible interaction. H-bonds are denoted by dashed lines. BOS, β-ODAP synthetase; L-DAPA, L-α,β-diaminopropionic acid.
Structure of BOS and docked substrates. A, ribbon diagram of BOS (PDB ID: 6ZBS). The N- (cyan) and C-terminal (green) domains are connected by loop (residues 183–209; blue) and segment comprised of a β-strand and a short loop (residues 371–393, red). B, surface representation with oxalyl-CoA (magenta) docked into the active site. The conserved DFGWG motif is colored yellow. C, side view showing both oxalyl-CoA (magenta) and L-DAPA (cyan) docked into the active site. D, a close-up view showing oxalyl-CoA (magenta) and L-DAPA (green) docked into the active site along with the conserved catalytic residues His162 and Asp166 (orange). A rotamer of Asp166 that places its residue in hydrogen-bonding distance to His166 was chosen manually to demonstrate a possible interaction. H-bonds are denoted by dashed lines. BOS, β-ODAP synthetase; L-DAPA, L-α,β-diaminopropionic acid.A search for structural homologs of BOS yielded six enzymes (Fig. 2), all of which are plant hydroxycinnamoyl transferases (HCTs; EC:2.3.1.133; Fig. S7). The average RMSD between the structure of BOS and HCTs from Selaginella moellendorffii, A. thaliana, Coffea canephora, Panicum virgatum, Sorghum bicolor, and Plectranthus scutellarioides is 2.7 ± 0.1 Å. The average sequence identity between BOS and these HCTs is only 22 ± 2%, while the sequence identity among the HCTs themselves ranges from 56% to 93%. The main differences between BOS and these enzymes are the lengths and conformations of some of its loops (Fig. S8, A and B) and the geometry of the active site tunnel, which is approximately straight in HCTs (Fig. S8, C and D), but bent to ∼15° in its middle in BOS (Fig. S8, E and F). Thus, despite its overall structural similarity, BOS is different from other plant HCTs, both in sequence and in key structural features.
Catalytic mechanism
Our attempts to cocrystallize BOS with either of its substrates or products were unsuccessful. To gain insight into the catalytic mechanism of BOS, we docked its two substrates to the active site, using the structure of P. virgatum HCT (44) (Protein Data Bank [PDB] ID: 5FAL) as a guide. The docking result showed that oxalyl-CoA resides within the active-site tunnel with the CoA nucleotide moiety situated at the entrance to the tunnel and its oxalyl moiety buried deep inside the tunnel (Fig. 4, B–D). As predicted, the other substrate, L-DAPA, is positioned within binding distance from the thioester carbonyl carbon, with which it interacts (Fig. 4D). Histidine 162 of the conserved catalytic HXXXD motif is located 3.5 Å away from the β amine group of L-DAPA (Fig. 4D). Thus, similar to its role in other BAHD acetyltransferases (38), His162 likely acts to deprotonate the terminal amino nitrogen of L-DAPA, enabling a nucleophilic attack on the carbonyl carbon. This, in turn, results in the formation of a tetrahedral intermediate that subsequently collapses to release CoA and the product β-ODAP (Figs. 4D and S9). The role of the conserved aspartic acid residue of the HXXXD motif, Asp166, may be structural (38) or may serve to activate His162 through hydrogen bonding (Fig. 4D). Our analysis also suggests that other residues, including Lys40, Asp284, Ser371, Ile369, and Lys401, may participate in substrate binding in the active site (Fig. S10). To verify the requirement of His162 and Asp166 to catalysis, we mutated them, individually, to residues with similar side-chain volumes and polarities (His162Gln and Asp166Ser). Both single mutants expressed and purified as soluble proteins but exhibited little to no activity (Fig. S1), supporting their essential roles in the catalytic mechanism of BOS.
Molecular docking simulations
We used molecular docking simulations to predict the likelihood that different CoA substrates will occupy the active site of BOS in a catalytically productive manner. The simulations generated different ligand orientations, and for each orientation, a docking score was calculated. The ligand orientations were sorted by their docking score, and the top-scoring (most negative, thus favorable to binding) orientations were analyzed to estimate their catalytic efficiency. We ran 385 docking simulations using CoA, acetyl-CoA, oxalyl-CoA, malonyl-CoA, glutaryl-CoA, and 4-coumaroyl-CoA as acyl substrates. The simulations were carried out using the structure of BOS predocked with L-DAPA. The rotamer of Asp166 that places it in close, hydrogen bonding distance with His162 (Fig. 4D) was manually chosen in accordance with our proposed catalytic mechanism (Fig. S9). We surmised that optimal substrate binding would place the CoA substrate at a distance and an angle that would favor a productive nucleophilic attack by the β-amine group of L-DAPA on the thioester carbonyl carbon. The optimal angle of attack for a nucleophile on an unsaturated carbonyl, that is, the Bürgi–Dunitz angle (45), is generally taken to be 105 ± 5°, and the initial distance between the nucleophile and the electrophile is expected to range between 2.5 and 3.4 Å (46, 47). Accordingly, we analyzed the top 10% docking results (data not shown) by the proximity and angle of their carbonyl group to the nucleophilic amine of L-DAPA. Of the seven best-ranking binding simulations (Table S4), five were of oxalyl-CoA. Malonyl-CoA received the highest docking score, yet its carbonyl was found a long distance (4.58 Å) away from its nucleophile and at an attack angle of only 59.5 degrees. Binding in this mode is less likely to be productive than that of oxalyl-CoA. Acetyl-CoA, on the other hand, received the lowest docking score of the top seven results, yet its carbonyl was positioned at the shortest distance from the nucleophile (3.2 Å) and at a favorable angle of 102 degrees. Two substrates were not included in the top seven ranking solutions: glutaryl-CoA and 4-coumaryl-CoA. The former was chosen since it is a three-carbon backbone analog of oxalyl-CoA, and the latter is the substrate of the HCT structural analogs of BOS. In both cases, the simulations resulted in their thioester carbonyl groups positioned at long distances (10 ± 5 Å) from the β-amine group of L-DAPA. Such orientations are less likely to lead to a productive ligation reaction. As a reference, we docked CoA itself and obtained docking scores similar to that of other substrates (Table S4), indicating that much of the binding interaction between BOS and the CoA substrates is mediated by their large CoA moiety. In our simulations, the average distance of the oxalyl-CoA carbonyl from its attacking nucleophile was 3.49 ± 0.1 Å (Table S4). It is longer than the expected distance limit for bond formation yet was derived from rigid-body simulations, which do not reflect distance changes in the enzyme following substrate binding or enzyme dynamics. Despite their limitations, our docking simulations may help explain the observed substrate specificity profile of BOS and predict it will be inefficient in catalyzing a reaction with 4-coumaryl-CoA as a substrate.
Conclusions
This work presents the isolation, identification, and characterization of a BOS from grass pea, more than 50 years after its activity was first demonstrated and its existence proposed (21, 22, 23). BOS is part of a superfamily of BAHD acyltransferases that has at least 30 more members in grass pea. However, none of the closest sequence homologs of BOS we identified in grass pea have similar enzymatic activity. The crystal structure of BOS and the results of our molecular docking analysis suggest that both of its substrates interact within a long V-shaped active-site tunnel. They further suggest that upon binding, the thioester carbonyl of oxalyl-CoA is positioned within hydrogen-bonding distance to the terminal β-amine group of L-DAPA and to the conserved His162 residue of BOS. This likely facilitates catalysis through a proton shuttle between the highly conserved Asp166 and His162 residues and is supported by the fact that mutating either of them results in a soluble, folded yet inactive enzyme. Aside from grass pea, β-ODAP has been found in the seeds of 13 species of Crotalaria, 17 species of Acacia, and 20 additional species of Lathyrus (48, 49), all from the legume family (Fabaceae), as well as in several species of Panax (29, 50). We searched for BOS orthologs in the available transcriptomes of Panax ginseng, Panax notoginseng, Crotalaria juncea, and Acacia koa in the NCBI transcriptome shotgun assembly database. However, while we were able to find BAHD family Ib members in all of these species, none branched closely enough to BOS to consider them as orthologs (data not shown). Thus, it is unknown if these and other species that produce β-ODAP utilize a different synthetic pathway that evolved independently for this purpose or use the same pathway and express a BOS-like gene that evolved in parallel and has little sequence similarity to LsBOS.Grass pea is a nutritious and robust crop whose seeds have a high-protein and low-fat content (4, 5). It is inexpensive, easy to grow, and tolerant to drought and high salinity as well as flooding (2) and possesses a hardy and penetrating root system that enables it to grow on a wide range of soils, including poor soils and heavy clays (51). Sometimes, it is the only surviving crop in drought-prone (and often famine-ravaged) areas in Africa and Asia (52). It has, therefore, a potential to significantly improve food security, particularly in arid and semiarid regions, which are likely to expand due to a looming climate change (53).These attributes, however, are overshadowed by the hazards associated with the production of β-ODAP, which greatly limit its use. Indeed, the sale and storage of grass pea, in any form, have been banned in various jurisdictions in India since 1961 (2). Since the biological role of β-ODAP in grass pea is still unknown, the consequences of eliminating its production in the plant are not clear. β-ODAP has been proposed to transport zinc in soils depleted of zinc or rich in iron (54), to act as a radical scavenger, or to enhance symbiosis with Rhizobium bacteria (17, 55, 56), and while it was shown to accumulate under water stress (57, 58), its role, if any, in grass pea drought tolerance is unknown. The latter has, in fact, been attributed to upregulation of antioxidant enzymes and increased concentrations of osmoprotectants, such as proline and soluble sugars (59), as well as to an extensive root system and leaf rolling during water shortage (2). It is hoped that the identification of BOS, along with its encoding gene, will pave the way toward the generation of β-ODAP–free grass pea cultivars. These would, hopefully, preserve the various beneficial traits of grass pea while being safe for consumption, adding a valuable resource to the world’s battery of food crops.
Grass pea (L. sativus) seeds were surface sterilized by a brief wash with 70% ethanol and soaking in 3% sodium hypochlorite for 20 min at constant, gentle (50 rpm) shaking. Then, they were rinsed four times with autoclaved dextrose in distilled water (DDW) and kept in DDW overnight (O/N) at 4 °C to synchronize their germination. For RNA extraction, the seeds were inoculated in a container supplemented with half-strength Murashige and Skoog salts and 30 g/l sucrose and gelled with 0.3% GELRITE at pH 5.8. For protein extraction, the seeds were inoculated in plastic pots containing 250 cm3 of commercial potting mixture. Seeds were germinated under 12-h light (150 μmol m−2 s−1)/dark cycle at 25 °C for 1 week.
The gene encoding OCS was amplified from extracted genomic DNA of L. sativus, cloned into a bacterial expression vector, expressed in E. coli, and purified (27). Purified OCS (0.25 μM) was added to a reaction solution (125 mM Hepes, pH 7.5; 2 mM MgCl2; 10 mM ATP) containing oxalic acid (5 mM) and CoA (1.5 mM) and incubated at 37 °C for 1 h. The reaction mixture was sampled before the addition of OCS and at the end of the incubation. The samples, diluted twofold, were mixed with an equal volume of buffered DTNB solution (50 mM Hepes, pH 7.5; 1 mM DTNB) and incubated for 15 min at room temperature (RT). The amount of unreacted CoA was determined by comparing the absorbance of the sample solutions at 412 nm, before and after incubation with OCS. Typically, under these conditions, more than 95% of CoA reacted to form oxalyl-CoA. Reaction mixtures containing in vitro synthesized oxalyl-CoA were used directly as substrates for in vitro β-ODAP synthesis, as described in the following.
L-α,β-Diaminopropionic acid elimination assay
The activity of BOS was determined in purified protein fractions as described (25). Oxalyl-CoA was freshly prepared in vitro as described previously. Samples containing oxalyl-CoA (1.5 mM, 40 μl) were mixed with buffered DAPA solution (50 mM Hepes, pH 7.5; 0.7 mM, 30 μl DAPA) and purified grass pea protein fractions (30 μl). The mixtures were incubated at 37 °C for 15 to 30 min in 96-deep-well plates and then mixed with OPT reagent solution (50 mM boric acid, pH 9.9; 60 mM β-mercaptoethanol; 7.5 mM, 400 μl OPT). DAPA concentrations were determined by measuring solution absorbance at 422 nm. The reaction produces a yellow color with free DAPA and other vicinal diamine compounds, but not with primary amines or with β-ODAP. The formation of β-ODAP in the reactions, performed in plant or E. coli extracts, or in vitro, resulted in the reduction in free DAPA concentrations and was verified independently using LC–MS (detailed in “Determination of α/β-ODAP and L-DAPA concentrations by LC–MS”).
Grass pea complementary DNA library preparation and sequencing
Raw data were processed using SMRTlink 4.0 software (PacBio). Circular consistency sequences were generated into full-length reads, which were clustered into full-length nonchimeric transcripts by IsoSeq3. From these transcripts, high-quality isoforms (Qscore > 30) were selected for further analysis. Since a reference genome for grass pea was not available at the time, the Coding Genome Reconstruction Tool (Cogent) was used to generate a fake coding genome on which Cupcake ToFU was used to remove redundant sequences. This pipeline yielded a nonredundant high-likelihood gene assembly. The ‘Dumb’ algorithm was used to find the longest open reading frames (ORFs). We then used the ‘ANGEL’ (62) algorithm to predict the most likely ORFs based on the length and coding potential of each given sequence. GO annotation was performed using Blast2GO (http://www.blast2go.org/), version 2.5.0. After gene-ID mapping, GO term assignment, annotation augmentation, and generic GO-slim process, the final annotation file was generated. Data were submitted as BioProject ID PRJNA662941 (http://www.ncbi.nlm.nih.gov/bioproject/662941).
Proteomic analysis
Protein fractions were subjected to in-solution tryptic digestion, using suspension trapping as described (63). Briefly, protein concentration was measured using the bicinchoninic acid assay (Thermo Scientific). Protein fractions were supplemented with SDS (5%, final concentration) in 50 mM Tris–HCl, reduced with 5 mM dithiothreitol, and alkylated with 10 mM iodoacetamide in the dark. Samples were loaded onto suspension trapping microcolumns (ProtiFi) according to the manufacturer’s instructions, and the columns were washed with 90:10% methanol/50 mM ammonium bicarbonate. Samples were digested with trypsin (1:50 trypsin/protein) for 1.5 h at 47 °C. The digested peptides were eluted using 50 mM ammonium bicarbonate. Trypsin was added to this fraction, which was further incubated O/N at 37 °C. Two more elutions were made using 0.2% formic acid and 0.2% formic acid in 50% acetonitrile. The three eluates were pooled together and vacuum-centrifuged to dryness. Samples were kept at −80 °C until further analysis. Dry digested samples were dissolved in 97:3% H2O/acetonitrile + 0.1% formic acid. Peptides were separated by chromatography using nano-UPLC (10 kpsi nanoAcquity; Waters). The mobile phase was H2O + 0.1% formic acid and acetonitrile + 0.1% formic acid. Desalting of the samples was performed online using a reversed-phase Symmetry C18 trapping column (180 μm internal diameter, 20 mm length, 5 μm particle size; Waters). Peptides were separated using a T3 high-strength silica nanocolumn (75 μm internal diameter, 250 mm length, 1.8 μm particle size; Waters) at 0.35 μl/min and were eluted into the mass spectrometer using the following gradient: 4% to 30% B in 50 min, 30% to 90% B in 5 min, maintained at 90% for 5 min, and then back to initial conditions. The nano-UPLC was coupled online through a nanoelectrospray ionization emitter (10 μm tip; New Objective) to a quadrupole Orbitrap mass spectrometer (sample BOS1: Q Exactive Plus, samples BOS2-3: Fusion Lumos, BOS4: Q Exactive HFX, Thermo Scientific) using a FlexIon nanospray apparatus (Proxeon). Data were acquired in data-dependent acquisition mode, using the Top10 method. MS1 resolution was set to 120,000 (at 200 m/z), mass range of 375 to 1650 m/z, automatic gain control of 3e6, and the maximum injection time was set to 60 msec. MS2 resolution was set to 15,000, quadrupole isolation 1.7 m/z, automatic gain control of 1e5, dynamic exclusion of 20 s, and maximum injection time of 60 msec. Raw data were processed with Proteome Discoverer V2.2 (Thermo Fisher Scientific) and searched using SequestHT (64) and Mascot (v. 2.5.1) (65) engines against the translated transcriptome as the protein database, appended with common lab protein contaminants. The number of database sequence entries used for the search was 18,649 for the Dumb database and 49,324 for the Angel database. Enzyme specificity was set to trypsin, and up to two missed cleavages were allowed. Fixed modification was set to carbamidomethylation of cysteines, and variable modifications were set to oxidation of methionines, protein N-terminal acetylation, and deamidation of glutamines and asparagines. Peptide precursor ions were searched using a maximum mass deviation of 10 ppm and fragment ions using maximum mass deviation of 0.02 Da. Peptide and protein identifications were filtered at a false discovery rate of 0.01 using the decoy database strategy.
The 3D crystal structure of BOS (PDB ID: 6ZBS) was optimized prior to docking using the Protein Preparation Wizard in Schrödinger Maestro Suite 2019 (Schrödinger Suite 2019-2 Protein Preparation Wizard; Epik, Schrödinger, LLC, 2019; Impact, Schrödinger, LLC, 2018; Prime, Schrödinger, LLC, 2018). Inconsistencies in the structure, such as missing side chains or hydrogens, incorrect bond orders, or side-chain orientation, were rectified during optimization (85) and the resulting structure was used for Glide docking (86, 87, 88). To support the proposed catalytic mechanism of BOS, a rotamer of Asp166 facing His162 was manually selected. The LigPrep module in Schrödinger Maestro Suite 2019 (LigPrep, Schrödinger, 2019-2) was used to convert the structures of the oxalyl-CoA, L-DAPA, β-ODAP, 4-coumaroyl-CoA, malonyl-CoA, and glutaryl-CoA from 2D to 3D, to correct improper bond distances and bond orders, to generate the ionization states of the ligands, and to minimize their energy. The optimized structures were then used for rigid ligand docking using the standard precision mode of Schrödinger’s Glide docking tool. Of BOS′ two substrates, oxalyl-CoA was docked first, using the crystal structure of PvHCT in complex with CoA and p-coumaroyl-shikimate (PDB ID: 5FAL) to generate the receptor grid, which represents the active site of the receptor for Glide ligand-docking jobs. For the docking of L-DAPA and β-ODAP, the best oxalyl-CoA docking pose was selected and truncated to a four-carbon chain, which was used as a dummy ligand for grid generation. The poses were sorted based on the Glide docking score. For docking the CoA-derivative ligands library, L-DAPA is the nucleophile that attacks the carbonyl carbon of the CoA-derivatives ligand thioester moiety; hence, L-DAPA is included as part of the BOS protein. For three-body molecular docking, L-DAPA was docked and merged as BOS–L-DAPA, with a negative charge to the nucleophilic β-amine group (NH-). For the second-grid generation, the docking result of oxalyl-CoA is used. The second grid was then used for docking the ligands library of 4-coumaroyl-CoA, malonyl-CoA, glutaryl-CoA, oxalyl-CoA, CoA, and acetyl-CoA using Extra Precision, XP-Glide docking algorithm.
Transient expression of BOS in N. benthamiana leaves
The BOS-encoding gene was cloned into the binary vector pART27 (89) and transformed into A. tumefaciens (GV3101) electrocompetent cells. Transformed cells were selected on LB agar plates containing 200 μg/ml spectinomycin and 50 μg/ml gentamicin at 28 °C. Single colonies were then used to inoculate 10 ml of LB media containing the respective antibiotics and grown at 28 °C O/N. Cells were harvested (3500 rpm, 10 min), washed with 10 ml of infiltration solution (100 mM Mes buffer, 2 mM Na3PO4, 100 μM acetosyringone), and repelleted. The cell pellet was resuspended in infiltration solution to a final A600 ≈ 0.3 and incubated at RT for 2 h. The resulting Agrobacterium suspension (2 ml) was infiltrated into 4- to 5-week-old N. benthamiana leaves. A. tumefaciens cells transformed with an empty cloning vector pDGB3Ω1 were used as a control. Plants were grown under 16-h light/8-h dark conditions. After 3 days of agroinfiltration, the same leaves were infiltrates with a 2-ml substrate solution (1 mM, L-DAPA, pH 7.5). Leaves were collected 2 days after substrate infiltration and frozen in liquid nitrogen. Ground tissue was further used for metabolic extraction. Six individual plants were used for the BOS expression group and four for each of the other controls (vector only and wt).
Data and materials availability
Refined coordinates were deposited in the Protein Data Bank Database under accession code 6ZBS (BOS). Transcriptomic data are available on GenBank, BioProject ID PRJNA662941. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (90) partner repository (http://www.ebi.ac.uk/pride/) with the data set identifier PXD030847. Details of all proteins identified using the “Dumb” and “Angel” databases are specified in Appendices C and D, respectively. Materials are available from the authors upon request.
Supporting information
This article contains supporting information (64, 65).
Authors: Arren Bar-Even; Elad Noor; Yonatan Savir; Wolfram Liebermeister; Dan Davidi; Dan S Tawfik; Ron Milo Journal: Biochemistry Date: 2011-05-04 Impact factor: 3.162
Authors: Aymerick Eudes; Jose H Pereira; Sasha Yogiswara; George Wang; Veronica Teixeira Benites; Edward E K Baidoo; Taek Soon Lee; Paul D Adams; Jay D Keasling; Dominique Loqué Journal: Plant Cell Physiol Date: 2016-02-08 Impact factor: 4.927