Teresa P Martins1,2, Nathaniel R Glasser3, Duncan J Kountz3, Paulo Oliveira4,5,6, Emily P Balskus3, Pedro N Leão1. 1. CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, University of Porto, 4450-208 Matosinhos, Portugal. 2. ICBAS - Institute of Biomedical Sciences Abel Salazar, University of Porto, 4050-313 Porto, Portugal. 3. Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States. 4. i3S - Institute for Research and Innovation in Health, University of Porto, 4200-135 Porto, Portugal. 5. IBMC - Institute of Molecular and Cell Biology, University of Porto, 4200-135 Porto, Portugal. 6. Department of Biology, Faculty of Sciences, University of Porto, 4169-00 Porto, Portugal.
Abstract
Nocuolin A is a cytotoxic cyanobacterial metabolite that is proposed to be produced by enzymes of the noc biosynthetic gene cluster. Nocuolin A features a 1,2,3-oxadiazine moiety, a structural feature unique among natural products and, so far, inaccessible through organic synthesis, suggesting that novel enzymatic chemistry might be involved in its biosynthesis. This heterocycle is substituted with two alkyl chains and a 3-hydroxypropanoyl moiety. We report here our efforts to elucidate the origin of the carbon skeleton of nocuolin A. Supplementation of cyanobacterial cultures with stable isotope-labeled fatty acids revealed that the central C13 chain is assembled from two medium-chain fatty acids, hexanoic and octanoic acids. Using biochemical assays, we show that a fatty acyl-AMP ligase, NocH, activates both fatty acids as acyl adenylates, which are loaded onto an acyl carrier protein domain and undergo a nondecarboxylative Claisen condensation catalyzed by the ketosynthase NocG. This enzyme is part of a phylogenetically well-defined clade within similar genomic contexts. NocG presents a unique combination of characteristics found in other ketosynthases, namely in terms of substrate specificity and reactivity. Further supplementation experiments indicate that the 3-hydroxypropanoyl moiety of 1 originates from methionine, through an as-yet-uncharacterized mechanism. This work provides ample biochemical evidence connecting the putative noc biosynthetic gene cluster to nocuolin A and identifies the origin of all its carbon atoms, setting the stage for elucidation of its unusual biosynthetic chemistry.
Nocuolin A is a cytotoxic cyanobacterial metabolite that is proposed to be produced by enzymes of the noc biosynthetic gene cluster. Nocuolin A features a 1,2,3-oxadiazine moiety, a structural feature unique among natural products and, so far, inaccessible through organic synthesis, suggesting that novel enzymatic chemistry might be involved in its biosynthesis. This heterocycle is substituted with two alkyl chains and a 3-hydroxypropanoyl moiety. We report here our efforts to elucidate the origin of the carbon skeleton of nocuolin A. Supplementation of cyanobacterial cultures with stable isotope-labeled fatty acids revealed that the central C13 chain is assembled from two medium-chain fatty acids, hexanoic and octanoic acids. Using biochemical assays, we show that a fatty acyl-AMP ligase, NocH, activates both fatty acids as acyl adenylates, which are loaded onto an acyl carrier protein domain and undergo a nondecarboxylative Claisen condensation catalyzed by the ketosynthase NocG. This enzyme is part of a phylogenetically well-defined clade within similar genomic contexts. NocG presents a unique combination of characteristics found in other ketosynthases, namely in terms of substrate specificity and reactivity. Further supplementation experiments indicate that the 3-hydroxypropanoyl moiety of 1 originates from methionine, through an as-yet-uncharacterized mechanism. This work provides ample biochemical evidence connecting the putative noc biosynthetic gene cluster to nocuolin A and identifies the origin of all its carbon atoms, setting the stage for elucidation of its unusual biosynthetic chemistry.
Cyanobacteria are well-known producers
of natural products with
intriguing structures ranging from terpenes and alkaloids to polyketides
and nonribosomal peptides.[1,2] Cyanobacterial secondary
metabolites also exhibit a wide range of pharmacologically relevant
bioactivities, mostly anticancer-related.[1] To synthesize such unique natural products, these organisms make
use of complex secondary metabolic pathways.[3] Enzymes in these pathways often catalyze challenging reactions currently
unachievable by available synthetic methodologies.Among the
most structurally unique cyanobacterial secondary metabolites
is nocuolin A (1, Figure a), initially isolated by Voráčová
and co-authors from the cyanobacterium Nostoc sp.
CCAP1453/38 on the basis of its cancer cell line cytotoxicity.[4] Metabolite 1 was later isolated
independently from Nodularia sp. LEGE 06071 and shown
to impair mitochondrial oxidative phosphorylation.[5,6] IC50 values for 1 in cancer cell line cytotoxicity
assays ranged from the high nanomolar to low micromolar.[4,5] The structure of 1 features a substituted 1,2,3-oxadiazine
moiety, unprecedented among reported natural products. In fact, natural
hydrazones per se are a very restricted class of compounds.[7] A natural product with close structural resemblance
to the nocuolin A scaffold is geralcin D (Figure a), isolated from Streptomyces sp. LMA-545.[8] It is also noteworthy that
while 1,3,4- and 1,2,4-oxadiazines have been prepared by chemical
synthesis, 1,2,3-oxadiazines are yet to be accessed synthetically.[4,9] The unique structure of 1 is likely generated by previously
unrecognized enzymatic chemistry or by a unique combination of known
biochemistry. For instance, it is unclear whether the central carbon
scaffold of 1 derives from a single fatty acid (FA) precursor
or if it is the product of carbon–carbon (C–C) bond
formation between two shorter-chain substrates. The biosynthetic origins
of the 3-hydroxypropanoyl moiety and the oxadiazine ring are also
intriguing, and so is the basis for N–N bond formation, as
there are few examples of characterized N–N bond-forming enzymes[10,11] and no homologs of such enzymes are present in the noc biosynthetic gene cluster (BGC).
Figure 1
Schematic highlighting the structure of
nocuolin A, the noc BGC, and the proposed biosynthetic
steps. (a) Chemical
structures of nocuolin A (1) and geralcin D, the natural
product with the closest structural resemblance to the nocuolin A
scaffold. (b) noc locus and its proposed relationship
with nocuolin A (1), the chlorosphaerolactylates (e.g., 2) and nocuolactylates (e.g., 3). (c) Representation
of the proposed biosynthesis of 1. (d) Detailed annotation
of noc-encoded biosynthetic enzymes (some have previously
been included in the cly BGC) with a potential role
in the biosynthesis of 1.
Schematic highlighting the structure of
nocuolin A, the noc BGC, and the proposed biosynthetic
steps. (a) Chemical
structures of nocuolin A (1) and geralcin D, the natural
product with the closest structural resemblance to the nocuolin A
scaffold. (b) noc locus and its proposed relationship
with nocuolin A (1), the chlorosphaerolactylates (e.g., 2) and nocuolactylates (e.g., 3). (c) Representation
of the proposed biosynthesis of 1. (d) Detailed annotation
of noc-encoded biosynthetic enzymes (some have previously
been included in the cly BGC) with a potential role
in the biosynthesis of 1.The initial report of the discovery of 1 by Voráčová
and co-authors suggested that a 50 kb locus (noc, Figure a) shared by the
genomes of cyanobacteria producing 1 was the likely nocuolin
BGC. This proposal was based on the presence of genes encoding fatty
acyl-AMP ligases (FAALs) and different nitrogen processing/incorporating
enzymes in this locus (Figure b).[4,12] However, several of the putative noc genes have recently been implicated in the biosynthesis
of chlorosphaerolactylates (e.g., 2, Figure b).[12] To acknowledge these findings, those genes were renamed as clyA-F, constituting the cly BGC. Our group later showed that Nodularia sp.
LEGE 06071 produces not only 1 and chlorosphaerolactylates[6] but also hybrids of these two molecules named
nocuolactylates (e.g., 3, Figure b). This discovery led us to propose that
the entire noc locus would be involved in the production
of these larger metabolites.[6] However,
a direct experimental connection between noc genes
and 1 has not been established.In this study,
we experimentally interrogate the formation of the
carbon skeleton of 1. Using stable-isotope labeled precursor
supplementation experiments and in vitro enzymatic assays, we show
that the carbon atoms of 1 originate from three different
building blocks: hexanoic acid, octanoic acid, and l-methionine.
Both fatty acids are activated by NocH (FAAL) and condensed by the
ketosynthtase (KS) NocG with high specificity. We show that NocG is
part of phylogenetically well-supported clade with no other characterized
members and that the enzyme combines, in a unique way, the specificity
and reactivity of other stand-alone KSs. l-Methionine is
converted into the 3-hydroxypropanoyl moiety through an as-yet-uncharacterized
mechanism. By identifying the origins of all carbon atoms in 1 and the enzymatic steps leading to the formation of its
C13 alkyl moiety (Figure c), our study provides a first glimpse at the biosynthetic
events that underlie the unique molecular scaffold of 1 and firmly establishes its connection to the noc pathway.
Results and Discussion
Annotation of the noc Gene
Products and Acyl-ACP
Dehydrogenase Activity of NocI and NocC
To identify the Noc
enzymes potentially involved in the biosynthesis of the carbon skeleton
of 1, the previous noc gene product
annotation from Voráčová et al.[4] was reviewed using the remote homolog detection bioinformatics
tools HHPred and Swiss Model.[13,14] The new data suggested
that NocG could act as a ketosynthase (KS) and NocI as an acyl-[acyl
carrier protein (ACP)] dehydrogenase (Figure d, Table S1).
The noc locus also encodes a second predicted acyl-ACP
dehydrogenase, NocC, as well as two putative FAALs, NocH and ClyA
(NocL), and a stand-alone ACP, ClyB (NocM). Following heterologous
expression and purification of NocC and NocI as well as the ACP ClyB
(Figure S1), both enzymes were found to
desaturate several fatty acyl-ClyB substrates (from hexanoyl- to dodecanoyl-thioesters),
generated via incubation of different chain length acyl-CoAs with
ClyB and the promiscuous 4′-phosphopantetheinyl transferase,
Sfp.[15] This result confirmed the annotation
of NocC and NocI as acyl-ACP dehydrogenases (Figures S2 and S3). We considered that these enzymes could be involved
in the formation of the carbon skeleton of 1 by generating
unsaturated intermediates suitable for C–C bond formation by
the putative KS NocG. However, we detected unsaturated versions of
the chlorosphaerolactylates in Nodularia sp. LEGE
06071 (Figure S4), in coherence with the
structures of the nocuolactylates produced by this cyanobacterium,
which are also unsaturated.[6] We also observed
a higher activity of NocC (compared to NocI) toward ClyB-bound fatty
acyl substrates (Figure S2). Taken together
these data are suggestive of an involvement of NocC in generating
the β,γ-unsaturation of the C12 chain in the
chlorosphaerolactylates.
The C13 Alkyl Chain in 1 Is
Derived from Hexanoic
and Octanoic Acids
In our previous study,[6] nocuolactylates were found to incorporate three alkyl moieties
derived from either hexanoic acid or longer FAs. The lactylate portion
of these molecules accounts for one such incorporation,[12] indicating that the two remaining moieties are
used for the biosynthesis of 1. We considered that these
building blocks could be used to generate the C13 alkyl
moiety of 1. To explore this possibility, stable isotope-labeled
FA supplementation experiments were carried out. In a first assay, Nodularia sp. LEGE 06071 was pulse fed with perdeuterated
hexanoic acid (d11-hexanoic acid) and
compared to a nonsupplemented control. Following LC-HRMS analysis,
we detected versions of 1 with shifts of m/z 11.069 and m/z 22.138, which are in agreement with
incorporation of one and two d11-hexanoic
acids, respectively, into 1 (Figure a). A shift of m/z 21.132
was also evident (Figure a) and could result from a loss of deuterium due to keto-enol
tautomerism of a putative biosynthetic intermediate. MS/MS analysis
confirmed that the two FAs are incorporated into the C13 chain (Figures S5 and S6). These results
indicate that the C13 skeleton of 1 is formed
from two FAs, implying that C–C bond formation must occur during
its assembly.
Figure 2
Supplementation of Nodularia sp. LEGE06071
with d11-hexanoic acid and d15-octanoic acid leads to d11-1/d22-1 and d13-1, respectively. LC-HRMS-derived
extracted ion chromatograms (EICs) of 1 and labeled version
with (a) d11-hexanoic acid and (b) d15-octanoic acid in Nodularia sp. LEGE 06071. (c) HRMS/MS spectrum of d13-1 and 1 ([M + H]+) highlighting
the diagnostic fragmentations supporting the structural proposal.
Supplementation of Nodularia sp. LEGE06071
with d11-hexanoic acid and d15-octanoic acid leads to d11-1/d22-1 and d13-1, respectively. LC-HRMS-derived
extracted ion chromatograms (EICs) of 1 and labeled version
with (a) d11-hexanoic acid and (b) d15-octanoic acid in Nodularia sp. LEGE 06071. (c) HRMS/MS spectrum of d13-1 and 1 ([M + H]+) highlighting
the diagnostic fragmentations supporting the structural proposal.We next sought to determine the precise length
of the FAs involved
in the biosynthesis of 1 by performing supplementation
experiments with a range of additional even-chain perdeuterated FAs
(octanoic to tetradecanoic acids). The data revealed incorporation
of a single d15-octanoic acid moiety with
the loss of two deuterium atoms (d13-1) (Figure b, Figure S6). No incorporation from FAs
with longer chains was observed (Figure S7). We also supplemented Nodularia sp. LEGE 06071
with perdeuterated heptanoic acid (d13-heptanoic acid) and found no deuterium incorporation from this substrate
(Figure S7). Overall, these experimental
results indicate that one hexanoic and one octanoic acid unit are
used to generate the C13 moiety of 1.To pinpoint the positions of the hexanoic and octanoic acid-derived
atoms in the C13 chain of 1, the [M + H]+ ion of d13-1, resulting
from supplementation of Nodularia sp. LEGE 06071
with d15-octanoic acid (m/z 312.3145), was subjected to MS/MS analysis. Two diagnostic peaks
that include position C-5 (Figure S5) were
prominent (m/z 110.096 and 127.123) and showed a
mass shift of 1.006 amu relative to the corresponding MS/MS peak for 1, indicating the incorporation of a single deuterium atom
at position 5 (Figure c). With these data, we concluded that hexanoic acid is incorporated
into positions C-6 to C-11 and octanoic acid into the remaining portion
of the C13 alkyl moiety, with a new C–C bond being
formed between C-5 and C-6.Based on the deuterium incorporation
pattern from these previous
experiments, we reasoned that C–C bond formation must occur
with loss of C1 from octanoic acid, perhaps through decarboxylation.
To test this hypothesis, Nodularia sp. LEGE06071
was supplemented with hexanoic acid-1-13C (1-13C-hexanoic acid) or octanoic acid-1-13C (1-13C-octanoic acid). Incorporation of one or two 1-13C-hexanoic
acid units into 1 was evident, and MS/MS analysis of
the labeled species was consistent with the predicted labeling at
C-4 and C-6 (Figures S8 and S9). On the
other hand, 1-13C-octanoic acid supplementation showed
a low-abundance and unspecific (from MS/MS analysis) incorporation
into 1 and other non-FA-derived molecules such as chlorophyll a and phaeophytin a, indicating that the
observed labeling pattern is caused by 13C scrambling (likely
as a result of 13CO2 recycling through the Calvin
cycle) (Figure S10). Overall, these supplementation
experiments with stable isotope-labeled FAs showed that construction
of the C13 alkyl chain of 1 involves C–C
bond formation between C-1 of hexanoic acid and C-2 of octanoic acid,
with loss of the C-1 carboxylate from the latter substrate.
NocH Activates
Hexanoic and Octanoic Acids
Having established
the origin of the C13 alkyl moiety in 1, we
tried to identify the biosynthetic machinery involved in its generation.
We considered whether the noc locus, in particular
the non-cly genes, could encode enzymes capable of
performing hexanoic and octanoic acid activation. The noc locus encodes two putative FAALs, NocH and ClyA (NocL). The latter
has been implicated in the biosynthesis of the chlorosphaerolactylates
and was predicted to activate dodecanoic acid and also decanoic and
tetradecanoic acids to a minor extent.[12] We heterologously expressed and purified each FAAL as well as their
cognate ACPs in E. coli. NocH contains
both FAAL and ACP domains and was expressed as a single polypeptide,
while ClyA and its associated ACP ClyB were expressed and purified
separately. Competition assays were performed for each FAAL using
a range of even chained FAs from hexanoic to dodecanoic acids. As
expected, ClyA mainly activated dodecanoic acid (Figure ). NocH showed a clear preference
for hexanoic and octanoic acids (Figures , S11 and S12)
and can therefore activate and load both FA substrates involved in
the generation of the C13 alkyl chain of 1. Additionally, we tested and observed loading of tetradecanoic acid
by ClyA (Figure S12), further supporting
the role of this enzyme in chlorosphaerolactylate biosynthesis. These
experiments provided the first biochemical evidence supporting the
connection between the noc locus and the synthesis
of nocuolactylates, chlorosphaerolactylates, and 1.
Figure 3
FAALs
NocH and ClyA (NocM) activate hexanoic/octanoic acids and
dodecanoic acid as acyl-ACP thioesters, respectively, in competition
assays. Shown are LC-HRMS-derived EICs of a trypsinated ACP fragment
loaded with each fatty acid.
FAALs
NocH and ClyA (NocM) activate hexanoic/octanoic acids and
dodecanoic acid as acyl-ACP thioesters, respectively, in competition
assays. Shown are LC-HRMS-derived EICs of a trypsinated ACP fragment
loaded with each fatty acid.
NocG Is a Ketosynthase That Generates the C13 Alkyl
Moiety in 1
We then considered candidate enzymes
for carrying out C–C bond formation between hexanoyl- and octanoyl-NocH
thioesters encoded within the noc BGC. Based on the
bioinformatic analysis detailed above (Figure b, Table S1),
NocG was annotated as a putative KS. Recombinant NocG was obtained
after codon optimization, expression in the toxic-protein resistant E. coli C43 strain, and cobalt resin purification
(Figures S1, S13 and Tables S2, S3). We
then carried out coupled enzymatic assays with NocG, the FAAL-ACP
NocH, and hexanoic and octanoic acid substrates. In assay mixtures,
we detected a low-abundance feature with m/z 241.1809
(Figure a), compatible
with the formation of a 13-carbon alkyl-β-ketoacid (4). The corresponding decarboxylated molecule 6-tridecanone (5) was also detected in this assay, while neither 4 nor 5 was detected in control assays lacking NocG (Figure a). Compound 4 (but not 5) was also detected in assays with
NocG and hexanoyl- and octanoyl-S-N-acetylcysteamine (SNAC) thioester substrate surrogates but in a
lower amount (Figure a). MS/MS analysis of 4 corroborated the proposed structure
(Figure S14). In addition, we repeated
the NocG enzymatic assays with different combinations of 2H- and 13C-labeled FAs and observed the formation of products
with the predicted m/z values for the corresponding
labeled versions of 4 (Figure S15). With these results, we confirmed the role of NocG as the KS responsible
for condensation of hexanoyl- and octanoyl-thioesters in the C13 chain of 1.
Figure 4
NocG belongs to a new class of KSs. (a)
LC-HRMS-derived EICs of
the β-ketoacid 4 and ketone 5 produced
by NocG upon enzymatic assays with hexanoyl- and octanoyl-thioesters. (b) LC-HRMS-derived EICs of 6 upon reduction
of 4 with sodium borohydride proving the nondecarboxylative
nature of NocG reaction. (c) Phylogenetic tree composed of NocG (*),
its homologs and other known ketosynthases (the scale bar indicates
the degree of divergence as substitutions per site).
NocG belongs to a new class of KSs. (a)
LC-HRMS-derived EICs of
the β-ketoacid 4 and ketone 5 produced
by NocG upon enzymatic assays with hexanoyl- and octanoyl-thioesters. (b) LC-HRMS-derived EICs of 6 upon reduction
of 4 with sodium borohydride proving the nondecarboxylative
nature of NocG reaction. (c) Phylogenetic tree composed of NocG (*),
its homologs and other known ketosynthases (the scale bar indicates
the degree of divergence as substitutions per site).To clarify whether the C–C bond forming
reaction catalyzed
by NocG is decarboxylative, we quenched assay mixtures with sodium
borohydride (NaBH4). Under these conditions, the β-hydroxy
acid (6) was obtained and no 6-tridecanol was obtained
(Figure b), indicating
that NocG catalyzes a nondecarboxylative Claisen condensation between
hexanoyl- and octanoyl-ACP thioesters, giving rise to a C13 β-ketoacid.Because the two acyl-ACP dehydrogenases
encoded in the noc locus (NocC and NocI) were able
to accept octanoyl-ACP
substrates (Figures S1 and S2), we considered
that NocG might accept 2-octenoyl-ACP as a substrate (since the hexanoic
acid-derived alkyl portion remains unchanged in 1). However,
NocG was unable to generate 4 in vitro when substituting
the octanoyl substrate with 2-octenoyl (Figure S16). C–C bond formation also did not occur with a single
thioester and one free FA (Figure S17a).
Furthermore, in assay mixtures with hexanoyl-CoA and octanoyl-CoA,
NocG failed to generate detectable amounts of 4 (Figure S17b). In NocG assays with one of the
FAs as a CoA thioester and the other as a NocH thioester, compound 4 was produced but in slightly lower yield than in assays
in which both FAs were activated as NocH thioesters (Figure S17b). This provides further support to the hypothesis
that NocH activates both FAs prior to NocG activity. Therefore, saturated
acyl-ACP thioesters are the substrates for this enzyme and the C–C
bond formation reaction it catalyzes does not require the action of
a dehydrogenase. Likewise, we considered that a prehydroxylated (C-3)
version of octanoic acid could be a substrate of NocG, as observed
for the heterodimer KS LstAB, during lipstatin biosynthesis.[16] Such a possibility would be consistent with
the C-4 oxymethine in 1. However, no activity was detected
when NocG and NocH were incubated with hexanoic acid and 3-hydroxyoctanoic
acid (Figure S18a) and no NocH loading
was observed for the hydroxylated FA (Figure S18b).Considering the lack of C-5 functionalization in 1, we hypothesize that 4 spontaneously decarboxylates
to give 5. Formation of such an enolizable ketone intermediate
is consistent with the presence of an abundant +21 amu peak in addition
to the expected but less prominent +22 amu peak observed upon d11-hexanoic acid supplementation of Nodularia sp. LEGE 06071 cultures (Figure a). To gain further insight into the activity
of NocG, we carried out kinetic assays with octanoyl-SNAC and variable
concentrations of hexanoyl-SNAC and observed apparent Km and Vmax values of 135 ±
29 μM and 0.224 ± 0.016 μM min–1 for the formation of 4, respectively (Figure S19).Interestingly, despite the typical promiscuity
of KSs regarding
the alkyl chain length[17−19] and the fact that both hexanoic and octanoic acids
are activated by NocH, we were unable to detect longer (C15, from two octanoic acid substrates) or shorter (C11,
from two hexanoic acid substrates) versions of the β-ketoacid
in NocG assay mixtures (Figure S20a). Such
reactivity contrasts with closely related stand-alone KSs like OleA,
which is able to accept C8 to C16 acyl-CoA substrates[18] or PpyS, which can condense C8 to C14 (with different branching)
thioesters to generate photopyrones A to H.[17,20] On the other hand, LstAB generates a C22 aliphatic skeleton with
no longer or shorter homologs having been described.[16] However, to our knowledge, the promiscuity of the LstAB
heterodimer toward fatty-acyl substrates of different chain lengths
was not tested.Consequently, the selectivity of NocG when generating 4 from its structurally very similar substrates (hexanoyl-NocH
and
octanoyl-NocH thioesters) is striking and unique. This selectivity
possibly arises from the need to control the size of downstream products,
as no longer or shorter versions of 1 could be found
in extracts of Nodularia sp. LEGE 06071 (Figure S20b). Another interesting feature of
the biosynthesis of the C13 chain of 1 arises
from the fact that NocH loads and activates the two substrates later
used by NocG in vitro. Typically, FAALs load one free fatty-acid substrate,
which can be further elongated by KSs, usually as part of polyketide
synthases assembly lines.[20,21] To our knowledge, no
previous studies have described the use of the same FAAL/ACP pair
in loading two different substrates to be used simultaneously by the
same downstream KS.To gain insight into how NocG compares to
previously characterized
KSs, we performed a phylogenetic analysis of 266 different KS sequences
from the main phylogenetic groups described so far. In addition, the
top 43 hits to NocG from BLASTp searches were also included in the
analysis (Table S4). The resulting phylogenetic
tree shows that NocG clades separately from all other characterized
KSs (Figure c). The
NocG-containing clade comprises mostly proteins from Cyanobacteria
and Actinobacteria, along with some Proteobacteria and Acidobacteria
sequences. The “NocG clade” is most closely related
to the clade that features PpyS enzymes.[22] PpyS (identity/similarity to NocG: 36/57%, respectively) and its
closest homologs are involved in the biosynthesis of photopyrones.
Analogously to the NocG C–C bond formation, this class of KSs
performs a nondecarboxylative Claisen condensation between a β-ketoacyl-ACP
thioester and an acyl-ACP thioester partner. However, unlike what
is observed for NocG, this intermediate is not released and instead
undergoes an intramolecular cyclization to generate the final pyrone
product (Figure c).[22] Previous phylogenetic studies have shown that
PpyS homologs clustered in two different clades, one including the
two characterized enzymes involved in pyrone biosynthesis (PpyS and
PyrS) and a second clade containing Nocardia, Microcystis, and other genera with no clear biosynthetic role.[22] It is within this second clade that NocG and its closest
homologs cluster; hence, this is a functionally separate clade from
pyrone-forming KSs. Sequence similarity network (SSN) analysis of
the NocG supported these findings, with NocG and PpyS forming distinct
clusters (Figure S21). Additionally, the
genomic neighborhood network analysis of the SSN data indicated that
the NocG and PpyS SSN clusters had entirely different genomic contexts
(Figure S22). Members of the SSN cluster
that encompasses NocG and its closest homologs are most often associated
with fatty acyl activation/tethering (NocH homologs) and amino group
transfer (NocF homologs) (Figure S21).
Some of these clusters harbor additional homologs of the Noc enzymes,
especially in Nocardia spp. (Figure S23). On the other hand, PpyS homologs show much more
diverse genomic contexts, mainly associated with sugar metabolism,
and possess no obvious connection to Noc-related enzymes (Figure S22, Table ). NocG also clades separately from other reported
stand-alone KSs that use β-ketoacyl-CoAs as substrates, like
MxnB (myxopyronin biosynthesis)[23] and CorB
(corallopyronin biosynthesis).[24]
Table 1
Comparison between NocG and Its Closest
Homologs, PpyS and OleA
fatty acyl activation/tethering
and amino acid transference
sugar metabolism
-a
final product
C13 alkyl-β-ketoacid
photopyrone A-H (different
chain lengths and branchings)
C19 to C32 alkyl-β-ketoacid
Genomic context of OleA was not
investigated.
Genomic context of OleA was not
investigated.Notably, our
phylogenetic analysis also shows that
despite showing
similar reactivity, NocG is phylogenetically distant from OleA, a
thiolase that performs a nondecarboxylative Claisen condensation between
two long-chain fatty acyl-CoAs in the olefin biosynthetic pathway.[18] The reaction catalyzed by NocG differs in its
use of protein-tethered thioester substrates. Key catalytic residues
are shared between NocG, PpyS, and OleA homologs—in NocG, these
are E100 (deprotonation of the acyl intermediate), C125, H275 and
N304 (covalent binding of the precursor) (Figures S24 and S25). Based on this and on the similar position of
the predicted catalytic residues (Figure S26), we hypothesize a similar reaction mechanism that starts with the
deprotonation of the α-carbon of octanoyl-NocH thioester by
Glutamate 100, which is able to form a hydrogen bond with the α-carbon
of the substrates, creating nucleophile species that subsequently
attacks the carbonyl carbon of hexanoyl-NocH thioester to form a new
C–C bond (Figure S27).[22] To confirm this hypothesis, we constructed an
E100A NocG mutant resulting in loss of detectable activity (Figure S28). Thus, NocG is the first characterized
member of a new class of KS and expands the diversity of this C–C
bond forming enzyme family.
The 3-Hydroxypropanoyl Moiety of 1 Is Derived
from l-Methionine
Having elucidated the biosynthesis
of the C13 alkyl chain of 1, we then focused
on the origin
of the remaining carbon atoms in this compound, namely the 3-hydroxypropanoyl
moiety. We noticed that NocA and NocB are homologous to methylthioribulose-1-phosphate
dehydratase (MtnB) and acireductone dioxygenase (MtnD), respectively.
These enzymes are part of the methionine salvage pathway (MTA salvage
pathway) (Figure d, Table S1), which regenerates l-methionine
from its downstream metabolic products. In Bacillus
subtilis, the MTA salvage pathway converts SAM (or
a product of SAM metabolism) to methylthioribose (MTR), which is then
phosphorylated by a methylthioribose kinase (MtnK) and converted to
methylthioribose-1-phosphate (MTR-1P) by methylthioribose-1-phosphate
isomerase (MtnA). MTR-1P is then converted to 2,3-diketo-5-methylthiopentyl-1-phosphate
(2,3-DK-MTP-1-P) by MtnB. Alignment of NocA with characterized methylthioribulose-1-phosphate
dehydratases and alignment of NocB with characterized acireductone
dioxygenases showed conservation of key residues (Figures S29 and S30), which could indicate the involvement
of methionine in the biosynthesis of 1 and potential
recycling of MTA by the salvage pathway. To confirm the suspected
activity of NocA, this enzyme was heterologously expressed in E. coli, purified, and tested in a coupled assay
with MtnK and MtnA (from B. subtilis) with MTR as the substrate.[25,26] The assay yielded 2,3-DK-MTP-1-P
in LC-HRMS analyses, confirming that NocA has methylthioribulose-1-phosphate
dehydratase activity (Figure S31).The biochemical characterization of NocA and the bioinformatically
predicted function of NocB as MTA salvage pathway enzymes support
a role for methionine or S-adenosyl methionine (SAM)
in the biosynthesis of 1. There is precedence for SAM
decarboxylation followed by transfer of its aminopropyl group to various
metabolites.[27] We hypothesized that a similar
transformation could give rise to the 3-hydroxypropanoyl moiety in
the biosynthesis of 1. To test this proposal, we supplemented Nodularia sp. LEGE 06071 with d8-l-methionine. LC-HRMS data showed a prominent M + 4 peak
in the isotope cluster of 1 ([M + H]+, Figure a), indicating that
methionine is in fact incorporated into 1. To clarify
which atoms in 1 derive from methionine, we supplemented Nodularia sp. LEGE 06071 with 13C5-l-methionine and 13C5-15N-l-methionine (Figure a). Three 13C5-l-methionine
carbon atoms were found to be incorporated into the 3-hydroxypropanoyl
moiety, as revealed by MS/MS analysis (Figure b), thereby establishing the origin of all
carbon atoms in 1. LC-HRMS/MS data from 13C5-15N-l-methionine supplementation
experiments revealed a much higher 15N incorporation into 1 in MS/MS fragments containing N-1, suggesting a direct incorporation
of one nitrogen atom from methionine. Additionally, the calculated
ratios between direct 15N incorporation (15N113C3-1/15N013C3-1) versus scrambled
single 15N incorporation in 1 (15N1-1/15N0-1) indicate a 10-fold increase as a result of direct incorporation
from 13C5-15N-l-methionine
(Figures c and S32). Further feeding with 1-13C-l-methionine also allowed us to conclude that the C-1 of l-methionine is not incorporated into 1 and, therefore,
that the 3-hydroxypropanoyl moiety is formed by l-methionine
carbons C-2 to C-4 (Figures d and S33).
Figure 5
3-Hydroxypropanoyl moiety
of 1 is derived from C-2
to C-4 and N-1 of l-methionine. (a) HRMS spectra of 1 and its stable isotope labeled versions following supplementation
with d8-l-methionine, 13C5-l-methionine, and 15N,13C5-l-methionine, confirming incorporation of
this precursor into 1. (b) HRMS/MS spectrum of 13C3-1 confirming methionine incorporation
into the 3-hydroxypropanoyl moiety. (c) Fine structure of the M peak
in the isotope clusters of [M + H]+ ions of key isotopologues
of 1 (values next to each peak correspond to peak areas,
and their ratios are tabulated), supporting incorporation of a nitrogen
atom from methionine into 1. (d) HRMS spectra of 1 and its stable isotope labeled versions following supplementation
with 1-13C-l-methionine, supporting a nonspecific
incorporation of C-1 from l-methionine. (e) Schematic representation
of incorporation of C and N atom-derived l-methionine into the 3-hydroxypropanoyl
moiety of 1.
3-Hydroxypropanoyl moiety
of 1 is derived from C-2
to C-4 and N-1 of l-methionine. (a) HRMS spectra of 1 and its stable isotope labeled versions following supplementation
with d8-l-methionine, 13C5-l-methionine, and 15N,13C5-l-methionine, confirming incorporation of
this precursor into 1. (b) HRMS/MS spectrum of 13C3-1 confirming methionine incorporation
into the 3-hydroxypropanoyl moiety. (c) Fine structure of the M peak
in the isotope clusters of [M + H]+ ions of key isotopologues
of 1 (values next to each peak correspond to peak areas,
and their ratios are tabulated), supporting incorporation of a nitrogen
atom from methionine into 1. (d) HRMS spectra of 1 and its stable isotope labeled versions following supplementation
with 1-13C-l-methionine, supporting a nonspecific
incorporation of C-1 from l-methionine. (e) Schematic representation
of incorporation of C and N atom-derived l-methionine into the 3-hydroxypropanoyl
moiety of 1.Despite the role of SAM as a reactive one-carbon
donor in the methylation
of a wide range of substrates, the prosthetic group can also transfer
amino groups, ribosyl groups, and (as mentioned above) aminopropyl
groups.[27] Recently, Cui and co-workers[28] and Barra et al.[29] have reported the transfer of a C4N group derived from
SAM to the muraymycin and altemicidin, respectively, by PLP-dependent
enzymes, further highlighting the potential of SAM as a versatile
biosynthetic precursor. Still, none of these reactions can explain
by themselves the observed incorporation pattern for stable isotope-labeled
methionine substrates into 1 (Figure e). Overall, our findings strongly indicate
that generation of the 3-hydroxypropanoyl scaffold in 1 involves unprecedented biochemical transformations.
Conclusions
This study shows that the carbon skeleton
of the cytotoxic cyanobacterial
metabolite 1 is formed from three different building
blocks: hexanoic acid, octanoic acid, and l-methionine (Figure c). Both FAs are
ultimately condensed by NocG with remarkable selectivity, forming
a new C–C bond between carbons 5 and 6. A detailed study of
NocG shows that it belongs to a new class of KSs, using saturated
acyl-ACP thioesters as substrates and whose members are found mainly
in cyanobacteria and actinobacteria. l-Methionine is incorporated
into the 3-hydroxypropanoyl moiety of 1 through an as-yet-unclear
mechanism that involves bond cleavage between C-1 and C-2. This work
provides the first biochemical evidence connecting the putative noc pathway to 1 and reveals several instances
of unusual biochemistry leading to a unique molecular scaffold. Further
investigations on the biosynthesis of 1, namely, of the
transformations leading to the 1,2,3-oxadiazine moiety, are now facilitated
by the identification of such key substrates and biosynthetic intermediates.
Authors: Sebastian W Fuchs; Kenan A J Bozhüyük; Darko Kresovic; Florian Grundmann; Veronica Dill; Alexander O Brachmann; Nicholas R Waterfield; Helge B Bode Journal: Angew Chem Int Ed Engl Date: 2013-02-19 Impact factor: 15.336
Authors: Tai L Ng; Monica E McCallum; Christine R Zheng; Jennifer X Wang; Kelvin J Y Wu; Emily P Balskus Journal: Chembiochem Date: 2019-12-19 Impact factor: 3.164
Authors: Georg Zocher; Joachim Vilstrup; Daniel Heine; Asis Hallab; Emilie Goralski; Christian Hertweck; Mark Stahl; Till F Schäberle; Thilo Stehle Journal: Chem Sci Date: 2015-08-06 Impact factor: 9.825