Isabel Pardo1, David Bednar2,3, Patricia Calero1, Daniel C Volke1, Jiří Damborský2,3, Pablo I Nikel1. 1. The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kongens Lyngby, Denmark. 2. Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 601 77 Brno, Czech Republic. 3. International Clinical Research Centre, St. Anne's University Hospital, 656 91 Brno, Czech Republic.
Abstract
Fluorinases, the only enzymes known to catalyze the transfer of fluorine to an organic molecule, are essential catalysts for the biological synthesis of valuable organofluorines. However, the few fluorinases identified so far have low turnover rates that hamper biotechnological applications. Here, we isolated and characterized putative fluorinases retrieved from systematic in silico mining and identified a nonconventional archaeal enzyme from Methanosaeta sp. that mediates the fastest SN2 fluorination rate reported to date. Furthermore, we demonstrate enhanced production of fluoronucleotides in vivo in a bacterial host engineered with this archaeal fluorinase, paving the way toward synthetic metabolism for efficient biohalogenation.
Fluorinases, the only enzymes known to catalyze the transfer of fluorine to an organic molecule, are essential catalysts for the biological synthesis of valuable organofluorines. However, the few fluorinases identified so far have low turnover rates that hamper biotechnological applications. Here, we isolated and characterized putative fluorinases retrieved from systematic in silico mining and identified a nonconventional archaeal enzyme from Methanosaeta sp. that mediates the fastest SN2 fluorination rate reported to date. Furthermore, we demonstrate enhanced production of fluoronucleotides in vivo in a bacterial host engineered with this archaeal fluorinase, paving the way toward synthetic metabolism for efficient biohalogenation.
Fluorinated organic compounds
(organofluorines), containing at least one fluorine (F) atom, are
chemicals of enormous industrial interest[1,2]—as
evidenced by their increasing prevalence in pharmaceuticals (almost
one-third of the pharma molecules in the market contain F) and agrochemicals.[3−5] The unique physicochemical properties of F endow organofluorines
with advantageous properties with respect to their nonfluorinated
counterparts, e.g. increased chemical stability or improved bioavailability.[6] However, the abundance of human-made organofluorines
contrasts with their relative scarcity in Nature.[7,8] 5′-Fluoro-5′-deoxyadenosine
(5′-FDA) synthase, or fluorinase (FlA), is the only one enzyme
known to naturally catalyze the formation of the C–F bond,
which requires a high activation energy for desolvation of the fluoride
ion (F–). This enzyme, originally identified in Streptomyces cattleya,[9,10] catalyzes the SN2 transfer of F– to the C5′ of the
essential methyl donor S-adenosyl-l-methionine
(SAM), thereby generating 5′-FDA and l-methionine
(l-Met) as products[11] (step I
in Scheme ). Since
the discovery of FlA in 2003, only six other fluorinases have been
reported in the literature, all of them sourced from actinomycetes.[12−14] A chlorinase, catalyzing 5′-chloro-5′-deoxyadenosine
(5′-ClDA) synthesis and closely related to FlAs, has also been
identified in the marine actinomycete Salinispora tropica(15) (step II in Scheme ). FlA from S. cattleya is
capable of catalyzing the chlorination reaction as well, albeit much
less efficiently than fluorination.[16] Conversely,
SalL, the chlorinase of S. tropica, cannot catalyze the formation of C–F bonds. This activity
difference has been attributed to the presence of a 23-residue loop,
present in all known FlAs but absent in SalL.[17] It was hypothesized that this loop, located near the catalytic site,
could influence halide specificity by modifying the architecture of
the binding pocket.[18]
Scheme 1
Fluorometabolite
Biosynthesis Pathways and Reactions Catalyzed by
Fluorinase/Chlorinase
Reactions catalyzed by fluorinase/chlorinase
are indicated in gray: (I) forward fluorination reaction, (II) forward
chlorination reaction, and (III) reverse chlorination reaction. The
common step in fluorometabolite biosynthetic pathways is shaded in
orange. The canonical fluoroacetate and 4-fluoro-l-threonine
biosynthetic pathway are show in purple. The 5′-fluoro-5′-deoxy-d-ribose biosynthetic route is indicated in light blue. Compound
abbreviations (blue): 5′-ClDA, 5′-chloro-5′-deoxyadenosine;
SAM, S-adenosyl-l-methionine; 5′-FDA,
5′-fluoro-5′-deoxyadenosine; 5-FDRP, 5′-fluoro-5′-deoxy-d-ribose 1-phosphate; 5-FDRulP, 5-fluoro-5-deoxy-d-ribulose
1-phosphate; FAld, fluoroacetaldehyde; FAc, fluoroacetate; 4-FT, 4-fluoro-l-threonine; 5-FDR, 5′-fluoro-5′-deoxy-d-ribose; 5-FHPA, 5-fluoro-2,3,4-trihydroxypentanoic acid. Enzyme
abbreviations (black bold): FlA, fluorinase; FlB, 5′-fluoro-5′-deoxyadenosine
phosphorylase; FlIso, 5-fluoro-5-deoxy-d-ribose 1-phosphate
isomerase; FlFT, 4-fluoro-l-threonine transaldolase; FdrA,
5-fluoro-5-deoxy-d-ribose 1-phosphate phosphoesterase; and
FdrC, 5-fluoro-5-deoxy-d-ribose dehydrogenase.
Fluorometabolite
Biosynthesis Pathways and Reactions Catalyzed by
Fluorinase/Chlorinase
Reactions catalyzed by fluorinase/chlorinase
are indicated in gray: (I) forward fluorination reaction, (II) forward
chlorination reaction, and (III) reverse chlorination reaction. The
common step in fluorometabolite biosynthetic pathways is shaded in
orange. The canonical fluoroacetate and 4-fluoro-l-threonine
biosynthetic pathway are show in purple. The 5′-fluoro-5′-deoxy-d-ribose biosynthetic route is indicated in light blue. Compound
abbreviations (blue): 5′-ClDA, 5′-chloro-5′-deoxyadenosine;
SAM, S-adenosyl-l-methionine; 5′-FDA,
5′-fluoro-5′-deoxyadenosine; 5-FDRP, 5′-fluoro-5′-deoxy-d-ribose 1-phosphate; 5-FDRulP, 5-fluoro-5-deoxy-d-ribulose
1-phosphate; FAld, fluoroacetaldehyde; FAc, fluoroacetate; 4-FT, 4-fluoro-l-threonine; 5-FDR, 5′-fluoro-5′-deoxy-d-ribose; 5-FHPA, 5-fluoro-2,3,4-trihydroxypentanoic acid. Enzyme
abbreviations (black bold): FlA, fluorinase; FlB, 5′-fluoro-5′-deoxyadenosine
phosphorylase; FlIso, 5-fluoro-5-deoxy-d-ribose 1-phosphate
isomerase; FlFT, 4-fluoro-l-threonine transaldolase; FdrA,
5-fluoro-5-deoxy-d-ribose 1-phosphate phosphoesterase; and
FdrC, 5-fluoro-5-deoxy-d-ribose dehydrogenase.Considering the environmentally harsh conditions currently
required
for the chemical synthesis of organofluorines, FlAs are promising
biocatalysts for “green” production[19] of new-to-Nature, bioderived organofluorines and for the
implementation of synthetic metabolism with fluorinated intermediates
in living cells.[20−22] However, all known FlAs are poor biocatalysts,[23] with turnover rates <1 min–1. So far, the handful of protein engineering efforts aimed at the
improvement of FlA activity have had limited success.[24−26] Furthermore, these studies mostly relied on employing surrogate
substrates, for example, 5′-ClDA, to select for enzyme variants
with improved transhalogenation activity[25] (see steps III and I in Scheme ). This strategy hampers the applicability of FlAs
in a consolidated, whole-cell bioprocess where only F– and an appropriate carbon substrate would be supplied as feedstock
to support de novo biofluorination.[27]Genome-wide databases are a rich source of potentially valuable
enzymes,[28] yet their continuous, exponential
expansion makes the selection of catalytically attractive candidates
challenging. The EnzymeMiner platform[29] has been recently developed to address this
issue as an interactive Web site (https://loschmidt.chemi.muni.cz/enzymeminer). This user-friendly bioinformatic tool searches through databases
upon submitting a sequence of at least one representative member of
the target enzyme family, together with the identification of essential
(i.e., catalytic) residues. EnzymeMiner conducts
multiple database searches and accompanying calculations, which provide
a set of hits and their systematic annotation based on protein solubility,
possible extremophilicity, domain structures, and other structural
information. These collected and calculated annotations provide users
with key information needed for the selection of the most promising
sequences for gene synthesis, small-scale protein expression, purification,
and functional characterization.[30]With the goal of expanding the FlA toolset for the biological production
of organofluorines in engineered bacterial cell factories, here we
describe the systematic screening, in vitro characterization, and
in vivo implementation of hitherto unknown FlAs retrieved from genome
databases. First, in an effort to identify “Nature’s
best” biocatalyst, the fluorinase from Streptomyces sp. MA37 (FlAMA37) was used as the query sequence (UniProt
W0W999), and the amino acid residues D16, Y77, S158, D210 and N215
were specified as essential based on their implication in catalysis
and substrate binding in EnzymeMiner (Figure a). We selected this enzyme
since it is one of the most efficient fluorinases reported in the
literature thus far, and it has been used as template for directed
evolution experiments.[12,24]
Figure 1
Putative fluorinases identified by genome
mining. (a) Residues
specified as essential for the EnzymeMiner search,
based on the crystal structure of FlAMA37 (PDB ID 5B6I). The SAM substrate
is shown as a ball-and-stick representation. (b) Phylogenetic tree
of retrieved fluorinase sequences obtained using the MEGAX software,[31] inferred using the Neighbor-Joining method with
a bootstrap of 10 000 iterations. The percentage of replicate
trees in which the associated taxa clustered together in the bootstrap
test are shown next to the branches. The tree is drawn to scale, with
branch lengths in the same units as those of the evolutionary distances
used to infer the phylogenetic tree. Sequences sourced from Actinomycetes
are highlighted as blue squares. Enzymes previously characterized
in the literature are indicated in blue bold font. (c-e) 3D structures
for FlAMA37 (c), wild-type SalL (d, PDB ID 6RYZ) and FlAPtaU1 (e, modeled with the SWISS-MODEL Alignment
Mode tool using the FlA crystal structure
PDB ID 2 V7 V as template). The loop hypothesized to differentiate
fluorinases from chlorinases is circled in a dashed gray line. Two
chains from the homotrimer for each structure are shown as cartoon
and surface representations, respectively.
Putative fluorinases identified by genome
mining. (a) Residues
specified as essential for the EnzymeMiner search,
based on the crystal structure of FlAMA37 (PDB ID 5B6I). The SAM substrate
is shown as a ball-and-stick representation. (b) Phylogenetic tree
of retrieved fluorinase sequences obtained using the MEGAX software,[31] inferred using the Neighbor-Joining method with
a bootstrap of 10 000 iterations. The percentage of replicate
trees in which the associated taxa clustered together in the bootstrap
test are shown next to the branches. The tree is drawn to scale, with
branch lengths in the same units as those of the evolutionary distances
used to infer the phylogenetic tree. Sequences sourced from Actinomycetes
are highlighted as blue squares. Enzymes previously characterized
in the literature are indicated in blue bold font. (c-e) 3D structures
for FlAMA37 (c), wild-type SalL (d, PDB ID 6RYZ) and FlAPtaU1 (e, modeled with the SWISS-MODEL Alignment
Mode tool using the FlA crystal structure
PDB ID 2 V7 V as template). The loop hypothesized to differentiate
fluorinases from chlorinases is circled in a dashed gray line. Two
chains from the homotrimer for each structure are shown as cartoon
and surface representations, respectively.After curing out redundant sequences, 16 unique candidates were
obtained (Table and Figure b). Some of the retrieved
amino acid sequences were found to be missing several N-terminal residues, which were added after manually curating the
deposited genome sequences where the fluorinase genes had been predicted
(Table S1). Out of the 16 sequences retrieved,
five corresponded to fluorinases reported in the literature (thus
serving as an internal quality control of the prediction routine),
while nine corresponded to new putative fluorinases. Another two sequences
corresponded to a site-directed mutagenesis variant of the chlorinase
from Salinispora tropica CNB-440 (SalL; carrying
point substitutions Y70T and G131S)[15] and
a putative chlorinase from the archaea Methanosaeta sp. PtaU1.Bin055 (FlAPtaU1). Both of these sequences
lack the 23-residue loop previously hypothesized to differentiate
fluorinases from chlorinases (Figure c–e). Notably, only four of all the retrieved
sequences were not sourced from Actinobacteria. These include the
putative enzymes from a Chloroflexi bacterium (Chloroflexi), Peptococcaceae bacterium CEB3 (Clostridia), Thermosulforhabdus
norvegica (Deltaproteobacteria), and Methanosaeta sp. PtaU1.Bin055 (Methanomicrobia). Phylogenetic analysis of the
16S rRNA sequences of the fluorinase-encoding organisms gave a similar
result to that obtained when using the fluorinase amino acid sequences,
except that, expectedly, S. tropica groups together with the other Actinomycetes, in a clade separate
from the one formed by Streptomyces sp. (Figure S1 and Table S2).
Table 1
Putative Fluorinases Retrieved from EnzymeMiner Analysis Using FlAMA37 as the Query
name
organism and
reference
ID (%)a
FlAMA37
Streptomyces sp. MA37[12]
query
FlAScat
Streptomyces cattleya10
87.6%
FlASxin
Streptomyces xinghaiensis13
86.0%
FlASAJ15
Streptomyces sp. SAJ15
85.0%
FlAN902
Actinoplanes sp. N902–10912
80.7%
FlAAmza
Actinopolyspora mzabensis14
78.9%
FlAAbar
Amycolatopsis bartoniae
79.1%
FlACA12
Amycolatopsis sp. CA-128772
78.6%
FlAAN11
Goodfellowiella sp. AN110305
77.7%
FlANbra2
Nocardia brasiliensis IFM 10847
75.7%
FlANbra3
Nocardia brasiliensis NCTC 11294
75.3%
FlANbra1
Nocardia brasiliensis ATCC 70035812
75.3%
FlACbac
Chloroflexi bacterium
69.3%
FlAPbac
Peptococcaceae bacterium CEB3
64.8%
FlATnor
Thermodesulforhabdus
norvegica
54.5%
FlAPtaU1
Methanosaeta sp. PtaU1.Bin055
49.5%
SalLStro
Salinispora tropica CNB-44015
35.6%
Sequence identity.
References to
known FlAs are indicated.
Sequence identity.
References to
known FlAs are indicated.The genomic context of the different flA genes
was likewise examined (Table S3). As reported
for the fluorination gene clusters of Streptomyces sp. MA37, N. brasiliensis, Actinoplanes sp. N902–109, and S. xinghaiensis, all Actinomycetes
harbor gene clusters resembling that of S. cattleya, the most studied source of fl genes described
to date[23,32] (Figure ). The genes flB (encoding a 5′-FDA
phosphorylase), flG (encoding a response regulator), flH (encoding a putative cation:H+ antiporter),
and flI (encoding a S-adenosyl-L-homocysteinase)
were highly conserved in all actinomycetes. Most of them also presented
the genes flIso (5-fluoro-5-deoxy-d-ribose
1-phosphate isomerase) and flFT (4-fluoro-l-threonine transaldolase), involved in the synthesis of fluoroacetate
and 4-fluoro-l-threonine. These are the two canonical end
fluorometabolites described thus far.[1] Also,
genes encoding a prolyl-tRNA synthetase-associated protein and an
EamA family transporter were usually found in proximity to flFT. In S. cattleya, the products of these
genes (termed fthB and fthC, respectively)
play a role in detoxification by deacylation of 4-fluoro-l-threoninyl-tRNA and export of 4-fluoro-l-threonine.[33] Interestingly, Amycolatopsis bartoniae and Goodfellowiella sp. AN110305 lacked either flIso and flFT orthologues within the fl cluster, presenting, instead, orthologues to the fdr genes from Streptomyces sp. MA37. The
genes are probably involved in the biosynthesis of 5-fluoro-2,3,4-trihydroxypentanoic
acid via the fluorosugar intermediate 5-fluoro-5-deoxy-d-ribose.[34] Further biochemical activities encoded in these
gene clusters include phosphoesterases, short chain dehydrogenases,
dihydroxyacid dehydratases and cyclases, suggesting that the main
fluorinated compounds produced by these microorganisms could be different
from the canonical fluorometabolites fluoroacetate and 4-fluoro-l-threonine. Similar activities seem to be also encoded by genes
in the vicinity of flA in Chloroflexi bacterium and salL in S. tropica.[35] Other genes widely
distributed among the different actinomycotal clusters encoded activities
related to SAM synthesis (i.e., SAM synthetase) and S-adenosyl-l-homocysteine degradation (i.e., S-adenosyl-L-homocysteinase), a competitive inhibitor of fluorinase
activity.[10] As indicated above, the latter
gene (flI) was present in all actinomycotal clusters.
Since SAM and S-adenosyl-l-homocysteine
are involved in essential cellular reactions, it is likely that these
enzymes modulate the levels of these compounds during secondary metabolism,
when organofluorines are actively produced.[36] Further analysis of the genes found in these fl clusters will provide clues as to what activities are needed to
establish robust and efficient biofluorination pathways in heterologous
hosts. This prospect is particularly exciting at the light of the
need of novel organofluorine biosynthesis enzymes that could be sourced
from environmental microbes.[1]
Figure 2
Fluorination
gene clusters in actinomycetes. For clarity, the clusters
are drawn centered on flA (identified as A) in the sense orientation. Numbers under dashed lines
indicate the distance between open reading frames (ORFs) found in
the same sequence entry; ORFs in separate entries are not connected
by a line. Italicized letters indicate orthologues to the corresponding fl genes from S. cattleya. J′ indicates
duplicate flJ copies (encoding DUF190 domain-containing
protein). FT* is a truncated pseudogene homologous
to flFT. Orthologues to fdr genes
from Streptomyces sp. MA37 are indicated as white
blocks with blue outlines. ORFs outlined in black represent genes
with other/unknown functions. MFS, major facilitator superfamily;
HTH, helix-turn-helix.
Fluorination
gene clusters in actinomycetes. For clarity, the clusters
are drawn centered on flA (identified as A) in the sense orientation. Numbers under dashed lines
indicate the distance between open reading frames (ORFs) found in
the same sequence entry; ORFs in separate entries are not connected
by a line. Italicized letters indicate orthologues to the corresponding fl genes from S. cattleya. J′ indicates
duplicate flJ copies (encoding DUF190 domain-containing
protein). FT* is a truncated pseudogene homologous
to flFT. Orthologues to fdr genes
from Streptomyces sp. MA37 are indicated as white
blocks with blue outlines. ORFs outlined in black represent genes
with other/unknown functions. MFS, major facilitator superfamily;
HTH, helix-turn-helix.Next, the coding sequences
of all FlA candidates were codon-optimized
for production in Escherichia coli as N-terminal His-tag fusions (flAMA37, flA and flA had been previously
codon-optimized for expression in Gram-negative hosts;[27] see also Tables S4 and S5). SalL was not included in this
experimental set since it is reportedly inactive on F–.[15] The expression of the 16 candidate
genes was initially evaluated in 96-well microtiter plate cultures.
FlA, FlA, and FlA could not be obtained
as soluble enzymes and were not included in further analyses. Moreover,
very faint bands of the expected size were observed in SDS-PAGE of E. coli extracts producing either FlA or FlA, suggesting limited expression levels or poor translation (Figure S2). Therefore, we proceeded to obtain
the remaining 13 candidates in medium-scale shaken-flask cultures
for His-tag purification and activity assays. The purified enzymes
were incubated in the presence of increasing SAM concentrations for
1 h, after which 5′-FDA was measured by HPLC. 5′-FDA
synthase activity could be detected for 12 out of the 13 candidates
(Figure S3). The protein concentration
was normalized for these assays, although the enzymes were recovered
with varying degrees of purity due to differences in solubility—typical
of proteins from high-G+C-content species when produced in a Gram-negative
host.[37] Notably, the enzyme from Methanosaeta sp. (FlAPtaU1, predicted to be a
chlorinase), was one of the top performers. FlASAJ15 also
had high 5′-FDA synthase activity in vitro. These two enzymes
had specific activities comparable to those of FlAMA37 and
FlA, with the highest catalytic efficiencies
on SAM-dependent SN2 fluorination reported to date.FlAPtaU1 and FlASAJ15 were selected for large-scale
shaken-flask production and a more detailed biochemical characterization.
Steady-state kinetics assays with 1 μM of the purified protein,
varying concentrations of SAM (1.5–800 μM) and 75 mM
KF revealed that both of these enzymes presented higher turnover rates
(kcat) than FlAMA37 and FlA (Figure a and Table ). In particular, the kcat of
FlAPtaU1 was 2.6-fold larger than that of FlAMA37. Surprisingly, KMSAM values
were consistently <10 μM, much lower than what had been previously
reported in the literature for fluorinases.[12−14] Notably, previous
studies used high enzyme concentrations (>10 μM), which impedes
reaching a steady state of the reaction for substrate concentrations
below 10 μM. We also used a KF concentration that ensures F– saturation without causing any inhibitory effect (previous
studies have used KF concentrations >200 mM).
Figure 3
Biochemical
characterization and residue conservation of selected
fluorinases. (a) Steady-state fluorination assays using increasing
SAM concentrations. Reactions were carried out at 37 °C in 50
mM HEPES buffer, pH = 7.8, with 75 mM KF. Dotted lines show fits to
the Michaelis–Menten equation (R2 > 0.95 in all cases). (b) End-point (1 h) transhalogenation assays
with increasing 5′-ClDA concentrations. Reactions were carried
out at 37 °C in 50 mM HEPES buffer, pH = 7.8, with 75 mM KF and
1 mM l-Met. Error bars represent standard deviations from
triplicate independent assays. Symbols and color codes are kept in
both panels. Simplified schematics for the corresponding reactions
are shown above each panel. (c–f) Variable residues in the
substrate binding pocket of FlAMA37 (c), FlASAJ15 (d), FlAPtaU1 (e), and SalL (f). Residues that differ from those of FlAMA37 are labeled in bold font, whereas conserved residues are labeled
in italics. FlA residues are identical
with those of FlAMA37. The SAM substrate is shown in ball-and-stick
representation.
Table 2
Michaelis–Menten Kinetic Constants
of Selected Fluorinasesa
fluorinase
KMSAM (μM)
kcat (min–1)
kcat/KMSAM (mM–1 min–1)
FlAMA37
4.42 ± 0.58
0.16 ± 0.01
36.36 ± 4.82
FlASxin
3.76 ± 0.15
0.22 ± 0.01
58.63 ± 2.63
FlASAJ15
9.62 ± 1.43
0.34 ± 0.01
35.81 ± 5.43
FlAPtaU1
6.99 ± 1.06
0.41 ± 0.01
57.54 ± 8.85
Assays
conducted in 50 mM HEPES,
pH = 7.8, with 75 mM KF and varying SAM concentrations incubated at
37 °C. Average and standard deviations are given for triplicate
independent measurements.
Assays
conducted in 50 mM HEPES,
pH = 7.8, with 75 mM KF and varying SAM concentrations incubated at
37 °C. Average and standard deviations are given for triplicate
independent measurements.Biochemical
characterization and residue conservation of selected
fluorinases. (a) Steady-state fluorination assays using increasing
SAM concentrations. Reactions were carried out at 37 °C in 50
mM HEPES buffer, pH = 7.8, with 75 mM KF. Dotted lines show fits to
the Michaelis–Menten equation (R2 > 0.95 in all cases). (b) End-point (1 h) transhalogenation assays
with increasing 5′-ClDA concentrations. Reactions were carried
out at 37 °C in 50 mM HEPES buffer, pH = 7.8, with 75 mM KF and
1 mM l-Met. Error bars represent standard deviations from
triplicate independent assays. Symbols and color codes are kept in
both panels. Simplified schematics for the corresponding reactions
are shown above each panel. (c–f) Variable residues in the
substrate binding pocket of FlAMA37 (c), FlASAJ15 (d), FlAPtaU1 (e), and SalL (f). Residues that differ from those of FlAMA37 are labeled in bold font, whereas conserved residues are labeled
in italics. FlA residues are identical
with those of FlAMA37. The SAM substrate is shown in ball-and-stick
representation.To gain insight on the structural
factors that could determine
these differences in fluorination activity, we inspected the predicted
crystal structures of FlAMA37, FlA, FlASAJ15, FlAPtaU1, and SalL. Examination of the amino acid residues potentially
interacting with SAM (at distances <5 Å) revealed important
variations between the substrate binding pocket of FlAPtaU1 and that of the other fluorinases known to date (Figure c–f). The alterations
could be mapped near to the adenyl moiety of SAM, and involve the
substitution of a conserved proline for an arginine residue and an
RNAA motif for YYGG. This motif is found in the C-terminal domain of other fluorinases, which is more variable than
the N-terminal domain and is presumably also involved
in hexamer formation[38] (Figure S4). Interestingly, the catalytic features found in
FlAPtaU1 do not resemble those of the SalL chlorinase, which would place FlAPtaU1 in a different functional group of SN2 halogenases. Evaluating
the effect of these amino acid differences in fluorinase activity
will be of interest for enzyme engineering efforts.Since FlAPtaU1 was predicted to be a chlorinase, we
evaluated whether it was also active in SN2-dependent addition
of Cl– to SAM. Unexpectedly, no 5′-ClDA accumulation
could be detected in enzymatic reactions in which KF was replaced
by KCl—in contrast to what has been reported for SalL.[15] Previous studies
have shown that FlA can also catalyze
the chlorination reaction.[16] However, this
feature requires the simultaneous removal of l-Met or 5′-ClDA,
the reaction products, since the reverse dehalogenation reaction is
favored. We could observe transhalogenation on 5′-ClDA (i.e.,
5′-FDA production in the presence of l-Met and F–, steps III and I in Scheme ; Figure b). Again, FlAPtaU1 catalytically outperformed
all other fluorinases, with a 3-fold higher Vmax value. Although we cannot rule out that FlAPtaU1 could also execute de novo chlorination, the 23-residue loop reportedly
found in “conventional” fluorinases is not essential
for the activity toward F–.With this background,
we tested the biosynthesis of fluorometabolites in vivo by engineering selected fluorinases in the bacterial
platform Pseudomonas putida, a robust
chassis for engineering complex chemistries using synthetic biology
tools.[39−43] We have designed a fluoride-responsive genetic circuit that enabled
biofluorination in this Gram-negative host.[27] Here, this system was adapted to express either flAPtaU1 or flASAJ15, the best-performing
fluorinases according to the kinetic parameters in Table . FlAMA37 and FlA were included in control experiments,
as we have previously used them for engineering in vivo fluorination.[27] Upon inducing gene expression with NaF (which
is also the substrate of the reaction of interest) and producing the
fluorinases for 20 h at 30 °C, 5′-FDA biosynthesis was
determined by LC-MS to evaluate de novo fluorination activity (Figure a).
Production of 5′-FDA by engineered P. putida could be detected in all cases (Figure b). Notably, the 5′-FDA
content, indicative of in vivo biofluorination, was 12-fold higher
in cells expressing flAPtaU1 with respect
to any other fluorinase gene. Fluorination activity in cell-free extracts
of P. putida incubated for 20 h
at 30 °C in the presence of exogenously added 200 μM SAM
and 5 mM NaF was similar for the fluorinases tested (Figure S5), with a higher activity detected in cell-free extracts
carrying FlAPtaU1, the Archaeal fluorinase. In the cell-free
extract assay, the final 5′-FDA concentrations detected were
within the ranges previously reported.[27,38] Interestingly,
no other fluorometabolites than 5′-FDA could be detected in
these assays.
Figure 4
Engineering in vivo biofluorination in P. putida. (a) Schematic representation of the fluoride-responsive genetic
circuit based on the T7 phage RNA polymerase (T7RNAP)[17] and workflow for the biofluorination assay. Expression
of the different fluorinase genes was induced when the cultures reached
an OD600 = 0.4–0.6 by adding NaF at 15 mM. Next,
following an incubation at 30 °C for 20 h, aliquots were taken
for metabolite extraction and quantification by LC-MS. Further details
are provided in the Supporting Information. (b) Quantification of the intracellular 5′-FDA content in
engineered P. putida expressing the different fluorinase
genes. In this case, the intracellular 5′-FDA concentration
is normalized by the cell dry weight (CDW). Black dots show individual
values from six independent biological replicates, and the error bars
represent standard deviations. Asterisks indicate significant differences
with p-values <0.1 (*) or <0.05 (**) for a
two-sample, one-sided Welch’s t-test.
Engineering in vivo biofluorination in P. putida. (a) Schematic representation of the fluoride-responsive genetic
circuit based on the T7 phage RNA polymerase (T7RNAP)[17] and workflow for the biofluorination assay. Expression
of the different fluorinase genes was induced when the cultures reached
an OD600 = 0.4–0.6 by adding NaF at 15 mM. Next,
following an incubation at 30 °C for 20 h, aliquots were taken
for metabolite extraction and quantification by LC-MS. Further details
are provided in the Supporting Information. (b) Quantification of the intracellular 5′-FDA content in
engineered P. putida expressing the different fluorinase
genes. In this case, the intracellular 5′-FDA concentration
is normalized by the cell dry weight (CDW). Black dots show individual
values from six independent biological replicates, and the error bars
represent standard deviations. Asterisks indicate significant differences
with p-values <0.1 (*) or <0.05 (**) for a
two-sample, one-sided Welch’s t-test.In conclusion, out of the 10 newly identified enzymes,
the nonconventional
FlA from the archaea Methanosaeta sp. PtaU1.Bin055
(FlAPtaU1) was found to present turnover rates superior
to those of all FlAs reported to date. Surprisingly, this enzyme lacks
the loop that was so far hypothesized to be a differentiating feature
between fluorinases and chlorinases, challenging the hypothesis that
this loop is required for activity toward F–. Engineering
this nonconventional fluorinase in P. putida mediated the highest in vivo production of 5′-FDA described
to date—and, for that matter, the highest fluorometabolite
levels reported for any biological system, either natural or engineered.
This work highlights the importance of systematic and efficient biocatalyst
selection across the ever-expanding genomic databases, followed by
careful characterization in vitro and cell factory engineering in
vivo. This study also expands the known sequence diversity for fluorinase
enzymes, helping in the identification of other nonintuitive sequence
features. Interestingly, when the mining run was repeated with either
FlAMA37 or FlAPtaU1 as query, the number of
putative fluorinase sequences retrieved (24 hits) was essentially
the same as obtained with the enzyme from S. cattleya as the template. These features will be useful
for predicting protein function(s) from genomic databases annotations.
Additionally, this fundamental knowledge will inform future engineering
endeavors of fluorinases by rational and semirational design. Taken
together, our results open avenues for the implementation of neo-metabolic pathways to incorporate F atoms in bacterial
hosts by synthetic biology approaches.
Authors: Huihua Sun; Wan Lin Yeo; Yee Hwee Lim; Xinying Chew; Derek John Smith; Bo Xue; Kok Ping Chan; Robert C Robinson; Edward G Robins; Huimin Zhao; Ee Lui Ang Journal: Angew Chem Int Ed Engl Date: 2016-10-14 Impact factor: 15.336
Authors: Alessandra S Eustáquio; Ryan P McGlinchey; Yuan Liu; Christopher Hazzard; Laura L Beer; Galina Florova; Mamoun M Alhamadsheh; Anna Lechner; Andrew J Kale; Yoshihisa Kobayashi; Kevin A Reynolds; Bradley S Moore Journal: Proc Natl Acad Sci U S A Date: 2009-07-09 Impact factor: 11.205
Authors: Xiaofeng Zhu; David A Robinson; Andrew R McEwan; David O'Hagan; James H Naismith Journal: J Am Chem Soc Date: 2007-11-07 Impact factor: 15.419
Authors: Hai Deng; Steven L Cobb; Andrew R McEwan; Ryan P McGlinchey; James H Naismith; David O'Hagan; David A Robinson; Jonathan B Spencer Journal: Angew Chem Int Ed Engl Date: 2006-01-23 Impact factor: 15.336