Douglas M M Soares1,2, Letícia C P Gonçalves1, Caroline O Machado1, Larissa C Esteves1, Cassius V Stevani1, Carla C Oliveira2, Felipe A Dörr3, Ernani Pinto3,4, Flávia M M Adachi2, Carlos T Hotta2, Erick L Bastos1. 1. Departamento de Química Fundamental, Instituto de Química, Universidade de São Paulo, 05508-000 São Paulo, São Paulo Brazil. 2. Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, 05508-000 São Paulo, São Paulo Brazil. 3. Departamento de Análises Clínicas e Toxicológicas, Faculdade de Ciências Farmacêuticas, Universidade de São Paulo, 05508-000 São Paulo, São Paulo Brazil. 4. Centro de Energia Nuclear na Agricultura, Universidade de São Paulo, 13400-970 Piracicaba, São Paulo Brazil.
Abstract
The l-DOPA dioxygenase of Amanita muscaria (AmDODA) participates in the biosynthesis of betalain- and hygroaurin-type natural pigments. AmDODA is encoded by the dodA gene, whose DNA sequence was inferred from cDNA and gDNA libraries almost 30 years ago. However, reports on its heterologous expression rely on either the original 5'-truncated cDNA plasmid or artificial gene synthesis. We provide unequivocal evidence that the heterologous expression of AmDODA from A. muscaria specimens is not possible by using the coding sequence previously inferred for dodA. Here, we rectify and reannotate the full-length coding sequence for AmDODA and express a 205-aa His-tagged active enzyme, which was used to produce the l-DOPA hygroaurin, a rare fungal pigment. Moreover, AmDODA and other isozymes from bacteria were submitted to de novo folding using deep learning algorithms, and their putative active sites were inferred and compared. The wide catalytic pocket of AmDODA and the presence of the His-His-His and His-His-Asp motifs can provide insight into the dual cleavage of l-DOPA at positions 2,3 and 4,5 as per the mechanism proposed for nonheme dioxygenases.
The l-DOPA dioxygenase of Amanita muscaria (AmDODA) participates in the biosynthesis of betalain- and hygroaurin-type natural pigments. AmDODA is encoded by the dodA gene, whose DNA sequence was inferred from cDNA and gDNA libraries almost 30 years ago. However, reports on its heterologous expression rely on either the original 5'-truncated cDNA plasmid or artificial gene synthesis. We provide unequivocal evidence that the heterologous expression of AmDODA from A. muscaria specimens is not possible by using the coding sequence previously inferred for dodA. Here, we rectify and reannotate the full-length coding sequence for AmDODA and express a 205-aa His-tagged active enzyme, which was used to produce the l-DOPA hygroaurin, a rare fungal pigment. Moreover, AmDODA and other isozymes from bacteria were submitted to de novo folding using deep learning algorithms, and their putative active sites were inferred and compared. The wide catalytic pocket of AmDODA and the presence of the His-His-His and His-His-Asp motifs can provide insight into the dual cleavage of l-DOPA at positions 2,3 and 4,5 as per the mechanism proposed for nonheme dioxygenases.
Molecular
studies on the biosynthesis of secondary metabolites
of basidiomycete fungi have been resurfacing because of modern heterologous
gene expression platforms and multiomic technologies.[1] Among these, the fly agaric Amanita muscaria (L.) Lam., with its white-dotted red pileus, is possibly the most
iconic mushroom-forming species.[2] Apart
from its cultural and symbolic importance, the study of the chemical
constituents of fly agaric provides us with the opportunity to explore
the evolutionary connection between plants and fungi, to establish
structure–bioactivity relationships of its metabolites, and
to unravel the biochemical pathways leading to bright natural colors.The biological and ecological functions of pigments in plants and
fungi are more complex than meets the eye.[3] The fly agaric and other basidiomycetes of the genus Amanita are pigmented by betalains and hygroaurins, whose biosynthesis requires
the enzymatic extradiol insertion of molecular oxygen into the catechol
ring of l-DOPA (Figure ).[4−6] Betalains also impart bright red-violet and orange
colors and occasionally green fluorescence[7,8] to
most plants of the order Caryophyllales and have gained considerable
attention because of their broad applicability,[9−14] prompt availability,[15] and importance
as molecular markers in phylogenetic studies.[16,17] In contrast, hygroaurins occur exclusively in some species of Amanita, Hygrocybe,[18] and Hygrophorus fungi,[19] and information on their occurrence, synthesis, and application
is virtually nonexistent.[20]
Figure 1
Biosynthesis of betalains,
hygroaurins, and related compounds.
Betacyanins and betaxanthins, the two classes of betalains, are produced
by the spontaneous coupling between betalamic acid with cyclo-DOPA
derivatives or amino acids/amines, respectively. The reaction of muscaflavin
with amines or amino acids produces hygroaurins. The dehydrative cyclization
of l-4,5-seco-DOPA is spontaneous under acidic aqueous conditions.[32] Matching colored names (red/orange) indicate
isomers. “E” designates enzymatic transformations. Amino
and carboxyl groups are presented in the uncharged forms for clarity.
Adapted with permission from Bastos, E. L.; Schliemann, W. Betalains
as Antioxidants. In Plant Antioxidants and Health, Ekiert, H. M., Ramawat, K. G., Arora, J., Eds.; Springer International
Publishing: Cham, 2021; pp 1–44. Copyright 2021 Springer Nature.
Biosynthesis of betalains,
hygroaurins, and related compounds.
Betacyanins and betaxanthins, the two classes of betalains, are produced
by the spontaneous coupling between betalamic acid with cyclo-DOPA
derivatives or amino acids/amines, respectively. The reaction of muscaflavin
with amines or amino acids produces hygroaurins. The dehydrative cyclization
of l-4,5-seco-DOPA is spontaneous under acidic aqueous conditions.[32] Matching colored names (red/orange) indicate
isomers. “E” designates enzymatic transformations. Amino
and carboxyl groups are presented in the uncharged forms for clarity.
Adapted with permission from Bastos, E. L.; Schliemann, W. Betalains
as Antioxidants. In Plant Antioxidants and Health, Ekiert, H. M., Ramawat, K. G., Arora, J., Eds.; Springer International
Publishing: Cham, 2021; pp 1–44. Copyright 2021 Springer Nature.Hygroaurins and betalains are produced by the spontaneous
coupling
of nitrogen nucleophiles, such as amino acids and amines, with either
muscaflavin or its isomer betalamic acid. The DOPA dioxygenase of
fly agaric (AmDODA, EC:1.14.99.-) catalyzes the conversion of l-DOPA into l-2,3- and l-4,5-seco-DOPA, that
spontaneously cyclize to muscaflavin and betalamic acid (Figure ).[21,22] However, the nonheme DOPA 4,5-extradiol dioxygenases of betalain-producing
plants only catalyze the formation of only l-4,5-seco-DOPA
(Figure ).[7]OBDC (l-4-(2-oxo-3-butenoic-acid)-4,5-dihydropyrrole-2-carboxylic
acid),[23] as well as stizolobinic and stizolobic
acids[24,25] are other secondary metabolites of applied
interest, whose biosynthesis is strictly related to the catalytic
properties of DOPA extradiol dioxygenases and the metabolism of seco-DOPAs.
Discerning betalamic acid, muscaflavin, and OBDC requires the use
of standards and modern mass spectrometry techniques, as these isomers
have a mass-to-charge ratio (m/z) of 212 and produce a main daughter ion of m/z 166.[26]The dodA gene (GenBank Y12886, 1629-bp) encodes
the AmDODA protein (Uniprot P87064), and its description by Hinz and
coauthors was the starting point to unravel the genetic basis of betalain
biosynthesis.[27] By screening a cDNA library
from the pileus of A. muscaria using anti-DOPA dioxygenase
antibodies, 20 positive clones were identified, all having a 612-bp
open reading frame (ORF).[27] The recombinant
expression of these cDNA clones resulted in a fully functional enzyme
as active as the native enzyme, and an extended coding sequence (687-bp)
for this 228-aa AmDODA was predicted from an upstream ATG found in
genomic clones.[28,29] However, to date AmDODA recombinant
expression was demonstrated using either the original cDNA plasmid[28,30] or artificial gene synthesis.[31]In this work, we rectify and reannotate the dodA gene and provide insights into the structures of AmDODA and other
DODAs from muscaflavin-producing organisms. Combining the DNA sequencing
of amplicons for dodA gene obtained from A. muscaria complementary and genomic DNA (cDNA and gDNA)
samples, the heterologous expression of AmDODA, and in silico protein
structure analysis, we provide the genetic basis for the heterologous
expression of AmDODA from natural resources and contribute to the
understanding of the catalytic properties and natural occurrence of
DODAs.
Results and Discussion
Attempt
to Express the Recombinant AmDODA
from the Coding DNA Sequence (CDS) Isolated from A. muscaria Specimens
To obtain the DNA template for cloning the CDS
for AmDODA, RNA was extracted from A. muscaria specimens
collected in São Paulo, Brazil and used to synthesize cDNA.
PCR amplification of cDNA samples using primers designed from the
687-bp CDS[27] produced a 784-bp long DNA
amplicon that is 97-bp longer than expected. The AmDODA CDS was inferred
by Hinz and coauthors using a partial genomic library constructed
from A. muscaria mushrooms collected in the Jorat
forest near Lausanne, Switzerland.[27,28] Sequencing
of the internal transcribed spacer elements (ITS1 and 2) of the nuclear
rDNA cluster of our specimens confirmed they correspond to the Eurasian
fly agaric (GenBank: AB080779 and AB015700, Figure S1), eliminating any inconsistencies caused by the study of
a different species.[33−37]A multiple alignment between the DNA sequences of the 784-bp
amplicon and dodA gene revealed that the exceeding
97-bp segment in the amplicon corresponds to the retention of the
first intron of dodA, probably due to a GA sequence
replacing the canonical AG signal at the 3′ splice site of
this putative intron of dodA (positions 748–749, Figure ).[27] This result was confirmed by DNA sequencing of dodA amplicons from A. muscaria gDNA (Figure a), which also revealed
that all other introns exhibited the conserved GT-AG at the 5′
and 3′ splice sites, respectively. In silico translation of
the 784-bp amplicon produced a truncated 35-aa polypeptide encoded
by a 105-bp upstream open reading frame (uORF) beginning at the same
start codon inferred for the CDS for AmDODA but ending earlier due
to a stop codon (TAG) located at the putative first intron (Figure b).
Figure 2
Comparison of the study
reported by Hinz and coauthors[27] and this
work. (a) Structures of cDNA and gDNA
clones isolated by Hinz and coauthors and the inferred 687-bp mRNA.
Introns are presented as lines and exons as colored boxes; this schematic
representation was created from the sequences reported in ref (27). The 228-aa AmDODA is
represented by the black and orange bar showing the initial and final
amino acid triads. The sequence was deduced by Hinz and coauthors
from the mRNA and is presented in Figure S3. (b) DNA amplicon obtained by PCR amplification of the CDS for AmDODA
from cDNA samples of Eurasian A. muscaria specimens
collected in São Paulo, Brazil (inset photo; credit: Prof.
C. V. Stevani). The “GA” replacing the canonical “AG”
reported by Hinz and coathors leads to the retention of the first
intron. A TAG stop codon indicates the 3′-end of the ORF used
for the in silico translation of the 784-bp DNA fragment, which would
result in a 35-aa truncated polypeptide. The structure of the dodA gene, as determined from DNA sequencing of gDNA amplicons,
is shown and the transcription initiation site is indicated. The 558-bp
CDS encoding the 185-aa AmDODA described in this work is presented.
PCR primer sites are indicated using dark blue harpoon arrows. The
complete sequence of the dodA gene is presented in Figure S2.
Comparison of the study
reported by Hinz and coauthors[27] and this
work. (a) Structures of cDNA and gDNA
clones isolated by Hinz and coauthors and the inferred 687-bp mRNA.
Introns are presented as lines and exons as colored boxes; this schematic
representation was created from the sequences reported in ref (27). The 228-aa AmDODA is
represented by the black and orange bar showing the initial and final
amino acid triads. The sequence was deduced by Hinz and coauthors
from the mRNA and is presented in Figure S3. (b) DNA amplicon obtained by PCR amplification of the CDS for AmDODA
from cDNA samples of Eurasian A. muscaria specimens
collected in São Paulo, Brazil (inset photo; credit: Prof.
C. V. Stevani). The “GA” replacing the canonical “AG”
reported by Hinz and coathors leads to the retention of the first
intron. A TAG stop codon indicates the 3′-end of the ORF used
for the in silico translation of the 784-bp DNA fragment, which would
result in a 35-aa truncated polypeptide. The structure of the dodA gene, as determined from DNA sequencing of gDNA amplicons,
is shown and the transcription initiation site is indicated. The 558-bp
CDS encoding the 185-aa AmDODA described in this work is presented.
PCR primer sites are indicated using dark blue harpoon arrows. The
complete sequence of the dodA gene is presented in Figure S2.
Reannotation of the dodA Gene
Structure
Thirteen possible ATG codons were found in the
784-bp CDS amplicon. The ATG start codon at the beginning of exon
2 (position 227–229, Figure b) is embedded in a consensus Kozak sequence for basidiomycetes
(ACUACCAUGU)[38] and remains
in the same reading frame as the original CDS for AmDODA. In silico
translation of the resulting 558-bp sequence leads to a 185-aa polypeptide
lacking the first 43 N-terminal amino acid residues
of the published 228-aa AmDODA (Figures b and S3).[27] All other start codons produce truncated polypeptides.The 558-bp DNA sequence was amplified from A. muscaria cDNA samples and cloned into a pET28b-AmDODA plasmid, which encodes
for 20 additional N-terminal amino acid residues,
including a 6×His-tag. After the transformation of E.
coli BL21(DE3) cells with the plasmid, the 205-aa His-tagged
recombinant AmDODA expression was successfully induced, and its activity
to promote the oxidation of l-DOPA was fully characterized,
as described in the following section. These results confirm the correct
CDS for AmDODA.Hinz and coauthors used genomic clones to infer
the structure of
the dodA gene and the CDS for AmDODA.[27] Even though their cDNA clones contain a 612-bp
ORF encoding for functional AmDODA, which includes the 558-bp DNA
sequence reported in this work, there are no further reports on the
cloning of the predicted 687-bp CDS and its heterologous expression
from A. muscaria specimens. The engineered production
of betalains in stable transgenic non-Caryophyllales plants, namely,
potato (Solanum tuberosum L.) and snapdragon (Antirrhinum majus L.),[30] was
performed using the original cDNA clones obtained by Hinz and coauthors.[27] Likewise, the production of fluorescent betaxanthins
in mammalian cells was reported by the expression of a plasmid containing
a synthetic codon optimized CDS for AmDODA.[31] Therefore, the rectification and reannotation of the dodA gene reported here is important for the description of the gene
and protein sequences and enables its cloning and the heterologous
expression of an active AmDODA enzyme from A. muscaria specimens. Moreover, AmDODA has the potential application as a genetically
encodable reporter for noninvasive monitoring gene expression.[39,40] The structure proposed for dodA comprises five
exons interrupted by four introns (Figure b) and the complete CDS sequence was deposited
at the GenBank (accession number MK922469).
Catalytic
Activity of AmDODA and Chemoenzymatic
Synthesis of l-DOPA-Hygroaurin
The 205-aa recombinant
His-tagged AmDODA catalyzed the oxidation of l-DOPA into
two different seco-DOPAs under slightly alkaline conditions at 25
°C. Ascorbic acid was used as an antioxidant to prevent the auto-oxidation
of l-DOPA that is known to produce browning substances (Figures S4 and S5).[41] The addition of l-DOPA to the biocatalytic system promptly
turned the reaction yellow and was accompanied by the appearance of
a broad absorption band with a maximum at 414 nm and shoulders at
approximately 380 and 450 nm (Figure a). HPLC/PDA analysis (Figure S6) and comparison with literature data[23] supports the formation of betalamic acid (λabs =
405 nm), muscaflavin (λabs = 400 nm), 2,3- and 4,5-seco-DOPA
(λabs = 385 and 383 nm, respectively), and l-DOPAxanthin (λabs 466 nm). d-DOPA is converted
into the respective stereoisomers and condensation products thereof,
that is, isobetalamic acid and isomuscaflavin, at a much lower rate
(Figure a). The maximum
specific activity was reached at pH 8.5 (Figure b), as reported for the native enzyme,[42] and the activity of AmDODA toward l-DOPA (0.5 U mg–1) was higher than d-DOPA
(0.3 U mg–1).
Figure 3
Enzyme activity. (a) Change in the UV–vis
spectra during
the reaction of d- and l-DOPA with oxygen in the
presence of AmDODA and AscH. Effect of pH (b) and l-DOPA
concentration (c) on the specific activity of AmDODA. The red shaded
region shows the 95% confidence band for the nonlinear fit of the
data to the Michaelis–Menten equation. (d) Kinetic traces of
the reaction products and intermediates formed by the oxidation of l-DOPA in the presence of AmDODA. The reaction was monitored
for up to 7 d. The structures of the seco-DOPAs are shown for clarity.
Chromatograms, peak retention time, absorption spectra, and reaction
conditions are shown in Figure S6 and were
obtained using the chromatographic condition 1. (e) Chromatogram of
products formed after 2 h of reaction at room temperature and a freezing-thawing
cycle (chromatographic condition 2). Figure S8 shows the MS2 spectra and ion fragments of muscaflavin
and betalamic acid. Experimental conditions: [AmDODA] = 1 μM,
[AscH] = 10 mM, [l-DOPA] = 2.5 mM, sodium phosphate buffer
(50 mM, pH 8.5).
Enzyme activity. (a) Change in the UV–vis
spectra during
the reaction of d- and l-DOPA with oxygen in the
presence of AmDODA and AscH. Effect of pH (b) and l-DOPA
concentration (c) on the specific activity of AmDODA. The red shaded
region shows the 95% confidence band for the nonlinear fit of the
data to the Michaelis–Menten equation. (d) Kinetic traces of
the reaction products and intermediates formed by the oxidation of l-DOPA in the presence of AmDODA. The reaction was monitored
for up to 7 d. The structures of the seco-DOPAs are shown for clarity.
Chromatograms, peak retention time, absorption spectra, and reaction
conditions are shown in Figure S6 and were
obtained using the chromatographic condition 1. (e) Chromatogram of
products formed after 2 h of reaction at room temperature and a freezing-thawing
cycle (chromatographic condition 2). Figure S8 shows the MS2 spectra and ion fragments of muscaflavin
and betalamic acid. Experimental conditions: [AmDODA] = 1 μM,
[AscH] = 10 mM, [l-DOPA] = 2.5 mM, sodium phosphate buffer
(50 mM, pH 8.5).At pH 8.5 and 25 °C,
the KM and Vmax of AmDODA were found to be 4.2 ± 0.4
mM and 2.6 ± 0.1 mM min–1 (Figure c), respectively, corresponding
to a turnover number (kcat) of 54.6 ±
0.1 min–1 (0.9 s–1), and a specificity
constant (kcat/KM) of 13.0 ± 0.1 min–1 mM–1 (2.2 × 10–4 s–1 M–1). The KM of AmDODA was nearly identical
to that of the native enzyme (3.9 mM)[42] and Zrÿd’s recombinant enzyme (4.5 mM).[28] Incubation of AmDODA with Chelex 100 to produce
the apoenzyme precludes catalysis (Figure S7), indicating the requirement of a metal cofactor. Attempts to restore
enzyme activity by adding different metal cations, including iron(II),[21] were unsuccessful, possibly due to enzyme denaturation.
Further, the participation of other metal cations such as Zn(II) and
Mn(II) in AmDODA catalysis could not be ruled out by existing experimental
evidence.Although temporal changes in absorption spectra have
been used
to investigate the catalytic activity of AmDODA and other l-DOPA dioxygenases,[28,42−45] spectral overlap makes the identification
of intermediates and products challenging.[46] Hence, the reaction kinetics were monitored by HPLC-PDA-ESI-qTOF-MS
analysis (Figures d, S6, and S8), and the area of each chromatographic
peak was converted into an estimated concentration using the kinetic
model presented in Figure S9.The
concentration of the 2,3- and 4,5-seco DOPA intermediates reached
their maxima after approximately 170 min of reaction (Figure d). Accordingly, change in
the concentration of betalamic and muscaflavin over time showed sigmoid
profiles, although betalamic acid was produced faster and in slightly
higher (about 40%) amounts. DOPA-betaxanthin, but not OBDC and DOPA-hygroaurin,
were detected under these experimental conditions. The use of a freezing–thawing
cycle promotes the regeneration of hydrolyzed betalains[47,48] and could favor the coupling of muscaflavin and l-DOPA.
To test this hypothesis, we repeated the reaction by subjecting the
mixture to a freezing–thawing cycle after 2 h at 25 °C,
which led to the formation of l-DOPA-hygroaurin (Figure e). No evidence of
hygroaurin formation was obtained when the reaction was performed
at 25 °C possibly because of its labile character.
Phylogenetic Relationships and Insights into
the Structure of AmDODA
DODAs participating in muscaflavin
biosynthesis were found in fungi (A. muscaria, AmDODA),[27,28,42] bacteria [(G. diazotrophicus, GdDODA),[43]A. cylindrica, AcDODA,[49] and E. coli, EcDODA],[49] and insect (B. mori, BmDODA)[50] but not in plants [M. jalapa, MjDODA,[51]B. vulgaris, BvDODAs[44] and P. grandiflora, PgDODA[5]]. A time
tree from these taxa, P. xenovorans, whose 2NYH protein
shows high similarity with the fungal DODA,[43] and the anthocyanin-pigmented A. thaliana was constructed
to estimate the timing of species divergence (Figure a).[52]
Figure 4
Phylogenetic
relations and structural models of DODAs. (a) Divergence
times for selected species producing dioxygenases. (b) Neighbor-Joining
consensus tree inferred from protein sequences of functionally characterized l-DOPA extradiol dioxygenases (DODAs). Different groups of organisms
are presented in the following colors: fungi (red), bacteria (orange),
insects (brown), and plants (green). Branch support values >50%
are
indicated. Except for AmDODA, whose amino acid sequence was generated
from the translation of the 558-bp CDS, protein sequences were obtained
from the Uniprot database. (c) Structural alignment of AmDODA, GdDODA,
and AcDODA. Enzymes were treated as monomeric units for simplicity,
and the cutoff distance for the α-Cs was set to 140 pm (picometer).
(d) Sequence alignment according to the structural alignment, and
the results of the metal site prediction and homology search. The
two regions of the putative catalytic pocket were assigned as I (blue)
and II (orange). The star indicates the highest score motif for all
three sequences. (e) Amino acid residues that are involved in complexation.
Triangles show triads of amino acids that were predicted to bind metal
cations in AmDODA (gray), GdDODA (green), and AcDODA (orange); underlined
aa are not conserved in all three enzymes. (f) Amino acids at the
putative catalytic pocket of the three DODAs. His (H), Glu (E), and
Asp (D) residues are colored in cyan, pink, and red, respectively.
Thick residues are from AmDODA. Colored volumes show the most probable
region of metal cation binding; AmDODA (blue), GdDODA (green), and
AcDODA (orange).
Phylogenetic
relations and structural models of DODAs. (a) Divergence
times for selected species producing dioxygenases. (b) Neighbor-Joining
consensus tree inferred from protein sequences of functionally characterized l-DOPA extradiol dioxygenases (DODAs). Different groups of organisms
are presented in the following colors: fungi (red), bacteria (orange),
insects (brown), and plants (green). Branch support values >50%
are
indicated. Except for AmDODA, whose amino acid sequence was generated
from the translation of the 558-bp CDS, protein sequences were obtained
from the Uniprot database. (c) Structural alignment of AmDODA, GdDODA,
and AcDODA. Enzymes were treated as monomeric units for simplicity,
and the cutoff distance for the α-Cs was set to 140 pm (picometer).
(d) Sequence alignment according to the structural alignment, and
the results of the metal site prediction and homology search. The
two regions of the putative catalytic pocket were assigned as I (blue)
and II (orange). The star indicates the highest score motif for all
three sequences. (e) Amino acid residues that are involved in complexation.
Triangles show triads of amino acids that were predicted to bind metal
cations in AmDODA (gray), GdDODA (green), and AcDODA (orange); underlined
aa are not conserved in all three enzymes. (f) Amino acids at the
putative catalytic pocket of the three DODAs. His (H), Glu (E), and
Asp (D) residues are colored in cyan, pink, and red, respectively.
Thick residues are from AmDODA. Colored volumes show the most probable
region of metal cation binding; AmDODA (blue), GdDODA (green), and
AcDODA (orange).Hygroaurin-producing
organisms diverged earlier than plants pigmented
by either betalains or anthocyanins, an observation that is confirmed
by the phylogenetic relationships between these species obtained comparing
the amino acid sequences of their DODAs (Figures b and S10). Although
AmDODA and plant DODAs share no common evolutive origin, AmDODA shows
a pairwise identity of 31.7% with the bacterial AcDODA and GdDODA
and all three enzymes show a significant identity with putative dioxygenases
from the proteobacterium P. xenovorans strain LB400
(PDB Databank: 2P8I_D)[53] and the filamentous cyanobacterium Nostoc punctiforme strain PCC 73102 (PDB Databank: 2PEB_A) (Table S1), whose structure has been elucidated.
Therefore, to gain further insight into the structure of AmDODA, we
used de novo folding based on deep learning algorithms to create models
of DODAs from betalain- and hygroaurin-producing organisms. The resulting
protein structures were subjected to conformational refinement, scanning
for metal binding sites, and structural alignment (Figure c).Structural differences
in the N-terminal region
of the 185-aa AmDODA fragment and the 228-aa protein described by
Hinz and coauthors did not affect the putative active site, whose
location is loosely indicated by a rectangle in Figure c and magnified in Figure S11. The active site of AmDODA, AcDODA, and GdDODA was predicted
considering the most likely binding sites for metal cations, as well
as by homology with metalloenzymes (Figure d–f and Table S2). The initial portion of AmDODA was not conserved in other
fungal dioxygenases (Figure S12), as well
as in AcDODA and GdDODA, in agreement with previous reports which
suggest that the lack of about 30 N-terminal residues
does not affect the overall stability and activity of the fungal enzyme.[27,28] These findings also corroborate our observation that the lack of
the first 43 amino acids in the 185-aa AmDODA compared to the inferred
228-aa AmDODA does not affect its catalytic properties.The
resulting structural alignment of AmDODA, AcDODA, and GdDODA
reveals that these proteins have more than 99-aa overlapping residues
(Table S1) and that a conserved His-His-His
motif in the region I of the catalytic pocket is the most likely metal
cation binding site (Figure e). This prediction is supported by the fact that this motif
includes the conserved His residue found from the multiple alignment
of all DODAs (Figure S10 and Figure d; AmDODA/H102, Pg/H177, Bv/H175,
Mj/H168, Ec/H177, Bm/H175, Ac/H88, and Gd/H91), which has been considered
essential for the catalytic activity of extradiol dioxygenase class
III enzymes,[5] and by the high homology
with the Zn(II) binding site of the putative dioxygenase from N. punctiforme PCC 73102. Notably, an Asp residue near the
3His motif is conserved in these DODAs, and proximal His-His-His and
His-His-Asp motifs are involved in Fe(II) complexation at the active
site of several extradiol dioxygenases.[21,54] Although the
His21-His23-Glu75-Trp77 motif of AmDODA has high homology with the
binding site for Mn(II) in glutamine synthetase from Salmonella
typhimurium (1F52), it is not conserved in AcDODA and GdDODA,
and hence it is unlikely to participate in the oxidation of l-DOPA. Region II has two conserved His residues and the Glu75 residue
for AmDODA or the His48 and His51 residues for GdDODA and AcDODA,
respectively, making its participation in catalysis unlikely as well.
However, the combined regions I and II make the catalytic pocket wide
enough to accommodate l-DOPA, and perhaps to favor the cyclization
of the 2,3- and 4,5-seco-DOPAs, as proposed for the EcDODA protein.[55]
Conclusion
The complete
CDS of the fly agaric l-DOPA dioxygenase
was rectified and reannotated, enabling its heterologous expression
from A. muscaria cDNA samples to produce muscaflavin
and hygroaurins. The recombinant AmDODA produced is 43-aa residues
shorter than previously reported but shows the same catalytic performance
of both the native and larger recombinant enzymes, confirming the
cloning and expression of the full-length cDNA. Hygroaurins can be
conveniently obtained using the freezing–thawing protocol described
here, making it possible to study this rare class of natural pigments.
The dual cleavage of l-DOPA by DODAs participating in muscaflavin
biosynthesis could be explained by their conserved His-His-His and
His-His-Asp motifs in the putative catalytic pocket, which is wide
enough to accommodate large substrates.
Material
and Methods
Molecular Identification of A. muscaria
A. muscaria mushrooms were collected in
Santana de Parnaíba, São Paulo, Brazil (23°28′18.8″
S, 46°51′50.2″ W) on June 20, 2018. The internal
transcribed spacer (ITS) region of its nuclear rRNA gene cluster was
analyzed for taxonomic purposes. Initially, 100 mg of the red-colored
cap tissue was disrupted using a mortar and pestle in the presence
of liquid nitrogen. Genomic DNA was extracted using the DNeasy Plant
Mini Kit (QIAGEN), following the manufacturer’s instructions.
PCR reactions were carried out in a final volume of 25 μL with
the enzyme Platinum Taq DNA Polymerase (Invitrogen) using 50 ng of
gDNA and the conserved primers flanking the rDNA region containing
the ITS sequences: ITS5F (5′-GGAAGTAAAAGTCGTAACAAGG-3′)
and ITS4R (5′-TCCTCCGCTTATTGATATGC-3′).
The reactions were incubated in the SimpliAmp thermal cycler (Applied
Biosystems) at 94 °C for 2 min, followed by 35 cycles of denaturation
at 94 °C for 30 s, primer annealing at 50 °C for 30 s, and
DNA extension at 72 °C for 1 min. PCR products were separated
by electrophoresis on a 1.5% agarose gel for 35 min at 130 V. DNA
bands (∼700 bp) were excised and purified using the GenElute
gel extraction kit (Sigma-Aldrich). Purified PCR amplicons were cloned
into pGEM-T Easy vectors (Promega) using a molar ratio of 5:1 (insert/vector),
following the manufacturer’s instructions. Chemocompetent Escherichia coli Stellar cells were transformed by ligation
reactions. Reactions were plated in selective LB media containing
ampicillin and incubated at 37 °C for 16 h. Positive clones were
confirmed by plasmid DNA extraction, followed by EcoRI digestion and electrophoresis on a 1.5% agarose gel. DNA sequencing
reactions were performed with the BigDye Terminator v3.1 Cycle Sequencing
Kit (Applied Biosystems) using 5 μL of plasmid DNA (100 ng/μL)
and 2.5 μL of 5 μM sequencing primers M13F (5′-CGCCAGGGTTTTCCCAGTCACGAC-3′)
or M13R (5′-CAGGAAACAGCTATGAC-3′). Two positive
clones were sequenced in duplicate using the Sanger method on an ABI
3730 DNA analyzer (Applied Biosystems) at the Centro de Pesquisa sobre
o Genoma Humano e Células-Tronco da Universidade de São
Paulo (CEGH-USP, Brazil). DNA sequences were analyzed using the software
Geneious Prime 2020.2.4 (Biomatters), and the consensus sequences
produced were aligned with the nucleotide database of the National
Center for Biotechnology Information (NCBI) using the online tool
Blastn.
Cloning of the Coding Sequence of dodA
DNA and mRNA samples were extracted from the
red-pigmented pileus using DNeasy and RNeasy kits (QIAGEN). cDNA was
synthesized from mRNA (1 μg) using SuperScript III reverse transcriptase
(Thermo Fisher Scientific). Primer design was carried out using the
DNA sequence of A. muscaria dodA (GenBank Y12886).[27] The coding sequence of dodA was PCR amplified from DNA and cDNA samples using the primers dodA-F
(CACCATGGTGCCAAGCTTCGTTGT) and dodA-R (CTATGCATCTCGATGGGGCGCTCT).
PCR was carried out under standard conditions using Phusion High-Fidelity
DNA Polymerase (Thermo Fisher Scientific). Samples were kept at 98
°C for 30 s (1 cycle), 98 °C for 10 s (30 cycles), 62 °C
for 30 s (1 cycle), and 72 °C for 1 min (1 cycle), followed by
a final extension phase at 72 °C for 7 min. Products were cloned
into pENTR/SD/d-TOPO (Thermo Fisher Scientific), and transformed
into DH5α-competent E. coli (Thermo Fisher
Scientific), according to the manufacturer’s instructions.
The sequence was checked by DNA sequencing of the positive colonies
using the primers M13 forward (5′-GTAAAACGACGGCCAG-3′)
and M13 reverse (5′-CAGGAAACAGCTATGAC-3′).
On the basis of the results obtained, a new CDS was proposed for dodA, which was deposited in the NCBI database (GenBank
accession number MK922469). cDNA samples from A. muscaria were used for PCR gene amplification using the AmDODA-F (5′-ACTTTAAGAAGGAGATATACATGTCCACCAAGCCAGAG-3′)
and AmDODA-R (5′-GTCGACGGAGCTCGAATTCGGTGCATCTCGATGGGGCG-3′)
primers. PCR was performed under standard conditions using Q5 High-Fidelity
DNA Polymerase (New England Biolabs). Samples were kept at 98 °C
for 30 s (1 cycle), 98 °C for 10 s (30 cycles), 60 °C for
30 s (1 cycle), and 72 °C for 1 min (1 cycle), followed by a
final extension phase at 72 °C for 2 min. PCR products were cloned
into the pET28b vector (Novagen) linearized with BamHI and NotI, following the sequence and ligation-independent
cloning (SLIC) method.[56] The recombinant
plasmid pET28b-AmDODA was confirmed by DNA sequencing.
CDS amplification from A. muscaria cDNA samples was
repeated in triplicate to eliminate any technical issues, and the
DNA sequence of the 784-bp amplicon was determined subsequently.
Expression and Purification of AmDODA
E. coli strain BL21(DE3) (New England Biolabs) was
transformed with the recombinant plasmid pET28b-AmDODA to express the C-terminal His-tagged AmDODA. 2-YT (tryptone/yeast
extract) medium (3 mL) supplemented with kanamycin (50 μg mL–1) was inoculated with E. coli BL21(DE3)
pET28b-AmDODA and maintained overnight at 37 °C
in an orbital shaker operated at 200 rpm. The cultures were transferred
to 1.0 L baffled Erlenmeyer flasks containing 250 mL 2-YT/kanamycin
medium and shaken at 200 rpm and 37 °C for approximately 2 h
to a final optical density of 0.4 to 0.6 at 600 nm. Isopropyl β-d-thiogalactopyranoside (IPTG) was added at a final concentration
of 0.5 mM to induce AmDODA expression, and the flasks were incubated
for 16 h at 30 °C. Cells were harvested by centrifugation (8000×g, 4 °C, 30 min) and resuspended in sodium phosphate
lysis buffer (Supporting Information).
Cell lysis was performed in a French Press Cell G-M (Thermo Fisher
Scientific), and the recombinant protein was purified by gravity-flow
chromatography using a nickel-charged resin Ni-NTA Agarose (QIAGEN)
equilibrated with 10 mM imidazole in lysis buffer. An elution buffer
(linear gradient of imidazole from 100 mM to 500 mM) was used to elute
AmDODA. Fractions containing the enzyme were identified by sodium
dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE)[57] using 15% polyacrylamide gels and stained using
the standard Coomassie Blue method. Pure fractions were pooled and
desalted by dialysis in a sodium phosphate buffer (50 mM, pH 7.4).
Protein concentrations were determined by the dye-binding method of
the Bradford assay (Bio-Rad); bovine serum albumin was used as the
calibration standard. The revised nucleotide sequence was deposited
in the GenBank database (accession number MK922469).[58]
Enzyme Catalysis
Oxidation of l-DOPA in the presence of AmDODA was monitored
every 18 s for
5 min by UV–vis absorption spectroscopy (300–600 nm,
scan rate: 2400 nm min–1) using a Varian Cary 50
Bio spectrophotometer, equipped with a cell holder thermostatic at
25 °C. The reaction was initiated by adding 1 μM of the
enzyme to a solution of l-DOPA (1 mM) and ascorbic acid (AscH,
10 mM) in sodium phosphate buffer (50 mM, pH 8.5) unless otherwise
stated. The pH was corrected using a base after the addition of AscH.
Product formation was monitored by the increase in absorption at 414
nm over time, and the initial rate of product formation (in μM
min–1) was calculated by linear regression, assuming
the molar absorptivity coefficient (ε) at 424 nm to be 24 000
M–1 cm–1 for both products.[43,59] The specific activity of the enzyme was calculated by dividing the
initial rate of product formation (in μM min–1) by the initial enzyme concentration (in mg L–1) and thus corresponding to the molar quantity of product converted
each minute per unit mass of enzyme, that is, μmol min–1 mg–1 or U mg–1.[60] The specific activity at each substrate concentration (l-DOPA, 0.5 to 7 mM range) was plotted, and the values of KM and Vmax were
calculated by nonlinear fitting of the data to the Michaelis–Menten
equation without considering substrate inhibition. All regression
analyses were performed using Origin 2016 software (OriginLab).
Phylogenetic Analysis and In Silico Enzyme
Modeling
The amino acid sequences of DOPA extradiol dioxygenases
of Anabaena cylindrica Lemmermann 1896 (AcDODA, GenBank
ID WP_015213489), A. muscaria (AmDODA, GenBank ID MK922469), Beta vulgaris L. (BvDODA1, GenBank ID HQ656027), Bombyx mori strain Dazao (BmDODA1, GenBank ID MG882761), E. coli (EcDODA, GenBank ID WP_000188362), Gluconacetobacter diazotrophicus (GdDODA, GenBank
ID WP_012222467), Mirabilis jalapa L. (MjDODA, GenBank ID B6F0W8), and Portulaca grandiflora (PgDODA, GenBank ID AJ580598) were
aligned using the MUSCLE (Multiple Sequence Comparison by Log-Expectation)
tool in the Geneious Prime software (Biomatters, v. 2020.2.4). Pairwise
distances obtained from this multiple sequence alignment were used
to produce a neighbor-joining consensus tree with 100 000 bootstrap
replicates. All of the amino acid sequences were also subjected to
de novo folding with restraints, using both deep residual neural networks
and homologous templates using the trRosetta server.[61] The resulting structures were refined using the GalaxyRefine2
service at the GalaxyWEB server,[62] scanned
for metal binding sites using the BioMetAll software,[63] and submitted to a homology search for metalloenzymes using
the COACH-D server.[64] Structural alignment
was performed considering a maximum distance between residues of 140
pm and restrained to a minimum number of matching residues of 50.
The timing of species divergence was estimated by constructing a time
tree from the species listed above, Paraburkholderia xenovorans, and the anthocyanin-pigmented Arabidopsis thaliana using the TimeTree online server.[52]
Chromatographic Analysis
The reaction
of l-DOPA (2.5 mM) and oxygen in the presence of AscH (10
mM) and AmDODA (1.0 μM) in sodium phosphate buffer (50 mM, pH
8.5) at 25 °C was monitored over time by HPLC-PDA using a Shimadzu
Prominence liquid chromatograph equipped with an Ascentis C18 column
(5 μm, 250 × 4.6 mm, Supelco) and an SPD-M20A detector.
The reaction was analyzed at 1.0 mL min–1 at 25
°C under a linear gradient from 2% to 60% B over 20 min (solvent
A, water; solvent B, acetonitrile, both containing 0.05% v/v formic
acid) (condition 1), or isocratic 5% B for 5 min, followed by a linear
gradient from 5% to 25% B over 15 min (condition 2). After equilibrium
was reached, the reaction mixture was subjected to HPLC-HRMS analysis
using a Shimadzu Prominence liquid chromatograph equipped with a Luna
C18 column (3 μm, 150 × 2 mm, Phenomenex), and coupled
to a Bruker Daltonics microTOF-QII mass spectrometer fitted with an
electrospray source operated in positive mode. The reaction mixture
analyzed at 0.2 mL min–1 at 30 °C under a linear
gradient from 5% to 95% B over 15 min (solvent A, 0.05% v/v formic
acid in water; solvent B, 0.05% v/v formic acid in acetonitrile). l- and d-DOPA, betalamic acid, and dopaxanthin were
used as standards (see Supporting Information). Betalamic acid was quantified by absorption spectroscopy using
a molar absorption coefficient of 24 000 M–1 cm–1 at 424 nm.[43]
Authors: Markus Gressler; Nikolai A Löhr; Tim Schäfer; Stefanie Lawrinowitz; Paula Sophie Seibold; Dirk Hoffmeister Journal: Nat Prod Rep Date: 2021-04-28 Impact factor: 13.423
Authors: Tatiana C Penna; Gustavo Cervi; André F Rodrigues-Oliveira; Bruno D Yamada; Rafael Z C Lima; Jair J Menegon; Erick L Bastos; Thiago C Correra Journal: Rapid Commun Mass Spectrom Date: 2020-02-08 Impact factor: 2.419