Chin-Yuan Chang1, Jeremy R Lohman1, Hongnan Cao2, Kemin Tan3, Jeffrey D Rudolf1, Ming Ma1, Weijun Xu2, Craig A Bingman4, Ragothaman M Yennamalli2,5, Lance Bigelow3, Gyorgy Babnigg3, Xiaohui Yan1, Andrzej Joachimiak3, George N Phillips2, Ben Shen1. 1. Department of Chemistry, The Scripps Research Institute , Jupiter, Florida 33458, United States. 2. BioScience at Rice and Department of Chemistry, Rice University , Houston, Texas 77251, United States. 3. Midwest Center for Structural Genomics and Structural Biology Center, Biosciences Division, Argonne National Laboratory , Argonne, Illinois 60439, United States. 4. Department of Biochemistry, University of Wisconsin-Madison , Madison, Wisconsin 53705, United States. 5. Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology , Waknaghat, Himachal Pradesh, India 173234.
Abstract
C-1027 is a chromoprotein enediyne antitumor antibiotic produced by Streptomyces globisporus. In the last step of biosynthesis of the (S)-3-chloro-5-hydroxy-β-tyrosine moiety of the C-1027 enediyne chromophore, SgcE6 and SgcC compose a two-component monooxygenase that hydroxylates the C-5 position of (S)-3-chloro-β-tyrosine. This two-component monooxygenase is remarkable for two reasons. (i) SgcE6 specifically reacts with FAD and NADH, and (ii) SgcC is active with only the peptidyl carrier protein (PCP)-tethered substrate. To address the molecular details of substrate specificity, we determined the crystal structures of SgcE6 and SgcC at 1.66 and 2.63 Å resolution, respectively. SgcE6 shares a similar β-barrel fold with the class I HpaC-like flavin reductases. A flexible loop near the active site of SgcE6 plays a role in FAD binding, likely by providing sufficient space to accommodate the AMP moiety of FAD, when compared to that of FMN-utilizing homologues. SgcC shows structural similarity to a few other known FADH2-dependent monooxygenases and sheds light on some biochemically but not structurally characterized homologues. The crystal structures reported here provide insights into substrate specificity, and comparison with homologues provides a catalytic mechanism of the two-component, FADH2-dependent monooxygenase (SgcE6 and SgcC) that catalyzes the hydroxylation of a PCP-tethered substrate.
C-1027 is a chromoprotein enediyne antitumor antibiotic produced by Streptomyces globisporus. In the last step of biosynthesis of the (S)-3-chloro-5-hydroxy-β-tyrosine moiety of the C-1027 enediyne chromophore, SgcE6 and SgcC compose a two-component monooxygenase that hydroxylates the C-5 position of (S)-3-chloro-β-tyrosine. This two-component monooxygenase is remarkable for two reasons. (i) SgcE6 specifically reacts with FAD and NADH, and (ii) SgcC is active with only the peptidyl carrier protein (PCP)-tethered substrate. To address the molecular details of substrate specificity, we determined the crystal structures of SgcE6 and SgcC at 1.66 and 2.63 Å resolution, respectively. SgcE6 shares a similar β-barrel fold with the class I HpaC-like flavin reductases. A flexible loop near the active site of SgcE6 plays a role in FAD binding, likely by providing sufficient space to accommodate the AMP moiety of FAD, when compared to that of FMN-utilizing homologues. SgcC shows structural similarity to a few other known FADH2-dependent monooxygenases and sheds light on some biochemically but not structurally characterized homologues. The crystal structures reported here provide insights into substrate specificity, and comparison with homologues provides a catalytic mechanism of the two-component, FADH2-dependent monooxygenase (SgcE6 and SgcC) that catalyzes the hydroxylation of a PCP-tethered substrate.
C-1027 is a chromoprotein enediyne antitumor
antibiotic isolated
from Streptomyces globisporus, consisting of an apoprotein
and the C-1027 chromophore.[1] The structure
of the C-1027 chromophore is comprised of a nine-membered enediyne
core with three appended moieties, including a deoxyaminosugar, benzoxazolinate,
and (S)-3-chloro-5-hydroxy-β-tyrosine (Figure A).[1] The enediynes share a common mechanism of action. The enediyne
core undergoes a Bergman cycloaromatization to generate a transient
benzenoid diradical capable of abstracting hydrogen atoms from DNA,
leading to double-strand breaks.[2] C-1027
is unique in that it can also generate interstrand cross-links under
anaerobic conditions.[3,4] The β-tyrosine moiety of
C-1027 plays an important role in determining the ratio of double-strand
breaks to interstrand cross-links, such that removal of the C-20′
chloride or the C-22′ hydroxyl functionalities almost abolishes
interstrand cross-links and decreases the number of double-strand
breaks by 2–20-fold.[4,5]
Figure 1
Structures of the C-1027
chromophore and engineered analogues.
(A) The C-1027 chromophore consists of four components: a nine-membered
enediyne core, a benzoxazolinate, a deoxyaminosugar, and (S)-3-chloro-5-hydroxy-β-tyrosine. (B) Biosynthesis
of the (S)-3-chloro-5-hydroxy-β-tyrosyl moiety
of the C-1027 chromophore from l-tyrosine, featuring SgcC3-catalyzed
chlorination and SgcC-catalyzed hydroxylation, both of which prefer
a SgcC2-tethered substrate.
Structures of the C-1027
chromophore and engineered analogues.
(A) The C-1027 chromophore consists of four components: a nine-membered
enediyne core, a benzoxazolinate, a deoxyaminosugar, and (S)-3-chloro-5-hydroxy-β-tyrosine. (B) Biosynthesis
of the (S)-3-chloro-5-hydroxy-β-tyrosyl moiety
of the C-1027 chromophore from l-tyrosine, featuring SgcC3-catalyzed
chlorination and SgcC-catalyzed hydroxylation, both of which prefer
a SgcC2-tethered substrate.Deletion of the halogenase gene sgcC3 and
monooxygenase
gene sgcC from S. globisporus afforded
mutant strains that produced 20′-deschloro-C-1027 and 22′-deshydroxy-C-1027,
respectively (Figure A).[1,6] Characterization of the (S)-3-chloro-5-hydroxy-β-tyrosyl moiety biosynthetic pathway
(Figure B) demonstrated
that (i) SgcC4 is an aminomutase that converts l-tyrosine
to (S)-β-tyrosine,[7] (ii) SgcC1 is a free-standing nonribosomal peptide synthetase (NRPS)
adenylation protein that loads (S)-β-tyrosine
onto the free-standing peptidyl carrier protein (PCP), SgcC2, leading
to (S)-β-tyrosyl-S-SgcC2,[6,8] (iii) SgcC3 is a FAD-dependent halogenase that catalyzes regioselective
chlorination of (S)-β-tyrosyl-S-SgcC2 to form (S)-3-chloro-β-tyrosyl-S-SgcC2,[9] and finally (iv) SgcC
is a FADH2-dependent monooxygenase that catalyzes the hydroxylation
of (S)-3-chloro-β-tyrosyl-S-SgcC2 to form (S)-3-chloro-5-hydroxy-β-tyrosyl-S-SgcC2.[10] The SgcE6 flavin reductase
supplies FADH2 to both SgcC3 and SgcC.[11]Three families of enzymes catalyze most hydroxylation
reactions
in natural product biosynthesis, cytochrome P450 monooxygenases, α-ketoglutarate-dependent
non-heme iron monooxygenases, and flavin-dependent monooxygenases.[12] Flavin-dependent monooxygenases are further
divided into two different classes, one-component and two-component
monooxygenases, which are not related by similarities in sequence
or structure.[13−15] The one-component systems use an FMN or FAD prosthetic
group and can directly reduce it via NAD(P)H, while the two-component
systems consist of a flavin reductase component and a flavin-dependent
monooxygenase component.[16−21] The flavin reductase and monooxygenase components have been further
divided into subfamilies, classes, or groups. The flavin reductase
components can be divided into two groups on the basis of whether
flavin is a substrate (class I) or a prosthetic group (class II).
Class I flavin reductases can be further divided into at least three
subfamilies based on sequence and structure as exemplified by (i)
Fre from Escherichia coli, (ii) FRaseI from Vibrio fischeri, and (iii) HpaC from E. coli W (EcHpaC).[20] The monooxygenase components
have been divided into groups, A–H, based on sequence/structure
and function, with groups C–F having a flavin reductase component.[14,22]SgcE6 and SgcC form a two-component monooxygenase system.
On the
basis of previous bioinformatics analysis, SgcE6 belongs to the HpaC-like
flavin reductase subfamily possessing conserved (S/T/C)XXPP and GDH
motifs.[11,20] This subfamily contains a dual-substrate
binding pocket to accommodate FAD/FMN and NAD(P)H. The HpaC-like family
catalyzes a two-electron transfer from NAD(P)H to generate reduced
flavin.[23] SgcE6 uses FAD and NADH but not
NADPH or FMN as substrates and provides FADH2 to SgcC and
SgcC3 through free diffusion.[11] On the
basis of previous bioinformatics analysis, SgcC belongs to a group
of monooxygenases that act on p-hydroxyphenylacetate
(4HPA), such as HpaB from E. coli strain W (EcHpaB)[24] and HpaA from Pseudomonas aeruginosa (PaHpaA).[25] Thus, SgcC falls into “group
D”, whereas most monooxygenases in natural product biosynthesis
are in “group A”.[22] The “group
D” monooxygenases share structural homology with acyl-CoA dehydrogenases
(ACAD) and are split into two types represented by HpaB from E. coli W (EcHpaB) and HpaH from Acinetobacter baumanii (AbHpaH), which are 520 and 422 residues in length, respectively.[22] Structures of HpaB from Thermus thermophilis HB8 (TtHpaB),[17] which is ∼25%
identical to EcHpaB and similar in length, reveals mechanistic insights
into the catalysis of EcHpaB and SgcC homologues.In vitro,
SgcC efficiently catalyzes the regioselective hydroxylation
of 3-substituted β-tyrosyl-S-SgcC2 analogues,
including F-, Cl-, Br-, I-, and methyl-substituted analogues; however,
a 3-OH-substituted analogue was not hydroxylated.[10] Two-component monooxygenases typically catalyze hydroxylations
of small molecule substrates,[17,19,26−29] whereas SgcC is active on only carrier protein-tethered substrates.
The other known carrier protein-dependent oxidase is BtrO that participates
in the biosynthesis of butirosin.[30] However,
BtrO is an FMN-dependent, two-component monooxygenase, which catalyzes
hydroxylation at the α carbon of the acyl-S-ACP substrate and falls into “group C”. Other putative
homologues of SgcC that might act on carrier protein-tethered substrate
can be found in the nine-membered enediyne biosynthetic pathways of
kedarcidin (KedY),[31] maduropeptin (MdpC),[32] and sporolides A and B (SpoT3).[33] There have been few studies investigating two-component
monooxygenases in which the carrier protein plays an essential role;
thus, a structure of SgcC could provide insight into its family.Enediynes are promising candidates for anticancer therapies, and
alteration of the appended moieties can have important impacts on
biological function. Homologues of sgcC and sgcE6 are widespread in nine-membered enediyne biosynthetic
gene clusters, with sgcE6 being conserved. Here,
we present the crystal structures of the PCP-dependent two-component
monooxygenase, SgcC, and the flavin reductase, SgcE6. The structure
of SgcC reveals insight into the “group D” class of
flavin-dependent monooxygenases that act on carrier protein-tethered
substrates. The molecular details responsible for the substrate specificity
of SgcC could now be explored and exploited for protein engineering,
potentially leading to new enediyne analogues.
Materials and Methods
Gene Cloning
and Production and Purification of SgcE6
The sgcE6 gene from S. globisporus was amplified from genomic
DNA by polymerase chain reaction (PCR)
using two primers, SgcE6-F and SgcE6-R (Table S1), and subcloned into expression vector pMCSG73,[34] yielding APC109096 (pBS1159). This construct
produced a fusion protein containing an N-terminal NusA, followed
by a His6 tag and a TEV protease cleavage site with the
target protein, which leaves an N-terminal Ser-Asn-Ala sequence after
TEV cleavage.To overproduce the selenomethionyl (SeMet)-SgcE6
protein, the APC109096 construct was transformed into E. coliBL21(DE3)-Gold (Stratagene), and the bacterial culture was then
grown at 37 °C and 190 rpm in 1 L of enriched M9 medium[35] until it reached an OD600 of 1.0.
After the sample had been cooled in air at 4 °C for 60 min, methionine
biosynthesis inhibitory amino acids (l-valine, l-isoleucine, l-leucine, l-lysine, l-threonine,
and l-phenylalanine, each at 25 mg/L) and 90 mg/L selenomethionine
were added. Protein overproduction was induced by 0.5 mM isopropyl
β-d-thiogalactoside (IPTG). The cells were incubated
overnight at 18 °C and subsequently harvested and resuspended
in lysis buffer [500 mM NaCl, 5% (v/v) glycerol, 50 mM HEPES (pH 8.0),
20 mM imidazole, and 10 mM β-mercaptoethanol]. The cells were
disrupted by sonication. The insoluble cellular material was removed
by centrifugation. SeMet-SgcE6 was purified using Ni-NTA affinity
chromatography and the ÄKTAxpress system (GE Healthcare Life
Sciences). The N-terminal tag was cleaved from pure protein using
recombinant His6-tagged TEV protease (Sigma), and an additional
step of Ni-NTA affinity chromatography was performed to remove the
protease, uncut protein, and affinity tag. Pure SeMet-SgcE6 was concentrated
using Amicon Ultra-15 concentrators (Millipore) in 20 mM HEPES buffer
(pH 8.0), 250 mM NaCl, and 2 mM dithiothreitol (DTT). Protein concentrations
were determined from the absorbance at 280 nm using a calculated molar
absorption coefficient (ε280 = 12615 M–1 cm–1).[36] The concentration
of protein samples used for crystallization was 30.2 mg/mL.
Crystallization
of SgcE6
SgcE6 crystallization screens
were prepared with a Mosquito liquid dispenser (TTP Labtech) using
the sitting-drop vapor-diffusion technique in 96-well CrystalQuick
plates (Greiner Bio-one). For each condition, 0.4 μL of protein
and 0.4 μL of crystallization formulation were mixed; the mixture
was equilibrated against 140 μL of the reservoir in the well.
The protein–ligand complex was prepared by mixing protein with
27.7 mM FAD and 27.7 mM NADH at 4 °C for several hours before
setting up crystallizations. The following commercially available
crystallization screens were used: MCSG-1-3 (Microlytic Inc.) at 24
°C for the ligand-free protein and MCSG-1-4 (Microlytic Inc.)
at 24 °C for the protein–ligand complexes. The crystals
for the ligand-free protein were obtained under 25% PEG 3350, 0.1
M HEPES (pH 7.5), and 0.2 M ammonium sulfate. The best crystal of
the protein–ligand complex of SgcE6 was produced under 20%
PEG 8000, 0.1 M MES (pH 6.0), and 0.2 M calcium acetate.
Data Collection
and Structure Determination of SgcE6
The diffraction data
of ligand-free SeMet-SgcE6 (apo-SgcE6) were
collected at Argonne National Laboratory on the APS (19-ID) beamline
using a wavelength of 0.97912 Å with the ADSC QUANTUM 315r CCD
detector. The data sets were collected to a resolution of 1.90 Å.
For the complex structure of SeMet-SgcE6 (SgcE6-FAD), the diffraction
data were collected on the same beamline using a wavelength of 0.97935
Å with the same detector. The data sets were collected to a resolution
of 1.66 Å. The diffraction data were indexed, integrated, and
scaled using HKL-3000.[37] Both apo-SgcE6
and SgcE6-FAD structures were determined by the single-wavelength
anomalous diffraction (SAD) method, and the initial models were built
using HKL-3000.[37] The structures were completed
with alternating rounds of manual model building with COOT[38] and refinement with phenix.refine.[39] Waters were added and updated during refinement.
The final structures were refined to the same resolution limit as
in data collection with favorable Rcryst and Rfree values (Table ).
Table 1
Data Collection,
Phasing, and Refinement
Statistics for Structures
apo-SgcE6
SgcE6-FAD
SgcC
Data Collection
Protein Data Bank entry
4HX6
4R82
4OO2
space group
P21
P21
P212121
cell dimensions
a, b, c (Å)
56.37, 213.31,
56.72
42.05, 62.83, 70.09
99.16, 173.73,
113.86
α, β,
γ (deg)
90.00, 88.83, 90.00
90.00,
92.06, 90.00
90.00, 90.00, 90.00
wavelength (Å)
0.97912
0.97935
0.91165
resolutiona (Å)
30.10–1.90 (1.93–1.90)
34.93–1.66 (1.69–1.66)
47.67–2.63 (2.72–2.63)
Rsymb or Rmergec (%)
9.5 (45.2)
9.0 (52.1)
29.6 (231.4)
CC1/2 (%)
99.2 (84.1)
99.4 (69.5)
98.0 (17.3)
I/σ
19.5 (2.4)
22.6 (2.0)
5.3 (0.62)
completeness (%)
98.3 (80.9)
99.5 (97.3)
96.7 (97.9)
redundancy
3.3 (2.8)
3.7 (2.9)
5.2 (4.3)
Refinement
resolution (Å)
30.10–1.90
34.93–1.66
47.67–2.63
no. of reflections
104829
43097
296449
Rwork/Rfree
0.169/0.212
0.164/0.188
0.216/0.246
Ramachandran plot (%)d
favored
99
98
95
outliers
0
2
0
no. of atoms
protein
10513
2515
15938
ligand/ion
50
176
8
water
774
240
217
B-factor (Å2)
protein
29.0
21.1
57.1
ligand/ion
53.8
22.4
61.9
water
37.0
30.9
47.8
root-mean-square deviation
bond lengths (Å)
0.007
0.006
0.002
bond angles (deg)
1.083
1.158
0.67
Numbers in parentheses
are values
for the highest-resolution bin.
Rmerge = ∑∑|I(hkl) – I̅(hkl)|/∑∑I(hkl), where I(hkl) is
the ith observation of reflection hkl and I̅(hkl) is the weighted
average intensity for all observations i of reflection hkl.
Numbers in parentheses
are values
for the highest-resolution bin.Rmerge = ∑∑|I(hkl) – I̅(hkl)|/∑∑I(hkl), where I(hkl) is
the ith observation of reflection hkl and I̅(hkl) is the weighted
average intensity for all observations i of reflection hkl.Rmeas = ∑[N/(N – 1)1/2]∑|I(hkl) – I̅(hkl)|/∑∑I(hkl).As defined by MolProbity.
Gene Cloning and Production and Purification of SgcC
The sgcC gene from S. globisporus was amplified
from the genomic DNA by PCR using two primers, SgcC-F
and SgcC-R (Table S1), and subsequently
subcloned into expression vector pBS3080,[40] yielding APC109081 (pBS1160), which produced SgcC with an N-terminal
His6 tag and a TEV protease cleavage site, which leaves
an N-terminal Ser-Pro sequence after TEV cleavage. For SeMet-SgcC
production, the APC109081 construct was transformed into E.
coli methionine auxotroph strain B834 (DE3) pLysS (Invitrogen).
SeMet-SgcC overproduction was accomplished using autoinduction medium
at 25 °C according to the standard protocol.[41] The cells were disrupted by sonication. The insoluble cellular
material was removed by centrifugation. Ni-NTA affinity chromatography
was used to initially purify the His6-tagged protein with
a supplement of 0.5 mM FAD during purification. The His6 tag was then removed by TEV protease (Sigma) cleavage at 4 °C
overnight in 50 mM Tris (pH 8.0) containing 200 mM NaCl, 1 mM DTT,
0.5 mM EDTA, and 0.5 mM FAD. The sample after TEV protease cleavage
was run through Ni-NTA resin again under loading buffer conditions,
and the flow-through was collected, which contained the purified untagged
SeMet-SgcC. The untagged protein was subsequently run through a Supderdex
200 size-exclusion column using the ÄKTAxpress system (GE Healthcare
Life Sciences) in 20 mM Tris buffer (pH 7.5) containing 100 mM NaCl.
SeMet-SgcC was concentrated to 15 mg/mL by Amicon Ultra-15 concentrators
(Millipore) for protein crystallization. Protein concentrations were
determined from the absorbance at 280 nm using a calculated molar
absorption coefficient (ε280 = 94435 M–1 cm–1).[36]
Crystallization
of SgcC
SeMet-SgcC was screened for
crystallization conditions as described for SgcE6. SeMet-SgcC was
crystallized by the hanging drop vapor diffusion method at 20 °C
by mixing 1 μL of 15 mg/mL SeMet-SgcC with 1 μL of reservoir
solution [100 mM MES buffer (pH 6.5), 60 mM mixture of MgCl2 and CaCl2, 100 mM imidazole, and 30% mixture of PEG 550
MME and PEG 20000].
Data Collection and Structure Determination
of SgcC
The diffraction data of SgcC were collected at Argonne
National Laboratory
on the GM/CA (23-ID-B) beamline using a wavelength of 0.911650 Å
with the MAR 300 CCD detector. The data sets were collected to a resolution
of 2.63 Å. The data were indexed, integrated, and scaled using
XDS.[42] The SgcC structure was determined
by molecular replacement using Phaser-MR[43] and Autobuild programs[44] of the PHENIX
suite[39] with monomer polypeptide coordinates
without ligands or water using the oxygenase component of the 4-hydroxyphenylacetate
3-monooxygenase from Thermus thermophilus (TtHpaB)
as the search model [Protein Data Bank (PDB) entry 2YYK]. The structural
refinement of SgcC was conducted with the same procedures that were
used for SgcE6. The final data collection and refinement statistics
can be found in Table .
Size-Exclusion Chromatography
Size-exclusion chromatography
was performed using a Superdex 200 16/600 column (GE Healthcare Life
Sciences) with an ÄKTAxpress system (GE Healthcare Life Sciences)
at 4 °C and 500 μL of sample loaded per run. The column
was calibrated with a size-exclusion calibration kit (GE Healthcare
Life Sciences) and developed with the elusion buffer [200 mM NaCl
and 100 mM Tris (pH 8.0)] at a flow rate of 0.5 mL/min with UV detection
at 280 nm.
Protein–Protein Docking of SgcC and
SgcC2
Two
methods of SgcC–SgcC2 docking were performed, both using a
model of SgcC2 generated by I-TASSER[45] (Figure S1, trimmed conserved core sequence).
The ClusPro server[46] was used to compute
a global blind docking, and the Rosie server[47] was used for local flexible docking (allowing local rotation and
tilting), which generated similar results.
Results and Discussion
In Nine-Membered
Enediyne Biosynthetic Gene Clusters, Homologues
of SgcE6 Are Conserved and Those of SgcC Are Widespread
Our
previous studies showed that SgcE6 supplies FADH2 to both
SgcC and SgcC3 in biosynthesis of the β-tyrosyl moiety of the
C-1027 chromophore.[9,10] Bioinformatics analysis of the
C-1027 biosynthetic gene cluster revealed the existence of other genes
encoding putative flavin-dependent enzymes, such as sgcE9, sgcD2, and sgcL, the functions
of which are still unknown.[1] A genome neighborhood
network of enediyne biosynthetic gene clusters from the NCBI and JGI
databases[48] revealed that sgcE6 homologues are conserved in enediyne biosynthetic gene clusters,
especially those encoding nine-membered enediynes, suggesting a role
in enediyne core biosynthesis (Figure S2). On the other hand, sgcC homologues are conserved
in the aromatic amino acid-containing enediyne biosynthetic gene clusters,
kedarcidin,[31] maduropeptin,[32] and sporolides A and B,[33] indicating this group of flavin-dependent monooxygenases specifically
catalyzes hydroxylation onto aromatic rings. Homologues of sgcC are also present in the other 14 putative nine-membered
enediyne biosynthetic gene clusters (Figure S3) with high levels of amino acid sequence identity among them (56.6–99.8%).
Hydroxylation of an aromatic amino acid moiety is therefore thought
to be involved in their biosynthetic pathways. We previously determined
that SgcE6 is kinetically capable of providing more reduced flavin
than is necessary for both SgcC and SgcC3, further supporting a broader
role for SgcE6 in the biosynthesis of enediynes.[11]
Structure Solution, Refinement, and Quality
of SgcE6 Crystal
Structures
The crystal structures of the ligand-free form
of SgcE6 (apo-SgcE6) was obtained in monoclinic space group P21 and was determined by the single-wavelength
anomalous diffraction (SAD) method; the summary of crystallographic
data is given in Table . Apo-SgcE6 was refined to a resolution of 1.90 Å with an Rwork factor of 16.9% and an Rfree factor of 21.2%. The structure of apo-SgcE6 contains
eight polypeptide chains, five acetates, and six sulfates per asymmetric
unit. The eight polypeptide chains had varying density for the N-
and C-termini, with the start of the chain varying from residue −2
to 8 and ending between residues 175 and 180. Many monomers also lack
interpretable density for the region between residues 97 and 105,
which is part of the SgcE6 “flexible loop” and will
be discussed further below (Figure S4).Crystals of the FAD complex form of SgcE6 (SgcE6-FAD) were prepared
by mixing protein with high concentrations of FAD and NADH via cocrystallization
under various conditions. SgcE6-FAD crystallized in the P21 space group, but was not isomorphous with apo-SgcE6, as it had different
cell dimensions. SgcE6-FAD was refined to a high resolution of 1.66
Å with an Rwork of 16.4% and an Rfree of 18.8%. The asymmetric unit of SgcE6-FAD
contains two polypeptide chains, each with associated electron density
that is modeled unambiguously as FMN and two adenosine 5′-monophosphates
(AMP) in the substrate/cofactor binding cavity (Figure C). On average, the atoms from the FMN and
AMP molecules had B factors similar to those of the
surrounding protein, indicating a lack of disorder. The FMN moiety
is from FAD, and the two AMPs are from either FAD or NADH due to the
large excess used in crystallization and apparent degradation. Each
monomer could be modeled from residue 9 to 99 and from residue 104
to 177 and again was lacking density in the “flexible loop”.
Each monomer is associated with one Mg2+, two Ca2+, two Cl–, and two acetates.
Figure 2
Structure-based sequence
alignment of SgcE6 and homologues. (A)
The sequence alignment is colored according to the loops lining the
active and flavin binding sites. The sequence/structure highlighted
in red or magenta correlates with either NADH or NADPH preference,
respectively; the regions highlighted in orange or yellow correlate
with FMN or FAD preference, respectively, and the regions highlighted
in green and cyan highlight the (S/T/C)XXPP and GDH motifs, respectively.
The yellow region corresponds to the “flexible AMP binding
loop”. The alignment was created with RaptorX,[59] edited by hand, and rendered with ESPript 3.0.[60] (B) Structural comparison of SgcE6 and other
HpaC-like flavin reductases. The labels indicate the preferred substrates.
The bound ligands are displayed as black balls and sticks, with key
hydrogen bonds shown as black dashes. The structures of SgcE6, TtHpaC,
and TtFMNbp are from PDB entries 4R82, 2ED4, and 3ZOE, respectively. Further comparisons are
shown in Figure S3. (C) σ-A-weighted
difference (mFo – DFc) omit map contoured at 3σ with the positive and
negative features shown as green and red mesh, respectively. The modeled
FMN and AMP moieties are accurately modeled.
Structure-based sequence
alignment of SgcE6 and homologues. (A)
The sequence alignment is colored according to the loops lining the
active and flavin binding sites. The sequence/structure highlighted
in red or magenta correlates with either NADH or NADPH preference,
respectively; the regions highlighted in orange or yellow correlate
with FMN or FAD preference, respectively, and the regions highlighted
in green and cyan highlight the (S/T/C)XXPP and GDH motifs, respectively.
The yellow region corresponds to the “flexible AMP binding
loop”. The alignment was created with RaptorX,[59] edited by hand, and rendered with ESPript 3.0.[60] (B) Structural comparison of SgcE6 and other
HpaC-like flavin reductases. The labels indicate the preferred substrates.
The bound ligands are displayed as black balls and sticks, with key
hydrogen bonds shown as black dashes. The structures of SgcE6, TtHpaC,
and TtFMNbp are from PDB entries 4R82, 2ED4, and 3ZOE, respectively. Further comparisons are
shown in Figure S3. (C) σ-A-weighted
difference (mFo – DFc) omit map contoured at 3σ with the positive and
negative features shown as green and red mesh, respectively. The modeled
FMN and AMP moieties are accurately modeled.
Overall Structure Establishing SgcE6 as a Member of the HpaC-like
Subfamily of Flavin Reductases
As previously mentioned, bioinformatics
analyses revealed that SgcE6 is a member of the HpaC-like flavin reductase
subfamily as determined by conserved (S/T/C)XXPP and GDH sequence
motifs. Although the sequence identities between SgcE6 and the other
HpaC-like flavin reductases are low and vary from 16 to 34%, they
all share similar three-dimensional structures with root-mean-square
deviations (rmsds) of 0.52–1.24 Å for Cα atom superposition.
These SgcE6 homologues include HpaC from T. thermophilus HB8 (TtHpaC),[16] PheA2 from Geobacillus
thermoglucosidasius (GtPheA2),[18] NTA-MoB from Mycobacterium thermoresistible (MtMoB),[49] TftC from Burkholderia cepacia AC1100 (BcTftC),[19] SMOB from Pseudomonas putida S12 (PpSMOB),[50] HpaC from Sulfolobus tokodaii strain 7 (StHpaC),[51] FMN binding protein form Pyrococcus
horikoshii (PhFMNbp),[52]T. thermophilus (TtFMNbp),[52] and Methanothermobacter thermautotrophicus (MtFMNbp),[53] and FeR from Archaeoglobus fulgidus (AfFeR)[54] (Figure and Figure S5). Because of the low degrees of similarity between many of these
members, we generated a structure-based alignment that captures nonconserved
structural elements and regions where residues overlap in space rather
than based on sequence alone (Figure and Figure S5). Important
regions will be discussed in detail below in reference to substrate
binding.All of the HpaC-like flavin reductases are homodimers,
with monomers related by 2-fold rotational axes. Not surprisingly,
in the crystal structures of apo-SgcE6 and SgcE6-FAD, each asymmetric
unit contains dimers, which were confirmed by size-exclusion chromatography
(Figure S6A). PISA analysis[55] of SgcE6 reveals that the dimerization interface
is stabilized by many hydrogen bonds and electrostatic interactions
and buries ∼2770 Å2 of accessible surface area
(ASA). In addition, β-sheets 4 and 5 and a loop containing the
(S/T/C)XXPP motif are swapped between dimers and make up portions
of the active site, making the HpaC-like flavin reductases obligate
homodimers.
Flavin Binding and Preference in SgcE6 and
Homologues
Previous characterization of SgcE6 revealed a
preference for FAD
over FMN and revealed that there were no interactions necessary for
FADH2 transfer to SgcC or SgcC3. The presence of FMN in
the active site of SgcE6-FAD reveals how the flavin ring and ribose
are bound. However, because of degradation of FAD during crystallization,
how SgcE6 differentiates between FAD and FMN remains unknown. Nevertheless,
insight can be gained through comparison of SgcE6-FAD with homologues
and apo-SgcE6.Superposition of the SgcE6 monomers reveals significant
conformational differences in the “flexible loop” near
the flavin binding site, which is flanked by α-helix 3 and β-sheet
7 (Figure and Figures S4 and S5). The differences in conformation
depend greatly on crystal contacts with the symmetry mates. Comparison
of SgcE6 with the other HpaC-like flavin reductases reveals that for
members preferring FAD [TtHpaC, BtPheA2, BcTftC, and PpSMOB (Table S2)], the flavin rings and ribose are bound
in identical conformations; however, the AMP portion of FAD is bound
in various conformations (Figure S5). We
did not try to model the AMP moiety due to the conformational heterogeneity
of the nonconserved “flexible loop” and because the
conformation of AMP differs among homologues. SgcE6 homologues that
prefer FMN [StHpaC, TtFMNbp, PhFMNbp, MtFMNbp, and AfFeR (Table S2)] also bind the flavin rings and ribose
in an identical location; however, the “flexible loop”
and more specifically α-helix 3 are extended into places where
the AMP moieties would bind. Thus, it has been suggested that longer
“flexible loops” correlate with FMN preference; however,
SgcE6 has an extended “flexible loop” yet strictly binds
FAD. Rather, the length of α-helix 3 and residues emanating
from β-sheet 3 correlate with flavin preference (Figure ).
Reductant Preference of
SgcE6 and Homologues
The previous
characterization of SgcE6 revealed it had a preference for NADH as
the reductant. Four regions interact with the NAD(P)H reductant, α-helix
1, β-sheets 11 and 12, and the (S/T/C)XXPP and GDH sequence
motifs (Figure and Figure S5). In most structures with NAD+ bound (TtHpaC, BtPheA2, BcTftC, and StHpaC), it is found in a “U”
conformation with the nicotinamide ring sandwiched between the isoalloxazine
rings of the flavin and the adenine ring (Figure C and Figure S5). The binding of NADPH remains a mystery for most of the members
preferring it, but a single-residue insertion in β-sheet 11
and shortened α-helix 1 distinguishes TtFMNbp, PhFMNbp, and
MtFMNbp from NADH-utilizing members. As an outlier, AfFeR has a bound
NADP+ moiety, where the adenine ring is bound by an α-helix
in place of β-sheets 11 and 12. Nevertheless, binding of NADPH
to TtFMNbp, PhFMNbp, and MtFMNbp may be similar to that of AfFeR.
The two AMP moieties bound in the active site of SgcE6, and comparison
with TtHpaC/BtPheA2/BcTftC/StHpaC structures reveals that the “U”
conformation binding mode of NADH is identical.
Structure Solution
and Refinement of SgcC
SgcC crystallized
in orthorhombic space group P212121. The anomalous signal from the SeMets was weak; thus,
the structure was determined by molecular replacement. The structure
of TtHpaB (PDB entry 2YYK) was used as the search model as it was the most similar in the
PDB (identity and similarity scores of 34 and 52%, respectively) (Figure ). Four copies of
the polypeptide chain were found and built in the asymmetric unit.
The chains could be reliably traced between residues 18–167
and 178–526, with the region between residues 168 and 177 lacking
significant density for three of the monomers. Two Ca2+ ions were built into the model based on Fo – Fc regular and anomalous electron
density maps, even though a mixture of Mg2+ and Ca2+ was included in the crystallization condition. The Ca2+ ions fit the following criteria better than Mg2+ ions based on the refinement of occupancy, B factor,
coordination number, and distance. The asymmetric unit also contained
one molecule of glycerol. The crystal structure of SgcC was refined
to a resolution of 2.63 Å with an R factor of
21.6% and an Rfree factor of 24.6%. Data
processing and refinement statistics are summarized in Table .
Figure 3
Structure-based sequence
alignment of SgcC and homologues. According
to the crystal structures of TtHpaB, the regions colored yellow (flavin
binding loop), green (substrate binding loop), and magenta (AMP responsive
loop) change conformation upon FAD binding, and the region colored
green further changes conformation upon substrate binding. The residues
highlighted in cyan are involved in binding the phenolic hydroxyl
group of their substrates. The residue highlighted in orange differs
from TtHpaB to the others, which likely opens the active site.
Structure-based sequence
alignment of SgcC and homologues. According
to the crystal structures of TtHpaB, the regions colored yellow (flavin
binding loop), green (substrate binding loop), and magenta (AMP responsive
loop) change conformation upon FAD binding, and the region colored
green further changes conformation upon substrate binding. The residues
highlighted in cyan are involved in binding the phenolic hydroxyl
group of their substrates. The residue highlighted in orange differs
from TtHpaB to the others, which likely opens the active site.
Overall Structure Establishing
SgcC as a Member of the ACAD-Fold
Flavin-Dependent Monooxygenases
The crystal structure of
SgcC was found to be packed as a tetramer in the asymmetric unit (Figure A), which was confirmed
in solution by size-exclusion chromatography (Figure S6B). PISA analysis revealed that the tetrameric interface
is formed by several tight hydrogen bonds and salt bridges and buries
∼8395 Å2 of ASA. Dali alignments[56] reveal that the SgcC monomer has a core structure
with an ACAD fold as judged by comparison with mammalian mitochondrial
medium chain ACAD with a rmsd of 3.2 Å for 312 Cα atoms
(Figure B,C). ACADs
are also tetrameric. SgcC, EcHpaB, and homologues are much longer
than ACADs by at least 100 residues. The extra residues are found
in many loops around the structure; however, the tetramerization interfaces
are relatively conserved. ACAD and SgcC are comprised of three domains,
an N-terminal α-helical domain, a core domain containing a β-barrel-like
structure, and a C-terminal domain containing a group of α-helices.
The N-terminal α-helix 1 and β-sheets 1 and 2 of SgcC
are found in a conformation completely different from that of ACADs,
and the C-terminal α-helices 18 and 19 and β-sheets 15
and 16 of SgcC are lacking from ACADs. As mentioned previously, flavin-dependent
monooxygenases with ACAD folds fall into two types, those similar
to EcHpaB/TtHpaB or AbHpaH. AbHpaH is very similar to ACADs with very
little deviation in structure, while SgcC is more similar to EcHpaB
and TtHpaB.
Figure 4
Overall structure of SgcC and comparison with acyl-CoA dehydrogenase
(PDB entry 3MDD). (A) The tetramer of SgcC is colored as follows: monomer A in gray,
monomer B in cyan, monomer C in black, and monomer D in orange. The
active site loops “flavin binding loop”, “AMP
responsive loop”, and “substrate binding loop”
are colored as in Figure . (B) A monomer from SgcC is colored by homology to acyl-CoA
dehydrogenase as seen in panel C. The N-terminal α-helices and
loops are in completely different orientations and colored cyan. The
rest of the N-terminal α-helical bundle is colored blue, the
β-barrel green, and the C-terminal α-helical bundle red.
The black regions are extended regions not found in acyl-CoA dehydrogenases
when compared to SgcC and regions that are not homologous in structure
and deviate by ≥5 Å.
Overall structure of SgcC and comparison with acyl-CoA dehydrogenase
(PDB entry 3MDD). (A) The tetramer of SgcC is colored as follows: monomer A in gray,
monomer B in cyan, monomer C in black, and monomer D in orange. The
active site loops “flavin binding loop”, “AMP
responsive loop”, and “substrate binding loop”
are colored as in Figure . (B) A monomer from SgcC is colored by homology to acyl-CoA
dehydrogenase as seen in panel C. The N-terminal α-helices and
loops are in completely different orientations and colored cyan. The
rest of the N-terminal α-helical bundle is colored blue, the
β-barrel green, and the C-terminal α-helical bundle red.
The black regions are extended regions not found in acyl-CoA dehydrogenases
when compared to SgcC and regions that are not homologous in structure
and deviate by ≥5 Å.Other structurally characterized ACAD-fold enzymes that are
less
similar to SgcC include AnaB, a prolyl-ACP dehydrogenase that acts
on a carrier protein-tethered substrate; FkbI, a dehydrogenase proposed
to act on an acyl carrier protein (ACP)-tethered aldehyde to generate
hydroxylmalonyl-S-ACP; and KijD3 and EvdC, which
are amino sugar oxidizing enzymes. Interestingly, all these enzymes
function in natural product biosynthetic pathways.
Structural
and Sequence Comparisons of SgcC with EcHpaB/TtHpaB-Type
Monooxygenases Reveal the Binding Site for Substrates
Only
three structures in the PDB are highly similar to that of SgcC. These
include the structures of TtHpaB,[17] 2,4,5-trichlorophenol
4-monooxygenase from B. cepacia AC1100 (BcTftD),[19] and 2,4,6-trichlorophenol 4-monooxygenase from Cupriacidus necator JMP134 (CnTcpA).[26] Despite the low levels of sequence identity among them
(20–29%), they all share nearly identical structures (Figure ). SgcC shares high
to low levels of sequence similarity with various TtHpaB-type monooxygenases
that act on phenolic substrates and afford ortho hydroxylated
products and a lower level of sequence similarity with monooxygenases
that act on highly halogenated or nonphenolic aromatic substrates
(Table S3, Figure , and Figures S7 and S8). Structural comparisons of apo-SgcC and FAD/4HPA-bound
TtHpaB reveal the substrate binding residues Arg119, Tyr123, His161,
and Thr202 of SgcC are conserved (numbering throughout will be based
on the SgcC sequence and alignment in Figure ), along with the tertiary structure of most
of the residues surrounding the active site.
The Structure of SgcC Is
Most Similar to That of Apo-TtHpaB,
Revealing That Conformational Changes Are Necessary for FAD and Substrate
Binding
TtHpaB was crystallized in three states, (i) in the
absence of substrates as an apo form, (ii) with FAD, and (iii) with
FAD and 4HPA.[17] Three loops in the active
site undergo changes in conformation from apo to substrate-bound and
consist of a “flavin binding loop”, “AMP responsive
loop”, and “substrate binding loop” (Figures A, 5, and S9). The loop between residues
159 and 179, “flavin binding loop”, carries His161 and
Phe163, and the backbone of these residues moves to form hydrogen
bonds with the flavin ring (Figure S9).
In addition, an “AMP responsive loop” emanating from
an adjacent monomer comprised of residues 428–435 also undergoes
a change in conformation to interact with the “flavin binding
loop”, which orders the conserved Arg170 to interact with the
FAD diphosphate moiety. A “substrate binding loop” that
interacts with 4HPA consists of residues 212–221, the sequence
of which is highly dependent on substrate, undergoes an order to disorder
transition upon FAD binding, and again becomes ordered upon 4HPA binding.
The “flavin binding loop” and “substrate binding
loop” of SgcC have the largest deviations between the monomers,
and the “flavin binding loop” contains disordered residues
(Figure A). A comparison
of SgcC to each of the TtHpaB structures reveals that the structure
of SgcC is most similar to that of apo-TtHpaB; thus, conformational
changes are necessary for FAD and substrate binding from the current
SgcC structure (Figure S9). As conformational
changes are necessary, we avoided modeling to further understand FAD
binding.
Figure 5
Comparison of SgcC monomers and SgcC with TtHpaB. (A) Comparison
of the SgcC monomers, with the “flavin binding loop”
colored tan, the “AMP responsive loop” purple, and the
“substrate binding loop” dark green. This highlights
the comparative disorder found in the active site loops compared to
the surrounding regions. (B) Stereoimage of an overlay of the active
site of SgcC shown in gray and dark colors as in panel A, and TtHpaB
in white and light colors with FAD in black and the 4HPA substrate
bound. Hydrogen bonds and interactions with 4HPA are colored red and
those with FAD black. This reveals the conformational changes required
for SgcC to bind FAD and substrate; namely, the loop containing His161
and Phe163 needs to rearrange, and the side chains need to rotate.
In addition, the catalytic Arg119 side chain would need to be repositioned.
This figure also reveals the stacking of the TtHpaB Phe at position
466 over the 4HPA substrate, which would preclude a modification at
position C-3; however, in SgcC, this residue is a serine, which makes
plenty of space for a modification.
Comparison of SgcC monomers and SgcC with TtHpaB. (A) Comparison
of the SgcC monomers, with the “flavin binding loop”
colored tan, the “AMP responsive loop” purple, and the
“substrate binding loop” dark green. This highlights
the comparative disorder found in the active site loops compared to
the surrounding regions. (B) Stereoimage of an overlay of the active
site of SgcC shown in gray and dark colors as in panel A, and TtHpaB
in white and light colors with FAD in black and the 4HPA substrate
bound. Hydrogen bonds and interactions with 4HPA are colored red and
those with FAD black. This reveals the conformational changes required
for SgcC to bind FAD and substrate; namely, the loop containing His161
and Phe163 needs to rearrange, and the side chains need to rotate.
In addition, the catalytic Arg119 side chain would need to be repositioned.
This figure also reveals the stacking of the TtHpaBPhe at position
466 over the 4HPA substrate, which would preclude a modification at
position C-3; however, in SgcC, this residue is a serine, which makes
plenty of space for a modification.
Putative Binding Mode of the 3-Chloro-β-tyrosyl Moiety
of the SgcC Substrate
To gain some insight into FAD and substrate
binding, we compared the structure of SgcC with that of TtHpaB substrate
complex (Figure B).
The major differences in substrate binding between the 3-chloro-β-tyrosyl
moiety and 4HPA are the presence of the 3-chloro group and an extended
chain in place of the acetic acid portion of 4HPA; however, both substrates
contain a p-phenol. In the structure of TtHpaB, the
4HPAphenol is bound by Arg119, Tyr123, His161, and hydrophobic contacts
are made with a phenylalanine and leucine at positions 466 and 163,
respectively. For the FAD and substrate to bind, the backbone and
side chain of His161 and Phe163 and the side chain of the conserved
Arg119 rearrange. Specificity for a phenol and accommodation of the
3-chloro and pantetheine arm will be discussed below.For SgcC
homologues acting on phenol-bearing substrates, Tyr123 and His161
are conserved. However, these residues are replaced by residues incapable
of hydrogen bonding in GtHpaH, which acts on anthranilate as a substrate,
as well as the nitrophenol- and chlorophenol-degrading homologues.[57] Furthermore, comparison of the active sites
of AbHpaH with TtHpaH reveals that FAD is bound almost identically,
but Tyr123 is replaced with a histidine and His161 with a serine (Figure S10). The histidine and serine bind the
common 4HPA substrate phenol such that position 3 is within 5 Å
of the C-4a atom of the flavin, which upon formation of the C-4a peroxyflavin
intermediate would be close enough for reaction with the peroxide.
This suggests that p-hydroxy-bearing substrates are
bound similarly within both AbHpaH and EcHpaB/TtHpaB families.If a 3-chloro-β-tyrosyl moiety is superimposed on the ring
of 4HPA, the chloro group would clash with the phenylalanine in position
466 of TtHpaB; however, this residue is a serine in SgcC and either
a glycine, alanine, or serine in homologues (Figure ). In the structure of SgcC, there does not
appear to be residues that would fill this space, allowing for the
halogenated substrate to bind. This suggests that the 3-chloro-β-tyrosyl
moiety of the SgcC substrate is bound in a manner identical to that
of the 4HPA substrate of TtHpaB, with the halide facing away from
the flavin. The acetic acid moiety of 4HPA interacts specifically
with the “substrate binding loop”, and this loop is
not conserved between TtHpaB and SgcC. However, this loop is conserved
between the SgcC homologues proposed to act on PCP-tethered substrates
(Figure S3), suggesting that there is a
common mode to accommodate the pantetheine and thioester moieties
of their substrates.
Implications for the Proposed Substrate Binding
Model
SgcC efficiently catalyzes the regioselective hydroxylation
of (S)-3-substituted β-tyrosyl-S-SgcC2
analogues, including F-, Cl-, Br-, I-, and Me-substituted derivatives.
The reactivity order of these analogues is as follows: I > Br >
Cl
> Me > F > H.[10] The reactivity
trend indicates,
in general, that higher electronegativity decreases the hydroxylation
activity, with the exception of methyl and hydrogen. Highly electronegative
functional groups such as fluorine limit the π-electrons from
attacking the hydroxy group of the hydroperoxyflavin intermediate
(FADHOOH). The electronegativity of the 3-substitutent therefore likely
plays a role in catalysis larger than that of specific interactions
with active site residues. The apparent lack of activity for the hydrogen-bearing
substrate is perplexing in light of the activity of homologues for
4HPA. This suggests that changing Ser466 and other active site residues
could increase the activity of SgcC for the non-3-substituted substrate
to increase yields of 20′-deschloro-C-1027, should SgcC activity
become limiting in engineering efforts.
Putative SgcC2–SgcC
Binding Interface
SgcC specifically
acts on its SgcC2-tethered substrate, as the free substrate is not
recognized, implying an interaction between SgcC and SgcC2. Comparison
of SgcC to the ACADs is not useful for gaining insight into the binding
of the shared phosphopantetheine moieties, as the structures are not
homologous in the regions around the “substrate binding loop”.
Similarly, comparisons with the structural homologues AnaB and FkbI
did not provide insight. Therefore, SgcC–SgcC2 docking models
were used to predict the interaction interface (Figure S11). The majority of the top-ranked models reveals
that the phosphopantetheine attachment point, Ser31, of SgcC2 points
toward the active site of SgcC (Figure S11A). According to this SgcC–SgcC2 docking model, SgcC2 is bound
at the center of the SgcC tetramer, contacting each of the subunits,
with an area of 985 Å (Figure S11B), and detailed interactions are shown in Figure S11C. While this docking experiment seems reasonable, it was,
however, performed using an SgcC tetramer as the protein acceptor
and an SgcC2 monomer as the protein donor and may not reflect the
true binding mode of the SgcC tetramer, which most likely interacts
with multiple molecules of SgcC2.Modifications catalyzed by
tailoring enzymes acting on PCP-tethered substrates are known, including
cyclization, halogenation, methylation, oxidation, and reduction.
In the biosynthesis of C-1027, SgcC3 and SgcC catalyze chlorination
and hydroxylation of SgcC2-tethered (S)-β-tyrosine,
respectively. By covalently tethering, PCPs sequester the substrates
from endogenous metabolite pools, thereby increasing their effective
concentrations at the active sites for catalysis.[58] In addition, PCP-tethered substrates also prevent promiscuous
activity from occurring on free substrates.
Proposed Overall Reaction
Scheme of SgcE6, SgcC, and SgcC2
The E. coli flavin reductase (Fre) could be substituted
for SgcE6 without impacting the activity of SgcC, implying that SgcC
produces FADH2 from SgcE6 by diffusion rather than delivery
via a specific protein–protein interaction.[11] The catalytic mechanisms of flavin-dependent monooxygenases
that share a mechanism common to SgcC are well studied.[17,27] According to the crystal structures of SgcE6 and SgcC and the docking
models, a general mechanism could now be proposed as shown in Figure .
Figure 6
Proposed catalytic mechanism
of SgcC. (i) SgcE6 is responsible
for the production of FADH2 at the expense of NADH. (ii)
FADH2 diffuses to SgcC, where Arg119 activates an O2 molecule to form a C-4a-hydroperoxyflavin intermediate. (iii)
SgcC2 forms protein–protein interactions with SgcC and guides
the (S)-3-chloro-β-tyrosyl moiety into the
active site. (iv) The (S)-3-chloro-β-tyrosyl
moiety is activated by a conserved general base His161, which plays
a key role in deprotonation of the 4-hydroxy group. The electrons
delocalize into the aromatic ring system and undergo electrophilic
attack of the hydroxy group of the C-4a-hydroperoxyflavin intermediate,
introducing a hydroxy group at the C-5 position of the (S)-3-chloro-β-tyrosyl moiety. (v) The hydroxyflavin (FADHO–) abstracts a proton from the C-5 atom of the intermediate,
resulting in the formation of the re-aromatizated (S)-3-chloro-5-hydroxy-β-tyrosyl-S-SgcC2, which
then dissociates from SgcC. (vi) Finally, the hydroxyflavin (FADHOH)
undergoes dehydration to form FAD, which is released and recycled
by SgcE6.
Proposed catalytic mechanism
of SgcC. (i) SgcE6 is responsible
for the production of FADH2 at the expense of NADH. (ii)
FADH2 diffuses to SgcC, where Arg119 activates an O2 molecule to form a C-4a-hydroperoxyflavin intermediate. (iii)
SgcC2 forms protein–protein interactions with SgcC and guides
the (S)-3-chloro-β-tyrosyl moiety into the
active site. (iv) The (S)-3-chloro-β-tyrosyl
moiety is activated by a conserved general base His161, which plays
a key role in deprotonation of the 4-hydroxy group. The electrons
delocalize into the aromatic ring system and undergo electrophilic
attack of the hydroxy group of the C-4a-hydroperoxyflavin intermediate,
introducing a hydroxy group at the C-5 position of the (S)-3-chloro-β-tyrosyl moiety. (v) The hydroxyflavin (FADHO–) abstracts a proton from the C-5 atom of the intermediate,
resulting in the formation of the re-aromatizated (S)-3-chloro-5-hydroxy-β-tyrosyl-S-SgcC2, which
then dissociates from SgcC. (vi) Finally, the hydroxyflavin (FADHOH)
undergoes dehydration to form FAD, which is released and recycled
by SgcE6.
Conclusion
In
the last step of biosynthesis of the (S)-3-chloro-5-hydroxy-β-tyrosyl
moiety of the C-1027 chromophore, ScgE6 and SgcC comprise a two-component
monooxygenase that catalyzes the hydroxylation of (S)-3-chloro-β-tyrosine into (S)-3-chloro-5-hydroxy-β-tyrosine
only upon tethering of the substrate to the PCP of SgcC2. In this
study, we determined the crystal structures of SgcE6 and SgcC, to
investigate the substrate selectivity and the enzyme catalytic mechanism.
The crystal structures show that SgcE6 possesses a flexible loop,
which provides sufficient space to accommodate FAD. This structural
feature is consistent with our previous studies that showed SgcE6
specifically reacts with FAD but not FMN. The crystal structure of
SgcC is highly homologous to that of TtHpaB, providing insight into
substrate binding and catalysis. Finally, the crystal structures of
SgcE6 and SgcC give new insight into the intriguing group of carrier
protein-dependent two-component monooxygenases in a three-protein
system and could be explored and exploited to engineer C-1027 diversity
by structure-based enzyme and biosynthetic pathway engineering.
Authors: Steven G Van Lanen; Pieter C Dorrestein; Steve D Christenson; Wen Liu; Jianhua Ju; Neil L Kelleher; Ben Shen Journal: J Am Chem Soc Date: 2005-08-24 Impact factor: 15.419
Authors: Carl V Christianson; Timothy J Montavon; Steven G Van Lanen; Ben Shen; Steven D Bruner Journal: Biochemistry Date: 2007-05-22 Impact factor: 3.162
Authors: Y Zhang; T E Edwards; D W Begley; A Abramov; K B Thompkins; M Ferrell; W J Guo; I Phan; C Olsen; A Napuli; B Sankaran; R Stacy; W C Van Voorhis; L J Stewart; P J Myler Journal: Acta Crystallogr Sect F Struct Biol Cryst Commun Date: 2011-08-16
Authors: Chin-Yuan Chang; Jeremy R Lohman; Tingting Huang; Karolina Michalska; Lance Bigelow; Jeffrey D Rudolf; Robert Jedrzejczak; Xiaohui Yan; Ming Ma; Gyorgy Babnigg; Andrzej Joachimiak; George N Phillips; Ben Shen Journal: Biochemistry Date: 2018-03-21 Impact factor: 3.162
Authors: David J Levy-Booth; Morgan M Fetherolf; Gordon R Stewart; Jie Liu; Lindsay D Eltis; William W Mohn Journal: Front Microbiol Date: 2019-08-20 Impact factor: 5.640