Literature DB >> 28945067

Intertwined Precursor Supply during Biosynthesis of the Catecholate-Hydroxamate Siderophores Qinichelins in Streptomyces sp. MBT76.

Jacob Gubbens1, Changsheng Wu2, Hua Zhu2, Dmitri V Filippov1, Bogdan I Florea1, Sébastien Rigali3, Herman S Overkleeft1, Gilles P van Wezel2.   

Abstract

The explosive increase in genome sequencing and the advances in bioinformatic tools have revolutionized the rationale for natural product discovery from actinomycetes. In particular, this has revealed that actinomycete genomes contain numerous orphan gene clusters that have the potential to specify many yet unknown bioactive specialized metabolites, representing a huge unexploited pool of chemical diversity. Here, we describe the discovery of a novel group of catecholate-hydroxamate siderophores termed qinichelins (2-5) from Streptomyces sp. MBT76. Correlation between the metabolite levels and the protein expression profiles identified the biosynthetic gene cluster (named qch) most likely responsible for qinichelin biosynthesis. The structure of the molecules was elucidated by bioinformatics, mass spectrometry, and NMR. The genome of Streptomyces sp. MBT76 contains three gene clusters for the production of catecholate-peptide siderophores, including a separate cluster for the production of a shared catecholate precursor. In addition, an operon in the qch cluster was identified for the production of the ornithine precursor for qinichelins, independent of primary metabolism. This biosynthetic complexity provides new insights into the challenges scientists face when applying synthetic biology approaches for natural product discovery.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28945067      PMCID: PMC5696649          DOI: 10.1021/acschembio.7b00597

Source DB:  PubMed          Journal:  ACS Chem Biol        ISSN: 1554-8929            Impact factor:   5.100


Actinobacteria are renowned for their ability to manufacture a diversity of bioactive small molecules.[1,2] High-throughput screening of actinomycetes has yielded many useful therapeutic agents but also turned big pharma away from NPs for drug-discovery programs due to high cost and chemical redundancy.[3,4] The increase in genome-sequence information has uncovered a vast and yet untapped biosynthetic potential and metabolic diversity, which has brought the microbial NPs back into the spotlight. However, many of the biosynthetic gene clusters (BGCs) discovered by genome mining are poorly expressed under laboratory conditions, and a major new challenge lies in finding the triggers and cues to activate their expression.[5] Such approaches include, among others, chemical triggers, microbial cocultivation, induction of antibiotic resistance, and heterologous gene expression.[6−10] In addition, the advances in genetic tools applied in synthetic biology, such as transformation-associated recombination (TAR), Red/ET recombination, and CRISPR-Cas9, had aided in the discovery of cryptic products through engineering of their biosynthetic pathways.[11] A second bottleneck in genomics-based approaches is to establish a link between genomic and metabolomic data.[5,12] It is difficult to assign the genetic basis for specific chemical scaffolds through bioinformatics analysis alone, largely due to nature’s flexibility in catalytic enzymology, i.e., enzyme promiscuity[13] and crosstalk among different gene clusters.[14,15] The latter offers a significant hurdle in drug-discovery approaches that are based solely on heterologous expression of single gene clusters.[16] This gap can be bridged by genomics-based methodologies based on tandem MS analysis of metabolites[17,18] that allow the linkage of specific biosynthetic genes to the bioactivity of interest. As we and others have exemplified, statistical correlation between transcript or protein expression levels and the presence of bioactive molecules is equally feasible.[19−21] Subsequent bioinformatics analysis of a biosynthetic gene cluster (BGC) provides important (partial) structural information.[22] This information can guide researchers to optimize compound isolation and identification, so as to recover sufficient quantity of targeted metabolites(s) from highly complex matrices to warrant de novo structural elucidation.[23] A specific class of natural products is the siderophores, which are synthesized by nonribosomal peptide synthases (NRPS) and act as iron scavengers.[24] Their chemical topologies and biosynthetic machineries have been studied extensively,[14,15,25] and a wide range of structures have been reported. Siderophores are generally classified into catecholates, hydroxamates, (hydroxy)-carboxylates, and mixed ligands thereof.[24] Members of the mixed catecholate-hydroxamate subfamily, including rhodochelin,[15] heterobactins,[26] rhodobactin,[27] lystabactins,[28] mirubactin,[29] and S-213L,[30] feature both a 2,3-dihydroxybenzoate(s) (2,3-DHB) moiety and (modified) δ-N-hydroxyornithine residues within the same molecule. Consequently, the biosynthesis of catecholate-hydroxamate siderophores is always initiated by loading 2,3-DHB as a starter unit into the modular NRPS assembly line, followed by successive incorporation of amino acids, including ornithine, into the growing peptide chain.[15,26,29] Streptomyces sp. MBT76 was previously identified as a prolific producer of antibiotics,[31] such as isocoumarins, prodiginines, acetyltryptamine, and fervenulin.[32] More recently, the activation of a type II polyketide synthase (PKS) gene cluster (qin) in Streptomyces sp. MBT76 induced the production of novel glycosylated pyranonaphthoquinones (qinimycins).[33] As this metabolic spectrum was dominated by polyketides, while the BGCs for NRPS are as commonplace as PKS in bacterial genomes,[34] we anticipated that peptides were underrepresented in our studies. Here, we describe the discovery and characterization of qinichelins, new mixed-type catecholatehydroxamate siderophores from Streptomyces sp. MBT76. The aforementioned limitations in genome-mining strategies were overcome through varying the growth conditions to fluctuate peptide production, after which quantitative proteomics allowed the connection of the NRPS gene clusters to their metabolic products.

Results and Discussion

Biosynthetic Loci for Catechol-Peptide Siderophores Are Dispersed through the Genome of Streptomyces sp. MBT76

Previous analysis of the genome of Streptomyces sp. MBT76 by AntiSMASH[35] identified 55 putative biosynthetic gene clusters (BGCs) specifying secondary metabolites.[33] A total of 16 of these contained gene(s) encoding NRPS, suggesting rich peptide metabolism. Our attention was in particular directed to three distinct NRPS BGCs, containing genes for the biosynthesis of catechol-peptide siderophores. One BGC (Figure c) matched the described BGC for griseobactin,[36] but this lacked the dhb genes required for 2,3-DHB synthesis. However, a copy of this operon was present in a second cluster, entA-C-E-B (Figure b). Together with entF and entD, this cluster contains all genes necessary for the biosynthesis of enterobactin, a catechol-peptide siderophore from E. coli, albeit in a different order than in the original cluster.[37]
Figure 1

NRPS BGCs involved in catechol-type siderophore biosynthesis in Streptomyces sp. MBT76. BGCs for a new siderophore (a), for enterobactin (b), and for griseobactin (c) could be identified. Carrier protein domains (ACP/PCP) are depicted in light blue, condensation domains in dark blue, epimerization domains in green, and thioesterase (termination) domains in yellow. Adenylation domains are shown in purple, together with their predicted substrates, and transport proteins in red. Triangles indicate the position of iron boxes likely bound by the iron repressor DmdR.

NRPS BGCs involved in catechol-type siderophore biosynthesis in Streptomyces sp. MBT76. BGCs for a new siderophore (a), for enterobactin (b), and for griseobactin (c) could be identified. Carrier protein domains (ACP/PCP) are depicted in light blue, condensation domains in dark blue, epimerization domains in green, and thioesterase (termination) domains in yellow. Adenylation domains are shown in purple, together with their predicted substrates, and transport proteins in red. Triangles indicate the position of iron boxes likely bound by the iron repressor DmdR. A third NRPS BGC, designated qch (Table and Figure a), also lacked the dhb genes but contained the qchG gene for a 2,3-DHB ACP homologous to the EntB-ACP domain. The starter condensation (C) domain of the NRPS QchI, which likely appends a 2,3-DHB unit to the N-terminus of the peptide, also indicated the presence of a 2,3-DHB moiety in the final structure.[37] Through phylogenetic analysis of adenylation (A) domains[38] of the two core NRPS (QchI and QchJ), a nonribosomal peptide with the sequence 2,3-DHB-Ser-Orn(ornithine)/Asp-Ser-Ser-Orn-Orn was predicted as the product specified by the BGC, whereby no clear consensus prediction could be made for the second A domain. Two epimerization (E) domains in the first and third modules of QchI probably transform the stereochemistry of l-Ser into d-Ser, while two other genes are likely involved in tailoring of Orn: qchF coding for an l-ornithine-5-monooxygenase and qchQ coding for a GCN5-related N-acetyltransferase.[39] This strongly suggests that the product of the cluster is a mixed hydroxymatecatecholate siderophore, which is further supported by the presence of the siderophore-related transporter genes qchH and qchL-P. Interestingly, the absence of an esterase (TE) domain at the terminus of QchJ indicates an unusual release of the mature peptide, potentially leading to a linear structure, in contrast to a cyclic peptide usually produced by TE domains.
Table 1

Blastp Analysis of NRPS Cluster

Qchlength (AA)predicted functionidentity (%)alignment lengthE valuebitscore
A325N-acetyl-gamma-glutamyl-phosphate reductase 2 ArgC76.313252.00 × 10–175498
B32questionable ORF    
C383arginine biosynthesis bifunctional protein ArgJ85.943840652
D311acetylglutamate kinase ArgB79.862833.00 × 10–152437
E398acetylornithine aminotransferase ArgD76.13850566
F474l-ornithine 5-monooxygenase71.44370627
G82isochorismatase ACP domain55.41742.00 × 10–1877.8
H326iron(III) dicitrate transport permease33.943305.00 × 10–36139
I4295nonribosomal peptide synthetase45.92315102075
J3247nonribosomal peptide synthetase49.28325302474
K70MbtH-like protein74.24669.00 × 10–30106
L670ABC transporter related protein51.215376.00 × 10–154465
M570ABC transporter related protein52.875924.00 × 10–167497
N268siderophore-interacting protein46.212649.00 × 10–67217
O362ferric enterobactin transport system permease53.083411.00 × 10–109333
P372transport system permease protein60.453548.00 × 10–115346
Q203GNAT family N-acetyltransferase49.681555.00 × 10–34125
A search of the CAS database (American Chemical Society, http://scifinder.cas.org), using the predicted sequence of the qch-specified peptide product as a query, yielded S213L (1, Figure ),[30] an antibiotic/antifungal siderophore with the sequence DHB-Ser-Orn-Ser-Orn-hOrn-chOrn, as a closest hit. The partial sequence for the S213L BGC has been described[40] but differed from the qch BGC sequence. Moreover, the fourth residue of S213L is Orn, instead of Ser predicted for the qch-specified product. This strongly suggested that the qch cluster might not produce S213L, but a related compound 2 (Figure ). The stereochemistry of the individual qinichelin residues was further deduced using Marfey’s protocol (see the Supporting Information Data section for details).
Figure 2

Molecular structures of S213L (1), qinichelin (2), acetyl-qinichelin (3,4), and dehydroxy-qinichelin (5). Abbreviations of moieties are shown to facilitate the comparison of respective structure. DHB, dihydroxybenzoate; Ser, serine; Orn, ornithine; hOrn, δ-N-hydroxy-ornithine; chOrn, cyclized δ-N-hydroxy-ornithine hydroxamate. Acetylation of qinichelin can occur on two positions as indicated. The shown configurations of DHB–d-Ser–l-Orn–d-Ser–l-Ser–l-hOrn–l-chOrn in qinichelins were based on Marfey’s protocol (Supporting Information data and Figures S10–S15) combined with biosynthetic considerations (see Figure ).

Molecular structures of S213L (1), qinichelin (2), acetyl-qinichelin (3,4), and dehydroxy-qinichelin (5). Abbreviations of moieties are shown to facilitate the comparison of respective structure. DHB, dihydroxybenzoate; Ser, serine; Orn, ornithine; hOrn, δ-N-hydroxy-ornithine; chOrn, cyclized δ-N-hydroxy-ornithine hydroxamate. Acetylation of qinichelin can occur on two positions as indicated. The shown configurations of DHBd-Serl-Ornd-Serl-Ser–l-hOrn–l-chOrn in qinichelins were based on Marfey’s protocol (Supporting Information data and Figures S10–S15) combined with biosynthetic considerations (see Figure ).
Figure 5

Model of intertwined biosynthetic pathways of catechol-peptide siderophores in Streptomyces sp. MBT76. The functional crosstalk among four entirely separate gene clusters ensures the assembly of two types of catechol-peptide siderophores, namely qinichelins and griseobactin. The DHB is supplied by the ent gene cluster and is shared by three NRPS systems. The DHB-hexapeptide backbone in qinichelin follows an orthodox colinear extension model, while the Orn building block could arise from either qchA-E or the canonical arg gene cluster. The differently modified Orn in the dashed boxes are likely accepted by the AOrn domain of modules 2 and 5 to produce different qinichelin variants.

Interestingly, qch contains four genes (qchA and qchCDE) that are highly similar to the arg genes argC, argJ, argB, and argD, respectively, which are required for the synthesis of the precursor ornithine from glutamate.[41] In addition, a canonical arg cluster for ornithine/arginine metabolism was also found in the Streptomyces sp. MBT76 genome, including the regulatory gene argR and argE-H for the subsequent conversion of ornithine to arginine, all of which were lacking in the qch gene cluster. Taken together, bioinformatics analysis suggested that up to three different catecholate–peptide siderophores might be produced by the strain, sharing one set of the dhb operon for 2,3-DHB synthesis, while ornithine, as a precursor for compound 2, might be produced by primary metabolism or by enzymes derived from the qch BGC.

Proteomics Analysis of the qch Cluster and Identification of the Qinichelins

We previously described the natural product proteomining pipeline, which makes use of the strong correlation between the amount of a (bioactive) molecule produced and the expression level of its biosynthetic proteins.[20] This was applied to efficiently connect genes (genotype) to a given metabolite or bioactivity of interest (chemotype). The reverse analysis whereby the expression level of a targeted BGC (known genotype) is used to predict its yet uncharacterized molecule that is produced (unknown chemotype) should be equally feasible. Accordingly, this reverse proteomining could complement a genome-mining strategy to facilitate the discovery of novel compounds. As a prerequisite, sufficient fluctuation of protein levels should be achieved as a result of varying growth conditions.[20] Accordingly, Streptomyces sp. MBT76 was grown in modified liquid minimal medium (NMMP), supplemented with (A) no additive (control), (B) 2% (w/v) NaCl, (C) 1% (w/v) starch, (D) 0.8% (w/v) peptone, or (E) 0.6% (w/v) yeast extract, as these conditions have been proven successful previously. Subsequent quantitative proteomics analysis of whole-cell lysates, using two mixtures of three samples to compare all growth conditions, yielded 1472 protein identifications, wherein relative expression levels of 1174 proteins were quantified with at least two independent events, including proteins belonging to the BGCs of interest (Table ). Cultures grown in NMMP with peptone and, remarkably, in NMMP without additives, showed strong expression of the Qch proteins, as demonstrated by the marked upregulation of QchF and QchH-J when compared to, e.g., condition B (NMMP with 2% NaCl). This may have been caused by low iron content in the growth media. The changes in expression level for QchA and QchG were not in line with the other proteins from the qch BGC. QchA and ArgC could not be differentiated due to their high sequence similarity. However, the fluctuation pattern of QchA/ArgC for the five culture conditions was in line with that of all detected Arg proteins, strongly suggesting that the observed signals for QchA/ArgC were most likely dominated by ArgC. For QchG, the data set contained only three quantification events, due to its small size, potentially leading to errors in quantifications.
Table 2

Quantitative Proteomics Analysis

 normalized expression ratio (log2)a
number of quantificationsb
proteinsD/AB/AB/DD/CE/CE/DD/AB/AB/DD/CE/CE/D
qinichelin
QchA/ArgCc–1.6–0.81.0–1.7–2.2–0.3777333
QchF–0.6–2.4–2.30.10.40.0888777
QchG   1.71.2–0.7111333
QchH–1.2–2.5–1.2–1.0–1.5–1.2666444
QchI–0.2–3.1–3.3–0.7–1.1–0.5121212777
QchJ–1.3–3.0–1.7–0.2–0.4–1.0222222131313
enterobactin
EntA–2.1–0.61.3   333000
EntB-IC–0.80.91.1   333111
griseobactin
GriG–1.3–1.50.1–2.2–2.2–0.7888877
GriF   –1.4–1.2–0.1000666
GriE–0.8–1.4–0.3   333000
arginine biosynthesis cluster
QchA/ArgCc–1.6–0.81.0–1.7–2.2–0.3777333
ArgJ–0.9–0.90.0–0.6–0.8–0.9777655
ArgD–1.0–0.20.7–1.7–0.50.9222222
ArgR–1.50.11.5–2.5–1.01.1999444
ArgG–0.6–0.6–0.1–0.6–0.70.0444222
ArgH–1.5–1.10.6–1.7–0.80.5888999

Changes in protein expression levels observed for the indicated proteins when compared among growth conditions A–E.

Number of quantification events used to calculate the expression ratios. Quantifications based on less than two events (italicized) were discarded.

The sequences of QchA and ArgC were very similar, resulting in insufficient unique peptides for quantification. Instead, quantification is shown for nonunique peptides.

Changes in protein expression levels observed for the indicated proteins when compared among growth conditions A–E. Number of quantification events used to calculate the expression ratios. Quantifications based on less than two events (italicized) were discarded. The sequences of QchA and ArgC were very similar, resulting in insufficient unique peptides for quantification. Instead, quantification is shown for nonunique peptides. The proteomics analysis demonstrated the expression of the Qch proteins in, among others, culture condition D (NMMP with peptone) and thus indicated the existence of the corresponding catecholate-peptide siderophore under these growth conditions. In our previous metabolomics study of Streptomyces sp. MBT76 under the same conditions,[32] no siderophores were identified, which is most likely due to the use of ethyl acetate for the extraction, which is not suited for the isolation of the hydrophilic peptidic siderophores. Therefore, here, spent media from five culturing conditions were desalted only and directly subjected to reverse-phase LC-MS analysis (in positive mode) without any prior extraction, resulting in the detection of a signal at m/z 772.3 for NMMP (A) and NMMP with peptone (D), with the strongest signal obtained for A (Figure a). The fluctuation pattern of this molecule correlated well with the expression level of the qch gene cluster, suggesting this may be the sought-after compound 2. In addition to the molecular ion [M + H]+ at m/z 772.3 for the iron-free compound, a coeluting peak was observed at m/z 825.3 corresponding to the iron-bound [M + Fe3+2H]+ species. Figure a depicts the combined signals for both species to compensate for any differences in iron(III) concentration among the different culture conditions.
Figure 3

Comparison of qinichelin production by LC-MS analysis. Spent medium samples of Streptomyces sp. MBT76 grown in conditions A–E (a) and in condition A in the absence (red line) or presence (black line) of Fe3+ (b), respectively, were compared. Shown are summed extracted ion chromatograms of [M + H]+, 772.3 m/z; [M + Fe3+ – 2H]+, 825.3 m/z ± 0.5 Da.

Comparison of qinichelin production by LC-MS analysis. Spent medium samples of Streptomyces sp. MBT76 grown in conditions A–E (a) and in condition A in the absence (red line) or presence (black line) of Fe3+ (b), respectively, were compared. Shown are summed extracted ion chromatograms of [M + H]+, 772.3 m/z; [M + Fe3+2H]+, 825.3 m/z ± 0.5 Da. To confirm the structure of 2, the spent medium of condition A was reanalyzed on a high resolution LTQ-orbitrap instrument, including both MS1 and MS2 analysis. Due to the use of formic acid instead of trifluoroacetic acid in the eluent, the MS1 spectrum of 2 presented the highest intensity at m/z 386.6773 assignable to [M + 2H]2+ species, followed by the [M + H]+ peak at m/z 772.3471 (Figure S1), within 0.5 ppm accuracy from the predicted mass. Indeed, the MS2 analysis yielded almost all the expected fragmentation products of the predicted compound 2, with complete sequence coverage for both the b- and y-ion series (Figure a). Moreover, the MS2 analysis corroborated the hydroxylation of two ornithines (hOrn-5 and chOrn-6) at the C-terminus, and the cyclization of the last ornithine (chOrn-6). The most intensive signals were obtained for the b5 and y2 ions, indicating that a potential hydroxamate bond might be more susceptible to cleavage than an amide bond. However, it was noteworthy that MS/MS analysis alone was not enough to indicate the presence of a peptide or isopeptide bond between Ser-4 and hOrn-5. To clarify this, the m/z 772.3 was used as a probe to guide the separation of target compound from the spent medium of condition A on reversed phase HPLC. The obtained semipurified compound 2 was analyzed by 1H NMR (850 MHz, in D2O, Table ), COSY, HSQC, and HMBC techniques (Figures S2–S6), which indeed supported a catecholate–hexapeptide architecture comprising three serine and three ornithine residues. In particular, a key HMBC correlation from H2-5 of hOrn-5 to C-1 of Ser-4 established that the linkage between these two residues was through the δ-hydroxylated-amine rather than α-amine of hOrn-5. The free amine group at C-2 of hOrn-5 could be also reflected by the upfield shifted H-2 (δH 3.99), in contrast to the amidated H-2 of Orn-2 (δH 4.44) and chOrn-6 (δH 4.40).
Figure 4

High resolution MS/MS analysis of qinichelin. A spent medium sample of Streptomyces sp. MBT76 grown in condition A was subjected to high resolution LC-MS/MS analysis to obtain insights into the structures of unacetylated (a) and acetylated (b) qinichelin.

Table 3

NMR Data Assignment of Qinichelin[2] in D2O

  13Ca1H
 
residuepositionδcδHintensitymultiplicityJ (Hz)carbon correlated in HMBC
DHB1171.0     
 2117.8     
 3147.6     
 4145.5     
 5120.77.111dd8.5, 1.7DHB (C-3, C-4, C-7)
 6120.76.901t8.5DHB (C-2, C-4)
 7120.67.341dd8.5, 1.7DHB (C-1, C-3, C-5)
Ser-11173.5     
 256.94.661t5.1Ser-1 (C-1, C-3), DHB (C-1)
 362.14.003bm Ser-1 (C-1, C-2)
Orn-21174.5     
 254.54.441dd8.5, 5.1Orn-2 (C-1, C-3, C-4), Ser-1 (C-1)
 328.51.96; 1.83   Orn-2 (C-1, C-2, C-4, C-5)
 424.21.76; 1.72   Orn-2 (C-2, C-3, C-5)
 539.73.022td8.5, 0.85Orn-2 (C-3, C-4)
Ser-31172.2     
 256.54.531t5.1Ser-3 (C-1, C-3), Orn-2 (C-1)
 362.03.862m Ser-3 (C-1, C-2)
Ser-41171.2     
 253.65.031t5.1Ser-4 (C-1, C-3), Ser-3 (C-1)
 361.43.782d5.1Ser-4 (C-1, C-2)
hOrn-51170.1     
 253.83.993bm hOrn-5 (C-1, C-3, C-4)
 328.81.87   hOrn-5 (C-1, C-2, C-5)
 422.21.76   hOrn-5 (C-5)
 548.63.65; 3.67   hOrn-5 (C-3, C-4), Ser-4 (C-1)c
chOrn-61167.2     
 251.54.401dd11.1, 6.0chOrn-6 (C-1, C-3), hOrn-5 (C-1)
 327.11.84; 2.05   chOrn-6 (C-1, C-2, C-4, C-5)
 421.02.01; 1.94    
 552.53.61; 3.67   chOrn-6 (C-3, C-4)

Chemical shifts of the carbon resonances are estimated from the HMBC data set.

Signals from C-3 of Ser-1 and C-2 of hOrn-5 overlapped, and no clear integral could be measured.

Key HMBC correlation confirmed the hydroxamate bond between Ser-4 and hOrn-5.

High resolution MS/MS analysis of qinichelin. A spent medium sample of Streptomyces sp. MBT76 grown in condition A was subjected to high resolution LC-MS/MS analysis to obtain insights into the structures of unacetylated (a) and acetylated (b) qinichelin. Chemical shifts of the carbon resonances are estimated from the HMBC data set. Signals from C-3 of Ser-1 and C-2 of hOrn-5 overlapped, and no clear integral could be measured. Key HMBC correlation confirmed the hydroxamate bond between Ser-4 and hOrn-5. Together, these experiments confirmed the existence and the precise chemical structure of compound 2. With three iron-coordinating groups including one DHB moiety and two hydroxamates, our new compound resembles other mixed-ligand siderophores like amychelin[25] and gobichelin.[42] This strongly suggested that compound 2 was a siderophore, which was named qinichelin. The name refers to the origin of Streptomyces sp. MBT76, which was isolated from the Qinling Mountains in China.[31]

High Resolution MS/MS Analysis Reveals Production of Qinichelin Variants (3–5), Griseobactin, but Not Enterobactin

We suspected that an acetylated analogue of qinichelin could be produced by Streptomyces sp. MBT76, because acetylation by an N-acetyltransferase encoded by qchQ had not yet been found in qinichelin. Indeed, we observed an [M + H]+ species at m/z 814.3587 for acetylated qinichelin, with a slightly longer retention time than qinichelin. The high abundance of an [M + H]+ species instead of [M + 2H]2+ already indicated that one of the two free amines in qinichelin, δ-NH2 in Orn-2 or α-NH2 in hOrn-5, was acetylated, while a derivative with both acetylations was not detected. Upon fragmentation for MS/MS analysis, a surprising result was obtained because the fragmentation spectrum (Figure b) corresponded to a mixture of two different acetylated peptides 3 and 4 (Figure ). Some masses could only be assigned to acetylation at δ-NH2 in Orn-2 while other masses indicated acetylation of α-NH2 in hOrn-5. Since fragmentation of this [M + H]+ ion was less efficient than the unacetylated [M + 2H]2+ ion (Figure a), a complete sequence coverage could not be achieved for b and y ions. However, at least one b or y ion was present for each peptide/hydroxamate bond for both variants, thus providing strong evidence for the position of the post-translational modification. In addition, qinichelin variant 5 gave a [M]+ peak at m/z 755.3314, and the characteristic fragment at m/z 512.2096 indicated an Orn-5 instead of an hOrn-5 residue (Figure S7). We did not obtain sufficient amounts of compounds 3–5 for 2D NMR analysis, as they are minor relative to 2. Since the proteomics analysis also revealed expression of the ent and gri clusters (Table ), we attempted to find their respective products, enterobactin and griseobactin, by MS/MS analysis. Indeed, griseobactin could be readily detected with highest intensity at m/z 394.1720 for the [M + 3H]3+ species, within 0.5 ppm of the expected mass. Another signal was observed for the [M + 2H]2+ species at m/z 590.7538, with an MS/MS fragmentation pattern corresponding exactly with published data.[43] Surprisingly, no enterobactin could be detected. This suggests that only the dhb operon in the ent cluster may be functional for 2,3-DHB precursor supply for griseobactin and qinichelin, but not enterobactin, production in Streptomyces sp. MBT76.

Qinichelin Production Belongs to the Iron Homeostasis Regulon

To support the iron-chelating function of qinichelin and its possible role in iron homeostasis of Streptomyces sp. MBT76, we searched for the occurrence of iron boxes within the qinichelin BGC. Iron boxes are cis-acting elements with a 19 bp palindromic consensus sequence TTAGGTTAGGCTAACCTAA that are bound by DmdR1, the global iron regulator in Streptomyces species.[44] When sufficient iron is available, the DmdR1–Fe2+ complex binds to iron boxes and represses the expression of siderophore biosynthetic and importer genes.[44] The dramatic reduction in qinichelin production under iron-rich conditions suggested that the expression of the qch cluster would also be under the negative control of DmdR1 (Figure b). Indeed, four highly conserved iron boxes were found within the BGC: (i) upstream of the predicted pentacistronic operon qchA-E involved in ornithine synthesis from glutamate, (ii) upstream of qchF coding for the l-ornithine 5-monooxygenase, (iii) upstream of the tricistronic operon (qchN-P) predicted to be involved in qinichelin transport, and (iv) upstream of qchQ that encodes the predicted qinichelin N-acetyltransferase (Figure a, and Table S1). The iron box identified 109 nt upstream of qchF that displayed the perfect palindromic sequence TTAGGTTAGGCTAACCTAA, which made it highly likely that the central NRPS genes of the qch cluster were regulated by DmdR1. Furthermore, the iron box upstream of the predicted qinichelin transporter system (qchN-P in Figure ) presents greater identity to the palindromic consensus sequence bound by DmdR1, compared to most of the iron boxes identified upstream of other siderophore uptake system genes present in the Streptomyces sp. MBT76 genome (Table S1). In addition, three iron boxes were identified in the gri cluster and one in the ent cluster (Figure b,c, and Table S1), suggesting that siderophore production in Streptomyces sp. MBT76 is indeed under control of DmdR1. Interestingly, scanning for ARG boxes (consensus sequence CCATGCATGCCCATTGCATA) that are bound by the arginine repressor ArgR[45] revealed no reliable cis-acting sequences upstream of the qchA-E operon. Instead, the canonical argCJBDR gene cluster outside the qinichelin biosynthetic cluster displayed the putative ARG box at position −87 nt upstream of argC. This suggests differential regulation of the ornithine biosynthetic genes from primary metabolism and those involved in secondary metabolism.

Biosynthesis of Qinichelins Relies on Coordination between Multiple BGCs

The theoretical analysis and the experimental identification of griseobactin and qinichelins allowed us to postulate an intertwined model for the production of catecholate–peptide siderophores in Streptomyces sp. MBT76 (Figure ). The chorismate pathway within the ent gene cluster provides the building block 2,3-DHB to the three NRPS EntF, GriE, and QchI-QchJ, for enterobactin, griseobactin, and qinichelin, respectively. The 2,3-DHB moiety is activated by 2,3-dihydroxybenzoate-AMP ligase EntE and subsequently transferred to stand-alone aryl carrier proteins QchG or EntB2. As the necessary gene coding for the aryl carrier protein is lacking in the griseobactin BGC, this requirement could be remedied by either QchG or EntB2 to deliver the activated 2,3-DHB starter unit for GriE. The further mechanisms for NRPS assembly of enterobactin and griseobactin have been elaborated elsewhere.[36,37] The coordinated expression of multiple NRPS gene clusters for siderophore production in Streptomyces sp. MBT76 is striking but not unprecedented. Similar functional crosstalk between different NRPS BGCs was demonstrated for the assembly of the siderophores erythrochelin in Saccharopolyspora erythraea(14) and rhodochelin in Rhodococcus jostii RHA1.[15] Such crosstalk could enable structural diversity for siderophores on the basis of a limited number of biosynthetic genes and thus confer an evolutionary advantage for the producing bacteria in terms of iron acquisition. In particular, it would be advantageous for one bacterium to evolve specific siderophore(s) for their own benefit, to compete with the “siderophore pirates” that use siderophores biosynthesized by other species.[46] For example, the structurally novel amychelin produced by Amycolatopsis sp. AA4 seems to frustrate “siderophore piracy” of Streptomyces coelicolor by inhibiting its development.[25] Model of intertwined biosynthetic pathways of catechol-peptide siderophores in Streptomyces sp. MBT76. The functional crosstalk among four entirely separate gene clusters ensures the assembly of two types of catechol-peptide siderophores, namely qinichelins and griseobactin. The DHB is supplied by the ent gene cluster and is shared by three NRPS systems. The DHB-hexapeptide backbone in qinichelin follows an orthodox colinear extension model, while the Orn building block could arise from either qchA-E or the canonical arg gene cluster. The differently modified Orn in the dashed boxes are likely accepted by the AOrn domain of modules 2 and 5 to produce different qinichelin variants. The assembly of the catecholate–hexapeptide backbone in qinichelin follows an orthodox linear logic of modular NRPS. Each module in QchI and QchJ contains an adenylation (A) domain for recognition of correct amino acid substrate, whereby Ser-1, Orn-2, Ser-3, Ser-4, hOrn-5, and hOrn-6 are sequentially bound and converted to aminoacyl adenylates. The two serine residues are converted from the initial L form[47] into their D stereoisomer by the epimerization (E) domain in modules 1 and 3. After QchG-mediated incorporation of 2,3-DHB, each condensation (C) domain is successively used to elongate the chain by formation of a peptide bond with the activated amino acid, except for the isopeptide bond catalyzed by C domain 5, while the growing peptide chain is tethered to peptidyl carrier proteins (PP). Finally, qinichelin is released from the last PP domain through an intramolecular nucleophilic substitution of the δ-hydroxylamino group of l-hOrn-6 to the carbonyl group of the thioester. However, it is challenging to understand the enzymology responsible for this reaction, because a usual thioesterase (TE) domain (e.g., in NRPS assembling gobichelin (42) and heterobactin[26]) required for peptide chain release is lacking in the C-terminus of QchJ. It is tempting to speculate that the C domain in module 6 catalyzes both the α-amidation of hOrn-6 to finalize the growing peptide chain and δ-amidation to self-cyclize the last hydroxyornithine (chOrn-6) to release the peptide chain from the NRPS system. A similar scenario for peptide chain release has recently been reported in the biosynthesis of scabichelin,[48] a pentapeptide siderophore containing a C-terminal cyclic hydroxyornithine residue as in qinichelin. The ornithine building block for qinichelin assembly may originate from either the qch cluster or from the canonical arg gene cluster,[41] regulated by DmdR1 and ArgR, respectively. This would allow decoupling of qinichelin production from primary metabolism. The generated Orn precursor is further tailored, including hydroxylation at δ-NH2 by QchF, and/or acetylation at α-NH2 and δ-NH2 by QchQ. Alternatively, α-N-acetylation could arise from the bifunctional enzyme ArgJ (or its counterpart QchC) during ornithine precursor synthesis.[41] The characterization of qinichelin congeners (3–5) provides evidence for substrate flexibility of the AOrn domain in modules 2 and 5, whereby unmodified ornithine (Orn), δ-N-hydroxyl ornithine (hOrn), α-N-acetyl ornithine, δ-N-acetyl ornithine, and δ-N-hydroxyl-α-N-acetyl ornithine can be recognized and incorporated into the NRPS assembly. Still, we cannot rule out that QchQ post-translationally acetylates either free amine after construction of the final qinichelin. Indeed, it is difficult to discriminate between A domains activating Orn and/or hOrn through bioinformatics alone.[49] However, since qinichelin[2] was the major chemical output of the qch gene cluster, unmodified ornithine (Orn) and δ-N-hydroxyl ornithine (hOrn) are most likely preferred by AOrn in module 2 and module 5, respectively.

Conclusions

Actinomycetes adopt versatile strategies to biosynthesize structurally diverse secondary metabolites. This includes the production of a variety of siderophores, although it is not always clear what the advantage is in terms of the competition for iron in the environment. Functional crosstalk among multiple distantly located BGCs is not always predicted well by bioinformatics analysis. Therefore, chemical novelty may be missed if we solely rely on synthetic biology approaches, such as heterologous expression of a single BGC. The “protein-first” method, via reverse natural product proteomining, effectively identified the qch gene cluster expression in Streptomyces sp. MBT76 and further guided the characterization of qinichelins 2–5, a family of new catecholatehydroxamate siderophores. The principles presented in this work can be exploited to discover a broader range of chemical frameworks and to elucidate other intertwined biosynthetic scenarios.

Experimental Section

Strains and Growth Conditions

Streptomyces sp. MBT76 isolation from Qinling mountain soil,[31] general growth conditions, and genome sequencing (GenBank accession number: LNBE00000000) have been described.[32,33] Here, Streptomyces sp. MBT76 was grown in a liquid NMMP medium[50] containing 1% (w/v) glycerol and 0.5% (w/v) mannitol as carbon sources, but lacking polyethylene glycol. This basic NMMP media were perturbed by using four different additives (or no additive) to create varying growth conditions: (A) no additive, (B) 2% (w/v) NaCl, (C) 1% (w/v) starch, (D) 0.8% (w/v) Bacto peptone (Difco), and (E) 0.6% (w/v) Bacto yeast extract (Difco). For the iron-starvation study, the minor element solution[50] was omitted from condition A. All the cultures of Streptomyces sp. MBT76 were incubated at 30 °C for 72 h, with constant shaking at 220 rpm.

Proteomics

Streptomyces sp. MBT76 cells were lysed using acetone/SDS as described.[51] Around 167 μg of protein was precipitated for each sample using chloroform/methanol[52] and then dissolved using RapiGest SF surfactant (Waters). The proteins were further digested with trypsin after iodoacetamide treatment,[53] and the resulting primary amines of the peptides were dimethyl labeled using three combinations of isotopomers of formaldehyde and cyanoborohydride on Sep-Pak C18 200 mg columns (Waters), via CH2O + NaBH3CN, CD2O + NaBH3CN, and 13CD2O + NaBD3CN, as described.[54] Light-, medium-, and heavy-labeled peptides with 4 Da mass differences were mixed 1:1:1 to obtain 0.5 mg for fractionation by cationic exchange (SCX) chromatography using a polysulfethyl A column (PolyLC, 100 × 2.1 mm, particle size 5 μm, average pore size 200 Å). Mobile phases for SCX chromatography consisted of solvent A (10 mM KH2PO4, 20% (v/v) acetonitrile, pH 3) and solvent B (10 mM KH2PO4, 20% (v/v) acetonitrile, 0.5 M KCl, pH 3). The running program for SCX was a gradient of 0–18% solvent B in 18 CV (column volume), 18–30% solvent B in 6 CV, and 30–100% solvent B in 5 CV, at a constant flow rate of 250 μL min–1. In total, 24 peptide fractions were collected for LC-MS/MS analysis on an LTQ-Orbitrap instrument (Thermo).[53] Data analysis was performed using MaxQuant 1.4.1.2,[55] whereby MS/MS spectra were searched against a database of translated coding sequences obtained from the genome of Streptomyces sp. MBT76. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with the data set identifier PXD006577.

LC-MS Analysis of Metabolites

Spent medium samples were acidified with 1% (v/v) formic acid final concentration) and desalted using StageTips.[56] Next, 20 μL of samples were separated on an Finnigan Surveyor HPLC (Thermo) equipped with a Gemini C18 column (Phenomenex, 4.6 × 50 mm, particle size 3 μm, pore size 110 Å) at a flow rate of 1 mL min–1 and using a 0–50% acetonitrile gradient buffered with 0.1% (v/v) TFA in 10 CV. Mass spectrometry was performed using an Finnigan LCQ advantage (Thermo) equipped with an ESI source in the positive mode and scanning at 160–2,000 m/z. For high resolution LC-MS/MS analysis on an LTQ-orbitrap the same setup was used as above for proteomics analysis,[53] but using different run parameters. Mobile phases were as follows: (A) 0.1% (v/v) formic acid in H2O and (B) 0.1% formic acid in acetonitrile. A 30 min 10–20% B gradient was followed by a 15 min 20–50% B gradient, both at a flow rate of 300 μL/min split to 250 nL min–1 by the LTQ divert valve. For each data-dependent cycle, one full MS scan (100–2000 m/z) acquired at a resolution of 30 000 was followed by two MS/MS scans (100–2000 m/z), again acquired in the orbitrap at a resolution of 30 000, with an ion selection threshold of 1 × 107 counts but no charge exclusions. Other fragmentations parameters were as described for the proteomics analysis.[53] After two fragmentations within 10 s, precursor ions were dynamically excluded for 120 s with an exclusion width of ±10 ppm. The data have been deposited in the GNPS repository (http://gnps.ucsd.edu/) with data set identifier MSV000081504.

Isolation of Qinichelins

A total of 5 mL of spent medium from NMMP-grown cultures was desalted on Sep-Pak SPE C18 200 mg columns (Waters). Columns were first washed with 1 mL of 80% (v/v) acetonitrile + 0.1% (v/v) formic acid and then equilibrated with 1 mL of 0.1% (v/v) formic acid. A total of 5 mL of spent medium was mixed with 1 mL of 5% (v/v) formic acid and loaded onto the column. After s wash with 1 mL of 0.1% (v/v) formic acid, the column was eluted with 600 μL of 80% (v/v) acetonitrile + 0.1% (v/v) formic acid. The resulting sample was dried in a speedvac to remove acetonitrile and resuspended in 900 μL of 3% (v/v) acetonitrile + 0.1% (v/v) formic acid. This desalted sample was separated by HPLC on an Agilent 1200 series instrument equipped with a Gemini C18 column (Phenomenex, 250 × 10 mm, particle size 5 μm, pore size 110 Å), eluting with a gradient of acetonitrile in H2O adjusted with 0.15% (v/v) trifluoroacetic acid from 6% to 12%. The HPLC run was performed in 3 CV at a flow rate of 5 mL/min, and the fractions were collected based on UV absorption at 307 nm. All fractions were analyzed by LC-MS (positive mode) to check the existence of the targeted mass at m/z 772.3. The fraction of interest was lyophilized and subsequently reconstituted in deuterated water (D2O) for NMR (850 MHz) measurement.

DmdR1 and ArgR Regulon Predictions

The putative binding sites for the iron utilization regulator DmdR1 and for the arginine biosynthesis regulator ArgR were detected on the chromosome of Streptomyces sp. MBT76 using the PREDetector software[57] and according to the method described.[58] For the generation of the DmdR1 position weight matrix (PWM), we used the sequence of the iron box which lies at position −82 nt upstream of desA (SCO2782) and was previously shown to be bound by DmdR1 in S. coelicolor.[59] In order to acquire more highly reliable iron boxes to generate the PWM, we scanned the upstream region of the orthologues of desA in five other Streptomyces species and retrieved their respective iron boxes (see Supporting Information Figure S8). A set of ARG boxes experimentally validated in S. clavuligerus(60) and S. coelicolor(45) were used to generate the ArgR PWM (see Supporting Information Figure S9).
  58 in total

Review 1.  Bioactive microbial metabolites.

Authors:  János Bérdy
Journal:  J Antibiot (Tokyo)       Date:  2005-01       Impact factor: 2.649

2.  PREDetector: a new tool to identify regulatory elements in bacterial genomes.

Authors:  Samuel Hiard; Raphaël Marée; Séverine Colson; Paul A Hoskisson; Fritz Titgemeyer; Gilles P van Wezel; Bernard Joris; Louis Wehenkel; Sébastien Rigali
Journal:  Biochem Biophys Res Commun       Date:  2007-04-12       Impact factor: 3.575

Review 3.  Triggers and cues that activate antibiotic production by actinomycetes.

Authors:  Hua Zhu; Stephanie K Sandiford; Gilles P van Wezel
Journal:  J Ind Microbiol Biotechnol       Date:  2013-08-02       Impact factor: 3.346

4.  Purification and characterization of rhodobactin: a mixed ligand siderophore from Rhodococcus rhodochrous strain OFS.

Authors:  Suraj Dhungana; Ryszard Michalczyk; Hakim Boukhalfa; Joseph G Lack; Andrew T Koppisch; Jason M Fairlee; Mitchell T Johnson; Christy E Ruggiero; Seth G John; Matthew M Cox; Cindy C Browder; Jennifer H Forsythe; Laura A Vanderberg; Mary P Neu; Larry E Hersman
Journal:  Biometals       Date:  2007-02-02       Impact factor: 2.949

5.  Goodbye to brute force in antibiotic discovery?

Authors:  Roberto Kolter; Gilles P van Wezel
Journal:  Nat Microbiol       Date:  2016-01-27       Impact factor: 17.745

6.  Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips.

Authors:  Juri Rappsilber; Matthias Mann; Yasushi Ishihama
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

7.  Structural characterization of the heterobactin siderophores from Rhodococcus erythropolis PR4 and elucidation of their biosynthetic machinery.

Authors:  Mattia Bosello; Mustafa Zeyadi; Femke I Kraas; Uwe Linne; Xiulan Xie; Mohamed A Marahiel
Journal:  J Nat Prod       Date:  2013-11-26       Impact factor: 4.050

8.  Discovery of C-Glycosylpyranonaphthoquinones in Streptomyces sp. MBT76 by a Combined NMR-Based Metabolomics and Bioinformatics Workflow.

Authors:  Changsheng Wu; Chao Du; Koji Ichinose; Young Hae Choi; Gilles P van Wezel
Journal:  J Nat Prod       Date:  2017-01-27       Impact factor: 4.050

9.  A proteomics approach to discovering natural products and their biosynthetic pathways.

Authors:  Stefanie B Bumpus; Bradley S Evans; Paul M Thomas; Ioanna Ntai; Neil L Kelleher
Journal:  Nat Biotechnol       Date:  2009-09-20       Impact factor: 54.908

10.  Metabolomics-guided analysis of isocoumarin production by Streptomyces species MBT76 and biotransformation of flavonoids and phenylpropanoids.

Authors:  Changsheng Wu; Hua Zhu; Gilles P van Wezel; Young Hae Choi
Journal:  Metabolomics       Date:  2016-03-30       Impact factor: 4.290

View more
  2 in total

Review 1.  Mining for Microbial Gems: Integrating Proteomics in the Postgenomic Natural Product Discovery Pipeline.

Authors:  Chao Du; Gilles P van Wezel
Journal:  Proteomics       Date:  2018-06-10       Impact factor: 3.984

2.  A Single Biosynthetic Gene Cluster Is Responsible for the Production of Bagremycin Antibiotics and Ferroverdin Iron Chelators.

Authors:  Loïc Martinet; Aymeric Naômé; Benoit Deflandre; Marta Maciejewska; Déborah Tellatin; Elodie Tenconi; Nicolas Smargiasso; Edwin de Pauw; Gilles P van Wezel; Sébastien Rigali
Journal:  mBio       Date:  2019-08-13       Impact factor: 7.867

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.