Literature DB >> 29666226

Discovery of recombinases enables genome mining of cryptic biosynthetic gene clusters in Burkholderiales species.

Xue Wang1, Haibo Zhou1, Hanna Chen1,2, Xiaoshu Jing1, Wentao Zheng1, Ruijuan Li1, Tao Sun1, Jiaqi Liu1, Jun Fu1, Liujie Huo1, Yue-Zhong Li1, Yuemao Shen1, Xiaoming Ding3, Rolf Müller4, Xiaoying Bian5, Youming Zhang5.   

Abstract

Bacterial genomes encode numerous cryptic biosynthetic gene clusters (BGCs) that represent a largely untapped source of drugs or pesticides. Mining of the cryptic products is limited by the unavailability of streamlined genetic tools in native producers. Precise genome engineering using bacteriophage recombinases is particularly useful for genome mining. However, recombinases are usually host-specific. The genome-guided discovery of novel recombinases and their transient expression could boost cryptic BGC mining. Herein, we reported a genetic system employing Red recombinases from Burkholderiales strain DSM 7029 for efficient genome engineering in several Burkholderiales species that currently lack effective genetic tools. Using specialized recombinases-assisted in situ insertion of functional promoters, we successfully mined five cryptic nonribosomal peptide synthetase/polyketide synthase BGCs, two of which were silent. Two classes of lipopeptides, glidopeptins and rhizomides, were identified through extensive spectroscopic characterization. This recombinase expression strategy offers utility within other bacteria species, allowing bioprospecting for potentially scalable discovery of novel metabolites with attractive bioactivities.
Copyright © 2018 the Author(s). Published by PNAS.

Entities:  

Keywords:  Burkholderiales; genome mining; lipopeptide; natural product; recombinases

Mesh:

Substances:

Year:  2018        PMID: 29666226      PMCID: PMC5939090          DOI: 10.1073/pnas.1720941115

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


Bacteria represent an important source of medicines and pesticides (1, 2). Current genome sequencing projects have revealed a large number of biosynthetic gene clusters (BGCs), but their natural products have not yet been identified (3–5). The enzymes encoded by these unexploited biosynthetic genes may synthesize metabolites that could serve as a reservoir of novel scaffolds for drug screening (6). Two major genome mining strategies, featuring homologous and heterologous expression, were employed to mine cryptic BGCs (7). Homologous expression in the native producers ensures proper substrates, precursors, cofactors, and accessory factors for biosynthesis, which could maximize generation of cryptic BGC biosynthesis products. However, this strategy requires efficient genetic manipulation tools in the producers to meet the demand of high-throughput genome mining in the postgenomic era. Recently, yeast homologous recombination and CRISPR-Cas9 strategies were harnessed to activate silent BGCs by promoter engineering in metagenomic DNA and natural product proficient streptomycetes, respectively (8, 9). However, in other natural product producers, such as bacilli, Burkholderiales, myxobacteria, and cyanobacteria, scalable and efficient genome manipulation is still a frustrating challenge. Red/ET recombineering is an in vivo homologous recombination-based genetic engineering method employed primarily in Escherichia coli by using short homology arms (∼50 bp) based on the expression of either Redα/Redβ from the Red operon of λ phage or the analogous RecE/RecT from Rac prophage, encoded in the E. coli chromosome (10–13). Full-length RecE and RecT mediate linear−linear homologous recombination (LLHR), which can be used to directly clone large DNA regions from genomic DNA into E. coli expression vectors (14), while Redα and Redβ promote homologous recombination between linear and circular DNA molecule [linear plus circular homologous recombination (LCHR)]. In addition, Redαβ recombineering is particularly relevant to bacterial genome engineering for genome mining, because the linear targeting DNA carrying short homology regions (conveniently generated by PCR) is sufficient for gene modification, and dedicated targeting plasmids are not required, unlike in single- or double-crossover mutagenesis (15). The Redαβ system from E. coli has been directly applied to several closely related Gram-negative bacteria for genome engineering, e.g., Salmonella enterica (16), Klebsiella pneumoniae (17), Agrobacterium tumefaciens (18), Pantoea ananatis (19), and naturally transformable Burkholderia species (20, 21). However, it is not functional in other Burkholderiales strains (20, 21). In more distant species, host-specific Redαβ or RecET homolog systems are mandatory (22–29). The order Burkholderiales belongs to β-Proteobacteria and includes several pathogenic bacteria and several environmental species. A number of Burkholderiales genomes contain plentiful putative natural product BGCs (4, 5, 30, 31). The proportion of polyketide synthases (PKSs) and nonribosomal peptide synthetases (NRPSs) in Burkholderia genomes is second only to that of actinobacteria and is higher than that of bacilli, cyanobacteria, and myxobacteria (32). Burkholderia species also exhibit great biosynthetic potential for production of siderophores and lipopeptides (31). Thus, Burkholderiales species have emerged as a new source of natural products with a plethora of BGCs still cryptic or silent (31–34). To access these BGCs, it is urgent to invent Burkholderiales specific homologous expression systems, particularly, novel Red/ET recombinases from Burkholderiales strains for efficient genome engineering. This study invented a workflow for the discovery of novel recombinases and the associated transient expression system to facilitate in situ promoter insertion for cryptic BGC mining (Fig. 1). In particular, a pair of Redαβ homologs (Redαβ7029) in Burkholderiales strain DSM 7029 (35) was discovered, cloned for transient expression, and optimized for the efficient deletion of large chromosomal DNA fragments (up to 200 kb). The specialized recombinases-assisted in situ constitutive promoter insertion also facilitated mining of cryptic BGCs. Three cryptic NRPS-PKS BGCs were activated or mined in the native host. These specialized recombinases were also applied to other Burkholderiales strains that currently lack effective genetic tools and native Redαβ homologs, which resulted in the successful identification of two cryptic NRPS BGCs. Two of five identified BGCs were silent (i.e., products not detectable by liquid chromatography-mass spectrometry (LC-MS) analysis in their native hosts under the given conditions). In addition, two classes of lipopeptides, glidopeptins and rhizomides, were identified from DSM 7029 and Paraburkholderia rhizoxinica HKI 454 (36), respectively.
Fig. 1.

Workflow for the discovery of novel recombinases and a transient expression system to facilitate in situ promoter insertion for cryptic BGC mining. (A) Access to bacterial genome sequences. (B) Bioinformatics analysis to identify cryptic BGCs and also the phage recombinase pairs. (C) Construction of a recombinase transient expression system and verification of its function. (D) Use of a transient recombinase expression to generate BGC-inactivated mutants by gene replacement and BGC activation mutants by in situ promoter insertion. (E) Fermentation and comparative metabolic profiling analysis of mutants and wild-type strains to identify different peaks. The asterisks indicate activated products of a cryptic gene cluster. (F) Structural elucidation of potential products by HRMS and NMR measurements. P, promoter; SM, antibiotic selection marker.

Workflow for the discovery of novel recombinases and a transient expression system to facilitate in situ promoter insertion for cryptic BGC mining. (A) Access to bacterial genome sequences. (B) Bioinformatics analysis to identify cryptic BGCs and also the phage recombinase pairs. (C) Construction of a recombinase transient expression system and verification of its function. (D) Use of a transient recombinase expression to generate BGC-inactivated mutants by gene replacement and BGC activation mutants by in situ promoter insertion. (E) Fermentation and comparative metabolic profiling analysis of mutants and wild-type strains to identify different peaks. The asterisks indicate activated products of a cryptic gene cluster. (F) Structural elucidation of potential products by HRMS and NMR measurements. P, promoter; SM, antibiotic selection marker.

Results

Host-Specific Phage-Derived Recombinase Pairs in Burkholderiales.

It had been shown that the Redαβ or RecET recombinases from E. coli could not be applied to nonnaturally transformable Burkholderia strains (21). This study also confirmed that Redαβ was not functional in the Burkholderiales strain DSM 7029 (Fig. 2). BlastP analysis using the amino acid sequences from E. coli Redαβ or RecET as queries revealed 7 Redαβ-like and 10 RecET-like recombinase pairs in ∼1,000 Burkholderiales strains (most of which belong to Burkholderia and Paraburkholderia genera) (). A Redαβ-like operon in DSM 7029 contains a putative single-strand annealing protein AAW51_RS10370 (Redβ7029) that is functionally equivalent to Redβ, a neighboring 5′-3′ exonuclease AAW51_RS10365 (Redα7029) equivalent to Redα, a putative recombinase AAW51 RS10355 (H7029), and six hypothetical proteins (Fig. 2). A pair of RecET-like recombinases (ET-BDU) in Burkholderia sp. BDU8 harbors a three-gene operon encoding a putative single-strand annealing protein WS71_RS13960 (RecT-BDU), an adjacent exonuclease WS71_RS13965 (RecE-BDU), and a putative inhibitor of the exonuclease WS71_RS13955 (H-BDU) which is similar to the E. coli exonuclease inhibitor Redγ (Fig. 2). These two pairs of recombinases were chosen as representatives of the Redαβ and RecET systems for recombination assays in DSM 7029, because Redαβ7029 are the native recombinases and Redβ7029 shows a high identity to Redβ of E. coli, and RecET-BDU operon includes a putative redγ-like gene (Fig. 2 and ).
Fig. 2.

Selected Red/ET recombinase pairs in Burkholderiales and their functional characterization. (A) Recombinase pairs from E. coli and their homologs in DSM 7029 and Burkholderia sp. BDU8. Arrows with the same shading represent similar functional genes. (B) Functional characterization of recombinase combinations in E. coli using an LCHR assay. BA_7029, Redαβ7029; RedG_BA_7029, Redγ from E. coli combined with Redαβ7029; pluG_BA_7029, Pluγ from Photorhabdus luminescens combined with Redαβ7029; ETh_BDU, Rec ET homologs and a hypothetical protein of Burkholderia sp. BDU8; ETpsy, RecET homologs from P. syringae pv. tomato DC3000. (C) Functional characterization of recombinase combinations for plasmid and genome modification in DSM 7029. RedGBA, Redαβγ from E. coli; RedG, Redγ from E. coli. Error bars, SD; n = 3.

Selected Red/ET recombinase pairs in Burkholderiales and their functional characterization. (A) Recombinase pairs from E. coli and their homologs in DSM 7029 and Burkholderia sp. BDU8. Arrows with the same shading represent similar functional genes. (B) Functional characterization of recombinase combinations in E. coli using an LCHR assay. BA_7029, Redαβ7029; RedG_BA_7029, Redγ from E. coli combined with Redαβ7029; pluG_BA_7029, Pluγ from Photorhabdus luminescens combined with Redαβ7029; ETh_BDU, Rec ET homologs and a hypothetical protein of Burkholderia sp. BDU8; ETpsy, RecET homologs from P. syringae pv. tomato DC3000. (C) Functional characterization of recombinase combinations for plasmid and genome modification in DSM 7029. RedGBA, Redαβγ from E. coli; RedG, Redγ from E. coli. Error bars, SD; n = 3.

Functional Verification and Optimization of Recombinase Pairs.

To study their functions, we constructed a variety of combinations of recombinases (), and two Redγ-like proteins (Redγ or Pluγ) that inhibit endogenous exonucleases and enhance homologous recombination mediated by the Redαβ and RecET systems are also included (28, 37). Transient high-level expression of the phage recombinases is important when manipulating DNA in vivo (13). As Redαβ7029 were transcriptionally silent in DSM 7029 strain (), it is essential to place these phage recombinase genes under an inducible promoter, so that recombination can be induced when required and undesired recombination events can be temporally restricted (10). We first tested the efficiency of induction and stringency of four frequently used inducible promoters, the tetracycline-regulated promoter (tet) (38), the arabinose-regulated promoter (BAD) (39), the isopropyl thiogalactoside-regulated promoter (tac) (40), and the rhamnose-regulated promoter (Rha) (41), using the firefly luciferase gene in a pBBR1 plasmid as a reporter in DSM 7029 () (42). The relative light unit (RLU) of the rhamnose-induced Rha promoter was at least 1.5 times higher than that of the other three inducible promoters with corresponding inducers, and the naive Rha promoter showed very weak RLU (). Although tet promoter is also stringent, previous reports indicated leaky expression during epothilone production in DSM 7029, rendering it unsuitable for our experiments (43). Therefore, Rha promoter was utilized to control recombinase expression in the subsequent work on the basis of its induction efficiency and stringency in DSM 7029. We then tested the recombination efficiency between linear DNA and circular DNA (LCHR) of different recombinase combinations under the control of inducible Rha promoter in E. coli and DSM 7029 using plasmid and genome modification assay, respectively (). Redαβ7029 functions both in E. coli, with colony number per milliliter (cnpm) of ∼400 (Fig. 2), and in DSM 7029, with cnpm of ∼40 (Fig. 2). The hypothetical protein H7029 had no positive effect on recombination (). The addition of Redγ into Redαβ7029 remarkably enhanced the recombination efficiency in both E. coli (Fig. 2) and DSM 7029 (Fig. 2) as it synergizes in function with E. coli Redαβ (37), whereas, Pluγ or HBDU did not. The ETh-BDU recombinase pair and the TEpsy system from Pseudomonas syringae (25) were also functional, but at a low efficiency, with both cnpm less than 10. The Redαβγ from E. coli or Redγ alone did not exhibit recombination activity in DSM 7029 (Fig. 2). The Redαβ7029 operon was more efficient in the LCHR assay (), and the recombination efficiency increased significantly with increase in homology arm length (). Redγ-Redαβ7029 is the most efficient recombinase combination in DSM 7029 (cnpm was ∼160), prompting its use for further optimization and genome engineering. Because competent cell preparation and electroporation are critical for efficient recombineering in bacteria, we established optimal conditions for recombineering in DSM 7029 (). By doing so, the recombination efficiency was enhanced by threefold compared with initial conditions (Fig. 3). A detailed protocol for recombineering in DSM 7029 is included in . We then performed replacements of large DNA regions in the DSM 7029 genome. A 200-kb genome sequence (3054675–3254511) encoding two BGCs was successfully replaced using 50-bp homology arms (Fig. 3 and ). Furthermore, the biomass and growth rate of the 100-kb and 200-kb deletion mutants were improved compared with the wild type (Fig. 3). These results established a technical platform for precise DNA engineering to promote cryptic BGC mining and future genome reduction of DSM 7029 as a chassis for the expression of microbial natural products (43).
Fig. 3.

Optimization of the Redγ-Redαβ7029 recombineering procedure. (A) Comparison of different recombinase combinations as indicated in Fig. 2 in genome modification of DSM 7029 between the initial and optimized procedures. (B) Replacement of large genome sequences (50 kb to 200 kb) with an apramycin resistance gene containing varying homologous arms (50 bp to 100 bp) in DSM 7029 using optimized conditions. Error bars, SD; n = 3. (C) Growth curves of DSM 7029 and 100-kb and 200-kb deletion mutants.

Optimization of the Redγ-Redαβ7029 recombineering procedure. (A) Comparison of different recombinase combinations as indicated in Fig. 2 in genome modification of DSM 7029 between the initial and optimized procedures. (B) Replacement of large genome sequences (50 kb to 200 kb) with an apramycin resistance gene containing varying homologous arms (50 bp to 100 bp) in DSM 7029 using optimized conditions. Error bars, SD; n = 3. (C) Growth curves of DSM 7029 and 100-kb and 200-kb deletion mutants.

Redβ7029 Catalyzes Highly Efficient Single-Stranded DNA Recombination.

Phage recombinases mediating single-stranded oligonucleotide recombination is also efficient for genome engineering in E. coli (11, 44). We investigated the recombination efficiency of the Redγ-Redαβ7029 and Redγ-Redβ7029 mediated genome modification in DSM 7029 using a linear dsDNA and two ssDNA (leading strand and lagging strand) as substrates with the optimal procedure (). Redγ-Redαβ7029 also enabled the use of linear ssDNA as a substrate to modify the genome, but at extremely low efficiency, with cnpm less than 10. Redγ-Redβ7029 was competent at ssDNA recombination; especially when using the lagging ssDNA, the efficiency is substantially improved (cnpm was more than 5,000), 13 times higher than the Redγ-Redαβ7029 mediated dsDNA recombination (cnpm was ∼300) (Fig. 4). Thus, Redβ7029 catalyzes high-efficiency ssDNA recombination for genome engineering, greatly increasing the utility of this recombinase pair. Because the generation of long ssDNA requires in vitro exonuclease digestion (45, 46), we instead utilized the dsDNA recombination directly using a PCR product as a substrate for genome engineering.
Fig. 4.

Comparison of Redαβ7029 recombinase combinations using dsDNA and ssDNA (ld, leading strand; lg, lagging strand) as substrates to modify the genome of DSM 7029 as indicated in . Error bars, SD; n = 3.

Comparison of Redαβ7029 recombinase combinations using dsDNA and ssDNA (ld, leading strand; lg, lagging strand) as substrates to modify the genome of DSM 7029 as indicated in . Error bars, SD; n = 3.

Strategy for Deciphering Cryptic BGCs in DSM 7029.

The precise insertion of a functional promoter is a proven method for activating silent BGCs and enabling production (47). Thus, we employed this genetic tool to perform systematic interrogation of cryptic NRPS-PKS BGCs at low transcriptional levels () by comparative metabolite analysis between an inactivated mutant and a promoter-inserted activated mutant. The DNA sequence of the first NRPS-PKS gene in the target BGC was replaced with an antibiotic selection marker to generate a completely inactivated mutant. The same antibiotic marker along with its constitutive promoter was precisely inserted upstream of the first biosynthetic gene, substituting the promoter region of the main biosynthetic operon and generating an activated mutant in which the antibiotic resistance gene-driven target BGC is within a functional operon (Fig. 1). Using this approach, natural products encoded by cryptic BGCs were predicted, isolated, and characterized on the basis of bioinformatic, mass spectrometry, and NMR analysis. To eliminate the interference of the cytotoxic glidobactins, the predominant natural products in DSM 7029, we completely deleted the glidobactin biosynthetic genes glbB-glbG (48) using Redγ-Redαβ7029 recombineering without addition of an antibiotic marker (). We then constructed inactivated and promoter knock-in mutants of three cryptic BGCs in this glb deletion mutant. The products of three cryptic BGCs, 6A, 7, and 11, were detected by comparing their metabolite profiles using LC-MS (Fig. 5 and ). BGCs 6A and 11 from activated mutants produced several extra peaks at m/z 662, 676 (1), and 690 [M + 2H]2+, and at m/z 762, 776, and 785 [M + 2H]2+, respectively, compared with that of the inactivated mutants or the wild-type strain (Fig. 5). These results demonstrated that BGCs 6A and 11 remained silent under the laboratory fermentation conditions, but were effectively activated by promoter exchange using this system. An extra peak at m/z 478 [M + 2H]2+ was identified in the BGC 7 activated mutant and the wild-type strain but was absent in the inactivated mutant as well.
Fig. 5.

Mining of three cryptic gene clusters from DSM 7029. (A) HPLC-HRMS analysis (BPC+ 650 to 700) of extracts from (i) DSM 7029-Δglb, (ii) inactivated mutant DSM 7029-Δglb-ΔBGC6A, and (iii) activated mutant DSM 7029-Δglb-PApra-BGC6A; the m/z of each peak is indicated, except 1 (m/z = 676). (B) HPLC-HRMS analysis (BPC+ 450 to 500) of extracts from (i) DSM 7029-Δglb, (ii) DSM 7029-Δglb-ΔBGC7, and (iii) DSM 7029-Δglb-PApra-BGC7. (C) HPLC-HRMS analysis (BPC+ 750 to 800) of extracts from (i) DSM 7029-Δglb, (ii) DSM 7029-Δglb-ΔBGC11, and (iii) DSM 7029-Δglb-PApra-BGC11. (D) BGC6A (glpA-H) encoding the glidopeptins with promoter knock-in (P-Apra), and predicted NRPS domain and A domain selectivity for GlpCDE. Parentheses indicate amino acids observed in the NMR-determined final glidopeptin structure that differ from the bioinformatics prediction. A, adenylation domain; C, condensation domain; Cd, dual condensation/epimerization domain; CS, starter condensation domain; Dab, 2,4-diaminobutyric acid; Dhb, 2,3-dehydrobutyric acid; Dhv, α,β-dehydrovaline; T, peptidyl carrier domain. (E) Complete structure of glidopeptin A (1).

Mining of three cryptic gene clusters from DSM 7029. (A) HPLC-HRMS analysis (BPC+ 650 to 700) of extracts from (i) DSM 7029-Δglb, (ii) inactivated mutant DSM 7029-Δglb-ΔBGC6A, and (iii) activated mutant DSM 7029-Δglb-PApra-BGC6A; the m/z of each peak is indicated, except 1 (m/z = 676). (B) HPLC-HRMS analysis (BPC+ 450 to 500) of extracts from (i) DSM 7029-Δglb, (ii) DSM 7029-Δglb-ΔBGC7, and (iii) DSM 7029-Δglb-PApra-BGC7. (C) HPLC-HRMS analysis (BPC+ 750 to 800) of extracts from (i) DSM 7029-Δglb, (ii) DSM 7029-Δglb-ΔBGC11, and (iii) DSM 7029-Δglb-PApra-BGC11. (D) BGC6A (glpA-H) encoding the glidopeptins with promoter knock-in (P-Apra), and predicted NRPS domain and A domain selectivity for GlpCDE. Parentheses indicate amino acids observed in the NMR-determined final glidopeptin structure that differ from the bioinformatics prediction. A, adenylation domain; C, condensation domain; Cd, dual condensation/epimerization domain; CS, starter condensation domain; Dab, 2,4-diaminobutyric acid; Dhb, 2,3-dehydrobutyric acid; Dhv, α,β-dehydrovaline; T, peptidyl carrier domain. (E) Complete structure of glidopeptin A (1).

Identification of the Lipopeptide Glidopeptin A from DSM 7029.

One compound, namely glidopeptin A (1), was isolated from BGC 6A activated mutant. The glidopeptin gene cluster (glp, BGC 6A) contains eight unidirectional genes (glpA-H; Fig. 5 and ). Three large genes, glpCDE, encode NRPSs predicted to form a 12-amino acid lipopeptide because the first condensation (C) domain belongs to a starter type C domain that incorporates an acyl group as a starter unit (49). The additional genes are predicted to be responsible for product transport and for generating unnatural amino acids for use by the adenylation (A) domains of NRPS. The glpA gene encodes a 2,4-diaminobutyrate-4-aminotransferase that is commonly used in 2,4-diaminobutyric acid (Dab) biosynthesis, which is supported by the predicted Dab substrate binding specificity of two A domains, A3 and A7 (Fig. 5). The glpF, glpB, glpG, and glpH genes are predicted to encode for a hydroxylase, a major facilitator transporter in charge of product transport, a histidinol phosphatase, and a membrane protein, respectively (). Biosynthesis was activated by promoter knock-in, and the main compound glidopeptin A (1) was isolated as a colorless oil with a molecular formula established as C60H102N16O19 by the high-resolution electrospray ionization mass spectroscopy (HR ESIMS) ion at m/z 676.3833 [M + 2H]2+ (calculated 676.3826). Its structure was determined on the basis of analysis of 1H and 13C NMR spectroscopy as well as 2D correlation spectroscopies including 1H-1H correlation spectroscopy, heteronuclear multiple-bond correlation (HMBC), and heteronuclear single-quantum coherence. Interpretation of the NMR data established seven proteinogenic amino acid residues including Glu, Lys, Leu, Asn, Gly, and two Ser; five nonproteinogenic amino acid residues including two Dab, two 2,3-dehydrobutyric acids (Dhb), and one α,β-dehydrovaline (Dhv); and a decanoic acid residue (Fig. 5 and ). These residues were connected to form a linear 12-amino acid lipopeptide on the basis of HMBC correlations (). The configurations of the amino acid residues were assigned as d-Ser, l-Glu, d-Dab, l-Lys, l-Leu, d-Dab, l-Ser, and l-Asn, using Marfey’s method combined with the presence of dual condensation/epimerization domains (Fig. 5 and ). Therefore, glidopeptin A was characterized to be a linear 12-amino acid lipopeptide containing a rare C-terminal Dhv that was only found in the myxobacterial nonribosomal peptide myxovalargin (50, 51). The glidopeptin amino acid sequence is consistent with the prediction from glpCDE NRPS (Fig. 5) except that two Dhb and one Dhv are incorporated at the positions predicted to contain Thr (positions 5 and 9) and Val (position 12), respectively. The tandem thioesterases (TE1-TE2) observed at the C terminus of the NRPS are commonly found in large lipopeptide BGCs and generally predicted to catalyze both intramolecular cyclization and proofreading to guarantee fidelity of the biosynthetic assembly line (52, 53). However, our data showed that glidopeptin is a linear lipopeptide, indicating that the glp cluster indeed encodes a family of structural lipopeptides.

Extended Applications of Redαβ7029 Recombinases to Other Burkholderiales Strains for Genome Mining.

This recombineering and BGC mining strategy was not only functional in the native host but was also applied to other Burkholderiales strains that lack native Redαβ recombinase homologs, including P. rhizoxinica HKI 454, an endosymbiont of Rhizopus microspores (36), and Paraburkholderia phytofirmans strain PsJN, a plant growth-promoting endophyte (54). P. rhizoxinica HKI 454 produces antimitotic NRPS-PKS macrolide rhizoxins and harbors 14 additional NRPS gene clusters. However, no peptides corresponding to the NRPSs have been previously isolated (55). The rhizoxins account for a very large proportion of metabolites (56), and their biosynthetic pathway was successfully inactivated by gene replacement using Redγ-Redαβ7029 recombineering with high efficiency, greatly diminishing the metabolite content (). Afterward, inactivated and promoter knock-in mutants for selected cryptic NRPS BGCs were generated via the above strategy (). The products of BGC P1 rhizomides A-C [m/z 732 (2), 748 (3, 4)] and BGC P7 (m/z 833, 789, 803, 783, 817) from plasmid pBRH01were identified (Fig. 6). The yield of rhizomides in promoter knock-in mutants was 70-fold higher than that in the wild-type strain, indicating that the strong promoter remarkably enhanced product yields. Redγ-Redαβ7029 recombineering was also effective in P. phytofirmans strain PsJN for BGC manipulation, but no products were detected ().
Fig. 6.

Mining of two cryptic gene clusters from P. rhizoxinica HKI 454. (A) HPLC-HRMS analysis (BPC+ 700 to 750) of extracts from (i) HKI 454-Δrhi, (ii) HKI 454-Δrhi-ΔBGC P1, and (iii) HKI 454-Δrhi-PApra-BGC P1. (B) HPLC-HRMS analysis (BPC+ 750 to 850) of extracts from (i) HKI 454-Δrhi, (ii) HKI 454-Δrhi -ΔBGC P7, and (iii) HKI 454-Δrhi-PApra-BGC P7; the m/z of each peak is indicated. (C) BGC P1 encoding the rhizomides (gene RBRH_01504), with promoter insertion (P-Apra), predicted NRPS domain, and A domain selectivity for RzmA. (D) Complete structure of rhizomides A−C (2 to 4).

Mining of two cryptic gene clusters from P. rhizoxinica HKI 454. (A) HPLC-HRMS analysis (BPC+ 700 to 750) of extracts from (i) HKI 454-Δrhi, (ii) HKI 454-Δrhi-ΔBGC P1, and (iii) HKI 454-Δrhi-PApra-BGC P1. (B) HPLC-HRMS analysis (BPC+ 750 to 850) of extracts from (i) HKI 454-Δrhi, (ii) HKI 454-Δrhi -ΔBGC P7, and (iii) HKI 454-Δrhi-PApra-BGC P7; the m/z of each peak is indicated. (C) BGC P1 encoding the rhizomides (gene RBRH_01504), with promoter insertion (P-Apra), predicted NRPS domain, and A domain selectivity for RzmA. (D) Complete structure of rhizomides AC (2 to 4).

Identification of Peptide Rhizomides from P. rhizoxinica HKI 454.

The single rhizomide (rzm, BGC P1) biosynthetic gene RBRH01504 encodes a seven-module NRPS with a starter C domain, indicating that its product may be a class of lipopeptides (Fig. 6). Compounds 2 to 4, namely rhizomides A to C, were purified as white amorphous powder and analyzed by HR ESIMS and NMR. The molecular formula of the dominant product 2 was determined to be C35H53N7O10 according to the HR ESIMS ion at m/z 732.3930 [M + H]+ (calculated 732.3927). The NMR data indicated the presence of seven amino acids (). HMBC correlations between amide protons and adjacent carbonyl groups defined the order of the seven amino acids, an acetyl group attached to the Leu N terminus, and intramolecular cyclization through the Thr side chain (). The absolute configurations of the amino acids were confirmed by Marfey’s method and bioinformatics analysis of the C domains (Fig. 6 and ). The two minor products 3 and 4 differ from 2 with one more oxygen atom as indicated by the HR ESIMS-determined formula C35H53N7O11 (m/z 748.3873 [M + H]+, calculated 748.3876), which was supported by the NMR comparison of 3 or 4 with that of 2, indicating that a Ser residue in 3 and 4 replaced the third Ala or the second Ala in 2 (), respectively. Marfey’s analysis of 3 and 4 demonstrated the presence of d- and l-Ser in 3 and 4 (), respectively, supporting the proposed structures (Fig. 6). Therefore, rhizomides AC (2 to 4) were determined to be seven-amino acid cyclic lipopeptides with an N-acetyl-Leu linking to a rare 19-membered lactone. These structures are consistent with the bioinformatic analysis of NRPS RzmA. The starter C domain condenses an unusual short fatty acid chain acetyl-CoA and Leu, and chain extension continues with the incorporation of l-Thr, d-Tyr, d-Ala, l-Ala/Ser, d-Ala/Ser, and l-Val by Rzm modules 2 to 7. The peptide chain is then passed to the TE domain, and is catalyzed into a 19-membered ring. The broad substrate binding specificity of the A5 and A6 domains allowed loading and incorporation of Ser into the extended chain to form the minor rhizomides C and B, respectively.

Bioactivity of Glidopeptin A and Rhizomide A.

Glidopeptin A and rhizomide A showed weak antitumor activity against several human tumor cells lines (IC50 34 μM to 96 μM) (). Glidopeptin A and rhizomide A also showed weak protective activity against cucumber downy mildew (Pseudoperonospora cubenis) () and weak in vitro inhibition activities against Staphylococcus aureus and Bacillus subtilis (), respectively.

Discussion

The chance of a successful mining of cryptic BGCs in a native host is generally higher than that in a heterologous host. However, it is usually inefficient due to a low homologous recombinant efficiency, especially in nonmodel organisms. To address this challenge, we used genome-guided discovery of host-specific phage recombinases and a transient expression system to develop an efficient recombineering method in a natural product producer. This method greatly expedites the construction of an inactive mutant by gene replacement and the activation of the cryptic BGCs by precise promoter insertion upstream of the main biosynthetic gene. This method is superior to the classical but laborious single crossover mutagenesis, especially considering the scalable manipulation of abundant BGCs. The transient expression of the recombinase ensures high-level expression for efficient DNA recombination, and diminishes the toxicity of recombinases and undesirable genome rearrangements. In addition, we used the same antibiotic selection for BGC inactivation and activation, which minimized disturbances during metabolite analysis. The use of an antibiotic promoter driving a BGC bypassed the complex screening of a feasible promoter for each producer. In addition to precise promoter insertion, more complex engineering such as domain/module swapping and seamless mutagenesis can be accomplished using the recombineering method in native producers. Once a potentially new compound is detected, biosynthesis dissection by in-frame gene deletion, structural optimization by combinatorial biosynthesis, and yield increase by host improvement or manipulation of regulators could easily be performed in the original host. Recombineering based on the host-specific recombinases Redαβ7029, pair of native bacteriophage recombinases in Burkholderiales, allowed us to investigate cryptic BGCs efficiently not only in the native DSM 7029 but also in other Burkholderiales species that lack effective genetic tools or native Redαβ recombinase homologs. A total of 15 metabolites from five BGCs was discovered in this study, and four metabolites were isolated and elucidated as compounds. Intriguingly, of five activated BGCs, two did not produce detectable products in the native producers under the same growth conditions, indicating that the promoter insertion strategy activates the expression of silent clusters. One cluster also showed a remarkably increased yield in the promoter knock-in mutant. Admittedly, some cryptic BGC products remain undiscovered, partially because they are present in very low abundance despite the improved expression level of biosynthetic genes using the promoter insertion strategy. It is possible that the small quantity of products escaped detection, due to the limits of the analytical methods or statistical data evaluation. Our strategy, combined with comprehensive metabolome mining, could further improve the likelihood of discovering specialized metabolites using genome mining (57, 58). In conclusion, we have demonstrated that a recombinase transient expression strategy was able to systematically activate cryptic gene clusters from Burkholderiales species. The metabolites can be detected by LC-MS and characterized by NMR. This strategy can be applied to more natural product producers for cryptic BGC investigation, and, when combined with medium optimization or metabolome mining, can mine the biosynthetic potential of a strain for the discovery of new molecules. Recombineering of the BGCs in native producers greatly facilitates genome engineering for host amelioration, speeds up the discovery of new compounds with attractive bioactivities, and optimizes metabolite production through combinatorial biosynthesis for pharmaceutical and agricultural applications.

Materials and Methods

Methods describing plasmid construction, promoter screening, detailed recombineering protocol, gene cluster inactivation, promoter insertion, fermentation and extraction of strains, HPLC-ESI-HRMS analyses, isolation and characterization of compounds, and bioactivity test are described in detail in . tables include strains, plasmids, mutants in this work (), recombinase pairs (), gene annotation of involved gene clusters (), primers used in this study (), transcript level of selected genes (), promoter sequences () and NMR data of compounds (), and the protective activity of glidopeptin A and rhizomide A against six plant diseases (). figures include inducible promoter screening (), recombination efficiency of different recombinases in E. coli and DSM 7029 (), the recombination efficiency of the LCHR and LLHR mediated by Redγ-Redαβ7029 in E. coli (), optimization of the work condition of Redγ-Redαβ7029 in DSM 7029 (), seamless deletion of glidobactin gene cluster (), inactivation and activation of BGC 6A, BGC 7, and BGC 11 in DSM 7029 (), inactivation of rhizoxin gene cluster in P. rhizoxinica HKI 454 (), inactivation and activation of BGC P1 and BGC P7 in P. rhizoxinica HKI 454 (), inactivation and activation of BGC 2 in P. phytofirmans PsJN (), and NMR spectra of compounds 1 to 4 (). Cytotoxic activity and bioactivity of glidopeptin A and rhizomide A are described ().
  58 in total

1.  A rapid and simple method for inactivating chromosomal genes in Yersinia.

Authors:  Anne Derbise; Biliana Lesic; Denis Dacheux; Jean Marc Ghigo; Elisabeth Carniel
Journal:  FEMS Immunol Med Microbiol       Date:  2003-09-22

2.  Identification and characterization of the lysobactin biosynthetic gene cluster reveals mechanistic insights into an unusual termination module architecture.

Authors:  Jie Hou; Lars Robbel; Mohamed A Marahiel
Journal:  Chem Biol       Date:  2011-05-27

3.  Complete genome sequence of the glidobactin producing strain [Polyangium] brachysporum DSM 7029.

Authors:  Biao Tang; Yucong Yu; Youming Zhang; Guoping Zhao; Xiaoming Ding
Journal:  J Biotechnol       Date:  2015-06-30       Impact factor: 3.307

4.  The myxovalargins, new peptide antibiotics from Myxococcus fulvus (Myxobacterales). I. Cultivation, isolation, and some chemical and biological properties.

Authors:  H Irschik; K Gerth; T Kemmer; H Steinmetz; H Reichenbach
Journal:  J Antibiot (Tokyo)       Date:  1983-01       Impact factor: 2.649

Review 5.  Genome-guided discovery of diverse natural products from Burkholderia sp.

Authors:  Xiangyang Liu; Yi-Qiang Cheng
Journal:  J Ind Microbiol Biotechnol       Date:  2013-11-09       Impact factor: 3.346

6.  A new logic for DNA engineering using recombination in Escherichia coli.

Authors:  Y Zhang; F Buchholz; J P Muyrers; A F Stewart
Journal:  Nat Genet       Date:  1998-10       Impact factor: 38.330

Review 7.  Microbial genome mining for accelerated natural products discovery: is a renaissance in the making?

Authors:  Brian O Bachmann; Steven G Van Lanen; Richard H Baltz
Journal:  J Ind Microbiol Biotechnol       Date:  2013-12-17       Impact factor: 3.346

8.  A new recombineering system for Photorhabdus and Xenorhabdus.

Authors:  Jia Yin; Hongbo Zhu; Liqiu Xia; Xuezhi Ding; Thomas Hoffmann; Michael Hoffmann; Xiaoying Bian; Rolf Müller; Jun Fu; A Francis Stewart; Youming Zhang
Journal:  Nucleic Acids Res       Date:  2014-12-24       Impact factor: 16.971

9.  Complete genome sequence of the plant growth-promoting endophyte Burkholderia phytofirmans strain PsJN.

Authors:  Alexandra Weilharter; Birgit Mitter; Maria V Shin; Patrick S G Chain; Jerzy Nowak; Angela Sessitsch
Journal:  J Bacteriol       Date:  2011-05-06       Impact factor: 3.490

10.  Studies on the synthesis of peptides containing dehydrovaline and dehydroisoleucine based on copper-mediated enamide formation.

Authors:  Franziska Gille; Andreas Kirschning
Journal:  Beilstein J Org Chem       Date:  2016-03-22       Impact factor: 2.883

View more
  23 in total

1.  Atolypenes, Tricyclic Bacterial Sesterterpenes Discovered Using a Multiplexed In Vitro Cas9-TAR Gene Cluster Refactoring Approach.

Authors:  Seong-Hwan Kim; Wanli Lu; Mahmoud Kamal Ahmadi; Daniel Montiel; Melinda A Ternei; Sean F Brady
Journal:  ACS Synth Biol       Date:  2018-12-21       Impact factor: 5.110

2.  Burkholderia as a Source of Natural Products.

Authors:  Sylvia Kunakom; Alessandra S Eustáquio
Journal:  J Nat Prod       Date:  2019-07-11       Impact factor: 4.050

3.  Streptomyces spinosus sp. nov. and Streptomyces shenzhenensis subsp. oryzicola subsp. nov. endophytic actinobacteria isolated from Jasmine rice and their genome mining for potential as antibiotic producers and plant growth promoters.

Authors:  Onuma Kaewkla; Sudarat Sukpanoa; Chanwit Suriyachadkun; Nitcha Chamroensaksi; Theeraphan Chumroenphat; Christopher Milton Mathew Franco
Journal:  Antonie Van Leeuwenhoek       Date:  2022-05-21       Impact factor: 2.271

Review 4.  Synthetic biology enabling access to designer polyketides.

Authors:  Alexandra A Malico; Lindsay Nichols; Gavin J Williams
Journal:  Curr Opin Chem Biol       Date:  2020-08-04       Impact factor: 8.822

5.  Biotechnological and Ecological Potential of Micromonospora provocatoris sp. nov., a Gifted Strain Isolated from the Challenger Deep of the Mariana Trench.

Authors:  Wael M Abdel-Mageed; Lamya H Al-Wahaibi; Burhan Lehri; Muneera S M Al-Saleem; Michael Goodfellow; Ali B Kusuma; Imen Nouioui; Hariadi Soleh; Wasu Pathom-Aree; Marcel Jaspars; Andrey V Karlyshev
Journal:  Mar Drugs       Date:  2021-04-25       Impact factor: 5.118

6.  At the Gate of Mutualism: Identification of Genomic Traits Predisposing to Insect-Bacterial Symbiosis in Pathogenic Strains of the Aphid Symbiont Serratia symbiotica.

Authors:  François Renoz; Vincent Foray; Jérôme Ambroise; Patrice Baa-Puyoulet; Bertrand Bearzatto; Gipsi Lima Mendez; Alina S Grigorescu; Jacques Mahillon; Patrick Mardulyn; Jean-Luc Gala; Federica Calevro; Thierry Hance
Journal:  Front Cell Infect Microbiol       Date:  2021-06-29       Impact factor: 5.293

Review 7.  Alkaloids in Contemporary Drug Discovery to Meet Global Disease Needs.

Authors:  Sharna-Kay Daley; Geoffrey A Cordell
Journal:  Molecules       Date:  2021-06-22       Impact factor: 4.411

8.  Pan-Genome Analysis Reveals Host-Specific Functional Divergences in Burkholderia gladioli.

Authors:  Hyun-Hee Lee; Jungwook Park; Hyejung Jung; Young-Su Seo
Journal:  Microorganisms       Date:  2021-05-22

Review 9.  Exploring endophytes for in vitro synthesis of bioactive compounds similar to metabolites produced in vivo by host plants.

Authors:  Hemant Sharma; Arun Kumar Rai; Divakar Dahiya; Rajen Chettri; Poonam Singh Nigam
Journal:  AIMS Microbiol       Date:  2021-05-26

Review 10.  Mining and unearthing hidden biosynthetic potential.

Authors:  Kirstin Scherlach; Christian Hertweck
Journal:  Nat Commun       Date:  2021-06-23       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.