Literature DB >> 32547925

Emerging molecular biology tools and strategies for engineering natural product biosynthesis.

Wei Xu1, Evaldas Klumbys1, Ee Lui Ang1, Huimin Zhao1,2.   

Abstract

Natural products and their related derivatives play a significant role in drug discovery and have been the inspiration for the design of numerous synthetic bioactive compounds. With recent advances in molecular biology, numerous engineering tools and strategies were established to accelerate natural product synthesis in both academic and industrial settings. However, many obstacles in natural product biosynthesis still exist. For example, the native pathways are not appropriate for research or production; the key enzymes do not have enough activity; the native hosts are not suitable for high-level production. Emerging molecular biology tools and strategies have been developed to not only improve natural product titers but also generate novel bioactive compounds. In this review, we will discuss these emerging molecular biology tools and strategies at three main levels: enzyme level, pathway level, and genome level, and highlight their applications in natural product discovery and development.
© 2019 The Authors.

Entities:  

Keywords:  Biosynthesis; Enzyme engineering; Metabolic engineering; Natural products; Pathway engineering; Synthetic biology

Year:  2019        PMID: 32547925      PMCID: PMC7283510          DOI: 10.1016/j.mec.2019.e00108

Source DB:  PubMed          Journal:  Metab Eng Commun        ISSN: 2214-0301


Introduction

Natural products (NPs) have proven to be a rich source of bioactive compounds and drugs for thousands of years. Although numerous NPs, especially microbial NPs, have been isolated, identified, and engineered during the Golden Age of NPs in the 1950s, bioactivity guided NPs discovery strategies remain time consuming and labor intensive (Donadio et al., 2007; Newman and Cragg, 2012). With the rapid development of next-generation sequencing technologies, an increasing number of microbial genomes have been elucidated, enabling a new era of bioinformatics-guided NPs discovery. However, the number of biosynthetic gene clusters (BGCs) identified in silico far exceeds the number of natural products identified so far and most BGCs are silent or not expressed in native hosts under standard laboratory conditions (Banik and Brady, 2010; Katz et al., 2016; Ren et al., 2017b; M. M. Zhang et al., 2017a; M. M. Zhang et al., 2016). Moreover, the laboratory cultured microorganisms represent only a small part of the overall microbial populations in nature. Overall, the rate of discovering novel bioactive natural products has slowed down drastically after the Golden Age, necessitating the development of new molecular biology tools (Galm and Shen, 2006; Ren et al., 2017b; M. M. Zhang et al., 2017a). The traditional paradigm for natural product discovery is typically bioactivity-guided whereas characterization of the corresponding BGC is usually carried out without whole genome sequence information. Consequently, known NPs are kept being re-discovered (Penesyan et al., 2010). In comparison, the modern paradigm for natural product discovery is based on genome sequencing, bioinformatics, and synthetic biology, and focuses on the direct identification and cloning of target BGCs (Ren et al., 2017b; M. M. Zhang et al., 2017a; M. M. Zhang et al., 2016), thereby avoiding the re-discovery of same NPs. Compared to random cloning using library-based approaches (Nah et al., 2017) or in situ manipulation (Tao et al., 2018), direct cloning is a much faster and more rational approach for microbial NP discovery. Moreover, a wide variety of enzyme engineering, pathway engineering, and genome engineering tools are used to either improve the production of target NPs or generate new analogs of target NPs (Fig. 1). In this review, we will discuss new molecular biology tools and strategies for natural product discovery and engineering, with a focus on recent advances (mainly between 2015 and 2019).
Fig. 1

Flowchart of engineering microbial natural product biosynthesis. Briefly, natural product discovery and engineering can be divided into three levels. (1) At the pathway level. A target BGC can be cloned and expressed in a heterologous host if the native host is not suitable for genetic engineering and the biosynthetic pathway can be refactored and optimized for its expression. (2) At the enzyme level. Key enzyme(s) is optimized via directed evolution or rational design if its proficiency needs to be improved. (3) At the genome level. The host genome can be edited if an optimized production host is needed.

Flowchart of engineering microbial natural product biosynthesis. Briefly, natural product discovery and engineering can be divided into three levels. (1) At the pathway level. A target BGC can be cloned and expressed in a heterologous host if the native host is not suitable for genetic engineering and the biosynthetic pathway can be refactored and optimized for its expression. (2) At the enzyme level. Key enzyme(s) is optimized via directed evolution or rational design if its proficiency needs to be improved. (3) At the genome level. The host genome can be edited if an optimized production host is needed.

Molecular biology tools for pathway engineering of BGCs

Direct cloning

Since the late 1980s, numerous natural product BGCs, especially relatively small BGCs, were cloned and expressed in heterologous hosts. The most widely used strategy was random cloning, in which genomic DNA is randomly sheared and cloned into vectors to create libraries for heterologous expression (Nah et al., 2017). However, this strategy is labor-intensive and time consuming. To overcome its limitations, a number of direct cloning strategies have been developed in recent years, in which a target natural product BGC is identified by genome sequencing and bioinformatics tools and then directly cloned for heterologous expression (Fig. 2).
Fig. 2

Scheme for various molecular biology tools for pathway engineering. (A) Direct cloning: a target BGC is identified by bioinformatics and the boundary of the BGC (indicated by small arrows) is confirmed. Various nucleases such as restriction enzymes (RE), CRISPR/Cas9, and the PfAgo-based artificial restriction enzymes (AREs) can be used to excise the target BGC from genomic DNA. A capture vector recovers the target BGC by various methods such as transformation associated recombination (TAR), Gibson assembly (GA), and RecET recombination. (B) Pathway refactoring: every biological element can be amplified and tagged with two homologous ends. All the elements and plasmid backbone can be ligated together with different methods such as TAR, GA, RecET recombination, Golden Gate assembly, and Sequence and Ligation Independent Cloning (SLIC). (C) Pathway optimization: the captured or assembled BGC can be optimized with computational or combinatorial methods. The computational methods can optimize RBSs and promoters of target genes while the combinatorial methods can change some key enzymes to achieve high NP production yield or novel chemical structure.

Scheme for various molecular biology tools for pathway engineering. (A) Direct cloning: a target BGC is identified by bioinformatics and the boundary of the BGC (indicated by small arrows) is confirmed. Various nucleases such as restriction enzymes (RE), CRISPR/Cas9, and the PfAgo-based artificial restriction enzymes (AREs) can be used to excise the target BGC from genomic DNA. A capture vector recovers the target BGC by various methods such as transformation associated recombination (TAR), Gibson assembly (GA), and RecET recombination. (B) Pathway refactoring: every biological element can be amplified and tagged with two homologous ends. All the elements and plasmid backbone can be ligated together with different methods such as TAR, GA, RecET recombination, Golden Gate assembly, and Sequence and Ligation Independent Cloning (SLIC). (C) Pathway optimization: the captured or assembled BGC can be optimized with computational or combinatorial methods. The computational methods can optimize RBSs and promoters of target genes while the combinatorial methods can change some key enzymes to achieve high NP production yield or novel chemical structure. For example, Moore and coworkers took advantage of the natural in vivo homologous recombination mechanism in Saccharomyces cerevisiae to directly capture a BGC of interest from genomic DNA (Yamanaka et al., 2014). The S. cerevisiae/Escherichia coli shuttle vector pCAP01 contained two homology arms with the target BGC and was co-transformed into S. cerevisiae with restriction enzyme (RE) digested genomic DNA harboring the target BGC to yield a large plasmid via homologous recombination. Using this Transformation-associated recombination (TAR) cloning method, a 67 kb nonribosomal peptide synthetase BGC which encodes the taromycin backbone from the marine actinomycete Saccharomonospora sp. CNQ-490 was successfully cloned and expressed in model actinomycete expression host Streptomyces coelicolor. In a follow-up study, a URA3 gene was inserted into pCAP01 under ADH1 promoter as a counter selectable marker so as to achieve high efficiency recombination with shorter capture arms and minimize non-homologous end joining (NHEJ). With the optimized vector pCAP03, two thiotetronic acids BGCs were directly cloned and heterologously expressed in a Streptomyces model host (Tang et al., 2015). Besides Gram-positive actinomycete natural product BGCs, the Moore group also expanded this TAR cloning method into Gram-negative species with the pCAP01 vector (Ross et al., 2015). The resulting plasmid pCAP05 combines yeast cloning elements with Gram-negative elements for heterologous expression. It was successfully used to capture and express the violacein BGC from Pseudoalteromonas luteoviolacea 2ta16 in two proteobacterial hosts, Pseudomonas putida KT2440 and Agrobacterium tumefaciens LBA4404 with robust production (J. J. Zhang et al., 2017b). In addition to using yeast’s native homologous recombination system, Leadlay and coworkers utilized Gibson assembly to recover digested genomic DNA fragments. In this approach, the genomic DNA was digested by two REs and the desired 41 kb BGC fragment was cloned with Gibson assembly (Zhou et al., 2015). Similar to the TAR cloning method, this method also required the use of REs to digest the high-quality genomic DNA and the removal of the small DNA fragments (less than 20 kb) to improve the cloning efficiency. A major limitation of the RE-based cloning methods is that the RE recognition sequences have to be avoided within the target BGCs. Thus, selection of appropriate REs can be difficult or even impossible, especially for large size BGCs such as Type I polyketide synthases (PKS) and nonribosomal peptide synthetases (NRPS) BGCs. Programmable nucleases, exemplified by the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR associated protein (Cas) endonuclease system (Deltcheva et al., 2011; Hsu et al., 2014; Jinek et al., 2012), may offer an alternative to circumvent this constraint. In 2015, Lockey and coworkers linearized a plasmid using CRISPR/Cas9 coupled with a programmed sgRNA and seamlessly inserted a DNA fragment into the plasmid using Gibson assembly (Wang et al., 2015). In the same year, this same strategy was extended to genomic DNA (Jiang et al., 2015). The CRISPR/Cas9 endonuclease system was first used in vitro to digest the high-quality genomic DNA in an agarose matrix. Due to the special digestion method which minimizes mechanical shearing and the specific sgRNA programmed restriction sites, up to 100 kb long bacterial genomic DNA could be cut from the whole genome and completely captured by Gibson assembly with a PCR amplified cloning vector. Similar to traditional RE systems, the CRISPR/Cas9 endonuclease system has also been incorporated with TAR cloning to capture the desired DNA fragment or BGC. Kouprina and coworkers reported that double strands breaks near the target recombination region instead of random breaks dramatically increased the capture efficiency from 0.5% to up to 32% (Lee et al., 2015). In the same year, Keasling and coworkers utilized CRISPR/Cas9-mediated double-strand breaks to assemble tens of DNA parts into S. cerevisiae genome with high efficiency (Jakočiunas et al., 2015). This method, named CasEMBLR, was used to assemble fifteen DNA fragments into a carotenoid pathway, which was integrated into three targeted loci. Another class of programmable nucleases is the artificial restriction enzymes based on a hyperthermophilic Pyrococcus furiosus Argonaute (PfAgo) (Enghiad and Zhao, 2017). The PfAgo system is similar to the CRISPR/Cas system but uses a short DNA guide instead of an sgRNA guide to locate the target cleavage site. Compared to the CRISPR/Cas system, the PfAgo system is more flexible (it can target virtually any sequence) and more active (PfAgo is a multiple turnover enzyme while CRISPR/Cas is a single turnover enzyme). However, this system has not been demonstrated to clone large BGCs yet. Another strategy to overcome the main limitation of the RE-based cloning methods is to use the linear plus linear homologous recombination (LLHR) mechanism mediated by the prophage recombinase RecET (Wang et al., 2016a). The full-length RecE and RecT could mediate highly efficient LLHR for cloning of the target BGCs from genomic DNA. The capture backbone harbors standardized cassettes for horizontal gene transfer and different replicator for selection in different hosts. Subsequently, an improved RecET method (Wang et al., 2018), termed Exonuclease Combined with RecET recombination (ExoCET), was used to directly clone a BGC of >50 kb from bacterial genomic DNA with high precision. After digesting the genomic DNA with REs or CRISPR/Cas endonucleases, the ExoCET system could capture the 106 kb salinomycin BGC.

Pathway refactoring

As regulatory pathways may differ from organism to organism, the productivity of a BGC can significantly decrease when introduced into a heterologous host. In such cases, refactoring the BGCs through decoupling and rewiring of the native regulatory system may be required to achieve initial production (Tan and Liu, 2017). In addition, during refactoring design, BGCs can be dissected into modular units for easy and efficient swap of any required genetic parts (Smanski et al., 2016). The complex nature of NP biosynthesis usually involves large gene clusters, necessitating the development of high-throughput refactoring systems for simultaneous adjustment of multiple genetic elements. To aid in BGC refactoring, computational tools have been developed to help predict genetic element behaviour (Brewster et al., 2012; Salis et al., 2009). Novel molecular biology tools and their applications in natural product biosynthesis will be discussed below.

Homologous recombination based refactoring tools

The homologous recombination mechanism in S. cerevisiae was explored to refactor the BGCs of rebeccamycin, tetarimycin and lazarimides A and B (Montiel et al., 2015), and bottromycin-related metabolites (Eyles et al., 2018). In the first study, yeast homologous recombination was used together with an auxotrophic complementation-based yeast selection system to replace native promoters with orthogonal actinomycetes constitutive promoters and ribosomal binding sites (RBSs). Each bidirectional promoter exchange cassette was amplified with primers containing 40 bp homologous sequences for recombination. The target BGC was co-transformed with PCR-amplified cassette and plated in cassette-specific media for selection. However, the decrease in homologous recombination efficiency with increasing number of desired genetic replacement, and the need to couple selectable markers to the promoter cassettes limit the types of modification achievable by this method. In the second study, TAR-based cloning was employed together with yeast-mediated assembly of DNA fragments to refactor and remodel the bottromycin BGC from Streptomyces scabies (Eyles et al., 2018). The bottromycin BGC was first captured in yeast/E. coli shuttle vector pCAP01 using TAR. Subsequently, yeast-mediated recombination was used to assemble combinations of DNA fragments created by REs targeting naturally occurring restriction sites in the BGC, PCR products, and single stranded oligonucleotides to create remodelled BGCs with refactored promoters, gene deletions and targeted mutations in a single step. This method is not only highly efficient, not hindered by high GC content, marker free, and inexpensive, but also enables flexible modifications to the BGCs. Due to its programmability and specificity, the CRISPR/Cas9 system has opened new avenues for BGC refactoring. Yeast-based promoter engineering platform named multiplexed-CRISPR-TAR (mCRISTAR) was developed to enable single-marker multiplexed promoter engineering (Kang et al., 2016). This approach involves fragmentation of a BGC of interest using a CRISPR/Cas9 system targeting the native promoter sequences, followed by TAR mediated reassembly to incorporate synthetic promoters. Similarly, multiplex Cas9-TAR (miCASTAR) was engineered for activation and refactoring of the atolypenes BGC (Kim et al., 2019). Compared to mCRISTAR, miCASTAR does not require construction and transformation of unique CRISPR/Cas9 plasmids into yeast for each refactoring experiment. A strategy similar to the TAR method was developed in E. coli. This method is based on the Red/ET homologous recombination system, involving a λ phage-derived protein pair, Redα/Redβ or RecE/RecT, and 50 bp homology arms (Zhang et al., 2000). It has been widely applied for genetic manipulation purposes and is gaining attention as a refactoring platform (Horbal et al., 2018). For example, Red/ET was used to refactor the bottromycin BGC to increase the production titer by up to 50 fold by simply decoupling the native regulatory system with artificial promoters (Horbal et al., 2018).

Non-homologous recombination based refactoring

Discovery and application of rapid multi-fragment cloning methods is essential for efficient refactoring process. Golden Gate cloning was initially described in 2008 (Engler et al., 2008), and it has since been successfully applied to the refactoring and activation of various BGCs, including the discovery of the BGC for the production of phosphonoacetic acid derivatives (Freestone et al., 2017). In this case, three successive Golden Gate assemblies were required for complete refactoring of the whole BGC from Streptomyces sp. strain NRRL F-525 to the Streptomyces lividans expression platform. An improved two-tiered Golden Gate assembly employing a plug-and-play approach was used to refactor 96 combinatorial carotenoid pathways, simplifying the aforementioned method (Ren et al., 2017a). In the first tier of this approach, genes from a BGC were first individually cloned into helper plasmids preassembled with promoters and terminators to form separate expression cassettes. In the second tier, the expression cassettes were then assembled using Golden Gate assembly to yield refactored BGCs. In addition, the use of spacer plasmids to help bridge sequence gaps in the constructs increases the flexibility of this approach, and facilitates gene deletion and replacement studies. Golden Gate assembly-based refactoring approach requires the removal of unwanted Type II RE recognition sites, which can be tedious. Therefore, RE independent cloning and refactoring offers promising alternatives. ExoCET was used for simultaneous assembly of 12 PCR-amplified products with refactoring of natural spinosad BGC (Song et al., 2019). The efficiency of this cloning method was much higher than Gibson with RecET assembly, whereas Gibson cloning on its own did not yield any successful clones. Another strategy is to combine in vitro and in vivo tools for successful assembly of BGCs. For example, sequence- and ligation-independent cloning (SLIC) harnessing a bacteriophage T4 DNA polymerase for in vitro DNA hybridization and E. coli DNA repair system can be used for assembling refactored BGCs (Jeong et al., 2012; Li and Elledge, 2012). It was applied for direct pathway cloning (DiPaC) (Greunke et al., 2018) using long-amplification PCR for refactoring of the cyanobacterial hapalosin BGC (D’Agostino and Gulder, 2018). This method is superior to the HiFi DNA assembly for cloning, assembly and simultaneous promoter and terminator refactoring of small- and mid-sized BGCs.

Pathway optimization

Cloning and refactoring of target BGCs is essential for the initial production of NPs. However, further optimization of the refactored pathways of interest is almost always necessary in order to obtain high enough yields of the products. For example, expression of BGC enzymes in a heterologous host can negatively affect the host by impairing its growth, disrupting its central metabolism thus reducing yields of desired compounds, or even killing the host (Keasling, 2010). Furthermore, most enzymes in BGCs have not been extensively studied, and usually there is no solubility, stability and activity data available. Therefore, enzymes and pathways may have to be engineered in concert with the host in order to obtain a highly productive functional pathway (Nielsen and Keasling, 2016). The development of standardized parts in synthetic biology has allowed researchers to dissect natural product pathways into genetic elements, such as promoters, RBSs, terminators, and genes of interest, and assemble the elements into redesigned pathways for screening in the design-build-test-learn cycle, while metabolic engineering of the host strain is required to ensure flux balance and host viability. In this section, new combinatorial and computational tools for pathway and host optimization will be discussed.

Computational tools for pathway optimization

As the number of genetic elements increases in a redesigned pathway for BGC refactoring, the number of possible permutations increases exponentially, making it intractable to identify the optimal combination of genetic elements through conventional pathway construction and screening approaches. This drives the use of computational approaches to help reduce the screening load in such endeavours. A machine learning algorithm was applied for RBS optimization in the limonene biosynthetic pathway consisting of 8 genes (Jervis et al., 2019). Initially, the two rate-limiting enzymes, trAg-GPPS and trMs-LimS, were paired with 12 RBS variants each with translation initiation rates spanning over two orders of magnitude in a combinatorial fashion to create a library of 144 possible permutations in total. The resulting RBS sequence-product titer data was used to train a machine learning algorithm, which led to the development of a model capable of predicting limonene titer from RBS sequences. This model was then further applied to a novel reconstituted mevalonate pathway, allowing a reduced library to be screened for improved limonene producers. Similarly, a Reduced Libraries (RebLibs) algorithm was used to facilitate rational minimization of experimental library screening for violacein biosynthesis (Jeschek et al., 2016). It produced a reduced set of uniformly distributed sub-libraries of RBSs for different screening purposes spanning across a range of accessible expression levels and contained degenerate sequences that allow one-pot restriction cloning. In another example, a combinatorial library consisting of 2592 potential pathway configurations was reduced to 16 using a design of experiment (DoE) approach, which were automatically assembled using robot-assisted ligase cycling reaction method (Carbonell et al., 2018). Promoters, order of the four genes, and plasmid copy numbers were varied in the first and second design-build-test-learn cycles. The selected constructs showed up to 500-fold improvement of pinocembrin production. The DoE approach was also used to guide combinatorial pathway engineering containing five genes for violacein production (P. Xu et al., 2017). A T7 promoter library was applied in combination with Plackett–Bruman and Box–Behnken designs to probe the gene expression levels and discriminate between potential high and low producers. This approach led to identification of optimal strength promoters for VioAB, VioD, and VioEC protein expression levels and overall 3-fold increase in violacein titer. The sub-libraries of genetic elements and pathway assembly were constructed using ePathBrick vector system comprising of four compatible restriction sites allowing multiple enzyme manipulation simultaneously (Xu et al., 2012).

Combinatorial tools for pathway optimization

Combinatorial pathway optimization involves modifications of target BGCs by introducing genetic elements for easy replacement or deletion of regulators, RBSs, and enzymes. A ‘Plug and Play’ platform was developed for synthesis of C10 monoterpenoids on the basis of the common substrate geranyl pyrophosphate (GPP) and product-determining monoterpene cyclases and synthases (Leferink et al., 2016). The modular nature of the monoterpenoid pathway allows the flexible switch between a library of monoterpene cyclases and synthases using traditional restriction cloning. This approach produced more than 30 different terpenes of various classes from glucose, including several compounds never produced before in engineered microbes, demonstrating the efficacy of standardized part-based system in E. coli. In another related study, a method named Multidimensional Heuristic Process (MHP) in combination with Cross-Lapping In Vitro Assembly (CLIVA) (Zou et al., 2013) was used to optimize the biosynthetic pathways of lycopene related compounds (Zhang et al., 2018). This modular pathway optimization approach allows screening of a library of diverse promoters, RBSs and enzyme variants in combination or individually to identify most efficient E. coli producers. Moreover, the Synthetic Chromosome Rearrangement and Modification by LoxP-mediated Evolution system (SCRaMbLE) was applied in a two-step manner for optimization of β-carotene and violacein BGCs (Liu et al., 2018). In this method, termed SCRaMBLE-in, pathway diversity was first created using purified recombinases to integrate genetic regulatory elements into the corresponding pathways. This is followed by random integration of the whole BGC into synthetic yeast chromosomes while creating combinatorial genome rearrangements in the host through an in vivo genome reshuffling system. This method enabled pathway optimization and host evolution to be carried out simultaneously to generate strains with 2-fold and 17-fold improved β-carotene and violacein titers, respectively.

Molecular biology tools for engineering enzymes in a pathway

New strategies and tools for enzyme engineering

The titers of natural products in heterologous hosts usually are low due to many reasons such as specific substrate and product inhibition, enzyme instability, lack of high flux of intermediates or even promiscuity of the enzymes. This arises from the intrinsic nature of natural product synthesis as the native organisms do not require natural products at high concentrations and can generate many derivatives starting from the same intermediates. In the past few years, a number of enzyme engineering strategies have been used to enhance natural product production. The paradigm for optimizing natural product synthesis shifted towards dissecting each cascade into single enzyme units to find the rate limiting steps. In this section, enzyme engineering tools and approaches in natural product pathways will be discussed, with emphasis on enzyme engineering in the context of a pathway.

Improving enzyme activity

Transfer of substrate between enzymes in a natural product biosynthetic pathway in a heterologous host can be a rate limiting step due to potential side reactions, diffusion issues and instability of intermediates. To avoid these issues, a fusion protein strategy may be used. For example, four novel linkers were developed and studied including natural CrtB/CrtY linker, short segment connecting the FMN and FAD/NADPH domains of cytochrome P450 reductase (CPR), and two synthetic flexible linkers for a tridomain CrtB/CrtY/CrtI system producing β-carotene and neurosporene (Rabeharindranto et al., 2019). However, physical linkers can affect enzymatic activity and modular scaffolds can become a heavy burden for protein expressing cells (Lee et al., 2012). An alternative strategy involves naturally occurring self-assembly systems such as PDZ domains and the corresponding PDZ ligands with their interaction facilitated by formation of additional β strand of ligand in the groove between α and β helices of PDZ domain (Lee and Zheng, 2010). The integration of PDZ and the corresponding ligand in enzyme assembly was successfully demonstrated using squalene epoxidase (ERG1) and dammarenediol synthase (PgDDS) showing 2-fold increase in production of ginsenoside intermediates (Zhao et al., 2016). Rational enzyme segregation and engineering followed by re-integration to optimize necessary cascade reactions is an emerging strategy for optimizing and diverging natural product biosynthesis. Current enzyme engineering tools can be categorized into sequence based and structure-mechanism driven (Davids et al., 2013), and their application in this strategy will be briefly mentioned. Primary approaches involve homology modelling based on sequence or structure similarities followed by genetic manipulation using traditional molecular biology tools. This strategy in combination with site directed mutagenesis (SDM) was applied for fine tuning of Erg20p enzyme producing either GPP, farnesyl pyrophosphate (FPP) (Ignea et al., 2014), or geranylgeranyl pyrophosphate (GGPP) (Ignea et al., 2015) substrates for terpene synthases, which improved terpene yields by more than 10 fold. In addition, SDM integration with alanine scanning allowed engineering of S-limonene synthase active site plasticity (Srividya et al., 2015; J. Xu et al., 2017) and combination of the latter technique with saturation mutagenesis was shown to be effective in regulating stereoselectivity of taxadiene synthase (Edgar et al., 2017).

Functional expression of pathway genes

Reconstitution of plant BGCs in a microbial heterologous host is complicated due to different cellular compartmentalization of the pathways in the native hosts. In particular, efforts in enabling functional expression of P450 enzymes with their CPR partners in heterologous hosts will be highlighted, which has been the focus of many studies in the past few years. These P450 enzymes are crucial for formation of various enantiomeric centers giving rise to unique properties of natural products, but expression of some of the membrane-bound plant P450 enzymes in yeast or E. coli is one of the limiting steps in reconstitution of functional BGCs in heterologous hosts (Schlegel et al., 2009). Current strategies rely on expression of the rate limiting or insoluble enzymes from higher copy number; however this approach could only be applied in a few cases showing limited efficiency in natural product production (Brown et al., 2015). The emerging strategies for expression of functional P450 enzyme systems within BGCs include traditional tools for single enzyme systems such as a mix-and-match approach using CPR libraries to find the right partner enzymes (Trenchard and Smolke, 2015) as shown in the reconstitution of the sanguinarine branch of the benzylisoquinoline alkaloids. This strategy is not applicable in cases where P450 enzymes do not show soluble expression at any level, thereby making targeted N-terminal signal peptide engineering the primary approach. Target signal peptide replacement strategies involve swapping with either homologous P450 domain as shown in the case of salutaridine synthase enzyme (Galanie et al., 2015) or homologous non-P450 membrane bound enzyme signal peptide (Fossati et al., 2014). Advances in plant enzyme expression platforms are required because current strategies are limited to each specific BGC.

Screening methods and tools for enzyme engineering

Optimization of the enzymes responsible for natural product biosynthesis relies on the ability to detect intermediates or final products. Traditional analytical chemistry detection methods such as gas chromatography (GC), liquid chromatography (LC), their combination with mass spectrometry (MS) and nuclear magnetic resonance (NMR) require time-consuming sample preparation and have limited screening capacity (Xu et al., 2007). Within this context, in vivo biosensors for detecting small molecules as an alternative to analytical chemistry is gaining attention in the natural product biosynthesis field. Typical biosensors involve enzyme coupled responses, transcriptional factors, riboswitches and RNA (Liu et al., 2015; Yang Liu et al., 2017). In this section, the development of novel biosensor systems for the natural products and their intermediate detection will be discussed. Moreover, the development and versatility of molecular sensing circuits will be emphasized and only recently developed biosensors will be mentioned in this section.

Enzyme biosensor systems

Enzyme biosensors are based on conversion of one of the intermediates or final products to a detectable molecule with either fluorescent or colorimetric properties. An enzyme coupled biosensor was successfully applied in identifying a candidate tyrosine hydroxylase and improving its activity using error-prone PCR (epPCR) for the synthesis of L-3,4-dihydroxyphenylalanine (L-DOPA) from L-tyrosine in yeast (DeLoache et al., 2015). The assay is based on expression of plant DOPA dioxygenase that converts L-DOPA to betalamic acid, which undergoes spontaneous reaction with free amines forming fluorescent betaxanthins. The L-DOPA biosensor was further combined with fluorescent-activated cell sorting (FACS) and used for screening transposon-mediated gene disruption library in yeast (Savitskaya et al., 2019). Random mutations allowed identification of key deletions that improved final yields of L-DOPA. Biosynthetic pathways for isoprene and lycopene diverge at isopentenyl diphosphate and the latter pigment can be produced with commercially available pAC-LYC system harbouring CrtE, CrtB, and CrtI downstream enzymes (Cunningham Jnr et al., 1994). High-throughput screening based on visual detection of lycopene with directed co-evolution of 1-deoxyxylulose-5-phosphate synthase, 1-deoxyxylulose-5-phosphate reductoisomerase and isopentenyldiphosphate (IPP)-dimethylallyldiphosphate (DMAPP) isomerase enzymes using epPCR was investigated allowing identification of enzyme variants with enhanced activity (Lv et al., 2016). The same approach employing cascade enzymatic steps for production of colorimetric molecules using either CrtE/CrtB/CrtI for diterpene or CrtM/CrtN for mono and sesquiterpene detection was used in designing terpene synthase (TPS) biosensor (Furubayashi et al., 2014). In this case, lack of pigmentation in colonies would indicate higher TPS activity due to lower amounts of residual shared isoprenyl diphosphates (GPP, FPP and GGPP) substrates. The biosensor was optimized using taxadiene synthase (diterpene) and further applied for the improvement of 5-epi-aristolochene (sesquiterpene) and geraniol (monoterpene) synthases. The developed CrtM/CrtN sensory system producing diaponeurosporene (yellow pigment) was also applied for screening of pinene synthase mutants in E. coli (Tashiro et al., 2016). Colorimetric high-throughput screening was used in combination with epPCR targeting active site residues. The mutated enzyme was superior to wild-type in competition experiments with limonene synthase.

Genetic biosensors

Genetic biosensors employ the natural transcription and translation machinery for producing reporter signals as opposed to enzyme biosensors producing response molecules in cascade reactions. Phenylalanine is a crucial intermediate in the synthesis of various flavonoids including pinocembrin, galangin, resveratrol and others (Pandey et al., 2016). TyrR transcription factor was shown to induce tyrP promoter in the presence of tyrosine and phenylalanine, and was used as a chassis for the development and optimization of the phenylalanine biosensor (Liu et al., 2017a). As a reporter, yellow fluorescent protein (YFP) and resistance gene strA were chosen for FACS and as a selection marker, respectively. A natural flavonoid naringenin responsive regulatory system was engineered in E. coli based on the PfdeAR-FdeR promoter-transcription factor pair that is naturally responsible for the activation of the naringenin degradation pathway (De Paepe et al., 2018). Essential parameters for naturally occurring sensing circuits were optimized including decoupling and engineering of the detector and effector modules, and RBS optimization for fluorescent signal such as the far-red fluorescent protein mkate2. Another flavonoid molecular sensory tool was developed using a combination of two sets of natural regulatory systems: the PfdeAR-FdeR and a PnodAD1-NodD1 promoter-transcription factor pairs with mkate2 fluorescent reporter (De Paepe et al., 2019). Further, the double biosensor consisting of the previously mentioned FdeR transcription factor with a pG promoter and a cyan fluorescent protein (CFP) reporter gene, and transcription repressor QdoR with p441 promoter controlling GFP expression was engineered for detection of naringenin chalcone, naringenin, kaempferol and quercetin (Siedler et al., 2014). This designed molecular tool in combination with SDM and FACS was successfully used to detect efficient kaempferol producing mutants even at single cell level. The versatility of transcription factor based biosensors was exploited for detection of different classes of natural products including stilbenoids and macrolides. A molecular sensor based on the Bacillus subtilis PadR transcriptional repressor was engineered to detect p-coumaric acid, which is an intermediate of the resveratrol biosynthetic pathway (Siedler et al., 2017). This biosensor in E. coli was combined with p-coumaric acid overproducing S. cerevisiae in a co-culture system to identify the high producers, which were isolated using FACS. The subsequent ligation of p-coumaric acid with coenzyme A to form its CoA thioester performed by p-coumarate:CoA ligase (4CL) is also a crucial step in synthesis of resveratrol (Halls and Yu, 2008) and phenylpropanoids in plants (Ehlting et al., 1999). A resveratrol biosensor was engineered using random mutagenesis of TtgR transcriptional repressor in combination with a LacZ reporter system to optimize 4CL activity (Xiong et al., 2017). A natural macrolide sensory circuit which is used as a host defence mechanism against antibiotics was exploited for application in biosensor design (Rogers et al., 2015). The sensory system was based on the coupling of the MphR transcriptional repressor capable of binding a broad range of macrolides with a GFP reporter gene, yielding a biosensor that produces a macrolide concentration-dependent fluorescence. The sensitivity of MphR towards erythromycin A was improved by more than 10-fold using multisite saturation mutagenesis and epPCR in combination with positive-negative FACS (Kasey et al., 2018). To demonstrate its utility, the MphR-based biosensor was used to rapidly identify erythromycin A producing Aeromicrobium erythreum strains from knockout strains that do not produce erythromycin A in a 96-well plate. Finally, the systematic evolution of ligands by exponential enrichment (SELEX) approach was used to develop naringenin responsive RNA elements fused with tetracycline/H+ antiporter (TetA) and superfolder green fluorescent protein (sGFP) reporters (Xiu et al., 2017). The sensory device was used in conjunction with FACS to establish a platform for flavonoid biosensors. The efficacy of the naringenin biosensors in this study was demonstrated in co-cultures of the naringenin producing strains and the biosensor strains, which showed a positive correlation between naringenin titre and fluorescence signal. Identification of novel natural product responsive RNA elements using strategies such as Parallel Analysis of RNA Conformations Exposed to Ligand binding (PARCEL) (Tapsin et al., 2018) will be essential for development of this type of biosensors. A summary of described biosensors is listed in Table 1.
Table 1

Recently developed biosensors for optimization of natural product pathways.

Transducer
PathwayDetection
Reference
ClassNameSignalResponse
EnzymeDOPA dioxygenaseL-DOPABetaxanthinsFluorescenceDeLoache et al. (2015)
Crt enzymesTerpenesCarotenoidsPigmentationLv et al. (2016); Tashiro et al. (2016)
Transcription factorTyrRFlavonoidYFP/strAFluorescence/antibioticLiu et al. (2017a)
PfdeAR-FdeRFlavonoidmkate2FluorescenceDe Paepe et al. (2018)
PfdeAR-FdeR/PnodAD1-NodD1Flavonoidmkate2FluorescenceDe Paepe et al. (2019)
FdeR-pG/QdoR-p441FlavonoidGFPFluorescenceSiedler et al. (2014)
PadRStilbenoidYFPFluorescenceSiedler et al. (2017)
TtgRStilbenoidLacZPigmentationXiong et al. (2017).
MphRMacrolideGFPFluorescenceKasey et al. (2018)
RNARiboswitchFlavonoidGFP/tetAFluorescence/antibioticXiu et al. (2017)
Recently developed biosensors for optimization of natural product pathways.

Molecular biology tools for engineering the host genome

As the most important source for natural products discovery, Actinobacteria, particularly those of genus Streptomyces, has been the focus of attention in the genome engineering field for decades. Compared with other model organisms such as E. coli and S. cerevisiae, Actinobacteria shows poverty in genetic manipulation tools and a lot of valuable NP producer strains are not genetically tractable (Alberti and Corre, 2019). Recently, the CRISPR/Cas9 system has been successfully established in different species for genome editing. Zhao and coworkers (Cobb et al., 2015) designed the pCRISPomyces-2 system, which harbors a codon-optimized spCas9 endonuclease, a sgRNA expressing cassette and two traditional 1 kb homology repairing donor arms. After conjugated into Streptomyces, the Cas9 endonuclease could generate a double-strand-break (DSB) at the targeted site with the guidance of sgRNA. Subsequently, the chromosome DSB was repaired by homology-dependent-repairing system with DNA donor fragments possessing two homology repair donor arms, resulting in chromosome deletions ranging from 20 bp to 31 kb at 70–100% efficiency. Furthermore, this system could be equipped with multiplexed sgRNA cassettes and corresponding homology repair donors to achieve simultaneous multiplex genome editing. In the same year, three independent groups also successfully developed a CRISPR/Cas9-based genome engineering tool in Streptomyces (Huang et al., 2015; Tong et al., 2015; Zeng et al., 2015). In recent years, the CRISPR/Cas9 technology is applied not only in model Streptomyces strains, but also in more and more other NP producers, such as non-model Streptomyces (Jia et al., 2017; Qin et al., 2017), Micromonospora (Braesel et al., 2019; Cohen and Townsend, 2018; Wolf et al., 2016), Myxococcus (Yang et al., 2017), Clostridium (Q. Li et al., 2016; Wang et al., 2017; Y. Wang et al., 2016b), and even Yarrowia (Gao et al., 2016) and Cyanobacteria (Li et al., 2016a; Wendt et al., 2016). Although the CRISPR/Cas9 system has been widely applied to different hosts, it also has some limitations. The toxicity of Cas9 expression in some industrial Streptomyces strains and the requirement of complex expression constructs for targeting multiple genomic loci have limited its scope of application. Lu and coworkers employed another CRISPR system from Francisella novicida to Streptomyces and successfully developed a high-efficiency CRISPR/Cpf1 system for multiplex genome editing and transcriptional repression (Li et al., 2018). The authors engineered BGCs of three pigments (cryptic polyketone, prodiginine and actinorhodin) with CRISPR/Cpf1 technology and proved that it is a powerful alternative to CRISPR/Cas9 for industrial strains that cannot utilize the CRISPR/Cas9 system. A combination of CRISPR/Cas9 technology and λ Red recombineering based Multiplex Automated Genomic Engineering technology, termed CRMAGE, was shown to achieve extremely high recombineering efficiency between 96.5% and 99.7% for the gene recoding of three genomic targets, while traditional recombineering could only achieve between 0.68% and 5.4% efficiency (Ronda et al., 2016). CRMAGE utilizes two USER-cloned plasmids to quickly and cost-efficiently introduce more than two mutations in a single recombineering round. Besides, degenerate codon usage could be used to expand the CRISPR target range beyond loci with PAM sequence, making CRMAGE capable of targeting virtually any site to generate single point mutations as well as larger mutation libraries. This technology has the potential of automating genome wide engineering. During the engineering of NP producers with CRISPR systems, the toxic effects of Cas9 or Cpf1 expression and unwanted off-target effects are non-negligible problems (Tong et al., 2015). Furthermore, DNA double-strand breaks (DSB) are essential for high-efficiency HR or NHEJ, but DSB in the actinomycetes chromosomal arm region could trigger instability and mutagenesis (Hoff et al., 2018). CRISPR-Base Editing SysTem (CRISPR-BEST) (Tong et al., 2019) and cytidine base editors (CBEs) and adenine base editors (ABEs) (Zhong et al., 2019) enable precise C to T or A to G nucleotide substitutions without double-strand breaks and repairing donor. The cytidine or adenine deaminase was guided by the CRISPR system with dCas9 (catalytically defective Cas9) and performed base deamination reaction to change C to U or A to I. After DNA repair or replication, the U or I could change to T or G and complete C to T or A to G editing. As proof of concept, this base editing technology was applied to engineer different Streptomyces species and could realize simultaneous disruptions of nine different PKS gene clusters in one step. This technology provides a rapid, effective and multiplex genetic engineering approach for Streptomyces, and promises to be a powerful tool for other NP producers in future. Silent BGCs are a treasure grove of novel NPs, which may be activated by genome engineering tools. Zhao and coworkers developed a CRISPR-Cas9 mediated promoter knock-in strategy to activate silent BGCs in their native hosts (M. M. Zhang et al., 2017c). In this strategy, an inactive promoter region was targeted with the CRISPR-Cas9 nuclease and the resulting DSB was repaired through homologous recombination with a donor cassette harboring a constitutive promoter. In this way, the silent gene or genes could be activated, and the entire BGC could recover the ability to produce a NP. Similar to this strategy, Corre and coworkers triggered the expression of a silent gene cluster by inactivating a key transcriptional repressor with the CRISPR/Cas9 system (Alberti et al., 2019). The authors captured a silent BGC via TAR cloning and expressed this BGC in a validated Streptomyces host. However, the BGC was repressed by a transcriptional repressor. Inactivation of this repressor gene with CRISPR/Cas9, triggered the activation of the BGC to produce a novel NP, scleric acid. Another repressor related activation strategy was also developed by Zhao and coworkers (Wang et al., 2019a). This so-called transcription factor decoy strategy uses the repressor binding DNA fragments that are cloned from the target BGC into a plasmid to sequester the cognate repressor(s), resulting in the de-repression of the targeted BGC. This strategy has been successfully used to activate eight large BGCs ranging from 50 to 134 kilobases in multiple streptomycetes. More recently, inspired by the mechanisms of evolution and diversification of NP BGCs, Yoshikuni and coworkers developed a chassis-independent recombinase-assisted genome engineering (CRAGE) technology to activate a BGC via single-step integration of the entire BGC directly into different bacterial chromosomes (Wang et al., 2019b). As proof-of-concept, the authors expressed three known and six previously identified but experimentally elusive NRPS and NRPS-PKS hybrid BGCs. All nine BGCs from Photorhabdus luminescens were expressed in 25 diverse γ-Proteobacteria species with the CRAGE technology. The successful activation of six BGCs showed that the diversity and yield of products from BGCs expressed in closely related strains were greater than these from BGCs expressed in either native or more distantly related strains. Finally, to improve the yield, titer, and productivity of NPs in the heterologous hosts, multiple modes of optimization at the genome level such as gene overexpression, knock-down and knock-out are usually required. For example, Zhao and coworkers reported an orthogonal tri-functional CRISPR system, which combines transcriptional activation, transcriptional interference, and gene deletion (CRISPR-AID) in S. cerevisiae for modular, parallel and high-throughput combinatorial engineering (Lian et al., 2017). As proof of concept, the BGC of β-carotene was engineered with CRISPR-AID and the production was increased by 3-fold in a single step. In another example, Lu and coworkers developed an advanced multiplex site-specific genome engineering method (aMSGE) for BGC integration in actinomycetes (Li et al., 2019). Based on the multiple integrases-multiple attB sites notion, the authors established a plug-and-play toolkit to realize high-efficiency, single-step, multi-locus integration of BGCs. As proof of concept, acetyl-CoA carboxylase genes were introduced in S. coelicolor to produce actinorhodin and up to four copies of the entire 72 kb 5-oxomilbemycin BGC were integrated into Streptomyces hygroscopicus genome to achieve 185.6% increase in titer (from 2.23 to 6.37 g L−1).

Selected examples

Construction and optimization of BGCs for high-level production of complex NPs can be extremely challenging, requiring the use of cloning, refactoring, optimization, enzyme and host engineering tools simultaneously. In this section, we will discuss several selected examples that successfully integrated multiple molecular biology tools for production of industrially relevant compounds. Opioids such as morphine and codeine are important psychoactive compounds used clinically for pain management and palliative care. Currently, opium poppy plants are the main source of opioids, but the production of opioids from plants raises a lot of regulatory issues due to their potential for abuse in society. In addition, production in plants suffers from low yield and susceptibility to external environmental conditions. Therefore, targeted biosynthesis of active opioid compounds in enclosed bioreactor environment was envisioned to be big leap forward for application of opioids for medicinal purposes. Biosynthesis of thebaine, a crucial intermediate of opioids and hydrocodone were reconstituted in E. coli (Nakagawa et al., 2016) and S. cerevisiae (Galanie et al., 2015) production platforms. Both systems were designed using previous knowledge of intermediate synthesis, while enzyme and strain engineering demonstrated the importance of combinatorial pathway optimization. Bacterial and yeast central metabolism was modified and exploited initially to create tyrosine over-producing strains, followed by production of L-DOPA using tyrosine hydroxylase (TYR), dopamine using DOPA decarboxylase (DoDC) before taking diverging pathways for synthesis of (R)-reticuline in both systems. The S. cerevisiae production platform was built using a modular approach containing 17 biosynthetic enzymes for the synthesis of (S)-reticuline, a key intermediate in production of benzylisoquinoline alkaloids (Galanie et al., 2015). The pathway was subdivided into precursor overproduction, tetrahydrobiopterin recycling, as well as (S)-noroclaurine and (S)-reticuline synthesis modules. It required the expression of mammalian, plant, bacterial and yeast enzymes in soluble and active forms to produce detectable levels of (S)-reticuline (20.7 μg L−1). An additional module encoding various rate limiting enzymes was integrated to obtain 82 μg L−1 of (S)-enantiomer for further conversion. Subsequent bioinformatics search identified DRS-DRR epimerase for (R)-reticuline production, which was further combined with yeast codon-optimized P. somniferum salutaridine synthase (yPsSalSyn), P. bracteatum lutaridine reductase (PbSalR), and salutaridinol acetyltransferase (PbSalAT) for thebaine synthesis. SalSyn was incorrectly processed and therefore its N-terminus had to be replaced with cheilanthifoline synthase (CFS) to generate another module containing PbDRS-DRR, yEcCFS1–83-yPbSalSyn92–504, PbSalR, and PsSalAT for conversion of (S)-reticuline to thebaine. The final module encoding the expression of thebaine 6-O-demethylase (T6ODM) from P. somniferum and morphine reductase (MorB) from P. putida M10 was added to yield a strain producing up to ~0.3 μg L−1 of hydrocodone. Alternatively, stepwise fermentation was employed for E. coli production of thebaine and hydrocodone (Nakagawa et al., 2016). The total biosynthesis system using a four-step culture was investigated using strains AN1126 (dopamine), AN1055 ((R, S)-THP), AN1600 ((R,S)-reticuline) and AN1829 (thebaine and hydrocodone). Combinatorial (R, S)-THP and (S)-reticuline producing E. coli strains have been engineered in single batch and stepwise systems demonstrating its applicability (Nakagawa et al., 2014, 2012). Interestingly, absence of the norcoclaurine 6-O-methyltransferase (6OMT) allowed detection of racemic reticuline mixture rather than (S)-enantiomer in the reconstituted pathway (Fig. 3) (Nakagawa et al., 2016). The downstream biosynthesis of salutaridine was catalysed by another P450, SalSyn (SalS – salutaridine synthase), which could only be functionally expressed in an N-terminal truncation form. In comparison with the yeast system, the same enzyme faced issues due to incorrect sorting. Combination with recombinantly expressed plant salutaridine reductase (SalR) and salutaridinol 7-O-acetyltransferase (SalAT) yielded production of thebaine at 2.1 mg mL−1 (Fig. 3). The final two genes encoding thebaine 6-O-demethylase (T6ODM) and morphinone reductase (MorB) (Thodey et al., 2014) were introduced into the thebaine producing strain to generate 360 μg L−1 hydrocodone. The E. coli platform showed more than 1000-fold higher hydrocodone production compared to the yeast system, however neither of the strains could produce industrially relevant amounts. With the development of novel molecular biology tools for functional expression of plant enzymes, especially P450s, and greater understanding of metabolic flux and rapid refactoring of gene clusters, more medically important plant and bacterial metabolites will be produced in heterologous hosts. Comparison between E. coli and S. cerevisiae platforms for biosynthesis of thebaine and hydrocodone demonstrates an immense interest in combinatorial engineering approaches. Such approaches have also been applied in the biosynthesis of noscapine (Li and Smolke, 2016) and cannabinoid derivatives (Luo et al., 2019). Both studies harness yeast central metabolism for precursor production and further integrate enzyme discovery and characterization, and unknown intermediate identification tools.
Fig. 3

Comparison of engineered biosynthetic pathways for de novo production of thebaine and hydrocodone in E. coli (red) and S. cerevisiae (blue). Enzymes and substrates are labelled in different colours for variation of pathway reconstitution in different microorganisms. TYR, tyrosine hydroxylase; DoDC, dopa decarboxylase; NCS, (S)-norcoclaurine synthase; 6OMT, norcoclaurine 6-O-methyltransferase; CNMT, coclaurine N-methyltransferase; NMCH, N-methylcoclaurine hydroxylase; 4′OMT, 3′-hydroxy-N-methylcoclaurine 4′-O-methyltransferase; DRS-DRR/STORR, 1,2-dehydroreticuline synthase-1,2-dehydroreticuline reductase; SalSyn, salutaridine synthase; SalR, salutaridine reductase; SalAT, salutaridinol 7-O-acetyltransferase; T6ODM, thebaine 6-O-demethylase; morB, morphinone reductase. The strategies and tools to optimize different parts of pathways are highlighted in boxes above and below figure. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Comparison of engineered biosynthetic pathways for de novo production of thebaine and hydrocodone in E. coli (red) and S. cerevisiae (blue). Enzymes and substrates are labelled in different colours for variation of pathway reconstitution in different microorganisms. TYR, tyrosine hydroxylase; DoDC, dopa decarboxylase; NCS, (S)-norcoclaurine synthase; 6OMT, norcoclaurine 6-O-methyltransferase; CNMT, coclaurine N-methyltransferase; NMCH, N-methylcoclaurine hydroxylase; 4′OMT, 3′-hydroxy-N-methylcoclaurine 4′-O-methyltransferase; DRS-DRR/STORR, 1,2-dehydroreticuline synthase-1,2-dehydroreticuline reductase; SalSyn, salutaridine synthase; SalR, salutaridine reductase; SalAT, salutaridinol 7-O-acetyltransferase; T6ODM, thebaine 6-O-demethylase; morB, morphinone reductase. The strategies and tools to optimize different parts of pathways are highlighted in boxes above and below figure. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Future prospects and conclusions

A variety of emerging molecular biology tools and strategies have been successfully developed and applied for natural product biosynthesis at pathway, enzyme, and genome engineering levels. However, despite promising successes, many challenges still remain in engineering NP biosynthesis. For example, with the rapid development of genome sequencing technology and bioinformatics, BGCs encoding NPs are much easier to be identified. But it is still a challenge to identify and even clone target BGCs from uncultured microorganisms, which are the majority in the environment. Metagenomics may provide a good platform for identification of novel or cryptic BGCs from soil bacteria while new direct cloning technologies and even DNA synthesis technologies may be needed for the investigation and engineering of these BGCs. Moreover, as the other main source of NPs, BGCs in plants are less investigated, and the biosynthetic genes are usually not organized in clusters. New computational and experimental tools are needed for identification, cloning, expression and optimization of the BGCs from plants in microbial hosts. At the pathway engineering level, the existing methods are still time-consuming and labor intensive. Novel technologies, such as machine learning and automation, are increasingly applied to NP biosynthesis. With the help of computational methods, pathway engineering and optimization could be sped up and more rationally used to reduce screening efforts often associated with combinatorial strategies. At the enzyme engineering level, the engineering strategy strongly depends on the availability of biochemical and structural data. In the future, computational enzyme engineering, especially equipped with deep machine learning technology, could accelerate BGC optimization for NPs production. Moreover, de novo artificial enzyme design is another promising direction. The artificially designed enzymes may catalyze unnatural chemical reactions or achieve remarkable catalytic efficiencies. At the genome engineering level, the CRISPR/Cas-based technologies are the main emerging approach. However, CRISPR/Cas systems have some issues, such as toxicity and off-target effects. Thus, new molecular biology tools for genome engineering are still needed. For microbial host engineering, E. coli, yeast and Streptomyces, which are well studied in the laboratory for many years, are the most commonly used hosts. However, these hosts also have their respective limitations at industrial settings, and development of new non-model organisms as industrial production hosts is one of the future directions. In conclusion, numerous molecular biology tools and strategies have been developed for natural product discovery and engineering in recent years. However, given the vast number of uncharacterized BGCs and the ever-growing importance of NPs in the pharmaceutical industry, more powerful molecular biology tools and strategies are still needed for large-scale and more efficient natural product discovery and engineering. With these tools and strategies, we expect a renaissance of natural product research in the coming years.

Declaration of competing interest

The authors declare no competing financial interests.
  125 in total

Review 1.  Spatial organization of enzymes for metabolic engineering.

Authors:  Hanson Lee; William C DeLoache; John E Dueber
Journal:  Metab Eng       Date:  2011-09-18       Impact factor: 9.783

2.  Development of a Bacterial Biosensor for Rapid Screening of Yeast p-Coumaric Acid Production.

Authors:  Solvej Siedler; Narendar K Khatri; Andrea Zsohár; Inge Kjærbølling; Michael Vogt; Petter Hammar; Christian F Nielsen; Jan Marienhagen; Morten O A Sommer; Haakan N Joensson
Journal:  ACS Synth Biol       Date:  2017-06-21       Impact factor: 5.110

3.  Novel biosensors based on flavonoid-responsive transcriptional regulators introduced into Escherichia coli.

Authors:  Solvej Siedler; Steen G Stahlhut; Sailesh Malla; Jérôme Maury; Ana Rute Neves
Journal:  Metab Eng       Date:  2013-11-01       Impact factor: 9.783

4.  Enhancing Biosynthesis of a Ginsenoside Precursor by Self-Assembly of Two Key Enzymes in Pichia pastoris.

Authors:  Chengcheng Zhao; Xin Gao; Xinbin Liu; Yong Wang; Shengli Yang; Fengqing Wang; Yuhong Ren
Journal:  J Agric Food Chem       Date:  2016-04-21       Impact factor: 5.279

5.  CRAGE enables rapid activation of biosynthetic gene clusters in undomesticated bacteria.

Authors:  Gaoyan Wang; Zhiying Zhao; Jing Ke; Yvonne Engel; Yi-Ming Shi; David Robinson; Kerem Bingol; Zheyun Zhang; Benjamin Bowen; Katherine Louie; Bing Wang; Robert Evans; Yu Miyamoto; Kelly Cheng; Suzanne Kosina; Markus De Raad; Leslie Silva; Alicia Luhrs; Andrea Lubbe; David W Hoyt; Charles Francavilla; Hiroshi Otani; Samuel Deutsch; Nancy M Washton; Edward M Rubin; Nigel J Mouncey; Axel Visel; Trent Northen; Jan-Fang Cheng; Helge B Bode; Yasuo Yoshikuni
Journal:  Nat Microbiol       Date:  2019-10-14       Impact factor: 17.745

6.  Rationally reduced libraries for combinatorial pathway optimization minimizing experimental effort.

Authors:  Markus Jeschek; Daniel Gerngross; Sven Panke
Journal:  Nat Commun       Date:  2016-03-31       Impact factor: 14.919

7.  CRMAGE: CRISPR Optimized MAGE Recombineering.

Authors:  Carlotta Ronda; Lasse Ebdrup Pedersen; Morten O A Sommer; Alex Toftgaard Nielsen
Journal:  Sci Rep       Date:  2016-01-22       Impact factor: 4.379

8.  Engineering biosynthesis of the anticancer alkaloid noscapine in yeast.

Authors:  Yanran Li; Christina D Smolke
Journal:  Nat Commun       Date:  2016-07-05       Impact factor: 14.919

9.  Formicamycins, antibacterial polyketides produced by Streptomyces formicae isolated from African Tetraponera plant-ants.

Authors:  Zhiwei Qin; John T Munnoch; Rebecca Devine; Neil A Holmes; Ryan F Seipke; Karl A Wilkinson; Barrie Wilkinson; Matthew I Hutchings
Journal:  Chem Sci       Date:  2017-02-13       Impact factor: 9.969

Review 10.  CRISPR/Cas9-Based Editing of Streptomyces for Discovery, Characterization, and Production of Natural Products.

Authors:  Weixin Tao; Anna Yang; Zixin Deng; Yuhui Sun
Journal:  Front Microbiol       Date:  2018-07-24       Impact factor: 5.640

View more
  7 in total

1.  Site-Directed Mutagenesis of Large Biosynthetic Gene Clusters via Oligonucleotide Recombineering and CRISPR/Cas9 Targeting.

Authors:  Jia Jia Zhang; Bradley S Moore
Journal:  ACS Synth Biol       Date:  2020-07-06       Impact factor: 5.110

Review 2.  Heterologous production of cyanobacterial compounds.

Authors:  Dipesh Dhakal; Manyun Chen; Hendrik Luesch; Yousong Ding
Journal:  J Ind Microbiol Biotechnol       Date:  2021-06-04       Impact factor: 4.258

3.  Expression of fungal biosynthetic gene clusters in S. cerevisiae for natural product discovery.

Authors:  Zihe Liu; Zhenquan Lin; Jens Nielsen
Journal:  Synth Syst Biotechnol       Date:  2021-01-25

Review 4.  Successful Enzyme Colocalization Strategies in Yeast for Increased Synthesis of Non-native Products.

Authors:  Hannah C Yocum; Anhuy Pham; Nancy A Da Silva
Journal:  Front Bioeng Biotechnol       Date:  2021-02-09

5.  Characterization of cross-species transcription and splicing from Penicillium to Saccharomyces cerevisiae.

Authors:  Zhenquan Lin; Kang Xu; Guang Cai; Yangqingxue Liu; Yi Li; Zhihao Zhang; Jens Nielsen; Shuobo Shi; Zihe Liu
Journal:  J Ind Microbiol Biotechnol       Date:  2021-12-23       Impact factor: 4.258

6.  A Candidate Gene Cluster for the Bioactive Natural Product Gyrophoric Acid in Lichen-Forming Fungi.

Authors:  Garima Singh; Anjuli Calchera; Dominik Merges; Henrique Valim; Jürgen Otte; Imke Schmitt; Francesco Dal Grande
Journal:  Microbiol Spectr       Date:  2022-07-14

Review 7.  Bridging Cyanobacteria to Neurodegenerative Diseases: A New Potential Source of Bioactive Compounds against Alzheimer's Disease.

Authors:  Andrea Castaneda; Ricardo Ferraz; Mónica Vieira; Isabel Cardoso; Vitor Vasconcelos; Rosário Martins
Journal:  Mar Drugs       Date:  2021-06-16       Impact factor: 5.118

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.