| Literature DB >> 34171170 |
Abstract
The field of synthetic biology is evolving at a fast pace. It is advancing beyond single-gene alterations in single hosts to the logical design of complex circuits and the development of integrated synthetic genomes. Recent breakthroughs in deep learning, which is increasingly used in de novo assembly of DNA components with predictable effects, are also aiding the discipline. Despite advances in computing, the field is still reliant on the availability of pre-characterized DNA parts, whether natural or synthetic, to regulate gene expression in bacteria and make valuable compounds. In this review, we discuss the different bacterial synthetic biology methodologies employed in the creation of 5' regulatory regions - promoters, untranslated regions and 5'-end of coding sequences. We summarize methodologies and discuss their significance for each of the functional DNA components, and highlight the key advances made in bacterial engineering by concentrating on their flaws and strengths. We end the review by outlining the issues that the discipline may face in the near future.Entities:
Mesh:
Year: 2021 PMID: 34171170 PMCID: PMC8601185 DOI: 10.1111/1751-7915.13868
Source DB: PubMed Journal: Microb Biotechnol ISSN: 1751-7915 Impact factor: 5.813
Fig. 1Schematic representation of 5′ regulatory regions in bacteria. RNA polymerase α‐subunits, C‐terminal domain (CTD) I and II, interact with the proximal and distal subsites of the UP element (UP); sigma (σ)‐subunit region 2 (R2) and region 4 (R4) interact with the −10 and −35 promoter motifs respectively (please note this depiction applies to σ70, other σ‐factors may have different binding preferences). ITR, initially transcribed region; SD, Shine–Dalgarno sequence; 5′ UTR, 5′ untranslated region; TrSS, transcriptional start site (+1); TnSS, translational start site. The regulatory regions are not to scale.
Fig. 2Overview of methods used to create novel 5′ regulatory sequences. Promoter (green), 5′ untranslated region (blue) and CDS (orange).
Fig. 3Overview over methods discussed in the context dependency section. A: Translational coupling has two different mechanisms: Termination–reinitiation (i) and upstream‐dependent de novo initiation.
A. i: Translational coupling happens when the end of a gene overlaps with the start of the next gene. This can, for example, be achieved by having the stop and the start codon overlap. The ribosome that translated the first gene will continue translating the second gene after terminating translation of the first gene. For the bicistronic design, the first gene is shortened to a leader peptide sequence (or fore‐cistron). The leader sequence contains a second RBS close to the stop/start codon overlap to make the ribosome reinitiate. A. ii: Gene 1 and Gene 2 are part of the same operon. The mRNA builds a secondary structure within the second gene or in the intergenic region, which sequesters the RBS of the downstream gene. Once gene 1 is translated, the secondary structure unfolds, and the downstream RBS becomes available for the 30S subunit of a ribosome to bind to. The TARSyN method makes use of this principle by adding hairpins in between two genes in one operon. The hairpin sequesters the RBS of the downstream gene, and the secondary structure unfolds when the upstream gene is translated.
B: Insulators insulate the RBS from upstream genetic context by (i) self‐cleavage through a ribozyme or (ii) externally mediated RNA cleavage such as by endoRNase Csy4, which recognizes specific CRISPR sequences in the mRNA.
C: Toehold switches sequester the RBS and start codon within a secondary structure (OFF state), which unfolds when a complementary trigger RNA binds and thus makes the RBS accessible (ON state).
D. uASPIre is a method to test regulatory sequences (RESs) and to record phenotypically if the RES leads to expression. The to‐be‐tested RES is upstream of a gene, which codes for a DNA‐modifying enzyme. If the RES leads to expression, the DNA‐modifying enzyme modifies its target. Since the system is encoded on a plasmid, multiple copies of the same plasmid are present within the cell. The plasmids are extracted and sequenced, and the fraction of modified DNA target sequences is determined. This indicates how strongly the DNA‐modifying enzyme was expressed. Illustrations of the different figures were based on: A. i., A. ii. (Rennig et al., 2018; Huber et al., 2019), B. i. (Lou et al., 2012), B. ii. (Qi et al., 2012), C. (Valeri et al., 2020), D. (Höllerer et al., 2020).
Fig. 4Guide to gene expression engineering. The starting point (step 1) is a standard cassette with the elements: promoter, 5′ UTR including insulator and RBS, bicistronic operon, CDS, RNase III cutting site, bidirectional terminator. Many characterized parts for a standard cassette can be found in the iGEM registry. If step 1 does not lead to expression of the gene of interest, step 2 or 3 can be followed. Step 2 introduces rational design of standard or natural genetic elements. Sequences are mutated or merged; new sequences can be made through de novo motif discovery. Step 3 scales up the experiment to a high‐throughput set‐up. Here, parts are recombined and randomized. Step 1 is based on the standard cassette design suggested by Neves et al. (2020).