Literature DB >> 30897329

Systematic Evaluation of Genetic and Environmental Factors Affecting Performance of Translational Riboswitches.

Abstract

Since their discovery, riboswitches have been attractive tools for the user-controlled regulation of gene expression in bacterial systems. Riboswitches facilitate small molecule mediated fine-tuning of protein expression, making these tools of great use to the synthetic biology community. However, the use of riboswitches is often restricted due to context dependent performance and limited dynamic range. Here, we report the drastic improvement of a previously developed orthogonal riboswitch achieved through in vivo functional selection and optimization of flanking coding and noncoding sequences. The behavior of the derived riboswitches was mapped under a wide array of growth and induction conditions, using a structured Design of Experiments approach. This approach successfully improved the maximal protein expression levels 8.2-fold relative to the original riboswitches, and the dynamic range was improved to afford riboswitch dependent control of 80-fold. The optimized orthogonal riboswitch was then integrated downstream of four endogenous stress promoters, responsive to phosphate starvation, hyperosmotic stress, redox stress, and carbon starvation. These responsive stress promoter-riboswitch devices were demonstrated to allow for tuning of protein expression up to ∼650-fold in response to both environmental and cellular stress responses and riboswitch dependent attenuation. We envisage that these riboswitch stress responsive devices will be useful tools for the construction of advanced genetic circuits, bioprocessing, and protein expression.

Entities: CellLine Chemical Disease Gene Mutation Species

Keywords: Design of Experiments; FACS; cellular stress; riboswitch; robustness; synthetic biology

Mesh：

Substances：
Riboswitch

Year: 2019 PMID： 30897329 PMCID： PMC6492952 DOI： 10.1021/acssynbio.9b00017

Source DB: PubMed Journal: ACS Synth Biol ISSN： 2161-5063 Impact factor: 5.110

As the desire to construct ever more complex synthetic gene circuits grows, the robust performance of requisite parts becomes ever more important. Genetic circuits often exploit transcription factors to encode logic function.[1−3] The expression of such regulatory proteins can place a significant metabolic burden upon the host cell due to the cost of translation and ribosome sequestration with up to 50% of cellular energetic resources allocated to translation.[4,5] Minimizing the cost of genetic circuitry is a major challenge to the expansion of synthetic biology.[6] The use of “ribo-regulation” has become a promising alternative to traditional transcription-factor-based systems for heterologous gene expression control. These systems include ribozymes,[7,8] ribo-regulators,[9] synthetic toehold switches,[10] and riboswitches[11,12] which function without any requirements for accessory proteins. The use of post-transcriptional regulation provides an additional point of control, allowing fine-tuning of expression rates. Current methods for tuning genetic circuits often achieve this by modification of the promoter and/or the ribosome binding site (RBS).[13−15] These approaches often require iterative cloning and screening to select for the desired regulatory function. The use of small molecule inducible systems such as inducible promoters can be used to circumvent this need, but these systems often suffer from high basal expression.[16] Riboswitches are naturally occurring RNA structures found in the 5′ UTR of bacterial genes[11] capable of binding a wide array of small molecules,[17−20] including synthetic inducer ligands.[21,22] RNA devices have been engineered and selected to respond to environmental culture conditions such as pH[23,24] and a wide array of noncognate ligands using in vitro and in vivo selection methods.[25−30] Riboswitches have been employed in directed evolution[31] and metabolic engineering.[32,33] Additionally, riboswitches have been shown to function across a range of bacterial species[34,35] and even in eukaryotic organisms.[36,37] Riboswitches consist of two functional domains, a ligand binding aptamer, and a signal actuator, termed the expression platform. In the case of translation activating riboswitches,[20,38,39] ligand binding within the aptamer initiates a structural rearrangement of the expression platform by strand displacement of the RBS sequestering (anti-RBS) sequence, leading to RBS release and translation initiation (Figure A). A number of studies have attempted to capitalize on this bipartite structure by inserting aptamer domains upstream from terminator stems to generate riboswitches with modified transcription termination functionality.[35,40] Riboswitches provide a number of attractive advantages: (i) heterologous protein expression places a significant burden on the cellular metabolism;[41] therefore, cis-regulation RNA-mediated control of expression reduces cellular burden,[41] as protein production is not required for function.[11] (ii) Additionally, post-transcriptional regulation at the RNA level removes dependence on accessory protein, allowing a rapid regulatory response. (iii) Attenuation of expression levels using a single clone, rather than relying on RBS modification, reduces the need for library development and screening.[42] Taken together, the integration of riboswitches into the 5′UTR should allow the user to quickly obtain dynamic regulation of any given gene of interest.

Figure 1

Engineering of the PPDA responsive orthogonal riboswitch for enhanced function. (A) Illustrating the mechanism of the orthogonal riboswitch showing small molecule mediated regulation of protein expression. Protein production is activated in response to pyrimido[4,5-d]pyrimidine-2,4-diamine (PPDA), which causes a change in the secondary structure of the eGFP 5′UTR, allowing translation to proceed. (B) An overview of the screening and optimization workflow employed in this study, highlighting the use of in vivo FACS selection and design of experiments to improve riboswitch function. This functional enhancement enabled development of four stress responsive riboswitch devices. (C) Schematic overview of a synthetic riboswitch device under regulation of the IPTG inducible Ptac promoter showing transcriptional and translational regulation of an N-terminally His tagged eGFP. However, there are a number of additional challenges to the use of riboswitches. Translation initiation from an RBS is influenced by synonymous codon usage of the flanking 5′ coding sequence, and this context dependency has gained a lot attention in recent years,[43,44] with a number of studies using insulator regions and mechanisms to address this problem.[45,46] In addition to this genetic context-dependency, the function of expression systems is also known to be sensitive to contextual changes in environment such as temperature, population density, or metabolic state.[47] To build systems which function as robustly as possible, it is important to understand how both these genetic and environmental changes can affect protein production and regulatory performance. There are also a number of specific challenges to riboswitch engineering, namely sensitivity of function to the surrounding sequence context.[48,49] In addition, the potential for complex interplay between different riboswitch conformations means predictive molecular engineering is currently challenging.[50,51] In some translational riboswitches such as the adenine binding addA riboswitch from Vibrio vulnificus and the related pyrimido-pyrimidine-2,4-diamine (PPDA) responsive orthogonal riboswitch (ORS), this is further complicated by overlap of functional regions within the riboswitch at the junction between the RBS sequestering hairpin and the basal stem of the aptamer.[20,35,52] These overlapping regions are essential for the formation of the mutually exclusive OFF and ON structures (Figure A) because the RBS sequestering sequence (anti-RBS) is directly involved in formation of the OFF structure. Previous studies have selected artificial expression platforms to generate functional riboswitches from an aptamer, using fluorescence-activated cell sorting (FACS) and Förster resonance energy transfer (FRET) methods.[53,54] Genetic selection has been applied to improve riboswitch function through expression platform engineering.[27,55] Guided by this work and the riboswitch design rules generated following statistical analysis of in silico folding experimental performance metrics,[48] we sought to re-engineer the ORS to achieve high levels of gene expression under conditions of riboswitch activation and to minimize riboswitch expression in the absence of induction while considering the context dependencies which affect RBS riboswitch function.[48,49] To achieve this, we set out to improve performance in a structured and systematic manner using a combination of genetic design variables and environmental factors using Design of Experiments (DoE). DoE was originally laid out as an essential principle of experimentation.[56] By designing structured experiments, it becomes possible to assess both the effects of factors (genetic and environmental variables, in the case of biology) as well as the interactions between these factors, while efficiently mapping the entire multidimensional landscape of experimental space. DoE has been applied across a range of fields,[57] including bioprocessing[58] and metabolic engineering.[59,60] The complex structure of many designed experiments has previously prohibited the use of DoE within biomolecular engineering for exploration of sequence-function landscapes. With the advent of cheap and accurate DNA synthesis, recent years have begun to see the use of DoE in synthetic biology but with only a few studies exploring sequence level optimization or characterization.[44,61−64] Here, we describe the use of a high-throughput flow-cytometer-based selection method coupled with DoE for the selection and characterization of the post-transcriptional regulatory RNA devices. This methodology has been used in this study to enhance the function of the ORS. This riboswitch device was selected for development due to its simple structure, relatively modular structure, PPDA solubility, and orthogonality of the ligand from cellular metabolism.[65] We demonstrate that these improved variants afford high levels of translational control across an array of settings and use DoE to explore the sensitivity of riboswitch function to environmental and genetic alterations (Figure B). The most robust device was then selected for further testing in a wide selection of genetic contexts.

Results and Discussion

Previously, we showed that through optimization of N-terminal synonymous codon usage we enhanced riboswitch function.[48] However, although good riboswitch-dependent control was observed, the developed riboswitches exhibited low total expression levels. To further enhance the utility of riboswitches, we set out to achieve vertical extension of the riboswitch-dependent dose response curves by increasing the maximum level of protein production and allowing access to a greater proportion of the total gene expression landscape. To improve riboswitch function, we aimed to further improve the performance by increasing the translation initiation rate (TIR) when the riboswitch is activated by ligand binding (Figure A). In this study, the ORS was placed downstream of the isopropyl β-d-1-thiogalactopyranoside (IPTG) inducible Ptac promoter and upstream of an N-terminally His-tagged enhanced green fluorescent protein (eGFP) (Figure C). Expression performance was assessed by considering gene expression levels under the following conditions: full riboswitch and promoter induction (ON, 100 μM IPTG and 1000 μM PPDA), expression in the absence of riboswitch induction (OFF, IPTG 100 μM), and basal expression under uninduced conditions (UI). Riboswitch-dependent control and total gene expression control are determined from the respective ratios ON/OFF and ON/UI. To increase the TIR from the ORS, we modified the region containing the predicted native RBS (AGAGAA), −10 to −4 nucleotide relative to the start codon (Figure A)[20] contained with the expression platform region of the riboswitch. We first validated the location of the predicted RBS in the absence of the riboswitch (ΔRS) by mutating the native RBS to the same sequence as the 16S rRNA (TCCTCC) (Methods, Supplementary Table 1), termed the nonfunctional RBS (NF_RBS). Assessment of protein expression from the NF_RBS showed that this alteration greatly affected expression, resulting in very low levels of reporter protein signal when induced with IPTG (RFU/OD = 208) compared to the WT_RBS (RFU/OD = 2436) (Figure B and Supplementary Table 2). Following confirmation of the RBS within the ΔRS construct, we exchanged the −10 to −4 nucleotide sequence with the E. coli consensus RBS sequence (AGGAGG) (con_RBS).[66,67] The predicted RBS strengths of the WT_RBS and the con_RBS were 811.1 au and 21546.3, respectively (Salis RBS calc. v 2.0).[42,68] Measurement of the reporter protein levels from the con_RBS construct showed high levels of expression compared to WT_RBS and NF_RBS, with an increase of 5.5- and 64.5-fold, respectively (Figure B). Following induction with IPTG expression from con_RBS reached 13425 RFU/OD with a dynamic rage (± IPTG) of 11.5-fold, demonstrating that we had successfully been able to achieve a gain of function with substitution of native RBS with the consensus RBS.

Figure 2

Exchanging the ribosome binding site of the ORS. (A) Overview of expression platform engineering, showing RBS exchange from the native RBS (WT_RBS = AGAGAA, red semi-circle) to the E. coli six nucleotide consensus RBS (con_RBS = AGGAGG, green semi-circle), which was expected to improve protein production. Also shown is the ORS anti-con_RBS control sequence which contains the modified anti-RBS sequence. (B) End point measurement of eGFP production following modification of the RBS in constructs lacking the riboswitch aptamer region (left). We first tested the native addA WT_RBS under transcriptional regulation from the Ptac promoter in the presence (blue) and absence (gray) of IPTG (100 μM). Substitution of the native RBS with a predicted weak/nonfunctional RBS (NF_RBS = TCCTCC) yields greatly reduced protein expression, confirming correct identification of the RBS. Subsequently, the AGAGAA motif was exchanged with the six base consensus sequence (con_RBS = AGGAGG). The right hand side of this graph shows the performance of the ORS with the native and consensus RBS sequences and the ORS_con_RBS with a modified anti-RBS sequence. These constructs were tested in the presence (blue) and absence (gray) of IPTG (100 μM) plus induction of both transcription and translation with IPTG (100 μM) and PPDA (1000 μM) (orange). In light of the successful increase in TIR, the con_RBS sequence was incorporated into an orthogonal riboswitch-containing construct (Supporting Information and Methods, Supplementary Table 1, and Supplementary Figure 1). Comparison of protein production from the ORS constructs containing the WT RBS (ORS-WT_RBS) and the consensus RBS (ORS-con_RBS) induced with IPTG (100 μM) and PPDA (1000 μM) showed that after 3 h the maximal (ON) expression levels were 7.5-fold higher for ORS-con_RBS. However, riboswitch-dependent (ON/OFF) control was further reduced from 4-fold to 2-fold (Figure B). As a control, we designed an anti-RBS sequence (CCUCCC) with perfect complementary sequence to the consensus RBS (anti-con_RBS) (Supplementary Figure 1C). This showed poor function with very low OFF and ON expression and no PPDA-dependent regulation of expression (Figure B). Riboswitch function is determined by a fine balance between at least two competing RNA structures to afford ON and OFF switching function. Ligand binding causes a conformational change, resulting in a change in stability of the expression platform. The free energies involved in this structural rearrangement have been finely tuned by evolution, making rational design highly challenging.[69] To address these difficulties, we employed a high-throughput FACS-based selection/counter selection methodology to identify riboswitches with optimal OFF and ON states. First, an anti-RBS clonal library with degenerate nucleotides at positions −36 to −31 was generated by isothermal assembly using a synthetic ssDNA input library (46 possible variants) (Methods) (Figure A). The library was screened with two rounds of FACS using selection and counter selection (±PPDA) to isolate clones with enhanced riboswitch-dependent control (Methods) (Figure B, Supplementary Figure 2).

Figure 3

Anti-RBS library and FACS screening. (A) Design and construction of the anti-RBS library. (B) An overview of the FACS screening procedure. This method was used to screen variants of the anti-RBS library for riboswitch function. The library was grown under OFF (IPTG 100 μM, blue) and ON (IPTG 100 μM and PPDA 1000 μM, orange) induction conditions at 30 °C prior to sorting. The lowest 5% of the OFF population and the upper 5% of the ON population was sorted based on eGFP fluorescence intensity (green bow). The sorted populations were recovered and induced under the opposite induction condition, and the alternate sorting procedure was applied, giving two final sorting schemes (low > high, high > low). (C) OFF and ON expression levels (left y-axis) and riboswitch dependent fold change (ON/OFF) (right y-axis) from the 20 functional FACS selected riboswitches following 3 h of induction. (D) Plot showing the trade-off between two important performance metrics ON/OFF and total fold change (ON/UI). LH D4 and LH G7 show the best performance in terms of ON/OFF and ON/UI, respectively. All FACS screened constructs contain His-Linker 30. Error bars represent standard deviation. Data points represent the mean of at least biological duplicate measurements. Following FACS selection, 180 colonies were screened for function after induction with IPTG (100 μM) ± PPDA (200 μM) for 3 h (single replicate only, Supplementary Figure 3). Of these, the 20 best performers were taken forward for sequencing and further expression testing. FACS screening of the anti-RBS library successfully yielded riboswitches with a wide range of function (Figure C). A 223-fold range of ON performance was observed with relative expression values between 104 RFU/OD for HL-H3 to 23271 RFU/OD for LH-F9 (Figure C, left y-axis). In addition, a diverse range of riboswitch dependent fold change (ON/OFF) was observed (3 h induction, 30 °C) with clones displaying no riboswitch-dependent control (HL-H3) up to 16-fold (LH-D4) (Figure C, right y-axis). This riboswitch-dependent control is a significant improvement in both performance relative to the original riboswitch (ORS-WT_RBS), the riboswitch containing the consensus RBS (ORS-con_RBS), and the riboswitch with the consensus RBS perfect complementary anti-RBS sequence (anti-con_RBS) which displayed 4.4-fold, 2.0-fold, and 1.3-fold, respectively. Coupled with the increased expression level, this gain of function greatly improves the function of the ORS. The isolation of these riboswitches demonstrates the utility of the FACS sorting protocol for selecting a diverse range of functional riboswitches. Further comparison of riboswitch dependent fold change (ON/OFF) and total fold change (ON/UI) function for all functional FACS selected variants showed that the two best performers for ON/OFF and ON/UI were LH-G7 and LH-D4 (D4-ORS) (Figure D). Sanger sequencing of the functional variants was performed, and structural prediction analysis of the D4-ORS was performed (Supplementary Figure 1D). The anti-RBS sequences and expression performance of each variant can be found in Supplementary Table 3. To rationalize, predict, and perform forward molecular engineering of riboswitches with enhanced dynamic range, it is important to understand the respective ON and OFF conformations. For example, if a given anti-RBS variant binds too tightly to an RBS, then the expression platform may remain in an OFF conformation even when the aptamer is ligand-bound. Similarly, a weakly bound RBS–anti-RBS hairpin may lead to an expression platform that remains in an ON even when the aptamer is not ligand-bound. However, in silico prediction of ligand-bound riboswitch conformations remains a significant challenge. To rationalize the performance of new anti-RBS–consensus RBS variants, they were subjected to in silico folding analysis[70] to predict the structural stability of the OFF and ON states of the ORS expression platform. The minimum free energy of the OFF state (ΔG OFF) was predicted using the full-length expression platform and the 5′ of the coding sequence (−45 nt to +60 nt). This sequence was then truncated in silico to remove the aptamer region of the riboswitch (−36 to +60 nt) as a proxy for the ON state sequence, in a manner similar to that used in Kent et al.[48] (Methods). By using this sequence, the structural predictions allow interrogation of the expression platform when the riboswitch is in the activated RBS-released state. The minimum free energy was then predicted using RNAfold[70] (ΔG ON) and the difference between the two minimum free energies was then calculated (ΔΔG (kcal/mol) (Supplementary Figure 4A). These features make this an essential region in predicting riboswitch function; by using this in silico prediction, we hope to further understand the dynamic interplay between the OFF and ON state of the expression platform. The ΔΔG of each functional variant shows a correlation with riboswitch-dependent control (r = 0.56, P = 7.7 × 10–3) (Supplementary Figure 4B). To further assess the performance of the ORS, D4-ORS, we explored the effect of (i) exchanging the flanking coding sequence, (ii) transcription levels, and (iii) environmental factors. Previously, we demonstrated the context sensitivity of the ORS (ORS-WT_RBS) upon downstream coding sequence through the modification of synonymous codon usage.[48] This previous study revealed that riboswitch-dependent performance was dependent upon and could be rationalized on the basis of calculated structural/stability features of the riboswitch and that a fine balance was required in the relative stabilities of the respective ON and OFF conformations to achieve optimal riboswitch performance. Often when producing proteins it is desirable to modify environmental conditions or factors as well as genetic ones to achieve optimal expression and control, for example, lower incubation temperatures to improve folding of recombinant proteins. Typically, the performance of genetic elements or factors is evaluated under fixed environmental conditions. While this approach may yield constructs which function under these conditions, engineered strains and devices may fail to perform robustly under alternative environmental conditions.[47] Testing all possible combinations of these factors would be challenging, but by applying a DoE (Methods), we aimed to robustly test and optimize the behavior of the D4-ORS-con_RBS while simultaneously exploring the effect of multiple genetic and environmental factors. The selected factors were: transcription level (IPTG concentration) (n = 3), incubation temperature (°C) (n = 3), predicted TIR (n = 2), predicted anti-RBS:RBS hairpin stability (ΔΔG kcal/mol) (n = 3) based on minimum free energies calculated from the anti-RBS–consensus RBS variants (isolated using FACS), and N-terminal synonymous codon usage (n = 4)[48] (Figure A). Four of the previously identified selected N-terminal codon usage variants were tested with the D4-ORS. The selected codon usage variants were variant linker 30 (L30), linker (L35), linker 36 (L36), and linker 32 (L32).[48] For details on the DoE factor levels and data types, see Supplementary Table 4. The use of the fractional factorial D-optimal DoE facilitated a 5.4-fold reduction (216 experiments to 40) in the required number of experiments. For details on the generated experimental design, see Supplementary Table 5. Ribosome binding site engineering (Figure ) and in silico analysis (Supplementary Figure 4B) of the best performing FACS variants highlights the importance of balancing TIR modification with the complementarity of the anti-RBS. To allow integration of this design consideration into the DoE, the final genetic factor explored was the ΔΔG of the RBS sequestering structure. By using this in silico metric, we were able to treat this design factor as a continuous data type, allowing reduction of the required number of experimental runs.

Figure 4

DoE data. (A) An overview of genetic and environmental factors (hairpin ΔΔG, RBS strength, N-terminal synonymous codon variant, temperature and IPTG concentration) investigated using Design of Experiments. (B) Measured UI, OFF and ON performance of each riboswitch construct when tested under the factor settings dictated by the D-optimal design (Supplementary Table 5). (C) ON/OFF (x-axis) and ON/UI (y-axis) performance of each tested DOE run. Run number 4 shows the highest riboswitch dependent control. (D) End point eGFP expression testing of D4-ORS-L35 when induced with different concentration of IPTG (1–8, 0, 0.32, 1.6, 8, 40, 200, 400, and 1000 μM, respectively) and PPDA (A, 0, 0.32, 1.6, 8, 40, 200, 400, and 1000 μM, respectively) after 24 h of induction at 37 °C. (E) Calculated total fold change (ON/UI) of the two inducer titration analysis. All error bars represent standard deviation, and data points represent the mean of three technical replicates (B) or biological replicates (D and E). Data for the structured experimental design of reporter protein production was collect after 24 h (Methods). This data shows a large distribution in uninduced (UI), OFF and ON expression levels (Figure B). Comparison of the expression data shows that the factors tested greatly affect the riboswitch-dependent fold change (ON/OFF) and total fold change (ON/UI) in expression, changes in ON/OFF ranging from 2.5- up to an impressive 78.1-fold, and ON/UI control between 7.9- and 698-fold. Initial investigation of the optimal expression conditions show that plotting ON/OFF against ON/UI (Figure C) shows that run #20 yields the best trade-off between these two responses, while #4 and #20 show similar ON/OFF performance. However, #4 shows a reduced total fold change (ON/UI = 512). To allow improved investigation and understanding of the multidimensional data set generated from the DoE, a standard least squares (SLS) regression model was fit to the multifactorial expression data set (Methods). ANOVA was used to assess the importance of each factor in the model; significant effects and interactions were selected based on the Log worth statistic, and unimportant factors were removed from the model (Supplementary Table 6). This refined model facilitated deeper investigation into the effects of multiple factors on the four response metrics (OFF, ON, ON/OFF, and ON/UI). The SLS regression highlights the first order effects and second order interactions, which are significant in explaining each of the four responses (Supplementary Table 7). This modeling allows us to formalize several interesting observations about riboswitch expression dynamics. The generated explanatory model showed that across all four responses, the highly significant (P < 5 × 10–4) factors and interactions were the ON/OFF stability (ΔΔG) of the anti-RBS-RBS hairpin, the interaction between TIR and hairpin ΔΔG, TIR, and IPTG concentration (Supplementary Table 6). Several other interactions were also significant (P < 0.01) (Supplementary Table 6). To test the explanatory power of this statistical model, the prediction for each response was plotted against the actual performance (Supplementary Figure 5), showing that the explanatory model fits the data set well (OFF P = < 1 × 10–4, r2 = 0.87, root mean squared error (RSME) = 883.7; ON P = < 1 × 10 –4, r2 = 0.86, RSME = 6140.3; ON/OFF P = < 1 × 10 –4, r2 = 0.93, RSME = 8.0; ON/UI P = < 1 × 10 –4, r2 = 0.85, RSME = 77.7). Once the explanatory power of the model had been shown, we set out to investigate the multidimensional, multiresponse data set using the model projections of each respective response. By setting the desired response optimization criteria (minimize OFF expression and maximize ON expression, ON/OFF, and ON/UI, Supplementary Figure 6) interrogation of the standard least squares regression reveals factors with optimal OFF, ON, ON/OFF, and ON/UI performance. Details of the model coefficients for each factor used to build the refined model and the effect of these factors on the individual response variables can be found in Supplementary Table 7. As expected, RBS strength has a significant impact upon both the OFF (P = 2.8 × 10–7) and ON (P = 1.5 × 10–10) expression levels with the strongest RBS (con_RBS) affording a large increase in ON (Supplementary Table 7). Increasing IPTG concentration also has a significant positive effect on both OFF (P = 4.6 × 10–4) and ON (P = 4.6 × 10–5), leading to an increase in both responses with a larger effect upon ON expression than OFF (Supplementary Figure 6). Interestingly the model predicts that both RBS strength and hairpin ΔΔG have a significant positive effect upon riboswitch dependent control (RBS strength P = 7.6 × 10–10, hairpin ΔΔGP = 2.0 × 10–10) (Supplementary Figure 6, blue boxes). The interaction between these two factors also has a significant effect on ON/OFF function (P = 5.2 × 10–10) and significantly increase total fold change (RBS strength P = 1.0 × 10–6, hairpin ΔΔGP = 7.2 × 10–4). Together, these factor effects suggest that the improved riboswitch function is a byproduct of increasing ON expression while also reducing OFF and UI expression through proper formation of the anti-RBS hairpin OFF state. A number of other interacting factors impact OFF and ON expression levels. The concentration of IPTG used shows an additive effect on both OFF and ON expression when used with the stronger RBS (OFF P = 5.9 × 10–3, ON P = 4.8 × 10–3), meaning that when a stronger RBS is used, the effect of IPTG induction is enhanced, leading to higher expression both in the absence and presence of riboswitch activation. Increasing hairpin stability (ΔΔG) leads to a reduction in the impact of the IPTG effect on OFF expression (P = 4.1 × 10–3), which means a more stable hairpin reduces the leaky expression from the unactivated riboswitch (Supplementary Table 7). Intuitively, this observation seems to indicate that leaky expression from the riboswitches is caused by nonoptimal anti-RBS hairpin formation in the absence of the ligand (i.e., too weak/low ΔΔG), and that by increasing the ΔΔG of the hairpin, this leaky expression can therefore be reduced even at higher levels of transcription. Additionally, RBS strength and hairpin ΔΔG significantly interact, effecting both OFF (P = 5.6 × 10–7) and ON (P = 1.0 × 10–2) expression with the larger effect on OFF expression, suggesting that the importance of proper anti-RBS hairpin formation is greater when a strong RBS is used. In more practical terms, this means that any transcript which forms a ligand-bound ORS riboswitch structure produces greater levels of eGFP when translation is regulated by the strong consensus RBS. The set of experimental conditions that best satisfy all four response criteria is DoE run #20 (Figure C, Supplementary Figure 6). The plasmid construct used in this run consists of the D4-ORS riboswitch, N-terminal linker L36, induced with 200 μM IPTG at 30 °C. With these parameter settings, the model achieves an optimal balance between OFF, ON, ON/OFF, and ON/UI. However, the model suggests an interesting, albeit small, interaction between two factors. We see that the N-terminal linker L36 has a significant effect on this performance metric, but only when we consider it as part of an interaction with the incubation temperature (P = 3.0 × 10–2). The effect of this predicted interaction was initially investigated in silico by modification of the experimental parameter settings to study the effect of this predicted interaction upon riboswitch function (Figure ). We see that when linker L36 (Figure A) is used in the model, projections show an effect of temperature, with a higher incubation temperature having a negative effect on ON/OFF and ON/UI performance. While the predicted ON/OFF and ON/UI of the D4-ORS-L36 is largely unchanged when incubated at 30 °C, the real effect of this interaction occurs when temperature is increased to 37 °C. If we instead use the L35 linker with the D4-ORS, then the ON/OFF and ON/UI performance shows a reduced temperature dependency (Figure B) with the ON/UI performance being better maintained, irrespective of the induction temperature.

Figure 5

DoE explanatory model showing the effect of the temperature–N-terminal interaction. Standard least squares modeling of the DoE dataset. Model projections showing the effect of hairpin ΔΔG, RBS strength, N-terminal His tag, incubation temperature, and IPTG concentration when using the D4-ORS-L36 (A) and D4-ORS-L35 (B). (A) D4-ORS-L36 ON/OFF and ON/UI show an interaction with incubation temperature, as indicated by the slope of the projections in the orange boxes. (B) D4-ORS-L35 ON/OFF and ON/UI shows very little interaction with incubation temperature, as indicated by the reduced slope of the projections in the green boxes. (C) Direct comparison of UI (grey), OFF (blue), and ON (orange) performance of the original ORS construct prior to RBS-anti-RBS modification and DOE optimization and the optimized D4-ORS-His-L30. (D) Comparison of the ON/OFF (green) and ON/UI (pink) of the original ORS riboswitch, D4-ORS-L35, and D4-ORS-L36. Expression data (C–D) was collected from biological triplicate induction assays carried out at 37 °C after 24 h following induction with 100 μM IPTG in the presence and absence of 1000 μM PPDA. (E) Histogram of eGFP fluorescence intensity from ΔRS_conRBS-L35 analyzed by flow cytometry, showing the population distribution of single-cells when induced with different level of IPTG (red = uninduced, orange = 10 μM IPTG, green = 100 μM IPTG). (F) Flow cytometry analysis of the D4-ORS-L35 device following various induction conditions (Red = uninduced, orange = 100 μM IPTG, magenta = 100 μM IPTG + 8 μM PPDA, blue = 100 μM IPTG + 40 μM PPDA, yellow = 100 μM IPTG + 200 μM PPDA, magenta = 100 μM IPTG + 1000 μM PPDA). The expression landscape of D4-ORS-L35 in response to both IPTG and PPDA was mapped (Figure D), and the total fold change is shown in Figure E. This confirmed that D4-ORS-L35 displayed high levels of IPTG and PPDA dependent regulation and displays AND logic behavior with expression requiring the addition of both IPTG and PPDA. An optimal ON/UI of 500-fold was observed with 1000 μM PPDA and 200 μM IPTG. Further investigation of the expression response surface suggested that at 200 μM, IPTG maximal ON expression is achieved, but also that this IPTG concentration leads to an increase in the OFF expression, compared to those cells induced with 40 μM IPTG (Figure D). Indeed, at an intermediate IPTG concentration (100 μM) (Figure C), D4-ORS-L35 gave consistent ON expression but reduced OFF expression, affording improved ON/OFF performance of 72-fold and total expression control of 544-fold (Figure D). This highlights the need to balance transcriptional and post-transcriptional induction to properly achieve optimal riboswitch performance. To demonstrate the regulatory improvements achieved, the expression of the optimized constructs (D4-ORS-L35 and D4-ORS-L36) and the original ORS construct (ORS-WT_RBS-L30) (Figure C and D) were compared. Fluorescence quantification after 24 h of induction confirmed that the optimized constructs showed drastically improved performance compared to the original ORS construct. The impact of changing the N-terminal linker was then validated by comparing riboswitch performance of the D4-ORS-L36 and D4-ORS-L35 at 37 °C induced with 100 μM IPTG ± 1000 μM PPDA. These results confirmed that D4-ORS-L36 shows reduced performance at 37 °C when compared to D4-ORS-L35. To assess the expression profile of the D4-ORS construct at a single-cell level, we used flow cytometry to measure cellular eGFP fluorescence. This allowed us to assess the heterogeneity of eGFP expression 24 h after induction with IPTG ± PPDA. Here, we compared the performance of the D4-ORS-L35 with ΔRS-conRBS-L35 (Figure E and F). In the absence of any induction, we see two subpopulations in the ΔRS-conRBS-L35, one with low fluorescence (median = 332 RFU, 7.7% of events) and a second, broad population with higher fluorescence (median = 5661 RFU, 92.3% of events). If we compare this uninduced population with that of the D4-ORS-L35, this shows a single population of cells with low fluorescence (median = 265 RFU). Once induced with IPTG the ΔRS-conRBS-L35 shows strong induction (10 μM IPTG, median = 63 996, IPTG 100 μM 143 785). Upon induction of D4-ORS-L35 with 8, 40, 200, and 1000 μM PPDA and a fixed IPTG concentration of 100 μM, we see titratable regulation of protein production with the median fluorescence measurements from expressing cells reaching 1687 RFU, 4352 RFU, 16 326, and 55 543 RFU, respectively. We also observed a small, low fluorescent subpopulation following IPTG and PPDA addition, which represents between 7–10% of the total cellular population. Due to the complexity of interactions between genetic and environmental factors, selection of the best riboswitch design is not as simple as combining the best expression platform with the best N-terminal linker.[48] By identifying the temperature–linker interaction, we selected the final device based on both performance and robustness. This effect would have been difficult to identify using a more traditional “one factor at a time” method, thus supporting our use of a Design of Experiments approach in the multifactorial, multiobjective selection and optimization of this RNA device. For reasons of apparent robustness, the D4-ORS-L35 construct was selected as the best candidate for further evaluation and characterization. To test the robustness of the D4-ORS-L35 with an alternative gene of interest, we exchanged the eGFP for the alternative reporter protein mKate2 and assayed riboswitch performance (Methods). The D4-ORS riboswitch showed consistently improved performance compared to the original riboswitch with an increase in both ON and ON/OFF (Figure A). However, in spite of very similar OFF and ON expression, the D4-ORS riboswitch with an alternative linker L32 performs marginally better when it comes to fold change (ON/OFF = 59) than D4-ORS-L35 (ON/OFF = 53) (Figure A). Despite the inconsistent rank performance of the optimal N-terminal linkers when expressing the two different reporter genes, both D4-ORS-L35 and D4-ORS-L32 show high levels of riboswitch-dependent control over the expression of mKate2, making both viable tools for post-transcriptional regulation of protein expression. This performance is vastly improved relative to the original ORS riboswitch that has an ON/OFF performance of just 12-fold and demonstrates that the D4-ORS robustly provides enhanced regulatory performance regardless of the gene being regulated.

Figure 6

Regulation of plasmid-based mKate2 expression and chromosomally integrated eGFP. (A) Measurement of end point expression (UI = gray, OFF = yellow, ON = red) shows robust riboswitch performance of D4-ORS-L35 and comparison of each N-terminal variant when regulating mKate2 expression. (B) Following genomic integration of the D4-ORS-L35 device into the genome of E. coli DH10 beta Top10 F′, the device showed strong ON expression and high fold-change performance values compared to the original ORS construct (100 μM IPTG and 1000 μM PPDA) (ON/OFF = blue brackets, ON/UI = black brackets). (C) Genome integration of the D4-addA riboswitch show greatly improved ON, ON/OFF (blue brackets), and ON/UI (black brackets) performance compared to the original addA riboswitch (100 μM IPTG and 1000 μM 2-aminopurine). Bars represent mean of three biological triplicates; error bars show standard deviation. It may often be beneficial to integrate regulatory devices into the genome of the chassis to reduce genetic instability caused by plasmid maintenance, and to remove the need to maintain selective pressure through the use of antibiotics. We next wanted to investigate the function of the D4-ORS-L35 riboswitch when inserted into the genome (Methods). The native WT-ORS performs poorly when integrated into the genome, with very low expression levels (UI = 10 RFU/OD, OFF = 19, ON = 49) and poor riboswitch-dependent control (ON/OFF = 3.4-fold) (Figure B). However, by using the optimized D4-ORS-L35 we are able to regulate expression of the gene of interest from a single copy present on the genome (UI = 9, OFF = 26, ON = 598), affording riboswitch-dependent and total control overexpression of 23-fold and 70-fold respectively (Figure B). By optimizing the function of these riboswitches, it is now possible to use riboswitch from a genomic locus, and obtain high levels of control and tight basal expression. This development greatly expands the utility of the PPDA responsive riboswitch. To establish whether the optimized riboswitch would provide enhanced function when the aptamer was replaced, we exchanged the ORS aptamer domain for the related 2-aminopurine (2-AP) responsive addA aptamer domain from Vibrio vulnificus (Figure A). When the con_RBS is present, we observe ON expression far higher than previously reported,[48] affording a 29-fold total fold change. However, as expected without the cognate anti-RBS, the OFF expression level is also very high, and poor riboswitch-dependent control is observed. The addA aptamer was then assessed with the D4 expression platform and linker (L35, L36, and L32) combinations. These re-engineered addA riboswitches show tight basal expression and high ON expression with ON/UI of 162-fold, 167-fold, and 193-fold, respectively. However, while the ON/UI shows good function, the ON/OFF performance of the D4-addA was reduced in comparison to the D4-ORS variants (7.6-, 9.4-, and 10.2-fold for L35, L36, and L32 respectively, Figure B). However, this is likely due to elevated OFF expression caused by cognate inducer adenine present in the cell at 97 μmol/mg DCW.[71] The highest ON/OFF of 10.3-fold was observed in the D4-addA-L32. The D4-addA devices were also used to regulate the expression of mKate2 (Figure C). The performance of these constructs was similar to that observed when regulating eGFP production, but in this case, the D4-addA-L35 showed the great fold change performance (ON/UI = 476, ON/OFF = 8.4). The engineered performance of the D4 expression platform may serve as a useful platform for the future development of post-transcriptionally regulated riboswitches through integration of other ligand binding domains.

Figure 7

ORS–addA mutation. (A) Predicted structure (RNAfold) of the D4-ORS riboswitch, showing the point mutations responsible for changing ligand specificity (green circles, labeled with mutation) from PPDA to 2-AP, thus restoring the aptamers region of the addA riboswitch. The D4 anti-RBS (orange) and consensus RBS are indicated (green); numbering shown is relative to the start codon. (B) 2-AP mediated activation of the addA leads to an increase in protein expression. Blue bars show the level of protein synthesis when induced with IPTG only, whilst green bars show induction with IPTG and 2-AP. The optimized D4 expression platform shows robust performance when the aptamer region is exchanged with the native addA aptamer. (C) Performance of the D4-addA devices when regulating the gene expression of for mKate2 (UI = gray, OFF = yellow, ON = red). Error bars represent the standard deviation of three biological replicates. Blue brackets indicate ON/OFF values, whilst black brackets represent ON/UI. In addition to testing the D4-addA riboswitch on a plasmid, the riboswitch construct was also integrated into the genome of E. coli (Figure C). Similar trends to the D4-ORS were observed with the consensus RBS greatly increasing ON expression and the D4 anti-RBS leading to restoration of tight UI and OFF expression. The D4-addA riboswitch showed riboswitch-dependent control (ON/OFF, 22-fold) very similar to that of the D4-ORS riboswitch. This is a drastic improvement compared to the native addA riboswitch (ON/OFF = 1.4). Total fold change (ON/UI) was also greatly increased from 1.2-fold up to 138-fold. To further explore the context dependency of riboswitch performance and expand their utility as modular control elements for use in synthetic biology, we next looked to examine the dependency upon upstream sequence and promoter. To do so, we initially deleted the LacO sequence of the promoter (Ptac) immediately downstream of the transcription start site (TSS) (Methods), reducing the length of the transcript upstream of the basal aptamer stem by 21 nt (Figure A). The effect of this reduction in transcript length upstream of the riboswitch was investigated by designing an alternative upstream linker, restoring the original length of this upstream region to that of the Ptac system (49 nt). This nucleotide sequence was manually designed to reduce any predicted interaction with the OFF state riboswitch using RNAfold.[70] D4-ORS riboswitch constructs with both the short and long upstream linkers were tested under the control of 3 constitutive promoters across a range of promoter strengths (BBa_J23114 = 0.1 a.u, BBa_J23118 = 0.56 a.u, BBa_J23100 = 1 a.u) using promoters from the Anderson collection[72] (Figure A).

Figure 8

Integrating constitutive promoters into riboswitches allows inducible control of gene expression. (A) Modification of the upstream linker sequence between the transcription start site and the basal stem of the ORS. Deletion of the LacO site from Ptac yielded the constitutive Ptrp promoter with a reduced length transcript upstream of the riboswitch (short). An alternative (long) upstream linker was designed in silico to restore the transcript length to that of the original Ptrp construct. The Ptrp was then replaced with three different strength promoters from the Anderson library for further testing. The three Anderson promoter parts (BBa_J23100 (green), BBa_J23114 (red), and BBa_J23118 (orange)) also contained an upstream insulator sequence.[87] (B) End-point measurement of eGFP production from the Anderson promoter regulated devices showing OFF (blue) and ON (orange) performance. The black brackets indicate the level riboswitch dependent control. (C) PPDA titration of gene expression from Ptrp riboswitch constructs. The performance of the original ORS is shown in gray; the con_RBS ORS is shown in red, and the D4-ORS is shown in blue. Filled circles indicated the long linker, whilst empty circles indicate the short linker. (D) Riboswitch dependent control of each of the Ptrp constructs. Expression analysis with these constructs confirmed that the D4-ORS riboswitch functions robustly under constitutive promoter regulation with riboswitch-dependent control of up to 25-fold observed. We also observed that the rank order of the Anderson promoters was not impacted by the insertion of the riboswitches (Figure B). Comparison of the short linker vs the longer linker revealed that the longer upstream linker consistently performed better than those with the short linker both in terms of ON and ON/OFF performance (Figure B). The ON performance of the long linker was 20, 46, and 66% higher than the short linker constructs for BBa_J23114, BBa_J23118, and BBa_J23100 promoters, respectively. The effect of modifying the length of the upstream sequence was further expanded by testing both the short and long linker variants when place under the control of the strong Ptrp promoter (Figure C). Interestingly, the riboswitch-dependent control of both short and longer linker D4-ORS constructs showed no significant difference, and the discrepancy between linker length was reduced in comparison to that observed when transcription initiation was regulated by the weaker Anderson promoters. The importance of the upstream sequence explored here highlights this as another important factor to consider when incorporating these cis post-transcriptional regulation devices into any genetic control element. Importantly, incorporation of the riboswitches showed that simply by incorporating 154 nt into the 5′ UTR of a selection of constitutive promoters, we obtained inducible expression control units with riboswitch-dependent control of up to 25-fold, demonstrating the use of the D4-ORS riboswitch to convert constitutive promoters into small molecule induced expression units across a range of different promoter strengths. Given that the longer linker gave better or equivalent ON/OFF performance for the constitutive promoters tested, this linker was selected for future testing of additional transcriptional and post-transcriptionally regulated devices. Exploration of the upstream linker region for these constitutive promoters allowed us to standardize this transcript region for future engineering. Following the successful integration of the D4-ORS-L35 riboswitch downstream from a series of constitutive promoters, we looked to integrate the D4-ORS-L35 into a variety of endogenous environmentally regulated, stress response promoters. A number of recent studies have developed stress responsive, transcriptionally regulated devices.[73,74] While systems which utilize cellular stress to induce transcription provide a real time response to metabolic stress, the strength of this response cannot be dynamically regulated. By integrating post-transcriptional regulation downstream of such promoters, it should be possible to couple the stress response elements with small molecule induction and temporal control. Another common mechanism for regulating gene expression is through the use of bacterial starvation responses. Post-transcriptional regulation provides an interesting opportunity to advance the use of systems such as the phosphate starvation promoter, which has been used in the biological manufacture of therapeutic proteins.[75] In this case, the only way to modify the strength of the starvation response upon gene expression is to engineer elements such as the RBS. By integrating such promoters with riboswitch device, we can modulate the magnitude of gene expression to optimize the production of the desired protein of interest. By replacing the original promoter (Ptac) with a number of environmental and metabolic stress response promoters, namely hyperosmotic stress (PosmY), phosphate starvation (PphoA), redox stress (PsoxR), and carbon starvation (PcstA) responsive promoters (Methods, Supplementary Table 1), we constructed tools which are regulated by both the respective stress response and the riboswitch ligand concentration. In Figure A, we see that the no-riboswitch control (ΔRS) PphoA promoter is responsive to K2HPO4 over the concentrations assessed (10 to 0.0001 mM), affording expression output over a 38.6-fold range. For the PphoA-D4-ORS, we see that the expression is controlled in response to both varying K2HPO4 and PPDA concentrations (Figure A). The greatest ON is observed at 0.001 mM K2HPO4 and 1000 μM PPDA, while in the absence of PPDA, we observe tight control of basal expression regardless of the initial phosphate concentration (gray, Supplementary Figure 7A). Riboswitch-dependent control of expression of up to 48-fold was observed under 1000 μM PPDA and 10 mM K2HPO4 conditions. A maximal PPDA and K2HPO4 dependent dynamic range of the PphoA::D4-ORS-eGFP device was 646-fold, observed for 1000 μM PPDA and 0.001 mM K2HPO4 with both PPDA and phosphate dependent regulation of expression (Figure A). This is a significant increase in the dynamic range of this promoter compared to when no riboswitch is present. Despite the reduction in maximal ON expression, this enhanced dynamic range is achieved by drastically reducing the basal “leakiness” of the promoter under high phosphate conditions.

Figure 9

Integration of orthogonal, post-transcriptional regulation into endogenous stress response pathways. Induction matrix heat maps showing the observed ON/UI for the PphoA (A), PosmY (B), PsoxS (C), and PcstA (D) D4-ORS-L35 devices following different levels of transcriptional and post-transcriptional induction with PPDA and K2HPO4, NaCl, H2O2, or glucose starvation. ΔRS controls included for comparison. E–H show kinetic microfermentation characterization of eGFP production (RFU/OD) from the PphoA::D4-ORS-L35 devices when grown in EZ-rich media supplemented with varying levels of K2HPO4 (E = 10 mM, F = 1 mM, G = 0.1 mM, H = 0.01 mM) over 24 h. Data represent the mean of three replicates. Raw data and standard deviations can be seen in Supplementary Table 8. Similar trends were observed for the PosmY::D4-ORS-eGFP. This system was induced by both hyperosmotic stress and PPDA (Figure B, Supplementary Figure 7B), and GFP fluorescence was measured after 3 h of hyperosmotic stress and PPDA induction. Tight control of basal expression, regardless of the NaCl concentration, is afforded by riboswitch integration (Figure B). The maximal ON expression level was reached at 1 M NaCl and 1000 μM PPDA with riboswitch performance of 7.3-fold and ON/UI of 36-fold (Figure B, gray Supplementary Figure 7B). In comparison, the PosmY ΔRS construct showed NaCl dependent induction of gene expression with only 4.9-fold control. The PosmY is also active under stationary phase as well as being induced by osmotic stress.[76−78] To investigate whether the expression of eGFP during stationary phase could be regulated by the riboswitch, we also measured eGFP expression after 24 h (Supplementary Figure 8). At this time point, we see a loss of osmotic stress dependent expression in the PosmY::D4-ORS strain; in fact, expression is reduced at very high NaCl concentrations (0.8 and 1 M) in the PosmY ΔRS construct, likely due to inhibition of cell growth. Despite loss of NaCl dependent modulation of protein production, riboswitch-dependent control of expression was maintained with PPDA dependent regulation up to 15-fold (ON/OFF) after 24 h, showing that the D4-ORS is able to regulate expression independently of both osmotic stress and stationary phase dependent effects. Another stress response transcription factor dependent promoter was coupled to the D4-ORS-L35 and assessed for its ability to respond to peroxide stress. Comparatively, the PsoxS::D4-ORS-eGFP device displayed H2O2 dependent expression after 3 h of expression when induced with a range of H2O2 and PPDA concentrations. At 3 h, expression was altered in response to increasing both H2O2 (% v/v) and PPDA concentration (Supplementary Figure 7C). The ON/UI for the PsoxS reached 26.8-fold (0.03% H2O2, 1000 μM PPDA) (Figure C). PsoxS and the soxRS have recently been used for sensing imbalance of NADP+/H redox state,[79] giving the PsoxS::D4-ORS-L35 device great potential in optimizing redox dependent pathways in biotechnology. The final stress responsive riboswitch device constructed utilized the carbon starvation promoter PcstA. This strong stationary phase promoter is σ70 regulated and glucose repressible. Following cAMP accumulation, CRP positively regulates PcstA activity, leading to transcription initiation. Following expression analysis in LB ± 5% glucose, we observed that the PcstA::ΔRS is positively regulated in the absence of glucose, which corresponds to a 4.4-fold increase in eGFP production when cells are cultured in the absence of glucose. The same trend in observed upon integration of the D4-ORS; as expected, this device shows riboswitch-dependent control (Figure D, Supplementary Figure 7D). Glucose dependent expression control in the D4-ORS device reached 7.2-fold (at 200 μM PPDA), and PPDA addition led to a 9.5-fold increase in protein synthesis. Combined, the effect of glucose starvation and PPDA induction yielded an ON/UI of 34.8-fold change in eGFP production (Figure D) on induction with 1000 μM when induced by carbon starvation. In summary, riboswitch-dependent control of the output signal from bacterial stress response promoters proved successful, and Figure shows that each promoter was activated by the respective stimuli. By integrating D4-ORS into these endogenous promoters we have been able to decouple the signal processing of these cellular stress responses through post-transcriptional regulation. The introduction of a riboswitch into these promoter elements converts the simple input-output system into a two-input system, as both stress response promoter and riboswitch activation (stress + PPDA) is required to give an output (eGFP expression). In addition to production applications, these tools could be of use as sensors in developing understanding of cellular stress responses and metabolic and environmental stress. Coupling of stress response promoter with post-transcription, RNA regulated, small molecule inducible control of expression provides an interesting opportunity. This system allows cellular detection of a given stress stimuli, meaning that when this stress exceeds the limits deemed acceptable by the cell (selective pressure), transcription occurs. However, the riboswitch will adopt the OFF conformation until user addition of PPDA, allowing tunable, temporal control over the strength of the cellular stress response. Following the end-point characterization of the PphoA::D4-ORS device, we characterized the expression kinetics of the device in a mini-fermentation system (BioLector, M2P-Laboratories). Expression of eGFP for the PphoA riboswitch device was measured over a 24-h period (Figure E–H; Supplementary Table 8). Cells were supplemented with different amounts of K2HPO4 to allow comparison of eGFP expression in phosphate replete and depleted conditions. When supplemented with 10 mM K2HPO4, PphoA is not activated (Figure E); however, at 1 mM K2HPO4 and below, eGFP production is observed, indicating activation of PphoA (Figure F). Expression begins at ∼2 h, indicating the onset of the starvation response and plateaus after 4 h, which coincides with entry into the stationary growth phase (Supplementary Figure 9). When grown in 0.1 mM K2HPO4, this expression onset delay is reduced to ∼30 min (Figure G), and at 0.01 mM K2HPO4, expression is observed upon the initial measurement, suggesting that the cells have entered a starved state directly following resuspension in media at this concentration of K2HPO4. At both of these lower K2HPO4 levels, we see that eGFP production does not plateau throughout the 24-h measurement period. This is likely due to the severe phosphate limitation, as indicated by the poor growth of these cultures. At all phosphate concentrations, the basal level of expression is tightly repressed (Figure E–H) in the absence of PPDA, and expression is increased in a PPDA-dependent manner, highlighting the utility of the D4-ORS to tightly control expression even when transcription has been induced. To assess the modification of eGFP production rate upon addition of PPDA, linear regression was carried out during the linear region of the RFU/OD data sets (2.5–3.5 h). The slope coefficients detailing the effect of PPDA on the eGFP production rate can be seen in Supplementary Table 9. The fastest production rate is observed at 1 mM K2HPO4 and 1000 μM PPDA. After 24 h, the maximal eGFP signal is observed in the 0.1 mM K2HPO4 culture with 1000 μM PPDA (Figure G). However, when cells are starved at a phosphate concentration below this (0.01 mM K2HPO4), we see a reduction in the maximal fluorescent signal and rate of expression. This ability to attenuate both the expression levels and rate of expression further expands the utility of both stress induced promoter and riboswitches for protein production applications.

Conclusion

The findings presented here demonstrate the power of combining functional selection and multidimensional, multiobjective optimization for constructing RNA regulators with improved functionality. We show that through incorporation of both the consensus E. coli RBS and an anti-RBS library that subsequent FACS enrichment and selection of riboswitches with optimal anti-RBS hairpins can be achieved. Following further functional optimization through DoE riboswitch, function of the ORS can be greatly improved (820% improvement in ON state expression) with PPDA induced riboswitch-dependent control (ON/OFF) up to 72-fold and overall transcriptional (IPTG) and post-transcriptional (PPDA) regulation across a 544-fold dynamic range (ON/UI) (Figure D). Design of Experiments also allowed prediction of factors likely to affect riboswitch robustness using sparse sampling of a large experimental space to enhance understanding, probe complex interactions and environmental factors, and guide optimal construct selection. By improving the function of the PPDA responsive riboswitch, we built a device comparable with the widely used theophylline riboswitch,[53] thus expanding the RNA device toolbox, which will be of great use to the synthetic biology community and reduce dependence upon protein- and transcription-factor-based regulation. Unsurprisingly, the SLS regression model predicts that the stability of the expression platform hairpin dictates riboswitch function. In future work, we hope that this framework will allow further prediction of switches with further enhanced function. Currently, this has proved challenging due to the complexities of in silico structural prediction. A recent review of RNA device design highlights a number of the challenges in this area.[80] It is possible that incorporation of more advanced kinetic or three-dimensional structural predictions into the DoE framework could allow us to build a predictive model. Additionally, the SLS model built here assumes, for simplicity, only linear relationships between factors. While this has proven sufficient for construct selection and robustness testing, nonlinear modeling may facilitate future further improvement of riboswitch function. We demonstrate that the riboswitch developed here functions robustly in a wide array of contexts, including the regulation of an alternative gene of interest, when inserted on the genome, and downstream of a number of different constitutive promoters. In constructing the devices containing these constitutive transcriptional regulators, we also highlight the importance of the 5′ UTR, the linker region between the TSS and the riboswitch structure. The design of these devices was then used to inform the design of four endogenous E. coli stress response promoter riboswitch regulators. The orthogonal riboswitch allows translation control of expression from stress responsive promoters only when PPDA is present. This additional layer of expression control allows exogenous control of expression following the induction of transcription by the respective environmental metabolic stress. This additional layer of riboswitch-dependent control allows for temporal control and activation of gene expression in response to environmental and stress stimuli. Decoupling of growth and production phases has been previously implemented to maximize volumetric production titers using both process control strategies such as autoinduction media or late stage fed-batch induction[81,82] and autonomous genetic control.[73,83] However, a challenge with using autonomous control elements such as promoters responsive to environmental or stress stimulus is their undesired activation during preculture conditions and prior to induction. Here, use of a riboswitch permits tight basal control of gene expression in the absence of riboswitch activation, and upon further activation, the expression output is attenuated over a large range. This may prove particularly useful in bioprocessing of toxic or difficult to express proteins, when it is often desirable to accumulate biomass prior to protein production while maintaining tight regulation of expression. This can be challenging in traditional expression systems due to high basal expression and population-level expression heterogeneity. By utilizing post-transcriptional control, these issues can be easily addressed. Additionally, by combining the endogenous sensing capabilities of the cell with riboswitch control, we addressed a major limitation to the use of riboswitches. Currently, the numbers of riboswitch–ligand pairs are somewhat limited, and those which respond to orthogonal small molecule inducers are limited yet further. By tapping into a number of endogenous sensory pathways, we expanded the stimuli, which can regulate protein expression, through post-transcriptional control. In the case of these systems, promoter activation machinery provides stimuli specific induction, and the riboswitch affords tunable control. In other words, once transcription has be activated by the respective stress stimuli, the magnitude of this activation can be regulated by riboswitch addition, allowing the expression rate to be balanced with respect to the cellular capacity to produce, process, or secrete the desired protein of interest.[84] The combination of stress activated transcription and tunable control of protein production will hopefully prove of great use to the synthetic biology and bioprocessing communities.

Methods

Strains and Plasmids

All cloning, including library generation, was carried out using HiFi assembly (NEB #E5520S) and transformation into Escherichia coli DH5 alpha (NEB #C2987). Assembly DNA was synthesized either as dsDNA or ssDNA. Plasmid DNA was then isolated using silica column purification (Qiagen) and transformed into chemically competent E. coli DH10β Top10 F′ (ThermoFisher Scientific). This strain was used for all subsequent expression analyses, unless stated otherwise. pKIKO vectors were cloned in E. coli PIR2 strain (Invitrogen). For details on cloning, see Supporting Information and Methods. The sequence of the pTAC-ORS-L30-eGFP plasmid and key expression cassettes, consisting of the promoter-5′UTR-CDS-terminator region of the main devices used in this study, can be found in Supplementary Table 10.

Cultivation and Expression Conditions

Single colonies were inoculated into LB medium and carbenicillin 100 μg mL–1 (0.5% yeast extract, 0.5% NaCl, 1.0% bactotryptone) following calcium chloride mediated transformation of plasmid DNA into chemically competent cells prepared according to a modified procedure.[85] Liquid cultures were incubated at 37 °C with shaking at 200 rpm for 16 h overnight. These seed cultures were diluted 100× in 10 mL of LB supplemented with 0.2% glucose and incubated as before for 2 h (OD 600–0.3). Cultures were then aliquoted (500 μL) into deep well plates (Amgen 1.4 mL 96 well plates) with the designated final inducer concentrations (100 μM IPTG) and pyrimido-[4,5-d]-pyrimidine-2,4-diamine (PPDA) or 2-amino- purine (2-AP) at 1000 μM for standard OFF–ON expression. All constitutive promoter constructs were induced with PPDA only, i.e. no IPTG was required. Following incubation for 3 h, 250 μL of culture was aspirated into a separate deep well block, and cells were harvested by centrifugation. Cell pellets were then washed with PBS (Sigma), diluted 1:4, and the cell density and fluorescence were recorded using a ClarioStar microplate reader (BMG). EZ-rich media (Teknova) was used for testing and kinetic characterization of the PphoA::D4-ORS-L35 device. EZ-rich is a modified Neidhardt media[86] with the following composition: 40 mM MOPS, 4 mM tricine, 0.01 mM iron sulfate, 9.5 mM ammonium chloride, 0.276 mM potassium sulfate, 0.0005 mM calcium chloride, 0.525 mM magnesium chloride, 50 mM sodium chloride, 3 × 10–9 M ammonium molybdate, 4 × 10–7 M boric acid, 3 × 10–8 M cobalt chloride, 10–8 M cupric sulfate, 8 × 10–8 M manganese chloride, 10–8 zinc sulfate, 1.32 mM potassium phosphate, 1.5 mM potassium hydroxide, 0.2 mM adenine, 0.2 mM cytosine, 0.2 mM uracil, 0.2 mM guanine, 0.8 mM l-alanine, 5.2 mM l-arginine HCL, 0.4 mM l-asparagine, 0.4 mM l-aspartic acid (potassium salt), 0.6 mM l-glutamic acid (potassium salt), 0.6 mM l-glutamine, 0.8 mM l-glycine, 0.2 mM l-histidine HCL H2O, 0.4 mM L-isolucine, 0.4 mM l-proline, 10 mM l-serine, 0.4 mM l-threonine, 0.1 mM L-tryptophan, 0.6 mM l-valine, 0.8 mM l-leucine, 0.4 mM l-lysine, 0.2 mM l-methionine, 0.4 mM l-phenylalinine, 0.1 mM l-cysteine HCL, 0.2 mM l-tryosine, 0.05 mM thiamine, 0.01 mM calcium pantothenate, 0.01 mM para-amino benzoic acid, 0.01 mM para-hydroxy benzoic acid, 0.01 mM dihydroxyl benzoic acid, and 0.2% glucose (w/v).

FACS Counter-Selection of Improved Riboswitch Variants

Two separate sorting schemes were applied to isolate functional riboswitches from the anti-RBS library; sorting of cells with low fluorescence intensity under OFF induction, followed by sorting of highly fluorescent cells under ON induction (low > high). Conversely, the opposite scheme was also applied, where highly fluorescent cells were sorted under ON induction, followed by isolation of cells with low fluorescence when induced with OFF. Prior to FACS sorting, 500 μL of the cryopreserved library was inoculated into 50 mL of LB + carbenicillin 100 μg mL–1 and incubated for 16 h at 30 °C with shaking at 200 rpm. The culture was diluted 100× and incubated as before for 2.5 h (OD600 = 0.3). Cells were then induced with either 100 μM IPTG or 100 μM IPTG + 200 μM PPDA and incubated for 20 h at 30 °C, and cells (2 mL) were then harvested by centrifugation and resuspended in PBS prior to FACS sorting. The bottom 5% of the IPTG induced culture (low OFF expression) and the top 5% of the IPTG + PPDA (high on expression) were sorted (10 000 events). Following sorting of these two populations, 15 mL of LB + 0.2% glucose was added to the sort tubes. Cells were then pelleted by centrifugation (2500g for 10 min), resuspended in 1 mL of LB + 0.2% glucose, and incubated overnight at 30 °C, 200 rpm. The induction and sorting process was repeated as before, but each population was induced under the alternative condition (IPTG only > IPTG + PPDA, IPTG + PPDA > IPTG only). Sorted cells were resuspended and concentrated as before and plated onto LB + carbenicillin 100 μg mL–1 + 0.2% glucose plates. All FACS analysis was carried out using a Sony SH800. Data were analyzed using FlowJo Single Cell Analysis Software version 10.

Plate-Based Screening Protocol

Single colonies (90 from each FACS sort scheme) were inoculated into 500 μL of LB + carbenicillin 100 μg mL–1 + 0.2% glucose and incubated at 37 °C shaking at 1000 rpm for 16 h. This culture was then diluted 100× in LB + carbenicillin 100 μg mL–1 + 0.2% glucose in a 96 deep well block prior to incubation for 2 h at 37 °C shaking at 1000 rpm. The culture was then induced with 100 μM of IPTG. Half of the IPTG induced culture was then transferred to another plate and induced with 1000 μM of PPDA. Both plates were incubated for a further 3 h. The resulting culture was harvested by centrifugation at 2250g for 10 min at 4 °C. Cells were then resuspended in 1 mL of PBS and pelleted by centrifugation as before. Finally, cells were again resuspended and diluted 1:1 in PBS. A 200 μL aliquot was then transferred into a black, clear flat-bottomed microplate (Greiner), and the optical density (OD λ = 600 nm) and fluorescence (eGFP fluorescence was measured at λEx/λEm = 488/520 nm, while mKate2 fluorescence was measured at λEx/λEm = 588/633 nm) were recorded. Fluorescence was then normalized to optical density (RFU/OD).

Design of Experiments

The Design of Experiment data structure was generated with JMP 12 pro using the custom design tool. D-optimality was used to select the chosen design from a number of randomly generated designs (random starts = 20 000) to investigate all main effects and two-factor interactions. The factors tested by the D-optimal Design of Experiments are listed in Supplementary Table 4. This design was generated using JMP-12 Pro Custom Design tool based on the D-optimality criteria which scores designs based on reducing the generalized variance around the selected design points, thus giving a more precise estimate of the model effects. The design is a modified fractional factorial design consisting of 40 runs containing 4 center points. The choice of runs was determined by the JMP custom design algorithm. Modeling of the structured experimental data set will allow the prediction of all linear main effects and two-factor interactions. Unlike other fractional factorial DoE designs, the D-optimal custom design allows fractional reduction in the number of experiments required while also facilitating testing of multiple categorical factors (i.e., A or B or C) at more than two levels, making its use essential for DoE testing of the 4 N-terminal linker variants used in this study. An eight-run block was included to control for run order effects and to protect against user error. If human error prevented the use of any data points, this block would mean that only the runs in the respective block need be repeated. Confirmation that this repeated data did not significantly deviate from the rest of the data set is then confirmed when assessing the model effects during SLS regression. SLS model fitting did not assign the block effect as significant. The induction protocol was modified to align with the data structure shown in Supplementary Table 5. Data were collected and processed using the expression conditions described previously. Protein expression was measured after 3 and 24 h. The SLS model fitting and all subsequent experimental expression data (Figure D–F) show the eGFP production after 24 h.

Standard Least Squares Model Fitting

Modeling was carried out using the Fit Model function with JMP pro 12. The statistical model was built using the JMP 12 pro Standard Least Squares fit model algorithm using Residual Sum of Squares (RSS) reduction. Factor importance and factor effects were assessed using ANOVA. The underlying SLS model is represented in this study using the JMP 12 pro Prediction Profiler (Figure A and B, Supplementary Figure 6).

Stress Response Promoter Testing: Phosphate Starvation Coupled Riboswitch Function Assay

The PphoAD4-ORS-L35-eGFP and PphoA ΔRS-L35-eGFP constructs were transformed into E. coli DH10β TOP10 F′. Single colonies were inoculated into LB containing 100 μg mL–1 and incubated for 16 h at 37 °C with shaking at 200 rpm. Cells were then diluted 100× in EZ-rich media and grown to an optical density (λ = 600 nm) of 0.3 (∼2.5 h). The culture was centrifuged (2250g, 5 min) and resuspended in an equal volume of EZ-rich media supplemented with either 10, 1.32, 1, 0.1, 0.01, 0.001, or 0.0001 mM K2HPO4. The culture was then aliquoted (500 μL) into a 96-well deep-well block and induced with 0, 40, 200, and 1000 μM PPDA and incubated at 37 °C with shaking at 1000 rpm for 24 h. The OD600 and fluorescence was measured at 24 h. Cells were washed in PBS, and readings were taken as described previously, carried out in biological triplicate. Expression kinetics of the PphoA::D4-ORS-L35 was also assessed. Cells were treated as above, but following resuspension in K2HPO4 supplemented media, the cell suspension was transferred (1 mL) into a 48 well, baffled deep-well plate (m2p, M2P-48-B) containing either 1000, 400, 200, 40, 8, and 1.6 μM PPDA (PphoA::D4-ORS-L35 only) and incubated in the BioLector culture platform for 24 h at 37 °C with shaking at 1200 rpm. Optical density (λ = 620 nm) and fluorescence (λEx/λEm = 488/520 nm) was measured every 15 min. Carried out in biological triplicate.

Hyperosmotic Shock Activation of PosmY Assay

The PosmYD4-ORS-L35-eGFP and PphoA ΔRS-L35-eGFP constructs were transformed into E. coli DH10β TOP10 F′. Single colonies were inoculated into LB containing 100 μg mL–1 and incubated for 16 h at 37 °C with shaking at 200 rpm. Cells were then diluted 100× in LB-rich media and grown to an optical density (λ = 600 nm) of 0.3. The culture was centrifuged (2250 g, 5 min) and resuspended in an equal volume of LB media supplemented with 1, 0.8, 0.6, 0.4, 0.2, or 0.171 M (standard LB) or 0.1 M NaCl. The culture was then aliquoted (500 μL) into a 96-well deep-well block and induced with 0, 40, 200, and 1000 μM PPDA and incubated at 37 °C with shaking at 1000 rpm for 24 h. The OD600 and fluorescence was measured at 3 and 24 h. Cells were washed in PBS, and readings were taken as described previously. This experiment was carried out in biological triplicate.

Hydrogen Peroxide Induction of PsoxS Assay

The PsoxSD4-ORS-L35-eGFP and PsoxS ΔRS-L35-eGFP constructs were transformed into E. coli DH10β TOP10 F′. Single colonies were inoculated into LB containing 100 μg mL–1 and incubated for 16 h at 37 °C with shaking at 200 rpm. Cells were then diluted 100× in LB-rich media and grown to an optical density (λ = 600 nm) of 0.3. The culture was then aliquoted (500 μL) into a 96-well deep-well block and induced with hydrogen peroxide (0, 0.3, 0.03, 0.003, and 0.0003%) and PPDA (0, 40, 200, and 1000 μM) and incubated at 37 °C with shaking at 1000 rpm for 24 h. The OD600 and fluorescence was measured at 3 and 24 h as previously described. The assay was carried out in biological triplicate.

PcstA Induction Assay

The PcstAD4-ORS-L35-eGFP and PcstA ΔRS-L35-eGFP constructs were transformed into E. coli DH10β TOP10 F′. Single colonies were inoculated into LB containing 100 μg mL–1 and incubated for 16 h at 37 °C with shaking at 200 rpm. Cells were then diluted 100× in LB-rich media and grown to an optical density (λ = 600 nm) of 0.3. The culture was centrifuged at 2250g for 5 min. The supernatant was removed, and cells were resuspended in LB media ±5% glucose and aliquoted (500 μL) into a 96-well deep-well block containing PPDA (0, 40, 200, and 1000 μM) and incubated at 37 °C with shaking at 1000 rpm for 24 h. The OD600 and fluorescence was measured at 3 and 24 h as previously described. Samples were measured in biological triplicate.

82 in total

Systematic Evaluation of Genetic and Environmental Factors Affecting Performance of Translational Riboswitches.

Results and Discussion

Conclusion

Methods

Strains and Plasmids

Cultivation and Expression Conditions

FACS Counter-Selection of Improved Riboswitch Variants

Plate-Based Screening Protocol

Design of Experiments

Standard Least Squares Model Fitting

Stress Response Promoter Testing: Phosphate Starvation Coupled Riboswitch Function Assay

Hyperosmotic Shock Activation of PosmY Assay

Hydrogen Peroxide Induction of PsoxS Assay

PcstA Induction Assay

Review 1. Engineering genes for predictable protein expression.

Review 2. Overloaded and stressed: whole-cell considerations for bacterial synthetic biology.

3. Mapping of amino acid substitutions conferring herbicide resistance in wheat glutathione transferase.

4. Metabolomic analysis of riboswitch containing E. coli recombinant expression system.

5. Synthetic RNA devices to expedite the evolution of metabolite-producing microbes.

6. osmY, a new hyperosmotically inducible gene, encodes a periplasmic protein in Escherichia coli.

7. Tetracycline-aptamer-mediated translational regulation in yeast.

Review 8. Contextualizing context for synthetic biology--identifying causes of failure of synthetic biological systems.

9. D-Tailor: automated analysis and design of DNA sequences.

10. Reducing DNA context dependence in bacterial promoters.

Review 1. Application of Corynebacterium glutamicum engineering display system in three generations of biorefinery.

2. Aptamer-Array-Guided Protein Assembly Enhances Synthetic mRNA Switch Performance.

Review 3. Riboswitches for Controlled Expression of Therapeutic Transgenes Delivered by Adeno-Associated Viral Vectors.

4. Attenuator LRR - a regulatory tool for modulating gene expression in Gram-positive bacteria.