Literature DB >> 35174698

Improving the Robustness of Engineered Bacteria to Nutrient Stress Using Programmed Proteolysis.

Klara Szydlo¹, Zoya Ignatova¹, Thomas E Gorochowski².

Abstract

The use of short peptide tags in synthetic genetic circuits allows for the tuning of gene expression dynamics and release of amino acid resources through targeted protein degradation. Here, we use elements of the Escherichia coli and Mesoplasma florum transfer-mRNA (tmRNA) ribosome rescue systems to compare endogenous and foreign proteolysis systems in E. coli. We characterize the performance and burden of each and show that, while both greatly shorten the half-life of a tagged protein, the endogenous system is approximately 10 times more efficient. On the basis of these results we then demonstrate using mathematical modeling and experiments how proteolysis can improve cellular robustness through targeted degradation of a reporter protein in auxotrophic strains, providing a limited secondary source of essential amino acids that help partially restore growth when nutrients become scarce. These findings provide avenues for controlling the functional lifetime of engineered cells once deployed and increasing their tolerance to fluctuations in nutrient availability.

Entities: Chemical

Keywords: burden; genetic circuit; protein degradation; proteolysis; resource recycling

Mesh：

Substances：

Year: 2022 PMID： 35174698 PMCID： PMC9097571 DOI： 10.1021/acssynbio.1c00490

Source DB: PubMed Journal: ACS Synth Biol ISSN： 2161-5063 Impact factor: 5.249

Introduction

Prokaryotic protein degradation is an essential cellular quality control mechanism and plays a crucial role in eliminating damaged and/or nonfunctional proteins.[1−3] It is enabled by a network of ATP-dependent proteases and adaptors that recognize specific motifs in misfolded proteins, or degrons.[4,5] Protein degradation in bacteria is mediated by the prokaryotic transfer-mRNA (tmRNA) ribosome rescue system, where an SsrA peptide tag is added C-terminally to nascent polypeptides, targeting them for degradation by several endogenous proteases.[6] These include ClpXP, ClpAP, FtsH, and Lon, with ClpXP and ClpAP being the most active in Escherichia coli, degrading over 90% of SsrA-tagged proteins.[1,3,7] The tagging of proteins for degradation has gained interest in the field of synthetic biology as it allows for specific and controllable protein degradation and has been used to modulate protein turnover rates, investigate protein function by reducing intracellular concentrations, and for the tuning of dynamic processes (e.g., modulating the period of genetic oscillators).[8−11] The SsrA peptide-tag system is conserved across prokaryotic species, but the tags vary in their amino acid composition and length.[8,12−14] The E. coli SsrA tag is the most extensively characterized, and its last three amino acids, LAA, determine the tag strength and the rate of tagged protein degradation.[8] Variants of these critical residues such as LVA, AAV, and ASV result in different degradation rates, with LAA and LVA rendering tagged-GFP more unstable than the AAV or ASV variants.[8] The growing knowledge of E. coli proteases and their dependency on auxiliary adaptor proteins has also allowed for controllable modulation of protein half-lives and degradation.[2,15,16] For example, the degradation of proteins tagged with an E. coli tag variant “DAS” is mediated by the induction of the SspB adaptor protein in Bacillus subtilis.[14] Using SsrA tags from distinct species offers another level of control over protein degradation. The simultaneous use of multiple tags in parallel supports the construction of more complex systems with which degradation of multiple proteins can be independently controlled. Several SsrA tags from other species have been characterized,[13,14,17] including that of Mesoplasma florum.[12] This is targeted by the efficient M. florum Lon protease that acts orthogonally to the endogenous E. coli system, making it possible to use both systems simultaneously in E. coli cells.[12] Previous studies have identified regions of the M. florum tag which are crucial for recognition by E. coli and M. florum proteases, leading to the development of variants of the M. florum tag through the deletion of nonessential regions or replacement of residues with other amino acids.[10,18] Furthermore, the specificity of the endogenous M. florum Lon protease to the cognate M. florum SsrA tag has enabled the development of inducible orthogonal protein degradation systems in E. coli with diverse applications, including the ability to control the behavior of synthetic circuits such as toggle switches.[10−12,18] While targeted protein degradation has seen widespread use in tuning the function of genetic parts and circuits, much less attention has been placed on its use in a more native context. Specifically, using protein degradation to help recycle essential amino acid resources when nutrient stress occurs.[19,20] Although such capabilities are less important when cells are grown in the rich and carefully controlled conditions of the lab, when deploying an engineered system into real world environments like your gut or the soil, high variability in nutrient availability is inevitable and cells must be able to react.[21−24] Therefore, having programmable systems to help buffer cells from these effects is important and warrants further investigation. Here, we attempt to address this need by exploring how endogenous and heterologous protein degradation systems can be used to manage reservoirs of amino acids that are locked up in stable nonendogenous proteins that can then be subsequently released when needed. We explore the suitability of endogenous and heterologous proteolysis systems for implementing this type of system and show using auxotrophic strains how targeted release of amino acids from a reporter protein enables the partial recovery of growth when an essential amino acid becomes scarce in the growth media. Our proof-of-concept systems offer inspiration for developing new cellular chassis that are more robust to nutrient fluctuations, as well as opening avenues to constrain the functional “shelf-life” of a cell by providing an internal amino acid reservoir with a limited capacity, acting somewhat like a biological battery.

Results

Assessing the Proteolytic Activities of E. coli and M. florum SsrA Tags

To gain an insight into the effectiveness of different proteolytic tags, we compared the activities of the E. coli and M. florum proteolysis systems by assembling genetic constructs in which an eGFP (GFP) reporter gene was tagged with one of two proteolysis tags. Specifically, we used the E. coli (Ec; AANDENYALAA) and M. florum (Mf; AANKNEENTNEVPTFMLNAGQANYAFA) SsrA tag sequences which were codon optimized for expression in E. coli (Materials and Methods) and fused these to the C-terminus of GFP, the expression of which was under the control of an isopropyl β-d-1-thiogalactopyranoside (IPTG) inducible promoter (Plac). In this way, GFP was synthesized bearing one of two peptide tags, targeting it for proteolytic degradation by each of our chosen systems (Figure A). Because the Mf tag is specifically recognized by its cognate Lon protease from M. florum (Mf-Lon), which is not present in E. coli, we also constructed a separate plasmid where a codon-optimized lon gene from M. florum(12) was expressed under the control of an arabinose-inducible promoter (PBAD).

Figure 1

E. coli and M. florum proteolysis systems used for targeted protein degradation in E. coli. (A) Schematic of the proteolysis systems. GFP is expressed with E. coli (Ec) or M. florum (Mf) SsrA tags, which mark it for degradation by endogenous proteases, or the orthogonal plasmid-borne Mf-Lon protease, respectively. (B) GFP fluorescence normalized to cell density of E. coli BL21(DE3) cells expressing nontagged GFP (nt), GFP-Ec (Ec), or GFP-Mf without and with the coexpression of Mf-Lon (Mf and Mf + Lon, respectively). Arrow indicates time point of GFP induction. (C) GFP fluorescence normalized to cell density of cells expressing nontagged GFP (nt), GFP-Ec (Ec), or GFP-Mf (Mf) after removal of inducer, while maintaining Mf-Lon expression in the case of GFP-Mf. (D) Percent fluorescence normalized to the time of removal of the inducer of cells expressing nontagged GFP (nt), GFP-Ec (Ec), or GFP-Mf (Mf). Curves are fitted to first order exponential decay. Data are means ± SD (n = 3 independent biological replicates). To assess the performance of the two tags, we expressed untagged GFP, GFP-Ec, or GFP-Mf alone and simultaneously with Mf-Lon in E. coli BL21(DE3) cells and measured cell growth and fluorescence (Figure S1). We observed almost no fluorescence in cells expressing GFP-Ec compared to cells expressing untagged GFP (2.8% at 6 h), indicating that the E. coli tag was effective in targeting the tagged protein for degradation by endogenous proteases (Figure B). In contrast, GFP-Mf when expressed alone, saw reduced, though nevertheless substantial levels of GFP, suggesting that most, but not all, of this protein escaped the endogenous E. coli proteases (Figure B). This was confirmed with additional experiments in which the Mf-Lon expressing plasmid was both absent and present, corroborating previous findings[10,18] (Figure S2). As expected, further induction of Mf-Lon protease caused a 76% drop in GFP-Mf fluorescence, supporting the notion that the Mf tag is specifically recognized (Figure B). The fluorescence observed from cells expressing the untagged GFP remained largely the same upon induction of the Mf-Lon protease, demonstrating the specificity of the protease for the Mf tag (Figure S3). To further compare the efficiency of the Ec and Mf tags, we induced the expression of untagged GFP, GFP-Mf, or GFP-Ec and after 5 h removed the inducer. After allowing for GFP maturation,[25] we then monitored the degradation rate of each GFP variant by the drop in fluorescence and calculated their half-lives (Figure C,D; Materials and Methods). The fluorescence levels of cells containing GFP-Ec remained low throughout, indicating that even strong expression rates could not overcome the endogenous protein degradation. From this data, we found GFP, GFP-Ec, and GFP-Mf to have half-lives of 565, 6, and 56 min, respectively. These numbers support the high efficiency of the endogenous E. coli proteases with half-lives being almost 10 times shorter than when using the M. florum system. However, the Mf-tag did still cause an increased turnover rate, with GFP-Mf exhibiting a half-life less than a tenth of the untagged GFP. To assess how general these results were, we further tested if each system functioned similarly in the industrially relevant E. coli BL21 (DE3) star strain, in which RNase E has been knocked out for higher mRNA stability (Figure S4). As expected, we observed longer half-lives for each tagged GFP of 14 and 424 min for GFP-Ec and GFP-Mf, respectively, and virtually no measurable degradation of the untagged GFP, which was likely due to the increased stability of mRNA in this strain (Figure S4). We also explored the impact of amino acid recycling when a cell is further burdened by a large genetic regulatory circuit. We chose to use a large 3 input, 1 output genetic logic circuit called 0xF6 designed by the Cello software[26] and composed of nine transcription factors. Our existing strains containing the untagged GFP and GFP-Ec were cotransformed with the pAN3938 plasmid encoding this circuit and an assessment of growth carried out (Figure S5). As expected, growth was significantly affected by the addition of the 0xF6 genetic circuit. Following an exponential growth which was indistinguishable from that of untagged GFP expressing cells, the GFP-Ec expressing cells then entered a slower growth phase, presumably once the circuit components had been fully expressed. While the untagged GFP expressing cells ceased growth after the exponential phase, the GFP-Ec expressing cells continued growing to a higher density. This suggests that the 0xF6 circuit had started to affect the cells, and the expression of tagged GFP was able to provide some relief from this burden.

Dynamic and Targeted Control of Protein Degradation Using the M. florum SsrA System

A potential advantage of using the M. florum SsrA tag system in E. coli for the recycling of amino acids is the ability to dynamically control its expression to coincide with an increased demand for resources (e.g., during starvation conditions). This reduces the strength at which tagged proteins acting as a reservoir of amino acids need to be expressed, as their turnover rate can be kept low to ensure long-term protein stability when recycling is not required. Such a method is not possible with the endogenous system as it is continually active and needed by the host cell. Therefore, stronger and continual expression of the tagged protein is necessary to maintain a similar sized pool of reserve protein. We carried out several time-course experiments to investigate the precise dynamics of the Mf-tag system in this context where GFP-Mf expression was induced at t = 0 and Mf-Lon simultaneously induced or induced 1 or 2 h after GFP-Mf induction (Figure S6A). We found that only simultaneously inducing Mf-Lon with GFP-Mf resulted in increased degradation of GFP-Mf, while sequential induction of Mf-Lon after 1 or 2 h caused barely noticeable drops in fluorescence (4% and 3%, respectively). This result was unexpected given that Mf-Lon has been shown to function efficiently in E. coli,[10,18] but is likely due to the varying expression strengths of the GFP-Mf reporter and Mf-Lon protease, which reside on different plasmids and which are driven by different promoters (Figure ). To test this theory, we carried out additional experiments where Mf-Lon expression was induced 2 h before induction of GFP-Mf to allow further time for its accumulation (Figure S6A). We found that the initial increase in fluorescence when Mf-Lon was induced simultaneously with GFP-Mf was negated when Mf-Lon was induced 2 h prior, suggesting that expression and maturation of Mf-Lon occurs quickly, and efficient GFP degradation could occur. Nevertheless, the rate of fluorescence increase from 3 h after GFP-Mf induction was almost identical (Figure S6B), indicating that the concentration of Mf-Lon achieved when expressed from a PBAD promoter and medium-copy plasmid (p15A origin; ∼10 copies per cell) is unable to significantly reduce GFP-Mf levels.

Recovering Cell Growth by Amino Acid Recycling

A major challenge when developing genetic circuits is managing the burden they place on shared cellular resources.[27−32] The expression of a genetic construct will sequester key cellular machinery such as ribosomes and may exhaust amino acid supplies, which in turn can impact overall cell physiology and protein synthesis,[33−36] alter translation dynamics,[28] and trigger stress responses.[37,38] A reason for this large impact is that circuit components are often strongly expressed and designed to be highly stable, causing a large portion of the cell’s resources to become locked away from use in endogenous processes. Several studies have engineered ways to mitigate the burden placed on the cell; limiting recombinant protein expression via negative feedback loops, or reducing translational demand by splitting recombinant protein synthesis between endogenous and orthogonal ribosomes.[30,37,39] However, it has also been observed that supplementing the growth media of cells expressing recombinant proteins with amino acids can enhance growth rate and protein production.[38] Consequently, we hypothesized that by increasing amino acid turnover of heterologous protein products through targeted proteolysis, we would be able to help mitigate the burden a genetic circuit places on its host cell. To test this idea, we measured the growth rate of cells expressing tagged and untagged GFP under the control of the same strong Plac promoter (Figure A). We reasoned that the expression of the tagged GFP would place less of a burden on the host compared to the untagged version, due to increased recycling of amino acids.[27,38,40] Although the expression of any GFP protein will reduce cell growth rate, the reduction in growth rate for the first 3 h of induction was smaller for cells expressing GFP-Ec (41%) and GFP-Mf with Mf-Lon (37%), compared to cells expressing untagged GFP (51%) (Figure S7). This suggests that while the expression of a recombinant protein will always cause a burden, this burden is partially alleviated by more effective recycling of these products, making these resources accessible to endogenous processes. It is known that protein degradation is elevated under various stress conditions, possibly as a way to increase the availability of amino acids for synthesis of stress-related proteins.[19,41] Furthermore, as part of the E. coli stringent response to nutrient limitations, there is an increase in the level of amino acid biosynthesis enzymes, to meet the demand for amino acids.[42] Considering this, we asked whether the potential benefit of using tagged proteins might increase when the host cell experienced nutrient related stress. We reasoned that increased recycling of a heterologous pool of proteins could benefit a host cell for which nutrients to synthesize amino acids had become scarce in the environment. To assess the feasibility of this approach, we developed a simple mathematical model to capture the key flows of a hypothetical essential resource in the cell (e.g., an amino acid the cell is unable to synthesize) and its impact on cell growth (Figure A). The model consisted of three ordinary differential equations that track the concentrations of a shared resource that is either available for use within the cell (N), is actively in use by endogenous proteins (P), or is locked up in foreign heterologous proteins (P):Here, N is the external resource concentration outside the cell with a cellular import rate of r, re and r are the rates that available resources within the cell are converted into endogenous or heterologous proteins, respectively, and r is the recycling rate of the heterologous proteins (e.g., due to targeted proteolysis). Cellular growth and the associated dilution (by cell division) of all resources was captured by μ = 0.1P. Parameters were chosen such that overall growth rate of the cell was consistent with E. coli data (i.e., having a division time ∼25 min) and that relative internal transport, production and degradation rates were biologically realistic (Materials and Methods).

Figure 2

Model capturing the benefits of amino acid recycling. (A) Schematic of the model. N denotes the external resource concentration, N, P, and P denote the concentration of a key resource (i.e., an amino acid that cannot be natively produced), available within the cell, locked up in endogenous or in heterologous proteins, respectively. r denotes the cellular import rate of resources, which can be divided into r, the rate at which resources are converted into endogenous proteins, and r, the rate at which resources are converted into foreign heterologous proteins. μ = f(P) captures cell growth and dilution of resources by cell division. (B) Simulation of the normalized cell growth rate in a strain expressing recombinant proteins (i.e., with no amino acid recycling), and in a strain expressing tagged heterologous proteins (i.e., with amino acid recycling). (C) Simulation of the effects of nutrient stress on normalized cell growth rate expressing tagged proteins (i.e., with continual amino acid recycling; “AA recycling”), and in a strain expressing no heterologous protein (“wild-type”). The external resource (nutrient) is continually present for the first 500 min (gray shaded region), then either removed completely, or oscillating nutrient levels are applied after this time. In all cases, growth rate is normalized to the steady state growth rate when the external resource is present (i.e., N = 1). (D) Time-series of normalized growth rate (to wild-type cells when the external resource is present) of cells exposed to a cycling of external nutrient absence (white regions) and presence (gray shaded regions). Responses shown for wild-type cells (black dashed line), cells with continual amino acid recycling (red lines; r = 0.1 protein resource–1 min–1, light–dark: r = 0.1, 0.2, 0.4 resource protein–1 min–1), and cells where amino acid recycling is only active when the external nutrient is absent (blue lines; r = 0.1 protein resource–1 min–1, light–dark: r = 0.1, 0.2, 0.4 resource protein–1 min–1). Panels from top to bottom show varying lengths of period that nutrient is absent from the external environment. (E) Heat maps showing how varying heterologous protein production (r = 0, 0.05, 0.2 protein resource–1 min–1) and recycling (r = 0.001, 0.015, 0.029, 0.043, 0.057, 0.072, 0.086, 0.1 resource protein–1 min–1) rates, and period (25, 50, 75, 100, 125, 150, 175, 200 min) of the nutrient/resource availability cycling (see panel D) affect the maximum percentage drop in the initial steady state growth rate when external nutrient is present. Using this model, we simulated cells expressing tagged and untagged proteins (Figure B) and exposed these cells to several external environmental shifts to temporally vary the resources available (Figure C). In the first shift, we removed all resources from the environment at 500 min, and in the second, at the same time point, we applied an oscillating external nutrient concentration. In both cases, we compared cells not producing any heterologous protein (i.e., r = 0) to those producing a recombinant protein that is subsequently recycled for reuse by the cell. We then measured their response in terms of growth rate normalized to when the external nutrient was continually present (i.e., the steady state growth rate when N = 1). In both cases, the model showed a reduction in the relative impact on growth rate to changes in environmental availability (Figure C), demonstrating the ability for a recycled internal reservoir of a heterologous resource to act as a reserve that can help buffer the cell temporarily from environmental change. It should be noted that inclusion of a heterologous resource pool and its recycling does have an impact on cellular growth rate. However, for some applications (e.g., excitable systems that are sensitive to even minor fluctuations in cellular behaviors[43]), it may be preferable to have a more consistent performance when faced with environmental variability. Even though our previous experiments had shown a limited capacity to dynamically vary protein degradation rates, we also explored how future controllable amino acid recycling systems might compare to a simpler system in which heterologous proteins are continually recycled. We ran simulations of our model where no heterologous protein pool was present (“Wild-type”; r = 0) and where a heterologous protein pool was continually recycled (“AA recycling”) or only recycled upon removal of nutrient from the environment (“Controlled AA recycling”). We allowed each system to reach an initial steady state with the external nutrient present (i.e., N = 1 until t = 500 min), then provided alternating time periods in which the nutrient was completely removed from the environment (i.e., N = 0) and then made available again. We tracked the varying growth rate over time to assess the impact on the cells (Figure D). As expected, wild-type cells displayed large drops in growth rate upon removal of an external nutrient that was proportional to the length of the removal period and fast recovery was seen upon reintroduction of the external nutrient. Activation of constant recycling saw a minor reduction in growth rate compared to the wild-type cells. However, this allowed for the drop in growth rate upon nutrient removal to be a smaller fraction of the initial growth rate. In contrast, controlled amino acid recycling that was active only when the essential nutrient was removed from the environment showed two different features. First, because recycling was only active upon removal of the external nutrient, for normal conditions there was no recycling of the heterologous protein and so a larger impact was seen on the normal growth rate compared to when continual recycling was used (e.g., lower initial normalized growth rates for blue lines in Figure D). Second, removal of the external nutrient led to a transient increase in growth rate as recycling was activated. However, as the pool of heterologous protein was consumed the growth rate also then began to drop. If the period of nutrient switching was short enough though, it was possible for no reduction below the initial growth rate to be seen (e.g., blue lines in bottom panel of Figure D). We also generated heat maps showing the percentage drops in growth rate from an initial steady-state growth rate in which the essential nutrient was abundant in the environment (i.e., N = 1) for varying recycling rates (r) and periods of removal from the environment, across both low and high rates of heterologous protein production (r). These simulations revealed that as expected, when no heterologous protein production is present, growth rate drops with the length of time (period) the essential nutrient is removed from the environment (Figure E). The expression and continual recycling of a heterologous protein was able to reduce these drops in growth rate, and this effect was enhanced if the internal pool of heterologous protein was sufficiently large and recycled at a sufficiently high rate (i.e., high r and r). For the controlled amino acid recycling, we found that for particular combinations of heterologous protein production and recycling rates and shorter periods of nutrient removal, drops in growth rate could be completely eradicated (white regions in Figure E). This suggests that controlled amino acid recycling (if sufficiently rapid) is a feasible strategy for completely shielding a cell from environmental nutrient fluctuations. However, trade-offs in the size of the internal heterologous protein pool and the rate of recycling affect the ability to robustly respond to differing lengths of nutrient fluctuation.

Buffering Auxotrophic Cells from Environmental Amino Acid Fluctuations

To test some of the model predictions, we used auxotrophic E. coli strains RF10[44] (ΔlysA) and ML17[45] (ΔglnA), which are unable to synthesize lysine and glutamine, respectively. This allowed us to tightly control endogenous amino acid levels by modulating the external supply in the media. Furthermore, lysine and glutamine are among the most abundant amino acids in our GPF reporter (8.4% and 6.7% of the total amino acid composition, respectively) offering suitable reservoirs of these key resources. We initially tested the ability of the endogenous Ec tag system to enhance cell growth as we had previously found that it resulted in faster degradation of GFP compared to the orthogonal Mf tag system. We grew each of the strains expressing untagged GFP and GFP-Ec in nutrient-rich media to allow for a buildup of the recombinant protein. Following this, cells were switched to minimal media, effectively removing the source of all external amino acids, and for our auxotrophic strains, completely removing access to lysine and glutamine, respectively. Consistent with our model predictions, we found that both strains expressing GFP-Ec exhibited a higher growth rate than cells expressing untagged GFP; 0.128 and 0.15 h–1 for GFP-Ec versus 0.08 and 0.091 h–1 for GFP for the ΔlysA and ΔglnA strains, respectively. This equated to an increase in growth rate of 60% and 65% for the ΔlysA and ΔglnA strains, respectively (Figure A,B). We suspect the higher growth rates are due to the degradation of GFP-Ec, which is supported by the lower fluorescence levels (Figure C). With the addition of lysine or glutamine (7 mM) to the medium for the respective untagged GFP-expressing auxotrophic strains, there was a marked increase in cell growth rate from 0.008 to 0.13 h–1 for the ΔlysA strain when lysine was present, and from 0.091 to 0.193 h–1 for the ΔglnA strain when glutamine was present (Figure B). This indicated that glutamine and lysine were the major limiting factors for cell growth and that recycling of the internal heterologous protein reservoir was able to partially buffer this impact (37% and 31% recovery for ΔlysA and ΔglnA, respectively).

Figure 3

Targeted GFP degradation provides amino acids to auxotrophic strains upon nutrient limitation. (A) Growth of the RF10 (ΔlysA) and ML17 (ΔglnA) strains in minimal medium, expressing nontagged GFP (nt), GFP-Ec (Ec), or nontagged GFP with the addition of 7 mM lysine or glutamine supplement (+7 mM AA). Data are means ± SD. (B) Quantification of the exponential growth rates of cells. (*p < 0.05, as compared to nt condition for each strain, with 2-sample t-test). Data are means ± SE. (C) GFP fluorescence normalized to cell density of the RF10 (ΔlysA) and ML17 (ΔglnA) strains, expressing nontagged GFP (nt), GFP-Ec (Ec), or nontagged GFP with the addition of 7 mM lysine or glutamine supplement (+7 mM AA). Data are means ± SD (n = 5 independent biological replicates). Having shown that using the native E. coli tag could render cells more robust in the face of amino acid limitations, we next investigated whether the same effect could be seen when using the orthogonal Mf tag system. As mentioned previously and shown by our modeling, an orthogonal system would confer benefits over the endogenous one as it could target degradation in a dynamic and controllable manner. Again, we grew strains coexpressing untagged GFP and Mf-Lon, or GFP-Mf and Mf-Lon in rich media, before switching them to minimal media to eliminate external sources of nutrients. We observed a higher growth rate in both strains when expressing GFP-Mf compared to untagged GFP (0.076 and 0.06 h–1 for GFP-Mf versus 0.033 and 0.014 h–1 for untagged GFP for the ΔlysA and ΔglnA strains, respectively). The increase in growth of over 2-fold for the ΔlysA strain and over 4-fold for the ΔglnA strain corroborated our findings from the model and endogenous tag system (Figure A,B). This again could be attributed to the degradation of GFP-Mf as indicated by the lower fluorescence levels (Figure C). We also found that the addition of 10 mM lysine or glutamine to the medium recovered the growth of cells expressing untagged GFP and Mf-Lon; 0.033 h–1 increased to 0.068 h–1 in the ΔlysA strain, and 0.014 h–1 increased to 0.057 h–1 in the ΔglnA strain (Figure A,B). In contrast to the previous results, specifically in ΔglnA, the amino acid supplementation, even at fairly high concentrations of 10 mM did not result in an increased growth rate compared to cells with already induced GFP degradation, suggesting that maximum cell growth had already been achieved under these conditions.

Figure 4

Targeted GFP degradation using the foreign Mf tag system provides amino acids to auxotrophic strains upon nutrient limitation. (A) Growth of the RF10 (ΔlysA) and ML17 (ΔglnA) strains in minimal medium, coexpressing nontagged GFP and Mf-Lon (nt), GFP-Mf and Mf-Lon (Mf), or nontagged GFP and Mf-Lon with the addition of 10 mM lysine or glutamine supplement (+10 mM AA). Data are means ± SD. (B) Quantification of the exponential growth rates of cells (*p < 0.05, **p < 0.005, as compared to the nt condition for each strain, with 2-sample t-test). Data are means ± SE. (C) GFP fluorescence normalized to cell density of the RF10 (ΔlysA) and ML17 (ΔglnA) strains, coexpressing nontagged GFP and Mf-Lon (nt), GFP-Mf and Mf-Lon (Mf), or nontagged GFP and Mf-Lon with the addition of 10 mM lysine or glutamine supplement (+10 mM AA). Data are means ± SD (n = 5 independent biological replicates). We also found that cells expressing both GFP-Mf and Mf-Lon grew faster than cells expressing only GFP-Mf (Figure A,B). The induction of the Mf-Lon protease enhanced growth of both the ΔlysA and ΔglnA strains; 0.076 and 0.06 h–1, respectively, compared to where GFP-Mf alone was expressed: 0.057 and 0.026 h–1, respectively. This suggests that the benefits of increased protein degradation (Figure C), and therefore a higher level of amino acid recycling, outweigh the cost of expressing two recombinant proteins (the reporter and the protease). Indeed, upon Mf-Lon protease induction, we observed an increase in growth of 33% and 180% for the ΔlysA and ΔglnA strains, respectively, providing support for increased amino acid recycling within cells enhancing their robustness to nutrient stress. Interestingly, we observed that the ΔglnA strain grew slower than the ΔlysA strain, and that the benefits of amino acid recycling were more pronounced (Figures and 5). This may be due to the fact that glutamine is more common in the E. coli proteome than lysine.[46] Therefore, a lack of endogenous glutamine would have a greater effect on cellular growth when external nutrients were limited and cells were heavily burdened, than a lack of endogenous lysine.

Figure 5

Induction of the orthogonal Mf proteolysis system increases cell robustness against nutrient stress as a result of resource recycling. (A) Growth of the RF10 (ΔlysA) and ML17 (ΔglnA) strains in minimal medium expressing GFP-Mf (Mf), or GFP-Mf and Mf-Lon (Mf + Lon). Data are means ± SD. (B) Quantification of the exponential growth rates of cells (**p < 0.005, as compared to Mf condition, with 2-sample t-test). Data are means ± SE. (C) GFP fluorescence normalized to cell density of the RF10 (ΔlysA) and ML17 (ΔglnA) strains expressing GFP-Mf (Mf), or GFP-Mf and Mf-Lon (Mf + Lon). Data are means ± SD (n = 5 independent biological replicates).

Discussion

In this work, we have directly compared the effectiveness of the endogenous E. coli proteolysis system and a similar heterologous system from M. florum, and characterized them with the goal of using them as mechanisms for promoting targeted amino acid recycling within E. coli cells (Figure ).[8,12] We found that the endogenous system was approximately 10 times more effective than the M. florum system, shortening the half-life of untagged GFP almost 100-fold, and similar results were observed in a second E. coli BL21(DE3) star strain, indicating the transferability of this approach. We also observed some crosstalk between these systems, with the reporter protein containing the M. florum tag also seeing increased degradation compared to an untagged reporter when the cognate Mf-Lon protease was not present. While characterization of these systems has been performed independently,[8,10,11,18,47] we believe this study to be the first that directly compares these systems targeting an identical target protein and functioning within the same host cell context. In addition, we explored the option to activate targeted degradation by externally inducing expression of the M. florum system dynamically over time. However, we found that dynamic expression of the Mf-Lon protease in our system was unable to have a significant effect on GFP levels unless the protease was simultaneously induced or induced prior to the target protein (Figure S6). This was likely due to the use of different plasmid backbones with different plasmid copy numbers, resulting in strong expression of GFP-Mf that Mf-Lon could not overcome. We also studied the effect of proteolysis tags when the host cells expressed a large genetic circuit, and found that the Ec tag remained effective and supported slow cell growth after the burdensome circuit became established within them (Figure S5). Numerous strategies have been developed to mitigate the burden that genetic circuits and strong heterologous protein expression places on a host cell. Examples include the use of negative regulators, stress feedback sensors, and orthogonal ribosomes.[30,37,39] In our study, we asked whether the genetic circuit burden could be mitigated by using proteolysis tags to stimulate a higher turnover rate of amino acids, which could be used for the synthesis of endogenous proteins. We found that the reduction in growth rate of cells was indeed smaller when they expressed a tagged protein (Figure S7), providing evidence for the benefits of using proteolysis tags when expressing recombinant proteins. On the basis of these findings, we developed a model that further explored the role that proteolysis could play, specifically under nutrient stress. The model showed that benefits would be amplified when facing external amino acid shortages, especially when activation of proteolysis could be triggered only when necessary (Figure ). Finally, by using auxotrophic strains, we were able to show that recycled heterologous proteins could act as a limited reservoir of amino acid resources, both when using the endogenous and orthogonal tag systems, helping buffer the cell from fluctuations in nutrient availability and partially recovering cell growth (Figures and 4). We also found that the benefits of constant proteolysis outweighed the costs of expressing two recombinant proteins (Figure ), emphasizing the importance of resource recycling to cells exposed to nutrient stress. As our ambitions in synthetic biology grow and we begin to consider the construction of entire synthetic cells, understanding how resources flow and are recycled throughout these systems will become crucial. Our demonstration of the benefits of proteolysis tags as a means for amino acid recycling opens new avenues for other approaches controlling nutrient fluxes within cells and provides a fresh perspective on the use of internal pools of heterologous proteins (or other resources) that can be released when needed to alleviate potential environmental fluctuations. This methodology can be used to help buffer cells, and the engineered genetic systems they host, from the unavoidable variability that is present within real-world environments and paves the way for creating more reliable and robust biosystems.

Materials and Methods

Bacterial Strains, Media, and Cloning

The E. coli strain DH5α (φ80dlacZ ΔM15 Δ(lacZYA-argF)U169 deoR recA1 endA1 hsdR17rK-mK+ phoA supE44 λ– thi-1) was used for plasmid construction and cloning, and the strains BL21(DE3) (F–ompT hsdSB (rB– mB−) gal dcm (DE3)) and BL21(DE3) star (F–ompT hsdSB (rB– mB−) gal dcm rne131 (DE3)) were used for characterization of our genetic systems. Cells were grown in Luria–Bertani (LB) media (Roth, X968.4), or minimal media (12.8 g/L Na2HPO4·7H2O, 3 g/L KH2PO4, 1 g/L NH4Cl, 2 mM MgSO4, 0.1 mM CaCl2, 0.4% glucose). Also, 100 mg/mL ampicillin (Sigma-Aldrich, A9393), 50 mg/mL kanamycin (Sigma-Aldrich K1377), or 34 mg/mL chloramphenicol (Sigma-Aldrich, C0378) were used as selection markers for cloned plasmids. Enhanced green fluorescent protein (eGFP) in the pET16b vector under the IPTG-inducible Lac promoter system was C-terminally tagged with the E. coli (Ec) tag through site directed mutagenesis: overlap PCR primers were designed which contained the Ec tag sequence. These were phosphorylated and used for PCR with the plasmid backbone. The product was digested with DpnI (NEB, R0176S) overnight, and the resulting product was PCR purified. A ligation was carried out to circularize the vector, using 10–50 ng of DNA and T4 DNA ligase (Thermo Fisher, EL0011), according to the manufacturer’s instructions. The resulting plasmid was transformed into competent DH5α cells. The M. florum tag was codon optimized for expression in E. coli by selecting the most highly abundant codons in E. coli for the corresponding amino acids (codon sequence: GCT GCA AAC AAG AAC GAG GAA AAC ACC AAC GAA GTA CCG ACC TTC ATG CTG AAC GCA GGC CAG GCT AAC TAT GCA TTC GCA), and GFP was C-terminally tagged with the Mf tag using a digest-and-ligate approach: oligonucleotides were designed to contain the Mf tag sequence, annealed to create double-stranded DNA fragments, and then phosphorylated. The pET16b-eGFP vector was digested with fast digest BsrgI and XhoI (NEB, R0102S and R0146S) and used in a ligation reaction with the insets (3:1 ratio) using T4 DNA ligase (Thermo Fisher, EL0011), at 22 °C for 4–6 h. Competent DH5α cells were transformed with the resulting product. The M. florum Lon protease from the pBAD33 vector (a gift from Robert Sauer; Addgene plasmid 21867) was subcloned into the pSB3C5 plasmid under the araBAD promoter using Golden Gate assembly. The primers for the PCR reaction were designed to flank the Mf-Lon with BsmBI restriction sites and include them into the vector (pSB3C5). The Golden Gate assembly reaction was set up, which included insert/vector in a 4:1 ratio, BsmBI (NEB, R0739S), and 1 μL of T4 DNA Ligase (Thermo Fisher, EL0011). The following conditions were used for the reaction: 60 cycles of 42 °C for 3 min, then 16 °C for 4 min, followed by 50 °C for 5 min and 80 °C for 5 min. The resulting product was transformed into E. coli DH5α cells. The pAN3938 plasmid encoding the 0xF6 genetic logic circuit[26] was a gift from Christopher Voigt, and electrocompetent BL21 cells were cotransformed with either the GFP-Ec or GFP plasmids, and the pAN3938 plasmid.

Proteolysis Tag Activity Assays

Overnight cultures of BL21(DE3) or BL21(DE3) star cells transformed with pET16b-eGFP, pET16b-eGFP-Ec, or pET16b-eGFP-Mf and pSB3C5-mfLon were grown for 12–16 h at 37 °C 250 rpm and then resuspended in minimal media with appropriate antibiotics for selection. The cultures were grown to an OD600 of 0.4–0.6, then induced with 0.5 mM IPTG (Roth, 2316.3) or 0.2% (w/v) arabinose (Roth, 5118.2). Glucose (1%) was added to the cultures expressing untagged proteins to prevent basal expression from the pET16b vector.[48] For degradation assays, cells were pelleted 5 h after induction, washed twice in 1X phosphate-buffered saline (PBS) (137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4, 1.8 mM KH2PO4), and then resuspended in minimal medium containing the relevant antibiotics, without IPTG. In cultures cotransformed with two plasmids, IPTG induction was stopped, but the second inducer, 0.2% (w/v) arabinose, was added to the medium to induce expression of Mf-Lon. Aliquots of 200 μL of the cultures were grown in a 96-well flat-bottom black plate with clear bottom (Corning, Sigma-Aldrich CLS3603-48EA) at 37 °C with orbital shaking in a multimode microplate reader (Tecan Spark). Optical density (at 600 nm) and fluorescence measurements (excitation and emission wavelengths of 472 and 517 nm, respectively, with a gain of 50) were recorded at discrete intervals. Fluorescence was normalized to the OD600 value (a.f.u./OD600). Untransformed BL21(DE3) cells were used as a negative control, and their normalized autofluorescence values (a.f.u/OD600) were subtracted from the normalized fluorescence values (a.f.u./OD600) of the cells in different conditions.

Auxotrophic Strain and Starvation Assays

The RF10 (ΔlysA) and ML17 strains (ΔglnA) (a gift from Robert Gennis and Toshio Iwasaki Addgene plasmids 62076 and 61912) were transformed with the plasmids developed in this work and grown in LB to an OD600 of ∼0.3. For cells transformed with GFP-nt and GFP-Ec, GFP expression was induced with 0.5 mM IPTG for 1 h. For cells transformed with GFP-Mf and Mf-Lon, or GFP-nt and Mf-Lon, GFP expression was induced with 0.5 mM IPTG and Lon expression induced with 0.2% (w/v) arabinose for 1 h. After this, cells were pelleted, washed in 1X PBS, and resuspended in minimal medium containing appropriate antibiotics for plasmid selection with additional 0.5 mM IPTG to maintain GFP expression, or 0.5 mM IPTG and 0.2% (w/v) arabinose to maintain GFP and Mf-Lon expression. Additionally, 7 or 10 mM of lysine (Sigma-Aldrich, L5501) or glutamine (Serva, 22942) was added to positive control samples. The OD600 value and fluorescence were then measured as described above using a microplate reader every 10 min over 12 h.

Data Analysis

Python version 3.9.5 and packages matplotlib version 3.3.2, NumPy version 1.19.2, and SciPy version 0.13 were used to fit the degradation data to a first order decay function of the form, N(t) = 100e – λt, where N(t) is the percentage of remaining fluorescence at time t post the start of the degradation curve, and λ is the decay constant. The half-live of GFP was then given by t1/2 = ln(2)/λ. When investigating the dynamics of the Mf-tag system, the rate of GFP production (F) was calculated as the gradient of fluorescence values normalized to OD600 between 3 and 7 h after induction (Figure S6). To obtain values for the growth rate of cells expressing tagged or untagged GFP, the slope of the linear fit to log transformed OD600 values was calculated when cells were in their exponential growth phase, between 5 and 9 h (Figures –5), or 1.5 and 2.5 h (Figure S5).[49] It is of note to consider that the cells do not always show standard growth curves with distinct growth phases, due to confounding factors such as the auxotrophic strains used in the experiments, nutrient stress placed on the cells, as well as the expression burden. Therefore, the exponential growth phase of the cells is not always distinct. The percent drop in growth rate of cells expressing recombinant proteins used growth rate values calculated between 1 and 4 h growth (Figure S7). To compare whether the growth rates of the auxotrophic strains were statistically significantly different, a 2-sample t-test was used. p < 0.05 was considered as statistically significant. The statistical analysis was performed, and all plots and slopes of best fit were generated, using OriginLab Pro software (2019 version 64 bit).

Model Parametrization and Simulation

Parameters for the model of resource allocation and use were selected based on the assumption that an external resource concentration of N = 1 would lead to a realistic cell doubling time (∼25 min) and that internal cellular rates would have biologically feasible relative values. This resulted in simulations with heterologous protein recycling present being simulated with parameters r = 0.015, r = 0.02, r = 0.2, r = 0.01, and with μ = 0.1 × P. In all simulations, initial conditions for all states were set to 0, and the dynamics were simulated for 500 min with N = 1.0 for the system to reach a steady state before any environmental fluctuations occurred. The model was simulated using Python version 3.9, and the SciPy version was 0.13. The code for all simulations is available as Supplementary Data 1.

40 in total

Improving the Robustness of Engineered Bacteria to Nutrient Stress Using Programmed Proteolysis.

Introduction

Results

Assessing the Proteolytic Activities of E. coli and M. florum SsrA Tags

Dynamic and Targeted Control of Protein Degradation Using the M. florum SsrA System

Recovering Cell Growth by Amino Acid Recycling

Buffering Auxotrophic Cells from Environmental Amino Acid Fluctuations

Discussion

Materials and Methods

Bacterial Strains, Media, and Cloning

Proteolysis Tag Activity Assays

Auxotrophic Strain and Starvation Assays

Data Analysis

Model Parametrization and Simulation

1. Dynamics of substrate denaturation and translocation by the ClpXP degradation machine.

Review 2. Proteolysis in bacterial regulatory circuits.

3. Growth rates made easy.

4. New unstable variants of green fluorescent protein for studies of transient gene expression in bacteria.

5. Quantifying cellular capacity identifies gene expression designs with reduced burden.

6. The ClpXP and ClpAP proteases degrade proteins with carboxy-terminal peptide tails added by the SsrA-tagging system.

Review 7. Cellular Biosensors with Engineered Genetic Circuits.

8. Cytoplasmic degradation of ssrA-tagged proteins.

Review 9. Proteases and protein degradation in Escherichia coli.

10. 'Deadman' and 'Passcode' microbial kill switches for bacterial containment.

Review 1. Bacterial degrons in synthetic circuits.