Literature DB >> 35729740

Control-Based Continuation: A New Approach to Prototype Synthetic Gene Networks.

Irene de Cesare^1,2, Davide Salzano^1,2, Mario di Bernardo², Ludovic Renson³, Lucia Marucci^1,4,5.

Abstract

Control-Based Continuation (CBC) is a general and systematic method to carry out the bifurcation analysis of physical experiments. CBC does not rely on a mathematical model and thus overcomes the uncertainty introduced when identifying bifurcation curves indirectly through modeling and parameter estimation. We demonstrate, in silico, CBC applicability to biochemical processes by tracking the equilibrium curve of a toggle switch, which includes additive process noise and exhibits bistability. We compare the results obtained when CBC uses a model-free and model-based control strategy and show that both can track stable and unstable solutions, revealing bistability. We then demonstrate CBC in conditions more representative of an in vivo experiment using an agent-based simulator describing cell growth and division, cell-to-cell variability, spatial distribution, and diffusion of chemicals. We further show how the identified curves can be used for parameter estimation and discuss how CBC can significantly accelerate the prototyping of synthetic gene regulatory networks.

Entities: Chemical

Keywords: bifurcations; control-based continuation; synthetic biology; toggle switch

Mesh：

Year: 2022 PMID： 35729740 PMCID： PMC9295158 DOI： 10.1021/acssynbio.1c00632

Source DB: PubMed Journal: ACS Synth Biol ISSN： 2161-5063 Impact factor: 5.249

The complexity of biological systems is widely acknowledged. In native organisms, multiscale intracellular interactions often result in complex nonlinear dynamics. Consider, for example, switches and oscillations in gene expression, which are used by cells to process external inputs and program specific cell outputs. Synthetic biology aims at engineering biological computation by recreating such dynamics using circuits embedded into living cells.[1−10] Mathematical modeling is widely used within synthetic biology design–build–test–learn cycles. In the context of engineered gene regulatory networks, models can both support the design phase (indicating the parameter space that allows the emergence of the desired dynamics, such as oscillations), and the testing upon experimental implementation. Moreover, modeling is currently the only way in synthetic biology to study the relationship between physical parameters variations and bifurcations; the latter represent stability boundaries where qualitative and quantitative changes to the system’s dynamics occur. For instance, saddle-node and Hopf bifurcations (responsible for hysteresis and oscillatory behaviors, respectively) that are commonly observed in native biological systems[11] can only be studied by nonlinear model analysis. This means that, first, a mathematical model needs to be derived and fitted to often ad hoc generated experimental data. Then, nonlinear model analysis can be performed to identify the bifurcation behavior being observed. The derivation of biochemical models can however be challenging, both in terms of model structure (which depends on the underlying hypotheses on the system), and parameter identification and validation (which can be troublesome in the case of incomplete/noisy experimental data sets). Model uncertainties will inevitably result in misleading conclusions.[12,13] As a consequence, the design and testing of engineering synthetic biochemical circuits that perform as intended is extremely difficult, unless various design, build, test and learn iterations are performed.[14] Control-based continuation (CBC), originally proposed by Sieber and Krauskopf,[15] is a general, systematic and model-free testing method that applies the principles of numerical continuation (a computational method for bifurcation analysis) to physical experiments. The fundamental principles of CBC are well established and the method has been applied to a wide range of mechanical systems. For instance, Barton et al.[16] studied the periodic responses of a bistable energy harvester; a similar system was studied by Renson et al.[17] The continuation method was also demonstrated on a multi-degree-of-freedom structure exhibiting isolated curves of periodic responses and quasi-periodic oscillations.[18] CBC exploits feedback control to explore the nonlinear dynamics of a system, detect bifurcations, and eventually trace their evolution with respect to adjustable parameters directly during experimental tests. Recently, external feedback controllers have been exploited for controlling gene expression in living cells by means of microfluidics/microscopy or optogenetics platforms.[19−27] From a control theory standpoint, genetic networks are the processes to be controlled, while the controller is implemented on a computer. Biosensors are used to measure the state of the process, usually by means of fluorescent reporter proteins. The fluorescence evaluation can be done either at single-cell or at cell population level; such measurement is fed back to a control algorithm that evaluates the appropriate input aiming to steer the process to a selected reference signal. Inputs are then actuated on the process via light or chemical compounds.[28] By employing external feedback controllers to steer gene expression, it should be possible to apply CBC to track nonlinear dynamics in biochemical systems, overcoming the need for a model identification step and defining a shorter way for bifurcation tracking that includes the isolation of unstable equilibria (Figure A).

Figure 1

(A) A schematic Toggle Switch from Lugagne et al.[6] Two different paths to the bifurcation diagram (- -): numerical continuation (path 1), and control-based continuation (path 2). Unstable branch of the bifurcation curve (—) and bifurcation points (*) are highlighted. (B) Steps of the CBC algorithm: at time t = 124 h the control pushes the system’s trajectory TetR(IPTG) (—) toward the new reference TetR* (blue line - -). At time t = 144 h transient dynamics are extinguished and a point is saved (●). Then, the reference signal is increased and the process is repeated. Here, we demonstrate, for the first time, the applicability of CBC to prototype the dynamics of a synthetic gene regulatory network and to estimate the parameters of a fixed model structure using data collected after the CBC routine. We use as a test bed the toggle switch, a bistable biological circuit, first embedded in Escherichia coli cells by Gardner et al.,[10] which is often used to benchmark new control strategies and is considered a fundamental tool for cell computation.[11,29] Specifically, we run in silico experiments on a recent version of the toggle switch, for which external feedback control was shown to be successful.[6] Moreover, we leveraged an agent-based simulator called BSim[30] to test the performance of the designed algorithms in conditions more representative of an in vivo experiment. Our in silico demonstration of CBC gives us the confidence that the method should be exploitable in vivo to fully explore the parameter space that allows the emergence of the desired dynamics (in our example, hysteresis), leading to rapid and model-free characterization, and also parameter estimation, of engineered synthetic modules.

Results and Discussion

CBC retrieves stable and unstable invariant solutions of a dynamical system through the application of an external control action. In order to acquire the bifurcation diagram, the controller should not modify the position in the parameter space of the uncontrolled system’s invariant solutions. For example, the equilibria of a controlled system are in general different from the one of the uncontrolled system and, to recover the response of the underlying uncontrolled system of interest, CBC seeks a control signal that decays asymptotically to zero, i.e.,where is the system’s state, is the control reference signal, and t is time. Although the control signal is asymptotically converging to zero, it is synthesized in order to stabilize the local dynamics of the system such that unstable equilibria become stable and hence detectable forward in time. A control strategy that satisfies eq is called noninvasive and does not modify the system’s equilibrium positions in the parameter space. Finding a noninvasive control signal usually requires finding the “right” reference input (xref) for the controller such that eq is satisfied, while the solution of interest is stabilized. When the control input can be chosen as the bifurcation parameter of interest, the methodology can be significantly simplified as the control signal is only required to settle to a constant value. Indeed, when this condition is achieved, the nonzero constant control signal can be viewed as a mere shift in the bifurcation parameter value. This simplified method is used in this paper, and more details about its implementation can be found in the Methods section. Furthermore, CBC can also be extended to characterize systems exhibiting oscillations and other bifurcations as described in the literature.[17,18,31,32] The applicability of CBC is demonstrated here in silico on a synthetic gene network, the toggle switch (Figure ). The mathematical model developed by Lugagne et al.[6] is used. From a control system standpoint, it is a 2-input (aTc and IPTG) 2-output (LacI and TetR) nonlinear dynamical system. The inputs correspond to drugs that can be added to the cell culture environment, while the outputs are proteins synthesized by cells and detected by tagging them with fluorescent reporters. Here the control signal (specifically IPTG) plays a dual role: it not only pushes the system into a new state, but also it keeps the system stable, allowing it to explore the nearby unstable dynamics that would not be collected otherwise (see the red branch of the bifurcation diagram in Figure A). The model derivation and further information about the network can be found in the Methods section. Figure B illustrates an experiment conducted with CBC; data points collected with CBC (blue points) are compared to the actual bifurcation curve obtained using standard model-based numerical continuation algorithms (gray dashed line). First, a particular control reference value (xref = TetR*) is selected (dashed-blue line). The reference signal is compared to the current expression of the gene of interest to compute a control error. This is fed to the controller, which evaluates a control signal (IPTG concentration), translated into an input to the toggle switch that will therefore change its state (the red line in Figure B shows the system transient dynamics). At every sampling time, a new measure is acquired and the entire process is repeated until a steady state value is reached. The steady state value of the state variables, together with the associated control signal, are then saved to define a point in the bifurcation diagram. Subsequently, a new reference value is picked (see Methods) and the entire process is repeated. To trace out the entire equilibrium curve, the continuation algorithm requires a set of control reference values, broadly covering the range of gene expression of interest (in our case the TetR fluorescence expression). CBC is a model-free method because it does not require the knowledge of a mathematical model of the system to be applied to, and the results’ accuracy does not depend on the structure and parameter values of a model. Furthermore, the controller within CBC is only required to be stabilizing and noninvasive. As such, CBC is not tied to a specific type of feedback controller, and control laws that do not require a mathematical model of the system can be used. However, the design, parameter tuning, and overall performance of the controller can be improved if some knowledge (like a model) of the system dynamics is available. Here, the use of a model-free proportional (P) controller and a model-predictive controller (MPC) is compared. The former controller was chosen as it is widely used in CBC applications, while the latter because it is commonly exploited in synthetic biology.[20] When using Model Predictive control strategies, simple linear models work well, showing that detailed mathematical representations are not needed for CBC to work. A sampling time of 5 min is considered in this work; this is a realistic time interval that we previously used to image and externally control bacterial cells.[22,25] Note that with the simplified CBC approach, the controller is not required to take the control error to zero, but only to stabilize the system to a nearby equilibrium point. This does not affect the accuracy of results, as further explained in the Methods.

CBC Can Reconstruct the Toggle Switch Bifurcation Diagram under Deterministic Simulation Settings

We first applied CBC on the deterministic model of the toggle switch. During the in silico experiment, 30 steady state points were acquired by varying the reference input to the control strategy between TetR* = 1800 [a.u.] (or 1200 [a.u.]) and TetR* = 0 [a.u.]. The maximum duration of a single equilibrium acquisition was constrained to 9 h and 55 min, after which the control reference was modified. The steady states values were computed by taking the average of the samples recorded in the last 60 min (12 samples) of the simulation carried out for each value of the reference signal. Averaging is not essential in a deterministic scenario, but becomes fundamental in a noisy environment as a way to filter out some noise. Points collected with the CBC perfectly overlap with the numerical continuation diagram (Figure A and 3A). Both control strategies have comparable tracking performances and proved able to stabilize the unstable branch of equilibria delimited by the Saddle-Node bifurcations, capturing the whole bifurcation curve.

Figure 2

Figure 3

CBC with MPC applied to the deterministic toggle switch model eq . (A) Equilibrium curve measured using CBC (●). TetR(IPTG) transient trajectories (···). Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: γ = 0.3 (eq ) and aTc = 25 ng mL–1.

CBC with P controller applied to the deterministic toggle switch model eq . (A) Equilibrium curve measured using CBC (●). TetR(IPTG) transient trajectories (···). Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: k = 0.0016 and aTc = 25 ng mL–1. CBC with MPC applied to the deterministic toggle switch model eq . (A) Equilibrium curve measured using CBC (●). TetR(IPTG) transient trajectories (···). Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: γ = 0.3 (eq ) and aTc = 25 ng mL–1. The proportional control signal is linearly dependent on the error signal. Every time the reference is modified (Figure B), the error changes and consequently the control signal as well changes (Figure C). As the controller does not include an integral action, the error never nullifies, but it becomes constant once the system reaches the equilibrium. When this happens, the control signal and bifurcation parameter IPTG become constant as well (because the proportional contribution is constant), and the steady state can be collected. With MPC, the control action is computed as the optimal signal that minimizes the error, and therefore its contribution pushes the system’s trajectories toward the reference signal, generating some early oscillations (Figure B,C). As the bifurcation parameter is not directly proportional to the control error, the latter does not have to be different from zero. The steady state error introduced by the MPC algorithm is much lower than the proportional one (see Figure B and Figure B); thus, it is easier to define the range of reference values for the MPC as it corresponds to the effective dynamical range explored by the continuation algorithm. Instead, when using a proportional controller, the range of the control reference may need to be varied significantly (as in Figure B, where the maximum reference was set to TetR* = 1800 [a.u.] in order for the algorithm to get a full coverage of the bifurcation diagram). This should not be considered as a fault of the proportional algorithm, but only as a difference between the two control strategies. Results in Figure and Figure are also represented in Supplementary Movies S1 and S2. To decrease the total time of each simulation we also implemented a steady state detection routine that changes the reference each time the system reaches steady state (Methods). This, together with a smaller number of collected points, allowed to decrease the experimental time to approximately 55 h for both the controllers, see Figure S1 and S2.

CBC Can Reconstruct the Toggle Switch Bifurcation Diagram under a Stochastic Simulation Setting

CBC is now demonstrated on a stochastic toggle switch model to reproduce conditions more similar to an in vivo experiment. As noise can strongly modify results of a single continuation experiment, we consider 10 repetitions of the same experiment. In each we collect 30 points, with the steady state values computed as described for the ODE case. The resulting plot is a density map made of all the collected steady states over the 10 simulations, that we can compare with the numerically computed bifurcation diagram (Figure A and 5A). The results show dense clouds of points falling in the area where the branch of equilibria is, confirming the ability of the strategy to work also in a stochastic setting: both controllers were able to track the unstable branch of equilibria, preventing the system from jumping between the two stable steady states. For simplicity, we only show one out of 10 computed trajectories of TetR and IPTG in Figure B,C and 5B,C.

Figure 4

Figure 5

CBC with MPC applied to the stochastic toggle switch model eq . (A) Density plot of equilibrium curve measured using CBC. Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of one simulation of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: γ = 0.3 (eq ) and aTc = 25 ng mL–1.

CBC with P controller applied to the stochastic toggle switch model eq . (A) Density plot of equilibrium curve measured using CBC. Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of one simulation of TetR and the control reference signal TetR* (—). (C) Time evolution of IPTG (i.e., control signal). Parameter values: k = 0.0016 and aTc = 25 ng mL–1. CBC with MPC applied to the stochastic toggle switch model eq . (A) Density plot of equilibrium curve measured using CBC. Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of one simulation of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: γ = 0.3 (eq ) and aTc = 25 ng mL–1. For both the proportional controller and MPC, the output signal shows oscillations due to noise, which also affect the control signal u. However, these oscillations do not seem to strongly alter the average values we take as steady state points. Similar observations to the deterministic case can be made. For the proportional controller, the error does not nullify and the reference signal is chosen in a wider range of values than for the MPC in order to uncover the full bifurcation curve (Figure B). As in the deterministic scenario, the simulation time can be drastically decreased considering a variable step time associated with the computation of the steady states and a reduced amount of collected points. The resulting bifurcation curves can be seen in Figure S3A and S4A, while panels B and C show just one representative trajectory of TetR and IPTG, respectively. Furthermore, for the stochastic scenario, we also considered the case of a steady state check with the full amount of points (30). We found that the reference shift guided by steady state detection could only reduce the total time to 228 h for a proportional controller and 233 h in case of MPC algorithm (see Figure S5 and S6), i.e., a 23.4% and a 21.8% reduction, respectively. Reducing the total testing time requires to decrease the number of points collected, which can be achieved without losing accuracy in the diagram retrieved as shown in Figures S3 and S4. Nonetheless, the present method could be improved by developing an algorithm for automatic reference stepping with a variable number of points collected.

Single-Cell CBC Using Agent-Based Simulations

To further prove the applicability of CBC to synthetic biology, we validated the method in BSim, a Java-based bacteria simulator,[30,33] where it is possible to reproduce a microfluidics experiment including a mathematical representation of movement, growth and division of cells as well as spatial distribution and diffusion of chemicals. We implemented single cell control considering the same model used for the stochastic scenario (eq ). Simulation settings (i.e., sampling time, number of collected points, gains, etc.) were kept the same as above, with minor differences due to changes in the programming language. In Figures B, 7B, and Supplementary Movies S3 and S4 representative examples of TetR and IPTG trajectories are shown. The bifurcation curves that are obtained, shown in Figure A and 7A, are comparable with previous results obtained on the stochastic model. However, we observed worse control performances for the proportional controller in BSim. This is mainly due to additional factors introduced in simulations. Specifically, the cells biomechanics and their flush out from the microfluidic chamber are simulated, as well as the chemicals’ spatial distribution and diffusion that introduce additional delay in the control input delivery. Also, in BSim we explicitly simulate an actuation delay due to the time the media takes to be delivered to the cells within the microfluidic device. All these factors might contribute to the performances exhibited by the proportional controller. A similar result when using more realistic simulation environments was also found in ref (34), where the authors showed that the performance of a PI controller considerably deteriorated when the algorithm was tested on an agent based model, while an MPC algorithm was able to maintain similar performance. Further information about BSim can be found in the Methods section.

Figure 6

Figure 7

CBC with MPC applied to the stochastic toggle switch model eq in BSim. (A) Density plot of equilibrium curve measured using CBC. Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of one simulation of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: γ = 0.3 (eq ) and aTc = 25 ng mL–1.

CBC with P controller applied to the stochastic toggle switch model eq in BSim. (A) Density plot of equilibrium curve measured using CBC. Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of one simulation of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: k = 0.0016 and aTc = 25 ng mL–1. CBC with MPC applied to the stochastic toggle switch model eq in BSim. (A) Density plot of equilibrium curve measured using CBC. Reference equilibrium curve obtained using numerical continuation (- -). (B) Time evolution of one simulation of TetR and the control reference signal TetR*. (C) Time evolution of IPTG (i.e., control signal). Parameter values: γ = 0.3 (eq ) and aTc = 25 ng mL–1. Results with reduced points and steady state detection obtained in BSim (Figure S7 and Figure S8) were similar to short stochastic simulations in Matlab. Here, the increased complexity of simulations takes into account many sources of noise (such as spatial diffusion of chemicals); for this reason, results are more scattered around the numerically computed bifurcation diagram (see Figure S7A and Figure S8A). Nonetheless, implemented controllers manage to stabilize trajectories along the unstable branch of equilibria. Considering the stochasticity of the experiments, there might be variability in the amount of time needed to uncover the bifurcation curve but, in general, it can be reduced up to less than a third of the original simulations.

Model Calibration via Data Collected Using CBC in a Noisy Scenario

Methods for parameter identification in nonlinear systems often rely on optimization routines: given a model, parameter identification allows to calibrate its output in order to match experimental data (refer to the literature[35,36] for further information). Using the approach by Beregi et al.,[32] we show how the equilibria measured using CBC can be exploited to estimate the toggle switch model parameters, and how data corresponding to unstable equilibria help in retrieving a model that captures the bistable behavior of the system. Further information about the identification process can be found in the Methods. The equilibrium curves obtained from parameters identified using CBC data are shown in Figure S9D for the deterministic scenario and in Figure S9E,F for the stochastic scenario (in green and red data associated with the proportional controller and with the MPC, respectively). For the sake of comparison, model parameters were also estimated from points collected using a traditional, open-loop parameter sweep where different levels of input parameter (IPTG) were tested for both noise-free and noisy scenarios (Figure S9A and B,C, respectively). The equilibrium curve obtained in this way generally does not reproduce the bistable behavior of the system, except slightly in the ideal case of the deterministic simulations (Figure S9A). On the other hand, data collected with CBC still capture bistability even in stochastic simulations (Figure S9E,F), although without being able to reproduce the original numerical bifurcation curve. Parameter estimation results show that the use of data including unstable equilibria helps to estimate parameters that reproduce bistability. As such, it is thought that the MPC control approach is preferable compared to the proportional control as it offers greater control over the distribution of data points along the equilibrium curve. Estimated parameters are shown in Table S1 for all different collected data. Interestingly, the variation of some parameters does not significantly affect the resulting fitted curve. Take, for example, the red line in Figure S9D, which is in excellent agreement with the reference equilibrium curve but was calculated with some identified parameters values being up to 41% different from their nominal value, hinting to nonidentifiability of some parameters. We confirmed this by performing a structural identifiability analysis on the system, using the STRIKE-GOLDD toolbox,[37] which showed unidentifiability of 9 parameters out of 14 when we considered two measurable outputs (LacI and TetR), one known control input (IPTG), and one other input as a fixed constant (aTc). For further reference, structural unidentifiability of the 2 ODEs toggle switch system for fixed inputs was also highlighted by Villaverde et al.[38] We note that bifurcation curves estimated from sweeps can be improved to reproduce bistability if additional knowledge about the presence of horizontal asymptotes for the stable branches is introduced in the cost function used for parameter estimation (Figure S10A–C). However, obtaining qualitatively different results for similar estimation approaches would typically lead to poor confidence in the results. This modified approach is also not systematic, as the true behavior of the system is assumed to be known. Using this new approach with CBC data further improves the agreement between the reference and identified models (Figure S10D–F).

Concluding Remarks

This paper presents the first application, in silico, of CBC to biochemical systems. In particular, CBC can track the stable and unstable equilibria of a synthetic gene network behaving as a toggle switch. In the absence of noise, a perfect agreement between equilibrium curves measured using CBC and standard numerical continuation methods is found. A notable challenge in biological applications, as compared to the mechanical systems studied so far with CBC, is the significant presence of (process) noise. Our results on the stochastic models (Matlab and Bsim) clearly show that CBC is still able to uncover the bistable nature of the system, as recorded data points qualitatively follow the equilibrium curve of the underlying deterministic model. Besides exploring the nonlinear dynamics of the physical system directly in the experiment, CBC provides informative data for parameter estimation. Our results show that the measured data, especially those falling in the unstable region, was critical for the identification of model parameters that can reproduce bistability. CBC appears therefore as a valuable tool for uncovering the dynamics of potentially more complex synthetic gene networks even in the presence of significant levels of noise. CBC can also be used to characterize oscillations[39] and other bifurcations.[17,40] The information gathered with CBC has the potential to enable a more accurate understanding of biochemical interactions and thus a more precise prototyping of those behaviors into novel synthetic gene circuits. An important feature of CBC is that it is a model-free method, since it does not require a model of the system to work, and the accuracy of the results is independent from any modeling assumption. CBC also does not rely on a particular control strategy, provided the controller can be made noninvasive and is able to stabilize unstable trajectories. In this regard, the use of a mathematical model (even approximate) can improve the control performance. For instance, the initial estimate of a proportional controller gain can be obtained based on a model and then further refined by trial and error during experiments. We also showed that a model-predictive controller, based on a simple linear model, can provide a smaller steady-state control error, resulting in better discretization of the equilibrium curve and beneficial for parameter estimation. MPC is a controller broadly used in biology,[20] which generally performs best for fast varying references.[41] We hope that the use of MPC in the present paper paves the way for a wide range of applications of CBC in systems and synthetic biology.

Future Developments

In future studies we aim to apply the strategy in vivo, using the microfluidic experimental setup already described in the literature,[24,20,22] which proved to be successful for control experiments involving both mammalian and bacterial cells. Several challenges will have to be addressed to enable a wider uptake of CBC: Noise. The presence of noise can make stabilization more difficult, as observed here and also highlighted by Lugagne et al.[6] Moreover, the presence of noise affects the noninvasiveness associated with the control input. Indeed, noise introduces variations in the control signal that cannot be suppressed completely. While this appears to have a mild effect on the system considered here, this is arguably not the case in general. As of now, some noise-related effects can be compensated by considering signal processing techniques, but more rigorous methods are required. Control. As of now, there are no general and systematic methods to design the controller used in CBC. Currently, control parameters are found by trial and error. While this works for simple systems, this approach does not scale to more complex systems where more sophisticated control strategies including, for instance, multiple inputs and outputs, will be needed. A step in that direction has recently been made by Li and Dankowicz,[42] who proposed adaptive control design strategies for CBC. Experimental time. Our initial simulations were particularly long (298 h in Figures –7). While such experiments can be achieved with some experimental set-ups—Balagaddé et al.[43] performed experiments on an Escherichia coli prey–predator system that lasted between 200 and 500 h using a microchemostat platform—this is currently not possible using microscopy/microfluidic platform.[6,22] It is in principle possible to significantly reduce testing time by improving the performance of the controller to reduce the transient. However, this can prove difficult due to the significant presence of noise in biochemical systems. Nonetheless, we showed that, for the presented gene network, it is possible to adjust the number of recorded points and CBC parameters to significantly reduce experimental time (2–3 days in the deterministic case, and 3–4 days in the stochastic case) while preserving a fine discretization of the equilibrium curve.

Methods

Model of the Toggle Switch

The genetic toggle switch is a well-characterized bistable biological circuit, first embedded in Escherichia coli by Gardner et al.[10] The system consists of two repressors (LacI and TetR), which mutually repress each other, and two inducers (aTc and IPTG), which can externally tune the genes’ production (Figure A). In the absence of inducers, the circuit exhibits two stable equilibria. Levels of LacI and TetR can be measured via fluorescent reporters: for example, mKate2(RFP) and mEGFP(GFP) can be used to monitor LacI and TetR, respectively.[6] Here, we will refer to the Hill-type model developed by Lugagne et al.,[6] representing mRNA transcription, translation, and degradation/dilution due to growth. Mathematically, the system is described by a set of four ordinary differential equations:where the system states are given by the genes production LacI and TetR and the associated mRNAs (mRNA, mRNA), is the leakage transcription rate, is the transcription rate, the translation rate, the mRNA degradation rate, and is the protein degradation rate. Parameters are listed in Table .

Table 1

Parameters for the Toggle Switch Model

parameters	description	value
k_L^m0	Transcription rates (mRNA min^–1)	3.20 × 10^–2
k_T^m0	–	1.19 × 10^–1
k_L^m	–	8.30
k_T^m	–	2.06
k_L^p	Translation rates (a.u. mRNA^–1 min^–1)	9.726 × 10^–1
k_T^p	–	1.170
g_L^m	Degradation rates (min^–1)	1.386 × 10^–1
g_T^m	–	1.386 × 10^–1
g_L^p	–	1.65 × 10^–2
g_T^p	–	1.65 × 10^–2
θ_LacI	plac regulation by TetR (−)	31.94
η_LacI	–	2.00
θ_IPTG	–	9.06 × 10^–2
η_IPTG	–	2.00
θ_TetR	ptet regulation by LacI (−)	30.00
η_TetR	–	2.00
θ_aTc	–	11.65
η_aTc	–	2.00
k_IPTGⁱⁿ	IPTG exchange rate (min^–1)	2.75 × 10^–2
k_IPTG^out	–	1.11 × 10^–1
k_aTcⁱⁿ	aTc exchange rate (min^–1)	1.62 × 10^–1
k_aTc^out	–	2.00 × 10^–2

The authors assumed that the repressors could be modeled with Hill function (h(x, θ, η) = 1/(1 + x/θ)η), where θ represents a threshold parameter for the Hill function and η is the Hill coefficient. Two more equations were added to take into account inducers diffusion through the cell membrane. Formally:The deterministic version of the model does not capture the intrinsic stochasticity of biochemical processes. A more accurate description is provided by the stochastic modeling procedure, on the basis of the pseudoreactions listed in Table .

Table 2

Pseudoreactionsa

description	pseudoreaction
transcription

translation

degradation/dilution

Here and are gene regulation functions, and h(x, θ, η) = 1/(1 + x/θ)η is the decreasing Hill function. The other arrow superscripts are parameters that can be found in Table .

Here and are gene regulation functions, and h(x, θ, η) = 1/(1 + x/θ)η is the decreasing Hill function. The other arrow superscripts are parameters that can be found in Table . Specifically, we employed a SDE based model for the description of biochemical systems:[44]Here: is the stoichiometry matrix where each row is associated with a state variable and each column i includes stoichiometry coefficients associated with the reaction i. is a vector where each element i corresponds to a propensity function. The latter is a function describing the probability of a certain reaction given the concentration of the chemical species.[45] is a vector containing the Wiener Process increments. Pseudoreactions and propensity functions were all given in ref (6), and here are reported in Table . All simulations were performed using Matlab R2019b, unless stated otherwise. To solve the system eq we used the function ode45, while for the stochastic model in eq we implemented the Euler–Maruyama method,[44] which is known to be slightly less accurate than the Stochastic Simulation Algorithm,[45] but more computationally efficient. Throughout the paper we have assumed aTc to be fixed to 25 ng mL–1 for all simulations.

Control-Based Continuation Theory: Using Control to Track Equilibria of the Underlying Uncontrolled System

CBC seeks to define a bifurcation diagram by the application of a noninvasive control action, which does not modify the underlying uncontrolled system’s equilibria position in parameter space. To do so, the original method looks for a control action whose contribution vanishes asymptotically (eq ). However, the same noninvasiveness can be achieved with a slightly simplified method, provided that the control action enters the system as bifurcation parameter.[31] The fundamental principles of the simplified CBC method used in this paper can be explained using the scalar differential equation:where is the bifurcation parameter. Equation corresponds to the normal form of a fold (or saddle-node) bifurcation. The equilibria of the above system are given by the formula:To trace out the equilibrium curve of eq , including both stable and unstable equilibria, CBC relies on the presence of a stabilizing feedback controller. The equation of motion of the system including feedback control is given bywhere u is the control signal given by a suitable, i.e., stabilizing, control law. When the control signal and bifurcation parameter affect the system in the same way (as in eq ), the static component of the control signal can be interpreted as a shift of the bifurcation parameter μ. In this paper, a simple linear proportional (P) law will be considered such that u is given bywhere xref(t) is the control reference (or target) and Kp is the proportional gain. Using eq as control input, eq becomeswhich can be reordered asThus, we can write the equilibria of eq and the control signal at steady state asWriting xref as a function of the steady state control action u, we can substitute eq in eq to obtainSolving the squared term and simplifying the equation we can thus obtainEquation defines the same equilibrium curve as eq for a parameter . Therefore, the steady state contribution of the control signal does not have to be removed, and it can simply be considered as a shift in parameter space. The control strategy so defined is noninvasive (it does not modify the position of system’s equilibria in parameter space), and the requirement of eq can be translated into the following:which is equivalent to converging to a steady state value. Here we demonstrated how CBC can be applied to track the equilibria of a system by the application of a stabilizing proportional controller. However, the same results could be achieved with other control strategies as long as the controller stabilizes the system’s unstable equilibria. In fact, CBC is not restricted to a particular form of control, and thus the control law can in principle take a general form:

Main Steps of the Control-Based Continuation Algorithm

The algorithm applied for control-based continuation can be briefly summarized in the following steps: Set a new control reference. A reference value xref = TetR* is used to evaluate the error (the difference between the measured TetR and the reference target), and it is given to the controller. The range where the reference is picked normally covers the minimum and maximum expression of the protein of interest, but it can be modified according to the need (Figure B shows a maximum reference value of TetR* = 1800 [a.u.], although the maximum expression of TetR is 1200 [a.u.]). Compute the control action. Depending on the control strategy, either the MPC performs optimization and defines the optimal control input or the proportional controller evaluates the control action depending on the error value. Feed the control action to the system. The computed control action u is fed to the system for 5 min continuously, as it is considered to be the minimum sampling time for measurements and actuation in an hypothetical microfluidics/microscopy experimental setup. After 5 min, the system output is measured and a new error is computed. Steps 2 and 3 are repeated until the process is considered at steady state. Check for steady state. The initial implementation of the algorithm considers the system at steady state after a fixed amount of time (9 h and 55 min for experiments shown in the Results and Discussion section). The algorithm allows a variable time for steady state computation. In this case, to compute the steady state of TetR and IPTG, the slope of the linear curve fitting the last 12 samples after 3 h of simulation is calculated for both the error and the control signals (where the slope corresponds to the angle the curve makes with respect to the x-axis). If the computed value is below a user-defined tolerance, the system is assumed at steady state. Otherwise, the simulation continues and the steady state is checked every new sample, until the slope is sufficiently small. A threshold is set on both the error and the control slopes, which may vary between a deterministic and a stochastic simulation. Acquire the steady state. The steady state values of TetR and IPTG ( and ) are saved as the average value computed over the last 12 samples of each signal, and the algorithm goes back to step 1. All steps are repeated until enough steady states are collected, or until the range of output and parameter values has been covered. An initial guess is given as initial state for the algorithm to start with. A schematic of few steps can be seen in Figure B. Results of the variable stepping reference algorithm can be seen in Figures S1–S8 for both the deterministic and the stochastic scenarios.

Data Availability

The code used in this publication is available at: https://github.com/lrenson/cbc-synbio-paper, https://github.com/diBernardoGroup/Control-Based-Continuation-of-a-genetic-Toggle-Switch. Movies S1, S2, S3, and S4 are provided as Supporting Information.

Proportional Controller

The simplest controller that can satisfy CBC requirements is a proportional controller, formally described by the control law of eq . The error signal is computed by subtracting the measured output to the control reference (TetR* – TetR). The error is then fed to the controller to evaluate the control input IPTG given to the toggle switch system. A schematic of the control loop can be seen in Figure S11A. Despite its simplicity, the proportional controller has proved to be a solid choice for CBC. As long as the controller can stabilize the system, the control action is noninvasive, and thus the points collected through the CBC routine correspond to the equilibria of the underlying uncontrolled system. To tune the proportional gain Kp we linearized the system around an unstable equilibria and then we used the root locus to find the minimum gain able to stabilize the linearized system. Depending on the type of simulation, deterministic or stochastic, the Kp gain is then adjusted by trial and error.

Model Predictive Controller

Model predictive control is a control scheme based on two main features: prediction and optimization. At each step, a model reproducing the process behavior is used to predict the process outputs to given input signals. The input minimizing a cost function is computed using an optimization algorithm and fed to the controlled process. We chose a linear model to reproduce behaviors we expect from the real process. A further requirement for the prediction procedure is the process current state which, if not fully measurable, can be estimated using a Kalman filter. A schematic of the control loop can be seen in Figure S11B.

System Identification to Reproduce the Process Experimental Behavior through a Simplified Model

The MPC strategy requires a model to predict the process trajectories and carry out the optimization routine. We artificially reproduced the experimental behaviors using stochastic numerical simulations. Specifically, we measured the response of the system’s outputs (LacI and TetR) to different randomly generated pulses of aTc and IPTG, both with fixed and variable amplitude. This data set was used to identify a LTI system via noniterative subspace estimation algorithm, using the Matlab System Identification toolbox. The result was a continuous-time identified state space model of the following form:where x(t) is the state, u(t) is the input to the system, y(t) is the system output, d(t) is the state disturbance, and K is its associated matrix.[46] Here, A, B, C, and K are free parameters to estimate, while D was set to 0. Equation is called innovation form of the state-space model, and the matrix K corresponds to the Kalman gain matrix associated with the identified system.

Definition of the Kalman Predictor to Reconstruct Unmeasurable States

Considering the Toggle Switch system eq , we measure only LacI and TetR concentrations. To estimate the remaining states we opted for a Kalman filter, defined aswhere is the state estimate, is the output estimate, and all the matrices A, B, C, D, K correspond to those obtained through the identification process. Equation can be rewritten as follows: The addition of a correction based on the measured outputs LacI and TetR enables a better fit of the original process outputs.[47] The ability to predict the data recorded is evaluated using the percentage of the output variation that is reproduced by the model. This is defined aswhere y is the measured output, is the simulated predicted model output, and is the mean value of y. Fit percentages of LacI and TetR outputs for the system in eq are 96.4% and 92.2%, respectively.

Definition of a Cost Function to Compute an Optimal Control Action

To compute the optimal control action, we employed a genetic algorithm.[48] Commonly, MPC algorithms use quadratic cost functions weighting both the state of the system and the control action. Considering a scalar example, a quadratic cost function could be defined asHere, α and β correspond to weighting coefficients reflecting the relative importance of the error e and the control input u; N represents the prediction horizon, and i the current time. To adapt the cost function to CBC and guarantee noninvasiveness of the control input, we initially replaced the control input u by its variation Δu as follows:In this way, the optimization finds the best trade off between regulation accuracy and control variation. Although eq granted us good results in most of the simulations, the tuning of both α and β was difficult, as small changes into the CBC algorithm variables (such as the number of points collected during a single simulation) affected the quality of the results and demanded for retuning of the cost function parameters. To simplify the cost function, we set β = 0 and α = N – k + 1 and forced the optimizer to limit both the control input amplitude and variation at each iteration point. Formally we defined the following constraints:where uinit(t) is the control input during the first step of the algorithm, ucurr(t) is the control input for all other steps except for the first one, and is the control input steady state value, saved before stepping the reference in the algorithm. The presence of external constraints on the control action allows to have a noninvasive contribution without the need for the βΔu(i + k)2 term in the cost function. The parameter γ defines a percentage bandwidth that we consider acceptable for the control action to vary between. In other words, at each iteration of the CBC algorithm—by which we mean every time we change the reference signal—the control action cannot vary more than γ with respect to the previous registered steady state input. In our simulations, γ was set to 0.3 for 30 points (Figure , 5) and to 0.5 for 11 and 12 points (Figure S2, S4). Other settings for γ might be found in the figure captions. Fewer points result in larger control reference steps, which can be associated with bigger steps in the control input, and for this reason γ can be more easily calibrated just by considering the amount of points to be collected in a simulation.

Parameter Estimation Process

For the estimation process, we calculated the analytical solutions of our toggle switch model eq that is used to build the cost function for parameter estimation. In particular, we want to define the function:that has a unique real solution for each admissible value of TetR and for a chosen set of parameters Θ = (klm0, klm, θ, η, θ, η, ktm0, ktm, θ, η, θ, η, klp, ktp), which is the subject of the estimation process. The full eq is presented in the Supporting Information. Here we considered the parameters (gtm, glm, glp, gtp) to be fixed to the values computed in ref (6), and therefore we did not estimate them. Once we computed the analytical definition of g(TetR, Θ), we built the following cost function:where are the measured steady state values of control input and control output out of the CBC routine for the i point of the bifurcation curve. The k index looks for the estimated point with minimum distance (L2 norm) from the measured steady state considered by the index i. is normalized between 0 and 1 to be comparable with . Minimizing the cost function eq for all points collected during an experiment, it is possible to estimate the best set of parameters Θ that allows to characterize a model of the toggle switch reproducing the bistability behavior seen via experiments. Once a set of parameters is found (using a genetic algorithm), it is possible to build the bifurcation curve by solving eq for fixed values of TetR. Sets Θ that gave complex values of IPTG were directly discarded. Estimates of parameters for different experimental settings and percentage variation with respect to nominal values can be seen in Table S1. In order to validate the results of the estimation using data collected with the CBC algorithm, a comparison with the parameter sweeps method was made. Parameter sweeps is a simple experimental design to collect data at different values of a parameter of interest and is commonly used for parameter estimation. Considering the example of the toggle switch, parameter sweeps are implemented registering the steady state signal for fixed values of the input . Collected points fall on the stable branches of our bifurcation curve, as shown in Figure S9A−C. This method does not allow to collect unstable steady states, as there is no control over the system. We showed that CBC grants a more robust estimation by collecting data in the unstable branch, which is not possible through simple parameter sweeps, and allows to retrieve the bistability of the system (see Figure S9D,E,F). As highlighted in the main text, assuming an a priori knowledge of the system’s bistability, we could improve the performances of our calibration by releasing the constraint on sets Θ that generate complex values of IPTG. Furthermore, more complex cost function could be considered. Figure S10 shows this latter improved case, with very good fitting performances compared to the initial estimation. Although a noticeable improvement, the modified cost function requires a much longer computational time and knowledge of the system of interest’s dynamics, which is often not the case when CBC is applied experimentally.

Agent-Based Simulations

All agent based simulations were performed using BSim, an agent-based environment designed to simulate bacterial populations.[30,33] Here, cell biomechanics are explicitly modeled, together with cell growth and division. The spatial configuration and dynamics of the chemical inducers added in the culture environment are also simulated, introducing an extra time delay in the delivery of the inducer molecules to the cells. In addition, it is possible to mimic microfluidic-based experimental platforms, including related physical and technological constraints (e.g., dimensions and shape of microfluidic chambers, flush out of cells). Specifically, we can define the geometry and dimension of the microfluidic chamber, and the physical interactions of the cells with the device. We can account for cells’ movement and collision inside the growth chamber and simulate the flushing out of the cells from the chamber with their consequent removal from the set of analyzable cells. The CBC algorithm was implemented by adding to the simulation environment both the proportional and the MPC controllers. As previously done in Shannon et al.[22] and Salzano et al.,[49] the simulation environment has been complemented with the SDE solver based on the Euler–Maruyama method eq . BSim simulates experiments via a microfluidic-based system composed by a microfluidic device, a microscope, a computer, and an actuation system. Dimensions of the growth chamber were initially taken from Lugagne et al.,[6] but then set to 16 μM × 1.5 μM × 1 μM to reduce the simulation time. The proportional controller was implemented directly in BSim, while the MPC was implemented in Matlab and externally called by the agent-based simulator. All biomechanical parameters of the cells were set to the same values used by Matyjaszkiewicz et al.[30] Moreover, realistic constraints on the sampling and actuation time were considered, similarly to what was already done for Matlab CBC simulations. More precisely, we enforced the state of the cells to be measured every 5 min, and we assumed the input provided to be changed at most every 5 min. Finally, we assumed a 40 s delay on the actuation of the control inputs, accounting for the time the inducers would take to reach the microfluidics cell culture chambers in a physical experiment. All the factors taken into account in BSim added additional sources of variability to the simulations, sometimes affecting the performance of the control algorithms, as discussed in the Results and Discussion section.

31 in total

1. A synthetic oscillatory network of transcriptional regulators.

Authors: M B Elowitz; S Leibler
Journal: Nature Date: 2000-01-20 Impact factor: 49.962

2. Systematic experimental exploration of bifurcations with noninvasive control.

Authors: D A W Barton; J Sieber
Journal: Phys Rev E Stat Nonlin Soft Matter Phys Date: 2013-05-28

3. Long-term model predictive control of gene expression at the population and single-cell levels.

Authors: Jannis Uhlendorf; Agnès Miermont; Thierry Delaveau; Gilles Charvin; François Fages; Samuel Bottani; Gregory Batt; Pascal Hersen
Journal: Proc Natl Acad Sci U S A Date: 2012-08-14 Impact factor: 11.205

Review 4. Control theory meets synthetic biology.

Authors: Domitilla Del Vecchio; Aaron J Dy; Yili Qian
Journal: J R Soc Interface Date: 2016-07-20 Impact factor: 4.118

Review 5. Cellular decision making and biological noise: from microbes to mammals.

Authors: Gábor Balázsi; Alexander van Oudenaarden; James J Collins
Journal: Cell Date: 2011-03-18 Impact factor: 41.582

6. In Vivo Real-Time Control of Gene Expression: A Comparative Analysis of Feedback Control Strategies in Yeast.

Authors: Gianfranco Fiore; Giansimone Perrino; Mario di Bernardo; Diego di Bernardo
Journal: ACS Synth Biol Date: 2015-12-04 Impact factor: 5.110

7. Cheetah: A Computational Toolkit for Cybergenetic Control.

Authors: Elisa Pedone; Irene de Cesare; Criseida G Zamora-Chimal; David Haener; Lorena Postiglione; Antonella La Regina; Barbara Shannon; Nigel J Savery; Claire S Grierson; Mario di Bernardo; Thomas E Gorochowski; Lucia Marucci
Journal: ACS Synth Biol Date: 2021-04-27 Impact factor: 5.110