Gene expression is inherently stochastic and pervasively regulated. While substantial work combining theory and experiments has been carried out to study how noise propagates through transcriptional regulations, the stochastic behavior of genes regulated at the level of translation is poorly understood. Here, we engineered a synthetic genetic system in which a target gene is down-regulated by a protein translation factor, which in turn is regulated transcriptionally. By monitoring both the expression of the regulator and the regulated gene at the single-cell level, we quantified the stochasticity of the system. We found that with a protein translation factor a tight repression can be achieved in single cells, noise propagation from gene to gene is buffered, and the regulated gene is sensitive in a nonlinear way to global perturbations in translation. A suitable mathematical model was instrumental to predict the transfer functions of the system. We also showed that a Gamma distribution parameterized with mesoscopic parameters, such as the mean expression and coefficient of variation, provides a deep analytical explanation about the system, displaying enough versatility to capture the cell-to-cell variability in genes regulated both transcriptionally and translationally. Overall, these results contribute to enlarge our understanding on stochastic gene expression, at the same time they provide design principles for synthetic biology.
Gene expression is inherently stochastic and pervasively regulated. While substantial work combining theory and experiments has been carried out to study how noise propagates through transcriptional regulations, the stochastic behavior of genes regulated at the level of translation is poorly understood. Here, we engineered a synthetic genetic system in which a target gene is down-regulated by a protein translation factor, which in turn is regulated transcriptionally. By monitoring both the expression of the regulator and the regulated gene at the single-cell level, we quantified the stochasticity of the system. We found that with a protein translation factor a tight repression can be achieved in single cells, noise propagation from gene to gene is buffered, and the regulated gene is sensitive in a nonlinear way to global perturbations in translation. A suitable mathematical model was instrumental to predict the transfer functions of the system. We also showed that a Gamma distribution parameterized with mesoscopic parameters, such as the mean expression and coefficient of variation, provides a deep analytical explanation about the system, displaying enough versatility to capture the cell-to-cell variability in genes regulated both transcriptionally and translationally. Overall, these results contribute to enlarge our understanding on stochastic gene expression, at the same time they provide design principles for synthetic biology.
The ability to map a given genotype to its corresponding phenotype is perhaps the biggest pursuit in molecular biology [1], especially in the post-genomic era, as it can provide fundamental insight and predictive power on natural evolution, with clear applications in biomedicine and ecology. However, it is well established that the very same genotype can lead to phenotypic heterogeneity in a non-changing environment [2]. This is the consequence of the inherent stochasticity of the different biochemical reactions that are needed for gene expression [3]. While stochastic events are often seen as undesirable, as they are when the optimal gene expression levels are lost [4], we now realize that a noisy gene expression can also be advantageous for the cell population to face environmental changes or induce time-dependent behaviors [5]. In this regard, substantial progress has been made over the last years to quantitatively understand and model this non-genetic variability (noise). However, there are still numerous questions regarding the mechanisms that produce and regulate noise in gene expression.Motivated by the prevalence of transcriptional regulations in the cell [6], previous work focused on studying the emergence and propagation of noise in genes regulated transcriptionally [3,7]. For example, we now appreciate that some promoters can generate bursts of expression as a consequence of a stochastic switching in their activity [8], that the sign of the regulation determines the best way to extract information from the environment [9], and that the stochastic fluctuations can inform about the underlying regulation when time is considered [10]. In addition, recent studies also focused on post-transcriptional regulations implemented by small non-coding RNAs (in particular, by microRNAs in eukaryotic cells) [11,12]. These studies concluded that microRNAs, by controlling the messenger RNA (mRNA) abundance, can suppress part of the noise generated at the level of transcription, hence resulting in ideal genetic elements to engineer robust circuits. Nevertheless, studies on cell-to-cell variability when protein expression is regulated at the level of translation are scarce, especially when the regulation is exerted by a translation factor. We just know that structured 5’ untranslated regions (UTRs) can generate noise in protein expression [13], which can even be tuned by trans-acting small RNAs [14].The importance of studying how stochastic gene expression is generated and regulated at different levels in the genetic information flow lies in the fact that living cells implement highly intricate circuitries for multiple signal integration that allow displaying a variety of phenotypes. Certainly, this signal integration becomes easier and more scalable if different layers are exploited (e.g., transcriptional, translational, and post-translational), and this is precisely what has evolved in nature. Only by understanding the particularities of each regulatory mode, we can rationalize the impact of gene expression on the cell behavior. As highlighted before, more studies on stochastic gene expression regulated at a layer other than transcription are mandatory, especially because there are important phenotypes in nature that arise as a consequence of changing expression translationally.In this work, we exploited the bacteriophage MS2 coat protein (MS2CP) as a translation factor [15] to engineer a basic synthetic regulatory circuit from which to study stochastic gene expression when it is regulated translationally. In the natural context, in addition to be a structural protein to form the virion, MS2CP blocks the translation of the viral replicase upon binding to an RNA hairpin in the corresponding 5’ UTR that contains the ribosome binding site (RBS) and the start codon [16]. Over the years, MS2CP has been used for many applications due to its strong binding affinity to RNA, such as the subcellular tracking of mRNAs with time and space [17], the study of protein-RNA interactions in vivo [18], the development of CRISPR scaffold RNAs for programmable transcription regulation (CRISPR stands for clustered regularly interspaced short palindromic repeats) [19], and the construction of nanoscale architectures that can serve, for example, to improve enzymatic reactions [20]. With our engineered circuit, we examined gene expression in single cells by using a double reporter system to monitor both the regulator (MS2CP) and the regulated gene, and we also developed a mathematical model to provide a predictive quantitative foundation of the system.
Results
Regulation of translation with an RNA-binding protein in single cells
We engineered a synthetic genetic system in E. coli in which the RNA-binding protein MS2CP acts as a protein translation factor (). MS2CP was expressed from a synthetic PL-based promoter repressed by LacI (named as PLlac) [21] in a medium copy number plasmid (about 80 copies/cell). This allowed controlling the expression of the regulator (at the transcriptional level) with isopropyl β-D-1-thiogalactopyranoside (IPTG). In addition, we fused the enhanced blue fluorescent protein 2 (eBFP2) [22] to the N terminus of MS2CP (leading to eBFP2-MS2CP) in order to monitor its expression. As a regulated element, here we used the superfolder green fluorescent protein (sfGFP) [23], which was expressed from a constitutive promoter in a low copy number plasmid (about 15 copies/cell). The wild-type RNA motif recognized by MS2CP (with a dissociation constant of about 3 nM) [24] was placed in frame just after the start codon of sfGFP. In this way, MS2CP can block the progression of the ribosome on the regulated gene in the initial phase [15]. This mode of action differs from the natural one, in which MS2CP prevents translation initiation rather than elongation [16]. The resulting circuit behaves like an inverter considering IPTG as input and sfGFP as output, MS2CP being an internal regulator that operates at the level of translation.
Regulation with a protein translation factor.
a) Schematics of the gene regulatory system implemented in a bacterial cell. IPTG is the external molecule that controls the expression of the protein translation factor (eBFP2-MS2CP). sfGFP is the final output of the system. b) Histograms of single-cell fluorescence for eBFP2 (fused to the regulatory protein) for different induction conditions with IPTG. On top, sequence details of the cis-regulatory region (DNA level) for transcriptional regulation (PLlac promoter). c) Histograms of single-cell fluorescence for sfGFP (the regulated protein) for different induction conditions with IPTG. On top, sequence details of the cis-regulatory region (RNA level) for post-transcriptional regulation (MS2CP RNA motif). d) Mean and noise of expression for eBFP2 as a function of IPTG. e) Mean and noise of expression for sfGFP as a function of IPTG. f) Noise for eBFP2 as a function of mean expression. g) Noise for sfGFP as a function of mean expression. h) Transfer function of the post-transcriptional regulation in terms of mean expression. i) Transfer function of the post-transcriptional regulation in terms of noise. In plots d-i), points correspond to calculations from the experimental data, while solid lines to predictions with the mathematical model.We performed single-cell measurements of blue and green fluorescence by flow cytometry for a concentration gradient of IPTG (9 conditions) in order to quantitatively study the stochastic regulatory dynamics of this engineered system (). We found a substantial down-regulation of sfGFP (about 50-fold in expression) as a consequence of the action of MS2CP on the cognate mRNA. From these data, we calculated the mean and the noise of expression for both eBFP2-MS2CP and sfGFP (the noise as the square of the coefficient of variation) [3], which were represented as a function of IPTG (). The mean gives the average position of the population, and the noise is a measure of the cell-to-cell variability. These measurements were repeated for different populations, finding consistency in the results (). We then constructed a mathematical model relying on a series of algebraic equations from basics on the biochemistry of gene expression and molecular noise propagation [7]. To derive these mathematical expressions for the mean and the noise, we constructed a system of stochastic differential equations for mRNA and protein expression following the Langevin formalism. The rates of concentration changes were subject to stochastic fluctuations of intrinsic and extrinsic nature. This system was analytically solved in steady state with the mean-field approximation for the fluctuations. With a suitable parameterization, our model was able to recapitulate with reasonable agreement the values of mean expression and noise for both eBFP2-MS2CP and sfGFP, highlighting the functional form of the different dose-response curves. In particular, the mean expressions follow Hill-Langmuir equations and the noises non-monotonous curves presenting a maximum at an intermediate IPTG concentration. Indeed, the peak-like noise curve is a consequence of a sigmoidal dynamics at the population level. We also observed that the noise levels in sfGFP are lower than in eBFP2 for all IPTG concentrations. We also performed numerical simulations of the stochastic differential equations (), finding good agreement with the analytical results, as well as sensitivity analyses to reveal the effect of perturbations in the adjusted parameters (), highlighting how the curves of mean expression and noise shift in one direction and even change in form. In addition, we represented the noise versus the mean to show the stochastic expression scaling laws of the system (). The model was also explicative about the nonlinear transfer functions in terms of mean expression regulation () and noise propagation (i.e., how the noise of eBFP2-MS2CP impacts on the noise of sfGFP; ). Together, these results indicated that the protein translation factor is a suitable element to control expression and that the cell-to-cell variability emerged at this level can be predicted with certain accuracy.
Fig 1
Regulation with a protein translation factor.
a) Schematics of the gene regulatory system implemented in a bacterial cell. IPTG is the external molecule that controls the expression of the protein translation factor (eBFP2-MS2CP). sfGFP is the final output of the system. b) Histograms of single-cell fluorescence for eBFP2 (fused to the regulatory protein) for different induction conditions with IPTG. On top, sequence details of the cis-regulatory region (DNA level) for transcriptional regulation (PLlac promoter). c) Histograms of single-cell fluorescence for sfGFP (the regulated protein) for different induction conditions with IPTG. On top, sequence details of the cis-regulatory region (RNA level) for post-transcriptional regulation (MS2CP RNA motif). d) Mean and noise of expression for eBFP2 as a function of IPTG. e) Mean and noise of expression for sfGFP as a function of IPTG. f) Noise for eBFP2 as a function of mean expression. g) Noise for sfGFP as a function of mean expression. h) Transfer function of the post-transcriptional regulation in terms of mean expression. i) Transfer function of the post-transcriptional regulation in terms of noise. In plots d-i), points correspond to calculations from the experimental data, while solid lines to predictions with the mathematical model.
Noise analysis in transcription and translation regulation
To further analyze the stochastic behavior, we looked inside the noise. That is, we inspected how a particular noise level is achieved. For that, we first decomposed the total noise of both eBFP2-MS2CP and sfGFP into three fundamental components: extrinsic noise, intrinsic noise, and regulation noise. Extrinsic noise comes from replication and variability in the cellular machinery, intrinsic noise is a consequence of a low number of molecules, and regulation noise accounts for the noise that is propagated from the regulator to the regulated gene [3,7,25]. In previous work, the regulation noise has been considered as a part of the extrinsic noise. Here, we separate this component to study more in detail the stochastic gene expression when it is regulated. Assuming independence between the different stochastic sources, we were able to end with compact mathematical expressions for the noise in which the different components were identified, although at the cost of introducing some inaccuracies since the extrinsic noise may correlate responses. Along the IPTG gradient and according to our mathematical model, the extrinsic noise of the system is constant, the intrinsic noise decreases in the case of eBFP2-MS2CP and increases in the case of sfGFP (as this noise scales inversely with the expression level), and the regulation noise follows a peak-like curve (–). Even though for both eBFP2-MS2CP and sfGFP the functional form of the regulation noise is similar, peaking at 50–75 μM IPTG, the maximal noise level is much lower (about four times) in the case of sfGFP. This suggested that with a translational control the noise of the regulator is buffered, i.e., the fluctuations of MS2CP expression are manifested on sfGFP expression in lower extent than the fluctuations of LacI expression on MS2CP expression. This is because in a scenario of translational control the regulated gene is constantly transcribed at high levels, where fluctuations in the number of mRNAs per cell are small in comparison with the mean quantity that is produced. Thus, the transcriptional noise is not significant. In addition, the regulation enters at the level of translation, which prevents the typical amplification process of the noise of the regulator that occurs with a transcriptional control [26]. Indeed, in such a scenario, the transcription rate can be quite low when the promoter is repressed, thereby leading to substantial fluctuations in mRNA amount in comparison with the mean production. Furthermore, in the post-transcriptional case, the fluctuations in mRNA abundance can partly be absorbed by the effect of the translation factor, controlling the number of mRNAs available for translation. This has already been discussed in the case of regulatory RNAs [27], but it also applies to the case of a protein translation factor. Consequently, we can argue that the noise in the regulated gene is reduced when the regulation occurs at the level of translation.
Detailed analysis of stochastic gene expression.
a) Model-based calculation of the total noise in eBFP2 expression as a function of IPTG. On top, schematics of the amplification effect by the transcriptional regulation. b-d) Decomposition of the total eBFP2 noise into extrinsic, intrinsic, and regulation noise components. e) Model-based calculation of the total noise in sfGFP expression as a function of IPTG. On top, schematics of the buffering effect by the post-transcriptional regulation. f-h) Decomposition of the total sfGFP noise into extrinsic, intrinsic, and regulation noise components. Insets in c,g) show the scaling of the intrinsic noise with the mean expression. i) Predicted eBFP2 fluorescence distributions for different induction conditions with IPTG (Gamma distributions). j) Gamma shape and scale parameters for eBFP2. k) Predicted sfGFP fluorescence distributions for different induction conditions with IPTG (Gamma distributions). l) Gamma shape and scale parameters for sfGFP. In plots j,l), points correspond to calculations from the experimental values of mean and noise, while solid lines to predictions with the mathematical model.In addition, we aimed at predicting the shape of the whole distribution of protein expression and not only the particular noise value. To this end, we considered a Gamma distribution, which has been shown to describe quite well the stochasticity of genetic systems [28] and which emerges from ab initio calculations [29]. The distribution of protein expression is instrumental to appreciate the degree of heterogeneity in the production with time and from cell to cell (assuming ergodicity). Here, by defining the Gamma shape parameter as the inverse of the noise (equal to the mean square divided by the variance) and the Gamma scale parameter as the product between the mean and the noise (i.e., an effective Fano factor), we were able to predict the distributions for both eBFP2-MS2CP and sfGFP (–). This was done with the values of mean expression and noise given by the mathematical model. As a result of a transcriptional control, the Gamma scale parameter for eBFP2-MS2CP depends on the translation rate; but in the case of sfGFP, as this element is controlled at the level of translation, the Gamma scale parameter is nearly independent of that rate. Importantly, these theoretical distributions were very close to those fitted directly against the experimental data (), although some discrepancies were observed between the data and the model at intermediate IPTG concentrations. Overall, this highlighted the generality of the Gamma distribution to describe genetic systems regulated at both transcriptional and translational levels.
Examination of global effects on regulated gene expression
Subsequently, we decided to study how global perturbations can impact the single-cell response of the system. To this end, we considered the effect of a global signal affecting translation. Here, we used sublethal concentrations of tetracycline (TC), a bacteriostatic antibiotic known to inhibit the formation of active ribosomes () [30]. Paradoxically, this inhibition leads to an increase in translation rate as a result of an over-production of ribosomes (a global response mechanism in bacteria against this type of antibiotics) [31,32]. That is, the cell is able to sense that a substantial amount of ribosomes is being inhibited upon binding to TC and produces more. In particular, TC binds to the 30S subunit and interferes with the transfer RNAs (tRNAs). In turn, the cell growth rate is compromised due to the action of TC. Importantly, this parameter has been shown to modulate the mean and noise of gene expression [33,34], so we decided to exploit it as a predictor variable. Over a two-dimensional concentration gradient of IPTG and TC (81 conditions), we first generated growth curves (). Basically, only TC showed a significant impact on growth rate (), with a maximal reduction of almost 3-fold, which was well explained by a Michaelis-Menten function ().
Growth-dependent regulation with a protein translation factor.
a) Extended schematics of the gene regulatory system in which TC further modulates it through its negative impact on growth rate and positive impact on translation rate (global effects). b) Heatmap of the mean growth rate as a function of IPTG and TC. c) Dose-response curve between growth rate and TC. Points correspond to experimental data, while solid line comes from the mathematical model. d) Projected two-dimensional histograms of single-cell fluorescence for eBFP2 and sfGFP for different induction conditions with IPTG and TC. e) Heatmap of the mean eBFP2 fluorescence as a function of IPTG and TC. f) Heatmap of the mean sfGFP fluorescence as a function of IPTG and TC. g) Heatmap of the mean eBFP2 synthesis rate as a function of IPTG and TC. h) Heatmap of the mean sfGFP synthesis rate as a function of IPTG and TC. i) Heatmap of the eBFP2 noise as a function of IPTG and TC. j) Heatmap of the sfGFP noise as a function of IPTG and TC.In parallel, we performed single-cell measurements of blue and green fluorescence for each condition (). We observed that the mean expression levels of both eBFP2-MS2CP and sfGFP remained almost constant at low TC concentrations, but they increased significantly from 500 ng/mL TC, irrespective of the induction with IPTG (). Because protein expression comes from the ratio between the protein synthesis rate (accounting for both transcription and translation) and the growth rate (in the case of stable proteins, as it is the case here), this indicated that the protein synthesis rate of both eBFP2-MS2CP and sfGFP scales with the growth rate (). It was interesting to note here the logical NOR behavior of the sfGFP synthesis rate and the difference between protein expression and synthesis rate. In addition, we calculated the noise levels for each condition (). We observed that the regulation noise decreases with TC for both eBFP2-MS2CP and sfGFP, as well as that TC leads to a substantial increase in the sfGFP noise when this gene is fully repressed by MS2CP.
Fig 3
Growth-dependent regulation with a protein translation factor.
a) Extended schematics of the gene regulatory system in which TC further modulates it through its negative impact on growth rate and positive impact on translation rate (global effects). b) Heatmap of the mean growth rate as a function of IPTG and TC. c) Dose-response curve between growth rate and TC. Points correspond to experimental data, while solid line comes from the mathematical model. d) Projected two-dimensional histograms of single-cell fluorescence for eBFP2 and sfGFP for different induction conditions with IPTG and TC. e) Heatmap of the mean eBFP2 fluorescence as a function of IPTG and TC. f) Heatmap of the mean sfGFP fluorescence as a function of IPTG and TC. g) Heatmap of the mean eBFP2 synthesis rate as a function of IPTG and TC. h) Heatmap of the mean sfGFP synthesis rate as a function of IPTG and TC. i) Heatmap of the eBFP2 noise as a function of IPTG and TC. j) Heatmap of the sfGFP noise as a function of IPTG and TC.
Integrative modeling of the deterministic and stochastic dynamics
Importantly, we noted that the cell volume increases as a consequence of TC (; see also where we show an exponential trend), which agrees with previous observations [35]. This entails the necessity of a higher number of MS2CP molecules per cell to repress the target gene. Moreover, it is known that the number of total ribosomes decreases linearly with the growth rate when it is modulated by an antibiotic (i.e., the slower the replication, the larger the number of ribosomes) [31]. This leads to a reciprocal function with the growth rate to describe the translation rate of a given gene [32]. That is, the translation rate scales inversely with the growth rate. It is also known that the transcription rate (mRNA production) scales linearly with the growth rate [33]. In terms of number of proteins per cell, this effect is cancelled out by the effective dilution due to cell division. By introducing into our mathematical model these dependencies, we were able to predict with relatively good agreement the impact of TC on mean expression for both eBFP2-MS2CP and sfGFP ().
Fig 4
Detailed analysis of stochastic gene expression modulated by growth rate.
a) As TC increases, cells grow slower and bigger. b) Mean eBFP2 expression as a function of growth rate for each IPTG condition. c) Mean sfGFP expression as a function of growth rate for each IPTG condition. d) Relative mean eBFP2 synthesis rate as a function of growth rate for each IPTG condition. e) Relative mean sfGFP synthesis rate as a function of growth rate for each IPTG condition. In d,e), the values are relative to the case TC = 0, and the dashed line corresponds to a linear dependence (which comes from a null model in which the translation rate is not affected by the growth rate). f) Transfer function of the post-transcriptional regulation in terms of mean expression for each TC condition. The inset shows the effect of TC on the sfGFP mRNA synthesis rate (included in α, as the growth rate decreases) and the effective dissociation constant between eBFP2-MS2CP and sfGFP mRNA (θ, as the volume increases) according to the mathematical model. g) eBFP2 noise as a function of growth rate for each IPTG condition. h) sfGFP noise as a function of growth rate for each IPTG condition. i) Transfer function of the post-transcriptional regulation in terms of noise for each TC condition. In plots b-i), points correspond to calculations from the experimental data, while solid lines to predictions with the mathematical model.
Detailed analysis of stochastic gene expression modulated by growth rate.
a) As TC increases, cells grow slower and bigger. b) Mean eBFP2 expression as a function of growth rate for each IPTG condition. c) Mean sfGFP expression as a function of growth rate for each IPTG condition. d) Relative mean eBFP2 synthesis rate as a function of growth rate for each IPTG condition. e) Relative mean sfGFP synthesis rate as a function of growth rate for each IPTG condition. In d,e), the values are relative to the case TC = 0, and the dashed line corresponds to a linear dependence (which comes from a null model in which the translation rate is not affected by the growth rate). f) Transfer function of the post-transcriptional regulation in terms of mean expression for each TC condition. The inset shows the effect of TC on the sfGFP mRNA synthesis rate (included in α, as the growth rate decreases) and the effective dissociation constant between eBFP2-MS2CP and sfGFP mRNA (θ, as the volume increases) according to the mathematical model. g) eBFP2 noise as a function of growth rate for each IPTG condition. h) sfGFP noise as a function of growth rate for each IPTG condition. i) Transfer function of the post-transcriptional regulation in terms of noise for each TC condition. In plots b-i), points correspond to calculations from the experimental data, while solid lines to predictions with the mathematical model.To further comprehend the interplay between gene regulation and cell growth, we used the model to predict the protein synthesis rate. Remarkably, we found that for eBFP2-MS2CP the same curve is explicative for all induction conditions with IPTG, provided the values are relativized to the case of no TC (). This is because eBFP2-MS2CP is regulated transcriptionally, and in turn the transcription factor of the system (LacI) is expressed constitutively and modulated by IPTG in terms of activity at the post-translational level. Hence, there is a decoupling between the regulation and the effect of growth rate on expression. A minimum in eBFP2-MS2CP synthesis rate was found at a growth rate of about 0.45 h-1, which comes from the fact that the protein synthesis rate is modeled by a rational function with the growth rate. In essence, at low TC concentrations, the transcription rate is reduced, but the translation rate remains almost constant. However, at high TC concentrations, the translation rate very much increases and dominates over the transcription rate. By contrast, we found that the relative sfGFP synthesis rate strongly depends on IPTG and that this dependence is well captured by the model (). In this case, the number of MS2CP molecules per cell changes with IPTG, so there is a coupling between the translational regulation and the effect of growth rate on expression. While at low IPTG concentrations the relative sfGFP synthesis rate follows the aforementioned trend for eBFP2-MS2CP, at high IPTG concentrations there is a maximum at a growth rate of about 0.65 h-1 (it was particularly pronounced at the intermediate value of 100 μM). illustrates how the transfer function in terms of mean expression varies with the TC concentration (i.e., by increasing the maximal expression level and shifting the inflexion point towards the right).Finally, we applied the model to project the noise in protein expression. In this case, we needed to introduce a phenomenological dependence with the growth rate on three noise-related parameters to explain the data. In particular, we set that the noise in LacI expression scales with the square of the growth rate (i.e., LacI expression varies from cell to cell in greater extent when cells grow faster) and that the extrinsic noise of both eBFP2-MS2CP and sfGFP scales with the inverse of the growth rate by following the translation rate (i.e., the extrinsic noise is higher at lower growth rates, which seems in tune with recent experiments characterizing genome-wide noisy expression levels [36]). While for eBFP2-MS2CP the noise decreases or remains constant with the growth rate (), for sfGFP the noise presents a more complex trend (almost constant at low IPTG concentrations and with a maximum at high IPTG concentrations; ). In turn, illustrates how the transfer function in terms of noise varies with the TC concentration, showing how the belly shape is reduced with TC, which indicates that noise propagation through the translation factor is less significant (i.e., it is masked) when the growth rate is low. That is, the regulation noise term, which quantifies how much fluctuation sfGFP perceives from MS2CP, becomes smaller than the other noise terms (intrinsic and extrinsic) with TC. Arguably, at high growth rates, when global perturbations are small, a translation factor is superior to a transcription factor because it is able to regulate gene expression without transmitting much noise. However, this is not the case at low growth rates, when global perturbations become substantial, due to a poor signal-to-noise ratio. Together, these results highlight the complex impact of growth rate in the system and serve to appreciate how global and local regulatory mechanisms interplay in the cell.
Discussion
Our development follows previous work on exploiting RNA-binding proteins as translation factors to engineer gene circuits [15]. In this work, we focused on quantitatively studying the stochastic behavior of this type of circuits (i.e., noise generation and propagation in gene expression when it is regulated at the level of translation). Our results show that the protein-RNA interaction in this case leads to a significant down-regulation in expression of about 50-fold by blocking the progression of the ribosome on the target mRNA, which is comparable to transcriptional fold-changes. In addition, a general mathematical framework was shown suitable to describe the stochastic behavior in regulations exerted by both transcription and translation factors. Noise propagation from gene to gene is buffered when the regulator acts at the level of translation, as the amplification process by transcription is avoided, and a Gamma distribution properly parameterized can provide deep analytical explanations about the resulting cell-to-cell variability [28]. By modulating the cellular growth rate, we also reported an interplay between global and local regulatory mechanisms in the cell that affect both the mean expression and noise levels.It is important to notice that the growth rate that we measured here corresponds to an average of the population (as we calculate it having monitored absorbance with time for a culture). Nevertheless, each cell grows differently, especially when the culture is under the effect of TC. In this regard, a mathematical model incorporating such heterogeneous growth might explain better the observed noise patterns [37]. Another limitation of our work is the assumption that the noise sources are independent, which allowed us to derive a compact mathematical expression for the noise of sfGFP. This is not strictly true since eBFP2-MS2CP and sfGFP share extrinsic noise sources [7]. Yet, we expect reliability in the conclusions derived from this study.Here, we exploited the viral protein MS2CP to implement the regulatory system, but in principle other RNA-binding proteins might be used. For example, the bacteriophage PP7 coat protein or the Mycobacterium enzyme PyrR [16] are suitable elements from which to engineer orthogonal systems. In fact, given the plethora of RNA-binding proteins in nature, especially in eukaryotes [38], and noting that the regulatory mechanism only requires a tight protein-RNA interaction to interfere with the ribosome, multiple implementations might be achieved. Our mathematical model is general enough to describe these eventual implementations. We only expect to change the kinetic parameters for each particular protein, preserving the functional form. In principle, each RNA motif will lead to a different translation rate. In this regard, the predictability of the system might be strengthened by using the RBS calculator [39]. We also anticipate that the use of tandem repeats of the RNA motif might enhance the regulatory fold-change of the system. Furthermore, since RNA is a very versatile molecule, RNA-binding proteins can regulate gene expression through a variety of mechanisms acting post-transcriptionally, including the regulation of translation initiation, translation elongation, transcription termination, and RNA stability [40]. In prokaryotes, the regulation of translation initiation by controlling RBS accessibility is a widespread mechanism, but in eukaryotes the blockage of translation elongation has been observed in the case of Argonaute proteins [41]. Arguably, a blockage in the initial phase by MS2CP is key in our synthetic system. Further work should analyze how those other mechanisms generate and propagate noise.In sum, our work provides new quantitative insights about the stochastic behavior in genes regulated translationally by RNA-binding proteins. Protein translation factors can integrate some advantages distinctively attributed to proteins (as transcription factors), such as the ability to transduce small signals and achieve large dynamic ranges, or to small RNAs, such as the ability to produce rapid responses and buffer transcriptional noise [27]. Furthermore, our work paves the way for engineering gene regulatory circuits with greater integrability and then sophistication. Certainly, the combination of different layers within the genetic information flow (i.e., transcription and translation) leads to an easier integration of signals to achieve a given function [42]. Therefore, we envision that RNA-binding proteins will be of great utility in synthetic biology in the close future to face biotechnological and biomedical challenges.
Materials and methods
Strains, plasmids, and reagents
E. coli Dh5α was used for cloning purposes by following standard procedures. To express our genetic circuit, E. coli MG1655-Z1 (lacI+ and tetR+; kindly gifted by M.B. Elowitz) was used. This strain was co-transformed by electroporation with two plasmids, called pRKFR2 (kanR, pSC101-E93G ori) and pREP3 (camR, p15A ori; kindly gifted by J.A. Daròs). pRKFR2 contains the gene coding for MS2CP translationally fused to eBFP2 in its N terminus (eBFP2-MS2CP) under the transcriptional control of the inducible promoter PLlac. pREP3 contains the gene coding for sfGFP under the transcriptional control of the constitutive promoter J23119 and the translational control of the MS2CP-recognizing RNA motif. The genetic cassettes were synthesized by IDT. LB medium was used for both overnight and characterization cultures. Kanamycin and chloramphenicol were used at the concentration of 50 μg/mL and 34 μg/mL, respectively. IPTG and TC were used as inducers of the system. The concentration gradient of IPTG that we tested was 0, 5, 10, 20, 50, 100, 200, 500, and 1000 μM, and the concentration gradient of TC was 0, 10, 100, 200, 300, 400, 500, 700, and 1000 ng/mL. Compounds provided by Sigma.Note that LacI is overexpressed in MG1655-Z1 to efficiently regulate the PLlac promoter in the pRKFR2 plasmid. This overexpression adds to the wild-type expression of LacI. In addition, TetR is expressed in E. coli MG1655-Z1 (although it does not play any regulatory role), so it will bind to TC when this inducer is used. The titration effect will be more relevant at low concentrations of TC, although overall this will only mean an effective TC concentration slightly lower. The maximal TC concentration used here reduced significantly the growth rate of the cells, suggesting a marginal effect of TetR in this case.
Growth curves
Cultures (2 mL) inoculated from single colonies (three replicates) were grown overnight in LB medium at 37°C and 200 rpm. Cultures were then diluted 1:100 in fresh LB medium (2 mL) and were grown for 3 h at the same conditions to reach exponential phase (OD600 around 0.5). Cultures were then diluted 1:50 in fresh LB medium (200 μL) to load a microplate (96 wells, black, clear bottom; Corning) with appropriate concentrations of IPTG and TC. The microplate was then incubated for 10 h at 37°C and 1,000 rpm in a PST-60HL plate shaker (Biosan). Absorbance (600 nm) was measured every hour in a Varioskan Lux fluorometer (Thermo). The growth rate was calculated as the slope between absorbance (in log scale) and time during the exponential phase. Data analysis performed with MATLAB (MathWorks) and Python.
Flow cytometry
Cultures (2 mL) inoculated from single colonies (four replicates) were grown overnight in LB medium at 37°C and 200 rpm. Cultures were then diluted 1:100 in fresh LB medium (2 mL) and were grown for 3 h at the same conditions to reach exponential phase (OD600 around 0.5). Cultures were then diluted 1:50 in fresh LB medium (200 μL) to load a microplate (96 wells, black, clear bottom; Corning) with appropriate concentrations of IPTG and TC. The microplate was then incubated at 37°C and 1,000 rpm in a PST-60HL plate shaker (Biosan) until cultures reached a sufficient OD600 (a different incubation time for each TC concentration). Cultures (6 μL) were then diluted in PBS (1 mL). Fluorescence was measured in an LSRFortessa flow cytometer (BD); a 405 nm laser and a 450 nm filter for blue fluorescence, and a 488 nm laser and a 530 nm filter for green fluorescence. Events were gated by using the forward and side scatter signals and compensated (~104 events after this process). The mean value of the autofluorescence of the cells was subtracted in each channel to obtain a final estimate of expression (). Data analysis performed with MATLAB and Python. The mean and the variance were calculated for each distribution after removing outliers, which served to compute the noise in gene expression.
Deterministic mathematical modeling
We assumed that the cellular amount of regulatory protein (eBFP2-MS2CP) is proportional to the signal of blue fluorescence, and that the amount of regulated protein (sfGFP) is proportional to the signal of green fluorescence. Thus, with IPTG and TC being the two external molecules of control, the Hill-Langmuir equations that dictate average protein expression (population measure) are
where α is the maximal protein level from the PLlac promoter (in presence of IPTG), ρ the transcriptional repression fold by LacI, α the maximal protein level from the constitutive J23119 promoter, and ρ the translational repression fold by eBFP2-MS2CP. In addition, θ is the effective dissociation constant between LacI and IPTG, n the effective degree of cooperativity of LacI, θ the effective dissociation constant between eBFP2-MS2CP and the cognate RNA motif embedded within the sfGFP mRNA, and n the effective degree of cooperativity of eBFP2-MS2CP. By using our data, the adjusted parameter values are α = 1,890 AU, ρ = 0.225, θ = 116 μM, n = 2.38, α = 29,000 AU, ρ = 0.016, θ = 610 AU, and n = 5 (upon varying IPTG, with no TC).Moreover, if μ denotes the actual cell growth rate, which is modulated by TC and dictates the dilution rate of the proteins, it turns out that the following Michaelis-Menten equation
where μ0 is the maximal cell growth rate (in absence of TC) and θ the half maximal inhibitory concentration of TC. From the data, we obtained μ0 = 1.2 h-1 and θ = 526 ng/mL (upon varying TC, with no IPTG).To consider the impact of growth rate on protein expression, we noticed that protein expression is the ratio between the protein synthesis rate (SR) and the growth rate, and that the protein synthesis rate is the product between the mRNA amount and the translation rate (λ, for gene i). That is,
Because it is known that the mRNA amount per cell is proportional to the growth rate (i.e., the transcription rate increases as long as the cell grows faster, 〈mRNA〉∝μ) [33], the dependence of the protein synthesis rate on the growth rate is just given by the effect of the growth rate on the translation rate. Interestingly, the translation rate can be described by a Michaelis function of the number of active ribosomes [32], which is known to increase linearly when the growth rate decreases as a consequence of TC. We can then define two effective parameters (ε1 and ε2, for gene i) to model such a dependence as and then write for eBFP2 and sfGFP the following
Here, we adjusted ε1 = 0.196 h-1, ε1 = 0.299 h-1, ε1 = 0.246 h-1, and ε1 = 0.349 h-1.In addition, we noticed that the cellular volume changes with the growth rate as a consequence of TC. By using the cube of the forward scatter signal (median of the population) as a proxy of the volume, a negative exponential trend was identified (i.e., the volume increases with TC). Because the model considers as variables the amount of fluorescent proteins per cell and not the concentrations, the parameter θ needs to be corrected as
where here we took δ = 0.85 h. In essence, when the volume increases, the number of proteins required to regulate the target gene are higher. Besides, the impact of growth rate on the transcriptional regulation was assumed negligible, as LacI is a highly expressed protein from the chromosome whose activity is modulated by a chemical inducer, which enters into the model in terms of concentration (i.e., θ independent of μ).
Stochastic mathematical modeling
The expressions for the variances in gene expression (or fluorescence signal) can be derived by following some basic calculations (see for a detailed calculation) [3,7]. We followed a Langevin formalism, which consists in introducing in the right-hand side of the differential equation a series of stochastic processes, and the mean-field approximation for the analytical treatment, which consists in assuming that the different fluctuation amplitudes depend on the deterministic solution. In brief, the variance for a given gene and induction condition can be decomposed into three different variances according to the nature of the molecular noise source. The variance coming from extrinsic noise can be assumed to scale with the square of the expression level (then leading to a constant term in gene expression noise), the variance from intrinsic noise with the expression level, and the variance from the regulatory protein noise with the square of the derivative of the transfer function (in terms of protein synthesis rate). In particular, we can write
where and are two empirical constants that quantify the levels of noise of extrinsic nature on eBFP2-MS2CP and sfGFP, respectively. Also, is a constant that measures the noise in LacI expression, β and β the Fano factors of noise of intrinsic nature for eBFP2-MS2CP and sfGFP, respectively, and γ a constant that accounts for the difference between fluorescence and number of molecules. Note that 〈eBFP2〉 = 〈MS2CP〉 and . Note also that when a strong repression occurs at the level of translation the transcriptional noise can be neglected and then β can be considered constant (i.e., the Fano factor, in number of molecules per cell, can be approached by 1). By using our data, the adjusted parameter values are
β = 45.6 AU, μM2,
β = 61.9 AU, and γ = 0.0233 (upon varying IPTG, with no TC).It is important to recall that the parameters α, α, and θ depend on the growth rate. Moreover, the intrinsic noise Fano factor for eBFP2-MS2CP is proportional to the translation rate (β∝λ),
but not the factor for sfGFP (β nearly independent of λ), as sfGFP is regulated at the level of translation and then its mRNA is constitutively expressed. To explain our data, we introduced the following phenomenological expressions
In essence, this indicates that LacI expression varies from cell to cell in greater extent when cells grow faster, and that the extrinsic noise increases when the growth rate is very low.Finally, we assumed that the stochastic gene expression follows a Gamma distribution (see for a basic derivation) [28]. Then, the probability for a given expression level reads
where a and a are the Gamma shape parameters for eBFP2-MS2CP and sfGFP, respectively, and b and b the Gamma scale parameters. Importantly, by knowing that for a Gamma distribution ab is the mean and ab2 the variance, these two parameters can be defined as
This means that the Gamma shape parameter is directly the inverse of the noise, and that the Gamma scale parameter depends on the translation rate in the case of eBFP2-MS2CP (transcription regulation) and is nearly independent of it in the case of sfGFP (translation regulation).
Numerical simulations
The system of stochastic differential equations (see ) was solved numerically to obtain stochastic trajectories of mRNA and protein concentrations. For that, we followed an integration scheme previously described [43]. The colored stochastic processes (for extrinsic and regulation noise) were obtained from independent white stochastic processes. The system was solved in one time interval with the routine ode45s from MATLAB, considering constant the stochastic fluctuations in that interval. The values of the fluctuations were updated in each interval with the previous mRNA and protein concentrations. Negative concentration values were avoided.
Reliability of the dose-response curve.
a) Mean of eBFP2 expression as a function of IPTG. b) Mean of sfGFP expression as a function of IPTG. c) Noise of eBFP2 expression as a function of IPTG. d) Noise of sfGFP expression as a function of IPTG. Points correspond to the values of the population shown in the main figures. Error bars correspond to standard errors calculated from four different populations. Solid lines correspond to predictions with the mathematical model.(TIF)Click here for additional data file.
Numerical simulations of stochastic dynamics.
a-d) Stochastic trajectories with time of eBFP2 and sfGFP for two different IPTG concentrations. In red, deterministic trajectories. The initial condition corresponds to the uninduced state in all cases. e-h) Histograms of protein expression computed from long trajectories. The Gamma distributions fitted against the experimental data (blue lines) were also represented.(TIF)Click here for additional data file.
Sensitivity analysis of the model parameters.
Plots of mean and noise of expression as a function of IPTG, where solid lines correspond to the dynamics predicted with the adjusted parameter, dotted lines to the dynamics if the parameter increases 2-fold, and dashed lines to the dynamics if the parameter decreases 2-fold.(TIF)Click here for additional data file.
Stochastic gene expression described by a Gamma distribution.
Histograms of experimental single-cell fluorescence for both a) eBFP2 and b) sfGFP for different induction conditions with IPTG, together with fitted Gamma distributions against the data (blue lines) and predicted Gamma distributions obtained by using the model values of mean and noise (red lines).(TIF)Click here for additional data file.
Growth curves.
Three different populations (blue, red, and green) were monitored with time. Points correspond to absorbance values, while solid lines come from fitted exponential trends.(TIF)Click here for additional data file.
Relationship between cellular growth rate and volume.
a) Schematics to show that as TC increases, cells grow slower and are bigger. b) Scatter plot between the cube of the forward scattering signal (proxy of cellular volume) and the growth rate for the 81 IPTG and TC conditions (colored by TC condition). An exponential trend was adjusted (solid line).(TIF)Click here for additional data file.
Stochastic differential equations.
Derivation of the mathematical expressions of noise in eBFP2 and sfGFP having followed a Langevin formalism and the mean-field approximation.(DOCX)Click here for additional data file.
Gamma distribution.
Derivation of the Gamma distribution for protein expression from a general stochastic differential equation.(DOCX)Click here for additional data file.
Flow cytometry data.
Single-cell fluorescence data of eBFP2 and sfGFP for different induction conditions with IPTG and TC after filtering events.(XLSX)Click here for additional data file.12 Jul 2021Dear Dr Rodrigo:Thank you very much for submitting your manuscript "Gene regulation by a protein translation factor at the single-cell level" (PCOMPBIOL-D-21-00828) for review by PLOS Computational Biology.As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. Based on the reviews, we regret that we will not be pursuing this manuscript for publication at PLOS Computational Biology.Although overall the paper presents new findings, the integration and application of the quantitative analysis is lacking and doesn't follow norms and definitions from previously published work in stochastic gene expression.The reviews are attached below this email, and we hope you will find them helpful if you decide to revise the manuscript for submission elsewhere.While we cannot consider your manuscript further for publication in PLOS Computational Biology, we would like to offer you the option to transfer your submission, with reviews, to PLOS ONE https://www.editorialmanager.com/PONE/If you DO wish to transfer your submission, please click this link:27749731511488f0-8f5e-4815-a7a9-c79a1db60ee3297360PCOMPBIOL-D-21-00828d43998f3-b36c-436d-8eb1-98fe1a06688dIf you do NOT wish to transfer your submission, please click this link to decline:28749731511488f0-8f5e-4815-a7a9-c79a1db60ee3297360PCOMPBIOL-D-21-00828d43998f3-b36c-436d-8eb1-98fe1a06688dPlease note, all PLOS journals are editorially independent and vary in submission requirements.Should you choose to transfer, your manuscript files, along with the reviewers' comments and their identities will be transferred automatically, and you will receive a confirmation email within 24 hours. Once transferred, your submission will be returned to you so you can check over your record before completing the submission. You may be asked to provide additional information, such as a response to the reviewers' comments. If you have any questions, please contact the editorial office of PLOS ONE https://www.editorialmanager.com/PONE/We are sorry that the news is not more positive on this occasion, and we hope you will consider PLOS Computational Biology for future submissions. Thank you for your support of PLOS and of open-access publishing.Sincerely,David UmulisAssociate EditorPLOS Computational BiologyIlya IoshikhesDeputy EditorPLOS Computational Biology***********************Reviewer's Responses to QuestionsComments to the Authors:Please note here if the review is uploaded as an attachment.Reviewer #1: This manuscript aims to examine the signal transduction and noise-modulating properties of various signaling network structures comprising feedforward loops. All possible network variants are assembled, varying the type of regulation, dual input function and node duplication. The input signal varies in mean but has a constant Fano factor of 1.83. The mean and Fano factor at the output are compared to those at the input. The study identifies the best-performing network structures for signal transduction and noise reduction.The manuscript is interesting, timely, well executed and well written. It should be publishable after the following comments are addressed.(1) The Fano factor measures burstiness (how different a process is from the Poisson process). However, this is not the only measure of noise magnitude. To characterize the standard deviation in terms of the mean, the Coefficient of Variation (CV, defined as standard_deviation/mean) is very widely used. The CV may be preferable for characterizing the quality of signal transduction, whereas the FF is most appropriate to characterize the burstiness of gene expression. For this reason, all the plots showing Fano Factors versus slope should also be made for the CV, using a constant CV at the input.(2) Keeping the Fano factor constant at the input makes the reader wonder, what would have happened at higher/lower input FF? It would be interesting to explore what happens if the input FF is higher or lower. The same question refers to the input CV.(3) The manuscript examines signaling (post-translational) interactions. Similar network structures exist in transcriptional regulatory networks. It would be important to discuss how the results would apply to transcriptional regulatory interactions. A relevant reference for FFL clustering may be PMID:15018656.(4) “…better noise reduction (lower Fano) correlates with better signal transduction (larger slope).” More specifically, this is true only for slope > 0. However, for slope < 0, better noise reduction (lower Fano) seems to correlate with worse signal transduction. Best signal transduction for slope < 0 is near -1, where the FF is the highest. The same is probably true for coupled motifs.(5) Besides the FFL, negative feedback can effectively reduce noise, as the authors note. In addition to ref. [13], this is also described in PMID:10850721 and PMID:19279212, which may be worth citing.Reviewer #2: In this paper Dolcemascolo et al. study noise propagation in a syntheticgenetic network where protein expression is regulated at the level oftranslation. The authors used the MS2-MCP system to inhibit the translationof a super folding GFP (sfGFP) variant and monitored the expression of boththe regulator (MCP fused to enhanced blue fluorescent protein 2 or eBFP2)and the target (sfGFP) in E. coli. MCP was expressed under the control of alac inducible promoter so that the expression level of MCP and hence thetranslational repression of sfGFP could be controlled by IPTG treatment.The authors empirically determined the steady state distributions of eBFP2and sfGFP at different IPTG concentrations as well as under tetracyclinetreatment that modified the growth rate and various kinetic properties ofthe cells. The empirical data were analyzed by computing noise defined asthe CV^2, where the CV is the coefficient of variation.The empirical contributions of this paper are strong: a new synthetic genecircuit has been established and produces excellent quantitative data atsingle-cell resolution.The computational/theoretical contributions however are rather weak.Although the authors call it "stochastic modeling", equations 6 (inMaterials and Methods) do not fit the commonly accepted meaning of the termin the field of stochastic gene regulation and can best be described asdeterministic curve fitting. A few observations regarding Equations 6:1. The third term in the expressions for CV^2 in equation 6 performs errorpropagation where the standard deviation of a dependent variable islinearly approximated as a function of the standard deviations of itsindependent variables. This is a purely deterministic framework.2. The second term in these equations representing "intrinsic" noiseassumes a constant Fano factor. Based on Friedman et al. (PRL 97: 168302),cited by the authors, and also Thattai and van Oudenaarden (PNAS 98: 8614)one would expect that the Fano factor is given by b+1 (discrete masterequation) or b (continuous master equation). b is theaverage number of proteins produced per burst, which is proportional to thetranslation rate. If the translation rate of sfGFP is being regulated byMCP, the Fano factor (=b) should be dependent on the amount of MCP/eBFP2 andnot a constant as the authors assume.3. Based on the above two points, the authors are effectively carrying outa deterministic curve fitting exercise when modeling the noise.4. Another problem is the use of the terms "extrinsic" and "intrinsic".Barring a two-color experiment (two colors of the translationally repressedprotein), it is not possible to decompose the noise into extrinsic andintrinsic components. Although one may decompose the total noise into twoterms (intrinsic and extrinsic), decomposing further into three or moreterms (each representing averages over subsets of independent variables,see Swain et al. PNAS 99: 12795) is not possible as far as I can tell. Ifthe authors have done so, they should include a derivation. Third, thetransfer function noise is also extrinsic since it involves the randomnessof eBFP2 expression but not the inherent stochasticity of translation.The authors observed a lower level of noise in sfGFP but a clearmechanistic explanation has not been provided. The assumptions of a genetranscribed and translated at a constant rate at steady state match theexperimental setup and the Gamma distribution describes the empiricalhistograms well. Then according to the results of Friedman et al. (PRL 97:168302), CV^2 = 1/a (shown as Eq. 10 in Methods), where a is the Gammadistribution shape parameter that is proportional to the transcriptionrate (and not translation rate). This could imply that the lower noisein sfGFP is due to higher transcript numbers and not translationalregulation.The deterministic Hill equations used for determining the dependence ofmean eBFP2 and sfGFP concentrations on IPTG and eBFP2 concentrations(Eq. 1) are unusual. The Hill equation for activation that results frommass kinetics takes the form (X/K)^n/(1 + (X/K)^n) and the Hill equationfor repression takes the form 1/(1 + (X/K)^n). Equation 1 however combinesboth repression and activation:alpha * (sigma + (IPTG/theta)^n)/(1 + (IPTG/theta)^n)= alpha * [ sigma/(1 + (IPTG/theta)^n) +(IPTG/theta)^n/(1 + (IPTG/theta)^n) ]According to this equation IPTG both activates and represses eBFP2simultaneously and sigma is a weight to determine the balance betweenactivation and repression. This may have been done to achievenon-zero eBFP2 expression when there is no IPTG. A basal or leaky level ofeBFP2 expression in the absence of IPTG might be better represented byadding a constant term to the Hill equation for activation.In summary, the paper treats propagation of noise in a deterministicmanner and lacks any sort of stochastic treatment of translation noise undertranslational regulation.--------------------Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: Yes--------------------PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #2: No25 Oct 2021Submitted filename: cover letter PCOMPBIOL-D-21-00828.pdfClick here for additional data file.17 Dec 2021Dear Dr Rodrigo,Thank you very much for submitting your manuscript "Gene regulation by a protein translation factor at the single-cell level" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.When you are ready to resubmit, please upload the following:[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).Important additional instructions are given below your reviewer comments.Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.Sincerely,David M. UmulisAssociate EditorPLOS Computational BiologyIlya IoshikhesDeputy EditorPLOS Computational Biology***********************A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:[LINK]Reviewer's Responses to QuestionsComments to the Authors:Please note here if the review is uploaded as an attachment.Reviewer #1: I would like to thank the Authors for responding to the earlier comments. Due to their edits, and due to their responses to both Reviewers’ comments, the manuscript has improved substantially, and it has become more understandable. Now I would like to make some further comments, addressing which could make the manuscript acceptable for publication.(1) Are there any natural systems what share the architecture of this synthetic system? It would be great to mention a couple of examples. That could increase the relevance of the research.(2) The manuscript would become much clearer and more readable if the Results is divided into subsections, each with a subtitle. Possible subsections could have subtitles like 1: Constructing the experimental system; 2: Single cell expression measurements; 3: Deterministic and stochastic mathematical modeling; 4: Noise decomposition; 5: Modeling distributions; 6: The effect of translational inhibition.(3) Is there a native LacI gene in this E. coli strain? The lacI+ feature suggests this. How would native LacI expression affect the system? Likewise, there seems to be a tetR gene in the cells, as suggested by the tetR+ notation in the Methods. If TetR is expressed in the cells, it will bind Tetracycline (TC) and it will prevent translational inhibition by TC. That effect will contribute to the growth rate vs. TC profile in Figure 3C. Does TetR regulate any gene in the cells?(4) Error bars are not shown in figures, with the justification that the noise (CV^2, standard deviation, etc.) represents the error bars. However, this is not a standard way to omit error bars. Error bars should arise from biological replicates instead of single cell values deviating from the average. Therefore, all the mean and noise curves should have error bars representing the deviations of biological replicates from a mean. In other words, there should be a mean of the mean, a mean of the noise, a standard deviation of the mean and a standard deviation of the noise calculated from at least 3 biological replicates. Likewise, error bars around simulated values could arise from repeated stochastic simulations. See Murphy et al., PNAS 104 (31) 12726-12731 (2007) and Murphy et al., Nucleic Acids Res. 38(8): 2712–2726 (2010).(5) While the theoretical approach is interesting, it would be helpful to also try some stochastic simulations of the system. Such simulations might predict at least the intrinsic noise behaviors without TC.(6) Adding TC will generate single-cell differences in cell division rate. The overall growth rate will be the average of all single cell division rates. In other words, there is a fitness average and a fitness noise. In the case of TC treatment, how large is the single-cell fitness noise? These concepts introduced in PLoS Comput. Biol. 8(4): e1002480 (2012) and then revisited in PLoS Comput Biol. 12(3): e1004825 (2016) and PNAS 115 (45) E10797-E10806 (2018) would be worth studying or at least discussing for TC. The fitness noise due to TC could be a source of extrinsic noise that correlates the extrinsic fluctuations of eBFP and sfGFP.(7) In Equations (6), the constants for extrinsic noise are strong assumptions. Are these constants assumed the same for all IPTG and TC conditions? It may be an issue if TC affects dilution mean & noise.(8) There are some minor grammar issues that would be helpful to address. For example, in line 51: “pursuit” may be better; line 60: “progress has been made”; in line 119: “MS2CP being” would be better. Also, some clarifications would be useful, such as on line 150: “the total noise of both eBFP and sfGFP”, on line 182: “inverse of the noise (equal to mean^2/variance)”.Reviewer #2: 1. In response to the criticism that the manuscript appeared not to beperforming stochastic modeling, the authors have clarified that equation 6has been derived by modeling mRNA/protein concentration dynamics using theLangevin equation. Expressions for the variance of eBFP2 and sfGFP havebeen determined analytically by computing the power spectra and using theWiener-Khintchine theorem.I went back to the original submission and looked again and "Langevin" isnot mentioned even once in the entire manuscript. The results state (lines115-117 of original ms):"We then constructed a mathematical model relying on a series ofalgebraic equations from basics on the biochemistry of gene expression andmolecular noise propagation [7]"This sentence can describe almost any model in computational biology. Evenin the Methods section, which should describe the *methods* in sufficientdetail so that the work may be reproduced, the sentence describing themodel states:"The expressions for the variances in gene expression (or fluorescencesignal) can be derived by following some basic calculations [3,7]."If the authors do not specify what "basic calculations" were performed, thereaders, including the reviewer, are free to use their imaginations. As ithappens, the same expressions for variance can be derived by errorpropagation, with very similar assumptions/approximations (sources of noiseare independent, linear approximation).The authors have included a derivation of equation 6 in Appendix S1 of therevised manuscript. The authors still fail to mention/describe the modelingmethod or its underlying assumptions, apart from the very genericdescriptions quoted above, in the results or methods. The reader should nothave to refer to other papers or read the appendices to know at a highlevel the method used to arrive at equation 6, which is central to theanalysis presented in the paper. This is also important for criticallyassessing the manuscript. For example, one crucial assumption is that thenoise sources are independent. This is not strictly true since eBFP2 andsfGFP share extrinsic noise sources and so the regulation noise from eBFP2is not independent of extrinsic noise. In this, the manuscript appears todiverge from the treatment in Pedraza and van Oudenaarden (Science,307:1965), who modeled transmitted noise as strongly correlated with global(equivalent to extrinsic in this ms's terminology). These assumptions were notmentioned at all in the original submission and are only mentioned in the appendixin the revised version.The authors should include a short description of the Langevin formalism and itsassumptions in the results and methods sections. They have alreadydescribed how they chose the variances for the different noise terms (lines418-424). A description of the analytical methodology and its assumptionsis missing. It would also be useful to discuss the limitations imposed bythese assumptions in the Discussion section.2. The lack of sufficient description of the modeling also explains theconfusion regarding the terms "intrinsic" and "extrinsic". The impressionone gets from the manuscript (lines 149-151 of revised ms):"For that, we first decomposed the total noise into three fundamentalcomponents: extrinsic noise, intrinsic noise, and regulation noise."is that it is *always* possible to decompose thenoise into three components, which is not correct. Without any furtherassumptions, noise can be decomposed into two components (Swain et al. PNAS99: 12795) but not three. To achieve the decomposition carried out in thems, it is necessary to make the additional assumption that "extrinsic","intrinsic", and regulatory random variables are independent. A betterwording might be something like:"We modeled stochastic mRNA/protein expression using Langevin equation.The rate of change of mRNA concentration was subject to fluctuations fromthree sources: .... Assuming that the sources are independent, thenoise can be decomposed into three contributions"The authors have cited Pedraza and van Oudenaarden (Science,307:1965), who defined the noise sources more carefully as global,intrinsic, and plasmid. The authors should consider following that examplesince "extrinsic" means all sources that are not inherent to thetranscription/translation process (Elowitz et al., Science 297:1183,Swain et al. PNAS99: 12795) and includes, among other things, transmitted noise.3. The authors have satisfactorily addressed all the other points raised inthe review.4. A general comment: part of the difficulty of understanding themanuscript has been that it has been written very tersely and is stingywith explanations, yet it shies away from technical/mathematicaldescriptions in the main text. Anything the authors can do to expandtheir explanations would help.For example, it is indeed true that the authors noted that the lower noisein sfGFP must be due to a high transcription rate (lines 146-148, originalsubmission), but it is stated in such an off-hand manner:"This suggested that with a translational control the noise of the regulator isbuffered, as the regulated gene is transcribed at high levels."and without reference to the theoretical results that it was easily missed.It would be better to expand this explanation a bit and point out that 1)the noise is inversely related to the shape parameter of the Gammadistribution and 2) that the shape parameter is proportional to thetranscription rate (ideally using the mathematical expressions).**********Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: Yes**********PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #2: NoFigure Files:While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.Data Requirements:Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.Reproducibility:To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocolsReferences:Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.3 Feb 2022Submitted filename: cover letter final revison.pdfClick here for additional data file.7 Apr 2022Dear Dr Rodrigo,We are pleased to inform you that your manuscript 'Gene regulation by a protein translation factor at the single-cell level' has been provisionally accepted for publication in PLOS Computational Biology.Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.Best regards,David M. UmulisAssociate EditorPLOS Computational BiologyIlya IoshikhesDeputy EditorPLOS Computational Biology***********************************************************Reviewer's Responses to QuestionsComments to the Authors:Please note here if the review is uploaded as an attachment.Reviewer #1: I would like to thank the Authors for addressing my comments. I would like to recommend the publication of the revised manuscript.Reviewer #2: All my concerns have been addressed.**********Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: None**********PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #2: No29 Apr 2022PCOMPBIOL-D-21-00828R2Gene regulation by a protein translation factor at the single-cell levelDear Dr Rodrigo,I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!With kind regards,Anita EstesPLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol
Authors: Jesse G Zalatan; Michael E Lee; Ricardo Almeida; Luke A Gilbert; Evan H Whitehead; Marie La Russa; Jordan C Tsai; Jonathan S Weissman; John E Dueber; Lei S Qi; Wendell A Lim Journal: Cell Date: 2014-12-18 Impact factor: 41.582