Literature DB >> 34919538

Genome-wide gene expression noise in Escherichia coli is condition-dependent and determined by propagation of noise through the regulatory network.

Arantxa Urchueguía1,2, Luca Galbusera1,2, Dany Chauvin1,2, Gwendoline Bellement1,2, Thomas Julou1,2, Erik van Nimwegen1,2.   

Abstract

Although it is well appreciated that gene expression is inherently noisy and that transcriptional noise is encoded in a promoter's sequence, little is known about the extent to which noise levels of individual promoters vary across growth conditions. Using flow cytometry, we here quantify transcriptional noise in Escherichia coli genome-wide across 8 growth conditions and find that noise levels systematically decrease with growth rate, with a condition-dependent lower bound on noise. Whereas constitutive promoters consistently exhibit low noise in all conditions, regulated promoters are both more noisy on average and more variable in noise across conditions. Moreover, individual promoters show highly distinct variation in noise across conditions. We show that a simple model of noise propagation from regulators to their targets can explain a significant fraction of the variation in relative noise levels and identifies TFs that most contribute to both condition-specific and condition-independent noise propagation. In addition, analysis of the genome-wide correlation structure of various gene properties shows that gene regulation, expression noise, and noise plasticity are all positively correlated genome-wide and vary independently of variations in absolute expression, codon bias, and evolutionary rate. Together, our results show that while absolute expression noise tends to decrease with growth rate, relative noise levels of genes are highly condition-dependent and determined by the propagation of noise through the gene regulatory network.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34919538      PMCID: PMC8719677          DOI: 10.1371/journal.pbio.3001491

Source DB:  PubMed          Journal:  PLoS Biol        ISSN: 1544-9173            Impact factor:   8.029


Introduction

It is by now well established that isogenic cells growing in a homogeneous environment show cell-to-cell fluctuations in gene expression (for example, [1-4]). This gene expression noise is not surprising from a biophysical perspective, given the inherent thermodynamic fluctuations in the molecular events underlying gene expression and the small numbers of molecules involved. In the simplest models of gene expression, where promoters are transcribed at a constant rate, the “intrinsic” noise in gene expression would simply grow in proportion to the square root of a gene’s absolute expression level (for example, [5]). However, even in bacteria where the gene expression process is considerably simpler than in eukaryotes, genes typically exhibit significantly higher levels of transcriptional noise, indicating that transcription rates fluctuate in time and across cells due to “extrinsic” factors [1]. Moreover, studies of genome-wide gene expression noise in bacteria have shown that genes with the same absolute expression can exhibit different noise levels and that the transcriptional noise of a gene is to a substantial extent encoded in its promoter sequence [6-9]. However, how the promoter sequence of a gene determines its transcriptional noise and what factors are the main drivers of differences in transcriptional noise remains largely unknown. In addition, because genome-wide studies have so far focused on gene expression noise in a single growth condition, it is currently not clear to what extent gene expression noise in bacteria is condition-dependent. That is, we do not know to what extent absolute noise levels vary across growth conditions and whether genes with the highest noise in one condition also exhibit the highest noise in other conditions. A systematic investigation into the condition dependence of genome-wide gene expression noise may provide important insights into what drives both absolute and relative noise levels of promoters. For example, it is possible that transcriptional noise is mostly driven by fluctuations in general factors, for example, the concentrations of RNA polymerases and nucleotides, and the overall state of the DNA. For example, it has been suggested that noise levels in yeast are mainly determined by basic promoter architecture and associated nucleosome positioning (see [10] and citations therein). Similarly, since supercoiling of the DNA has been reported to control the sizes of transcriptional bursts in Escherichia coli [11], it is conceivable that a promoter’s noise properties depend on its sensitivity to supercoiling. If differences in transcriptional noise across promoters result mainly from differences in the sensitivity of promoters to such global factors, then one would expect the same promoters to show highest noise across conditions. Alternatively, instead of a promoter’s noise level being an intrinsic feature of its architecture, a promoter’s noise might be determined by the way it is regulated in a given condition. Since the transcription rate of a promoter will generally depend on the binding of transcription factors (TFs), a promoter’s transcription rate will fluctuate as TFs stochastically bind and unbind to it. The rates of binding and unbinding of TFs in turn depend on average expression levels and fluctuations in expression levels of TFs across cells [8,12-14]. Consequently, fluctuations in both the expression levels of TFs and their binding to promoter regions will thus unavoidably propagate to fluctuations in expression of their target genes [15-19]. That noise propagation may play an important role for genome-wide gene expression noise was suggested by results we obtained in a previous study in which we measured genome-wide gene expression noise of E. coli promoters in a single growth condition and compared this with expression noise of synthetic promoters that were selected from a large library of 100 to 150 bp random sequence fragments [9]. We not only found that the synthetic promoters generally exhibited low expression noise, but also found that native promoters with high expression noise tended to have more known regulatory inputs from TFs than genes with low expression noise. To explain these observations, we developed an evolutionary theory in [9] explaining why natural selection may favor noisy gene regulation in many situations. However, to what extent genome-wide gene expression noise is indeed determined by noise propagation is currently unclear, and one of the motivations of this study is to systematically investigate this experimentally. As TFs change their expression levels across growth conditions, so will the fluctuations in their binding at their target promoters. Consequently, a key characteristic that distinguishes noise propagation from other sources of expression noise is that this noise will be highly condition-dependent. Therefore, a systematic investigation of how genome-wide noise levels of promoters vary across condition should directly provide insights into the role of noise propagation. To investigate the condition dependence of gene expression noise and elucidate the roles of both global factors and noise propagation, we systematically quantified genome-wide gene expression noise in E. coli across 8 different conditions that represent a wide range of growth rates and include different nutrients, different types of stress, and stationary phase.

Results

Expression noise levels vary substantially across conditions and systematically decrease with growth rate

Using methodology already employed in several previous studies [7,9,20], we used flow cytometry together with a library of fluorescent transcriptional reporters [21] to measure gene expression distributions of E. coli promoters genome-wide across a set of 8 different growth conditions (Fig 1A). The library of fluorescent reporters consists of most of E. coli’s intergenic regions inserted upstream of a strong ribosomal binding site and a fast-folding GFP on a low copy number plasmid. As we have shown previously [9], the GFP levels of these reporters reflect transcriptional activity, since translation and mRNA decay rates vary little across these reporters, which have almost identical mRNAs.
Fig 1

Genome-wide expression noise of E. coli promoters varies significantly with growth condition.

(A) For each growth condition and E. coli promoter, we used flow cytometry to measure the distribution of GFP levels across single cells of the corresponding fluorescent reporter. The 8 growth conditions comprised synthetic rich media, minimal media with different carbon sources, an osmotic and DNA damage stress, and 2 time points in stationary phase. (B) Mean (x-axis) and variance (y-axis) of log GFP levels for all promoters with expression above background level for growth in M9 0.2% lactose (see Fig G in S1 Text for results in all conditions). The blue line shows the fitted minimal variance as a function of mean expression and the corresponding noise floor a is indicated with an arrow. The insets show distributions of log-GFP levels for 2 example promoters. (C) The noise floor a as a function of the growth rate in the respective condition (stationary phase at 30 h not shown). The dotted line indicates a linear fit (with Pearson squared correlation coefficient R2 indicated). (D) To compare noise of promoters with different means, we defined the noise level of a promoter as the difference between its variance and the fitted minimal variance at its mean expression. Shown are noise levels versus mean for promoters in M9 0.2% lactose. (E) Noise level distributions of the full library in each of the measured conditions. The horizontal lines indicate the medians. The vertical scale is clipped at 0.35 for better visibility (Fig H in S1 Text has the full distributions). The underlying data for Fig 1 can be found in S1 Data and https://doi.org/10.5281/zenodo.4662163.

Genome-wide expression noise of E. coli promoters varies significantly with growth condition.

(A) For each growth condition and E. coli promoter, we used flow cytometry to measure the distribution of GFP levels across single cells of the corresponding fluorescent reporter. The 8 growth conditions comprised synthetic rich media, minimal media with different carbon sources, an osmotic and DNA damage stress, and 2 time points in stationary phase. (B) Mean (x-axis) and variance (y-axis) of log GFP levels for all promoters with expression above background level for growth in M9 0.2% lactose (see Fig G in S1 Text for results in all conditions). The blue line shows the fitted minimal variance as a function of mean expression and the corresponding noise floor a is indicated with an arrow. The insets show distributions of log-GFP levels for 2 example promoters. (C) The noise floor a as a function of the growth rate in the respective condition (stationary phase at 30 h not shown). The dotted line indicates a linear fit (with Pearson squared correlation coefficient R2 indicated). (D) To compare noise of promoters with different means, we defined the noise level of a promoter as the difference between its variance and the fitted minimal variance at its mean expression. Shown are noise levels versus mean for promoters in M9 0.2% lactose. (E) Noise level distributions of the full library in each of the measured conditions. The horizontal lines indicate the medians. The vertical scale is clipped at 0.35 for better visibility (Fig H in S1 Text has the full distributions). The underlying data for Fig 1 can be found in S1 Data and https://doi.org/10.5281/zenodo.4662163. The growth conditions (see SI Methods and Texts in S1 Text) were chosen to span a wide range of growth rates (Fig A in S1 Text), cell physiologies (Fig B in S1 Text), and regulatory states. They consist of MOPS synthetic rich media, M9 minimal media with 3 different carbon sources (0.2% glucose, 0.2% glycerol, and 0.2% lactose), 2 stresses (sub-MIC antibiotic: ciprofloxacin 1.5 ng/ml + 0.2% glucose and osmotic: 0.4 M NaCl + 0.2% glucose), and 2 time points in stationary phase (after 16 h and 30 h of growth in 0.2% glucose, respectively). We used microscopy to image cells from each growth condition and found that, consistent with the known relationship between growth rate and cell physiology [22], cell size generally increased with growth rate (Fig C in S1 Text). For each condition and each promoter, we used high-throughput flow cytometry to measure GFP levels for thousands of single cells. Apart from the 2 stationary phase conditions, all measurements were taken during mid-exponential phase. In total, we gathered 50′000 single-cell measurements for each of the 1,810 promoters in the library across 8 conditions, including some conditions in replicate. As observed previously [9], the fluorescence distributions can be well fitted with log-normal distributions, and we thus characterized each fluorescence distribution by the mean and variance of log-fluorescence. We note that, since flow cytometry measurements are themselves noisy, inferring means and variances from the raw measurements requires careful computational procedures, and we here use a set of procedures that we recently developed [23]. These include using forward and side scatter to identify events corresponding to cells and fits the log-fluorescence distribution by a mixture of a Gaussian and uniform distributions to remove possible outliers (for example, contaminants and nongrowing cells), as described in [23]. Replicate measurements performed on different days were highly reproducible, with Pearson squared correlations R2>0.99 for the mean between replicates in all conditions and squared correlations for the variance ranging from R2 = 0.85 to R2 = 0.95 (Fig D in S1 Text). In order to determine whether this variability derived mainly from biological variation from day to day or from measurement noise, we performed a time course experiment where we repeatedly measured the same culture at different time points during exponential growth and found that both the mean and variance measurements were extremely reproducible in these experiments (Fig E and F in S1 Text). This implies that variation in measurements from different days are mostly due to uncontrolled biological variation, and not measurement noise. This also implies that genes exhibit more biological variation in their noise levels across days than in their mean expression. To illustrate the typical form that the distribution of means and variances of log-expression across promoters takes, Fig 1B shows the variance as a function of mean for each promoter measured in M9 minimal media + 0.2% lactose (see Fig G in S1 Text for all conditions). Note that the variance in log-expression is equal to the square of the coefficient of variation (CV2) whenever fluctuations are small relative to the mean [9]. This approximation applies in our data, as the majority of promoters (approximately 75% across all conditions) have a variance smaller than 0.3 (Fig G in S1 Text). As has been observed in previous studies [6,9,24,25], we find that there is a clear lower bound on noise as a function of the mean expression level of the promoter (Fig 1B), which decreases with mean, and asymptotes to a fixed lower bound at high mean expression. A qualitatively similar curve is observed in all growth conditions (Fig G in S1 Text). As derived previously [9] and explained in the S1 Text, the functional form of the minimal variance as a function of mean expression can be derived, assuming that GFP variance is the sum of 2 terms: one “multiplicative” contribution with variance proportional to the square of the mean expression, and one “Poissonian” contribution with variance proportional to mean expression. The Poissonian term, whose magnitude we denote by b and is often referred to as the “intrinsic noise” term, could in principle derive from intrinsic expression noise whose magnitude scales proportional to mean expression [6,26]. However, by comparing microscopy and flow cytometry measurements we have recently shown that, at these expression levels, the component b derives almost entirely from the measurement noise of the flow cytometer [23]. We will refer to the multiplicative term as the “noise floor” a, which is often referred to as an “extrinsic noise” contribution. In contrast to the Poissonian term, whose contribution decreases with increasing mean and is negligible for highly expressed promoters, the contribution of the noise floor is independent of expression mean and corresponds to the minimal variance for highly expressed promoters. As shown in Fig G in S1 Text, the same functional form describes the minimal variance in all conditions, and we estimated the noise floor a for each condition. We observed that the noise floor a systematically decreases with growth rate over the entire range of growth rates. Although we currently lack a theoretical model for how this noise floor depends on growth rate, we noted that the dependence is well fit by a simple linearly decreasing function (R2 = 0.96; Fig 1C). However, we stress that this is only a phenomenological observation valid for the growth conditions considered here and that it is currently unclear whether this relationship generalizes to other conditions, for example, when growth rate is modulated by subinhibitory levels of antibiotics. The noise floor a likely reflects the minimal noise that every promoter is subject to due to general fluctuations in the physiological state of the cell including overall transcription, translation, mRNA decay, and growth [1,6]. Since we are measuring total protein levels per cell, one possible contribution to the noise floor is the variation in cell sizes. Although average cell size increases systematically with growth rate (Fig C in S1 Text), we find that the coefficient of variation of cell size does not vary much across conditions and shows no correlation with either the growth rate or the noise floor (Fig I in S1 Text). Therefore, changes in the cell size distribution do not explain the decrease of the noise floor with growth rate. Since our reporter constructs use a low copy number plasmid, some of the observed variation in expression levels may derive from plasmid copy number fluctuations. We note that, since the only differences between the reporter constructs are the short promoter sequences upstream of the GFP gene, all differences in the log-expression means and variances of different promoters within a given condition must be due to the differences in their promoter sequences. However, it is conceivable that plasmid copy number variations contribute significantly to the noise floor across conditions. As detailed in the S1 Text, we tested this hypothesis by selecting a set of promoters that were observed to have noise near the noise floor across all conditions, created chromosomal constructs for these promoters, and systematically compared mean and variance in log-expression of these chromosomal constructs with the corresponding plasmid-based reporters. As shown in Fig J in S1 Text, we find that whereas mean expression levels of the plasmid reporters are consistently about 6.5 times higher than the corresponding chromosomal reporters, the noise levels of the plasmid and chromosomal constructs are very similar, with differences generally within the error bars. These results show that plasmid copy number noise is either similar to the chromosomal copy number noise or that the copy number noise is small compared to other factors that determine the noise floor. An anticorrelation between noise and growth rate, similar to the one we observe here, has previously been observed in eukaryotes but was proposed to derive from heterogeneity in cell cycle stage [27]. However, our results show that this general anticorrelation between noise and growth rate also occurs in prokaryotes that do not have analogous cell cycle stages. In order to have a measure of the relative levels of noise of genes that is not confounded by the systematic dependence on mean expression, we defined the noise level N of promoter p in condition c as the difference between its variance in log-fluorescence and the noise floor, that is, the minimal variance at its mean expression level (see S1 Text, equation (3)). As shown in Fig 1D, the noise levels N indeed no longer show any systematic dependence on mean expression, and this is observed across all conditions (Fig G in S1 Text). Fig 1E shows the distribution of noise levels N in each of the conditions, sorted from high to low growth rate. We see that not only the noise floor, but also the distribution of noise levels on top of this noise floor varies substantially across conditions. Moreover, like the noise floor, both the median of the noise levels N as well as the variability in noise levels increase as the growth rate decreases, for example, the noise levels are lowest in synthetic rich conditions (p = 3×10−30, Wilcoxon rank-sum test) and highest at 30 h of stationary phase (p = 5×10−68, Wilcoxon rank sum test). That is, not only do minimal noise levels increase as growth rate decreases, the variability in noise levels across genes increases as well. The only exception to this general trend is the osmotic stress condition M9 + 0.4 M NaCl, which has relatively low variability in noise levels N compared to other conditions with similar growth rate (Fig 1E), even though its noise floor is not deviating from the general dependence on growth rate. These results show that the physiological state of the cell has a major influence on the distribution of absolute noise levels and that both the mean and variation in noise levels generally decreases with growth rate. We now turn to investigating how the relative noise levels of different promoters vary across the measured conditions.

Individual promoters show highly diverse changes in noise across conditions

If changes in noise levels across conditions were mostly driven by fluctuations in global factors such as concentrations of RNA polymerase, we would expect different genes to exhibit coherent changes in noise across conditions. For example, relative noise levels of different genes may remain relatively unchanged across conditions, or alternatively, noise levels might rescale across conditions as a function of the mean expression of the gene in the condition. However, this is not what we observe. Instead, different promoters show highly diverse changes in their noise levels across conditions (Fig 2).
Fig 2

Individual promoters show diverse patterns of variation in noise levels across conditions.

(A) Scatter plot showing the expression plasticity (variance across conditions, horizontal axis) and noise (variance in noise across conditions) of all measured promoters. (B-G) Examples of condition-dependent mean and noise of individual promoters. Each panel shows the noise level as a function of mean across conditions (colors; see legend) for one promoter, with the gene regulated by the promoter indicated in each panel. Error bars denote standard errors of the estimates based on biological replicate measurements. Each of the 3 pairs of panels indicate different types of behavior in mean and noise across conditions, as described at the top of each pair of panels. The underlying data for Fig 2 can be found in S1 Data and https://doi.org/10.5281/zenodo.4662163.

Individual promoters show diverse patterns of variation in noise levels across conditions.

(A) Scatter plot showing the expression plasticity (variance across conditions, horizontal axis) and noise (variance in noise across conditions) of all measured promoters. (B-G) Examples of condition-dependent mean and noise of individual promoters. Each panel shows the noise level as a function of mean across conditions (colors; see legend) for one promoter, with the gene regulated by the promoter indicated in each panel. Error bars denote standard errors of the estimates based on biological replicate measurements. Each of the 3 pairs of panels indicate different types of behavior in mean and noise across conditions, as described at the top of each pair of panels. The underlying data for Fig 2 can be found in S1 Data and https://doi.org/10.5281/zenodo.4662163. Following the general usage of the word plasticity to refer to the adaptability of the phenotype to changes in the environment, we will refer to the variance of a promoter’s mean and noise level across conditions as the plasticity of its mean and noise. The plasticity of both mean and noise vary over a substantial range across promoters, without any clear systematic dependence between these quantities. Analogous scatter plots for the variation and dependence between average expression, average noise, and the plasticities in mean and noise show that all these quantities vary substantially across promoters (Fig K in S1 Text). That is, individual promoters show highly distinct variation in their mean and noise across conditions, and Fig 2B–2G shows some examples of the different behaviors we observe. Note that all observations in these panels have error bars that show the standard error of measured mean and noise across biological replicates. We observe promoters that are low noise in almost all conditions, either with high plasticity in mean (Fig 2B) or low plasticity in mean (Fig 2C). Other promoters show high noise with plasticity in both the mean and noise level, without clear correlation between mean and noise level (Fig 2D and 2E). But many other patterns of behavior can be observed, such as promoters that show only low noise when the promoter has high mean (Fig 2F) or only low noise when the promoter has low mean (Fig 2G). The growth media were not predictive for how individual genes were going to change their mean and noise. For example, while overall the whole library is shifted towards lower noise in synthetic rich media, individual genes can show higher noise in this condition compared to other conditions (for example, Fig 2B and 2F). We highlighted this particular condition as an example, but the same observation applies to others. These observations indicate that global changes in the cell physiology or in the expression level cannot explain how the noise of a promoter varies across conditions. This implies that there is a promoter-specific source of noise shaping condition-dependent gene expression variability. Just as the plasticity in mean expression derives from gene regulation, one obvious hypothesis is that this promoter-dependent source of condition-dependent noise derives from gene regulation as well.

Noise propagation predicts that relative noise levels are condition-dependent

As mentioned in the introduction, the mechanistic basis for gene expression regulation is that the binding and unbinding of TFs to a promoter causes the transcription rate from this promoter to change. Consequently, fluctuations in the expression levels of TFs and their binding to promoter regions will unavoidably propagate to fluctuations in the expression of their target genes [8,12-19]. While the general decrease of absolute noise levels with growth rate (Fig 1C and 1E) is likely due to general physiological fluctuations that affect all promoters, the highly diverse changes in the relative noise levels of different promoters across conditions (Fig 2) is exactly what is expected to occur under a noise propagation scenario (Fig 3).
Fig 3

Signatures of condition-dependent noise propagation.

(A) We imagine a scenario in which 2 promoters are each regulated by a single transcription factor (TF1 or TF2). In growth condition 1, TF2 shows a higher variability in its activity (orange distribution) than TF1 (blue distribution). As a result, its target (gene B, yellow) will show higher expression variability than the target of TF1 (gene A, pink). (B) If the relative levels of variability in the activities are reversed in a different condition, the relative noise levels of target genes A and B will likewise be reversed. That is, noise propagation can explain why transcriptional noise is highly condition-dependent. (C) Because the noise of a target gene depends on fluctuations in activities of all of the TFs that regulate it, promoters that are more regulated will typically show higher noise levels in all conditions. The illustration shows a promoter controlling the expression of gene C (green) which is regulated both by TF1 (blue) and TF2 (orange). Since at least one of these TFs is highly variable in each condition, gene C will exhibit high noise levels in both conditions.

Signatures of condition-dependent noise propagation.

(A) We imagine a scenario in which 2 promoters are each regulated by a single transcription factor (TF1 or TF2). In growth condition 1, TF2 shows a higher variability in its activity (orange distribution) than TF1 (blue distribution). As a result, its target (gene B, yellow) will show higher expression variability than the target of TF1 (gene A, pink). (B) If the relative levels of variability in the activities are reversed in a different condition, the relative noise levels of target genes A and B will likewise be reversed. That is, noise propagation can explain why transcriptional noise is highly condition-dependent. (C) Because the noise of a target gene depends on fluctuations in activities of all of the TFs that regulate it, promoters that are more regulated will typically show higher noise levels in all conditions. The illustration shows a promoter controlling the expression of gene C (green) which is regulated both by TF1 (blue) and TF2 (orange). Since at least one of these TFs is highly variable in each condition, gene C will exhibit high noise levels in both conditions. Let us consider a simple scenario in which 2 individual genes are each regulated by one TF, that is, gene A is regulated by TF1 and gene B by TF2 (Fig 3A). As the activities of these TFs fluctuate within a given condition, these fluctuations can propagate to their respective targets. For example, in a condition where TF1 exhibits less variation in activity from cell to cell than TF2, gene A will generally exhibit less expression noise than gene B (Fig 3A). In anticipation of analysis presented below, it is important to stress that the distribution of “TF activity” shown in Fig 3A is only a schematic representation of a much more complicated biophysical process at the molecular level, and different target promoters of the same TF might respond very differently to fluctuations in the TF’s “activity.” Roughly speaking, the extent to which a TF X will propagate noise to a given target promoter Y depends on how much the binding of TF X to promoter Y fluctuates in time and across cells and how much the transcription rate of promoter Y depends on these fluctuations in binding of TF X. For example, if promoter Y is already strongly repressed or activated by another TF, the binding of TF X may be irrelevant for its transcription, and TF X will not propagate noise to promoter Y. Even if the transcription rate of Y is sensitive to binding of TF X, it may still be that binding affinities of the sites in promoter Y are so weak that the promoter is essentially never bound or so strong that it is essentially always bound, even if the concentration of TF X fluctuates from cell to cell. Only those target promoters of X for which the transcription rate is both sensitive to the binding of TF X, and for which the binding of TF X fluctuates significantly, will experience significant increase in their noise levels. Thus, the amount of noise propagation from a given TF X to a given target promoter Y is a complex context-dependent function, and only a subset of the promoters that are targeted by TF X will indeed respond to fluctuations in the activity of TF X in a given condition. These considerations make clear that, in general, we expect the extent to which different TFs propagate noise to different target promoters to be highly condition-dependent. For example, for the simple scenario imagined in Fig 3A, we can easily imagine that, in another condition, TF1 may show higher variability than TF2, such that the noise levels of their targets would change accordingly (Fig 3B). In other words, if gene expression noise is to a large extent determined by noise propagation from regulators to their targets, then this would explain why relative noise levels of genes can vary in a complex manner across conditions, because we expect both the noise levels of different regulators and the sensitivity to this noise at different promoters to vary across conditions. In summary, we propose that the qualitative patterns in expression noise across conditions that we observed in Fig 2 and Fig K in S1 Text can be explained by assuming that noise levels are to a large extent determined by propagation of noise from regulators to their targets. The hypothesis that noise propagation is responsible for the observed condition-dependent relative noise levels makes a number of additional predictions. First, constitutive promoters, that is, promoters that are not targeted by any TF other than the sigma factor of the RNA polymerase, should exhibit low noise in each condition and relatively little plasticity in their noise levels. Second, the larger the number of regulators that target a given promoter, the larger the chance that the promoter will be sensitive to fluctuations in the activities of at least one of these TFs (Fig 3C). Thus, more noisy promoters are in general expected to have more regulatory inputs. In addition, because all regulatory inputs of a promoter can change their noise levels in a condition-dependent manner, we also expect that, the more regulatory inputs a promoter has, the higher the plasticity of its noise level will be. Finally, to the extent that the regulatory inputs of each promoter are known, it should be possible to explain why some promoters are more noisy in one condition, and other promoters more noisy in another condition, and identify which TFs are most responsible for noise propagation in different conditions. In the next section, we investigate whether our data indeed exhibit these properties.

Noise propagation explains the condition-dependent noise levels of genes

In a previous work [9], we found that, for cells growing in minimal media with glucose, more noisy genes generally have more regulatory inputs, and we here checked whether these observations generalize to multiple growth conditions. We sorted promoters by their noise levels and used the regulatory site annotation from RegulonDB [28] to calculate the average number of known regulatory inputs of genes with noise levels N above a certain cutoff level, as a function of the cutoff level (Materials and methods). We find that in all 8 conditions, the number of known regulatory inputs systematically increases with noise levels (Fig 4A and Fig L in S1 Text). Notably, these differences are highly statistically significant with t-statistics of 4 or higher for the difference between known regulatory inputs for promoters above and below a given noise cutoff across a wide range of cutoffs in each condition (Fig M in S1 Text).
Fig 4

Noise propagation explains condition-dependent noise levels.

(A) More noisy promoters tend to have more regulatory inputs. We sorted promoters by their average noise across the 8 conditions and calculated the mean (y-axis) and standard error (gray area) of the average number of TFs known to regulate the promoters with noise level above , as a function of (x-axis). (B) The fraction of regulated promoters increases with higher levels of noise. We sorted promoters as in panel A and calculated the fraction (y-axis) and standard error (gray area) of the number of promoters with at least 1 regulatory input with noise level above , as a function of (x-axis). (C) The noise plasticity increases with number of regulatory inputs of the promoter. Shown are the cumulative distributions of the variance in noise across the 8 conditions for promoters with no known regulatory inputs (blue), 1 or 2 known regulators (yellow), and 3 or more known regulators (red). (D) The Motif Activity Response Analysis model explains a significant fraction of the variation in noise levels. Shown is the percentage of explained variance (FOV %, y-axis) in each of the 8 conditions (x-axis) after running the model on the real dataset (gray bars) and on randomized data (orange bars). Randomized data were generated by shuffling the association between regulatory inputs and expression noise multiple times and shown is the average value obtained +/− its standard error. (E) Table of TFs predicted by the model to significantly propagate noise in a condition-specific manner, that is, with A>δA in only one condition. (F) Average noise propagation activities (, y-axis) and their error bars (, vertical lines) of the strongest 6 noise propagators (with ), sorted by significance (, x-axis), which consistently propagate noise across all 8 conditions. The underlying data for Fig 4 can be found in S1 Data and https://doi.org/10.5281/zenodo.4662163.

Next, we wanted to test whether constitutive promoters exhibit consistently low noise levels. This analysis is complicated by the fact that our knowledge of E. coli’s regulatory network is extremely incomplete, with no known target promoters for almost two-thirds of E. coli’s TFs. Thus, although no known regulatory input is known for almost 60% of E. coli promoters (Fig N in S1 Text), a substantial fraction of these promoters are likely regulated by TFs for which we currently lack information. To obtain a set of promoters that are very likely constitutive we took a random selection of synthetic promoters that we obtained previously by screening a library of 100 to 150 bp random sequence fragments for sequences that drive expression in M9 minimal media with glucose [9] (see Supplementary Methods in S1 Text). We measured mean expression and expression noise of these synthetic promoters across 4 growth conditions and compared their expression plasticity, average noise, and noise plasticity with those of native promoters that have at least one known regulatory input. We found that the synthetic promoters not only have lower expression plasticity (p-value = 1.545e-09, two-sided Welch’s t test), confirming that they are likely constitutive but that both their average noise (p < 2.2e-16, two-sided Welch’s t test) and noise plasticity (p = 6.209e-05, two-sided Welch’s t test) are systematically low in comparison with regulated promoters (Fig O in S1 Text). To test whether all high noise promoters have at least one regulatory input, we calculated what fraction of promoters with noise level over a given cutoff have at least one known regulatory input (Fig 4B and Fig P in S1 Text) and found that 70% to 90% of high noise promoters in each condition have at least one known regulatory input. Given that our current knowledge of the regulatory network only represents one-third of E. coli’s TFs, this strongly suggests that most, if not all, of the high noise promoters are indeed regulated. We next tested to what extent noise plasticity increases with the amount of known regulatory inputs of a promoter. As shown in Fig 4C, we indeed observe that genes with more regulatory inputs show larger noise plasticity compared to genes with few or no known regulatory inputs (p<3.7×10−10, two-sided Welch’s t test). That is, regulated genes are not only more noisy on average, their noise levels are also more regulated across conditions.

Noise propagation explains condition-dependent noise levels.

(A) More noisy promoters tend to have more regulatory inputs. We sorted promoters by their average noise across the 8 conditions and calculated the mean (y-axis) and standard error (gray area) of the average number of TFs known to regulate the promoters with noise level above , as a function of (x-axis). (B) The fraction of regulated promoters increases with higher levels of noise. We sorted promoters as in panel A and calculated the fraction (y-axis) and standard error (gray area) of the number of promoters with at least 1 regulatory input with noise level above , as a function of (x-axis). (C) The noise plasticity increases with number of regulatory inputs of the promoter. Shown are the cumulative distributions of the variance in noise across the 8 conditions for promoters with no known regulatory inputs (blue), 1 or 2 known regulators (yellow), and 3 or more known regulators (red). (D) The Motif Activity Response Analysis model explains a significant fraction of the variation in noise levels. Shown is the percentage of explained variance (FOV %, y-axis) in each of the 8 conditions (x-axis) after running the model on the real dataset (gray bars) and on randomized data (orange bars). Randomized data were generated by shuffling the association between regulatory inputs and expression noise multiple times and shown is the average value obtained +/− its standard error. (E) Table of TFs predicted by the model to significantly propagate noise in a condition-specific manner, that is, with A>δA in only one condition. (F) Average noise propagation activities (, y-axis) and their error bars (, vertical lines) of the strongest 6 noise propagators (with ), sorted by significance (, x-axis), which consistently propagate noise across all 8 conditions. The underlying data for Fig 4 can be found in S1 Data and https://doi.org/10.5281/zenodo.4662163. If noise propagation is responsible for the high condition dependence of the relative noise levels across conditions, then it should in principle be possible to explain changes in the relative noise levels of promoters in terms of their regulatory inputs, and changes in the amount of noise that different TFs are propagating in different conditions. We have previously developed a model, called Motif Activity Response Analysis [29,30], which models gene expression in terms of computationally predicted regulatory sites in promoters genome-wide using a simple linear model, to identify which TFs are most important for driving observed gene expression changes across a set of conditions. We here adapted this approach to investigate whether changes in relative noise levels of promoters across conditions can be explained in terms of changes in the “noise propagating activities” of regulators and to identify which TFs are most important for propagating noise in different conditions. In particular, we used the RegulonDB database [28] to set a binary matrix of known regulatory inputs, that is, S is 1 when promoter p is known to be regulated by TF r and 0 otherwise. We then model the noise N of each promoter p in each condition c as a simple linear function of its known regulatory inputs S and the unknown noise propagating activities A of each regulator r in each condition c: where is the average noise level of all promoters in condition c, is the average of S across all promoters, and ϵ is a noise term that is assumed Gaussian distributed with mean 0 and unknown variance. For each condition c, we then inferred the noise propagating activities A by fitting the model (1) using a Gaussian prior on the activities A to avoid overfitting, which allows us to calculate a full posterior probability distribution over the activities A [30]. There are many reasons why the crude model (1) is extremely unlikely to provide a good quantitative model for the measured noise levels. First, as already mentioned above, our current knowledge of E. coli’s regulatory network is very incomplete with no targets known for almost two-thirds of its TFs, that is, there may well be significantly more regulatory interactions that we do not know about than those that we happen to know about. Second, as discussed in the previous section, the extent to which noise from a given TF propagates to a given target is likely a complex function of the combination of TFs that target a given promoter, the numbers, positions, and affinities of the binding sites for each of these TFs, the concentrations of all these TFs in a given condition, and so on. In particular, it is likely that of all promoters that a given TF targets, only a fraction will be sensitive to the noise in the TF binding in a given condition. However, we currently have no knowledge whatsoever about the extent to which different targets may respond to noise in the TFs that regulate them in a given condition. In absence of such knowledge, Eq (1) makes the crude assumption that each TF will propagate the same amount of noise to all its (known) target promoters and that the total noise of a promoter is simply the sum of the noise propagated by each of the regulators. Note that the latter effectively assumes that the fluctuations in the binding of all TFs are mutually independent, which is also unlikely to be true. Consequently, the aim of the model (1) is not to explain noise levels of individual promoters or to quantify the amount of noise propagated by each TF. Rather, the aim is to test whether this crude model of noise propagation can explain a significant fraction of the variation in noise levels across promoters and to identify which TFs are most responsible for noise propagation in each condition. As shown in Fig 4D (gray bars), in spite of our highly incomplete and rudimentary knowledge of E. coli’s regulatory network, the simple model explains between 10% and 30% of the variance in noise levels across conditions. To confirm the significance of these results, we fit the same model to data in which the association between regulatory inputs and noise levels were randomized by randomly shuffling the rows of the noise matrix N and observed that the fraction of explained variance on the randomized data was always much lower than on the real data (Fig 4D, orange bars). The model of Eq (1) also calculates error bars δA for the estimated noise propagation activities A of each regulator r in each condition c, allowing us to infer which TFs are most significantly propagating noise in each condition and Fig Q in S1 Text shows, for each condition, all TFs for which the noise propagating activity was larger than its error bar, that is, A>δA. Note that, while activity A corresponds to the average amount of additional noise per target that regulator r is predicted to cause in condition c, this should not be interpreted as the typical amount of noise per target. As discussed above, different target promoters will have very different sensitivities to the noise of regulator r, so that the A reflects an average between weak or no noise propagation at many targets and much stronger noise propagation at a subset of the targets of r. Focusing first on TFs that propagate noise in a highly condition-specific manner, Fig 4E lists the 5 TFs that had significant noise propagating activity in only one condition. For several of these TFs, their known functional role is consistent with the prediction that they propagate noise in these specific conditions. To mention the most obvious case, the TF LexA is predicted to propagate noise only in the sub-MIC ciprofloxacin condition. LexA is a repressor of the SOS response genes, and it is known that ciprofloxacin causes DNA damage and induces the SOS response [31]. Since we employed ciprofloxacin at a concentration well below the minimal inhibitory concentration, DNA damage likely only occurred in a subset of the cells, leading to heterogeneity in LexA activity across the cells. Similarly, the model predicted that FlhDC, the master regulator of flagellar biosynthesis [32], significantly propagates noise only in early stationary phase. It is known that flagellar synthesis peaks toward the end of exponential phase and decreases shortly after entry into stationary phase [33]. Since the 16-h condition is a transition between late exponential growth and entry into stationary phase, it seems plausible that some cells had entered growth arrest and were no longer expressing components of the flagellar machinery, while others had not yet transitioned, causing heterogeneity in the expression of targets of FlhDC. The other examples of condition-specific noise propagators are discussed in the S1 Text. In addition to condition-specific noise propagators, we noted that many of the most significant noise propagators were found in multiple conditions (Fig Q in S1 Text). To identify regulators that were consistently contributing to noise propagation in all conditions, we calculated, for each regulator r, its average noise propagating activity averaged over all conditions (SI Methods and Texts in S1 Text). Fig 4F shows the 6 TFs that were most significantly propagating noise in all conditions. As discussed in more detail in the S1 Text, the appearance of many of these TFs likely reflects our experimental setup, that is, growth in minimal media in microtiter plates. For example, the early stationary phase and stress regulator Sigma38 (rpoS) has been shown to have heterogeneous activity across single cells in M9 media with glucose [34]. Similarly, limiting oxygen levels in microtiter plates can lead to production of fermentation products [35,36], which are known to acidify the medium [37], explaining the appearance of GadW and GadX, which are involved in the response to acid stress [38]. The prediction that the histone-like TF H.NS is the most significant noise propagating TF is interesting, given that in eukaryotes, noise properties of different genes have been related to nucleosome organization in their promoters [39]. Although the predicted condition-dependent role of these TFs in propagating noise are, at this point, just hypotheses that require in-depth experimental follow-up to confirm, for several cases, the predicted role in noise propagation by these TFs is highly plausible, given their known functional role, and highlights that the simple model can make concrete predictions about which TFs are most involved in driving gene expression noise in different conditions. In summary, we have presented multiple lines of evidence to confirm that noise propagation plays an important role in determining condition-dependent expression noise genome-wide. Constitutive promoters have consistently low noise and low noise plasticity across conditions. In contrast, across all conditions, we find that the higher the expression noise, the higher the number of known regulatory inputs promoters tend to have. Although almost 60% of promoters have no known regulatory input, 70% to 90% of high noise promoters have at least one known regulatory input. In addition, promoters with more known regulatory inputs also exhibit higher noise plasticity across conditions, indicating that gene regulation causes noise levels to be regulated as well. And finally, in spite of our very limited knowledge of E. coli’s regulatory network, a crude model of noise propagation explains 10% to 30% of the variance in relative noise levels across conditions. Together, these results imply that propagation of noise through the regulatory network is a major determinant of condition-dependent expression noise. That is, not only the mean expression levels of genes are determined by gene regulation, the noise levels of genes are to a substantial extent determined by the structure of the gene regulatory network as well.

Gene features are organized along 2 major axes reflecting average expression and regulation

Previous studies of the genome-wide correlation structure of gene features have uncovered that genes are organized along a one-dimensional axis that relates evolutionary rates, codon bias, and gene expression level [40-43], that is, highly expressed genes tend to have strong codon bias and slowly evolving coding regions, whereas lowly expressed genes tend to have weak codon bias and evolve more rapidly. We next set out to extent such analysis of the genome-wide correlation structure of gene properties, including gene properties associated with gene regulation and expression noise into the analysis, and investigate the interdependence of absolute gene expression, regulation of expression, expression noise, codon bias, and evolutionary rates. We collected a set of features for E. coli genes on a genome-wide scale from the literature including the absolute expression levels at both the RNA [6] and protein level [44], sequence properties such as codon bias and the evolutionary rates at both synonymous and nonsynonymous sites (denoted by dN and dS, repectively) [42], and the number of regulatory inputs of each gene [28]. We then complemented these features with gene expression features that we measured here, including mean expression level, expression plasticity across the 8 growth conditions, the mean expression noise level, and noise plasticity across the 8 growth conditions. In total, we gathered 10 different gene features and then calculated an overall normalized correlation matrix R of correlations between these features, that is, with R the Pearson correlation between features i and j. We then performed Principal Component Analysis (PCA) of the matrix R to characterize the overall genome-wide correlation structure of these gene features. As shown in Fig R in S1 Text, the first 2 principal components capture significantly more of the total variance than the other 8 components, capturing more than 50% of the total variance. That is, the plane spanned by these first 2 PCA axes captures the majority of the variation in the 10-dimensional space of gene features. Moreover, each of these axes corresponds to a weighted average of the 10 gene features, and the fact that the axes are (by construction) orthogonal implies that these combinations of gene features vary independently of each other. We thus next investigated which gene features are associated with these first 2 PCA axes. We find that the first PCA axis corresponds precisely to the previously observed organization of genes by their absolute expression levels, codon bias, and evolutionary rates [40-43] (Fig 5A). That is, 94% of the weight along this first PCA component is accounted for by mean RNA and protein levels, codon bias, and evolutionary rates at synonymous and nonsynonymous sites (Fig 5A). As observed previously, the absolute expression levels and codon bias are positively correlated with each other, while the 2 evolutionary rates dS and dN are negatively correlated with these features (Fig 5C).
Fig 5

Principal component analysis shows gene features are distributed along 2 major axes associated with absolute expression level and gene regulation, respectively.

(A) Relative contribution of the 10 gene features to the first PCA component, sorted from bottom to top. The features in bold together account for 94% of the first component. In green are expression measurements obtained from previous studies, sequence features are in blue, and features measured in this study are in red. (B) As in panel A but now for the second PCA component. (C) Correlation structure of the features contributing to the first PCA component. Negative correlations are in blue and positive correlations in orange. (D) As in panel C but now for the second PCA component. The underlying data for Fig 5 can be found in S1 Data.

Principal component analysis shows gene features are distributed along 2 major axes associated with absolute expression level and gene regulation, respectively.

(A) Relative contribution of the 10 gene features to the first PCA component, sorted from bottom to top. The features in bold together account for 94% of the first component. In green are expression measurements obtained from previous studies, sequence features are in blue, and features measured in this study are in red. (B) As in panel A but now for the second PCA component. (C) Correlation structure of the features contributing to the first PCA component. Negative correlations are in blue and positive correlations in orange. (D) As in panel C but now for the second PCA component. The underlying data for Fig 5 can be found in S1 Data. Strikingly, the second PCA axis corresponds almost entirely to gene features associated with gene regulation and expression noise. Around 94% of this vector’s weight is accounted for by gene expression noise, noise plasticity, plasticity in mean expression, and number of regulatory inputs (Fig 5B). Moreover, we find that these 4 features are all positively correlated with each other on a genome-wide scale (Fig 5D). That is, this second PCA axis organizes genes by their regulation and expression noise. On one end of this axis are constitutively expressed genes that do not change their mean expression level across conditions and have low noise in all conditions, whereas on the other end of the axis are highly regulated genes that have a high number of regulatory inputs, are highly plastic in expression, and have high and varying expression noise across conditions. This result not only shows that gene regulation and expression noise are intimately coupled on a genome-wide scale, confirming the importance of noise propagation for condition-dependent expression noise, it also shows that these gene regulatory features are varying independently of the absolute expression and evolutionary rate features of the first principal axis.

Discussion

Although it is now well established that gene expression is an inherently noisy process, so far little is known in bacteria about how noise levels of genes vary across growth conditions. Here, we used high-throughput flow cytometry in combination with a library of fluorescent transcriptional reporters to quantify expression noise of E. coli promoters genome-wide. The general picture that emerges from our study is that the expression noise of a given gene in a given condition is the sum of two contributions: a minimal amount of noise that derives from global physiological fluctuations and that is approximately equal for all genes, and a highly gene- and condition-specific component that is substantially due to propagation of noise through the regulatory network. Constitutively expressed genes have least expression noise in each condition and consistently exhibit low noise. Consequently, constitutively expressed genes also exhibit least variation in noise levels across conditions. In contrast, regulated genes exhibit additional noise due to noise propagation. As the regulatory network changes its state across conditions, so does the propagation of noise through the regulatory network, causing regulated genes to change their noise levels in a highly condition-dependent manner. That is, our results suggest that the cell’s regulatory network does not only control the mean expression levels of genes across conditions, but also controls the amount of expression noise of each gene, making gene expression noise a regulated quantity. This intimate coupling of expression noise and regulation was underscored by our analysis of the genome-wide correlation structure of various gene features. We found that number of regulatory inputs, expression plasticity, expression noise, and noise plasticity are all positively correlated on a genome-wide scale and that variations in these quantities are indepedent of the correlated variations in average absolute expression, codon bias, and evolutionary rate that has been observed previously [40-43]. We also observed that both the noise floor and the total amount of variation in noise levels systematically decrease with the growth rate of the cells and is highest in the stationary phase (Fig 1). Both its dependence on growth rate, and the fact that this noise floor appears to affect all promoters equally, strongly suggest that the noise floor is driven by global physiological fluctuations, although it is currently not clear which physiological variables contribute most to the noise floor. Our analysis shows fluctuations in cell sizes are similar in all conditions, and our comparison of plasmid-based and chromosomally integrated reporters shows that plasmid and chromosome copy number fluctuations are either similar in size or do not contribute substantially to the noise floor. However, fluctuations in RNA polymerase concentration, ribosome and charged tRNA concentrations, mRNA decay rates, and fluctuations in growth rate itself are all plausible contributors to the noise floor. In addition, the fact that not only the noise floor but also the total variance in noise levels decreases with growth rate suggests that increased growth may dampen the propagation of noise through the regulatory network. To gain further insight into which fluctuations set the noise floor, and why the noise floor decreases with growth rate, will likely require quantitative time course data, for example, from approaches that combine microfluidics with time-lapse microscopy [45,46]. Although our modeling of noise levels in terms of known regulatory interactions showed that noise propagation can explain a significant fraction of the condition-dependent variation in noise levels genome-wide, there are many questions that remain for future work. Our modeling identified both TFs that appear important noise propagators in all conditions, for example, the histone-like H.NS and sigma factor Sigma38, as well as TFs that significantly propagate noise in one condition only, for example, LexA under treatment with ciprofloxacin and FlhDC in early stationary phase. Therefore, the most obvious direction for detailed experimental follow-up is to investigate the precise role of these TFs in noise propagation. For example, it is currently not clear what the main biophysical mechanism is through which noise is propagated from regulators to their targets. Both fluctuations in TF concentration across cells and the stochastic binding and unbinding of TFs to promoters will contribute to noise propagation, but the relative contribution of these are currently not known. In addition, it is also not clear what sets the sensitivity of different target promoters to fluctuations in an upstream regulator. To quantitatively understand the sensitivities of different target promoters to noise in the activities of their regulators will likely require much more realistic biophysical models of promoter function, which take into account that different TFs compete for binding to the promoter, that binding rates depend on TF concentrations, that interactions between bound TFs and RNA polymerase depend on the relative positioning of sites, and so on. Developing such quantitative models will likely require detailed data on the expression dynamics of different promoter architectures as growth conditions are varied. Lastly, since the structure of the regulatory network is a major determinant of genome-wide noise levels, this raises the question of how natural selection has acted on noise propagation. One might expect that by making gene regulation less accurate, the effects of noise regulation are mainly deleterious, so that natural selection would be expected to act to minimize noise propagation. However, our previous theoretical work has shown that, by effectively implementing a targeted bet hedging strategy, noise propagation can in fact be beneficial in many circumstances where perfect regulation is difficult to achieve [9]. It is thus conceivable that the way noise propagates through the regulatory network has been tuned by natural selection. It will be interesting to investigate to what extent the condition-dependent noise properties that we have measured contribute to growth and survival of the population in these conditions. For example, it is conceivable that the systematic increase of expression noise as growth rate decreases might be an adaptive strategy by which cells more actively explore different phenotypes when they grow more slowly. Similarly, it would be very interesting to investigate to what extent the noise propagation patterns that we observed in our lab strain of E. coli are conserved in related wild bacterial strains or related species.

Materials and methods

Strains

All 1,810 strains used in this study were taken from [21] and have been previously described [7]. In short, each strain carries a transcriptional fusion of a given native E. coli promoter followed by a strong ribosomal binding site and gfp-mut2 on a low copy number plasmid (SI Methods and Texts in S1 Text).

Growth conditions

The library of strains was grown in a total of 8 different conditions: minimal media, M9 (0.1 mM CaCl2, 1 mM MgSo4, 1 × M9 salts [Sigma M6030]) supplemented with either 0.2% glucose (w/v), 0.2% glycerol (v/v), 0.2% lactose (w/v), 0.4 M NaCl (+0.2% glucose [w/v]), or 1.5 ng/ml ciprofloxacin (+0.2% glucose [w/v]); a MOPS based synthetic rich media (Teknova, M2105) supplemented with 0.2% glucose, and 2 stationary phase conditions, where plates were grown for either 16 h or 30 h in M9 minimal media + 0.2% glucose (w/v) (SI Methods and Texts in S1 Text).

Flow cytometry quantification of fluorescence

We measured the distribution of GFP fluorescence levels in single cells using a FACSCanto II (BD Biosciences) with a high-throughput sampler (HTS), fluorescence excitation at 488 nm and a 530/30-nm filter for emission. We used a Bayesian procedure that removes outliers to extract the mean and variance of the log-fluorescence distributions as described in [23] (SI Methods and Texts in S1 Text).

Minimal variance as a function of mean and noise estimation

Flow cytometry data show a clear lower bound on noise levels (variance of log-fluorescence) that depends on the mean of expression. In previous work [9], we derived a functional form for this noise floor as a function of mean expression and used it to correct for the dependency in each condition (see S1 Text). We define a promoter’s noise, N, as the difference between the measured variance and the fitted minimal variance.

Noise propagation features

We sorted all annotated genes by their average noise across all conditions () and as a function of a cutoff in , we calculated the mean and standard error of the number of regulatory inputs of all genes with values above the cutoff and the fraction of genes with at least one known regulatory input. As a measure of noise plasticity of each promoter p, we calculated the variance of the noise levels N across conditions. We used the same promoter annotation as in [9], where the promoter fragments had been reannotated by mapping the primer pairs used to construct the library to the E. coli K12 MG1655 genome (SI Methods and Texts in S1 Text).

Fitting noise in terms of regulatory inputs

To model noise in terms of regulatory inputs, we adapted a previously developed method, called Motif Activity Response Analysis, which models gene expression levels in terms of computationally predicted regulatory sites in promoters and condition-dependent activities of regulators [29,30]. In particular, we model the noise N of each promoter p in each condition c as a linear function of the condition-dependent noise-propagating activities A of the regulators known to regulate promoter p, that is, Eq (1). Details of the approach are in the SI Methods and Texts in S1 Text.

Principal component analysis

For each promoter, we gathered a list of 10 features associated with the immediately downstream gene using both the measurements in this study, as well as previously published data. We calculated a covariance matrix containing all the variances of each of the features across genes, and the covariances of each pair of features. We then transformed this covariance matrix into a matrix of correlation coefficients and performed PCA (SI Methods and Texts in S1 Text).

Supporting information file containing supplementary methods, supplementary text, and supplementary figures and tables.

(PDF) Click here for additional data file.

Excel table containing the underlying data for results presented in the main and supplementary figures.

(XLSX) Click here for additional data file. 4 May 2021 Dear Dr van Nimwegen, Thank you for submitting your manuscript entitled "Genome-wide gene expression noise in Escherichia coli is condition-dependent and determined by propagation of noise through the regulatory network" for consideration as a Research Article by PLOS Biology. Please accept my apologies for the delay in getting back to you as we consulted with an academic editor about your submission. Your manuscript has now been evaluated by the PLOS Biology editorial staff as well as by an academic editor with relevant expertise and I am writing to let you know that we would like to send your submission out for external peer review. However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire. Please re-submit your manuscript within two working days, i.e. by May 06 2021 11:59PM. Login to Editorial Manager here: https://www.editorialmanager.com/pbiology During resubmission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF when you re-submit. Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed all checks it will be sent out for review. Given the disruptions resulting from the ongoing COVID-19 pandemic, please expect delays in the editorial process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible. Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission. Kind regards, Richard Richard Hodge, PhD Associate Editor, PLOS Biology rhodge@plos.org PLOS Empowering researchers to transform science Carlyle House, Carlyle Road, Cambridge, CB4 3DN, United Kingdom California (U.S.) corporation #C2354500, based in San Francisco 26 May 2021 Dear Dr van Nimwegen, Thank you very much for submitting your manuscript "Genome-wide gene expression noise in Escherichia coli is condition-dependent and determined by propagation of noise through the regulatory network" for consideration as a Research Article at PLOS Biology. Your manuscript has been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by three independent reviewers. The reviews are attached below. You will see that the reviewers find your manuscript interesting and well done, but also raise concerns with the interpretation of the data in several instances and ask that the conclusions are toned down. In addition, Reviewer #1 raises concerns with the strength of advance over your previously published study (Wolf et al, 2015, eLife). We ask that the manuscript is re-written so that the novelty and advance of the current work is made clear. In light of the reviews, we will not be able to accept the current version of the manuscript, but we would welcome re-submission of a much-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent for further evaluation by the reviewers. We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology. **IMPORTANT - SUBMITTING YOUR REVISION** Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript: 1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript. *NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point. You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response. 2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type. *Re-submission Checklist* When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record. Please make sure to read the following important policies and guidelines while preparing your revision: *Published Peer Review* Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details: https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/ *PLOS Data Policy* Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5 *Blot and Gel Data Policy* We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements *Protocols deposition* To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Richard Richard Hodge, PhD Associate Editor, PLOS Biology rhodge@plos.org PLOS Empowering researchers to transform science Carlyle House, Carlyle Road, Cambridge, CB4 3DN, United Kingdom California (U.S.) corporation #C2354500, based in San Francisco ***************************************************** REVIEWS: Reviewer #1: This is a well-reasoned and well-presented manuscript studying the role of environmental conditions in expression variability in E.coli. The authors hypothesize that a key factor in expression noise is that genes regulated by transcription factors will propagate the noise of the factors and will therefore be more noisy the factors or the number of factors. The ultimate conclusion, that environment affects noise through transcription factors and that it affects different genes differently, is not surprising. This manuscript builds heavily upon this group's previous work. The manuscript is well written, and the clarity of data analysis and presentation of supporting evidence is exemplary. Overall, the manuscript is impressive in my opinion has the potential to support many interesting developments, both theoretical and experimental. I do have, however, several important concerns that should affect the editorial decision. Once these major issues are addressed, I'll be happy to review the manuscript again. 1. Some of the results presented are quite similar to the senior author's older work "Expression noise facilitates the evolution of gene regulation" (Wolf et al., eLife 2015). Indeed, the following quote is from that eLife 2015 publication digest entry: "Wolf et al. also found that noisy promoters tend to be highly regulated by transcription factors: the proteins that control gene expression by binding to promoter regions." This sounds much like the take-home message of this manuscript, which raises questions regarding the contribution in this manuscript. Unfortunately, the authors have made my life very difficult as a reviewer, by blending old and new methods in their manuscript in a way that makes it difficult to assess the novelty in this work. For example: most of Fig. 1 in this manuscript is very similar to Fig. 1 of Wolf. Fig. 2B in this manuscript is very similar to Fig. 1E in Wolf. Fig. 4A in this manuscript is very similar to Fig. 2A in Wolf. Fig. 4B in this manuscript is very similar to Fig. 2C in Wolf. There are more examples. The fact that as deep as Fig. 4 in this manuscript I still see analyses that are very similar to this group's work from 6 years ago, is worrying. It suggests to me that the authors need to rewrite the manuscript in such a way that makes it utterly clear what the novelty in this work. This would be a major edit since it is expected that non-novel methods/analyses belong either in the introduction or in the supplementary material, and that the results section contains only the novel results for this work. Only after the authors have clearly separated the old from the new, will I be able to properly judge the significance of the new manuscript. As it stands currently, my impression is that the major breakthrough was in the 2015 paper and that this followup work simply implements those methods in more experimental conditions. 2. I think the manuscript can be made more robust by softening some of the interpretations the authors make about the data. The data as they are speak eloquently and I fear these particular interpretations may detract and even may misguide and so should be removed. * Fig 1C - there not enough data to justify a linear fit. These data could be explained by a number of plausible models, for example a Hill function. Similarly, the interpretation in the text is not necessary. Unless the authors suggest a plausible linear relation, and then extract the slope of the linear fit and discuss its implications (adjusting for the correct units and considering fit confidence levels) then I suggest the authors do away with this fit altogether. * Since Growth rate ~ size and noise floor ~ 1/sqrt(size), doesn't that imply that the faster the growth the lower the noise? (e.g. Fig. 1C). This seems like an obvious statement, and so should be made clear. Again it questions the validity of the linear fit in Fig. 1C. * Fig 2E-F - I'm not sure the evidence supports the statements - "Noise/Expression plasticity". How different are they from A-B? One outlier is hardly convincing. Do the authors really want to go into the statistical analysis of whether they are or aren't different from the A-B case? Please remove/adjust or otherwise support the statements with a rigorous statistical analysis. Reviewer #2: Review: In the manuscript Urchueguía et al study gene expression noise in E. coli using a plasmid-based reporter system and flow cytometry. Using their system, the authors recapitulated what is known about noisy gene expression (variation among genes, increase of noise with lower expression levels, and a minimal noise floor) and expanded it to multiple growth conditions (faster growth reduces the overall noise level). While the choice of methods is tricky, both plasmids copy number is inherently noisy, and so is flow cytometry, the analysis appears well done. Fig. 2 is a nice illustration of the behavior of different promoters and Fig. 3 states the authors hypothesis that noise propagation increases noise clearly. The analysis of the effect of regulation on noise (Fig 4) yielded interesting results. Noisy genes generally had more regulatory input, and highly regulated promoters showed higher noise plasticity. The authors interpret their findings in Fig. 4 as 'Noise propagation explains condition-dependent noise levels', which at this stage I cannot see that the authors have shown. In Fig. 4C the authors show the fraction of explained variance of their simple noise propagation model, which captures around 10 to 30% of the observed noise. With 90-70% of the observed noise unaccounted for, it is hard to argue that noise propagation has explained variations in noise levels. I believe that is OK for the authors to only explain noise levels partially, and I also believe that Fig. 4A&B are interesting new data. Fig. 4D-E are accompanied by a large section in the text (p. 10) explaining which transcription factor is important and noisy for certain reasons in each condition. While some of the claims could be true (flagellar master regulator being turned on at end of growth -> noisy in early stationary phase), most others are not trivial (ArcA upregulated at high salt, because cell turns fermentative. But why the noise?). This section is very speculative, and could leave the reader more confused than enlightened. I recommend removing it from the main text. The authors should make clearer what the analysis of regulators (transcription factors and sigma factors) found means for the noise levels. In Fig. 4E the average noise propagation strength for a 'hit' is 0.01, but it was not clear to me if this is little or a lot. After going through the manuscript again I now understood that a gene regulated by this TF/sigma increases its noise level by approx. 0.01 (Eq. 1). Because the typical scale is 0-0.15 in noise level for a gene (Fig. 1D), regulation increases noise by around 10%. A proper estimation of what the numbers mean could benefit the reader. Finally, Fig. 5 seems ok, but very confusing and needs more explanation. What are dS, dN? I assume codon bias of synonymous and non-synonymous sites, but it is not clear to me what the metric is. What does it mean that genes cluster along the two principal components? Is this trivial, because several noise metrics were included in a data set where the other metrics are highly correlated (protein, RNA expression, dS, dN), or is it non-trivial? Is the conclusion that noise is independent of expression level? What does it mean that regulatory input is only a small component of PCA2? Overall, the manuscript is fine and can be published after revisions. In addition to the above comments, I want to express a concern that that the claims of the authors, especially in title and abstract, are not fully supported by the data. Expression noise, as far as I can see, is only 'partially determined' by noise propagation (10 to 30%, if I understand the paper right), so the title should be adapted to reflect that. The abstract should include that instead of a 'significant fraction' of the variation in noise levels, about 10-30% can be explained by the authors. Being transparent on the findings is important, because it will leave other authors (or the same) the opportunity to continue to work on what determines the remaining 70-90% of the noise. Furthermore, I want to authors to state in the abstract that their system is plasmid-based, because plasmid themselves are highly noisy, and the reader needs to be aware of this limitation of the study. In the main text, the authors should include a discussion of plasmid copy number variations and compare the noise from plasmid copy numbers to their observed noise (e.g. see https://www.nature.com/articles/s41467-021-21734-y). A discussion of the impact of cell size, would also benefit the reader. In exponential growth, cells post- and pre-division vary by a factor of ~2, which might impact the noise recorded by the authors. The impact of the flow cytometer on noise levels would also be important for the readers to judge noise levels. The statement: "Whereas constitutive promoters exhibit noise near this `noise floor' in all conditions […]" confused me. Does it refer to Fig. 4A, which shows that less noisy promoters are regulated on average by fewer inputs? If so, then the authors have only shown that promoters with lower noise are usually constitutive promoters. Replotting the existing data as number of regulators vs. noise, and then specifically discussing constitutive promoters should be sufficient to test if indeed unregulated promotors are near the noise-floor for in all conditions. Similarly, the statement "regulated promoters are systematically more noisy" should be changed to 'noisier promoters are regulated by an average higher amount of inputs', or alternatively, the can replot their data. In addition, a correlation plot, ideally as part of Fig. 4, between noise/noise plasticity and number of regulatory inputs would greatly benefit the readers. A major issue with the claims of the authors is that it is not clear whether the claims are statistically significant. Inclusion of statistical significance tests concerning correlations between noise and regulatory inputs, should be included in the revision. The statement: "Our results show that expression noise levels are themselves regulated and determined by the propagation of noise through the gene regulatory network", is, as far as I can see, not shown in the manuscript. Same claim in the discussion: "[…] gene expression noise is […] a regulated quantity". If the authors want to make this statement, then they should include a direct test of this hypothesis, e.g. by measuring the noise level in synthetic constructs with a varying number of regulatory inputs, or any other experimental verification. There are several statements in the manuscript that claim that condition-dependent gene expression noise is explained by propagation of noise through the regulatory network, e.g. "First, all promoters exhibit a condition-dependent minimal amount of noise, and unregulated promoters typically exhibit noise near this noise floor. Regulated promoters exhibit additional noise that results from propagation of noise through the regulatory network, causing different promoters to show condition-dependent noise levels depending on their regulatory inputs. These results show that not only the mean expression levels of genes are determined by gene regulation, the noise levels of genes are also to a large extent determined by the structure of the gene regulatory network as well." p. 11 The authors should be careful with this claim. If the authors want to make this statement, then they need to show that 1) unregulated promoters are at the noise-floor. 2) only regulated promoters are above the noise floor. I presume that this is based on Fig. 4, but I cannot see which data supports this claim. It could well be that the existing analysis proves the authors point and that I am missing something. Either way, the logic needs to be clearly spelled out and comprehensible. Finally, it is not clear to me how the authors claim that 'that condition-dependent gene expression noise is explained by propagation of noise through'. This is the major claim of the paper, and is found throughout the paper, including title and abstract. What is the evidence that noise propagation is responsible for the condition-dependent expression noise? Reviewer #3: This work is a tour de force in characterizing the mean and the variability of E. coli gene expression across many promoters and many experimental conditions. Bravo! This will be a very useful resource for researchers in the field, especially since all of the data have been uploaded by the authors. I am not an experimentalist, and hence cannot assess the experimental parts of the work, which are its cornerstone. The modeling/analysis is rather straightforward, based on previous work from the same group and from other authors, and was easy to follow. I don't have any specific comments there. Based on my ability to understand the computational analysis, and on the beauty of the dataset, I think the manuscript should be accepted to the journal with minimal changes. 1. As mentioned above, I've been able to follow most of the analysis. However, one argument made me pause. Specifically, the authors say that "Noise propagation predicts that relative noise levels are condition-dependent", so that "genes that are regulated by more TFs are expected to generally exhibit more expression noise than genes that are regulated by fewer TFs". This contradicts the model and the experiments on eukaryotic systems, such as in https://elifesciences.org/articles/59351, which argues that multiple enhancers decrease the promoter variability. There are certainly many differences between eukaryotes and prokaryotes -- and it would be worthwhile, I think, for the authors to explain why different results are expected for them under seemingly very similar conditions. 2. Towards the end of the manuscript, I got overwhelmed by the data, and was looking for a theoretical framework to incorporate them into, to help internalize them. Why is the organization of regulation as discovered her? Alas, the Discussion section was very short, and, to a large extent, it simply restated the findings of the Results section, without speculations about the framework to understand the data. It might be that the authors do not have a theoretical framework in mind, which I would understand. However, if they do have some ideas, I would strongly urge the authors to put them on paper, to help us summarize the data within some explanatory framework. 1 Oct 2021 Submitted filename: response_reviews_25sep2021.pdf Click here for additional data file. 4 Nov 2021 Dear Erik, Thank you for submitting your revised Research Article entitled "Genome-wide gene expression noise in Escherichia coli is condition-dependent and determined by propagation of noise through the regulatory network" for publication in PLOS Biology. I'm handling this paper temporarily while my colleague Dr Richard Hodge is out of the office. We have now obtained advice from the original reviewers and have discussed their comments with the Academic Editor. Based on the reviews, we will probably accept this manuscript for publication, provided you satisfactorily address the remaining points raised by the reviewers. Please also make sure to address the following data and other policy-related requests. IMPORTANT: Please attend to the following: a) Please address the remaining request from reviewer #1. b) Please supply a blurb according to the instructions on the submission form. c) Please address my Data Policy requests below; specifically, please supply numerical values underlying Figs 1BCDE, 2ABCDEFG, 4ABCDF, 5ABCD, S1AB, S2AB, S3, S4AB, S5A, S6A, S7AB, S8, S9AB, S10AB, S11ABCD,S12, S13AB, S14, S15ABC, S16, S17, S18, and cite the location of the data clearly in each relevant Fig legend (I note that the raw data and code are already provided, but we’ll need the above output values too). As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript. We expect to receive your revised manuscript within two weeks. To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following: -  a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list -  a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable) -  a track-changes file indicating any changes that you have made to the manuscript. NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines: https://journals.plos.org/plosbiology/s/supporting-information *Published Peer Review History* Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details: https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/ *Early Version* Please note that an uncorrected proof of your manuscript will be published online ahead of the final version, unless you opted out when submitting your manuscript. If, for any reason, you do not want an earlier version of your manuscript published online, uncheck the box. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us as soon as possible if you or your institution is planning to press release the article. *Protocols deposition* To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please do not hesitate to contact me should you have any questions. Best wishes, Roli Roland Roberts PhD Senior Editor PLOS Biology rroberts@plos.org on behalf of Richard Hodge, Associate Editor, rhodge@plos.org, PLOS Biology ------------------------------------------------------------------------ DATA POLICY: You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797 We note that raw data and code are presented in the Zenodo deposition. However, we also need the numerical values presented in the figures to be made available in one of the following forms: 1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore). 2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication. Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figs 1BCDE, 2ABCDEFG, 4ABCDF, 5ABCD, S1AB, S2AB, S3, S4AB, S5A, S6A, S7AB, S8, S9AB, S10AB, S11ABCD,S12, S13AB, S14, S15ABC, S16, S17, S18 NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values). IMPORTANT: Please also ensure that figure legends in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend. Please ensure that your Data Statement in the submission system accurately describes where your data can be found. ------------------------------------------------------------------------ BLOT AND GEL REPORTING REQUIREMENTS: We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare and upload them now. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements ------------------------------------------------------------------------ DATA NOT SHOWN? - Please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please either remove mention of these data or provide figures presenting the results and the data underlying the figure(s). ------------------------------------------------------------------------ REVIEWERS' COMMENTS: Reviewer #1: The authors have addressed my concerns well. I appreciate the edits made and personally I find the manuscript much more readable now, and the contribution in this work clearer. The manuscript remains impressive with exemplary analysis practices. I am happy to recommend publication but I do have one comment - I do not need to see the manuscript again - I leave its final form up to the authors and editor. The authors insist on fitting a straight line to Fig.1C despite acknowledging that they have no theory for it. They then ask me (in their rebuttal) why would I suggest a more complicated form instead. The answer is, that the straight line would go negative for a fast enough growth rate, and simply saying that it's linear within the relevant range for E.coli is hiding that fact under the rug. Together with very sparse data, lacking a reasoned model, such a fit is damaging to the field. This is a good paper in a respected journal - imagine the number of people who would implicitly assume a linear dependence here with little to no justification. Once such a view becomes "lore" it is very difficult to uproot. I request that they remove the ill-argued and quite possibly misleading linear fit from Fig1C. It is not necessary for the main messages of the paper. It is quite enough to state "decreasing" without stating a precise relation. You may even state "approximately linear" or "appears linear" without actually putting the fit with confidence intervals on the figure. Reviewer #2: The authors have put considerable effort into improving the manuscript. The revised manuscript is a great contribution to the field and can be published without further changes. Reviewer #3: My comments on the previous version were pretty minor to start with, and they have been addressed now. I recommend publication. 16 Nov 2021 Submitted filename: response_reviews.pdf Click here for additional data file. 23 Nov 2021 Dear Erik, On behalf of my colleagues and the Academic Editor, Nathalie Balaban, I am pleased to say that we can in principle accept your Research Article "Genome-wide gene expression noise in Escherichia coli is condition-dependent and determined by propagation of noise through the regulatory network" for publication in PLOS Biology, provided you address any remaining formatting and reporting issues. These will be detailed in an email that will follow this letter and that you will usually receive within 2-3 business days, during which time no action is required from you. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have any requested changes. Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process. PRESS We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf. We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/. Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. Best wishes, Richard Richard Hodge, PhD Associate Editor, PLOS Biology rhodge@plos.org PLOS Empowering researchers to transform science Carlyle House, Carlyle Road, Cambridge, CB4 3DN, United Kingdom ORCiD I plosbio.org I @PLOSBiology I Blog California (U.S.) corporation #C2354500, based in San Francisco
  45 in total

1.  Noise in eukaryotic gene expression.

Authors:  William J Blake; Mads KAErn; Charles R Cantor; J J Collins
Journal:  Nature       Date:  2003-04-10       Impact factor: 49.962

2.  A single determinant dominates the rate of yeast protein evolution.

Authors:  D Allan Drummond; Alpan Raval; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2005-10-19       Impact factor: 16.240

3.  Transcriptional control of noise in gene expression.

Authors:  Alvaro Sánchez; Jané Kondev
Journal:  Proc Natl Acad Sci U S A       Date:  2008-03-19       Impact factor: 11.205

4.  Two strategies for gene regulation by promoter nucleosomes.

Authors:  Itay Tirosh; Naama Barkai
Journal:  Genome Res       Date:  2008-04-30       Impact factor: 9.043

5.  Promoter architecture dictates cell-to-cell variability in gene expression.

Authors:  Daniel L Jones; Robert C Brewster; Rob Phillips
Journal:  Science       Date:  2014-12-18       Impact factor: 47.728

6.  Acetate metabolism by Escherichia coli in high-cell-density fermentation.

Authors:  G L Kleman; W R Strohl
Journal:  Appl Environ Microbiol       Date:  1994-11       Impact factor: 4.792

7.  Mechanism of transcriptional bursting in bacteria.

Authors:  Shasha Chong; Chongyi Chen; Hao Ge; X Sunney Xie
Journal:  Cell       Date:  2014-07-17       Impact factor: 41.582

8.  RegulonDB v 10.5: tackling challenges to unify classic and high throughput knowledge of gene regulation in E. coli K-12.

Authors:  Alberto Santos-Zavaleta; Heladia Salgado; Socorro Gama-Castro; Mishael Sánchez-Pérez; Laura Gómez-Romero; Daniela Ledezma-Tejeida; Jair Santiago García-Sotelo; Kevin Alquicira-Hernández; Luis José Muñiz-Rascado; Pablo Peña-Loredo; Cecilia Ishida-Gutiérrez; David A Velázquez-Ramírez; Víctor Del Moral-Chávez; César Bonavides-Martínez; Carlos-Francisco Méndez-Cruz; James Galagan; Julio Collado-Vides
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line.

Authors:  Harukazu Suzuki; Alistair R R Forrest; Erik van Nimwegen; Carsten O Daub; Piotr J Balwierz; Katharine M Irvine; Timo Lassmann; Timothy Ravasi; Yuki Hasegawa; Michiel J L de Hoon; Shintaro Katayama; Kate Schroder; Piero Carninci; Yasuhiro Tomaru; Mutsumi Kanamori-Katayama; Atsutaka Kubosaki; Altuna Akalin; Yoshinari Ando; Erik Arner; Maki Asada; Hiroshi Asahara; Timothy Bailey; Vladimir B Bajic; Denis Bauer; Anthony G Beckhouse; Nicolas Bertin; Johan Björkegren; Frank Brombacher; Erika Bulger; Alistair M Chalk; Joe Chiba; Nicole Cloonan; Adam Dawe; Josee Dostie; Pär G Engström; Magbubah Essack; Geoffrey J Faulkner; J Lynn Fink; David Fredman; Ko Fujimori; Masaaki Furuno; Takashi Gojobori; Julian Gough; Sean M Grimmond; Mika Gustafsson; Megumi Hashimoto; Takehiro Hashimoto; Mariko Hatakeyama; Susanne Heinzel; Winston Hide; Oliver Hofmann; Michael Hörnquist; Lukasz Huminiecki; Kazuho Ikeo; Naoko Imamoto; Satoshi Inoue; Yusuke Inoue; Ryoko Ishihara; Takao Iwayanagi; Anders Jacobsen; Mandeep Kaur; Hideya Kawaji; Markus C Kerr; Ryuichiro Kimura; Syuhei Kimura; Yasumasa Kimura; Hiroaki Kitano; Hisashi Koga; Toshio Kojima; Shinji Kondo; Takeshi Konno; Anders Krogh; Adele Kruger; Ajit Kumar; Boris Lenhard; Andreas Lennartsson; Morten Lindow; Marina Lizio; Cameron Macpherson; Norihiro Maeda; Christopher A Maher; Monique Maqungo; Jessica Mar; Nicholas A Matigian; Hideo Matsuda; John S Mattick; Stuart Meier; Sei Miyamoto; Etsuko Miyamoto-Sato; Kazuhiko Nakabayashi; Yutaka Nakachi; Mika Nakano; Sanne Nygaard; Toshitsugu Okayama; Yasushi Okazaki; Haruka Okuda-Yabukami; Valerio Orlando; Jun Otomo; Mikhail Pachkov; Nikolai Petrovsky; Charles Plessy; John Quackenbush; Aleksandar Radovanovic; Michael Rehli; Rintaro Saito; Albin Sandelin; Sebastian Schmeier; Christian Schönbach; Ariel S Schwartz; Colin A Semple; Miho Sera; Jessica Severin; Katsuhiko Shirahige; Cas Simons; George St Laurent; Masanori Suzuki; Takahiro Suzuki; Matthew J Sweet; Ryan J Taft; Shizu Takeda; Yoichi Takenaka; Kai Tan; Martin S Taylor; Rohan D Teasdale; Jesper Tegnér; Sarah Teichmann; Eivind Valen; Claes Wahlestedt; Kazunori Waki; Andrew Waterhouse; Christine A Wells; Ole Winther; Linda Wu; Kazumi Yamaguchi; Hiroshi Yanagawa; Jun Yasuda; Mihaela Zavolan; David A Hume; Takahiro Arakawa; Shiro Fukuda; Kengo Imamura; Chikatoshi Kai; Ai Kaiho; Tsugumi Kawashima; Chika Kawazu; Yayoi Kitazume; Miki Kojima; Hisashi Miura; Kayoko Murakami; Mitsuyoshi Murata; Noriko Ninomiya; Hiromi Nishiyori; Shohei Noma; Chihiro Ogawa; Takuma Sano; Christophe Simon; Michihira Tagami; Yukari Takahashi; Jun Kawai; Yoshihide Hayashizaki
Journal:  Nat Genet       Date:  2009-04-19       Impact factor: 38.330

10.  Promoter sequence determines the relationship between expression level and noise.

Authors:  Lucas B Carey; David van Dijk; Peter M A Sloot; Jaap A Kaandorp; Eran Segal
Journal:  PLoS Biol       Date:  2013-04-02       Impact factor: 8.029

View more
  4 in total

1.  The transcription factor network of E. coli steers global responses to shifts in RNAP concentration.

Authors:  Bilena L B Almeida; Mohamed N M Bahrudeen; Vatsala Chauhan; Suchintak Dash; Vinodh Kandavalli; Antti Häkkinen; Jason Lloyd-Price; Palma S D Cristina; Ines S C Baptista; Abhishekh Gupta; Juha Kesseli; Eric Dufour; Olli-Pekka Smolander; Matti Nykter; Petri Auvinen; Howard T Jacobs; Samuel M D Oliveira; Andre S Ribeiro
Journal:  Nucleic Acids Res       Date:  2022-06-24       Impact factor: 19.160

2.  Probabilistic edge weights fine-tune Boolean network dynamics.

Authors:  Dávid Deritei; Nina Kunšič; Péter Csermely
Journal:  PLoS Comput Biol       Date:  2022-10-10       Impact factor: 4.779

3.  Gene regulation in Escherichia coli is commonly selected for both high plasticity and low noise.

Authors:  Markéta Vlková; Olin K Silander
Journal:  Nat Ecol Evol       Date:  2022-06-20       Impact factor: 19.100

4.  Gene regulation by a protein translation factor at the single-cell level.

Authors:  Roswitha Dolcemascolo; Lucas Goiriz; Roser Montagud-Martínez; Guillermo Rodrigo
Journal:  PLoS Comput Biol       Date:  2022-05-06       Impact factor: 4.475

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.