| Literature DB >> 29203832 |
Ching-Cher Sanders Yan1, Surendhar Reddy Chepyala1,2,3, Chao-Ming Yen1,4,5, Chao-Ping Hsu6,7.
Abstract
Gene expression involves bursts of production of both mRNA and protein, and the fluctuations in their number are increased due to such bursts. The Langevin equation is an efficient and versatile means to simulate such number fluctuation. However, how to include these mRNA and protein bursts in the Langevin equation is not intuitively clear. In this work, we estimated the variance in burst production from a general gene expression model and introduced such variation in the Langevin equation. Our approach offers different Langevin expressions for either or both transcriptional and translational bursts considered and saves computer time by including many production events at once in a short burst time. The errors can be controlled to be rather precise (<2%) for the mean and <10% for the standard deviation of the steady-state distribution. Our scheme allows for high-quality stochastic simulations with the Langevin equation for gene expression, which is useful in analysis of biological networks.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29203832 PMCID: PMC5715166 DOI: 10.1038/s41598-017-16835-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1A general model of gene expression with burst productions and its stochastic dynamics of protein number. (a) The scheme of reactions for gene expression. (b) Shown are a stochastic trajectory (green) from the Gillespie algorithm, with the protein’s intermittent burst production indicated by red bars in time steps of 0.2 protein lifetime (1/γ ). Under the conditions applied, and , rapid rises in the trajectory are seen, and protein production can be described as in bursts. Parameters used are k = 5, γ = 95, k = 200, γ = 10, k = 100 and γ = 1, which correspond to , average mRNA burst size and protein average burst size .
Figure 2Four cases of gene expression dynamics and errors from different Langevin equations. (a) Four possible cases of gene expression. When an activated gene state is short-lived, the Langevin equation skips the tracking for the gene state, and a burst production following the statistics is used for mRNA. Similarly, when the mRNA’s lifetime is short, burst production of protein is introduced, instead of tracking the mRNA. (b) Shown are normalized errors (%) of a steady-state protein’s standard deviation () from the burst Langevin equations compared to the squared root of exact variance expression as in equation (26), as a function of gene deactivation rate γ and mRNA degradation rates γ . Red lines are the boundaries for the four different cases in the burst models. Other parameters are , k = 5, mRNA burst size , protein burst size , and γ = 1.
Summary of statistics for three different cases of burst in gene expression.
| mRNA burst ( | both bursts ( | protein burst ( | |
|---|---|---|---|
| condition |
|
|
|
| simulated subjects |
|
|
|
| burst event distribution: | |||
|
|
|
|
|
|
|
|
|
|
| burst size† distribution: | |||
|
|
|
|
|
|
|
|
|
|
| burst production in Langevin equation: | |||
|
|
|
|
|
|
|
|
|
|
| steady-state distribution: exact expression* | |||
|
| same as exact | — | — |
|
|
| — | — |
|
| same as exact | same as exact | same as exact |
|
|
|
|
|
†Definition for burst sizes are: , , .
‡The mean and variance of mRNA production with mRNA burst are the mean and variance of the burst events of protein in the case of both bursts.
*Definitions for the fractions are: , , .
§With the conditions of and , replaced by with mRNA burst.
¶With the conditions of and , replaced by with mRNA burst.
||With the conditions of and , replaced by with both bursts; with , replaced by .
#With the conditions of , replaced by with protein burst; and with , replaced by .
Figure 3Comparison of and steady-state distributions with the burst Langevin simulation and Gillespie simulation. Shown in (a) are the difference (in %) with the burst Langevin simulation and Gillespie simulation and in (b) steady-state distributions with the burst Langevin simulation (red) and Gillespie simulation (green) with different gene activation rates k = 3,10,100 and burst size . Statistics were taken at the steady state of 10,000 independent points for the model as defined in equations (1) to (3) with parameters k = 100, γ = 100, γ = 10 and γ = 1.
Figure 4Comparison of simulation time with the burst Langevin simulation and Gillespie simulation. Shown are the percentage of computer time used by the burst Langevin simulation compared to that by the Gillespie simulation for different k and burst size .
Figure 5Comparison between the burst Langevin simulation and Gillespie simulation with different and . Shown in (a) are the difference (in %) with the burst Langevin simulation and Gillespie simulation and in (b) steady-state distributions with the burst Langevin simulation (red) and Gillespie simulation (green) with different and . k and k were determined by equations (5) and (16) with given and , respectively, with the other parameters k = 5, γ = 100, γ = 10 and γ = 1.
Figure 6A test for simulation error of gene expression under non-linear repressive regulation. Shown in (a) is the steady-state with the burst Langevin simulation and in (b) the error in from the burst Langevin simulation comparing to that from the Gillespie simulation. Here the p 1’s burst frequency, , and burst size, , are varied over a range. Other parameters for p 1 are k = 5, γ = 100, γ = 10 and γ = 1. For p 2, the parameters are k = 5, γ = 100, k = 200, k = 60, γ = 10, k = 100 and γ = 1, K = 200 and n = 3. The red line in (b) corresponds to .
Figure 7Comparison of different algorithms for genetic switching dynamics. Shown are average protein numbers with the standard deviation of the distribution at different times from 10,000 independent stochastic trajectories with the burst Langevin algorithm (red) and Gillespie simulation (green) for the model defined in equations (1) to (3) with parameters k = 30 for t = 7 to 14; otherwise k = 3 and other parameters γ = 100, k = 200, γ = 10, k = 100 and γ = 1.
Figure 8A representative burst production distribution and the effect of negative burst on a protein’s fluctuation. Shown in (a) is a typical burst production distribution as defined in equation (31) with τ = 0.03 and colored area as the negative production and in (b) are two stochastic trajectories from the algorithm following equation (31), with (blue) and removing (red) negative burst production. Parameters are k = 5, γ = 95, k = 200, γ = 10, k = 100 and γ = 1, corresponding to with .