Literature DB >> 26783763

Evaluation of predictions of the stochastic model of organelle production based on exact distributions.

Abstract

We present a reanalysis of the stochastic model of organelle production and show that the equilibrium distributions for the organelle numbers predicted by this model can be readily calculated in three different scenarios. These three distributions can be identified as standard distributions, and the corresponding exact formulae for their mean and variance can therefore be used in further analysis. This removes the need to rely on stochastic simulations or approximate formulae (derived using the fluctuation dissipation theorem). These calculations allow for further analysis of the predictions of the model. On the basis of this we question the extent to which the model can be used to conclude that peroxisome biogenesis is dominated by de novo production when Saccharomyces cerevisiae cells are grown on glucose medium.

Entities: Chemical Disease Gene Species

Keywords: S. cerevisiae; cell biology; cell modelling; computational biology; mathematical analysis; organelle biogenesis; peroxisome; stochastic; systems biology

Mesh：

Substances：
Culture Media
Glucose

Year: 2016 PMID： 26783763 PMCID： PMC4733043 DOI： 10.7554/eLife.10167

Source DB: PubMed Journal: Elife ISSN： 2050-084X Impact factor: 8.140

Introduction

Recently a model was presented in which the variation of numbers of a particular type of organelle (Golgi apparatus, vacuoles or peroxisomes) observed in cells was proposed as a diagnostic indicator of the relative importance of different processes by which organelles can be formed and destroyed (Mukherji and O’Shea, 2014; see Mukherji and O’Shea, 2015 for a correction). Here we re-examine the mathematical analysis of this model and show that further insight can be gained from considering exact calculations of the equilibrium distributions. For conciseness we will refer to the model, and the analysis in the associated paper (Mukherji and O’Shea, 2014), by the abbreviation SMOP (stochastic model of organelle production).

Analysis

The SMOP model in the context of “birth and death” models

In the SMOP model, four processes are envisaged for the production and destruction of organelles: de novo synthesis, fission, fusion and decay. These four processes are characterised by one rate constant each, defined in the SMOP paper as k, and γ. Following the definitions in the SMOP paper, the probabilities of each of the four processes occurring in the next small time period δt are given in Table 1. We also include in this table the total rate of each process that would be observed instantaneously in a large population of N cells.

Table 1.

Definition of terms in the SMOP model

DOI: http://dx.doi.org/10.7554/eLife.10167.003

Process	Probability of process occurring in next δt in a particular cell containing n organelles¹	Process changes n by	Rate of process in sample of N cells²
De novo	k_de novoδt	1	k_{de novo}f_nN
Fission	k_fissionnδt	1	k_fissionnf_nN
Decay	γnδt	-1	γnf_nN
Fusion	k_fusionn(n-1)δt	-1	k_fusionn(n-1)f_nN

For example, if k= 0.02 then the probability of a cell with n = 2 organelles undergoing a fission event in the next 0.1 time units = 0.02x2x0.1 = 0.004.

is the fraction of cells having n organelles. For example, if 23% of the cells have 2 organelles then f= 0.23. If, in a population of 1000 cells, f= 0.23 and k= 0.02 then the rate of cells changing from having n = 2 to n = 3 organelles at any one moment due to fission would be 0.02x2x0.23x1000 = 9.2 cells per time unit.

Definition of terms in the SMOP model DOI: http://dx.doi.org/10.7554/eLife.10167.003 For example, if k= 0.02 then the probability of a cell with n = 2 organelles undergoing a fission event in the next 0.1 time units = 0.02x2x0.1 = 0.004. is the fraction of cells having n organelles. For example, if 23% of the cells have 2 organelles then f= 0.23. If, in a population of 1000 cells, f= 0.23 and k= 0.02 then the rate of cells changing from having n = 2 to n = 3 organelles at any one moment due to fission would be 0.02x2x0.23x1000 = 9.2 cells per time unit. Models involving processes of this type are generically termed “birth and death” processes and have a very long history of analysis in the context both of the life sciences (e.g. evolution; Yule, 1924) and in the context of physical processes (e.g. detection of cosmic rays; Furry, 1937). Accessible discussions can be found in several books (Bailey, 1990, which is a reissue of the classic text from 1964; Taylor and Karlin, 1998). In such analyses, the three processes of de novo production, production by fission, and loss by first order decay are often termed immigration, birth and death, respectively. Immigration is used for a process that increases the number of individuals but does not require any other individuals already to be present. Birth is the process by which one individual gives rise to a second individual. Death is a process by which a particular individual is lost from a population with a probability that is independent of any other members of the population. Analyses including a fusion term are much less common. As there are a considerable number of possible combinations of the four processes that might be active, we will use a notation here to define a model by listing in curly brackets the active production processes, followed by the active destruction processes, separated by a semi-colon. Any process that is not mentioned has a rate constant of zero. Thus the model with de novo, fission and decay terms would be denoted {de novo, fission; decay}. During single cell simulations based upon the equations in Table 1 the number of organelles will fluctuate (Figure 1A) and one can ask what fraction of time, f, does a cell spend having n = 0, n = 1, n = 2, etc. organelles. This is equivalent to asking what fraction of a large ensemble of cells have n = 0, n = 1, n = 2, etc. organelles at one moment in time. In treatments of stochastic systems, the values of f would normally be described as the probability distribution for a cell having n organelles. In terms of a population of cells it can be described as a population distribution.

Figure 1.

The concept of the limiting distribution in a stochastic system.

DOI: http://dx.doi.org/10.7554/eLife.10167.004

The concept of the limiting distribution in a stochastic system.

(A) The traces show simulations run with the parameters {k= 2.0, k= 0.9; γ = 1.0, k= 0.02}, starting from n = 0 (black trace) and n = 50 (green trace). Both simulations “settle down” to stochastic fluctuations about a mean value of = 7.1 (B) Schematic representation of a set of cells that are all initialised to n = 1 at time t = 0, and are observed at a time t = τ. (C) Distributions calculated with the same parameters as in (A) for a set of 1000 cells as in (B), calculated for τ = 0.2 (magenta), 0.4 (yellow), 1.0 (green), 2.0 (blue), 5.0 (black), 10.0 (cyan), 15.0 (red). The curves for τ = 10.0 and τ = 15.0 become very similar as they approach the limiting distribution. These two curves match closely the result (filled black circles) of applying the recurrence relation (Equation 2; Appendix 2). (D) The red trace is a time course for parameters {k= 2.0, k= 1.1; γ = 1.0, k= 0}. Since k> γ, and k= 0, then n diverges; in such a case there is no limiting distribution. DOI: http://dx.doi.org/10.7554/eLife.10167.004 The simulations in Figure 1A illustrate the important point that a simulation is always started from some arbitrary starting point, and that a period of time must elapse before the simulations can be considered to be independent of this starting point. If the distribution f is evaluated at different times after the starting point of simulations (Figure 1C) then different distributions are obtained; hence the distribution is “time dependent”. As the probability (or population) distribution varies in time (and since the system can in principle be started from any state) then it is not strictly possible to talk of “the distribution” for a stochastic system. However in many birth and death models the system will settle down to a limiting distribution, independent of the starting states of the cell(s) as in the cyan and red curves in Figure 1C. Such a situation corresponds to a state of dynamic equilibrium that is familiar from chemical kinetics. Thus the terms limiting, equilibrium (or steady state) distributions can be used in this context interchangeably. The conditions for the models studied here to have limiting distributions is discussed further below; an example of a set of parameters for which a SMOP model does not yield a limiting distribution is shown in Figure 1D. Assuming that a limiting probability distribution does exist there are two basic sampling methods by which it can be measured, irrespective of whether one is making experimental observations or performing simulations. One method is to note n at a set of time points for a single cell (such as points drawn from the trajectories in Figure 1A), and the other is to take a large number of cells at one point in time, and measure n across this ensemble of cells (Figure 1B,C). The former would require a time dependent set of observations that may be difficult to obtain experimentally. The latter approach is equivalent to making experimental observations on a large field of view of cells, or of making repeated simulations. However there is a large caveat that when the measurements are made one must be convinced that the cells have had “long enough to reach equilibrium” since the last significant perturbation to the system. If this is not the case then the distribution measured will be contaminated with contributions from non-equilibrium distributions (such as from the magenta, yellow, green and blue curves in Figure 1C). Perturbations include the choice of an arbitrary starting point in simulations, and effects such as cell division and change of growth conditions in experimental data.

Does the SMOP analysis imply equilibrium distributions?

It is implicitly assumed in the SMOP paper that the populations are to be considered to be at equilibrium (or that the probability distributions are in their limiting form) for all three cases of the analysis via simulations, from experimental data, or via the fluctuation dissipation theorem; a comment has been added to the original articles to clarify this assumption for the simulations (see the comment dated November 23, 2015 on Mukherji and O'Shea, 2014). For the experimental data this seems a reasonable assumption, although dynamic population data are really required to fully settle this issue. The fluctuation dissipation theorem method implicitly assumes a steady state (Paulsson, 2005).

Derivation of recurrence relation for the distribution of organelles in the {de novo, fission; decay, fusion} model

By applying an equilibrium condition it is straightforward to derive precise relations for the distributions in the three scenarios considered in the original SMOP paper, and hence avoid the approximations introduced by the use of the fluctuation dissipation theorem. At equilibrium, the rate at which the population gains cells with n + 1 organelles due to cells with n organelles gaining one organelle must equal the rate at which the cells with n + 1 organelles lose one organelle. The reasoning is the same as for standard treatments of dynamic equilibrium between two states (as in a chemical reaction), and the complete justification of this when there are multiple states (i.e. cells with n = 0, n = 1, n = 2, etc., organelles) is given in Appendix 1. Thus at equilibrium which gives From Equation 2 the exact distribution of organelle numbers at equilibrium can be calculated for a model involving any combination of the four processes, without recourse to random number based simulations and the attendant issues of ensuring adequate sampling precision. An explicit numerical example of the use of Equation 2 to generate a distribution is given in Appendix 2. Briefly, an arbitrary value for fis chosen; fis then calculated from f; f is calculated from f is calculated from f; etc. Finally the entire distribution is normalised, which removes any dependence on the initial choice for f. Equation 2 is often termed a recurrence relation (or sometimes recursion relation or difference equation) as it allows successive terms in a distribution to be calculated from earlier terms.

Application of recurrence method to Golgi and vacuole models

The recurrence relation readily allows the derivation of precise distributions for the case of the model applied to Golgi ({de novo; decay}, Appendix 3) and vacuoles ({fission; fusion}, Appendix 4). For the Golgi, a Poisson distribution is obtained as the limiting distribution in accord with the SMOP analysis. However for vacuoles a truncated Poisson distribution is obtained, and not the shifted Poisson distribution that is reported in the SMOP analysis. Although the difference between these distributions is quite subtle (Appendix 4), the variation of Fano factor with is significantly different: the Fano factor for the truncated Poisson approaches 1 much more rapidly (Figure 2, green curve) than for the shifted Poisson (Figure 2, black curve).

Figure 2.

Comparison of reported Fano factors for vacuole populations, compared to two different theoretical expectations.

DOI: http://dx.doi.org/10.7554/eLife.10167.005

Comparison of reported Fano factors for vacuole populations, compared to two different theoretical expectations.

Three data points quoted in the SMOP paper are plotted: □ Haploid (glucose); ○ Diploid (glucose); ∆ Haploid (oleate). The solid black curve is the expectation from the shifted Poisson distribution (which is the incorrect distribution given the {fission; fusion} model) and the solid green curve is the expectation from the truncated Poisson distribution (which is the correct distribution given the model). The dashed line is for a Fano factor of 1. The solid curves were constructed by calculating a family of distributions and evaluating the mean and Fano factor. DOI: http://dx.doi.org/10.7554/eLife.10167.005 In Figure 2, it can be seen that the experimental values quoted in the SMOP analysis are in excellent agreement with the incorrect prediction, whilst the agreement with the corrected theoretical prediction is much less good. This greatly weakens the argument that the SMOP model makes “quantitatively accurate predictions” or that it therefore correctly accounts for the behaviour of the vacuole population.

Application of recurrence method to peroxisome models

Having established the value of analysing the SMOP model with the recurrence method, we move on to the case of peroxisomes.

Recurrence relation demonstrates that {de novo, fission; decay} yields a negative binomial distribution

For peroxisomes, the primary model discussed in the SMOP analysis is a model in which peroxisomes can potentially form both de novo and by fission, and fusion is considered to be negligible. We show in Appendix 5 that such a {de novo, fission; decay} model has a limiting distribution that is a negative binomial distribution. As a result, exact expressions (for all parameter values) are readily obtained (Appendix 5) for the mean and Fano factor for this model, Combining Equations 3,4 gives an alternative form for the Fano factor This latter equation is the form given in the SMOP analysis as the k= 0 limit of the approximate Equation 1 of the SMOP paper. By explicitly applying the equilibrium assumption we have therefore shown that this expression is exact in this case for all values of the parameters.

Evaluation of Fano factor for the boundary marking equal de novo and fission rates

An attractive claim made for the SMOP analysis is that the value for the Fano factor can be used to distinguish between different modes of organelle production. In Figure 3D of the original SMOP paper, a green dashed line was placed at and stated as “marking the boundary between de novo synthesis and fission dominated organelle production”. Thus, as presented in Figure 3D of the original version of the SMOP paper, the Fano factor for cells grown on oleate ( appears to lie significantly above the line where the rates are equal, and thus terms such as “fission dominated biogenesis” are used widely in the SMOP paper.

Figure 3.

The percentage contribution to the total production rate from de novo production and from fission as a function of Fano factor in the {de novo, fission; decay} model.

DOI: http://dx.doi.org/10.7554/eLife.10167.006

The percentage contribution to the total production rate from de novo production and from fission as a function of Fano factor in the {de novo, fission; decay} model.

For each value of the Fano factor shown, a bar is drawn to represent the total production rate. The filled part of the bar represents the contribution from de novo production inferred from the model and the open part of the bar represents the inferred rate from fission. Bars are shown for Fano factors of: 1.0 (100% de novo); 1.1 (experimentally observed for glucose growth (G) in SMOP paper; 9% de novo, 91% fission); 2.0 (boundary value, where de novo and fission contributions are equal); 2.4 (experimentally reported value for oleate growth (O) in SMOP paper; 42% de novo, 58% fission); 3.0 (value required for fission rate to be double the de novo rate). The total production rate from de novo processes is simply k. The total rate from fission processes is k. The relative proportions of the two processes were calculated using Equation 5. DOI: http://dx.doi.org/10.7554/eLife.10167.006 However, according to Equation 5 the correct value with equal contributions from the two processes is . A correction to this effect has now been issued. The value of corresponds to the extreme case of zero fission and production solely by de novo production (Figure 3). In the corrected version of Figure 3D in the SMOP paper it is clear that the inferred fission rate for growth on oleate is only barely greater than the de novo synthesis rate and the term “dominance” does not seem to be appropriate. It is clear from Figure 3 that the inferred contribution from fission grows only rather slowly as the Fano factor increases above a value of two. In other words, for fission to be truly dominant a Fano factor would have to be observed that was far greater than any reported in the SMOP paper.

Restrictions on the relationships between parameters in order for limiting distributions to exist

Since it is inherent in the SMOP analysis that the distributions are limiting/equilibrium distributions, it is important to consider whether a limiting distribution will ever be reached. In the case of the peroxisome model, {de novo, fission; decay}, for the system to have a limiting or equilibrium distribution there is a strong restriction on parameters, namely that k To see why this restriction exists, consider first the more simple {de novo; decay} model. Considering a single cell, the number of organelles will be limited by the fact that as n grows larger then the decay process (whose rate increases proportionally to n) will become increasingly more likely than the de novo formation process (which is independent of n). If the fission process is introduced then there is now a production term that also increases proportionally to n. If k exceeds γ then the fission process will always exceed the decay process and the number of organelles will grow without limit. The behaviour as k becomes similar to γ can also be seen from Equations 3,4 since if k is increased from zero until it becomes equal to γ then and both diverge. Thus at first sight there appear to be two different ways in which the Fano factor can become very large. One way, as extensively discussed in the SMOP analysis and above, is if the mean rate k becomes large compared to the mean rate k. The other way, that becomes clear from our analysis, is if the rate constant k becomes similar to the rate constant γ. The connection between these two relations is that is a function of all three rate constants. This emphasises the hidden complexity of the interplay of the parameters in this model.

Discussion

Our motivation for this in depth analysis of the SMOP method was sparked by the claim that it could differentiate between fission and fusion dominated mechanisms of peroxisome biogenesis. That the distribution of numbers of organelles can give a clue to the mechanisms by which organelles are formed is a very elegant idea, and the SMOP analysis combines this idea with a very simple kinetic model. As we explored the system further we realised that the fluctuation dissipation theorem result was not necessary for analysis of the system, and that enforcing the equilibrium condition, that was implicit in the work already, greatly simplified the analysis. There are a number of factors that cause us to question the utility of the SMOP model. A main piece of evidence for the correctness of the model was the agreement of the experimentally observed Fano factors for the vacuole data with those from the model. We have shown this agreement to be much less perfect than originally demonstrated. We have also shown that there is a strong interplay between different parameters in the model. This means that the agreement of experimental data with the model is not as compelling as originally presented and that the interpretation of experimental observations back to mechanistic conclusions is open to question. We hope that our analysis will stimulate discussion as to whether, for instance, the SMOP model captures the key features of the underlying processes and is just lacking some details; or whether the model fundamentally lacks key aspects of feedback. The assumption that observations of cells grown in batch culture faithfully report equilibrium distributions also requires further verification. A key conclusion of the SMOP analysis is that the contribution of fission to peroxisome biogenesis is negligible (<10%) when yeast cells are grown on glucose, but "dominant" when they are grown on oleate. This is an area of some contention (Hoepfner et al, 2005; Motley and Hettema, 2007), and a recent model relied on fission of peroxisomes during organelle inheritance as the proliferation mechanism (Knoblach et al, 2013). Our analysis has shown that the term “dominant” is misleading, and that the data reported for haploid cells grown on oleate indicates approximately equal contributions from the two processes. Nevertheless the model does suggest that the proportion of production by fission increases by about a factor of five on switching to oleate growth. Supporting evidence for this was provided by the observation of the reduction in the inferred fission contribution in cells grown on oleate in which the fission factors Vps1 or Dnm1 (or Fis1, an accessory factor of Dnm1) were deleted. On the other hand, no data were shown for glucose grown cells harbouring the same deletions. On glucose the Fano factor is reported in the SMOP analysis as 1.1, with = 3, and from Equations 3,4 one obtains k= 2.7γ and k= 0.1γ. The model then implies that on deletion of the fission pathway (i.e. setting k= 0) then would only drop by 10%. Peroxisome count data has been reported recently (Fig S4, Motley ), with values of = 4.9 in WT cells, = 1.5 in vps1Δ cells and = 1.2 in vps1Δdnm1Δ cells. The drops in peroxisome numbers in vps1Δ and vps1Δdnm1Δ cells are much greater than the 10% estimated above from the SMOP model. There is also peroxisome count data in Kuravi et al. (2006), which gives = 1.6 (WT), = 1.2 (vps1Δ), = 1.7 (dnm1Δ), = 0.9 (dnm1Δvps1Δ). The drop in for the dnm1Δvps1Δ again conflicts with the idea that fission is such a small contributor to the biogenesis process. The discrepancies between these various data possibly arise from difficulties in quantifying peroxisome numbers, especially when cells contain a large number of small (and therefore low fluorescence) peroxisomes as may be the case when fission is a strong contributor to biogenesis. The problem of peroxisome counts depending on the brightness of fluorescent markers has been commented on by Jung . There is a continuing push for cell biology to become more quantitative, and to be subject to the use of rigorous models as are common in the physical sciences. Such a push raises significant challenges not only in terms of developing tractable models and justifying the underlying assumptions, but also in terms of the application of the model to complex experimental data. In particular this work highlights the need for greater accounting for detection limits and intensity distributions, as well of time dependent issues, in the reporting and analysis of organelle count data, if these are to be used to infer details of organelle biogenesis mechanisms.

Materials and methods

All calculations were performed in python 2.7, running either via Cygwin under Windows 8.1, or Linux Mint 17.0. The code used for running stochastic simulations and for calculating distributions via Equation 2 is given in Source code 1,2. Time units are arbitrary. The time step for the simulations in Figure 1 was 0.0001 units. Equation 2 can be reformulated in terms of three parameters, e.g. k/γ, k/γ, k/γ; thus for example the parameters {k= 2.0, k= 0.9; γ = 1.0, k= 0.02} and any uniformly scaled set of parameters (e.g. {k= 4.0, k= 1.8; γ = 2.0, k= 0.04}) yield the same limiting distribution. As in the SMOP paper, the Fano factor is defined here as . In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included. Thank you for submitting your work entitled "A reanalysis of the stochastic model of organelle production" for peer review at eLife. Your submission has been favorably evaluated by Vivek Malhotra (Senior Editor) and two reviewers. The reviewers have discussed the reviews with one another and the Senior Editor has drafted this decision to help you prepare a revised submission. Specifically, the reviewers have raised a concern that your arguments are too aggressive and it would be best to tone down your statements. For example, you refer to "a number of errors" in the previous paper. The reviewers feel that this is incorrect or at least overstates the case. The reviewers agree that there was a major error in the O'Shea paper, namely the use of the shifted rather than the truncated Poisson distribution. Other than this, the mathematical results reported in the previous paper are correct as far as they go. Your manuscript shows that certain expressions in the Mukherji and O'Shea paper are exact, even though O'Shea and colleagues had reported them as approximate. This cannot be considered an error. The use of the word "dominant" is misleading, but is not a mathematical error. An abbreviated summary of the reviews follows. A) Assessment of the manuscript: First, I can find nothing mathematically incorrect about the present manuscript. The arguments are sound. Unlike the Mukherji and O'Shea paper, results are explicitly derived and all steps are detailed so readers are convinced of the correctness of these results. The paper is written in a rather pedagogical manner, the various results presented here are basically textbook material. For instance, deriving the detailed balance condition, explaining the stochastic simulation code, deriving the Poisson distribution, etc. would be standard graduate-level material typically not reported in a physics paper. I do think a good point made in the present manuscript is the issue of identifiability, i.e. is it possible to identify an underlying microscopic model based on certain macroscopic measurements? It is clear from the new analysis that many combinations of parameter values can "mix together" to produce indistinguishable Fano factors, etc. However, on its own I don't think this analysis would have been considered a strong contribution. It is only as a critique of the Mukherji and O'Shea paper that this manuscript might be considered. B) The contribution of the Mukherji and O'Shea paper: The Mukherji and O'Shea paper made three major contributions: 1) It put forward the idea that heterogeneity in organelle number could be used to distinguish competing models of organelle biogenesis. 2) It made predictions from different models of biogenesis. 3) It reported experiments and compared them with predictions. C) The critique of the Mukherji and O'Shea paper: There are two major concerns raised about the Mukherji and O'Shea paper: 1) Were the simulations and the experiments truly done so the equilibrium assumption is correct? 2) The use of the shifted rather than the truncated Poisson distribution. There are also two minor concerns: 3) What is the basis for various approximate Fano factors reported in the Mukherji and O'Shea paper? 4) The use of the term "dominance" in many places gives the reader the wrong impression. Let's analyse each of these in turn. 1) During the review of the Mukherji and O'Shea paper, the issue of cell division as a confounding factor was explicitly raised. The authors' response was to provide more detailed simulations in which they showed that the Fano factors did not vary greatly when partitioning of organelles during division was taken into account, and was approximately as observed when the only mode of organelle reduction was a first-order decay term. This argument was convincing, assuming that equilibrium had truly been reached. I will add that the idea mentioned in the present manuscript (subsection “Does the SMOP analysis imply equilibrium distributions?”) that only if the time to equilibrium is short compared to the cell cycle time will the limiting distribution be valid, is a matter of scope. If the model explicitly includes cell division as one of the processes, that larger model will itself reach a limiting distribution. I agree that Mukherji and O'Shea provided no evidence that equilibrium had been reached, either in the simulations or the experiments, and were not requested to by the reviewers. This condition is typically assumed to be correct, and rarely are authors asked for evidence. 2) Is there any evidence that the simulations of the Mukherji and O'Shea paper had not reached equilibrium? I would assume that Mukherji and O'Shea would have done the correct validation steps, and there is some evidence of that in their Figure 1 for example. However, it is then difficult to understand how the error about the truncated vs. the shifted Poisson distribution could have arisen. I admit that this point could have been discovered by the reviewers if they had explicitly repeated some of the reported calculations in the Mukherji and O'Shea paper, and I for one overlooked it. It is unclear how the exact stochastic simulations could be in agreement with an incorrect shifted Poisson prediction. In the Mukherji and O'Shea paper no simulation matching the shifted Poisson prediction is given. Perhaps this simulation was never done, otherwise Mukherji and O'Shea would have discovered the discrepancy. 3) For certain expressions in the Mukherji and O'Shea paper, which arise out of an approximate calculation using the fluctuation-dissipation theorem (subsection “Incorrect analysis of cases with k ≠ 0” of this manuscript), it is not explained how they were derived. This is a valid but minor concern; Mukherji and O'Shea would likely be able to provide the derivation. The point that the approximations were not necessary given the existence of an exact result is worth reporting but does not on its own detract from the original findings. In particular, many expressions derived in the Mukherji and O'Shea paper are correct, as shown in the present manuscript. 4) The misleading use of the word "dominant". There are scientific contexts where "dominance" is interchangeable with "greater than". E.g. in game theory, the word "dominance" is applied when the payoff from one strategy is just greater than from other strategy. Mukherji and O'Shea used the word "dominance" or "dominates" throughout their paper, but "contributed greater than" or some such term might have been more appropriate. I agree that this gives the reader the wrong impression, but does not, on its own, invalidate the results of Mukherji and O'Shea. D) Is the critique valid? I had said in my original review and I reiterate there: the ideas in the Mukherji and O'Shea paper were worth reporting, and I had hoped the original Mukherji and O'Shea paper would stimulate discussion and be improved upon with new analyses and measurements. The present manuscript in a sense validates this hope: discussion has indeed been stimulated. However, it is true that the original paper contained one major error not caught in the review process: use of the shifted rather than the truncated Poisson distribution. The present author argues that this weakens the central attraction of the previous paper, which was the strong agreement of theory and experiment. I would argue that the agreement itself, which was admittedly a striking feature, was not the central point on which the Mukherji and O'Shea paper stood. If the original paper had reported the correct distribution and highlighted the discrepancy with measured data, that would have also been food for thought. Overall, I feel this is a more accessible and comprehensive theoretical analysis than the Mukherji and O'Shea paper provided, and serves to continue the original discussion. Specifically, the reviewers have raised a concern that your arguments are too aggressive and it would be best to tone down your statements. I have softened the language throughout the text. In addition, some of the references to parts of Figure 1 and Appendix 4 Figure 1 have also been improved.

Appendix 4 Figure 1.

Comparison of Poisson (black filled circles), shifted Poisson (red filled squares), and truncated Poisson (green open circles) distributions calculated for λ = 2.5.

The values in a shifted Poisson distribution are the same as in the corresponding Poisson distribution, however the whole distribution is shifted to the right by one unit in n. The value at n = 0 is set to zero. In a truncated Poisson the value at n = 0 is set to zero, and the rest of the distribution has the same functional form as the corresponding Poisson distribution. Due to the loss of the n = 0 point then normalisation causes the remaining values to be slightly higher in the truncated Poisson distribution than in the corresponding Poisson distribution.

DOI: http://dx.doi.org/10.7554/eLife.10167.009

I have also taken the opportunity to add one sentence where I explain how I hope this will stimulate discussion of the underlying premises of the model (Discussion: “Our analysis has shown that the term […] contributions from the two processes”). For example, you refer to "a number of errors" in the previous paper. The reviewers feel that this is incorrect or at least overstates the case. The reviewers agree that there was a major error in the O'Shea paper, namely the use of the shifted rather than the truncated Poisson distribution. Other than this, the mathematical results reported in the previous paper are correct as far as they go. I have toned down some of these statements. However the fact that the value of the Fano factor given by the expression 1 kfission/kde novo is equal to 2 when the terms kfission and kde novo are equal and not to 1 as stated in the SMOP paper is a clear second mathematical error and was the reason why the kfission term appeared so “dominant” in Figure 3D, since the value for oleate growth is so close to 2 (almost within the stated error). Your manuscript shows that certain expressions in the Mukherji and O'Shea paper are exact, even though O'Shea and colleagues had reported them as approximate. This cannot be considered an error. The use of the word "dominant" is misleading, but is not a mathematical error. The detailed comments of the reviewers are pasted below. I have used the term “misleading” now so that I hope that will lead people to look more closely at the original arguments – and that that can lead to some of the healthy discussion that the reviewers refer to. I am particularly keen to know the origin of the expression since I cannot find a way that it follows from their Equation 1 but maybe they have derived it via a different route. I am very happy to remove that comment if necessary, as the peroxisome model plus fusion does not seem to have much relevance given prevalent models of peroxisome biogenesis (detailed query is below). A) Assessment of the manuscript: First, I can find nothing mathematically incorrect about the present manuscript. The arguments are sound. Unlike the Mukherji and O'Shea paper, results are explicitly derived and all steps are detailed so readers are convinced of the correctness of these results. The paper is written in a rather pedagogical manner, the various results presented here are basically textbook material. For instance, deriving the detailed balance condition, explaining the stochastic simulation code, deriving the Poisson distribution, etc. would be standard graduate-level material typically not reported in a physics paper. This area is so multi-disciplinary that I felt I should make the Discussion as accessible to as many workers in the field as possible. In addition, giving explicit examples help remove as much ambiguity as possible from definitions. I give the derivation of the Poisson distribution to show that it does arise from the recurrence relation: many people will be much more familiar with the Poisson distribution in the context of the number of successes of a large number of low-probability-of-success trials, which is really rather different from the birth-death type situation. I also give the Poisson (Golgi) case to show that my analysis does agree with the SMOP analysis in that case. I give a full explanation of the principle of detailed balance as used here because the application here seems to me to be different from the thermodynamic principle of detailed balance. In a cellular system it seems there is no objection to a cycle having stable populations and yet sustaining a net flux round the cycle, whereas that is forbidden in closed thermodynamic systems. In the case of the present model it is the linear (and n>=0) nature of the set of linked reversible processes that lead to detailed balance. If there are accessible presentations of this material readily available and couched in sufficiently identical terms then I am more than happy to direct readers to those instead. I do think a good point made in the present manuscript is the issue of identifiability, i.e. is it possible to identify an underlying microscopic model based on certain macroscopic measurements? It is clear from the new analysis that many combinations of parameter values can "mix together" to produce indistinguishable Fano factors, etc. However, on its own I don't think this analysis would have been considered a strong contribution. It is only as a critique of the MO paper that this manuscript might be considered. In light of this comment, and the encouragement that this paper can form part of a healthy discussion, I have added a short paragraph discussing the implications of the underlying assumptions of the model (Discussion: “Our analysis has shown that the term […] contributions from the two processes”). […] C) The critique of the Mukherji and O'Shea paper:There are two major concerns raised about the Mukherji and O'Shea paper: 1) Were the simulations and the experiments truly done so the equilibrium assumption is correct? 2) The use of the shifted rather than the truncated Poisson distribution. There are also two minor concerns: 3) What is the basis for various approximate Fano factors reported in the Mukherji and O'Shea paper? 4) The use of the term "dominance" in many places gives the reader the wrong impression. I feel this list omits the crucial miscalculation of the value of the Fano factor for the case where the fission and de novo rates are equal in the peroxisome model, as I have discussed above. Let's analyse each of these in turn. 1) During the review of the Mukherji and O'Shea paper, the issue of cell division as a confounding factor was explicitly raised. The authors' response was to provide more detailed simulations in which they showed that the Fano factors did not vary greatly when partitioning of organelles during division was taken into account, and was approximately as observed when the only mode of organelle reduction was a first-order decay term. This argument was convincing, assuming that equilibrium had truly been reached. I will add that the idea mentioned in the present manuscript (subsection “Does the SMOP analysis imply equilibrium distributions?”) that only if the time to equilibrium is short compared to the cell cycle time will the limiting distribution be valid, is a matter of scope.If the model explicitly includes cell division as one of the processes, that larger model will itself reach a limiting distribution. I agree that Mukherji and O'Shea provided no evidence that equilibrium had been reached, either in the simulations or the experiments, and were not requested to by the reviewers. This condition is typically assumed to be correct, and rarely are authors asked for evidence. The cell division model is only included as a brief supplement to Figure 1 of the SMOP paper, and only for the Golgi ({de novo; fission}) model. Almost no details are given and that is significant because the issue of timescale becomes relevant in such a model. How does the “regular time interval” relate to the timescale of the various processes? 2) Is there any evidence that the simulations of the Mukherji and O'Shea paper had not reached equilibrium? I would assume that Mukherji and O'Shea would have done the correct validation steps, and there is some evidence of that in their 3) For certain expressions in the Mukherji and O'Shea paper, which arise out of an approximate calculation using the fluctuation-dissipation theorem (subsection “Incorrect analysis of cases with k If it can be obtained I am very happy to drop this point. However I cannot see that it can follow from Equation 1. Equation 1 is If the limit is looked for under the condition kfusion<(n(n-1))>) ~ γ , it is not clear how to proceed because one immediately has terms in . (It appears that one can make some progress by then using variance=-2 but that does not seem to lead to the quoted result). If one instead writes the first fraction in the denominator of the left hand side of the above equation as X and try to solve simultaneously, along with the quoted result then one obtains an expression for X that depends on kfission and kde novo so again this route does not seem to lead to anywhere. However as is inextricably linked to all parameters then maybe there is a hidden relationship that leads to the quoted result. The point that the approximations were not necessary given the existence of an exact result is worth reporting but does not on its own detract from the original findings. In particular, many expressions derived in the Mukherji and O'Shea paper are correct, as shown in the present manuscript. I agree, but if quantitative methods are to be adopted more within the mainstream then I think it behoves the modelers to make sure they keep things as simple as possible, otherwise it becomes the preserve of applied mathematician “wizards”. The recurrence method is much simpler than the FDT approach and should be accessible to more of the community. 4) The misleading use of the word "dominant". There are scientific contexts where "dominance" is interchangeable with "greater than". E.g. in game theory, the word "dominance" is applied when the payoff from one strategy is just greater than from other strategy. Mukherji and O'Shea used the word "dominance" or "dominates" throughout their paper, but "contributed greater than" or some such term might have been more appropriate. I agree that this gives the reader the wrong impression, but does not, on its own, invalidate the results of Mukherji and O'Shea. I wrestled with this and sought the opinions of colleagues about what they would understand by “dominant”. If one rate were dominant over another then some people responded by saying it would imply a condition as extreme as differing by an order of magnitude. Personally I would want to see a factor of about three. Since the SMOP paper is a quantitative paper one would expect to see a statement of the ratio of the two rates, rather than the loose word dominant. In the present context the argument is made more complex by the use of a Fano factor of 1 rather than 2 as the boundary between equal rates. This is a significant difference when trying to evaluate the meaning of a Fano factor of 2.4 /-0.2. According to my analysis the rates are in the ratio 0.58:0.42, or roughly 1.4:1. Given the experimental and theoretical uncertainties this does not look to me like a secure case of dominance. I agree that there are cases where dominant can mean “one just greater than the other”, and the game theory use is such a case. In politics an election vote distributed 0.58:0.42 would tend to be seen as indicating the dominance of a party. Indeed if two rates act against each other then only a tiny “dominance” can lead to run away behaviour. But here we are evaluating the relative magnitude of two rates that act in the “same direction” so I think that the requirement that dominance indicates a clear cut difference is valid. I have amended my wording to “misleading” as suggested. D) Is the critique valid? A4F1I had said in my original review and I reiterate there: the ideas in the Mukherji and O'Shea paper were worth reporting, and I had hoped the original Mukherji and O'Shea paper would stimulate discussion and be improved upon with new analyses and measurements. The present manuscript in a sense validates this hope: discussion has indeed been stimulated. However, it is true that the original paper contained one major error not caught in the review process: use of the shifted rather than the truncated Poisson distribution. The present author argues that this weakens the central attraction of the previous paper, which was the strong agreement of theory and experiment. I would argue that the agreement itself, which was admittedly a striking feature, was not the central point on which the Mukherji and O'Shea paper stood. If the original paper had reported the correct distribution and highlighted the discrepancy with measured data, that would have also been food for thought. The extremely close agreement of the observed and (incorrectly) predicted Fano factors seems to have provided a key link in the argument that the underlying model is a good one. That, in itself, is part of a fairly standard inductive scientific method with its attendant strengths and weaknesses. There was very little justification otherwise of the underlying model, and certain aspects of the model deserve more discussion. For instance the use of a rate for fusion proportional to n(n-1) would seem appropriate if the organelles could be considered to be freely moving and well mixed as the molecules within a liquid. But whether it is appropriate for objects such as vacuoles seems harder to justify. It implies that the likelihood (within the next small delta-t) of a vacuole fusing with another when n = 3 is double that when n = 2. One could envisage that when n = 2 there could be a very similar degree of total contact area with other vacuoles as when n = 3, and so the fusion probability would be very similar. It would depend on vacuole size, composition and “packing”. I am not aware that sufficiently detailed analysis of such factors exists: it is certainly not quoted in the SMOP paper. The observation of very good agreement with the model would reduce the impact of such an objection. However, in the absence of such agreement, then, such discussion is vital: is the model basically a good one but just lacking a little detail or is the model fundamentally lacking key aspects of feedback? I hope this will stimulate more discussion on these issues.

	R_n=1_→n=0		R_n=2_→n=1		R_n=3_→n=2		R_n=4_→n=3		R_n=5_→n=4
n = 0	⇆	n = 1	⇆	n = 2	⇆	n = 3	⇆	n = 4	⇆	etc.
	R_n=0_→n=1		R_n=1_→n=2		R_n=2_→n=3		R_n=3_→n=4		R_n=4_→n=5

8 in total

2 in total

Review 1. Glycosome biogenesis in trypanosomes and the de novo dilemma.

Authors: Sarah Bauer; Meredith T Morris
Journal: PLoS Negl Trop Dis Date: 2017-04-20

Review 2. Evolving mtDNA populations within cells.

Authors: Iain G Johnston; Joerg P Burgstaller
Journal: Biochem Soc Trans Date: 2019-10-31 Impact factor: 5.407