| Literature DB >> 30430330 |
C D Bayliss1, C Fallaize2, R Howitt2, M V Tretyakov3.
Abstract
Temporal evolution of a clonal bacterial population is modelled taking into account reversible mutation and selection mechanisms. For the mutation model, an efficient algorithm is proposed to verify whether experimental data can be explained by this model. The selection-mutation model has unobservable fitness parameters, and, to estimate them, we use an Approximate Bayesian Computation algorithm. The algorithms are illustrated using in vitro data for phase variable genes of Campylobacter jejuni.Entities:
Keywords: Approximate Bayesian computation; Phase variable genes; Population genetics; Stochastic modelling
Mesh:
Year: 2018 PMID: 30430330 PMCID: PMC6373360 DOI: 10.1007/s11538-018-0529-9
Source DB: PubMed Journal: Bull Math Biol ISSN: 0092-8240 Impact factor: 1.758
Fig. 1The domain (top right), which is obtained from the (p, q) domain (top left), and the corresponding example of the domain (bottom) (Color figure online)
Fig. 2Application of Algorithm 3.1 to the data for gene cj0617. The domain is shown by black dashed lines; the blue dashed curves are and ; the solid blue curve is (Color figure online)
Fig. 3Application of Algorithm 3.1 to the data for cj01342 gene. The domain is shown by black dashed lines; the blue dashed curve is ; the solid blue curve is . The blue cross-hatched region shows the domain covered by y(x; u, v) as described in the text (Color figure online)
Single-gene data, estimates and results for the independence of fitness parameters investigation
| Gene |
|
|
|
|---|---|---|---|
| (0.9433, 0.0567) | (0.2621, 0.7379) | (1, 1.016) | |
| (0.0567, 0.9433) | (0.8267, 0.1733) | (1.02, 1) | |
| (0.0533, 0.9467) | (0.8288, 0.1712) | (1.02, 1) |
Three-gene model input (fitness parameters) and results, with and without application of Assumption 2.3
|
|
|
|
|
|---|---|---|---|
| (1.040400, 1.020000, | (0.099859, 0.000256, | (1.018, 1.007, | (0.143176, 0.011395, |
| 1.020000, 1.000000, | 0.002181, 0.000143, | 1.009, 1.000, | 0.009522, 0.056227, |
| 1.057046, 1.036320, | 0.877841, 0.000756, | 1.026, 1.027, | 0.685888, 0.033098, |
| 1.036320, 1.016000) | 0.018654, 0.000311) | 1.019, 1.004) | 0.036405, 0.024289) |
Here, and for the single-gene results in Table 1, , and n are fixed at the values , and given in Table 5, where the prior settings for the fitness parameters can also be found. The values of () and () required for the three-gene runs are as in Table 6. We obtain the distances , and ,; since , we reject the independence assumption
Prior settings for dataset 1
| Gene |
|
|
| |
|---|---|---|---|---|
| 12.30 [9.1, 22.2] | 17.88 [11.0, 40.2] | 220 [110, 275] | [1, 1.04] | |
| 4.23 [3.0, 5.7] | 2.15 [1.4, 2.8] | [1, 1.04] | ||
|
|
| [1, 1.04] | ||
| [1, 1] | ||||
| [1.005, 1.06] | ||||
| [1.005, 1.06] | ||||
| [1, 1.04] | ||||
| [1, 1.04] |
Sample data for dataset 1
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 300 | 0.0766 | (0.00333, 0.01, 0.00667, | 141 | 0.112 | (0.15603, 0.00709, 0.01418, |
| 0.92333, 0.04333, | 0.09220, 0.63121, 0.04255, | ||||
| 0, 0, 0.01333) | 0.04255, 0.01418) |
Inputs for the synthetic data experiment
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| 300 | 0.0766 | (0.003, 0.010, 0.007, | [1.005, 1.04] | 150 | 0.108 | (0.13013, 0.01044, 0.01129, |
| 0.924, 0.043, | [1, 1] | 0.13676, 0.63192, 0.00608, | ||||
| 0, 0, 0.013) | [1, 1] | 0.03386, 0.03951) | ||||
| [1, 1] | ||||||
| [1.005, 1.04] | ||||||
| [1.005, 1.04] | ||||||
| [1.005, 1.04] | ||||||
| [1.005, 1.04] |
Results for the synthetic data experiment
| True |
|
|
|---|---|---|
| (1.014, 1.002, 1.007, 1, 1.022 | (1.0162, 1, 1, 1, 1.0252, | (0.12607, 0.00664, 0.00495, 0.11870, |
| 1.01, 1.015, 1.001) | 1.0164, 1.0175, 1) | 0.67638, 0.00745, 0.03145, 0.02837) |
The distance
Sample data and prior settings for dataset 2. Also, , , ,
|
|
|
| |||
|---|---|---|---|---|---|
| 17.88 | 12.30 | 20 [10, 25] | [1, 1.6] | (0.0119, 0.0476, | (0.0115, 0.0230, |
| [11.0, 40.2] | [9.1, 22.2] | [1, 1.6] | 0.0000, 0.7738, | 0.0230, 0.0690, | |
| [1, 2] | 0.1548, 0.0000, | 0.7586, 0.0805, | |||
| [1, 1] | 0.0119, 0.0000) | 0.0345, 0.0000) | |||
| [1.1, 1.8] | |||||
| [1.05, 2.2] | |||||
| [1, 2.2] | |||||
| [1, 1.6] |
Results for the time-dependence experiment. Here
|
|
|
|
|---|---|---|
| (0.15603, 0.00709, 0.01418, 0.09220, | (0.15034, 0.02545, 0.03451, 0.07191, | 0.0830 |
| 0.63121, 0.04255, 0.04255, 0.01418) | 0.59554, 0.05478, 0.04457, 0.02289) |
The minimum, maximum and 95% posterior probability intervals for fitness parameters from time-dependence experiment
|
|
|
| 95% posterior probability intervals for |
|---|---|---|---|
| 1.004021 | 1 | 1.00998 | [1, 1.00925] |
| 1.001056 | 1 | 1.00869 | [1, 1.00618] |
| 1.000296 | 1 | 1.00586 | [1, 1.00410] |
| 1.006620 | 1.00228 | 1.00994 | [1.00425, 1.00961] |
| 1.007894 | 1.00610 | 1.00999 | [1.00708, 1.00982] |
| 1.000000 | 1 | 1.00552 | [1, 1.00341] |
| 1.002977 | 1 | 1.00986 | [1, 1.00895] |
| 1.002558 | 1 | 1.00941 | [1, 1.008791] |
The minimum, maximum and 95% posterior probability intervals () for mutation rates from time-dependence experiment
| Gene |
|
|
| 95% interval ( |
|
|
| 95% interval ( |
|---|---|---|---|---|---|---|---|---|
| 12.308 | 9.135 | 17.580 | 9.534, 15.762 | 16.257 | 11.084 | 25.958 | [11.727, 21.948] | |
| 4.126 | 3.002 | 5.619 | 3.112, 5.248 | 2.152 | 1.405 | 2.800 | [1.580, 2.723] | |
| 0.0724 | 0.0389 | 0.127 | 0.0423, 0.109 | 0.00453 | 0.00294 | 0.00775 | [0.00310, 0.00627] |
The minimum, maximum and 95% posterior probability interval for the number of generations from the time-dependence experiment
|
|
|
| 2.5/97.5 percentiles from |
|---|---|---|---|
| 212 | 145 | 250 | 168, 246 |