| Literature DB >> 26451161 |
Wendi Liu1, Sanyi Tang1, Yanni Xiao2.
Abstract
The aim of the present study is to apply simple ODE models in the area of modeling the spread of emerging infectious diseases and show the importance of model selection in estimating parameters, the basic reproduction number, turning point, and final size. To quantify the plausibility of each model, given the data and the set of four models including Logistic, Gompertz, Rosenzweg, and Richards models, the Bayes factors are calculated and the precise estimates of the best fitted model parameters and key epidemic characteristics have been obtained. In particular, for Ebola the basic reproduction numbers are 1.3522 (95% CI (1.3506, 1.3537)), 1.2101 (95% CI (1.2084, 1.2119)), 3.0234 (95% CI (2.6063, 3.4881)), and 1.9018 (95% CI (1.8565, 1.9478)), the turning points are November 7,November 17, October 2, and November 3, 2014, and the final sizes until December 2015 are 25794 (95% CI (25630, 25958)), 3916 (95% CI (3865, 3967)), 9886 (95% CI (9740, 10031)), and 12633 (95% CI (12515, 12750)) for West Africa, Guinea, Liberia, and Sierra Leone, respectively. The main results confirm that model selection is crucial in evaluating and predicting the important quantities describing the emerging infectious diseases, and arbitrarily picking a model without any consideration of alternatives is problematic.Entities:
Mesh:
Year: 2015 PMID: 26451161 PMCID: PMC4586906 DOI: 10.1155/2015/207105
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Evidence categories for the Bayes factor (Jeffreys, 1961 [10]).
| Bayes factor | Interpretation |
|---|---|
|
| Decisive evidence for model |
| 1/100 < | Very strong evidence for model |
| 1/30 < | Strong evidence for model |
| 1/10 < | Substantial evidence for model |
| 1/3 < | Anecdotal evidence for model |
|
| No evidence |
| 1 < | Anecdotal evidence for model |
| 3 < | Substantial evidence for model |
| 10 < | Strong evidence for model |
| 30 < | Very strong evidence for model |
|
| Decisive evidence for model |
Figure 1Model fitting of simulated data generated from Richards model and Gompertz model using four candidate models. The data in (a) and (b) are produced from Richards model; the data in (c) and (d) are produced from Gompertz model. The simulated data points are shown as black dot points. The curves represent the fitting to the data points for four models, respectively. The grey areas are the 95% confidence intervals of each lines.
The corresponding Bayes factor and AIC about four models based on A/H1N1 and Ebola data.
| Data source |
|
| |||
|---|---|---|---|---|---|
| Logistic | Gompertz | Rosenzweig | Richards | ||
| H1N1 | Logistic | 1 | Inf | Inf | 1.34 |
| Gompertz0 | 0 | 1 | / | — | |
| Rosenzweig0 | 0 | / | 1 | — | |
| Richards model | 0.75 | Inf | Inf | 1 | |
| AIC | 249 | 362 | 592 | 254 | |
|
| |||||
| West Africa | Logistic | 1 | Inf | Inf | 2.1528 |
| Gompertz0 | 0 | 1 | / | — | |
| Rosenzweig0 | 0 | / | 1 | — | |
| Richards model | 0.4645 | Inf | Inf | 1 | |
| AIC | 5200 | 49500 | 1872800 | 5400 | |
|
| |||||
| Guinea | Logistic | 1 | Inf | Inf | 1.25 |
| Gompertz0 | 0 | 1 | / | — | |
| Rosenzweig0 | 0 | / | 1 | — | |
| Richards model | 0.8 | Inf | Inf | 1 | |
| AIC | 1991 | 3427 | 18476 | 1998 | |
|
| |||||
| Liberia | Logistic | 1 | Inf | Inf | — |
| Gompertz0 | 0 | 1 | / | — | |
| Rosenzweig0 | 0 | / | 1 | — | |
| Richards model | 5 | Inf | Inf | 1 | |
| AIC | 6308 | 6547 | 7980 | 2559 | |
|
| |||||
| Sierra Leona | Logistic | 1 | — | — | — |
| Gompertz | 102310 | 1 | 2.96 | 0.28 | |
| Rosenzweig | 34750 | 0.34 | 1 | 0.095 | |
| Richards model | 362940 | 3.55 | 10.48 | 1 | |
| AIC | 15432 | 6251 | 7038 | 5400 | |
— means a very small number.
Inf means a sufficiently big number.
0 means the probability of being chosen for model is zero.
/ means a no number (i.e., 0/0).
Figure 2(a) Model selection based on the accumulate cases data from the 8th Hospital of Xi'an from 3 September to 21 September with the last 2000-group parameters of Markov chain; (b) model fitting of A/H1N1 data in Xi'an, 2009. The curves represent the fitting to the data for four models, respectively. The grey areas are the 95% confidence intervals of each curve. Here, cyan curve represents Logistic model; blue curve represents Gompertz model; red curve represents Rosenzweig model; black curve represents Richards model. Note that the cyan curve and black curve almost coincide.
The estimations of R 0, turning point t , and final size for the best model.
| Data source |
| 95% CI |
| 95% CI | Final size | 95% CI |
|---|---|---|---|---|---|---|
| H1N1 | 1.9005 | (1.8869, 1.9142) | 231 | (22, 24) | 1013 | (996, 1030) |
| West Africa | 1.3522 | (1.3506, 1.3537) | 2272 | (226, 228) | 257943 | (25630, 25958) |
| Guinea | 1.2101 | (1.2084, 1.2119) | 2394 | (237, 241) | 39165 | (3865, 3967) |
| Liberia | 3.0234 | (2.6063, 3.4881) | 1306 | (121, 149) | 98867 | (9740, 10031) |
| Sierra Leona | 1.9018 | (1.8565, 1.9478) | 1658 | (157, 174) | 126339 | (12515, 12750) |
1Denoting turning point during Sep. 25–Sep. 27, 2009.
2Denoting turning point during Nov. 6–Nov. 8, 2014.
3Denoting the final time during Sep. 13–Sep. 17, 2015.
4Denoting turning point during Nov. 15–Nov. 19, 2014.
5Denoting the final time during Dec. 24–Dec. 31, 2015.
6Denoting turning point during Sep. 23–Oct. 21, 2014.
7Denoting the final time during Sep. 19–Sep. 26, 2015.
8Denoting turning point during Oct. 27–Nov. 12, 2014.
9Denoting the final time during Dec. 15–Dec. 22, 2015.
the first stage cannot reach final size because of the beginning of the second stage.
R 0 was computed using the mean generation interval of T = 4 days [11] about A/H1N1 and T = 12 days [12] about Ebola.
The estimations of all parameters with respect to the best model.
| Data source | Parameter | Mean | Std. | MC_err | Tau | Geweke |
|---|---|---|---|---|---|---|
| H1N1 |
| 0.1605 | 9.1570 | 3.3003 | 6.6007 | 0.9999 |
|
| 1013 | 8.6356 | 0.0341 | 6.6724 | 0.9997 | |
|
| ||||||
| West Africa |
| 0.0251 | 4.8634 | 1.5757 | 6.6144 | 0.9999 |
|
| 25794 | 83.712 | 0.2631 | 6.6658 | 0.9999 | |
|
| ||||||
| Guinea |
| 0.0159 | 6.1289 | 2.1429 | 6.7164 | 0.9999 |
|
| 3916 | 26.131 | 0.1143 | 6.6456 | 0.9999 | |
|
| ||||||
| Liberia |
| 0.0919 | 6.19 | 6.1022 | 44.625 | 0.9973 |
|
| 9886 | 74.03 | 0.5253 | 29.833 | 0.9999 | |
|
| 0.2333 | 0.0225 | 2.0514 | 39.52 | 0.9963 | |
|
| ||||||
| Sierra Leona |
| 0.0536 | 1.0186 | 4.0645 | 12.058 | 0.9999 |
|
| 12633 | 59.697 | 0.2866 | 11.61 | 0.9999 | |
|
| 0.3985 | 0.0149 | 6.4121 | 12.063 | 0.9997 | |
Figure 3Model selection based on the accumulate Ebola cases for (a) West Africa, (b) Guinea, (c) Liberia, and (d) Sierra Leone with the last 2000-group parameters of Markov chain. The Logistic model and Richards model are selected in (a) and (b), and Richards model is selected in (c) and (d).
Comparison of the reported and model predicted cases of Ebola based in Richards model on June 14, 2015.
| Source of data | Reported cases (number)† | Predicted cases (number) | The rate of underestimated or overestimated model |
|---|---|---|---|
| West Africa | 27305 | 25693 | −5.9% |
| Guinea | 3674 | 3778 | +2.8% |
| Liberia | 10666 | 9842 | −7.7% |
| Sierra Leone | 12965 | 12515 | −3.5% |
†Source: World Health Organization (http://apps.who.int/ebola/current-situation/ebola-situation-report-17-june-2015).
Figure 4Model fitting of 2014-2015 Ebola outbreaks in (a) West Africa, (b) Guinea, (c) Liberia, and (d) Sierra Leone. Data of the cumulative numbers of infected cases are shown as black dots. The curves represent the fitting to the data for four models, respectively. The grey areas are the 95% confidence interval of each curves. Cyan curve represents Logistic model; blue curve represents Gompertz model; red curve represents Rosenzweig model; black curve represents Richards model.
Figure 5The effects of different generation time in West Africa, Sierra Leone, Liberia, and Guinea on the basic reproduction number of Ebola. Dotted lines represent the 95% confidence interval of R 0 generated by the 95% confidence interval of r.
The selection of model about different data.
| Data | Xi'an | West Africa | Guinea | Liberia | Sierra Leone |
|---|---|---|---|---|---|
| (H1N1) | (Ebola) | (Ebola) | (Ebola) | (Ebola) | |
| Model | L (R) | L (R) | L (R) | R | R |
L denotes Logistic model.
R represents Richards model.
L (R) means both Logistic model and Richards model.
The estimation of R 0 and turning point t , 95% conditional confidence (95% CI) for each dataset and candidate model.
| Model |
| 95% CI |
| 95% CI |
|---|---|---|---|---|
| Liberia (Ebola) | ||||
|
| 3.0223 | (2.6026, 3.4855) | 130 | (111, 159) |
|
| 1.784 | (1.7661, 1.802) | 137 | (135, 140) |
|
| 1.197 | (1.1935, 1.2005) | 128 | (126, 130) |
|
| ||||
| Sierra Leone (Ebola) | ||||
|
| 1.9016 | (1.8563, 1.9475) | 165 | (157, 174) |
|
| 1.1507 | (1.1478, 1.1536) | 164 | (162, 168) |
|
| 1.5894 | (1.582, 1.5967) | 171 | (169, 173) |
The models are sorted from the best to the worst.
Figure 6The estimation of R 0 and turning point t . Here the 95% CI and the deviation between estimated values and correct values of Logistic model, Richards model, and Gompertz model for Sierra Leone and Liberia dataset have been given.
| Data source |
|
| |||
|---|---|---|---|---|---|
| Logistic | Gompertz | Rosenzweig | Richards | ||
| Richards model | Logistic | 1 | — | — | — |
| Gompertz | Inf | 1 | Inf | — | |
| Rosenzweig | Inf | — | 1 | — | |
| Richards model | 8866.4 | Inf | 144.8 | 1 | |
| AIC | 260 | 241 | 245 | 231 | |
|
| |||||
| Gompertz model | Logistic | 1 | — | — | — |
| Gompertz | Inf | 1 | 19.8855 | 13.0241 | |
| Rosenzweig | Inf | — | 1 | — | |
| Richards model | Inf | — | Inf | 1 | |
| AIC | 618 | 440 | 601 | 556 | |
| Data source | Parameter | Mean | Std. | MC_err | Tau | Geweke |
|---|---|---|---|---|---|---|
| Richards model |
| 0.3095 | 0.0335 | 3.9372 | 95.36 | 0.9923 |
|
| 100.26 | 2.8313 | 0.0255 | 85.563 | 0.9991 | |
|
| 0.3914 | 0.0579 | 6.7027 | 95.533 | 0.9919 | |
|
| ||||||
| Gompertz model |
| 0.1504 | 0.0062 | 9.2746 | 60.727 | 0.9983 |
|
| 100 | 1.8169 | 0.0249 | 62.058 | 0.9991 | |
— means a very small number.
Inf indicates a sufficiently big number.