| Literature DB >> 23785448 |
Olivier J T Briët1, Priyanie H Amerasinghe, Penelope Vounatsou.
Abstract
INTRODUCTION: With the renewed drive towards malaria elimination, there is a need for improved surveillance tools. While time series analysis is an important tool for surveillance, prediction and for measuring interventions' impact, approximations by commonly used Gaussian methods are prone to inaccuracies when case counts are low. Therefore, statistical methods appropriate for count data are required, especially during "consolidation" and "pre-elimination" phases.Entities:
Mesh:
Year: 2013 PMID: 23785448 PMCID: PMC3681978 DOI: 10.1371/journal.pone.0065761
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Monthly malaria case counts and rainfall in Gampaha District over time.
Panel A shows monthly malaria case counts and panel B shows monthly rainfall.
Akaike’s information criterion (AIC) for selected (Gaussian) models on Box-Cox transformed data.
| Model | Excluding rainfall | Including rainfall |
| SARIMA(3 | 1638.61 | 1640.35 |
| SARIMA(3 | 1638.95 | 1640.74 |
| SARIMA(0,1,3 | 1638.44 | 1640.36 |
| SARIMA(0,1,3 | 1638.79 | 1640.74 |
| ARIMA(3 | 1632.2 | 1630.68 |
| ARIMA(0,1,3 | 1631.27 | 1630.07 |
Legend: SOH: second order harmonics. For all these models, where applicable, the autoregrdessive ( and ) or moving average parameters ( and ) corresponding to the first two lags were omitted.
Selection criteria statistics for selected negative binomial models.
| Model | nep | DIC on full series | MARE | DIC on first half | MARE out of sample* |
| GSARIMA(3 | 3 | 4637.8 | 0.4054 | 2266.6 | 0.3970 |
| GSARIMA(3 | 3 |
| 0.3883 | 2243.2 |
|
| GSARIMA(3 | 3 | 4351.1 | 0.3898 | 2243.0 | 0.3684 |
| GSARIMA(0,1,3 | 3 | 4352.4 | 0.3883 | 2243.7 | 0.3661 |
| GSARIMA(0,1,3 | 3 | 4352.8 | 0.3882 | 2243.1 | 0.3669 |
| GSARIMA(3 | 4 | 4352.5 | 0.3876 | 2240.0 | 0.3795 |
| GSARIMA(3 | 4 | 4352.9 | 0.3896 | 2240.6 | 0.3726 |
| GSARIMA(0,1,3 | 4 | 4354.3 | 0.3869 | 4354.2 | 0.3721 |
| GSARIMA(0,1,3 | 4 | 4355.0 | 0.3893 | 2241.1 | 0.3775 |
| GARIMA(3 | 6 | 4335.7 | 0.3933 | 2246.2 | 0.3796 |
| GARIMA(0,1,3 | 6 | 4336.5 | 0.3910 | 2246.1 | 0.3750 |
| GARIMA(3 | 7 | 4399.8 | 0.4000 | 2212.3 | 0.3979 |
| GARIMA(3 | 7 |
|
| 2236.7 | 0.3859 |
| GARIMA(0,1,3 | 7 | 4333.8 | 0.3899 | 2237.1 | 0.3845 |
Legend: IL: identity link; LL: logarithmic link function with transformation method “ZQ1” corresponding to equation 2.2 in Zeger and Qaqish [24] and with ; nep: number of estimated parameters; DIC: Deviance Information Criterion; MARE: mean absolute relative error of fitted values; RF: with rainfall lagged at two months; SOH: second order harmonics; *: The ‘MARE out of sample’ was calculated for the second half of the series, with the model fitted to the first half of the series only. For all models, where applicable, the autoregressive ( and ) or moving average parameters ( and ) corresponding to the first two lags were omitted.
Parameter estimates (mean and 95% credible interval) of selected negative binomial models.
| Parameter | GARIMA(3′,1,0)-SOH-RF | GSARIMA(3′,1,0)×(1,0,0)12 |
|
| −0.34 (−0.66, −0.02) | |
|
| −0.10 (−0.23, 0.02) | |
|
| −0.15 (−0.28, −0.02) | |
|
| 0.14 (0.06, 0.21) | |
|
| 0.16 (0.07, 0.24) | |
|
| −0.10 (−0.19, 0.00) | −0.13 (−0.23, −0.04) |
|
| 0.12 (0.03, 0.22) | |
|
| 4.54 (3.87, 5.27) | 4.32 (3.69, 5.04) |
|
| 0.19 (0.07, 0.32) | |
|
| 0.21 (0.13, 0.29) | |
|
| 4.83 (3.30, 6.35) | |
|
| −0.69 (−1.05, −0.34) |
Legend: GARIMA(3′,1,0)-SOH-RF = GARIMA(3,1,0) model with parameters for the first two lags ( and ) omitted, second order harmonics and rainfall lagged at 2 months (in m); GSARIMA(3′,1,0)×(1,0,0)12 = GSARIMA(3,1,0)×(1,0,0)12 model with parameters for the first two lags ( and ) fixed to zero; AH = annual harmonic, SAH = semi-annual harmonic; $ = derived parameter, phase shift = phase shift of the cosine function expressed in months.
Figure 2Posterior predictive distributions for the last 12 months of the Gampaha malaria case count series.
In each panel, representing each a month in the last year of the series, the black and the red lines are the outline histogram of the density of the posterior predictive distribution of the negative binomial model and a (Bayesian) Gaussian model on Box-Cox transformed data, respectively. Models were fitted to the entire data set. In each panel, the observed case count is represented by a blue dot.
Figure 3Cumulative distribution function of randomized cumulative probabilities.
The black line represents the cumulative distribution function of randomized cumulative probabilities of the model on monthly numbers of malaria cases in Gampaha, Sri Lanka. The red line represents the cumulative distribution function of randomized residual probabilities of the Gaussian model on Box-Cox transformed data. The light grey diagonal line (cumulative distribution equals randomized probability) represents on average appropriate predictive distributions. Dotted lines represent 95% confidence boundaries for proportions equalling probability. A: for the last 392 months in the series. B: for the last fifty months in the series.
Figure 4Normal Q-Q plot of normalized randomized quantile residuals of the selected model.
Figure 5Plot of normalized randomized quantile residuals of the model against the logarithm of relative change.
Monthly malaria case counts were logarithmically transformed after adding one. Then for each month, the difference between this value and the value for the previous month was taken. The diagonal is the fitted regression line.
Figure 6Plot of the autocorrelation function of normalized randomized quantile residuals of the selected model.