| Literature DB >> 30586416 |
Yongbin Wang1, Chunjie Xu2, Zhende Wang1, Shengkui Zhang1, Ying Zhu1, Juxiang Yuan1.
Abstract
BACKGROUND: It is a daunting task to discontinue pertussis completely in China owing to its growing increase in the incidence. While basic to any formulation of prevention and control measures is early response for future epidemic trends. Discrete wavelet transform(DWT) has been emerged as a powerful tool in decomposing time series into different constituents, which facilitates better improvement in prediction accuracy. Thus we aim to integrate modeling approaches as a decision-making supportive tool for formulating health resources.Entities:
Mesh:
Year: 2018 PMID: 30586416 PMCID: PMC6306235 DOI: 10.1371/journal.pone.0208404
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Ljung-Box Q tests of the residuals for the elected four models.
| Lags | SARIMA | NAR | Traditional | Wavelet based SARIMA-NAR | ||||
|---|---|---|---|---|---|---|---|---|
| Box-Ljung | Box-Ljung | Box-Ljung | Box-Ljung | |||||
| 1 | 0.249 | 0.618 | 1.809 | 0.179 | 1.095 | 0.295 | 0.025 | 0.874 |
| 3 | 0.787 | 0.853 | 2.409 | 0.492 | 1.180 | 0.758 | 1.047 | 0.790 |
| 6 | 4.499 | 0.610 | 5.870 | 0.438 | 7.532 | 0.274 | 5.561 | 0.474 |
| 9 | 6.051 | 0.735 | 7.010 | 0.636 | 10.359 | 0.322 | 10.062 | 0.345 |
| 12 | 9.532 | 0.657 | 8.343 | 0.758 | 11.176 | 0.514 | 18.761 | 0.094 |
| 15 | 15.066 | 0.447 | 9.869 | 0.828 | 14.973 | 0.453 | 19.779 | 0.181 |
| 18 | 21.814 | 0.240 | 11.815 | 0.857 | 19.217 | 0.379 | 25.382 | 0.115 |
| 21 | 23.275 | 0.330 | 18.371 | 0.625 | 19.950 | 0.524 | 28.123 | 0.137 |
| 24 | 32.254 | 0.121 | 31.425 | 0.142 | 24.973 | 0.407 | 28.979 | 0.221 |
| 27 | 33.317 | 0.187 | 31.847 | 0.238 | 26.080 | 0.514 | 29.594 | 0.333 |
| 30 | 33.556 | 0.299 | 36.061 | 0.206 | 27.704 | 0.586 | 31.058 | 0.413 |
| 33 | 35.564 | 0.348 | 37.148 | 0.284 | 28.561 | 0.688 | 33.566 | 0.440 |
| 36 | 42.281 | 0.218 | 47.161 | 0.101 | 34.031 | 0.563 | 34.576 | 0.536 |
SARIMA, seasonal autoregressive integrated moving average model; NAR, nonlinear auto-regressive neural network model; Wavelet based SARIMA-NAR, integrating a seasonal autoregressive integrated moving model with a nonlinear autoregressive network model at level 2 of db2 wavelet.
Estimated parameters of the SARIMA(2,1,0)(0,1,1)12 model.
| Parameters | Coefficient | Standard error | ||
|---|---|---|---|---|
| AR1 | -0.448 | 0.079 | -5.662 | <0.001 |
| AR2 | -0.201 | 0.079 | -2.551 | 0.012 |
| SMA1 | 0.678 | 0.072 | 9.427 | <0.001 |
AR1, moving average, lag1; AR2, moving average, lag2; SMA1, seasonal moving average, lag12.
The preferred SARIMA models’ parameters for various target series.
| Target | Approximation | SARIMA model | R2 | Normalized BIC | Ljung-Box | |
|---|---|---|---|---|---|---|
| Statistics | ||||||
| Original | - | SARIMA(2,1,0)(0,1,1)12 | 0.940 | 8.348 | 21.814 | 0.113 |
| db2 | a1 | SARIMA(0,1,1)(0,1,1)12 | 0.976 | 7.362 | 22.307 | 0.134 |
| db2 | a2 | SARIMA(0,1,3)(1,0,0)12 | 0.975 | 7.338 | 15.661 | 0.334 |
| coif1 | a1 | SARIMA(0,1,2)(0,1,1)12 | 0.975 | 7.433 | 22.042 | 0.107 |
| coif1 | a2 | SARIMA(1,1,4)(0,1,1)12 | 0.987 | 6.680 | 17.172 | 0.309 |
| coif1 | a3 | SARIMA(0,1,3)(2,0,1)12 | 0.990 | 6.381 | 12.323 | 0.420 |
Original, pertussis incidence cases time series; db2, Daubechies wavelet; coif1, Coiflets wavelet; a1, a2 and a3, approximate components of the pertussis incidence cases time series generated by Daubechies or Coiflets wavelet; SARIMA, seasonal autoregressive integrated moving average model; R2, determination coefficient; BIC, Bayesian information criterion.
The preferred NAR models’ parameters of various target series.
| Target | Levels | Hidden units | Delays | MSE | R | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Training | Validation | Testing | Training | Validation | Testing | Overall | ||||
| Original | - | 16 | 6 | 3930.094 | 7460.696 | 5977.939 | 0.969 | 0.962 | 0.834 | 0.964 |
| db2 | 1 | 16 | 5 | 104.066 | 404.740 | 96.919 | 0.953 | 0.953 | 0.936 | 0.947 |
| db2 | 2(d1) | 12 | 4 | 106.794 | 310.947 | 602.364 | 0.941 | 0.977 | 0.927 | 0.927 |
| db2 | 2(d2) | 11 | 5 | 41.833 | 218.312 | 1541.972 | 0.998 | 0.982 | 0.936 | 0.988 |
| coif1 | 1 | 12 | 5 | 154.467 | 208.055 | 681.115 | 0.969 | 0.956 | 0.978 | 0.966 |
| coif1 | 2(d1) | 13 | 5 | 141.550 | 256.188 | 170.761 | 0.977 | 0.971 | 0.937 | 0.974 |
| coif1 | 2(d2) | 13 | 5 | 29.498 | 77.925 | 166.755 | 0.995 | 0.970 | 0.900 | 0.991 |
| coif1 | 3(d1) | 14 | 5 | 124.798 | 235.986 | 1627.252 | 0.970 | 0.954 | 0.946 | 0.954 |
| coif1 | 3(d2) | 12 | 5 | 56.408 | 266.313 | 199.657 | 0.990 | 0.958 | 0.938 | 0.982 |
| coif1 | 3(d3) | 15 | 5 | 341.436 | 932.159 | 279.393 | 0.984 | 0.953 | 0.992 | 0.982 |
Original, pertussis incidence cases time series; db2, Daubechies wavelet; coif1, Coiflets wavelet; d1, d2 and d3, detailed components of the pertussis incidence cases time series generated by Daubechies or Coiflets wavelet; MSE, mean square error; R, correlation coefficient.
Performance assessment of various SARIMA-NAR hybrid models based on the DWT.
| Mother wavelet | Levels | Mimic part | Predicted part | ||||
|---|---|---|---|---|---|---|---|
| MAPE | MAE | RMSE | MAPE | MAE | RMSE | ||
| db2 | 1 | 0.109 | 25.342 | 39.744 | 0.303 | 436.416 | 621.934 |
| db2 | 2 | 0.085 | 22.854 | 38.097 | 0.067 | 76.006 | 92.015 |
| coif1 | 1 | 0.135 | 29.659 | 41.363 | 0.493 | 538.029 | 617.032 |
| coif1 | 2 | 0.080 | 19.571 | 28.081 | 0.219 | 345.398 | 621.438 |
| coif1 | 3 | 0.094 | 23.682 | 34.031 | 0.189 | 243.336 | 348.098 |
DWT, discrete wavelet transform; db2, Daubechies wavelet; coif1, Coiflets wavelet; MAPE, mean absolute percentage error; MAE, mean absolute error; RMSE, root mean square error.
Estimated parameters of the SARIMA(0,1,3)(1,0,0)12modelfor the approximation yielded by db2 wavelet.
| Parameters | Coefficient | Standard error | ||
|---|---|---|---|---|
| MA1 | -0.574 | 0.079 | -7.240 | <0.001 |
| MA2 | -0.282 | 0.091 | -3.120 | 0.002 |
| MA3 | -0.339 | 0.081 | -4.200 | <0.001 |
| SAR1 | 0.901 | 0.031 | 28.602 | <0.001 |
MA1,moving average, lag1; MA2, moving average, lag2; MA3, moving average, lag3; SAR1, seasonal moving average, lag1.
Predicting morbidity numbers of pertussis from December 2017 to May 2018 with the selected four models.
| Time | Actual | SARIMA | NAR | Traditional SARIMA-NAR | Wavelet based SARIMA-NAR | ||||
|---|---|---|---|---|---|---|---|---|---|
| Forecasted | AE | Forecasted | AE | Forecasted | AE | Forecasted | AE | ||
| 627 | 957 | 0.459 | 915 | 0.459 | 1287 | 0.513 | 613 | 0.022 | |
| January | 649 | 819 | 0.313 | 446 | 0.313 | 863 | 0.248 | 742 | 0.143 |
| February | 743 | 955 | 0.448 | 410 | 0.448 | 1503 | 0.506 | 783 | 0.054 |
| March | 1602 | 1427 | 0.451 | 880 | 0.451 | 1527 | 0.049 | 1757 | 0.097 |
| April | 1758 | 1267 | 0.577 | 743 | 0.577 | 2130 | 0.175 | 1726 | 0.018 |
| May | 1764 | 1589 | 0.296 | 1242 | 0.296 | 1431 | 0.233 | 1889 | 0.071 |
SARIMA, seasonal autoregressive integrated moving average model; NAR, nonlinear auto-regressive neural network model; Wavelet based SARIMA-NAR, integrating a seasonal autoregressive integrated moving model with a nonlinear autoregressive network model at level 2 of db2 wavelet; AE, absolute error.
Comparison results of in-sample fitting and out-of-sample predicted performance using the selected four models.
| Mimic performance | Forecasted performance | |||||||
|---|---|---|---|---|---|---|---|---|
| MAPE | MAE | RMSE | MSE | MAPE | MAE | RMSE | MSE | |
| SARIMA | 0.178 | 43.169 | 61.245 | 3750.922 | 0.260 | 258.833 | 284.334 | 80845.833 |
| NAR | 0.234 | 47.362 | 66.966 | 4484.444 | 0.424 | 541.106 | 586.197 | 343627.309 |
| Traditional SARIMA-NAR | 0.122 | 26.075 | 40.190 | 1615.269 | 0.475 | 401.989 | 476.637 | 218684.000 |
| Wavelet based SARIMA-NAR | 0.085 | 22.854 | 38.097 | 1451.398 | 0.067 | 76.006 | 92.015 | 8466.756 |
| Novel vs. SARIMA | 52.247 | 47.059 | 37.796 | 61.306 | 74.231 | 70.635 | 67.638 | 89.527 |
| Novel vs. NAR | 63.675 | 51.746 | 43.110 | 67.635 | 84.198 | 85.954 | 84.303 | 97.536 |
| Novel vs. Traditional | 30.328 | 12.353 | 5.208 | 10.145 | 85.895 | 81.093 | 80.695 | 96.128 |
SARIMA, seasonal autoregressive integrated moving average model; NAR, nonlinear auto-regressive neural network model; Wavelet based SARIMA-NAR, integrating a seasonal autoregressive integrated moving model with a nonlinear autoregressive network model at level 2 of db2 wavelet; MAPE, mean absolute percentage error; MAE, mean absolute error; RMSE, root mean square error; MSE, mean square error.