| Literature DB >> 34825092 |
Dominik Nann1, Mark Walker2, Leonie Frauenfeld1, Tamás Ferenci3,4, Mihály Sulyok1,5.
Abstract
BACKGROUND: Alternative methods could be used to enhance the monitoring and forecasting of re-emerging conditions such as pertussis. Here, whether data on the volume of Internet searching on pertussis could complement traditional modeling based solely on reported case numbers was assessed.Entities:
Keywords: ARIMA; Forecasting; Infodemiology; Pertussis; Surveillance
Year: 2021 PMID: 34825092 PMCID: PMC8605298 DOI: 10.1016/j.heliyon.2021.e08386
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Figure 1Weekly pertussis case numbers and Google Trends data; weekly pertussis case numbers are shown with a blue curve; relative Google Trends search volumes are shown with a red curve. LOESS smoothers representing trend lines with 95% CI-s (shown with the same colour).
Figure 2STL time-series decomposition of weekly pertussis case numbers; trend, seasonal and random components of the reported case numbers are shown.
Figure 3Autocorrelation (ACF) and partial autocorrelation function (PACF) plots of the reported weekly pertussis case numbers. Upper left: Undifferentiated ACF; Upper right: Undifferentiated PACF; Bottom left: Differentiated ACF; Bottom right: Differentiated PACF.
Summary of the model using case numbers only, compared with the GTD-enhanced model.
| Model without GTD | Model with GTD | |
|---|---|---|
| Model formula | ARIMA (1,1,3) (0,1,1)[52] | ARIMA (1,1,3) (0,1,1)[52] with errors |
| AICc | 1750.98 | 1746.73 |
| Training set RMSE | 57.55 | 54.86 |
| Validation set RMSE (for 52 weeks; for 2 weeks) | 192.65; 207.8 | 144.22, 201.78 |
| Validation set MAPE (for 52 weeks; for 2 weeks) | 58.59, 72.1 | 43.86, 68.54 |
| Validation set MAE (for 52 weeks; for 2 weeks) | 169.53, 190.53 | 124.46, 178.96 |
Figure 4Forecasting of the optimal SARIMA model with traditional data (top); the Google Trends Data extended SARIMA (middle), and both models (bottom) compared to the reported weekly Pertussis case numbers (black curve). Shaded areas illustrate +/- 80 and 95% prediction error bounds.
Figure 5Evaluation on a rolling forecast origin-comparison of the SARIMA models. Note the peak in errors caused by an outlier at the beginning of 2017, and favorable effect of smoothing (less distance from the horizontal 0 error line).