| Literature DB >> 30824756 |
Yuzhou Zhang1, Laith Yakob2, Michael B Bonsall3, Wenbiao Hu4.
Abstract
Can early warning systems be developed to predict influenza epidemics? Using Australian influenza surveillance and local internet search query data, this study investigated whether seasonal influenza epidemics in China, the US and the UK can be predicted using empirical time series analysis. Weekly national number of respiratory cases positive for influenza virus infection that were reported to the FluNet surveillance system in Australia, China, the US and the UK were obtained from World Health Organization FluNet surveillance between week 1, 2010, and week 9, 2018. We collected combined search query data for the US and the UK from Google Trends, and for China from Baidu Index. A multivariate seasonal autoregressive integrated moving average model was developed to track influenza epidemics using Australian influenza and local search data. Parameter estimates for this model were generally consistent with the observed values. The inclusion of search metrics improved the performance of the model with high correlation coefficients (China = 0.96, the US = 0.97, the UK = 0.96, p < 0.01) and low Maximum Absolute Percent Error (MAPE) values (China = 16.76, the US = 96.97, the UK = 125.42). This study demonstrates the feasibility of combining (Australia) influenza and local search query data to predict influenza epidemics a different (northern hemisphere) scales.Entities:
Year: 2019 PMID: 30824756 PMCID: PMC6397245 DOI: 10.1038/s41598-019-39871-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The trends (upper panel) and systematic seasonal factors (lower panel) of positive influenza virological number of Australia, China, the US and the UK between week 1 2010 and week 9 2018. (X axis: date (week), Systematic seasonal factors were generated by seasonal decomposition procedures).
Figure 2Cross-correlation between Australian influenza surveillance with Chinese, the US and the UK influenza surveillance data. Confidence intervals (95%) are indicated by the black lines (X axis: lag value, Y axis: CCF value).
The goodness-of –fit results of SARIMA models.
| China (1,1,1) (1,0,2) | US (2,2,2) (2,0,0) | UK (3,0,2) (1,0,0) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| R2 | BIC | RMSE | R2 | BIC | RMSE | R2 | BIC | RMSE | |
| Model 1 | 92.80 | 13.17 | 301.55 | 94.20 | 14.33 | 557.46 | 91.60 | 12.27 | 139.23 |
| Model 2 | 94.10 | 11.42 | 292.37 | 96.70 | 12.62 | 530.75 | 93.30 | 9.73 | 124.99 |
| Model 3 | 93.90 | 11.87 | 297.41 | 96.40 | 12.94 | 536.18 | 92.90 | 10.06 | 128.37 |
| Model 4 | 94.40 | 11.12 | 245.18 | 96.80 | 12.16 | 405.13 | 93.90 | 9.52 | 111.95 |
Model 1: Australian influenza data and local search data excluded model; Model 2: Australian influenza data included model; Model 3: Local search data included model; Model 4: Australian influenza data and local search data included model.
Parameters estimates (and significance testing) associated with the SARIMA models for China, the US and the UK.
| Parameters | Coefficients | Standard error | t | P value | |
|---|---|---|---|---|---|
| China | AR | 0.75 | 0.10 | 7.68 | 0.000 |
| MA | 0.52 | 0.13 | 4.12 | 0.000 | |
| SAR | 0.71 | 0.32 | 2.21 | 0.028 | |
| SMA | 0.73 | 0.33 | 2.21 | 0.028 | |
| Search | 0.16 | 0.04 | 4.57 | 0.001 | |
| Influenza | 0.01 | 0.01 | 2.17 | 0.031 | |
| The US | AR | 1.65 | 0.04 | 40.91 | 0.000 |
| MA | 1.99 | 0.02 | 81.24 | 0.000 | |
| SAR | 0.04 | 0.07 | 0.52 | 0.602 | |
| Search | 3.24 | 0.38 | 8.44 | 0.000 | |
| Influenza | 0.22 | 0.07 | 3.12 | 0.002 | |
| The UK | AR | 0.95 | 0.02 | 54.60 | 0.000 |
| MA | 0.85 | 0.08 | 6.13 | 0.000 | |
| SAR | 0.27 | 0.06 | 4.57 | 0.001 | |
| Search | 0.10 | 0.02 | 8.47 | 0.000 | |
| Influenza | 0.01 | 0.01 | 2.33 | 0.020 |
AR: autoregressive, MA: moving average, SAR: seasonal autoregressive, SMA: seasonal moving average, Search: local internet search metrics, Influenza: Australian influenza infection.
Figure 3Weekly observed and 1-week ahead predicted positive influenza virological number using SARIMA model in China, the US and the UK from week 1, 2015 to week 9, 2018 (X axis: date (week), Y axis: positive influenza virological number, LCL: the lower control limit, UCL: the upper control limit).