| Literature DB >> 28874880 |
Yuzhou Zhang1, Gabriel Milinovich1, Zhiwei Xu1, Hilary Bambrick1, Kerrie Mengersen2, Shilu Tong1,3,4, Wenbiao Hu5.
Abstract
This study aims to assess the utility of internet search query analysis in pertussis surveillance. This study uses an empirical time series model based on internet search metrics to detect the pertussis incidence in Australia. Our research demonstrates a clear seasonal pattern of both pertussis infections and Google Trends (GT) with specific search terms in time series seasonal decomposition analysis. The cross-correlation function showed significant correlations between GT and pertussis incidences in Australia and each state at the lag of 0 and 1 months, with the variation of correlations between 0.17 and 0.76 (p < 0.05). A multivariate seasonal autoregressive integrated moving average (SARIMA) model was developed to track pertussis epidemics pattern using GT data. Reflected values for this model were generally consistent with the observed values. The inclusion of GT metrics improved detective performance of the model (β = 0.058, p < 0.001). The validation analysis indicated that the overall agreement was 81% (sensitivity: 77% and specificity: 83%). This study demonstrates the feasibility of using internet search metrics for the detection of pertussis epidemics in real-time, which can be considered as a pre-requisite for constructing early warning systems for pertussis surveillance using internet search metrics.Entities:
Mesh:
Year: 2017 PMID: 28874880 PMCID: PMC5585203 DOI: 10.1038/s41598-017-11195-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Systematic seasonal variations and trends for pertussis incidence rates and GT at national level from 2004 to 2015.
Figure 2Coloured bars show time-series cross correlation results for pertussis incidence rates with GT metrics (2004–15) in Australia. Blue bars indicate the value of search term pertussis, the values of search queries whooping and whooping cough are indicated by green bars and grey bars separately. Confidence intervals (95%) are indicated by the solid red lines (X axis: lag value, Y axis: CCF value).
Parameters estimates and their testing results of the SARIMA (2,0,2) (1,0,0) model.
| Parameters | Coefficients | Standard error | t | P value |
|---|---|---|---|---|
| AR | 0.91 | 0.23 | 4.01 | 0.000 |
| SAR | 0.31 | 0.11 | 2.95 | 0.004 |
| GT | 0.06 | 0.01 | 5.66 | 0.000 |
SARIMA: Seasonal Autoregressive Integrated Moving Average Model, AR: autoregressive, GT: Google Trends metrics for search term whooping cough.
Figure 3(A) Observed and fitted value of the SARIMA models from 2004 to 2012. UCL and LCL are presented for the GT included model. (B) Reflected pertussis incidence rates between 2013 and 2015 based on the SARIMA model.