| Literature DB >> 32489174 |
Paul P Schneider1,2, Christel Jaw van Gool3, Peter Spreeuwenberg1, Mariëtte Hooiveld1, Gé A Donker1, David J Barnett4, John Paget1.
Abstract
BackgroundDespite the early development of Google Flu Trends in 2009, standards for digital epidemiology methods have not been established and research from European countries is scarce.AimIn this article, we study the use of web search queries to monitor influenza-like illness (ILI) rates in the Netherlands in real time.MethodsIn this retrospective analysis, we simulated the weekly use of a prediction model for estimating the then-current ILI incidence across the 2017/18 influenza season solely based on Google search query data. We used weekly ILI data as reported to The European Surveillance System (TESSY) each week, and we removed the then-last 4 weeks from our dataset. We then fitted a prediction model based on the then-most-recent search query data from Google Trends to fill the 4-week gap ('Nowcasting'). Lasso regression, in combination with cross-validation, was applied to select predictors and to fit the 52 models, one for each week of the season.ResultsThe models provided accurate predictions with a mean and maximum absolute error of 1.40 (95% confidence interval: 1.09-1.75) and 6.36 per 10,000 population. The onset, peak and end of the epidemic were predicted with an error of 1, 3 and 2 weeks, respectively. The number of search terms retained as predictors ranged from three to five, with one keyword, 'griep' ('flu'), having the most weight in all models.DiscussionThis study demonstrates the feasibility of accurate, real-time ILI incidence predictions in the Netherlands using Google search query data.Entities:
Keywords: digital epidemiology; infectious diseases; influenza-like illness; machine learning; surveillance
Mesh:
Year: 2020 PMID: 32489174 PMCID: PMC7268271 DOI: 10.2807/1560-7917.ES.2020.25.21.1900221
Source DB: PubMed Journal: Euro Surveill ISSN: 1025-496X
Dutch search terms retrieved from Google Trends and their English translation, Netherlands, 17 August 2013–4 August 2018 (n = 26 search terms)
| Dutch search term | English translation |
|---|---|
| Griep | Flu |
| Symptomen | Symptoms |
| Symptomen griep | Symptoms flu |
| De griep | The flu |
| Griep 2018a | Flu 2018a |
| Griep koorts | Flu fever |
| Griep 2016a | Flu 2016a |
| Koorts | Fever |
| Tegen griep | Against flu |
| Griep hoe lang | Flu how long |
| Griep 2015a | Flu 2015a |
| Griep 2017a | Flu 2017a |
| Griep heerst | Flu going around |
| Symptomen griep 2018a | Symptoms flu 2018a |
| Verkoudheid | Common cold |
| Hoe lang duurt griep? | How long does flu last? |
| Ziek | Ill |
| Griep 2014a | Flu 2014a |
| Griep hoofdpijn | Flu headache |
| Griep wat te doen | Flu what to do |
| Heerst er griep | Is there flu going around |
| Griep zwanger | Flu pregnant |
| Verschijnselen griep | Symptoms flu |
| Griep kind | Flu child |
| Griep spierpijn | Flu muscle strain |
| Griep hoesten | Flu cough |
a Terms with a year in them were removed from further analysis because they were unlikely to be useful predictors in any other year.
Figure 1Bivariate associations between Google search terms and influenza-like illness incidence in the training dataset, Netherlands, weeks 33/2013–30/2017 (n = 20 search terms)
Figure 2Time series plot showing observed influenza-like illness incidence against predictions of 52 final lasso regression models, weeks 31/2017–31/2018 (A) and overview of training and validation, weeks 33/2013–31/2018 (B), Netherlands
Figure 3Observed vs predicted influenza-like illness incidence at five time points (A–E), Netherlands, influenza season 2017/18
Figure 4Predictors retained in the final lasso regression models throughout the 52 iterations, Netherlands, weeks 31/2017–31/2018