Kenichiro Sato1, Tatsuo Mano2, Atsushi Iwata3,4, Tatsushi Toda2. 1. Department of Neurology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan. kenisatou-tky@umin.ac.jp. 2. Department of Neurology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan. 3. Department of Neurology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan. iwata@m.u-tokyo.ac.jp. 4. Department of Neurology, Tokyo Metropolitan Geriatric Medical Center Hospital, Tokyo, Japan. iwata@m.u-tokyo.ac.jp.
Abstract
BACKGROUND: Google Trends (GT) is being used as an epidemiological tool to study coronavirus disease (COVID-19) by identifying keywords in search trends that are predictive for the COVID-19 epidemiological burden. However, many of the earlier GT-based studies include potential statistical fallacies by measuring the correlation between non-stationary time sequences without adjusting for multiple comparisons or the confounding of media coverage, leading to concerns about the increased risk of obtaining false-positive results. In this study, we aimed to apply statistically more favorable methods to validate the earlier GT-based COVID-19 study results. METHODS: We extracted the relative GT search volume for keywords associated with COVID-19 symptoms, and evaluated their Granger-causality to weekly COVID-19 positivity in eight English-speaking countries and Japan. In addition, the impact of media coverage on keywords with significant Granger-causality was further evaluated using Japanese regional data. RESULTS: Our Granger causality-based approach largely decreased (by up to approximately one-third) the number of keywords identified as having a significant temporal relationship with the COVID-19 trend when compared to those identified by Pearson or Spearman's rank correlation-based approach. "Sense of smell" and "loss of smell" were the most reliable GT keywords across all the evaluated countries; however, when adjusted with their media coverage, these keyword trends did not Granger-cause the COVID-19 positivity trends (in Japan). CONCLUSIONS: Our results suggest that some of the search keywords reported as candidate predictive measures in earlier GT-based COVID-19 studies may potentially be unreliable; therefore, caution is necessary when interpreting published GT-based study results.
BACKGROUND: Google Trends (GT) is being used as an epidemiological tool to study coronavirus disease (COVID-19) by identifying keywords in search trends that are predictive for the COVID-19 epidemiological burden. However, many of the earlier GT-based studies include potential statistical fallacies by measuring the correlation between non-stationary time sequences without adjusting for multiple comparisons or the confounding of media coverage, leading to concerns about the increased risk of obtaining false-positive results. In this study, we aimed to apply statistically more favorable methods to validate the earlier GT-based COVID-19 study results. METHODS: We extracted the relative GT search volume for keywords associated with COVID-19 symptoms, and evaluated their Granger-causality to weekly COVID-19 positivity in eight English-speaking countries and Japan. In addition, the impact of media coverage on keywords with significant Granger-causality was further evaluated using Japanese regional data. RESULTS: Our Granger causality-based approach largely decreased (by up to approximately one-third) the number of keywords identified as having a significant temporal relationship with the COVID-19 trend when compared to those identified by Pearson or Spearman's rank correlation-based approach. "Sense of smell" and "loss of smell" were the most reliable GT keywords across all the evaluated countries; however, when adjusted with their media coverage, these keyword trends did not Granger-cause the COVID-19 positivity trends (in Japan). CONCLUSIONS: Our results suggest that some of the search keywords reported as candidate predictive measures in earlier GT-based COVID-19 studies may potentially be unreliable; therefore, caution is necessary when interpreting published GT-based study results.
Authors: Kory D Johnson; Mathias Beiglböck; Manuel Eder; Annemarie Grass; Joachim Hermisson; Gudmund Pammer; Jitka Polechová; Daniel Toneian; Benjamin Wölfl Journal: Infect Dis Model Date: 2021-04-02
Authors: Michael S Deiner; Gurbani Kaur; Stephen D McLeod; Julie M Schallhorn; James Chodosh; Daniel H Hwang; Thomas M Lietman; Travis C Porco Journal: J Med Internet Res Date: 2022-07-05 Impact factor: 7.076
Authors: Alexander Fulk; Daniel Romero-Alvarez; Qays Abu-Saymeh; Jarron M Saint Onge; A Townsend Peterson; Folashade B Agusto Journal: PLoS One Date: 2022-06-07 Impact factor: 3.752
Authors: Milla Mukka; Samuli Pesälä; Charlotte Hammer; Pekka Mustonen; Vesa Jormanainen; Hanna Pelttari; Minna Kaila; Otto Helve Journal: JMIR Public Health Surveill Date: 2021-12-07