| Literature DB >> 32692691 |
Henry C Cousins1, Clara C Cousins2,3,4, Alon Harris5, Louis R Pasquale5.
Abstract
BACKGROUND: Timely allocation of medical resources for coronavirus disease (COVID-19) requires early detection of regional outbreaks. Internet browsing data may predict case outbreaks in local populations that are yet to be confirmed.Entities:
Keywords: COVID-19; Google Trends; epidemiology; infectious disease; infoveillance; internet activity; public health; surveillance
Mesh:
Year: 2020 PMID: 32692691 PMCID: PMC7394521 DOI: 10.2196/19483
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Characteristics of query topics screened for fit with coronavirus disease (COVID-19) case data.
| Search query category | Unique queriesa, n (%) | Correlation with national case rateb, mean | Associated lag time (days)c, mean | Activity weightingd |
| COVID-19 guidance | 32 (6.9) | 0.96 | 9.1 | 0.38 |
| COVID-19 news | 57 (12.3) | 0.96 | 8.3 | 1.00 |
| COVID-19 symptoms | 91 (19.7) | 0.94 | 8.9 | 0.41 |
| Medical treatments | 34 (7.3) | 0.93 | 10.1 | 0.31 |
| COVID-19 testing | 58 (12.5) | 0.89 | 5.4 | 0.11 |
| Medical care | 33 (7.1) | 0.89 | 7.2 | 0.60 |
| Nonspecific symptoms | 62 (13.4) | 0.89 | 6.8 | 0.57 |
| Economic effects | 28 (6.0) | 0.86 | 5.9 | 0.12 |
| Unrelated to illness | 51 (11.0) | 0.86 | 6.6 | 0.76 |
| Symptoms of other illnesses | 17 (3.7) | 0.84 | 8.3 | 0.77 |
aNumber of queries of each type in the query library (eg, the category “COVID-19 testing” would include the specific query “coronavirus test near me,” and the category “nonspecific symptoms” would include the query “cough”).
bExpressed as the inverse z-transformation of the averaged z-transformed correlations with in-sample national data.
cMean lag time between best-fitting query activity and confirmed case rate, in days.
dRelative mean search activity levels, normalized.
Figure 1Correlation of query predictions with regional coronavirus disease (COVID-19) confirmed case rates. (A) Correlation of predicted case rates with actual case rates for the 50 states. Values are Pearson correlation coefficients. * indicates significance at α=.05; ** at α=.01; *** at α=.005. (B) Root-mean-square error (RMSE) between predicted case rates and actual case rates for the 50 states, in units of daily new cases per 100,000 population. (C) Prediction correlations at the state level do not depend on outbreak timing, as measured by the date of the first confirmed case. Circle size indicates the relative population of the state. Color indicates US census-designated region (blue: Northeast; orange: Midwest; gray: South; green: West). (D) Prediction correlations at the designated market area (DMA) level do not depend on outbreak timing, as measured by the date of the first confirmed case. Circle size indicates the relative population of DMA. Color indicates the US census-designated region, as described. n.s.: not significant.
Figure 2Correlation of query predictions (red) with regional coronavirus disease (COVID-19) case rates (black) at the state and designated market area (DMA) levels, February 20 to April 2, 2020. (A) Comparison of predicted case rates (red) with actual case rates (black) at the state level, with Arizona shown as an example. Dashed lines indicate 95% CIs. (B) Comparison at the DMA level, with the Butte-Bozeman area shown as an example of predictions in a low-population region.