| Literature DB >> 34273513 |
Atina Husnayain1, Ting-Wu Chuang2, Anis Fuad3, Emily Chia-Yu Su4.
Abstract
OBJECTIVE: Incorporating spatial analyses and online health information queries may be beneficial in understanding the role of Google relative search volume (RSV) data as a secondary public health surveillance tool during pandemics. This study identified coronavirus disease 2019 (COVID-19) clustering and defined the predictability performance of Google RSV models in clustered and non-clustered areas of the USA.Entities:
Keywords: COVID-19; Google Trends; Infodemiology; Predictability performance; Spatial analysis; United States
Mesh:
Year: 2021 PMID: 34273513 PMCID: PMC8922685 DOI: 10.1016/j.ijid.2021.07.031
Source DB: PubMed Journal: Int J Infect Dis ISSN: 1201-9712 Impact factor: 3.623
Dataset description.
| Dataset | Data description | Data unit | Source | Utilization |
|---|---|---|---|---|
| Case data | Cumulative daily cases New daily cases | County-level data State-level data | Hot spot analysis State-level correlation and prediction analysis | |
| Spatial data | Spatial features and population numbers (of 48 contiguous states and the District of Columbia) | County-level data | Spatial analysis and visualization | |
| Google RSV data | Google RSV data ranged from 0∼100 | Country- and state-level data | Country- and state-level correlations, state-level prediction analysis | |
| Mobility data | Changes in time spent in six categorized places (retail and recreation, grocery and pharmacy, parks, transit stations, workplaces and residential) compared with baseline days (median value from 3 January to 6 February 2020) | State-level data | State-level prediction analysis |
RSC, relative search volume.
Monthly incidence rates of coronavirus disease 2019 in the USA.
| Month, 2020 | Country-level incidence rate | Counties with the highest incidence rate (state) | County-level incidence rate |
|---|---|---|---|
| January | 0.002 | Suffolk (MA) | 0.129 |
| Santa Clara (CA) | 0.051 | ||
| King (WA) | 0.046 | ||
| Cook (IL) | 0.038 | ||
| Orange (CA) | 0.031 | ||
| February | 0.005 | San Benito (CA) | 3.370 |
| Humboldt (CA) | 0.720 | ||
| King (WA) | 0.231 | ||
| Washington (OR) | 0.170 | ||
| Sacramento (CA) | 0.132 | ||
| March | 57.110 | Westchester (NY) | 1011.124 |
| Blaine (ID) | 886.508 | ||
| Rockland (NY) | 872.079 | ||
| Nassau (NY) | 624.511 | ||
| Richmond (NY) | 596.974 | ||
| April | 271.981 | Lincoln (AR) | 5764.018 |
| Bledsoe (TN) | 3951.936 | ||
| Nobles (MN) | 3403.514 | ||
| Marion (OH) | 3324.470 | ||
| Dakota (NE) | 3081.114 | ||
| May | 218.057 | Trousdale (TN) | 15,385.550 |
| Colfax (NE) | 5159.589 | ||
| Dakota (NE) | 4729.698 | ||
| Lake (TN) | 4616.770 | ||
| Buena Vista (IA) | 3776.787 | ||
| June | 253.575 | Lee (AR) | 6528.712 |
| Buena Vista (IA) | 4421.604 | ||
| East Carroll (LA) | 3797.139 | ||
| Lake (TN) | 3549.383 | ||
| Chattahoochee (GA) | 3132.424 | ||
| July | 581.399 | La Salle (TX) | 4373.808 |
| Madison (TX) | 4331.901 | ||
| Crockett (TX) | 3847.181 | ||
| Chicot (AR) | 3320.243 | ||
| Columbia (FL) | 3153.315 | ||
| August | 440.344 | Lafayette (FL) | 13,001.420 |
| Wayne (TN) | 6234.385 | ||
| Issaquena (MS) | 5811.321 | ||
| Chattahoochee (GA) | 4909.811 | ||
| Chicot (AR) | 4069.975 | ||
| September | 362.574 | Emmons (ND) | 4612.707 |
| Woodward (OK) | 4565.296 | ||
| Chattahoochee (GA) | 4522.657 | ||
| Rosebud (MT) | 4075.067 | ||
| Pawnee (KS) | 3717.633 | ||
| October | 577.555 | Bon Homme (SD) | 12,994.680 |
| Norton (KS) | 12,112.020 | ||
| Sheridan (KS) | 6856.455 | ||
| Faulk (SD) | 6250.000 | ||
| Buffalo (SD) | 6200.787 | ||
| November | 1351.011 | Crowley (CO) | 20,244.420 |
| Lee (KY) | 10,434.420 | ||
| Childress (TX) | 10,114.060 | ||
| Foster (ND) | 8712.459 | ||
| Jones (IA) | 8642.276 | ||
| December | 1926.729 | Bent (CO) | 14,336.190 |
| Lincoln (CO) | 11,687.610 | ||
| Pershing (NV) | 11,075.760 | ||
| Alfalfa (OK) | 9086.337 | ||
| Lassen (CA) | 8743.610 |
Incidence rate per 100,000 population.
Results of the global spatial autocorrelation test.
| Month, 2020 | Observed General G | z-score | Result | |
|---|---|---|---|---|
| January | 0.002 | 0.659 | 0.510 | Random |
| February | 0.011 | 3.247 | 0.001 | Clustered |
| March | 0.002 | 49.168 | <0.001 | |
| April | 0.001 | 30.914 | <0.001 | |
| May | 0.001 | 14.150 | <0.001 | |
| June | 0.001 | 28.119 | <0.001 | |
| July | 0.001 | 51.984 | <0.001 | |
| August | 0.000 | 36.725 | <0.001 | |
| September | 0.000 | 34.670 | <0.001 | |
| October | 0.000 | 49.418 | <0.001 | |
| November | 0.000 | 49.154 | <0.001 | |
| December | 0.000 | 27.123 | <0.001 |
Figure 1Distribution of coronavirus disease 2019 hot and cold spots in the USA.
Correlations between new daily cases of coronavirus disease 2019 (COVID-19) and Google relative search volumes of county-level data in the USA.
| Term | Month, 2020 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jan | Feb | Mar | Apr | May | June | July | Aug | Sept | Oct | Nov | Dec | |
| A | 0.568 | 0.497 | 0.476 | 0.909 | 0.547 | |||||||
| B | 0.698 | 0.848 | 0.442 | 0.401 | 0.907 | 0.543 | 0.539 | |||||
| C | 0.568 | 0.512 | 0.484 | 0.897 | 0.568 | -0.386 | 0.433 | |||||
| D | 0.831 | 0.945 | 0.429 | 0.370 | 0.910 | 0.478 | 0.524 | 0.376 | ||||
| E | 0.671 | 0.714 | 0.471 | 0.874 | 0.430 | 0.615 | ||||||
| F | 0.568 | 0.512 | 0.458 | 0.902 | 0.522 | 0.422 | ||||||
| G | 0.768 | 0.902 | 0.408 | 0.405 | ||||||||
| H | 0.773 | -0.460 | 0.929 | 0.531 | 0.509 | 0.747 | ||||||
| A: | Coronavirus (virus) | Weak correlation ( | ||||||||||
| B: | ‘Coronavirus disease 2019’ (disease) | Moderate correlation ( | ||||||||||
| C: | coronavirus (search term) | Strong correlation ( | ||||||||||
| D: | covid (search term) | All reported correlations were significant at | ||||||||||
| E: | covid-19 (search term) | |||||||||||
| F: | coronavirus + ‘coronavirus update’ + ‘coronavirus | |||||||||||
| G: | symptoms’ (search terms) | |||||||||||
| H: | ‘covid symptoms’ (search term) ‘covid testing’ (search term) | |||||||||||
Correlations between new daily cases of coronavirus disease 2019 and Google relative search volumes of state-level data in the USA.
| State | Month, 2020 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jan | Feb | Mar | Apr | May | June | July | Aug | Sept | Oct | Nov | Dec | |
| Alabama | 0.443 | |||||||||||
| Arizona | 0.567 | 0.557 | 0.440 | |||||||||
| Arkansas | 0.461 | 0.374 | ||||||||||
| California | 0.704 | 0.577 | 0.525 | 0.421 | ||||||||
| Colorado | 0.590 | -0.470 | 0.462 | |||||||||
| Connecticut | 0.427 | |||||||||||
| Delaware | 0.480 | -0.369 | ||||||||||
| Florida | 0.746 | 0.592 | ||||||||||
| Georgia | 0.461 | 0.442 | ||||||||||
| Idaho | 0.383 | |||||||||||
| Illinois | 0.716 | 0.448 | ||||||||||
| Indiana | 0.430 | 0.449 | ||||||||||
| Iowa | ||||||||||||
| Kansas | 0.445 | |||||||||||
| Kentucky | 0.383 | |||||||||||
| Louisiana | 0.547 | |||||||||||
| Maine | ||||||||||||
| Maryland | 0.668 | |||||||||||
| Massachusetts | 0.405 | 0.427 | ||||||||||
| Michigan | 0.502 | -0.361 | 0.541 | |||||||||
| Minnesota | ||||||||||||
| Mississippi | 0.572 | |||||||||||
| Missouri | 0.443 | |||||||||||
| Montana | ||||||||||||
| Nebraska | 0.402 | |||||||||||
| Nevada | 0.612 | |||||||||||
| New Hampshire | ||||||||||||
| New Jersey | 0.435 | 0.509 | ||||||||||
| New Mexico | ||||||||||||
| New York | 0.753 | 0.590 | ||||||||||
| North Carolina | 0.480 | 0.366 | -0.383 | |||||||||
| North Dakota | ||||||||||||
| Ohio | 0.544 | 0.418 | 0.409 | 0.525 | ||||||||
| Oklahoma | -0.377 | 0.649 | ||||||||||
| Oregon | -0.463 | 0.385 | ||||||||||
| Pennsylvania | 0.643 | 0.373 | 0.532 | |||||||||
| Rhode Island | ||||||||||||
| South Carolina | 0.740 | |||||||||||
| South Dakota | ||||||||||||
| Tennessee | 0.470 | |||||||||||
| Texas | 0.756 | 0.769 | 0.400 | 0.377 | 0.401 | |||||||
| Utah | 0.570 | |||||||||||
| Vermont | ||||||||||||
| Virginia | 0.500 | 0.699 | ||||||||||
| Washington | 0.408 | |||||||||||
| West Virginia | 0.416 | |||||||||||
| Wisconsin | 0.392 | 0.380 | 0.416 | |||||||||
| Wyoming | -0.365 | |||||||||||
| District of Columbia | -0.358 | -0.436 | ||||||||||
| Number of states with a significant correlation [ | 0 (0.000) | 0 (0.000) | 19 (38.776) | 2 (4.082) | 3 (6.122) | 11 (22.449) | 5 (10.204) | 5 (10.204) | 3 (6.122) | 9 (18.367) | 13 (26.531) | 4 (8.163) |
| Number of states with clustered counties | 1 (2.041) | 17 (34.694) | 35 (71.429) | 30 (61.224) | 36 (73.469) | 49 (100) | 44 (89.796) | 48 (97.959) | 45 (91.837) | 46 (93.878) | 48 (97.959) | |
| Number of states with a significant correlation and clustered counties | 0 (0.000) | 6 (12.245) | 2 (4.082) | 3 (6.122) | 10 (20.408) | 5 (10.204) | 4 (8.163) | 3 (6.122) | 8 (16.327) | 12 (24.490) | 4 (8.163) | |
| Note: | Hot spot areas. | |||||||||||
| Cold spot areas. | ||||||||||||
| Hot and cold spot areas. | ||||||||||||
| Non-significant areas at | ||||||||||||
States with hot spot counties, cold spot counties and both.
Figure 2Correlations between new daily cases of coronavirus disease 2019 and Google relative search volumes in clustered and unclustered areas in the USA.
Figure 3Time series of new daily cases of coronavirus disease 2019 (COVID-19) per 100,000 population in Florida, Illinois and Maryland.
Performance of the Google relative search volume (RSV) models in strongly correlated areas (with Spearman's rank correlation coefficients of ≥0.7).
| Model | Coef. (95% CI) | RMSE | AIC | BIC | |
|---|---|---|---|---|---|
| February | |||||
| California | |||||
| Intercept | 3.582 (3.518∼3.647) | <0.001 | 81.942 | 1244.475 | 1248.777 |
| Google RSVs | 0.056 (0.029∼0.083) | <0.001 | |||
| Mobility (transit stations) | -0.063 (-0.065∼-0.063) | <0.001 | |||
| Florida | |||||
| Intercept | 3.673 (3.611∼3.734) | <0.001 | 61.920 | 1295.194 | 1299.496 |
| Google RSVs | -0.235 (-0.284∼-0.185) | <0.001 | |||
| Mobility (transit stations) | -0.072 (-0.074∼-0.070) | <0.001 | |||
| Illinois | |||||
| Intercept | 3.822 (3.764∼3.880) | <0.001 | 95.865 | 2010.341 | 2014.643 |
| Google RSVs | 0.193 (0.151∼0.235) | <0.001 | |||
| Mobility (transit stations) | -0.050 (-0.051∼-0.048) | <0.001 | |||
| New York | |||||
| Intercept | 6.259 (6.242∼6.276) | <0.001 | 1629.921 | 27386.572 | 27390.874 |
| Google RSVs | -0.152 (-0.160∼-0.145) | <0.001 | |||
| Mobility (transit stations) | -0.055 (-0.056∼-0.055) | <0.001 | |||
| Texas | |||||
| Intercept | 2.665 (2.565∼2.766) | <0.001 | 84.144 | 1367.539 | 1371.841 |
| Google RSVs | 0.325 (0.285∼0.366) | <0.001 | |||
| Mobility (workplaces) | -0.086 (-0.089∼-0.082) | <0.001 | |||
| June | |||||
| South Carolina | |||||
| Intercept | 5.876 (5.839∼5.914) | <0.001 | 294.305 | 3182.487 | 3186.691 |
| Google RSVs | 0.030 (0.028∼0.032) | <0.001 | |||
| Mobility (parks) | 0.012 (0.011∼0.013) | <0.001 | |||
| Texas | |||||
| Intercept | 10.458 (10.401∼10.514) | <0.001 | 961.395 | 8381.994 | 8386.198 |
| Google RSVs | 0.020 (0.019∼0.020) | <0.001 | |||
| Mobility (transit stations) | 0.101 (0.099∼0.103) | <0.001 |
Coef., coefficient; RMSE, root mean squared error; AIC, Akaike information criterion; BIC, Bayesian information criterion.
Non-significant areas.
Hot spot areas.
Hot and cold spot areas.