| Literature DB >> 32335404 |
Abolfazl Mollalo1, Behzad Vahedi2, Kiara M Rivera3.
Abstract
During the first 90 days of the COVID-19 outbreak in the United States, over 675,000 confirmed cases of the disease have been reported, posing unprecedented socioeconomic burden to the country. Due to inadequate research on geographic modeling of COVID-19, we investigated county-level variations of disease incidence across the continental United States. We compiled a geodatabase of 35 environmental, socioeconomic, topographic, and demographic variables that could explain the spatial variability of disease incidence. Further, we employed spatial lag and spatial error models to investigate spatial dependence and geographically weighted regression (GWR) and multiscale GWR (MGWR) models to locally examine spatial non-stationarity. The results suggested that even though incorporating spatial autocorrelation could significantly improve the performance of the global ordinary least square model, these models still represent a significantly poor performance compared to the local models. Moreover, MGWR could explain the highest variations (adj. R2: 68.1%) with the lowest AICc compared to the others. Mapping the effects of significant explanatory variables (i.e., income inequality, median household income, the proportion of black females, and the proportion of nurse practitioners) on spatial variability of COVID-19 incidence rates using MGWR could provide useful insights to policymakers for targeted interventions.Entities:
Keywords: COVID-19; GIS; Multiscale GWR; Spatial non-stationarity
Mesh:
Year: 2020 PMID: 32335404 PMCID: PMC7175907 DOI: 10.1016/j.scitotenv.2020.138884
Source DB: PubMed Journal: Sci Total Environ ISSN: 0048-9697 Impact factor: 7.963
Explanatory variables used in this study together with definitions and sources.
| Theme | Variable Name | Description | Source |
|---|---|---|---|
| Socioeconomic | (1) Median household income | (2) The ratio of household income at the 80th percentile to income at the 20th percentile (2018) | (1–2) Small Area Income and Poverty Estimates, American Community Survey, five-year Estimates |
| Behavioral | Adult smoking | Percentage of adults that reported currently smoking (2018) | Behavioral Risk Factor Surveillance System (BRFSS) |
| Environmental | (1) Road density | (1) The total length of primary and secondary roads for each county calculated/area of the corresponding county | (1) US Census Bureau TIGER/Line |
| Topographic | (1) Minimum, maximum, and average | (1) Digital elevation model of the United States (1 km spatial resolution) | United States Geological Survey (USGS) |
| Demographic | (1) Percent of 65 years and over | *Assumed proportion to the fraction of state population living in the county | (1–7) US Census Bureau Population Estimates (2018) |
Summary statistics of the OLS model on selected variables in modeling COVID-19 incidence rates, continental United States.
| Variable | Coefficient | T-statistic | P-value | VIF |
|---|---|---|---|---|
| Intercept | 0.0007 | 0.0397 | 0.968338 | – |
| Income inequality | 0.2021 | 9.9015 | 0.000000* | 1.4657 |
| Median household income | 0.2449 | 12.2474 | 0.000000* | 1.4066 |
| % of nurse practitioner | 0.1365 | 7.4003 | 0.000000* | 1.1963 |
| % of black females | 0.1095 | 5.7726 | 0.000000* | 1.2667 |
Summary statistics of SLM and SEM in modeling COVID-19 incidence rates, continental United States.
| Variable | Coefficient | Std. error | P-value | |||||
|---|---|---|---|---|---|---|---|---|
| SLM | SEM | SLM | SEM | SLM | SEM | SLM | SEM | |
| Intercept | −0.002 | −0.003 | 0.016 | 0.027 | −0.134 | −0.098 | 0.893 | 0.922 |
| Income inequality | 0.172 | 0.189 | 0.019 | 0.021 | 8.98 | 9.158 | 0.000 | 0.000 |
| Median household income | 0.183 | 0.237 | 0.019 | 0.023 | 9.58 | 10.396 | 0.000 | 0.000 |
| % of nurse practitioner | 0.078 | 0.066 | 0.017 | 0.019 | 4.54 | 3.446 | 0.000 | 0.001 |
| % of black females | 0.064 | 0.123 | 0.018 | 0.0251 | 3.57 | 4.905 | 0.000 | 0.000 |
| Rho | 0.0402 | – | 0.024 | – | 16.99 | – | 0.000 | – |
| Lambda | – | 0.415 | – | 0.024 | – | 17.099 | – | 0.000 |
Measures of goodness-of-fit for OLS, SEM, SLM, GWR, and MGWR in modeling COVID-19 incidence rate, continental United States.
| Criterion | OLS | SEM | SLM | GWR | MGWR |
|---|---|---|---|---|---|
| Adj. R2 | 0.127 | 0.238 | 0.242 | 0.674 | 0.681 |
| AICc | 8304.98 | 8063.52 | 8045.70 | 6134.19 | `5796.53 |
Fig. 1The effects of median household income (above) and income inequality (below) in describing COVID-19 incidence rates using GWR (left) and MGWR (right) models, continental United States.
Fig. 2The effects of % of nurse practitioners (above) and % of black females (below) in describing COVID-19 incidence rates using GWR (left) and MGWR (right), continental United States.
Fig. 3Geographic distribution of local R2 of GWR and MGWR models for COVID-19 incidence rate associated with income inequality, median household income, % of nurse practitioners, and % of black females across the continental United States.