| Literature DB >> 32882867 |
Quynh C Nguyen1, Yuru Huang1, Abhinav Kumar2, Haoshu Duan3, Jessica M Keralis1, Pallavi Dwivedi1, Hsien-Wen Meng1, Kimberly D Brunisholz4, Jonathan Jay5, Mehran Javanmardi6, Tolga Tasdizen6.
Abstract
The spread of COVID-19 is not evenly distributed. Neighborhood environments may structure risks and resources that produce COVID-19 disparities. Neighborhood built environments that allow greater flow of people into an area or impede social distancing practices may increase residents' risk for contracting the virus. We leveraged Google Street View (GSV) images and computer vision to detect built environment features (presence of a crosswalk, non-single family home, single-lane roads, dilapidated building and visible wires). We utilized Poisson regression models to determine associations of built environment characteristics with COVID-19 cases. Indicators of mixed land use (non-single family home), walkability (sidewalks), and physical disorder (dilapidated buildings and visible wires) were connected with higher COVID-19 cases. Indicators of lower urban development (single lane roads and green streets) were connected with fewer COVID-19 cases. Percent black and percent with less than a high school education were associated with more COVID-19 cases. Our findings suggest that built environment characteristics can help characterize community-level COVID-19 risk. Sociodemographic disparities also highlight differential COVID-19 risk across groups of people. Computer vision and big data image sources make national studies of built environment effects on COVID-19 risk possible, to inform local area decision-making.Entities:
Keywords: COVID-19; GIS; big data; built environment; computer vision; machine learning
Mesh:
Year: 2020 PMID: 32882867 PMCID: PMC7504319 DOI: 10.3390/ijerph17176359
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Example processed Google Street View images for green street, presence of crosswalks, and “not single family home” indicators. Predictions were algorithm-derived labels for neighborhood features. “True” labels were manual annotations provided by the research team. (a) presents a residential scene with single family homes, ample street landscaping, and no crosswalks present. (b) presents a mixed-use neighborhood with ample street landscaping, and a crosswalk present.
Descriptive statistics, zip code level.
| Characteristic | Number of Images | Number of Zip Codes | Mean (Standard Deviation) |
|---|---|---|---|
| Google Street View | |||
| Non-single family home | 164,443,190 | 30,556 | 25.62% (21.10) |
| Sidewalks | 164,443,190 | 30,556 | 19.50% (24.31) |
| Crosswalks | 164,443,190 | 30,556 | 1.56% (3.17) |
| Visible wires | 164,443,190 | 30,556 | 44.14% (16.81) |
| Dilapidated building | 164,443,190 | 30,556 | 18.04% (11.40) |
| Single lane road | 164,443,190 | 30,556 | 65.47% (14.31) |
| Green street | 164,443,190 | 30,556 | 87.08% (15.70) |
| COVID-19 outcomes | |||
| Cases per 100,000 | 8171 | 545.86 (1353.86) |
Associations between built environment characteristics and zip code level coronavirus cases, 20 States.
| Characteristic | Rate Ratio (95% CI) | Rate Ratio (95% CI) | Rate Ratio (95% CI) | Rate Ratio (95% CI) | Rate Ratio (95% CI) | Rate Ratio (95% CI) | Rate Ratio (95% CI) |
|---|---|---|---|---|---|---|---|
| GSV indicators | |||||||
| Non-single family home | 1.21 | ||||||
| Sidewalks | 1.40 | ||||||
| Crosswalks | 1.14 | ||||||
| Visible wires | 1.08 | ||||||
| Dilapidated building | 1.03 | ||||||
| Single lane roads | 0.90 | ||||||
| Green streets | 0.96 | ||||||
| Covariates | |||||||
| Household size | 1.03 | 1.02 | 1.03 | 0.99 | 0.98 | 1.00 | 0.98 |
| Median household income | 1.17 | 1.12 | 1.15 | 1.18 | 1.17 | 1.16 | 1.17 |
| Poverty rate | 1.11 | 1.09 | 1.16 | 1.20 | 1.21 | 1.16 | 1.20 |
| % Less than H.S. education | 1.42 | 1.54 | 1.47 | 1.46 | 1.49 | 1.43 | 1.47 |
| Civilian employment | 1.07 | 1.12 | 1.07 | 1.05 | 1.05 | 1.03 | 1.05 |
| % Asian | 1.04 | 0.98 | 1.05 | 1.07 | 1.07 | 1.07 | 1.07 |
| % Black | 1.25 | 1.17 | 1.26 | 1.29 | 1.29 | 1.29 | 1.29 |
| % Hispanic | 1.13 | 1.02 | 1.13 | 1.19 | 1.19 | 1.19 | 1.20 |
| Population density | 1.01 | 1.01 | 1.02 | 1.04 | 1.04 | 1.03 | 1.04 |
| Median age | 1.07 | 1.01 | 1.05 | 1.04 | 1.03 | 1.06 | 1.04 |
| Adjusted R-square | 0.4416 | 0.4818 | 0.4370 | 0.4223 | 0.4202 | 0.4253 | 0.4207 |
All variables were standardized with a mean of zero and a standard deviation of 1. Adjusted Poisson regression controlled for the following zip code level demographics: population density, median age, household income, poverty rate, unemployment, percent with less than a high school education, percent Asian, percent black, percent Hispanic. Log of total population was used as the offset. Zip code coronavirus cases obtained for Arizona, California, Florida, Georgia, Illinois, Maryland, Michigan, Missouri, New York, New Mexico, North Carolina, Ohio, Oklahoma, Pennsylvania, Rhode Island, Texas, Utah, Virginia, Washington, Oregon. N = 7625 zip codes.