| Literature DB >> 36231394 |
Xiaohe Yue1, Anne Antonietti2, Mitra Alirezaei3, Tolga Tasdizen3, Dapeng Li4, Leah Nguyen5, Heran Mane1, Abby Sun6, Ming Hu7, Ross T Whitaker8, Quynh C Nguyen1.
Abstract
Built environment neighborhood characteristics are difficult to measure and assess on a large scale. Consequently, there is a lack of sufficient data that can help us investigate neighborhood characteristics as structural determinants of health on a national level. The objective of this study is to utilize publicly available Google Street View images as a data source for characterizing built environments and to examine the influence of built environments on chronic diseases and health behaviors in the United States. Data were collected by processing 164 million Google Street View images from November 2019 across the United States. Convolutional Neural Networks, a class of multi-layer deep neural networks, were used to extract features of the built environment. Validation analyses found accuracies of 82% or higher across neighborhood characteristics. In regression analyses controlling for census tract sociodemographics, we find that single-lane roads (an indicator of lower urban development) were linked with chronic conditions and worse mental health. Walkability and urbanicity indicators such as crosswalks, sidewalks, and two or more cars were associated with better health, including reduction in depression, obesity, high blood pressure, and high cholesterol. Street signs and streetlights were also found to be associated with decreased chronic conditions. Chain link fence (physical disorder indicator) was generally associated with poorer mental health. Living in neighborhoods with a built environment that supports social interaction and physical activity can lead to positive health outcomes. Computer vision models using manually annotated Google Street View images as a training dataset were able to accurately identify neighborhood built environment characteristics. These methods increases the feasibility, scale, and efficiency of neighborhood studies on health.Entities:
Keywords: GIS; big data; built environment; computer vision; machine learning; structural determinants of health
Mesh:
Substances:
Year: 2022 PMID: 36231394 PMCID: PMC9564970 DOI: 10.3390/ijerph191912095
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Computer vision model. Each sample is a single image accompanied by labels corresponding to each neighborhood feature (e.g., crosswalk). The feature extractor is VGG-19 or ResNet18 (depending on the feature analyzed) and is pretrained with ImageNet data. Each feature classifier is a single fully connected layer and the losses are cross entropy. The final loss for optimization is a summation of losses.
Figure 2Geoportal interface. Darker colors signal higher prevalence of a neighborhood characteristic.
Descriptive statistics of neighborhood characteristics and health outcomes, census tract.
| N | Mean (SD) | |
|---|---|---|
|
| ||
| Crosswalks | 70,359 | 3.63 (4.37) |
| Sidewalks | 70,359 | 43.96 (30.72) |
| Single lane road | 70,359 | 67.11 (14.57) |
| Presence of apartment/commercial building | 70,359 | 29.80 (23.69) |
| Streetlights | 70,319 | 15.66 (14.96) |
| Street signs | 70,344 | 24.28 (15.08) |
| 2 or more cars | 70,288 | 36.10 (20.53) |
| Chain Link fence | 70,311 | 7.63 (13.79) |
|
| ||
| Population size | 72,864 | 4237.29 (1972.52) |
| Percent 65 years+ | 72,578 | 13.63 (7.39) |
| Percent male | 72,578 | 49.18 (4.05) |
| Percent Black | 72,578 | 13.83 (22.29) |
| Percent Hispanic | 72,578 | 15.27 (20.82) |
| Percent single female headed households | 72,472 | 13.65 (8.17) |
| Percent owner-occupied housing | 72,472 | 64.32 (22.50) |
| Percent college educated | 72,436 | 27.67 (18.50) |
| Median household income | 72,048 | 67,432.68 (32,960.44) |
| Percent unemployed | 72,330 | 10.36 (6.34) |
| Child opportunity index, range 0 to 100 | 72,213 | 49.15 (28.61) |
|
| ||
| Obesity | 70,338 | 32.63 (6.82) |
| Diabetes | 70,338 | 10.96 (3.73) |
| High Blood Pressure | 70,338 | 32.49 (7.36) |
| High Cholesterol | 70,338 | 31.83 (4.79) |
| Cancer | 70,338 | 6.73 (1.94) |
| Poor mental health days | 70,338 | 15.21 (3.57) |
| Depression | 72,337 | 36.77 (5.23) |
| Sleep less than 7 h a night | 70,338 | 17.61 (3.45) |
| Current Smoking | 70,338 | 17.98 (5.76) |
This table presents summary statistics, mean and standard deviation (SD) for variables included in the analysis. Data were aggregated to the census tract level. N = number of census tracts. Built environment characteristics were derived from Google Street View images. Sociodemographic characteristics came from the American Community Survey 2018 5-year estimates and the 2010 US Census. Health outcomes data came from PLACES 2021.
Figure 3Census tract level distribution of GSV-derived neighborhood characteristics. (a) single lane roads, (b) chain-link fence, (c) crosswalk, (d) sidewalk. Darker colors signal higher prevalence of a neighborhood characteristic.
Figure 4Census tract level distribution of GSV-derived neighborhood characteristics. (a) non-single family home, (b) street light, (c) street sign, (d) two or more cars. Darker colors signal higher prevalence of a neighborhood characteristic.
Figure 5Geographic distribution of sidewalks across census tracts in three metropolitan areas. (A) Washington DC, (B) San Diego, California, (C) Jacksonville, Florida. Darker colors signal higher prevalence of a neighborhood characteristic.
Built environment predictors of adult health outcomes a.
| Obese | High Blood Pressure | High Cholesterol | Diabetes | Cancer | |
|---|---|---|---|---|---|
|
| Crude Odds Ratio (95% CI) | Crude Odds Ratio (95% CI) | Crude Odds Ratio (95% CI) | Crude Odds Ratio (95% CI) | Crude Odds Ratio (95% CI) |
| Single lane road | |||||
| 3rd tertile (highest) | 2.19 (2.06, 2.31) | 3.23 (3.09, 3.36) | 2.18 (2.09, 2.26) | 0.75 (0.69, 0.82) | 0.69 (0.66, 0.73) |
| 2nd tertile | 1.11 (0.99, 1.24) | 1.71 (1.58, 1.84) | 1.41 (1.33, 1.50) | 0.21 (0.14, 0.28) | 0.50 (0.46, 0.53) |
| 2 or more cars | |||||
| 3rd tertile (highest) | −1.97 (−2.09, −1.84) | −3.46 (−3.60, −3.33) | −4.43 (−4.51, −4.34) | 0.34 (0.27, 0.40) | −1.80 (−1.84, −1.77) |
| 2nd tertile | −1.46 (−1.58, −1.33) | −2.15 (−2.29, −2.02) | −2.20 (−2.28, −2.12) | −0.33 (−0.40, −0.26) | −0.71 (−0.75, −0.68) |
| Street signs | |||||
| 3rd tertile (highest) | −2.44 (−2.56, −2.31) | −4.45 (−4.58, −4.32) | −4.68 (−4.76, −4.60) | −0.05 (−0.12, 0.02) | −1.96 (−2.00, −1.93) |
| 2nd tertile | −1.02 (−1.14, −0.89) | −2.04 (−2.17, −1.91) | −2.55 (−2.63, −2.47) | −0.27 (−0.34, −0.20) | −0.81 (−0.84, −0.77) |
| Street lights | |||||
| 3rd tertile (highest) | −1.49 (−1.62, −1.37) | −3.04 (−3.17, −2.91) | −3.89 (−3.97, −3.81) | 0.35 (0.28, 0.42) | −1.64 (−1.68, −1.61) |
| 2nd tertile | −0.86 (−0.99, −0.74) | −2.74 (−2.87, −2.60) | −2.82 (−2.91, −2.74) | −0.54 (−0.61, −0.47) | −0.89 (−0.92, −0.86) |
| Non-single family home | |||||
| 3rd tertile (highest) | −1.58 (−1.70, −1.45) | −3.56 (−3.70, −3.43) | −3.77 (−3.85, −3.69) | 0.12 (0.05, 0.19) | −1.60 (−1.63, −1.56) |
| 2nd tertile | −0.08 (−0.21, 0.04) | −1.20 (−1.33, −1.07) | −1.43 (−1.51, −1.34) | 0.06 (−0.01, 0.13) | −0.59 (−0.62, −0.55) |
| Sidewalks | |||||
| 3rd tertile (highest) | −4.09 (−4.21, −3.96) | −5.83 (−5.95, −5.70) | −5.06 (−5.13, −4.98) | −0.94 (−1.01, −0.87) | −1.82 (−1.85, −1.79) |
| 2nd tertile | −2.33 (−2.45, −2.21) | −3.23 (−3.36, −3.10) | −2.85 (−2.92, −2.77) | −0.78 (−0.85, −0.71) | −0.77 (−0.81, −0.74) |
| Crosswalks | |||||
| 3rd tertile (highest) | −4.49 (−4.61, −4.37) | −5.99 (−6.12, −5.86) | −4.86 (−4.94, −4.78) | −1.25 (−1.32, −1.18) | −1.57 (−1.61, −1.54) |
| 2nd tertile | −1.84 (−1.96, −1.72) | −2.68 (−2.81, −2.55) | −2.40 (−2.48, −2.32) | −0.58 (−0.65, −0.51) | −0.68 (−0.71, −0.65) |
| N | 67,445 | 67,445 | 67,445 | 67,445 | 67,445 |
|
|
|
|
|
| |
| Single lane road | |||||
| 3rd tertile (highest) | 1.34 (1.26, 1.42) | 1.15 (1.08, 1.21) | 0.65 (0.60, 0.70) | 0.34 (0.31, 0.38) | 0.11 (0.10, 0.12) |
| 2nd tertile | 0.76 (0.68, 0.83) | 0.67 (0.60, 0.73) | 0.35 (0.30, 0.40) | 0.14 (0.11, 0.18) | 0.08 (0.07, 0.09) |
| 2 or more cars | |||||
| 3rd tertile (highest) | −3.39 (−3.48, −3.30) | −2.90 (−2.98, −2.82) | −1.67 (−1.74, −1.61) | −1.23 (−1.28, −1.19) | −0.37 (−0.38, −0.36) |
| 2nd tertile | −0.98 (−1.06, −0.90) | −1.55 (−1.61, −1.48) | −1.05 (−1.10, −0.99) | −0.72 (−0.76, −0.69) | −0.18 (−0.19, −0.17) |
| Street signs | |||||
| 3rd tertile (highest) | −2.71 (−2.81, −2.62) | −2.34 (−2.42, −2.26) | −1.32 (−1.39, −1.26) | −0.92 (−0.97, −0.88) | −0.31 (−0.32, −0.30) |
| 2nd tertile | −1.11 (−1.19, −1.03) | −1.44 (−1.51, −1.38) | −0.87 (−0.92, −0.81) | −0.68 (−0.72, −0.65) | −0.15 (−0.16, −0.14) |
| Street lights | |||||
| 3rd tertile (highest) | −1.56 (−1.65, −1.48) | −0.83 (−0.87, −0.80) | −1.99 (−2.07, −1.92) | −1.36 (−1.42, −1.30) | −0.28 (−0.29, −0.27) |
| 2nd tertile | −0.69 (−0.77, −0.61) | −0.62 (−0.65, −0.58) | −1.37 (−1.44, −1.30) | −1.00 (−1.05, −0.94) | −0.15 (−0.16, −0.14) |
| Non-single family home | |||||
| 3rd tertile (highest) | −1.90 (−1.99, −1.81) | −1.59 (−1.67, −1.52) | −1.00 (−1.06, −0.94) | −0.60 (−0.64, −0.56) | −0.19 (−0.20, −0.18) |
| 2nd tertile | −0.38 (−0.46, −0.31) | −0.67 (−0.74, −0.61) | −0.45 (−0.50, −0.40) | −0.27 (−0.30, −0.23) | −0.09 (−0.10, −0.08) |
| Sidewalks | |||||
| 3rd tertile (highest) | −3.07 (−3.16, −2.97) | −3.12 (−3.20, −3.04) | −1.85 (−1.91, −1.79) | −1.13 (−1.17, −1.09) | −0.34 (−0.35, −0.32) |
| 2nd tertile | −1.07 (−1.15, −0.98) | −1.71 (−1.78, −1.64) | −1.20 (−1.25, −1.14) | −0.75 (−0.79, −0.71) | −0.14 (−0.15, −0.13) |
| Crosswalks | |||||
| 3rd tertile (highest) | −2.99 (−3.08, −2.90) | −1.29 (−1.33, −1.25) | −3.07 (−3.14, −2.99) | −1.85 (−1.91, −1.79) | −0.28 (−0.29, −0.27) |
| 2nd tertile | −0.80 (−0.88, −0.72) | −0.63 (−0.67, −0.60) | −1.46 (−1.52, −1.39) | −0.96 (−1.01, −0.90) | −0.10 (−0.11, −0.09) |
| N | 67,167 | 67,167 | 67,167 | 67,167 | 67,167 |
a Data source for health outcome: CDC PLACES 2021. b Adjusted Linear regression models were run for each outcome separately. Models controlled for census tract population size, percent of the population 65 years and older, percent male, percent Hispanic, percent black, median household income, percent female headed households, and percent owner occupied housing, percent with a college degree, percent employed, and child opportunity index. Built environment characteristics were categorized into tertiles, with the lowest tertile serving as the referent group. Standard errors adjusted for clustering of values within a census tract.
Built environment predictors of adult mental health and risk behaviors a.
| Poor Mental Health Days | Depression | Inadequate Sleep (<7 h a Night) | Current Smoking | |
|---|---|---|---|---|
|
| Adjusted Odds Ratio (95% CI) b | Adjusted Odds Ratio (95% CI) b | Adjusted Odds Ratio (95% CI) b | Adjusted Odds Ratio (95% CI) b |
| Single lane road | ||||
| 3rd tertile (highest) | 0.51 (0.48, 0.55) | 0.82 (0.78, 0.87) | 0.19 (0.13, 0.24) | 0.82 (0.76, 0.87) |
| 2nd tertile | 0.32 (0.28, 0.35) | 0.60 (0.55, 0.64) | −0.19 (−0.25, −0.14) | 0.35 (0.30, 0.41) |
| Chain-linked fence | ||||
| 3rd tertile (highest) | 0.17 (0.12, 0.21) | 0.43 (0.37, 0.48) | −0.30 (−0.37, −0.24) | −0.58 (−0.65, −0.52) |
| 2nd tertile | −0.14 (−0.17, −0.10) | 0.10 (0.05, 0.14) | −0.40 (−0.45, −0.35) | −0.80 (−0.85, −0.75) |
| Crosswalks | ||||
| 3rd tertile (highest) | −0.80 (−0.84, −0.76) | −1.29 (−1.35, −1.23) | −0.56 (−0.62, −0.49) | −2.04 (−2.10, −1.97) |
| 2nd tertile | −0.16 (−0.19, −0.12) | −0.35 (−0.40, −0.30) | −0.15 (−0.21, −0.09) | −0.68 (−0.74, −0.63) |
| Sidewalks | ||||
| 3rd tertile (highest) | −0.89 (−0.93, −0.85) | −1.46 (−1.52, −1.40) | 0.51 (0.44, 0.57) | −1.68 (−1.74, −1.61) |
| 2nd tertile | −0.19 (−0.23, −0.16) | −0.37 (−0.42, −0.32) | 0.01 (−0.05, 0.07) | −0.65 (−0.71, −0.60) |
| Non-single family home | ||||
| 3rd tertile (highest) | −0.68 (−0.72, −0.64) | −1.37 (−1.43, −1.32) | −0.67 (−0.73, −0.60) | −1.11 (−1.17, −1.04) |
| 2nd tertile | −0.31 (−0.35, −0.28) | −0.44 (−0.48, −0.39) | −0.82 (−0.88, −0.77) | −0.51 (−0.56, −0.45) |
| Street lights | ||||
| 3rd tertile (highest) | −0.28 (−0.32, −0.25) | −0.80 (−0.86, −0.75) | −0.01 (−0.07, 0.05) | −1.02 (−1.09, −0.96) |
| 2nd tertile | −0.18 (−0.21, −0.14) | −0.25 (−0.30, −0.20) | −0.11 (−0.16, −0.05) | −0.57 (−0.63, −0.52) |
| Street signs | ||||
| 3rd tertile (highest) | −0.42 (−0.46, −0.38) | −0.81 (−0.87, −0.75) | 0.57 (0.50, 0.64) | −1.23 (−1.30, −1.16) |
| 2nd tertile | 0.18 (−0.22, −0.15) | −0.30 (−0.35, −0.25) | −0.02 (−0.07, 0.04) | −0.72 (−0.77, −0.66) |
| 2 or more cars | ||||
| 3rd tertile (highest) | −0.67 (−0.72, −0.63) | −1.18 (−1.24, −1.12) | 0.17 (0.10, 0.24) | −1.69 (−1.75, −1.62) |
| 2nd tertile | −0.17 (−0.20, −0.13) | −0.34 (−0.39, −0.29) | 0.04 (−0.02, 0.09) | −0.64 (−0.69, −0.58) |
| N | 67,167 | 67,167 | 67,167 | 67,167 |
a Data source for health outcome: CDC PLACES 2021. b Adjusted Linear regression models were run for each outcome separately. Models controlled for census tract population size, percent of the population 65 years and older, percent male, percent Hispanic, percent black, median household income, percent female headed households, and percent owner occupied housing, percent with a college degree, percent employed, and child opportunity index. Built environment characteristics were categorized into tertiles, with the lowest tertile serving as the referent group. Standard errors adjusted for clustering of values within a census tract.