| Literature DB >> 35457416 |
Marc A Adams1, Christine B Phillips2, Akshar Patel1, Ariane Middel3.
Abstract
The study purpose was to train and validate a deep learning approach to detect microscale streetscape features related to pedestrian physical activity. This work innovates by combining computer vision techniques with Google Street View (GSV) images to overcome impediments to conducting audits (e.g., time, safety, and expert labor cost). The EfficientNETB5 architecture was used to build deep learning models for eight microscale features guided by the Microscale Audit of Pedestrian Streetscapes Mini tool: sidewalks, sidewalk buffers, curb cuts, zebra and line crosswalks, walk signals, bike symbols, and streetlights. We used a train-correct loop, whereby images were trained on a training dataset, evaluated using a separate validation dataset, and trained further until acceptable performance metrics were achieved. Further, we used trained models to audit participant (N = 512) neighborhoods in the WalkIT Arizona trial. Correlations were explored between microscale features and GIS-measured and participant-reported neighborhood macroscale walkability. Classifier precision, recall, and overall accuracy were all over >84%. Total microscale was associated with overall macroscale walkability (r = 0.30, p < 0.001). Positive associations were found between model-detected and self-reported sidewalks (r = 0.41, p < 0.001) and sidewalk buffers (r = 0.26, p < 0.001). The computer vision model results suggest an alternative to trained human raters, allowing for audits of hundreds or thousands of neighborhoods for population surveillance or hypothesis testing.Entities:
Keywords: Google Street View; built environment; computer vision; deep learning; microscale; walkability
Mesh:
Year: 2022 PMID: 35457416 PMCID: PMC9028816 DOI: 10.3390/ijerph19084548
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Example of individual-level buffers and macroscale walkability components and index values for a single participant’s neighborhood. Note: In the example in Figure 1, land-use mix shows residential, recreational, and civic uses. Other land uses such as office, food, entertainment, and retail were possible but not present in this example.
Figure 2Classifier training and evaluation process.
Summary of the number of images used for training and validation datasets.
| Street Feature | Image Counts | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Present | Absent | Total | ||||||||
| All | All | Phoenix Only Training | Phoenix Only | All Training | All | Phoenix Only Training | Phoenix Only | Training | Validation | |
| Sidewalk | 8868 | 2851 | 5177 | 1745 | 3702 | 1254 | 2298 | 429 | 12570 | 4105 |
| Sidewalk buffer | 3530 | 629 | 1519 | 347 | 6066 | 1773 | 4461 | 1567 | 9596 | 2402 |
| Curb cuts | 5947 | 599 | 2406 | 268 | 6059 | 767 | 2459 | 599 | 12006 | 1366 |
| Zebra | 1687 | 2456 | 412 | 100 | 5604 | 6121 | 2971 | 879 | 7291 | 8577 |
| Line | 1762 | 1053 | 1693 | 758 | 4057 | 2462 | 3798 | 2257 | 5819 | 3515 |
| Walk Signal | 3126 | 509 | 1951 | 216 | 4722 | 1221 | 2747 | 1014 | 7848 | 1730 |
| Bike Symbol | 1127 | 152 | 853 | 132 | 9306 | 2138 | 6908 | 2078 | 10433 | 2290 |
| Streetlight | 1380 | 288 | 808 | 170 | 1213 | 273 | 761 | 171 | 2593 | 561 |
Validation performance of image classifiers for Phoenix, AZ.
| Street Feature | Performance | ||||
|---|---|---|---|---|---|
| Precision | Recall | Negative Predictive Value | Specificity | Accuracy | |
| Sidewalk | 97.93% | 97.48% | 89.93% | 91.61% | 96.32% |
| Sidewalk buffer | 86.73% | 84.73% | 96.63% | 97.13% | 94.88% |
| Curb cut | 95.38% | 92.54% | 96.71% | 98.00% | 96.31% |
| Zebra crosswalk | 100% | 96.00% | 99.55% | 100% | 99.59% |
| Line crosswalk | 95.97% | 94.20% | 98.06% | 98.67% | 97.55% |
| Walk signals | 96.77% | 97.22% | 99.41% | 99.31% | 98.94% |
| Bike symbols | 93.28% | 94.70% | 99.66% | 99.57% | 99.28% |
| Streetlight | 88.64% | 91.76% | 91.52% | 88.30% | 90.03% |
Model-detected microscale feature correlations with GIS-measured macro-level walkability and perceived NEWS scales.
| Model- | GIS-Measured Macroscale Neighborhood | Perceived Neighborhood Walkability | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Residential | Land-Use Mix | Intersection | Transit Density | Overall | Residential Density | Land-Use Mix | Street | Walking and | Aesthetics | Pedestrian Safety | Crime Safety | |
| Sidewalks | 0.12 ** | 0.05 | 0.18 ** | −0.06 | 0.02 | −0.06 | −0.02 | −0.03 | 0.11 * | −0.24 *** | 0.01 | −0.02 |
| Sidewalk Buffers | 0.18 *** | 0.30 *** | −0.14 ** | 0.01 | 0.17 *** | 0.07 † | −0.01 | 0.05 | 0.17 *** | 0.19 ** | −0.08 † | 0.01 |
| Curb Cuts | 0.04 | 0.16 * | −0.16 *** | −0.20 *** | −0.11 * | −0.19 *** | −0.06 | 0.06 | 0.17 *** | −0.03 | 0.08 † | 0.04 |
| Zebra crosswalks | 0.16 *** | −0.07 | 0.04 | 0.37 *** | 0.02 | 0.15 ** | 0.04 | −0.01 | −0.04 | −0.06 | −0.04 | −0.07 |
| Line crosswalks | 0.06 | 0.42 *** | −0.14 ** | 0.13 ** | 0.39 *** | 0.28 *** | 0.24 *** | 0.01 | 0.02 | 0.03 | −0.01 | −0.02 |
| All crosswalks | 0.07 † | 0.39 *** | −0.12 ** | 0.38 ** | 0.38 *** | 0.30 *** | 0.23 *** | 0.00 | 0.01 | 0.01 | −0.01 | −0.03 |
| Walk Signals | 0.09 * | 0.37 *** | −0.10 * | 0.52 *** | 0.46 *** | 0.31 *** | 0.23 ** | 0.02 | 0.00 | 0.07 | −0.07 † | −0.07 |
| Bike Symbols | 0.17 ** | 0.22 *** | 0.06 | 0.20 *** | 0.28 *** | 0.25 *** | 0.15 ** | −0.01 | 0.02 | −0.03 | −0.03 | −0.05 |
| Streetlights | 0.23 *** | 0.38 *** | 0.00 | 0.12 ** | 0.35 *** | 0.17 *** | 0.07 | −0.00 | 0.14 ** | −0.03 | −0.06 | −0.07 |
| Total Microscale | 0.19 *** | 0.38 *** | −0.12 * | 0.11 * | 0.30 *** | 0.13 ** | 0.07 † | 0.02 | 0.21 *** | 0.04 | −0.02 | −0.02 |
Notes: Spearman rank correlation coefficients. † p < 0.10, * p < 0.05, ** p < 0.01, *** p < 0.001. Model-detected features were assessed by z-scoring the average count of positive feature instances for coordinates within a 500 m street network buffer around participants’ homes. Perceived neighborhood features were assessed by the Neighborhood Environment Walkability Scale (NEWS). All crosswalks = sum of zebra and line crosswalks. Total microscale score = sum of z-score averages for bike symbols, all crosswalks, curb cuts, walk signals, sidewalks, sidewalk buffers, and streetlights within each participant’s 500 m neighborhood buffer.
Validation performance of image classifiers in pooled dataset including Phoenix AZ, San Diego CA, Washington D.C., Seattle WA, and Baltimore MD.
| Street Feature | Performance | ||||
|---|---|---|---|---|---|
| Precision | Recall | Negative Predictive Value | Specificity | Accuracy | |
| Sidewalk | 97.25% | 96.81% | 92.82% | 93.78% | 95.88% |
| Sidewalk buffer | 87.10% | 85.85% | 95.01% | 95.49% | 92.96% |
| Curb cut | 83.21% | 65.86% | 52.32% | 73.81% | 68.54% |
| Zebra crosswalk | 97.33% | 84.97% | 93.61% | 98.95% | 94.62% |
| Line crosswalk | 89.20% | 75.59% | 71.20% | 86.83% | 80.20% |
| Walk signals | 86.00% | 73.38% | 68.80% | 83.09% | 77.40% |
| Bike symbols | 95.00% | 95.00% | 98.33% | 98.33% | 97.50% |
| Streetlight | 84.30% | 86.44% | 83.84% | 81.37% | 84.09% |