| Literature DB >> 29718953 |
Rahul Goel1, Leandro M T Garcia1, Anna Goodman2, Rob Johnson1, Rachel Aldred3, Manoradhan Murugesan4, Soren Brage5, Kavi Bhalla4, James Woodcock1.
Abstract
BACKGROUND: Street imagery is a promising and growing big data source providing current and historical images in more than 100 countries. Studies have reported using this data to audit road infrastructure and other built environment features. Here we explore a novel application, using Google Street View (GSV) to predict travel patterns at the city level.Entities:
Mesh:
Year: 2018 PMID: 29718953 PMCID: PMC5931639 DOI: 10.1371/journal.pone.0196521
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Snapshot of the web-based questionnaire (image is illustrative and not from GSV to comply with copyright issues).
Fig 2Pearson correlation among GSV, APS and Census variables.
Descriptive statistics.
| Mean | Std. Deviation | Minimum | 25th Percentile | Median | 75th Percentile | Maximum | |
|---|---|---|---|---|---|---|---|
| Population | 498,758 | 532,382 | 107,053 | 211,826 | 339,864 | 578,604 | 2,422,818 |
| Number of local authorities per PUA | 2.2 | 1.7 | 1 | 1 | 5 | 2.8 | 9 |
| Census Walk (%) | 12.7 | 3.3 | 8.7 | 10.1 | 11.7 | 15.1 | 19.8 |
| Census Cycle (%) | 4.8 | 6.1 | 1.1 | 1.8 | 2.7 | 4.8 | 32.5 |
| Census MC (%) | 0.8 | 0.3 | 0.2 | 0.6 | 0.7 | 1.0 | 1.4 |
| Census Bus (%) | 10.6 | 5.0 | 5.0 | 7.2 | 8.3 | 13.6 | 28.6 |
| Census PT+Walk (%) | 28.0 | 7.6 | 16.2 | 23.2 | 27.9 | 30.7 | 49.4 |
| Census Car (%) | 65.9 | 10.8 | 36.8 | 62.6 | 67.4 | 72.5 | 81.3 |
| Census Cycle M/F (ratio) | 2.49 | 1.05 | 1.02 | 1.78 | 2.33 | 2.98 | 5.61 |
| APS Prev All Cycle (%) | 16.4 | 9.4 | 8.1 | 11.7 | 13.9 | 17.9 | 53.4 |
| APS Days All Cycle (per week) | 0.42 | 0.40 | 0.15 | 0.24 | 0.29 | 0.40 | 2.11 |
| APS Duration All Cycle (h per day) | 0.40 | 0.27 | 0.19 | 0.28 | 0.32 | 0.39 | 1.59 |
| APS Prev Utly Cycle (%) | 8.9 | 9.5 | 1.7 | 4.3 | 5.7 | 8.3 | 47.5 |
| APS Days Utly Cycle (per week) | 0.27 | 0.36 | 0.05 | 0.10 | 0.14 | 0.23 | 1.80 |
| APS Duration Utly Cycle (h per day) | 0.21 | 0.24 | 0.04 | 0.11 | 0.14 | 0.18 | 1.29 |
| APS Prev All Walk (%) | 85.2 | 2.4 | 79.7 | 83.7 | 85.3 | 86.9 | 90.8 |
| APS Days All Walk (per week) | 3.56 | 0.18 | 3.27 | 3.42 | 3.53 | 3.67 | 4.13 |
| APS Duration All Walk (h per day) | 4.14 | 0.37 | 3.49 | 3.93 | 4.19 | 4.38 | 5.04 |
| APS Prev Utly Walk (%) | 58.4 | 5.4 | 47.6 | 53.8 | 58.8 | 61.6 | 73.5 |
| APS Days Utly Walk (per week) | 2.06 | 0.24 | 1.66 | 1.87 | 2.03 | 2.17 | 2.54 |
| APS Duration Utly Walk (h per day) | 2.65 | 0.29 | 2.09 | 2.49 | 2.70 | 2.81 | 3.23 |
| APS Prev All Cycle M/F (ratio) | 2.36 | 0.61 | 1.16 | 2.06 | 2.34 | 2.71 | 3.70 |
| APS Prev Utly Cycle M/F (ratio) | 3.66 | 2.42 | 1.18 | 2.22 | 2.92 | 4.22 | 11.29 |
| GSV Cycle | 18 | 20 | 3 | 6 | 14 | 19 | 94 |
| GSV P-Cycle | 16 | 32 | 0 | 3 | 6 | 10 | 132 |
| GSV Walk | 209 | 56 | 138 | 170 | 192 | 238 | 371 |
| GSV Car | 1438 | 115 | 1111 | 1372 | 1430 | 1532 | 1620 |
| GSV Bus | 27 | 14 | 11 | 17 | 23 | 34 | 74 |
| GSV MC | 11 | 7 | 1 | 6 | 10 | 14 | 42 |
| GSV Autumn (%) | 26.9 | 24.9 | 0 | 7 | 24.7 | 35.5 | 94.2 |
| GSV Spring (%) | 30.3 | 28.9 | 0 | 1.9 | 26.1 | 45.9 | 94.4 |
| GSV Summer (%) | 42.7 | 27.5 | 5.2 | 26.4 | 34.3 | 65.3 | 99.6 |
| GSV Winter (%) | 0.03 | 0.1 | 0 | 0 | 0 | 0 | 0.7 |
Fig 3Linear relationships of GSV observations with commute share and prevalence of walking and cycling (3(a) and 3(b) include R2 using log transformed variables in parentheses to reduce the effect of outliers).
Fig 4Linear relationships of GSV observations with APS measures of walking and cycling (4(a) includes R2 using log transformed variables in parentheses to reduce the effect of outliers).
Regression models.
| Beta regression models | Linear regression models | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | Model 6 | Model 7 | Model 8 | Model 9 | Model 10 | |
| Census PT+Walk | Census Cycle | Census MC | Census Car | APS Prev All Cycl | APS Prev Utly Cycl | APS Prev All Walk | APS Prev Utly Walk | APS Days Utly Cycl | APS Days Utly Walk | |
| -1.356 | -4.877 | -4.759 | 1.223 | -1.995 | -3.662 | 1.571 | 0.192 | -0.033 | 1.562 | |
| -0.004 | 0.002 | |||||||||
| -0.004 | 0.408 | 0.006 | -0.01 | 0.026 | 0.467 | 0.007 | 0.008 | 0.015 | ||
| 0.027 | ||||||||||
| 0.020 | -0.018 | -0.016 | -0.008 | |||||||
| 0.553 | ||||||||||
| -0.013 | 0.010 | 0.009 | ||||||||
| 0.04 | 0.015 | 0.002 | 0.04 | 0.03 | 0.02 | 0.02 | 0.03 | 0.11 | 0.17 | |
| 0.03 | 0.008 | 0.001 | 0.04 | 0.02 | 0.01 | 0.01 | 0.02 | 0.07 | 0.16 | |
| 1.076 | 0.867 | 1.415 | 1.089 | 0.921 | 0.965 | 1.38 | 0.908 | - | - | |
| 0.865 | 0.653 | 0.712 | 0.959 | 0.735 | 0.679 | 0.933 | 0.629 | - | - | |
*Model 9: robust linear regression;
^ For square root of GSV Cycle.
Fig 5Observed and predicted mode shares using LOOCV with mean absolute error (MAE) and median absolute error (MDAE) for cycling measures (a: Model 1; b: Model 2; c: Model 3; d: Model 4).
Fig 7Observed and predicted average number of days using LOOCV with mean absolute error (MAE) and median absolute error (MDAE) for cycling measure (a: Model 9; b: Model 10).
Fig 6Observed and predicted prevalence measures using LOOCV with mean absolute error (MAE) and median absolute error (MDAE) for cycling measures (a: Model 5; b: Model 6; c: Model 7; d: Model 8).
Comparison of GSV, Census and APS estimates of gender split of cyclists.
| Groups | Number of GSV observations | Ratios of male to female | ||||
|---|---|---|---|---|---|---|
| Females | Males | GSV Observations | Commute cycling share | APS Prevalence of all-purpose cycling | APS Prevalence of utility cycling | |
| All | 82 | 172 | 2.10 | 1.63 | 1.82 | 2.21 |
| Group 1 | 28 | 33 | 1.18 | 1.02 | 1.28 | 1.31 |
| Group 2 | 18 | 42 | 2.33 | 1.16 | 1.33 | 1.29 |
| Group 3 | 18 | 39 | 2.17 | 1.51 | 2.07 | 1.81 |
| Group 4 | 18 | 58 | 3.22 | 2.60 | 2.45 | 3.98 |