| Literature DB >> 32831580 |
Jorge A Huertas1, Alejandro Palacio1, Marcelo Botero1, Germán A Carvajal1, Thomas van Laake2, Diana Higuera-Mendieta3, Sergio A Cabrales1, Luis A Guzman4, Olga L Sarmiento3, Andrés L Medaglia1.
Abstract
The Level of Traffic Stress (LTS) is an indicator that quantifies the stress experienced by a cyclist on the segments of a road network. We propose an LTS-based classification with two components: a clustering component and an interpretative component. Our methodology is comprised of four steps: (i) compilation of a set of variables for road segments, (ii) generation of clusters of segments within a subset of the road network, (iii) classification of all segments of the road network into these clusters using a predictive model, and (iv) assignment of an LTS category to each cluster. At the core of the methodology, we couple a classifier (unsupervised clustering algorithm) with a predictive model (multinomial logistic regression) to make our approach scalable to massive data sets. Our methodology is a useful tool for policy-making, as it identifies suitable areas for interventions; and can estimate their impact on the LTS classification, according to probable changes to the input variables (e.g., traffic density). We applied our methodology on the road network of Bogotá, Colombia, a city with a history of implementing innovative policies to promote biking. To classify road segments, we combined government data with open-access repositories using geographic information systems (GIS). Comparing our LTS classification with city reports, we found that the number of bicyclists' fatal and non-fatal collisions per kilometer is positively correlated with higher LTS. Finally, to support policy making, we developed a web-enabled dashboard to visualize and analyze the LTS classification and its underlying variables.Entities:
Keywords: Bogotá; Cluster Analysis; Cycling; Latin America; Level of Traffic Stress
Year: 2020 PMID: 32831580 PMCID: PMC7437968 DOI: 10.1016/j.trd.2020.102420
Source DB: PubMed Journal: Transp Res D Transp Environ ISSN: 1361-9209 Impact factor: 5.495
Fig. 1General overview of the data-informed LTS-based classification methodology.
Variables used in the LTS-based methodology.
| Variable | Factor | Type |
|---|---|---|
| Roadway width | Physical | Continuous |
| Number of lanes | Physical | Discrete |
| Presence of cycling infrastructure | Physical | Categorical |
| Presence of heavy vehicles | Traffic mix | Categorical |
| Vehicles' speed | Traffic | Continuous |
| Traffic density | Traffic | Continuous |
| Traffic flow | Traffic | Continuous |
| Congestion index | Traffic | Continuous |
Fig. 2Cluster analysis intuition.
Fig. 3Cluster analysis overview.
Fig. 4Coupling between the PAM algorithm and the multinomial logistic regression.
Fig. 5Transforming raw data into input variables.
Fig. 6Visual display of input variables for the neighborhood of Ciudad Salitre.
Fig. 7Silhouette width of the PAM algorithm after clustering segments in the locality of Usaquén.
Results of the multinomial logistic regression: p-values of the variables’ coefficients.
| Variable | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 |
|---|---|---|---|---|
| Road width | Base category | 0.298 | < 0.001 | < 0.001 |
| Number of lanes | < 0.001 | < 0.001 | < 0.001 | |
| Vehicles' speed | < 0.001 | < 0.001 | < 0.001 | |
| Traffic density | < 0.001 | < 0.001 | < 0.001 | |
| Traffic flow | < 0.001 | < 0.001 | < 0.001 | |
| Congestion | < 0.001 | < 0.001 | < 0.001 | |
| Cycling infrastructure | 0.021 | 0.013 | 0.015 | |
| Public transport lines | 0.001 | 0.001 | 0.001 |
Average probability of belonging to the classified cluster and number of classified observations.
| Average probability of belonging to cluster | ||||||
|---|---|---|---|---|---|---|
| Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | |||
| 0.000 | 0.000 | 0.000 | 48,675 | |||
| 0.000 | 0.002 | 0.000 | 11,309 | |||
| 0.000 | 0.001 | 0.009 | 46,297 | |||
| 0.000 | 0.000 | 0.007 | 61,237 | |||
Descriptive statistics for each cluster.
| Metric | Clusters | |||||||
|---|---|---|---|---|---|---|---|---|
| 3 | 1 | 4 | 2 | |||||
| Number of segments | 61,237 | 46,297 | 11,309 | 48,675 | ||||
| Kilometers per cluster | 2,670.19 | 1,906.63 | 473.58 | 1,680.30 | ||||
| Road width (m) | 6.45 | 1.65 | 7.11 | 1.74 | 8.97 | 2.90 | 7.92 | 2.03 |
| Number of lanes | 1.96 | 0.21 | 1.95 | 0.33 | 2.31 | 0.83 | 2.07 | 0.50 |
| Vehicles speed (km/h) | 15.02 | 6.51 | 17.16 | 7.43 | 25.27 | 11.34 | 21.00 | 9.54 |
| Traffic density (cars/km) | 43.66 | 34.98 | 159.66 | 47.29 | 157.20 | 52.01 | 140.60 | 61.96 |
| Traffic flow (cars/h) | 611.70 | 568.43 | 2,633.70 | 1,127.97 | 3,844.00 | 1,922.41 | 2,862.00 | 1673.66 |
| Congestion | 0.04 | 0.04 | 0.41 | 0.26 | 0.42 | 0.26 | 0.36 | 0.27 |
| Cycling infrastructure | 0.02 | 0.98 | 0.03 | 0.97 | 0.99 | 0.00 | 0.00 | 1.00 |
| Public transport lines | 0.00 | 1.00 | 0.00 | 1.00 | 0.89 | 0.12 | 1.00 | 0.00 |
Fig. 8Radar plot with the average values of every variable per cluster.
Assigning LTS labels to clusters. Semaphore colors shows stress from green (less stress) to red (high stress).
Fig. 9LTS intersection classification in Ciudad Salitre.
Results of the pairwise comparison of LTS classifiers by the Mantel tests (p-values).
| Segments in | LTS using PAM algorithm classification | LTS using the multinomial logistic regression trained by the PAM algorithm over the segments in | ||
|---|---|---|---|---|
| Usaquén | Suba | Kennedy | ||
| Usaquén | LTS Low | – | 0.401 | 0.709 |
| LTS Medium | – | 0.257 | 0.912 | |
| LTS High | – | 0.912 | 0.559 | |
| LTS Extremely high | – | 0.257 | 0.709 | |
| Suba | LTS Low | 0.257 | – | 0.401 |
| LTS Medium | 0.401 | – | 0.401 | |
| LTS High | 0.146 | – | 0.559 | |
| LTS Extremely high | 0.709 | – | 0.912 | |
| Kennedy | LTS Low | 0.829 | 0.257 | – |
| LTS Medium | 0.559 | 0.146 | – | |
| LTS High | 0.829 | 0.829 | – | |
| LTS Extremely high | 0.401 | 0.912 | – | |
Standardized collisions per kilometer of LTS.
| Non-fatal collisions / kilometer | Fatal collisions / kilometer | |||||||
|---|---|---|---|---|---|---|---|---|
| LTS Low | LTS Medium | LTS High | LTS Extremely High | LTS Low | LTS Medium | LTS High | LTS Extremely High | |
| Mean | 0.01505 | 0.09448 | 0.48436 | 0.50431 | 0.00094 | 0.00126 | 0.01060 | 0.02636 |
| Standard deviation | 0.01793 | 0.04718 | 0.35576 | 0.24492 | 0.00293 | 0.00311 | 0.01907 | 0.02329 |
| Half width (95%) | 0.00864 | 0.02274 | 0.17147 | 0.11805 | 0.00141 | 0.00150 | 0.00919 | 0.01122 |
| Localities (data points) | 19 | 19 | 19 | 19 | 19 | 19 | 19 | 19 |
Fig. 10Confidence intervals of the number of injuries per kilometer of LTS.