| Literature DB >> 35009547 |
Tianjian Yu1, Fan Gao1, Xinyuan Liu1, Jinjun Tang1.
Abstract
Spatial autocorrelation and skewed distribution are the most frequent issues in crash rate modelling analysis. Previous studies commonly focus on the spatial autocorrelation between adjacent regions or the relationships between crash rate and potentially risky factors across different quantiles of crash rate distribution, but rarely both. To overcome the research gap, this study utilizes the spatial autoregressive quantile (SARQ) model to estimate how contributing factors influence the total and fatal-plus-injury crash rates and how modelling relationships change across the distribution of crash rates considering the effects of spatial autocorrelation. Three types of explanatory variables, i.e., demographic, traffic networks and volumes, and land-use patterns, were considered. Using data collected in New York City from 2017 to 2019, the results show that: (1) the SARQ model outperforms the traditional quantile regression model in prediction and fitting performance; (2) the effects of variables vary with the quantiles, mainly classifying three types: increasing, unchanged, and U-shaped; (3) at the high tail of crash rate distribution, the effects commonly have sudden increases/decrease. The findings are expected to provide strategies for reducing the crash rate and improving road traffic safety.Entities:
Keywords: crash rate modelling; quantile effects; quantile regression; spatial autocorrelation; spatial autoregressive
Mesh:
Year: 2021 PMID: 35009547 PMCID: PMC8747712 DOI: 10.3390/s22010005
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Study Area.
Figure 2Spatial distribution of crash rates.
Variables explanation and statistics.
| Variables | Descriptions | Min | Average | Max | S.D. |
|---|---|---|---|---|---|
| Dependent Variables | |||||
| TCR | Total crash rate | 0.000 | 0.449 | 15.352 | 0.769 |
| I-F_CR | Injury and fatal crash rate | 0.000 | 2.231 | 100.001 | 4.353 |
| Independent Variables | |||||
| Education | Percent graduate high school or higher aged over 16 years old in each CT | 0.000 | 80.058 | 100.000 | 13.922 |
| PD | The number of people per km2 in each CT (in thousands) | 0.000 | 20.870 | 98.924 | 14.092 |
| P_YOU | Percent of youth (aged under 19) in each CT | 0.000 | 22.779 | 67.100 | 8.196 |
| P_ELD | Percent of elderly (aged over 60) in each CT | 0.000 | 19.139 | 100.000 | 8.444 |
| MHC | Median household incomes in each CT (in thousands / dollars) | 0.000 | 62.479 | 250.000 | 32.923 |
| N_VHU | The number of vacant housing units in each CT | 0.000 | 8.539 | 100.000 | 6.375 |
| CWC | Percent of people who commute to work by car in each CT | 0.000 | 27.976 | 100.000 | 17.969 |
| CWPT | Percent of people who commute to work by public transit in each CT | 0.000 | 55.601 | 100.000 | 16.317 |
| CWF | Percent of people who commute to work by foot in each CT | 0.000 | 9.157 | 100.000 | 9.563 |
| MCT | Mean commute time in each CTs (minutes) | 0.000 | 40.923 | 73.900 | 8.833 |
| P_COM | Percent of area used for commercial purpose in each CT | 0.000 | 0.234 | 1.000 | 0.202 |
| P_RES_L | Percent of area used for residential purpose in each CT | 0.000 | 0.726 | 1.000 | 0.204 |
| P_GAR | Percent of area used for garage purpose in each CT | 0.000 | 0.017 | 0.462 | 0.035 |
| P_IND | Percent of area used for industrial purpose in each CT | 0.000 | 0.017 | 0.856 | 0.057 |
| P_ENT_I | The entropy index used to measure the land use diversity in each CT | 0.000 | 0.547 | 1.234 | 0.240 |
| DKMT | Daily vehicle kilometer travelled in each CT (106 vehicle.km) | 0.000 | 0.411 | 5.489 | 0.580 |
| RD | The road length per km2 in each CT (km−1) | 2.435 | 96.239 | 319.955 | 29.254 |
| P_SL_20 | Percent of segment length posted speed 20 mph to total length in each CT | 0.000 | 0.032 | 1.000 | 0.136 |
| P_SL_25 | Percent of segment length posted speed 25 mph to total length in each CT | 0.000 | 0.904 | 1.000 | 0.185 |
| P_SL_30 | Percent of segment length with posted speed 30 mph to total length in each C | 0.000 | 0.012 | 0.508 | 0.042 |
| P_SL_35 | Percent of segment length with posted speed 35 mph to total length in each CT | 0.000 | 0.003 | 0.396 | 0.024 |
| P_SL_40 | Percent of segment length with posted speed 40 mph to total length in each CT | 0.000 | 0.005 | 0.530 | 0.037 |
| P_SL_45 | Percent of segment length with posted speed 45 mph to total length in each CT | 0.000 | 0.005 | 0.482 | 0.030 |
| P_SL_50 | Percent of segment length with posted speed 50 mph to total length in each CT | 0.000 | 0.023 | 0.527 | 0.064 |
Figure 3Comparison results of RMSE and MAE.
Coefficients estimation of Spatial AR quantile model.
| Variables | Total Crash Rate | Fatal-Plus-Injury Crash Rate | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| q = 0.1 | q = 0.3 | q = 0.5 | q = 0.7 | q = 0.9 | q = 0.1 | q = 0.3 | q = 0.5 | q = 0.7 | q = 0.9 | |
| PD | 0.554 ** | 1.131 *** (0.002) | 1.571 *** (0.0026) | 2.240 *** (0.004) | 3.390 ** (0.004) | 0.076 * (0.0003) | 0.174 ** (0.0004) | 0.312 *** (0.0006) | 0.431 *** (0.0009) | 0.680 * (0.003) |
| P_ELD | −0.001 (0.002) | −0.003 * (0.002) | −0.003 (0.0034) | −0.003 (0.006) | −0.006 (0.007) | 0.000 (0.0003) | 0.000 (0.0006) | −0.000 (0.008) | 0.001 (0.001) | 0.002 (0.002) |
| CWC | −0.007 ** (0.001) | −0.013 *** (0.0017) | −0.014 *** (0.0023) | −0.014 * (0.0034) | −0.004 (0.004) | −0.001 *** (0.0003) | −0.002 *** (0.0004) | −0.003 *** (0.0005) | −0.002 (0.0009) | −0.004 (0.002) |
| CWF | 0.002 * (0.002) | 0.003 (0.0018) | 0.004 ** (0.0033) | 0.007 ** (0.0044) | −0.001 (0.0048) | −0.000 (0.0004) | 0.000 (0.0004) | 0.000 (0.0008) | −0.000 (0.0012) | −0.004 (0.002) |
| P_RES_L | −0.003 ** (0.086) | 0.001 (0.108) | 0.000 (0.161) | −0.003 (0.272) | −0.03 ** (0.303) | 0.001 (0.016) | 0.000 (0.028) | 0.000 (0.0416) | −0.001 (0.069) | −0.004 ** (0.191) |
| P_GAR | 0.170 ** (0.608) | 0.314 ** (0.847) | 0.300 (0.992) | 0.712 ** (1.723) | 0.942 (1.615) | 0.040 ** (0.119) | 0.065 ** (0.249) | 0.100 ** (0.191) | 0.141 * (0.318) | 0.231 (1.095) |
| DVMT | −1.145 ** (0.058) | −1.712 * (0.065) | −3.252 *** (0.073) | −2.482 (0.086) | −2.756 (0.086) | −0.150 (0.011) | −0.413 * (0.014) | −0.458 ** (0.014) | −0.264 (0.019) | −0.789 (0.056) |
| P_SL_25 | −0.633 *** (0.105) | −0.834 *** (0.116) | −1.034 *** (0.214) | −1.323 *** (0.287) | −1.530 *** (0.271) | −0.102 *** (0.027) | −0.156 *** (0.035) | −0.199 *** (0.032) | −0.242 *** (0.056) | −0.274 *** (0.183) |
| P_S_45 | 0.321 ** (0.561) | 0.316 ** (0.474) | 0.178 (0.649) | 0.163 (0.779) | 0.532 (0.946) | 0.087 ** (0.098) | 0.081 ** (0.125) | 0.069 ** (0.113) | 0.008 (0.226) | 0.166 (0.754) |
Note: Min, Average, Max, S.D. refer to minimum, average, maximum, and standard deviation values, respectively. *, ** and *** mean that the estimated coefficients are statistically significant at the 90%, 95%, and 99% confidential interval, respectively.
Figure 4Coefficient estimates in SARQ model with total crash rate as the dependent variable.
Figure 5Coefficient estimates in SARQ model with fatal-plus-injury crash rate as the dependent variable.