| Literature DB >> 30951535 |
Chunjiao Dong1, Kun Xie2, Xubin Sun3, Miaomiao Lyu4, Hao Yue1.
Abstract
Conventional traffic crash analyzing methods focus on identifying the relationship between traffic crash outcomes and impact risk factors and explaining the effects of risk factors, which ignore the changes of roadway systems and can lead to inaccurate results in traffic crash predictions. To address this issue, an innovative two-step method is proposed and a support vector regression (SVR) model is formulated into state-space model (SSM) framework for traffic crash prediction. The SSM was developed in the first step to identify the dynamic evolution process of the roadway systems that are caused by the changes of traffic flow and predict the changes of impact factors in roadway systems. Using the predicted impact factors, the SVR model was incorporated in the second step to perform the traffic crash prediction. A five-year dataset that obtained from 1152 roadway segments in Tennessee was employed to validate the model effectiveness. The proposed models result in an average prediction MAPE of 7.59%, a MAE of 0.11, and a RMSD of 0.32. For the performance comparison, a SVR model and a multivariate negative binomial (MVNB) model were developed to do the same task. The results show that the proposed model has superior performances in terms of prediction accuracy compared to the SVR and MVNB models. Compared to the SVR and MVNB models, the benefit of incorporating a state-space model to identify the changes of roadway systems is significant evident in the proposed models for all crash types, and the prediction accuracy that measured by MAPE can be improved by 4.360% and 6.445% on average, respectively. Apart from accuracy improvement, the proposed models are more robust and the predictions can retain a smoother pattern. Furthermore, the results show that the proposed model has a more precise and synchronized response behavior to the high variations of the observed data, especially for the phenomenon of extra zeros.Entities:
Mesh:
Year: 2019 PMID: 30951535 PMCID: PMC6450638 DOI: 10.1371/journal.pone.0214866
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flowchart of the proposed approach for traffic crash prediction.
Fig 2Graphical illustration of the proposed SSM-SVR approach for traffic crash prediction.
Summary statistics of analyzed continuous variables.
| Variable | Mean | Std. Dev. | Min. | Max. |
|---|---|---|---|---|
| The number of major injury crashes per year per roadway segment | 0.06 | 0.28 | 0 | 3 |
| The number of minor injury crashes per year per roadway segment | 0.50 | 1.19 | 0 | 12 |
| The number of no injury crashes per year per roadway segment | 1.47 | 2.98 | 0 | 24 |
| Thousand passenger car AADT per lane | 3.25 | 1.36 | 0.38 | 35.24 |
| Thousand truck AADT per lane | 0.25 | 0.13 | 0.03 | 6.09 |
| Segment length (miles) | 0.82 | 1.30 | 0.09 | 12.57 |
| Degree of horizontal curvature | 1.67 | 3.55 | 0.00 | 14.42 |
| Median widths | 1.74 | 2.34 | 0.00 | 11.92 |
| Outside shoulder widths | 3.29 | 2.58 | 1.68 | 10.12 |
| International roughness index | 76.88 | 34.11 | 25.35 | 195.90 |
| Rut depth (in.) | 0.15 | 0.07 | 0.05 | 0.66 |
Summary statistics of analyzed categorical variables.
| Variable | Category | Frequency | Percent |
|---|---|---|---|
| Posted speed limits | <55 mph | 2870 | 49.83 |
| ≥55 mph | 2890 | 50.17 | |
| Number of through lanes | 6 | 1465 | 25.43 |
| 4 | 2730 | 47.40 | |
| 2 | 1565 | 27.17 | |
| Lane widths (ft) | 12 | 1170 | 20.31 |
| 11 | 3340 | 57.99 | |
| 10 | 1250 | 21.70 | |
| Median type | Two-way left turn lanes (TWLTL) | 785 | 13.63 |
| Raised median | 1375 | 23.87 | |
| No medians | 3600 | 62.50 | |
| Shoulder type | Pavement | 1995 | 34.64 |
| Gravel | 2510 | 43.58 | |
| Dirt | 1255 | 21.79 | |
| Terrain type | Mountainous | 2410 | 41.84 |
| Rolling | 3350 | 58.16 | |
| Land use type | Residential | 2770 | 48.09 |
| Commercial | 1695 | 29.43 | |
| Rural | 1295 | 22.48 | |
| Indicator for lighting | Lighting exists on the roadway segments | 3160 | 54.86 |
| No lighting devices | 2600 | 45.14 |
Fig 3Searching process and results based on the 4-fold cross validation: (a) first search; (b) contour plot corresponding to first search; (c) second search; (d) contour plot corresponding to second search.
Comparison between the observation in 2014 and the inputs for the developed models.
| Variable | Observed value in 2014 | Input for SVR and MVNB models | MAPE (%) | Input for the proposed models | MAPE (%) |
|---|---|---|---|---|---|
| Thousand passenger car AADT per lane | 3.67 | 3.30 | 10.08 | 3.83 | 4.33 |
| Thousand truck AADT per lane | 0.28 | 0.27 | 5.36 | 0.29 | 2.86 |
| Segment length (miles) | 0.82 | 0.82 | 0.11 | 0.54 | |
| Degree of horizontal curvature | 1.67 | 1.67 | 0.05 | 1.63 | 2.58 |
| Median widths | 1.66 | 1.87 | 12.79 | 1.70 | 2.36 |
| Outside shoulder widths | 3.00 | 3.14 | 4.89 | 3.05 | 1.78 |
| International roughness index | 69.87 | 73.38 | 5.02 | 68.91 | 1.37 |
| Rut depth (in.) | 0.14 | 0.15 | 4.20 | 0.15 | 3.96 |
| Posted speed limits <55 mph | 534 | 466 | 12.73 | 573 | 7.30 |
| ≥55 mph | 618 | 686 | 11.00 | 579 | 6.31 |
| Number of through lanes = 6 | 325 | 381 | 17.23 | 355 | 9.23 |
| = 4 | 511 | 490 | 4.11 | 470 | 8.02 |
| = 2 | 316 | 281 | 11.08 | 327 | 3.48 |
| Lane widths (ft) = 12 | 210 | 191 | 9.05 | 202 | 3.81 |
| = 11 | 668 | 641 | 4.04 | 677 | 1.35 |
| = 10 | 274 | 320 | 16.79 | 0.36 | |
| Median type = two-way left turn lanes (TWLTL) | 157 | 169 | 7.64 | 165 | 5.10 |
| = raised median | 249 | 206 | 17.27 | 260 | 4.42 |
| = no medians | 746 | 777 | 4.16 | 727 | 2.55 |
| Shoulder type = pavement | 333 | 281 | 15.62 | 326 | 2.10 |
| = gravel | 442 | 432 | 2.26 | 406 | 8.14 |
| = dirt | 377 | 439 | 16.45 | 420 | 11.41 |
| Terrain type = mountainous | 528 | 452 | 14.39 | 495 | 6.25 |
| = rolling | 624 | 700 | 12.18 | 657 | 5.29 |
| Land use type = residential | 630 | 647 | 2.70 | 624 | 0.95 |
| = commercial | 273 | 235 | 13.92 | 300 | 9.89 |
| = rural | 249 | 270 | 8.43 | 228 | 8.43 |
| Indicator for lighting = lighting exists on the roadway segments | 638 | 702 | 10.03 | 606 | 5.02 |
| = no lighting devices | 514 | 450 | 12.45 | 546 | 6.23 |
Results of traffic crash prediction.
| Major injury crashes | Minor injury crashes | No-injury crashes | Total | |
|---|---|---|---|---|
| Observed mean | 0.049 | 0.385 | 0.997 | 1.430 |
| Observed Std. Dev. | 0.302 | 1.102 | 2.095 | 2.350 |
| Observed min | 0 | 0 | 0 | 0 |
| Observed max | 3 | 10 | 19 | 19 |
| Observed counts | 56 | 443 | 1148 | 1647 |
| Predicted mean | 0.043 | 0.417 | 0.919 | 1.379 |
| Predicted Std. Dev. | 0.295 | 1.115 | 2.092 | 2.343 |
| Predicted min | 0 | 0 | 0 | 0 |
| Predicted max | 3 | 10 | 19 | 19 |
| Predicted counts | 50 | 480 | 1059 | 1589 |
| MAPE (%) | 10.714 | 8.352 | 7.753 | 3.522 |
| MAE | 0.005 | 0.060 | 0.173 | 0.215 |
| RMSD | 0.072 | 0.245 | 0.467 | 0.512 |
| Predicted mean | 0.056 | 0.432 | 1.110 | 1.599 |
| Predicted Std. Dev. | 0.314 | 1.134 | 2.222 | 2.472 |
| Predicted min | 0 | 0 | 0 | 0 |
| Predicted max | 3 | 9 | 21 | 22 |
| Predicted counts | 65 | 498 | 1279 | 1842 |
| MAPE (%) | 16.071 | 12.415 | 11.411 | 11.840 |
| MAE | 0.018 | 0.195 | 0.496 | 0.593 |
| RMSD | 0.135 | 0.442 | 0.728 | 0.851 |
| Predicted mean | 0.057 | 0.440 | 1.135 | 1.632 |
| Predicted Std. Dev. | 0.404 | 1.108 | 2.167 | 2.445 |
| Predicted min | 0 | 0 | 0 | 0 |
| Predicted max | 5 | 10 | 21 | 21 |
| Predicted counts | 66 | 507 | 1307 | 1880 |
| MAPE (%) | 17.857 | 14.447 | 13.850 | 14.147 |
| MAE | 0.021 | 0.398 | 0.548 | 0.780 |
| RMSD | 0.177 | 0.700 | 0.771 | 1.123 |
Fig 4Comparison of model performances. (a) Predicted value of proposed SSM-SVR model vs. observed value. (b) Predicted value of SVR model vs. observed value. (c) Predicted value of MVNB model vs. observed value.