| Literature DB >> 35627404 |
Nengchao Lyu1,2, Jiaqiang Wen1,2, Wei Hao3.
Abstract
Real-time regional risk prediction can play a crucial role in preventing traffic accidents. Thus, this study established a lane-level real-time regional risk prediction model. Based on observed data, the least squares-support vector machines (LS-SVM) algorithm was used to identify each lane region of the mainline, and the initial traffic parameters and surrogate safety measures (SSMs) were extracted and aggregated. The negative samples that characterized normal traffic and the positive samples that characterized regional risk were identified. Mutual information (MI) was used to determine the information gain of various feature variables in the samples, and the key feature variables affecting the regional conditions were tested and screened by means of binary logit regression analysis. Upon screening the variables and corresponding labels, the construction and verification of a lane-level regional risk prediction model was completed using the catastrophe theory. The results showed that lane difference is an important parameter to reduce the uncertainty of regional risk, and its odds ratio (OR) was 16.30 at the 95% confidence level. The 10%-quantile modified time to collision (MTTC) inverse, the speed difference between lanes, and 10%-quantile headway (DHW) had an obvious influence on regional status. The model achieved an overall accuracy of 86.50%, predicting 84.78% of regional risks with a false positive rate of 13.37% and 86.63% of normal traffic with a false positive rate of 15.22%. The proposed model can provide a basis for formulating individualized active traffic control strategies for different lanes.Entities:
Keywords: catastrophe theory; feature analysis; regional risk prediction; roadside observation data; surrogate safety measure
Mesh:
Year: 2022 PMID: 35627404 PMCID: PMC9141005 DOI: 10.3390/ijerph19105867
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1The overall framework of this study.
Figure 2The installation point and observation area of the microwave detector.
Figure 3Roadside data collection system.
Figure 4The processing flow of point traces.
Figure 5Lane identification in research area.
Figure 6Outer lane—speed distribution of conflict events and non-conflict events.
Figure 7Inner lane–speed distribution of conflict events and non-conflict events.
Figure 8Speed distribution of conflict events on the outer and inner lanes.
Feature variables and their meanings.
| Category | Variable Name * |
|---|---|
| Regional Macro-Traffic Parameters | Flow mean/standard deviation/range |
| Speed mean/standard deviation/range | |
| Occupancy mean/standard deviation/range | |
| Acceleration mean/standard deviation/range | |
| Lane number | |
| Regional Micro-SSMs | THW mean/standard deviation/10% quantile/5% quantile |
| DHW mean/standard deviation/10% quantile/5% quantile | |
| SSMs mean/standard deviation/10% quantile/5% quantile | |
| Inter-Regional Parameter Differences | THW/DHW difference between lanes |
| SSMs difference between lanes | |
| Flow/speed/occupancy difference between lanes |
Variable name *: All variables were statistically obtained based on the average value of a parameter within the 30 s time window of the area.
Figure 9Information gain of feature variables (only the top 10 were counted).
Impact Analysis of feature variables on regional status.
| Term | B | S.E. | Wald | Sig. | Exp(B) | 95% CI for Exp(B) | |
|---|---|---|---|---|---|---|---|
| Lower | Upper | ||||||
| Lane Number | 2.791 | 0.399 | 48.815 | 0.000 | 16.295 | 7.448 | 35.650 |
| 1/MTTC 10% Quantile | 0.019 | 0.010 | 4.079 | 0.043 | 1.019 | 1.001 | 1.039 |
| Speed Difference | −0.026 | 0.013 | 4.008 | 0.045 | 0.974 | 0.949 | 0.999 |
| DHW 10% Quantile | −0.013 | 0.004 | 11.741 | 0.001 | 0.987 | 0.980 | 0.995 |
| THW 5% Quantile | 0.059 | 0.066 | 0.794 | 0.373 | 1.061 | 0.932 | 1.207 |
| Intercept | −2.826 | 0.459 | 37.856 | 0.000 | 0.059 | - | - |
Figure 10ORs of feature variables.
Results of regional risk prediction model.
| Prediction Results | True Labels | |||
|---|---|---|---|---|
| Risk Area (Outer Lane) | Risk Area (Inner Lane) | Normal Area | ||
| Prediction Labels | Risk Area (Outer Lane) | 35 | - | 81 |
| Risk Area (Inner Lane) | - | 4 | 0 | |
| Normal Area | 5 | 2 | 525 | |
Regional risk prediction results of different models.
| Model | Risk Area | Normal Area | Accuracy | ||
|---|---|---|---|---|---|
| TPR | FPR | TPR | FPR | ||
| Proposed Catastrophe Theory | 84.78% | 13.37% | 86.63% | 15.22% | 86.50% |
| NB | 67.39% | 21.12% | 78.88% | 32.61% | 78.07% |
| KNN | 73.91% | 18.15% | 81.85% | 26.09% | 81.29% |
| SVM | 86.96% | 25.41% | 74.59% | 13.04% | 75.46% |
| DT | 76.09% | 16.17% | 83.83% | 23.91% | 83.28% |
Figure 11Prediction accuracy of different models for regional status.
Comparison of risk prediction effects in different studies.
| Author | Sensitivity | Specificity | Accuracy |
|---|---|---|---|
| Peng et al., 2020 [ | 84.21% | 81.62% | - |
| Basso et al., 2018 [ | 75. 03% | 77.53% | - |
| You et al., 2017 [ | 87.52% | 73.22% | 80.29% |
| Sun et al., 2016 [ | 77.90% | 79.30% | 79.20% |
| This Study | 84.78% | 86.63% | 86.50% |