| Literature DB >> 35409685 |
Ming Sun1, Ronggui Zhou1, Chengwu Jiao1, Xiaoduan Sun2.
Abstract
Although crashes involving hazardous materials (HAZMAT) are rare events compared with other types of traffic crashes, they often cause tremendous loss of life and property, as well as severe hazards to the environment and public safety. Using five-year (2013-2017) crash data (N = 1610) from the Highway Safety Information System database, a two-step machine learning-based approach was proposed to investigate and quantify the statistical relationship between three HAZMAT crash severity outcomes (fatal and severe injury, injury, and no injury) and contributing factors, including the driver, road, vehicle, crash, and environmental characteristics. Random forest ranked the importance of risk factors, and then Bayesian networks were developed to provide probabilistic inference. The results show that fatal and severe HAZMAT crashes are closely associated with younger drivers (age less than 25), driver fatigue, violation, distraction, special roadway locations (such as intersections, ramps, and bridges), higher speed limits (over 66 mph), midnight and early morning (12:00-5:59 a.m.), head-on crashes, more than four vehicles, fire/explosion/spill, poor lighting conditions, and adverse weather conditions. The overall prediction accuracy of 85.8% suggests the effectiveness of the proposed method. This study extends machine learning applications in a HAZMAT crash analysis, which would help develop targeted countermeasures and strategies to improve HAZMAT road transportation safety.Entities:
Keywords: Bayesian network; crash severity; hazardous material road transportation; random forest
Mesh:
Substances:
Year: 2022 PMID: 35409685 PMCID: PMC8998538 DOI: 10.3390/ijerph19074002
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Characteristics of HAZMAT road transportation crashes by severity.
| Variables | Number of Crashes | Fatal and Severe | Injury | No Injury Crashes | Percentage of |
|---|---|---|---|---|---|
|
| |||||
|
| |||||
| Less than 25 | 49 | 10.20% | 30.61% | 59.19% | 100% |
| 25–35 | 262 | 8.02% | 21.76% | 70.22% | 100% |
| 36–45 | 415 | 5.78% | 27.23% | 66.99% | 100% |
| 46–60 | 682 | 5.87% | 25.22% | 68.91% | 100% |
| over 60 | 202 | 7.92% | 23.76% | 68.32% | 100% |
|
| |||||
| Male | 1546 | 6.79% | 24.97% | 68.24% | 100% |
| Female | 64 | 1.56% | 29.69% | 68.75% | 100% |
|
| |||||
| Yes | 62 | 1.61% | 43.55% | 54.84% | 100% |
| No | 1548 | 6.78% | 24.42% | 68.80% | 100% |
|
| |||||
| Yes | 10 | 10.00% | 40.00% | 50.00% | 100% |
| No | 1600 | 6.56% | 25.06% | 68.38% | 100% |
|
| |||||
| Yes | 379 | 5.01% | 24.27% | 70.72% | 100% |
| No | 1231 | 7.07% | 25.43% | 67.50% | 100% |
|
| |||||
| Yes | 140 | 6.43% | 35.71% | 57.86% | 100% |
| No | 1470 | 6.60% | 24.15% | 69.25% | 100% |
|
| |||||
| Yes | 77 | 5.19% | 42.86% | 51.95% | 100% |
| No | 1533 | 6.65% | 24.27% | 69.08% | 100% |
|
| |||||
|
| |||||
| Less than or equal to 15,000 | 588 | 10.03% | 28.57% | 61.40% | 100% |
| 15,001–50,000 | 587 | 5.28% | 23.68% | 71.04% | 100% |
| 50,001–100,000 | 211 | 3.79% | 25.12% | 71.09% | 100% |
| Over 100,000 | 224 | 3.57% | 20.09% | 76.34% | 100% |
|
| |||||
| Straight-level | 1148 | 5.92% | 24.13% | 69.95% | 100% |
| Straight-grade | 240 | 7.50% | 25.83% | 66.67% | 100% |
| Curve-level | 94 | 10.64% | 29.79% | 59.57% | 100% |
| Curve-grade | 128 | 7.81% | 29.69% | 62.50% | 100% |
|
| |||||
| Highway Section | 1103 | 6.71% | 23.84% | 69.45% | 100% |
| Intersection | 342 | 6.43% | 30.70% | 62.87% | 100% |
| Ramp | 43 | 4.65% | 16.28% | 79.07% | 100% |
| Bridge | 41 | 9.76% | 31.71% | 58.53% | 100% |
| Other | 81 | 4.94% | 20.99% | 74.07% | 100% |
|
| |||||
| Yes | 884 | 5.66% | 24.10% | 70.24% | 100% |
| No | 726 | 7.71% | 26.45% | 65.84% | 100% |
|
| |||||
| Less than or equal to 2 | 478 | 10.88% | 30.13% | 58.99% | 100% |
| Less than or equal to 4 | 717 | 4.88% | 24.27% | 70.85% | 100% |
| Over 4 | 415 | 4.58% | 20.96% | 74.46% | 100% |
|
| |||||
| Dry | 1246 | 7.06% | 24.24% | 68.70% | 100% |
| Wet | 271 | 4.43% | 32.10% | 63.47% | 100% |
| Ice | 86 | 6.98% | 17.44% | 75.58% | 100% |
| Other | 7 | 0.00% | 14.29% | 85.71% | 100% |
|
| |||||
| Less than 30 | 76 | 0.00% | 11.84% | 88.16% | 100% |
| 30–45 | 438 | 3.88% | 22.83% | 73.29% | 100% |
| 46–55 | 470 | 9.36% | 31.91% | 58.73% | 100% |
| 56–65 | 450 | 6.67% | 24.67% | 68.66% | 100% |
| over 66 | 176 | 8.52% | 19.89% | 71.59% | 100% |
|
| |||||
| US route | 346 | 7.80% | 23.99% | 68.21% | 100% |
| Interstate | 620 | 4.84% | 21.94% | 73.22% | 100% |
| State route | 453 | 7.51% | 28.04% | 64.45% | 100% |
| Non-state route | 191 | 7.85% | 30.89% | 61.26% | 100% |
|
| |||||
| Urban | 784 | 9.18% | 26.66% | 64.16% | 100% |
| Rural | 826 | 4.12% | 23.73% | 72.15% | 100% |
|
| |||||
|
| |||||
| Less than or equal to 10,000 | 58 | 3.45% | 15.52% | 81.03% | 100% |
| 10,001–26,000 | 168 | 4.17% | 26.79% | 69.04% | 100% |
| over 26,000 | 1384 | 7.01% | 25.36% | 67.63% | 100% |
|
| |||||
| Single-unit truck | 389 | 5.14% | 29.31% | 65.55% | 100% |
| Truck/trailer | 199 | 5.53% | 23.12% | 71.35% | 100% |
| Truck/tractor | 24 | 8.33% | 20.83% | 70.84% | 100% |
| Tractor/semi-trailer | 758 | 8.71% | 23.22% | 68.07% | 100% |
| Tractor/doubles | 35 | 2.86% | 25.71% | 71.43% | 100% |
| Other | 205 | 2.93% | 26.83% | 70.24% | 100% |
|
| |||||
| Yes | 596 | 5.54% | 21.64% | 72.82% | 100% |
| No | 1014 | 7.20% | 27.22% | 65.58% | 100% |
|
| |||||
| 1 | 385 | 5.19% | 24.68% | 70.13% | 100% |
| 2 | 1093 | 5.95% | 23.15% | 70.90% | 100% |
| 3 | 98 | 14.29% | 42.86% | 42.85% | 100% |
| Greater than or equal to 4 | 34 | 20.59% | 44.12% | 35.29% | 100% |
|
| |||||
|
| |||||
| Head-on | 31 | 41.94% | 38.71% | 19.35% | 100% |
| Rear-end | 344 | 5.81% | 35.47% | 58.72% | 100% |
| Angle | 224 | 9.38% | 30.80% | 59.82% | 100% |
| Sideswipe | 433 | 3.70% | 16.86% | 79.44% | 100% |
| Single vehicle | 385 | 5.19% | 24.68% | 70.13% | 100% |
| Other | 193 | 8.29% | 17.62% | 74.09% | 100% |
|
| |||||
| 12:00–5:59 a.m.0:00–5:59 | 174 | 8.62% | 31.03% | 60.35% | 100% |
| 6:00–11:59 a.m. | 569 | 7.03% | 24.43% | 68.54% | 100% |
| 12:00–5:59 p.m. | 631 | 5.86% | 25.20% | 68.94% | 100% |
| 6:00–11:59 p.m. | 236 | 5.93% | 22.46% | 71.61% | 100% |
|
| |||||
| Yes | 124 | 11.29% | 37.90% | 50.81% | 100% |
| No | 1486 | 6.19% | 24.09% | 69.72% | 100% |
|
| |||||
|
| |||||
| Daylight | 1159 | 5.87% | 24.50% | 69.63% | 100% |
| Dusk/dawn | 72 | 12.50% | 26.39% | 61.11% | 100% |
| Dark—street lights | 123 | 4.88% | 33.33% | 61.79% | 100% |
| Dark—no street lights | 256 | 8.98% | 23.83% | 67.19% | 100% |
|
| |||||
| Clear | 1125 | 6.67% | 23.91% | 69.42% | 100% |
| Cloudy | 218 | 6.88% | 29.36% | 63.76% | 100% |
| Rain | 164 | 1.83% | 30.49% | 67.68% | 100% |
| Fog | 18 | 16.67% | 38.89% | 44.44% | 100% |
| Snow | 85 | 11.76% | 17.65% | 70.59% | 100% |
Bold and italic texts represent the five major categories. Bold texts represent the subcategories in each major group.
Figure 1Variable correlation matrix.
Figure 2Variable importance ranking using random forests.
Figure 3The Bayesian network structure development.
Figure 4The Bayesian network parameter learning.
Bayesian network probability inference results for HAZMAT road transportation crash severity.
| Variables | Probabilities When Setting Evidence | ||
|---|---|---|---|
| Fatal and Severe Injury Crashes | Injury | No Injury Crashes | |
| Proportion distribution | 0.1239 | 0.2602 | 0.6158 |
|
| |||
|
| |||
| less than 25 | 0.2187 | 0.3348 | 0.4464 |
| 25–35 | 0.1423 | 0.2329 | 0.6248 |
| 36–45 | 0.1169 | 0.2760 | 0.6071 |
| 46–60 | 0.1058 | 0.2596 | 0.6345 |
| over 60 | 0.1527 | 0.2472 | 0.6001 |
|
| |||
| Yes | 0.1620 | 0.2761 | 0.5619 |
| No | 0.1224 | 0.2596 | 0.6180 |
|
| |||
| Yes | 0.2112 | 0.2865 | 0.5022 |
| No | 0.1234 | 0.2601 | 0.6165 |
|
| |||
| Yes | 0.1240 | 0.2622 | 0.6225 |
| No | 0.1236 | 0.2539 | 0.6138 |
|
| |||
| Yes | 0.1572 | 0.3017 | 0.5410 |
| No | 0.1208 | 0.2563 | 0.6230 |
|
| |||
| Yes | 0.1920 | 0.2877 | 0.5203 |
| No | 0.1205 | 0.2589 | 0.6206 |
|
| |||
|
| |||
| Less than or equal to 15,000 | 0.1201 | 0.2684 | 0.6114 |
| 15,001–50,000 | 0.1263 | 0.2445 | 0.6292 |
| 50,001–100,000 | 0.1271 | 0.2660 | 0.6069 |
| over 100,000 | 0.1246 | 0.2745 | 0.6009 |
|
| |||
| Straight-level | 0.1210 | 0.2585 | 0.6204 |
| Straight-grade | 0.1232 | 0.2595 | 0.6174 |
| Curve-level | 0.1245 | 0.2604 | 0.6152 |
| Curve-grade | 0.1318 | 0.2683 | 0.5999 |
|
| |||
| Highway Section | 0.1111 | 0.2492 | 0.6397 |
| Intersection | 0.1503 | 0.2896 | 0.5601 |
| Ramp | 0.1639 | 0.2786 | 0.5575 |
| Bridge | 0.1582 | 0.2548 | 0.5870 |
| Other | 0.1490 | 0.2791 | 0.5719 |
|
| |||
| less than 30 | 0.1399 | 0.1834 | 0.6767 |
| 30–45 | 0.1111 | 0.2273 | 0.6616 |
| 46–55 | 0.1216 | 0.3022 | 0.5761 |
| 56–65 | 0.1247 | 0.2732 | 0.6021 |
| over 66 | 0.1533 | 0.2301 | 0.6166 |
|
| |||
|
| |||
| Single-unit truck | 0.1223 | 0.2732 | 0.6045 |
| Truck/trailer | 0.1179 | 0.2494 | 0.6327 |
| Truck/tractor | 0.1317 | 0.2605 | 0.6078 |
| Tractor/semi-trailer | 0.1274 | 0.2529 | 0.6197 |
| Tractor/doubles | 0.1215 | 0.2636 | 0.6149 |
| Other | 0.1195 | 0.2726 | 0.6079 |
|
| |||
| Yes | 0.1266 | 0.2557 | 0.6177 |
| No | 0.1223 | 0.2629 | 0.6147 |
|
| |||
| 1 | 0.1149 | 0.2435 | 0.6416 |
| 2 | 0.1116 | 0.2512 | 0.6372 |
| 3 | 0.2357 | 0.3883 | 0.3760 |
| Greater than or equal to 4 | 0.2995 | 0.3720 | 0.3285 |
|
| |||
|
| |||
| Head-on | 0.3997 | 0.3028 | 0.2975 |
| Rear-end | 0.1157 | 0.3431 | 0.5412 |
| Angle | 0.1948 | 0.3311 | 0.4741 |
| Sideswipe | 0.0772 | 0.1954 | 0.7274 |
| Single vehicle | 0.0881 | 0.2325 | 0.6794 |
| Other | 0.1375 | 0.2223 | 0.6402 |
|
| |||
| 12:00–5:59 a.m. | 0.1736 | 0.2992 | 0.5272 |
| 6:00–11:59 a.m. | 0.1133 | 0.2510 | 0.6357 |
| 12:00–5:59 p.m. | 0.1061 | 0.2470 | 0.6469 |
| 6:00–11:59 p.m. | 0.1605 | 0.2892 | 0.5503 |
|
| |||
| Yes | 0.2396 | 0.3347 | 0.4258 |
| No | 0.1143 | 0.2540 | 0.6317 |
|
| |||
|
| |||
| Daylight | 0.1016 | 0.2447 | 0.6537 |
| Dusk/dawn | 0.2239 | 0.2937 | 0.4823 |
| Dark—street lights | 0.1650 | 0.3425 | 0.4926 |
| Dark—no street lights | 0.1770 | 0.2818 | 0.5412 |
|
| |||
| Clear | 0.1216 | 0.2592 | 0.6192 |
| Cloudy | 0.1259 | 0.2571 | 0.6170 |
| Rain | 0.1292 | 0.2634 | 0.6073 |
| Fog | 0.1294 | 0.2683 | 0.6023 |
| Snow | 0.1364 | 0.2700 | 0.5936 |
Bold and italic texts represent the five major categories. Bold texts represent the subcategories in each major group.
Figure 5Percentage of probability change for HAZMAT crash severity when setting evidence for driver behavior factors.
Figure 6HAZMAT road transportation collision type proportions by crash severity.
Figure 7The number of HAZMAT road transportation crashes by collision type and crash hour.
Confusion matrix for the Bayesian network.
| Confusion Matrix | Predicted | |||||
|---|---|---|---|---|---|---|
| Class 1 | Class 2 | Class 3 | False Negative (FN) | Sensitivity | ||
|
| Class 1 | T11 | F21 | F31 | F21 + F31 |
|
| Class 2 | F12 | T22 | F32 | F12 + F32 |
| |
| Class 3 | F13 | F23 | T33 | F13 + F23 |
| |
| False positive (FP) | F12 + F13 | F21 + F23 | F31 + F32 | Overall Accuracy | ||
| Precision |
|
|
| |||
Performance measurements for the Bayesian network model.
| Performance Measurements | Fatal and Severe | Injury | No Injury Crashes |
|---|---|---|---|
| Accuracy | 96.4% | 85.9% | 85.0% |
| Precision | 67.5% | 78.5% | 88.1% |
| Sensitivity | 82.4% | 69.4% | 89.8% |
| Specificity | 97.1% | 92.4% | 75.4% |
| F-score | 74.2% | 73.7% | 88.9% |
| Overall accuracy = 85.8% | |||
Figure 8ROC curve for the Bayesian network model evaluation.