| Literature DB >> 33086567 |
Abstract
The accurate prediction of road traffic crash (RTC) severity contributes to generating crucial information, which can be used to adopt appropriate measures to reduce the aftermath of crashes. This study aims to develop a hybrid system using principal component analysis (PCA) with multilayer perceptron neural networks (MLP-NN) and support vector machines (SVM) in predicting RTC severity. PCA shows that the first nine components have an eigenvalue greater than one. The cumulative variance percentage explained by these principal components was found to be 67%. The prediction accuracies of the models developed using the original attributes were compared with those of the models developed using principal components. It was found that the testing accuracies of MLP-NN and SVM increased from 64.50% and 62.70% to 82.70% and 80.70%, respectively, after using principal components. The proposed models would be beneficial to trauma centers in predicting crash severity with high accuracy so that they would be able to prepare for appropriate and prompt medical treatment.Entities:
Keywords: emergency management; neural networks (NN); principal component analysis (PCA); support vector machine (SVM); traffic crash severity; vehicle crashes
Mesh:
Year: 2020 PMID: 33086567 PMCID: PMC7589286 DOI: 10.3390/ijerph17207598
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Crash, driver, vehicle, and environmental characteristics.
| Category | Variable | Description |
|---|---|---|
| Crash Characteristics | Crash type | 1. Collision with a vehicle; 2. Struck pedestrian; 3. Struck animal; 4. Collision with a fixed object; 5. Collision with some other object; 6. The vehicle overturned; 7. Fall from or in moving vehicle; 8. Other crash. |
| Day of the week | 1. Monday; 2. Tuesday; 3. Wednesday; 4. Thursday; 5. Friday; 6. Saturday; 7. Sunday | |
| Number of vehicles involved | Integer value (with a maximum of two vehicles) | |
| Number of persons involved | Integer value | |
| Roadway Characteristics | Roadway median separation | 0. Undivided, 1. Divided |
| Roadway geometry | 1. Cross intersection; 2. T intersection; 3. Y intersection; 4. Multiple intersections; 5. Not at an intersection; 6. Dead end; 7. Road closure | |
| Roadway speed | Integer value | |
| Roadway surface condition | 1. Dry; 2. Wet; 3. Muddy; 4. Snowy; 5. Icy | |
| Roadway surface type | 1. Paved; 2. Unpaved; 3. Gravel | |
| Traffic control | 0. No control; 1. Stop-go lights; 2. Pedestrian lights; 3. Pedestrian crossing; 4. Roundabout; 5. Stop sign; 6. Give Way sign; 7. other | |
| Environmental Characteristics | Weather condition | 1. Clear; 2. Raining; 3. Snowing; 4. Fog; 5. Smoke; 6. Dust; 7. Strong wind |
| Light condition | 1. Day; 2. Dusk/Dawn; 3. Dark: streetlight on; 4. Dark: streetlight off; 5. Dark/no streetlights; 6. Dark/street lights unknown | |
| Driver Characteristics * | Driver’s gender | 0. Female; 1. Male |
| Driver’s age | Integer value | |
| Vehicle Characteristics * | Vehicle’s age | Integer value |
| Vehicle type | 1. Car; 2. A station wagon; 3. Utility vehicle; 4. Panel van; 5. Bus; 6. Motorcycle; 7. Moped, 8. Bicycle; 9. Quad bike |
* For all drivers and vehicles involved in the crash.
Figure 1Simplified structure of MLP-NN.
Figure 2Data separation by hyperplanes.
Figure 3Eigenvalues for all components considered (scree plot).
Figure 4Cumulative variance plot.
A highly correlated original feature for each principal component.
| Principle Component No. | The Highly Correlated Original Feature |
|---|---|
| 1 | Crash Type [ |
| 2 | Road Surface Condition |
| 3 | Traffic Control Type |
| 4 | Drivers’ Gender [ |
| 5 | Vehicle Type [ |
| 6 | Road Surface Type |
| 7 | Roadway Speed [ |
| 8 | Road Geometry [ |
| 9 | Driver’s Age [ |
Figure 5Confusion matrices for the MLP-NN model using original crash attributes (training and testing data).
Figure 6Confusion matrices for the SVM model using original crash attributes (training and testing data).
Figure 7Confusion matrices for the MLP-NN model using principal components (training and testing data).
Figure 8Confusion matrices for the SVM model using principal components (training and testing data).
Performance measures of the developed models (serious/fatal injury crashes).
| Model | Training Accuracy | Testing Accuracy | Sensitivity | Precision | F1 Score |
|---|---|---|---|---|---|
| MLP-NN with original attributes | 65.5% | 64.5% | 41.1% | 56.6% | 47.6% |
| SVM with original attributes | 65.9% | 62.7% | 34.7% | 55.0% | 42.6% |
| MLP-NN with principal components | 82.7% | 82.7% | 65.1% | 87.10% | 74.5% |
| SVM with principal components | 81.1% | 80.7% | 58.4% | 89.1% | 70.6% |
Performance measures of the developed models (slight injury crashes).
| Model | Training Accuracy | Testing Accuracy | Sensitivity | Precision | F1 Score |
|---|---|---|---|---|---|
| MLP-NN with original attributes | 65.5% | 64.5% | 79.6% | 67.7% | 73.1% |
| SVM with original attributes | 65.9% | 62.7% | 81.8% | 65.2% | 72.6% |
| MLP-NN with principal components | 82.7% | 82.7% | 93.9% | 81.0% | 87.0% |
| SVM with principal components | 81.1% | 80.7% | 95.3% | 77.8% | 85.6% |
Figure 9Testing classification accuracies for the developed models.
Figure 10F1 scores for the developed models (serious/fatal injury and slight injury).
Findings of previous studies.
| Study | Models | Prediction Accuracy |
|---|---|---|
| Abdelwahab and Abdel-Aty [ | NN | 60.4% |
| Alkheder et al. [ | k-means clustering based NN | 74.6% |
| Zeng and Haung [ | NN trained by the convex combination algorithm | 54.8% |
| Iranitalab and Khattak [ | SVM | 61.5% |
| Zhang et al. [ | SVM | 53.9% |
| Li et al. [ | SVM | 48.8% |
| Assi et al. [ | Fuzzy c-means clustering-based SVM | 74% |
| Fuzzy c-means clustering-based NN | 71% |