Literature DB >> 30173007

Classification of motor vehicle crash injury severity: A hybrid approach for imbalanced data.

Heejin Jeong1, Youngchan Jang2, Patrick J Bowman3, Neda Masoud4.   

Abstract

This study aims to classify the injury severity in motor-vehicle crashes with both high accuracy and sensitivity rates. The dataset used in this study contains 297,113 vehicle crashes, obtained from the Michigan Traffic Crash Facts (MTCF) dataset, from 2016-2017. Similar to any other crash dataset, different accident severity classes are not equally represented in MTCF. To account for the imbalanced classes, several techniques have been used, including under-sampling and over-sampling. Using five classification learning models (i.e., Logistic regression, Decision tree, Neural network, Gradient boosting model, and Naïve Bayes classifier), we classify the levels of injury severity and attempt to improve the classification performance by two training-testing methods including Bootstrap aggregation (or bagging) and majority voting. Furthermore, due to the imbalance present in the dataset, we use the geometric mean (G-mean) to evaluate the classification performance. We show that the classification performance is the highest when bagging is used with decision trees, with over-sampling treatment for imbalanced data. The effect of treatments for the imbalanced data is maximized when under-sampling is combined with bagging. In addition to the original five classes of injury severity in the MTCF dataset, we consider two additional classification problems, one with two classes and the other with three classes, to (1) investigate the impact of the number of classes on the performance of classification models, and (2) enable comparing our results with the literature.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Automated vehicle safety; Data analytics; Imbalanced data; Injury severity classification; Machine learning; Vehicle crashes

Mesh:

Year:  2018        PMID: 30173007     DOI: 10.1016/j.aap.2018.08.025

Source DB:  PubMed          Journal:  Accid Anal Prev        ISSN: 0001-4575


  3 in total

1.  Exploring the mechanism of crashes with automated vehicles using statistical modeling approaches.

Authors:  Song Wang; Zhixia Li
Journal:  PLoS One       Date:  2019-03-28       Impact factor: 3.240

2.  Comparison of Prediction Models for Mortality Related to Injuries from Road Traffic Accidents after Correcting for Undersampling.

Authors:  Yookyung Boo; Youngjin Choi
Journal:  Int J Environ Res Public Health       Date:  2021-05-24       Impact factor: 3.390

3.  Hybrid feature selection-based machine learning Classification system for the prediction of injury severity in single and multiple-vehicle accidents.

Authors:  Shuguang Zhang; Afaq Khattak; Caroline Mongina Matara; Arshad Hussain; Asim Farooq
Journal:  PLoS One       Date:  2022-02-02       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.