| Literature DB >> 32197384 |
Iván Silva1,2, José Eugenio Naranjo1.
Abstract
Identifying driving styles using classification models with in-vehicle data can provide automated feedback to drivers on their driving behavior, particularly if they are driving safely. Although several classification models have been developed for this purpose, there is no consensus on which classifier performs better at identifying driving styles. Therefore, more research is needed to evaluate classification models by comparing performance metrics. In this paper, a data-driven machine-learning methodology for classifying driving styles is introduced. This methodology is grounded in well-established machine-learning (ML) methods and literature related to driving-styles research. The methodology is illustrated through a study involving data collected from 50 drivers from two different cities in a naturalistic setting. Five features were extracted from the raw data. Fifteen experts were involved in the data labeling to derive the ground truth of the dataset. The dataset fed five different models (Support Vector Machines (SVM), Artificial Neural Networks (ANN), fuzzy logic, k-Nearest Neighbor (kNN), and Random Forests (RF)). These models were evaluated in terms of a set of performance metrics and statistical tests. The experimental results from performance metrics showed that SVM outperformed the other four models, achieving an average accuracy of 0.96, F1-Score of 0.9595, Area Under the Curve (AUC) of 0.9730, and Kappa of 0.9375. In addition, Wilcoxon tests indicated that ANN predicts differently to the other four models. These promising results demonstrate that the proposed methodology may support researchers in making informed decisions about which ML model performs better for driving-styles classification.Entities:
Keywords: driving safety; driving styles; driving styles classification; driving styles methodology; intelligent vehicle control; machine learning
Mesh:
Year: 2020 PMID: 32197384 PMCID: PMC7146739 DOI: 10.3390/s20061692
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Proposed six-step methodology for driving-style classification.
Thresholds defined for sudden acceleration and sudden brake events depending on the traffic-flow level.
| Event Types | Traffic-Flow Levels | ||
|---|---|---|---|
| Low | Medium | High | |
| EventAcc | a >= 2 m/s2 | a >= 1.5 m/s2 | a >= 1 m/s2 |
| EventBrake | a <= 2 m/s2 | a <= 1.5 m/s2 | a <= 1 m/s2 |
Figure 2Input variables description with their linguistic labels and membership functions.
Rules defined from inputs variables.
| Rules | Antecedents | Consequent | Weight | |
|---|---|---|---|---|
| 1 | EventAcc | High | aggressive | 1 |
| 2 | EventBrake | High | aggressive | 1 |
| 3 | AvgAcc | Low | calm | 0.8 |
| 4 | EventAcc | High | aggressive | 1 |
| 5 | AvgDec | Low | calm | 0.8 |
| 6 | AvgAcc | Normal | normal | 0.9 |
| 7 | Traffic violations | High | aggressive | 1 |
| 8 | Traffic violations | Low | calm | 0.8 |
| 9 | Traffic violations | Normal | normal | 0.9 |
| 10 | EventAcc | Normal | normal | 0.9 |
Grid search values for Artificial Neural Networks (ANN).
| Equations | Number of Neurons | Accuracy | Reference |
|---|---|---|---|
| 3 | 0.84 | [ | |
| 4 | 0.86 | [ | |
|
| 10 | 0.86 | [ |
|
| 12 | 0.84 | [ |
Grid search values for Support Vector Machines (SVM).
| Parameters | Values |
|---|---|
| Kernel function | linear, polynomial, radial basis function (RBF), sigmoid |
| C | [2−5, 210] |
| γ | [2−5, 210] |
Evaluation metrics (accuracy, F1-score, Area Under the Curve (AUC), Kappa) for each model. ANN = Artificial Neural Networks; SVM = Support Vector Machines; RF = Random Forests; kNN = k-Nearest Neighbor.
| Model | Accuracy | F1-Score | AUC | Kappa |
|---|---|---|---|---|
| Fuzzy Logic | 0.8800 | 0.8840 | 0.9072 | 0.8106 |
| ANN (sigmoid; lr = 0.4; N = 4) | 0.8600 | 0.8663 | 0.9030 | 0.7807 |
| SVM (RBF, C = 25; γ = 2−2 ) | 0.9600 | 0.9595 | 0.9730 | 0.9375 |
| RF (n = 100; m = 4) | 0.9200 | 0.9253 | 0.9451 | 0.8750 |
| kNN (k = 3) | 0.9200 | 0.9253 | 0.9451 | 0.8750 |
F1-score and AUC per driving style.
| Fuzzy Logic | ANN | SVM | KNN | RF | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Class | F1 | AUC | F1 | AUC | F1 | AUC | F1 | AUC | F1 | AUC |
| Calm | 0.9 | 0.9375 | 0.9091 | 0.9167 | 0.9524 | 0.9875 | 0.9524 | 0.9875 | 0.9524 | 0.9875 |
| Normal | 0.8696 | 0.8831 | 0.8511 | 0.8600 | 0.9545 | 0.9594 | 0.9091 | 0.9189 | 0.9091 | 0.9189 |
| Aggressive | 0.8823 | 0.9011 | 0.8387 | 0.8387 | 0.9714 | 0.9722 | 0.9143 | 0.9289 | 0.9143 | 0.9289 |
Wilcoxon pair tests with corresponding p-value.
| Classifier 1 | Classifier 2 | |
|---|---|---|
| SVM | ANN | 0.025 |
| SVM | Fuzzy | 1 |
| SVM | kNN | 1 |
| SVM | RF | 1 |
| ANN | Fuzzy | 0.025 |
| ANN | kNN | 0.025 |
| ANN | RF | 0.025 |
| Fuzzy | kNN | 1 |
| Fuzzy | RF | 1 |
| kNN | RF | 1 |