| Literature DB >> 31795080 |
Donato Impedovo1, Fabrizio Balducci1, Vincenzo Dentamaro1, Giuseppe Pirlo1.
Abstract
Automatic traffic flow classification is useful to reveal road congestions and accidents. Nowadays, roads and highways are equipped with a huge amount of surveillance cameras, which can be used for real-time vehicle identification, and thus providing traffic flow estimation. This research provides a comparative analysis of state-of-the-art object detectors, visual features, and classification models useful to implement traffic state estimations. More specifically, three different object detectors are compared to identify vehicles. Four machine learning techniques are successively employed to explore five visual features for classification aims. These classic machine learning approaches are compared with the deep learning techniques. This research demonstrates that, when methods and resources are properly implemented and tested, results are very encouraging for both methods, but the deep learning method is the most accurately performing one reaching an accuracy of 99.9% for binary traffic state classification and 98.6% for multiclass classification.Entities:
Keywords: benchmark; deep learning; vehicular traffic congestion; vehicular traffic flow classification; vehicular traffic flow detection; video classification
Year: 2019 PMID: 31795080 PMCID: PMC6929094 DOI: 10.3390/s19235213
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The morphological operator applied to the frame of the vehicular traffic video.
Figure 2GRAM dataset: (left) a pixel mask that highlights the Region of Interest about vehicles present, annotated (and subsequently trackable) in each video frame (right).
Figure 3Three images from Trafficdb that depict a traffic state classified as Light, Medium, and Heavy.
Performance comparison according to the processing time and vehicle detection accuracy of the four object detectors on the three videos in the GRAM dataset.
| M-30 | M-30-HD | Urban1 | ||
|---|---|---|---|---|
|
| Time [s] | 0.08–0.13 | 0.3–0.44 | 0.02–0.06 |
| Accuracy | 43% | 75% | 40% | |
|
| Time [s] | 4–7 | 11–14 | 2.6–5.6 |
| Accuracy | 22% | 70% | 69% | |
|
| Time [s] | 1.0–1.8 | 1.0–1.8 | 1.0–1.8 |
| Accuracy | 82% | 86% | 91% | |
|
| Time [s] | 2.4–3.0 | 2.4–3.0 | 2.4–3.0 |
| Accuracy | 89% | 91% | 46% |
Traffic state classification accuracy on the Trafficdb dataset.
| KNN | SVM (Linear) | SVM (rbf) | Random Forest | |
|---|---|---|---|---|
|
| 0.81 ± 0.10 | 0.78 ± 0.12 | 0.79 ± 0.16 |
|
|
| 0.66 ± 0.21 | 0.64 ± 0.11 | 0.64 ± 0.08 | 0.68 ± 0.21 |
Normalized confusion matrix of the traffic state classification reached by the Random Forest classifier on the Trafficdb video dataset.
| Random Forest | Light (Pred) | Medium (Pred) | Heavy (Pred) |
|---|---|---|---|
|
|
| 0.02 | 0.04 |
|
| 0.42 |
| 0.11 |
|
| 0.20 | 0.11 |
|
Normalized confusion matrix about the binary traffic state classification performed by the deep neural network of Kurniawan et al. [53] on the Trafficdb video dataset.
| Deep Learning Architecture [ | Light (Pred) | Heavy (Pred) |
|---|---|---|
|
|
| 0.004 |
|
| 0 |
|
Normalized confusion matrix about the multiclass traffic state classification performed by the deep neural network of Kurniawan et al. [53] on the Trafficdb video dataset.
| Deep Learning Architecture [ | Light (Pred) | Medium (Pred) | Heavy (Pred) |
|---|---|---|---|
|
|
| 0.003 | 0. |
|
| 0.004 |
| 0.025 |
|
| 0. | 0.040 |
|