| Literature DB >> 27801822 |
Alberto Fernández1, Rubén Usamentiaga2, Juan Luis Carús3, Rubén Casado4.
Abstract
Driver distraction, defined as the diversion of attention away from activities critical for safe driving toward a competing activity, is increasingly recognized as a significant source of injuries and fatalities on the roadway. Additionally, the trend towards increasing the use of in-vehicle information systems is critical because they induce visual, biomechanical and cognitive distraction and may affect driving performance in qualitatively different ways. Non-intrusive methods are strongly preferred for monitoring distraction, and vision-based systems have appeared to be attractive for both drivers and researchers. Biomechanical, visual and cognitive distractions are the most commonly detected types in video-based algorithms. Many distraction detection systems only use a single visual cue and therefore, they may be easily disturbed when occlusion or illumination changes appear. Moreover, the combination of these visual cues is a key and challenging aspect in the development of robust distraction detection systems. These visual cues can be extracted mainly by using face monitoring systems but they should be completed with more visual cues (e.g., hands or body information) or even, distraction detection from specific actions (e.g., phone usage). Additionally, these algorithms should be included in an embedded device or system inside a car. This is not a trivial task and several requirements must be taken into account: reliability, real-time performance, low cost, small size, low power consumption, flexibility and short time-to-market. The key points for the development and implementation of sensors to carry out the detection of distraction will also be reviewed. This paper shows a review of the role of computer vision technology applied to the development of monitoring systems to detect distraction. Some key points considered as both future work and challenges ahead yet to be solved will also be addressed.Entities:
Keywords: driver distraction detection; image processing; visual-based sensors
Mesh:
Year: 2016 PMID: 27801822 PMCID: PMC5134464 DOI: 10.3390/s16111805
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Scope of the present work.
Figure 2Common steps in most distraction monitoring systems.
Figure 3Head pose can be decomposed in pitch, yaw and roll angles.
Mean Absolute Error (MAE) (in degrees) of face tracking algorithms comparison working in an automobile environment.
| Algorithm | Roll( | Yaw( | Pitch( |
|---|---|---|---|
| La Cascia et al. [ | 9.8 | 4.7 | 2.4 |
| Oyini et al. [ | 5.3 | 3.9 | 5.2 |
| Oyini et al. [ | 4.8 | 3.8 | 3.9 |
| Oyini et al. [ | 5.3 | 5.1 | 6.3 |
| Vicente et al. [ | 3.2 | 4.3 | 6.2 |
| Pelaez et al. [ | 2.7 | 3.8 | 2.5 |
| Murphy et al. [ | 2.4 | 4.7 | 3.4 |
| Tawari et al. [ | 3.0 | 8.2 | 7.6 |
| Tawari et al. [ | 3.8 | 7.0 | 8.6 |
| Tawari et al. [ | 3.5 | 5.9 | 9.0 |
| Tawari et al. [ | 3.4 | 6.9 | 9.3 |
| Tawari et al. [ | 3.6 | 5.7 | 8.8 |
| Tawari et al. [ | 2.7 | 5.5 | 8.5 |
Classification accuracy evaluated on the Southeast University (SEU) driving posture dataset [88].
| Algorithm | Features | Classifier | Average Accuracy (%) |
|---|---|---|---|
| Zhao et al. [ | Homomorphic filtering, skin-like regions segmentation and Contourlet Transform (CT) | RF | 90.63 |
| Zhao et al. [ | Geronimo-Hardin-Massopust (GHM) multiwavelet transform | Multiwavelet Transform | 89.23 |
| Zhao et al. [ | Histogram-based feature description by Pyramid Histogram of Oriented Gradients (PHOG) and spatial scale-based feature description | Perceptron classifiers | 94.20 |
| Zhao et al. [ | Homomorphic filter, skin-like regions segmentation, canny edge detection, connected regions detection, small connected regions deletion and spatial scale ratio calculation | Bayes classifier | 95.11 |
| Bosch et al. approach [ | PHOG | SVM | 91.56 |
| Lowe et al. approach [ | SIFT | SVM | 96.12 |
| Yan et al. [ | CNN | 99.78 |
Computer vision algorithms to detect cell phone usage. High recognition rates are usually obtained using very different approaches.
| Algorithm | Features | Classifier | Recognition Rate (%) |
|---|---|---|---|
| Zhang et al. [ | Features from the driver’s face, mouth and hand | Hidden Conditional Random Fields (HCRF) | 91.20 |
| Artan et al. [ | Image descriptors extracted from a region of interest around the face | SVM | 86.19 |
| Berri et al. [ | Percentage of the Hand and Moment of Inertia | FV | 91.57 |
| Xu et al. [ | DPM | FV | 95 |
| Seshadri et al. [ | Raw pixels and HOG features | Real AdaBoost, SVM, RF | 93.86 |
Hands recognition in different regions inside the car using CVRR-HANDS 3D dataset [106].
| Algorithm | Features | Classifier | Regions | Recognition Rate (%) |
|---|---|---|---|---|
| Ohn et al. [ | RGB data | SVM | 5 | 52.1 |
| Ohn et al. [ | RGB combined with depth data | SVM | 5 | 69.4 |
| Martin et al. [ | Hands cues | SVM | 3 | 83 |
| Martin et al. [ | Hands and head cues | SVM | 3 | 91 |
| Ohn et al. [ | Hands cues | SVM | 3 | 90 |
| Ohn et al. [ | Hands and head cues | SVM | 3 | 94 |
Figure 4Visual distraction algorithms categorization.
AUC comparisons by algorithm across tasks.
| Task | Algorithm | |||
|---|---|---|---|---|
| RVSP | EOFR | AttenD | MDD | |
| Arrows | 0.67 | 0.75 | 0.71 | 0.87 |
| Bug | 0.78 | 0.87 | 0.80 | 0.86 |
Figure 5Classification of main types and subtypes of cognitive load while driving.
Supervised algorithms for cognitive distraction detection.
| Algorithm | Features | Classifier | Accuracy (%) |
|---|---|---|---|
| Zhang et al. [ | Eye gaze-related features and driving performance | Decistion Tree | 81 |
| Zhang et al. [ | Eye gaze-related features | Decistion Tree | 80 |
| Zhang et al. [ | Pupil-diameter features | Decistion Tree | 61 |
| Zhang et al. [ | Driving performance | Decistion Tree | 60 |
| Liang, Reyes, et al. [ | Eye gaze-related features and driving performance | SVM | 83.15 |
| Liang, Reyes, et al. [ | Eye gaze-related features | SVM | 81.38 |
| Liang, Reyes, et al. [ | driving performance | SVM | 54.37 |
| Liang, Lee, et al. [ | Eye gaze-related features and driving performance data | DBNs | 80.1 |
| Miyaji et al. [ | Heart rate, Eye gaze-related features and pupil diameter | AdaBoost | 91.5 |
| Miyaji et al. [ | Eye gaze-related features | SVM | 77.1 (arithmetic task) |
| Miyaji et al. [ | Eye gaze-related features | SVM | 84.2 (conversation task) |
| Miyaji et al. [ | Eye gaze-related features | AdaBoost | 81.6 (arithmetic task) |
| Miyaji et al. [ | Eye gaze-related features | AdaBoost | 86.1 (conversation task) |
| Yang et al. [ | Eye gaze-related features and driving performance data | ELM | 87.0 |
| Yang et al. [ | Eye gaze-related features and driving performance data | SVM | 82.9 |
Mixing types of distraction detection algorithms.
| Algorithm | Features | Classifier | Average Accuracy (%) |
|---|---|---|---|
| Li et al. [ | AU and head pose | LDC (visual distraction) and SVM (cognitive distraction) | 80.8 (LDC), 73.8 (SVM) |
| Craye et al. [ | eye behaviour, arm position, head orientation and facial expressions using both color and depth images | Adaboot and HMM | 89.84 (Adaboot), 89.64 (HMM) |
| Liu et al. [ | Head and eye movements | SVM, ELM and CR-ELM | 85.65 (SVM), 85.98 (ELM), 86.95 (CR-ELM) |
| Ragab et al. [ | arm position, eye closure, eye gaze, facial expressions and head orientation using depth images | Adaboost, HMM, RF, SVM, CRF, NN | 82.9 (RF—type of distraction detection), 90 (RF—distraction detection) |
Summary of visual-based approaches to detect different types of driver distraction.
| Approach | Distraction Detection Approaches | Real Conditions | Operation | |||
|---|---|---|---|---|---|---|
| Manual | Visual | Cognitive | Daytime | Nighttime | ||
| Zhao et al. [ | ||||||
| Zhao et al. [ | ||||||
| Zhao et al. [ | ||||||
| Zhao et al. [ | ||||||
| Bosch et al. [ | ||||||
| Lowe et al. [ | ||||||
| Yan et al. [ | ||||||
| Yan et al. [ | ||||||
| Zhang et al. [ | ||||||
| Artan et al. [ | ||||||
| Berri et al. [ | ||||||
| Xu et al. [ | ||||||
| Seshadri et al. [ | ||||||
| Ohn et al. [ | ||||||
| Martin et al. [ | ||||||
| Ohn et al. [ | ||||||
| Morimoto et al. [ | ||||||
| Ji et al. [ | ||||||
| Ji et al. [ | ||||||
| Ji et al. [ | ||||||
| Gu et al. [ | ||||||
| Batista el al. [ | ||||||
| Bergasa et al. [ | ||||||
| Lee et al. [ | ||||||
| Vicente et al. [ | ||||||
| Cyganek et al. [ | ||||||
| Donmez et al. [ | ||||||
| Klauer et al. [ | ||||||
| Kircher et al. [ | ||||||
| Kircher et al. [ | ||||||
| Kircher et al. [ | ||||||
| Victor et al. [ | ||||||
| Zhang et al. [ | ||||||
| Liang et al. [ | ||||||
| Liang et al. [ | ||||||
| Liang et al. [ | ||||||
| Liang et al. [ | ||||||
| Liang et al. [ | ||||||
| Liang et al. [ | ||||||
| Miyaji et al. [ | ||||||
| Miyaji et al. [ | ||||||
| Yang et al. [ | ||||||
| Li et al. [ | ||||||
| Craye et al. [ | ||||||
| Liu et al. [ | ||||||
| Ragab et al. [ | ||||||