| Literature DB >> 31847432 |
Muhammad Qasim Khan1, Sukhan Lee1.
Abstract
Tracking drivers' eyes and gazes is a topic of great interest in the research of advanced driving assistance systems (ADAS). It is especially a matter of serious discussion among the road safety researchers' community, as visual distraction is considered among the major causes of road accidents. In this paper, techniques for eye and gaze tracking are first comprehensively reviewed while discussing their major categories. The advantages and limitations of each category are explained with respect to their requirements and practical uses. In another section of the paper, the applications of eyes and gaze tracking systems in ADAS are discussed. The process of acquisition of driver's eyes and gaze data and the algorithms used to process this data are explained. It is explained how the data related to a driver's eyes and gaze can be used in ADAS to reduce the losses associated with road accidents occurring due to visual distraction of the driver. A discussion on the required features of current and future eye and gaze trackers is also presented.Entities:
Keywords: advanced driving assistance systems (ADAS); eye tracking; gaze tracking; line of sight (LoS); point of regard (PoR); road safety
Mesh:
Year: 2019 PMID: 31847432 PMCID: PMC6960643 DOI: 10.3390/s19245540
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The process of tracking eye position and gaze coordinates.
Figure 2The appearances of eyes and eye parts change with head and eye movements. (a) Variability in eye appearance when eye position is fixed but head position varies. (b) Variability in gaze direction when head position is fixed but eyeball rotates.
Summary and comparison of eye detection techniques.
| Technique | Information | Illumination | Robustness | Requirements | References | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pupil | Iris | Corner | Eye | Between-the-Eyes | Indoor | Outdoor | Infrared | Scale | Head Pose | Occlusion | High Resolution | High Contrast | Temporal Dependent | Good Initialization | ||
| Shape-based (circular) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | [ | |||||||||
| Shape-based (elliptical) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | [ | ||||||||
| Shape-based (elliptical) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | [ | |||||||||
| Shape-based (complex) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | [ | ||||||
| Feature-based | ✓ | ✓ | ✓ | [ | ||||||||||||
| Feature-based | ✓ | ✓ | ✓ | [ | ||||||||||||
| Feature-based | ✓ | ✓ | ✓ | ✓ | [ | |||||||||||
| Feature-based | ✓ | ✓ | ✓ | [ | ||||||||||||
| Feature-based | ✓ | ✓ | ✓ | ✓ | ✓ | [ | ||||||||||
| Feature-based | ✓ | ✓ | ✓ | [ | ||||||||||||
| Appearance-based | ✓ | ✓ | ✓ | ✓ | ✓ | [ | ||||||||||
| Symmetry | ✓ | ✓ | ✓ | [ | ||||||||||||
| Eye motion | ✓ | ✓ | ✓ | ✓ | [ | |||||||||||
| Hybrid | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | [ | |||||
Figure 3Structure of human eye.
Comparison of gaze estimation methods.
| No. of Cameras | No. of Lights | Gaze Information | Head Pose Invariant? | Calibration | Accuracy (Degrees) | Comments | References |
|---|---|---|---|---|---|---|---|
| 1 | 0 | PoR | No. Needs extra unit. | 2–4 | webcam | [ | |
| 1 | 0 | LoS/LoG | No. Needs extra unit. | Fully | 1–2 | [ | |
| 1 | 0 | LoG | Approximate solution | < 1 | additional markers, iris radius, parallel with screen | [ | |
| 1 | 1 | PoR | No. Needs extra unit. | 1–2 | Polynomial approximation | [ | |
| 1 | 2 | PoR | Yes | Fully | 1–3 | [ | |
| 1 + 1 PT camera | 1 | PoR | Yes | Fully | 3 | Mirrors | [ |
| 1 + 1 PT camera | 4 | PoR | Yes | <2.5 | PT camera used during implementation | [ | |
| 2 | 0 | PoR | Yes | 1 | 3D face model | [ | |
| 2 + 1 PT camera | 1 | LoG | Yes | 0.7–1 | [ | ||
| 2 + 2 PT cameras | 2 | PoR | Yes | Fully | 0.6 | [ | |
| 2 | 2(3) | PoR | Yes | Fully | <2 | extra lights used during implementation, experimentation conducted with three glints | [ |
| 3 | 2 | PoR | Yes | Fully | not reported | [ | |
| 1 | 1 | PoR | No. Needs extra unit. | 0.5–1.5 | Appearance-based | [ |
Figure 4The stages of visual data in typical advanced driving assistance systems (ADAS) algorithms.
Figure 5Driving process.
Summary of measurement techniques.
| Measurement | Ability to Detect Distraction | Pros | Cons | ||
|---|---|---|---|---|---|
| Visual | Cognitive | Visual and Cognitive | |||
| Driving Performance | Y | N | N |
Ability to indicate the effect of driving distraction |
Requirement of complementary subjective reports to obtain high accuracy results |
| Physical Measurements | Y | Y | N |
Ability to distinguish between distraction types |
Unable to distinguish a combined type of distraction |
| Biological Measurements | Y | Y | Y |
Ability to measure cognitive and visual distraction |
Intrusiveness |
| Subjective Reports | N | Y | N |
Ability to distinguish underlying mechanism of distraction |
Requires input of an expert |
| Hybrid Measurements | Y | Y | Y |
Higher accuracy for discriminating types of distractions Able to complement the blind spots of other methods |
Synchronization of multiple source of data with different sampling rate |
Figure 6Flowchart of a generic eye tracking algorithm.
Summary of various eye tracking algorithms.
| Eye Detection | Tracking Method | Used Features | Algorithm for Distraction/Fatigue Detection | Performance | References |
|---|---|---|---|---|---|
| Imaging in the IR spectrum and verification by SVM | Combination of Kalman filter and mean shift | PERCLOS, Head nodding, Head orientation, Eye blink speed, Gaze direction Eye saccadic movement, Yawning | Probability theory (Bayesian network) | Very good | [ |
| Imaging in the IR Spectrum | Adaptive filters (Kalman filter) | PERCLOS, Eye blink speed, Gaze direction, Head rotation | Probability theory (Bayesian network) | Very Good | [ |
| Imaging in the IR Spectrum | Adaptive filters (Kalman filter) | PERCLOS, Eye blink rate, Eye saccadic movement, Head nodding, Head orientation | Knowledge-based (Fuzzy expert system) | [ | |
| Feature-based (binarization) | Combination of 4 hierarchical tracking method | PERCLOS, Eye blink rate, Gaze direction, Yawning, Head orientation | Knowledge-based (Finite State Machine) | Average | [ |
| Explicitly by Feature-based (projection) | Search window (based on face template matching) | PERCLOS Distance between eyelids, Eye blink rate, Head orientation | Knowledge-based (Fuzzy expert System) | Good | [ |
| Other methods (elliptical model in daylight and IR imaging in nightlight) | Combination of NN and condensation algorithm | PERCLOS, Eye blink rate, Head orientation | Thresholding | Good | [ |
| Feature-based (projection) | Search window (based on face template matching) | PERCLOS, Distance between eyelids | Thresholding | Good | [ |
| Feature-based (projection) | Adaptive filters (UKF) | Continuous eye closure | Thresholding | Average | [ |
| Feature-based (projection and connected component analysis) | Search window (eye template matching) | Eyelid distance | Thresholding | Very good | [ |
| Feature-based (projection) | Adaptive filters (Kalman filter) | Eye blink rate | Poor | [ | |
| Feature-based (variance projection and face model) | Adaptive filters (Kalman filter) | PERCLOS, Eye blink speed, Head rotation | Poor | [ |
A summary of features offered in modern vehicles.
| Make | Technology Brand | Description | Alarm Type | Reference |
|---|---|---|---|---|
| Audi | Rest recommendation system + Audi pre sense | Uses features extracted with the help of far infrared system, camera, radar, thermal camera, lane position, proximity detection to offer features such as collision avoidance assist sunroof and windows closinghigh beam assistturn assistrear cross-path assistexit assist (to warn door opening when a nearby car passes) traffic jam assistnight vision | Audio, display, vibration | [ |
| BMW | Active Driving Assistant with Attention Assistant | Uses features extracted with the help of radar, camera, thermal camera, lane position, proximity detection to offer features such as lane change warning, night vision, steering and lane control system for semi-automated driving, crossroad warning, assistive parking | Audio, display, vibration | [ |
| Cadillac | Cadillac Super Cruise | System based on FOVIO vision technology developed by Seeing Machines IR camera on the steering wheel column to accurately determine the driver’s attention state | Audio and visual | [ |
| Ford | Ford Safe and Smart | Uses features extracted with the help of radar, camera, steering sensors, lane position, proximity detection to offer features such as lane-keeping system, adaptive cruise control, forward collision warning with brake support, front rain-sensing windshield wipers, auto high-beam headlamps, blind spot information system, reverse steering | Audio, display, vibration | [ |
| Mercedez-Benz | MB Pre-safe Technology | Uses features extracted with the help of radar, camera, sensors on the steering column, steering wheel movement and speed to offer features such as driver’s profile and behaviour, accident investigation, pre-safe brake and distronic plus technology, night view assist plus, active lane keeping assist and active blind spot monitoring, adaptive high beam assist, attention assist | Audio, display | [ |
| Toyota | Toyota Safety Sense | Uses features extracted with the help of radar, charge-coupled camera, eye tracking and head motion, audio, display advanced obstacle detection system, pre-collision system, lane departure alert, automatic high beams, dynamic radar cruise control, pedestrian detection, | Audio, display | [ |