| Literature DB >> 35351904 |
Zihan Yan1, Yue Wu1, Yifei Shan1, Wenqian Chen1, Xiangdong Li2.
Abstract
Eye tracking is a widely used technique. To enhance eye gaze estimation in different contexts, many eye tracking datasets have been proposed. However, these datasets depend on calibrations in data capture and its applications. We seek to construct a dataset that enables the design of a calibration-free eye tracking device irrespective of users and scenes. To reach this goal, we present ARGaze, a dataset with 1,321,968 pairs of eye gaze images at 32 × 32 pixel resolution and 50 corresponding videos of world views based on a replicable augmented reality headset. The dataset was captured from 25 participants who completed eye gaze tasks for 30 min in both real-world and augmented reality scenes. To validate the dataset, we compared it against state-of-the-art eye gaze datasets in terms of effectiveness and accuracy and report that the ARGaze dataset achieved record low gaze estimation error by 3.70 degrees on average and 1.56 degrees on specific participants without calibrations to the two scenes. Implications for generalising the use of the dataset are discussed.Entities:
Year: 2022 PMID: 35351904 PMCID: PMC8964812 DOI: 10.1038/s41597-022-01200-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Schematic overview of the study and assay design.
| Step | Description | Procedures |
|---|---|---|
| Data acquisition | Capture images of left eye, right eye, and world view | • Record eye images and corresponding world view images in the real-world and augmented reality scenes |
| Data processing | Locate and annotate markers in the real-world and augmented reality scenes, and remove unqualified images | • Identify eye areas • Remove eye blink images • Transform video footages into image sequences • Extract marker coordinates from the images |
| Data verification | Propose the models with which to validate the dataset in terms of eye gaze estimation (without calibration) effectiveness and accuracy | • Assemble the dataset with eye images and related coordinates • Validate the dataset with the SIFTNet- and ALSTM-FCN- hybrid model |
Participant demography.
| Gender | Age | Visual acuity on average (LogMAR) | Backgrounds |
|---|---|---|---|
| 13 females, 12 males | M = 22, SD = 2.53 | left eyes (M = 0.22, SD = 0.34) right eyes (M = 0.23, SD = 0.324) Note: 10 out of 25 participants had myopia and wore contact lenses during the experiments. | 9 computer engineering 6 computer science 2 fine art 5 social science 3 medicine |
Characteristics of the participants’ eyes.
| Feature Name | Values |
|---|---|
| Iris Colour (under IR-enabled camera) | 7 dark, 6 medium, 12 bright |
| Single or double eyelids | 10 single-eyelid, 15 double-eyelid |
| Eye shape | 4 round, 21 almond; 2 protruding, 23 monolid; 5 upturned, 20 downturned |
Fig. 1Devices and scenarios.
Fig. 2Fixtures and views.
Fig. 3Dataset structure and procedures of the chessboard calibration and calculation.
Fig. 4Procedural flows of the experiments.
Fig. 5Network structure and training method of the model.
Fig. 6Examples of data processing.
Fig. 7Result of estimation error with different models and inputs.
Fig. 8Result of estimation error with different dataset parameters.
Comparisons of the current eye tracking datasets.
| Participants | On-Screen Gaze Targets | Images | Resolution | Scenes | Single User Angle Error (Lowest) | Multiuser Angle Error (Average) | |
|---|---|---|---|---|---|---|---|
| ARGaze | 25 | continuous | 1,321,968 | 32 × 32 | AR, real world | 0.78(AR) & 0.90(RW) | 3.70(mixed) |
| MPIIGaze[ | 15 | continuous | 213,659 | real world | 4.56 | ||
| Eyediap[ | 16 | continuous | videos | 640 × 480 | real world | 5.84 | |
| ShanghaiTechGaze + [ | 218 | continuous | 165,231 | 1920 × 1080 | real world | 4.8 | |
| UT Multiview[ | 50 | discrete | 64,000 | real world | 4.41(left eye) & 4.24(right eye) | ||
| InvisibleEye[ | 17 | discrete | 28,000 | 250 × 250 | real world | 1.79 | |
| NVGaze[ | 30 | continuous | 2,500,000 | 480 × 640 | VR, AR | 0.84 | 2.06 |
| Stephan | 152 | continuous | 356,649 | 400 × 640 | VR | ||
| Magic Eyes[ | 587 | continuous | 880,000 | 480 × 640 | MR | 4.22 | |
| TEyeD[ | 132 | continuous | 29,867,973 | 480 × 640, 288 × 384, 240 × 320, 360 × 640 | VR, AR, real world |
| Measurement(s) | Eye Movement Measurement |
| Technology Type(s) | eye tracking device |
| Factor Type(s) | Target Movement |
| Sample Characteristic - Organism | Homo sapiens |