| Literature DB >> 33828688 |
Birgitta Burger1, Anna Puupponen1, Tommi Jantunen1.
Abstract
Both eye tracking and motion capture technologies are nowadays frequently used in human sciences, although both technologies are usually used separately. However, measuring both eye and body movements simultaneously would offer great potential for investigating crossmodal interaction in human (e.g. music and language-related) behavior. Here we combined an Ergoneers Dikablis head mounted eye tracker with a Qualisys Oqus optical motion capture system. In order to synchronize the recordings of both devices, we developed a generalizable solution that does not rely on any (cost-intensive) ready-made / company-provided synchronization solution. At the beginning of each recording, the participant nods quickly while fixing on a target while keeping the eyes open - a motion yielding a sharp vertical displacement in both mocap and eye data. This displacement can be reliably detected with a peak-picking algorithm and used for accurately aligning the mocap and eye data. This method produces accurate synchronization results in the case of clean data and therefore provides an attractive alternative to costly plug-ins, as well as a solution in case ready-made synchronization solutions are unavailable.Entities:
Keywords: Eye movement; eye tracking; intermodal processing; methodology; motion capture; new media; synchronization; technology
Year: 2018 PMID: 33828688 PMCID: PMC7733527 DOI: 10.16910/jemr.11.2.5
Source DB: PubMed Journal: J Eye Mov Res ISSN: 1995-8692 Impact factor: 0.957
“Ground truth” sync points (“QTM” and “D-Lab”) and computationally derived sync points (“Matlab”) as well as their respective difference (“Diff”). All values are in seconds.
| Mocap data | Pupil data | |||||
| Trial | QTM | Matlab | Diff | D-Lab | Matlab | Diff |
| 1 | 4.7500 | 4.7500 | 0 | 3.48 | 3.46 | -0.02 |
| 2 | 4.9000 | 4.9000 | 0 | 3.00 | 3.02 | 0.02 |
| 3 | 4.7830 | 4.7830 | 0 | 3.16 | 3.14 | -0.02 |
| 4 | 4.7500 | 4.7583 | 0.0083 | 3.50 | 3.50 | 0 |
| 5 | 3.8830 | 3.8830 | 0 | 2.70 | 2.70 | 0 |
| 6 | 4.1083 | 4.1083 | 0 | 2.76 | 2.76 | 0.02 |
| 7 | 4.6916 | 4.6916 | 0 | 2.88 | 2.88 | 0 |
| 8 | 3.7500 | 3.7500 | 0 | 2.78 | 2.78 | 0 |
| 9 | 3.5916 | 3.5916 | 0 | 2.54 | 2.52 | -0.02 |
| 10 | 4.5916 | 4.5916 | 0 | 3.02 | 3.02 | 0 |
Differences in frames between the computational sync points and the “ground truth data” of both mocap and eye tracker for all 40 recordings. One mocap frame equals 5 ms, while one eye tracker frame equals 20 ms.
| mocap | eye tracker | |||
| start nod | end nod | start nod | end nod | |
| P1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| P2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | -1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| P3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| P4 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| P5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| P6 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| P7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| P8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| P9 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| P10 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 |
Differences of recording durations in seconds of mocap system and eye tracker between beginning and end nod per trial and average per participant.
| Duration differences per trial in seconds | |||||
| T1 | T2 | T3 | T4 | Mean | |
| P1 | 0.010 | 0 | 0.005 | 0.005 | 0.0050 |
| P2 | 0 | 0.005 | 0 | -0.030 | 0.0088 |
| P3 | 0.005 | -0.010 | 0 | 0.015 | 0.0075 |
| P4 | -0.020 | 0 | -0.010 | -0.020 | 0.0125 |
| P5 | 0.005 | -0.005 | -0.005 | 0 | 0.0038 |
| P6 | 0 | 0 | 0 | 0 | 0 |
| P7 | -0.005 | 0.010 | 0.020 | 0.010 | 0.0112 |
| P8 | -0.010 | 0.005 | -0.005 | 0.005 | 0.0063 |
| P9 | 0.005 | 0 | 0 | -0.005 | 0.0025 |
| P10 | -0.010 | -0.005 | 0 | 0 | 0.0037 |
Differences of recording durations in seconds of mocap and eye tracker for long recordings.
| Durations in seconds | |||
| Mocap | Eye | Difference | |
| R1 | 42.700 | 42.700 | 0 |
| R2 | 43.010 | 43.000 | -0.010 |
| R3 | 48.895 | 48.880 | -0.015 |
| R4 | 83.895 | 83.860 | -0.035 |
| R5 | 65.815 | 65.800 | -0.015 |
Rating results of participants’ subjective experiences. A 7-step scale (1=not at all – 7=very much, reversed for last question) was used.
| Standard deviation | ||
| How easy was it to keep the eyes open during the nods? | 5.6 | 1.65 |
| How comfortable did you feel during the nod? | 5.0 | 1.89 |
| How clear was it when to produce the nod? | 6.4 | 0.97 |
| How disturbing was it to perform the nod? | 2.3 | 1.83 |