| Literature DB >> 34982104 |
Candace E Peacock1,2, Ting Zhang1,3, Brendan David-John1,4, T Scott Murdison1,5, Matthew J Boring1,6, Hrvoje Benko1,7, Tanya R Jonker1,8.
Abstract
Numerous studies have demonstrated that visuospatial attention is a requirement for successful working memory encoding. It is unknown, however, whether this established relationship manifests in consistent gaze dynamics as people orient their visuospatial attention toward an encoding target when searching for information in naturalistic environments. To test this hypothesis, participants' eye movements were recorded while they searched for and encoded objects in a virtual apartment (Experiment 1). We decomposed gaze into 61 features that capture gaze dynamics and a trained sliding window logistic regression model that has potential for use in real-time systems to predict when participants found target objects for working memory encoding. A model trained on group data successfully predicted when people oriented to a target for encoding for the trained task (Experiment 1) and for a novel task (Experiment 2), where a new set of participants found objects and encoded an associated nonword in a cluttered virtual kitchen. Six of these features were predictive of target orienting for encoding, even during the novel task, including decreased distances between subsequent fixation/saccade events, increased fixation probabilities, and slower saccade decelerations before encoding. This suggests that as people orient toward a target to encode new information at the end of search, they decrease task-irrelevant, exploratory sampling behaviors. This behavior was common across the two studies. Together, this research demonstrates how gaze dynamics can be used to capture target orienting for working memory encoding and has implications for real-world use in technology and special populations.Entities:
Mesh:
Year: 2022 PMID: 34982104 PMCID: PMC8742516 DOI: 10.1167/jov.22.1.2
Source DB: PubMed Journal: J Vis ISSN: 1534-7362 Impact factor: 2.240
Figure 1.Example of an experiment 1 trial. During each trial, a text prompt was accompanied by a yellow arrow to guide navigation; a blue arrow was used to indicate the object to be recalled. The blue arrow later disappeared once participants’ gaze intersected the target object; after encoding five or nine objects, participants were asked to verbally recall all the objects.
Figure 2.Visualization of the sliding window framework. (a) A hypothetical eye movement feature (blue line) that increases just before the onset of WM encoding. The predictive window for this feature slides along sample by sample to produce multiple windows of data. (b) An example of how features were temporally downsampled. If feature 1 (or 2) contains 5 (or 10) samples in the optimal predictive window, then these 5 (or 10) samples would be downsampled via averaging to generate 1 (or 2) beta parameter without sacrificing accuracy. (c) An example of how the feature concatenation was performed. The class would be determined by the y value of the last time stamp that was concatenated.
Figure 3.The modeling framework. The gaze data was first preprocessed (a–d) before modeling. Following the pre-processing, we then modeled the data (e–g) and tested it on a held-out test set (h).
A description of the features and the window sizes for those features that were retained from recursive feature selection.
| Feature name | Window size (ms) | Description |
|---|---|---|
| Fixation detection | 1,000 | An index of cognitive processing computed as a binary variable to describe if a fixation occurred (1) or not (0). If a fixation was detected, then a participant's eyes were stationary. If a fixation was not detected, then their eyes were moving. |
| Gaze velocity | 916 | An index of gaze exploration that was computed as the angular distance between two gaze samples divided by the change in time. Smaller gaze velocities indicated that the eyes were moving slower, whereas larger gaze velocities indicated that the eyes were moving faster. |
| Angular displacement between previous and current saccade centroid | 125 | A measure of the distance between subsequent saccade centroids that was computed as the smallest angle needed to rotate the centroid of a saccade (i.e., 3D gaze vector direction) overtop the previous saccade centroid. Saccade centroids were defined as the center position of all samples in a saccade. Smaller angular displacements indicated there were shorter distances between subsequent saccade centroids, whereas larger angular displacements indicated that subsequent saccade centroids were farther apart. |
| Angular displacement between previous and current saccade landing points | 125 | A measure of the distance between subsequent saccade landing points computed as the smallest angle needed to rotate the landing point of a saccade (i.e., 3D gaze vector direction) on top of the previous saccade landing point. Smaller angular displacements indicated that saccade landing points were closer together whereas larger angular displacement indicates that saccade landing points were farther apart. This feature is likely correlated with the angular displacement between saccade centroids when subsequent saccades are small because the centroids are close to landing points. Therefore, the angular displacement between saccade landing points provides a unique measure of the variance relative to the angular displacement between saccade centroids when one saccade was large and the next one was small because there would be less distance between the end points than the centroids of those saccades. Conversely, when one saccade was small and the next was large, there would be a greater distance between the end points than the centroids. |
| Angular displacement between previous and current fixation centroid | 83 | A measure of the distance between fixation centroids. This was computed as the smallest angle needed to rotate the centroid of a saccade (3D gaze vector direction) overtop the previous fixation centroid. Smaller angular displacements indicated that fixation centroids were closer together whereas larger angular displacements indicated that fixation centroids were further apart. The angular displacement between fixation centroids is a unique event compared with the angular displacement between saccade centroid/landing points and, therefore, the fixation centers would be in different locations than the saccade centroids/landing points. |
| Midlevel dispersion | 916 | An index of exploration that was defined as the maximum angular displacement of all the samples from the centroid of a 1000 ms period/window. Increased dispersion indicated that subsequent gaze samples were further apart (i.e., more exploration), whereas decreased dispersion indicated that subsequent gaze samples were closer together (i.e., less exploration). |
| Saccade velocity: standard deviation | 1,000 | The standard deviation of saccade velocity. Increased standard deviations of saccade velocity indicated that there was more variability in saccade velocities whereas decreased standard deviation in saccade velocity indicated that there was less variability in saccade velocities. |
| Midlevel K-coefficient | 1,000 | A coefficient that described the ambient focal phenomenon 500 ms before the current sample. This measure was derived by computing the z-score for each saccade amplitude and subtracting the z-score of fixation duration from the fixation that proceeded it. The coefficient corresponded to the average value over all saccades and fixations in the past 500 ms. Larger values resulted from large fixation durations with small saccade amplitudes (i.e., focal) whereas small or negative values indicated shorter fixation durations and larger saccade amplitudes (i.e., ambient). |
| Saccade acceleration: skew | 125 | A measure of how skewed the saccade acceleration distribution was. This measure was described by computing the skew of saccade acceleration samples. No skew indicated that the tails of the distribution were balanced, whereas positive skew indicated that the tail was on the right (i.e., faster saccade accelerations) and negative skew indicated that the tail was on the left (i.e., slower saccade accelerations). Skew in saccade accelerations typically occur when saccade decelerations are slower than saccade accelerations ( |
| Vertical component of saccade acceleration: skew | 167 | A measure of the skew of the vertical component of the saccade acceleration distribution. As the saccade acceleration skew feature measures both the horizontal and vertical angles in tandem, the vertical component of saccade acceleration skew only measured the vertical component of saccade acceleration. No skew indicated that the tails of the distribution were balanced, whereas positive skew indicated that the tail was on the right (i.e., faster saccade accelerations in the vertical component) and negative skew indicated that the tail was on the left (i.e., slower saccade accelerations in the vertical component). |
| Fixation duration | 83 | An index of cognitive processing that was defined as the end time minus the start time of a fixation. Longer fixation durations indicated increased cognitive processing whereas shorter fixation durations indicated decreased cognitive processing. |
| Vertical component of saccade velocity: mean | 125 | The mean of the vertical component of the saccade velocity. A larger mean indicated that saccades were directed more in the vertical direction whereas a smaller mean indicated that saccades were directed less in the vertical direction. |
Figure 4.Results of the stability selection and group feature selection processes. The top panel depicts the proportion of participants that retained a given feature after the stability selection process. The bottom panel visualizes the features that were retained from the group recursive feature selection process as features were iteratively added, from most retained to least retained. Each point refers to the average AUC-PR from the cross-validation procedure and the error bars refer to 95% confidence intervals. Asterisks correspond with features that increased the AUC-PR relative to the previous benchmark and were thus used in the group models.
Figure 5.Partitioning of the train and test sets for an example participant. The group model was trained on 20% of the data (purple circle) with 80% of the data held out (green circle). The within-participant model was trained on 90% of the data (grey circle) with 10% of the data held out (yellow circle). Samples that overlapped between the group and within-participant test sets were then identified (red box) for the final model evaluation. This process produced test samples that were independent of both training sets.
Figure 6.The group model tested on individual participants. The gray bars represent the H1 results, and the white bars represent the results corresponding to the filtered test data preceding fixations. The top panel represents the AUC-PR results, and the bottom panel represents the AUC-ROC results. The red, dashed line represents chance performance.
Figure 7.Fixation detector analysis description. The time series of gaze velocity corresponds with the samples where saccades (green) and fixations (brown) were detected. To test whether the model was simply detecting fixations, the null class windows that ended just before the onset of a fixation (blue) were filtered. This temporally matched the filtered null classes to the true classes which also ended in no fixation and were followed by the onset of a fixation on the target object (red).
Design Differences between Experiments 1 and 2.
| Features |
|
|
|---|---|---|
| Participant set | Unique group of participants | Unique group of participants |
| Room type | Bedroom, patio, kitchen, bathroom, living room | Kitchen |
| Clutter | Low | High |
| Task | Find cue and encode target object 5 or 9 times | Find 1 or 3 target objects and encode 1 or 3 associated nonwords |
| Encoded stimulus | Semantic label of target object | Associated nonword |
| Memory test | Verbal recall | Nonword recognition |
Figure 8.Trial sequence for One or three to-be-remembered objects were presented on a gray background; participants were to find these objects in a cluttered room and then remember the nonword that was located above the target object. After all the objects were found, participants were asked to recall the associated nonwords from an array of target and distractor nonwords.
Figure 9.Group model tested on individual participants from The gray bars represent the AUC-PR (top panel) or AUC-ROC (bottom panel) and the red, dashed line represents chance in Experiment 2.
Figure 10.Features ranked from the most generalizable to the least generalizable. The top panel visualizes the features ranked based upon AUC-PR, which was the metric used to rank feature performance. The models were trained on the data collected during Experiment 1 and tested on each participant's data collected during Experiment 2. Each data point represents the mean AUC-PR. The bottom panel represents the corresponding mean AUC-ROC for each ranked feature. Asterisks depict whether the false discovery rate–corrected p value for each feature was significant (p < 0.05). The red, dashed lines represent chance. Error bars represent 95% confidence intervals.
Statistics about the generalizability of individual features for the AUC-PR and AUC-ROC metrics.
| AUC-PR | AUC-ROC | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Feature names | M (SD) | T | False discovery rate | Cohen's D | Generalized? | M (SD) | t | False discovery rate | Cohen's D | Generalized? |
| Fixation detection | 0.054 (0.09) | 2.49 | 0.04 | 0.50 | Yes | 0.69 (0.09) | 10.20 | <0.001 | 2.11 | Yes |
| Angular displacement between previous and current fixation centroid | 0.030 (0.03) | 3.59 | 0.004 | 0.70 | Yes | 0.54 (0.09) | 2.31 | 0.06 | 0.44 | No |
| Midlevel dispersion | 0.022 (0.04) | 1.66 | 0.13 | 0.33 | No | 0.50 (0.10) | 0.20 | 0.84 | 0.00 | No |
| Fixation duration | 0.019 (0.02) | 2.30 | 0.05 | 0.50 | No | 0.51 (0.10) | 0.36 | 0.79 | 0.10 | No |
| Angular displacement between previous and current saccade centroid | 0.017 (0.01) | 5.16 | <0.001 | 0.80 | Yes | 0.57 (0.08) | 4.22 | 0.002 | 0.88 | Yes |
| Saccade acceleration: skew | 0.017 (0.02) | 2.53 | 0.04 | 0.40 | Yes | 0.53 (0.09) | 1.88 | 0.10 | 0.33 | No |
| Angular displacement between previous and current saccade landing points | 0.016 (0.01) | 4.67 | 0.001 | 0.70 | Yes | 0.56 (0.10) | 2.83 | 0.03 | 0.60 | Yes |
| Midlevel K-coefficient | 0.016 (0.02) | 1.69 | 0.13 | 0.35 | No | 0.44 (0.11) | 2.69 | 0.03 | –0.54 | No |
| Vertical component of saccade acceleration: skew | 0.015 (0.01) | 3.82 | 0.003 | 0.60 | Yes | 0.54 (0.10) | 1.87 | 0.10 | 0.40 | No |
| Vertical component of saccade velocity: mean | 0.015 (0.02) | 1.91 | 0.10 | 0.30 | No | 0.46 (0.06) | 3.61 | 0.006 | –0.66 | No |
| Saccade velocity: standard deviation | 0.010 (0.003) | 1.43 | 0.18 | 0.33 | No | 0.47 (0.14) | 1.21 | 0.29 | –0.21 | No |
| Gaze velocity | 0.009 (0.003) | 0.43 | 0.67 | 0.00 | No | 0.45 (0.13) | 2.07 | 0.09 | –0.38 | No |
Figure 11.Comparison of best-generalizing features in This figure depicts the value of the features that generalized relative to the encoding onset across Experiments 1 and 2 based on the AUC-PR results. The red line (solid line, square dots) represents true classes, and the blue line (dotted line, circles) represents null classes. The error bars represent 95% confidence intervals.
Figure 12.Saccade deceleration versus acceleration in at the end of search when people oriented to the target (i.e., true classes) and early during search (null classes). The saccade deceleration time (red) and acceleration time (blue) during null classes in Experiment 1 (a) and Experiment 2 (d). The saccade deceleration time (red) and acceleration time (blue) during true classes in Experiment 1 and (b) Experiment 2 (e). The difference between deceleration and acceleration for the true classes (red) versus the null classes (blue) for (c) Experiments 1 (c) and 2 (f).