| Literature DB >> 34928486 |
Benjamin Wolfe1, Anna Kosovicheva2, Simon Stent3, Ruth Rosenholtz4,5.
Abstract
While driving, dangerous situations can occur quickly, and giving drivers extra time to respond may make the road safer for everyone. Extensive research on attentional cueing in cognitive psychology has shown that targets are detected faster when preceded by a spatially valid cue, and slower when preceded by an invalid cue. However, it is unknown how these standard laboratory-based cueing effects may translate to dynamic, real-world situations like driving, where potential targets (i.e., hazardous events) are inherently more complex and variable. Observers in our study were required to correctly localize hazards in dynamic road scenes across three cue conditions (temporal, spatiotemporal valid and spatiotemporal invalid), and a no-cue baseline. All cues were presented at the first moment the hazardous situation began. Both types of valid cues reduced reaction time (by 58 and 60 ms, respectively, with no significant difference between them, a larger effect than in many classic studies). In addition, observers' ability to accurately localize hazards dropped 11% in the spatiotemporal invalid condition, a result with dangerous implications on the road. This work demonstrates that, in spite of this added complexity, classic cueing effects persist-and may even be enhanced-for the detection of real-world hazards, and that valid cues have the potential to benefit drivers on the road.Entities:
Keywords: Attentional cueing; Driving; Hazard detection; Peripheral vision; Scene perception
Mesh:
Year: 2021 PMID: 34928486 PMCID: PMC8688617 DOI: 10.1186/s41235-021-00348-4
Source DB: PubMed Journal: Cogn Res Princ Implic ISSN: 2365-7464
Fig. 1Visualization of trial sequence and cues. a Each trial began with a random noise mask for 250 ms, and consisted of a video that lasted between 2034 and 5034 ms. In a given video, the hazard could appear between 1034 and 4000 ms after the video began. The portion of the video containing the hazard was always 1000 ms long, and was followed by a 250 ms random noise mask. Participants were instructed to use the left and right arrow keys to indicate the lateral location of the hazard, and could respond any time after the video started. b The video contained either no cue or one of three cues: a temporal cue (red bar at the bottom of the video), a spatiotemporal valid cue (expanding ring superimposed on the hazard, represented by the solid red circle and the larger dashed red circle) and a spatiotemporal invalid cue (expanding ring superimposed on a nonhazardous object in the scene). The no-cue, temporal cue, and spatiotemporal conditions were blocked, and block order was randomized across participants. Within the spatiotemporal cue block, valid and invalid spatiotemporal cues were randomly interleaved and appeared with equal frequency (50% cue validity). The expanding ring spatiotemporal cue was chosen using the procedure described in the “Cue selection” section
Table describing the six cues used in the cue selection experiments and the reasoning behind their inclusion in the set of candidate cues
| Cue type | Reasoning | Description |
|---|---|---|
| Red bounding box | Likely highly salient as compared to the larger road environment (e.g., green plants or grey/black roadways); (Bauer et al., | Red rectangular outline shown for 33 ms (1 frame), corresponding to the annotated location, drawn with a 5 px stroke width |
| Static zebra-striped bounding box | Maximizes luminance contrast of the cue versus background for arbitrary background luminance; increases cue salience (Engmann et al., | Black and white frame shown for 167 ms (5 frames), corresponding to the annotated location, with adjacent concentric alternating black and white outlines (white-black-white-black at 5 px each, for a total stroke width of 20 px) |
| Flashing zebra-striped bounding box | Maximizes luminance contrast of the cue versus background for arbitrary background luminance, may capture attention better than a static version (Stolte & Ansorge, | As above, but the black and white elements reversed on every frame (white-to-black and black-to-white), with the cue shown for 167 ms (5 frames) |
| Expanding ring (spatiotemporal cue used in main experiment) | Looming stimuli are known to capture attention and may be more salient superimposed on a complex, dynamic scene (Franconeri & Simons, | Red circular outline shown for 167 ms (5 frames). The diameter of the ring on the first frame was equal to the average of half the annotated height and width of the hazard (or distractor) and increased by 40% on each consecutive video frame |
| Contracting ring | Some evidence suggest that motion onsets alone may capture attention and may be sufficient for a cue (Abrams & Christ, | Red circular outline shown for 167 ms (5 frames). The diameter of the ring on the |
| Flashing red dot | Used on grounds of being perhaps the easiest to implement, but likely the least salient or prone to capture attention of any cue in the set | Red filled dot on the center of the annotated hazard location (22 pixels in diameter), appeared and disappeared on alternating frames for 7 frames (4 on-frames) |
Fig. 2Diagram of the set of five cue selection experiments used to evaluate different cue types. There were 100 participants in total (n = 20 per cue selection experiment); each pair of bars represents one experiment, each with a different group of participants. The y-axis on each bar graph indicates the difference in reaction time (in seconds) between the invalid and valid cue conditions, with positive values indicating slower reaction times in the invalid cue condition. Note that in the main experiment, the RT difference between valid and invalid spatiotemporal cues was approximately 120 ms. Error bars indicate ± 1 SEM. a The first set of three experiments measured the cue validity effect for the static versus flashing zebra cues, the expanding versus contracting rings, and the red bounding box versus flashing red dot. b The three cue types with the larger RT difference within each paired comparison (the static zebra, the expanding ring, and the red bounding box) were used in two additional paired comparisons: static zebra versus expanding ring and expanding ring versus red bounding box. In all 5 experiments, all pairwise differences between cue types conditions were not statistically significant, but the cue with the largest invalid-to-valid RT difference within the second set of experiments—the expanding ring—was chosen for the main experiment
Fig. 3Reaction time results. Each individual dot represents one participant’s median reaction time for that condition, and bars represent the group mean within the corresponding condition. Mean reaction time in the absence of a cue was 542 ms (magenta bar); invalid spatiotemporal cue (cyan bar) was 600 ms (+ 58 ms vs no cue), valid spatiotemporal cue (navy bar) was 479 ms (− 63 ms) and temporal cue only (violet bar) was 481 ms (− 61 ms). A single asterisk represents p values < 0.05; three asterisks represent p values < .0001. Error bars are standard error of the mean
Fig. 4Mean proportion correct. Each individual dot represents one participant’s mean proportion correct for a given condition. Mean proportion correct in the absence of a cue (no-cue, magenta bar) was 87.1%; invalid spatiotemporal cue (cyan bar), 76.1% (− 11% vs no cue), valid spatiotemporal cue (navy bar), 88.2% (+ 1.1%) and with a temporal cue (violet bar) 85.2% (− 1.9%). A single asterisk represents p values < 0.05; three asterisks represent p values < .0001. Error bars are standard error of the mean