| Literature DB >> 30684203 |
Yanik Sterchi1, Nicole Hättenschwiler2, Adrian Schwaninger2.
Abstract
In visual inspection tasks, such as airport security and medical screening, researchers often use the detection measures d' or A' to analyze detection performance independent of response tendency. However, recent studies that manipulated the frequency of targets (target prevalence) indicate that da with a slope parameter of 0.6 is more valid for such tasks than d' or A'. We investigated the validity of detection measures (d', A', and da) using two experiments. In the first experiment, 31 security officers completed a simulated X-ray baggage inspection task while response tendency was manipulated directly through instruction. The participants knew half of the prohibited items used in the study from training, whereas the other half were novel, thereby establishing two levels of task difficulty. The results demonstrated that for both levels, d' and A' decreased when the criterion became more liberal, whereas da with a slope parameter of 0.6 remained constant. Eye-tracking data indicated that manipulating response tendency affected the decision component of the inspection task rather than search errors. In the second experiment, 124 security officers completed another simulated X-ray baggage inspection task. Receiver operating characteristic (ROC) curves based on confidence ratings provided further support for da, and the estimated slope parameter was 0.5. Consistent with previous findings, our results imply that d' and A' are not valid measures of detection performance in X-ray image inspection. We recommend always calculating da with a slope parameter of 0.5 in addition to d' to avoid potentially wrong conclusions if ROC curves are not available.Entities:
Keywords: Detection measures; Signal detection theory; Visual search; X-ray image inspection
Mesh:
Year: 2019 PMID: 30684203 PMCID: PMC6647488 DOI: 10.3758/s13414-018-01654-8
Source DB: PubMed Journal: Atten Percept Psychophys ISSN: 1943-3921 Impact factor: 2.199
Outcome of decisions depending on stimulus using the terminology of visual search, signal detection theory, and X-ray baggage inspection
| Decision | ||
|---|---|---|
| Stimulus | Target absent | Target present |
Target absent Noise No prohibited item present | Correct rejection | False alarm |
Target present Signal plus noise Prohibited item present | Miss | Hit |
Note. Target present and target absent are terms used in visual search studies (Biggs & Mitroff, 2015; Eckstein, 2011; Wolfe, 2007, p. 99). Noise, no signal, signal plus noise, signal, hit, miss, false alarm, and correct rejection are terms used in signal detection theory (Gescheider, 1997, p. 106; Green & Swets, 1966, p. 34). The other terms have been used in X-ray baggage inspection studies (Cooke & Winner, 2007; Schwaninger, Hardmeier, & Hofer, 2004)
Fig. 1Illustration of noise and signal-plus-noise distribution (first column), receiver operating characteristic (ROC) curves (second column), and ROC curves in z-transformed space (zROC; third column) corresponding to d' (first row), d (second row), and A' (third row)
Fig. 2Two examples of the prohibited item category knife: (a) example of a known target item and (b) example of a novel target item (Asian combat knife)
Mean (SD) of the normal and liberal decision condition and the effect size (standardized difference) of the decision condition for hit rate (HR), false alarm rate (FAR), and detection measures d', A', d, and A
| Decision condition | HR | FAR |
|
|
|
|
|---|---|---|---|---|---|---|
| Known targets | ||||||
| Normal decision | .79 (.10) | .09 (.08) | 2.25 (0.61) | 2.03 (0.57) | .916 (.044) | .894 (.072) |
| Liberal decision | .90 (.10) | .25 (.13) | 2.01 (0.58) | 2.08 (0.61) | .899 (.049) | .906 (.073) |
| Effect size | -0.40 | -0.08 | -0.42 | 0.23 | ||
| Novel targets | ||||||
| Normal decision | .58 (0.14) | .09 (.08) | 1.63 (0.41) | 1.28 (0.38) | .851 (.040) | .799 (.082) |
| Liberal decision | .71 (0.13) | .25 (.13) | 1.27 (0.44) | 1.19 (0.43) | .817 (.074) | .793 (.076) |
| Effect size | -0.70 | -0.19 | -0.50 | -0.07 | ||
Response times [ms] for correct responses
| Normal decision | Liberal decision | |||
|---|---|---|---|---|
|
|
| |||
| Target-present | 6,000 (2,407) | 4,295 | 8,018 (4,331) | 6,291 |
| Target-absent | 6,813 (2,798) | 5,873 | 11,162 (6,872) | 9,464 |
Note. The reported means and standard deviations are based on individual mean response times, and the reported medians on individual median response times
Mean (SD) share of images per subject with a recorded fixation within the area of interest
| Share AOI fixations | ||
|---|---|---|
| Image type | Normal decision | Liberal decision |
| Known target | .713 (.237) | .740 (.258) |
| Novel target | .742 (.165) | .730 (.180) |
Effect size (standardized difference) [and 95% confidence intervals] of target novelty (known vs. novel targets)
|
|
| |||||
|---|---|---|---|---|---|---|
| Normal decision | 1.60 | [1.21, 2.10] | 1.72 | [1.34, 2.15] | 1.24 | [0.84, 1.64] |
| Liberal decision | 1.98 | [1.20, 3.02] | 1.73 | [1.11, 2.48] | 2.20 | [1.35, 3.04] |
Fig. 3Receiver operating characteristic (ROC) curves implied by d', A', and d estimated by the pooled hit rate (HR) and false alarm rate (FAR) of the normal decision condition for known prohibited items (higher HR) and novel prohibited items (lower HR)
Response times [ms] for correct responses
|
| ||
|---|---|---|
| Target-present | 4,781 (1,087) | 3,816 |
| Target-absent | 5,079 (1,959) | 4,008 |
Note. The reported group means and standard deviations are based on individual mean response times, and the reported medians on individual median response times
Fig. 4Individual (grey; jittered) and pooled (black) empirical zROC curves, the lines corresponding to the mean A', d', and d with a slope of 0.6, and the chance line (dashed)
Fig. 5Distribution, mean (red dashed line), and median (solid blue line) of arctan-transformed individual slope parameters