| Literature DB >> 30837917 |
Alain Chavaillaz1, Adrian Schwaninger2, Stefan Michel2, Juergen Sauer1.
Abstract
X-ray screening of passenger baggage is a key component in aviation security. The current study investigated how experts and novices performed in an X-ray baggage screening task while being assisted by an adaptable diagnostic aid. Furthermore, it examined how both groups operated and trusted this automated system. 30 experts (certified screeners) and 31 novices (students) had to indicate whether a target item (either a knife or a gun) was present in a series of X-ray images of cabin baggage. Half of the participants could choose between three different support levels of the diagnostic aid (DA): (1) no support, (2) a cue indicating the presence of a potential target without locating it, or (3) a cue indicating the presence of a potential target by surrounding it with a red frame. As expected, experts achieved higher detection performance (d'), were more self-confident and felt more competent in achieving the task than novices. Furthermore, experts experienced less time pressure and fatigue. Although both groups used the DA in a comparable way (in terms of support level used and frequency of level switches), results showed a performance increase for novices working with the DA compared to novices without support. This benefit of DA was not observed for experts. Interestingly, despite no difference in perceived trust ratings, experts were more compliant (i.e., following DA recommendations when it indicated the presence of a target) and reliant (i.e., following DA recommendations when it indicated the absence of a target) than novices. Altogether, the results of the present study suggested that novices benefited more from a DA than experts. Furthermore, compliance and reliance on DA seemed to depend on expertise with the task. Since experts should be better at assessing the reliability of the DA than novices, they may have used the DA as 'back-up' to confirm their decisions based on expertise (confirmatory function), while novices may have used it as a guide to base their decisions on (support function). Finally, trust towards a DA was associated with the degree to which participants found the DA useful.Entities:
Keywords: airport security; automation; detection performance; expertise; trust
Year: 2019 PMID: 30837917 PMCID: PMC6382685 DOI: 10.3389/fpsyg.2019.00256
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Interface of Luggage Inspection Simulation (LIS) depicting the levels of automations (LOA): (A) trial without support (LOA 1), (B) indirect cue (LOA 2), (C) direct cue (LOA 3), and (D) valid cue for a target-absent trial (LOA 3).
F-value, significance level and effect size for the main and interaction effects for expertise and presence of the diagnostic aid.
| Expertise | Diagnostic aid | Expertise X Diagnostic aid | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Variable | |||||||||
| Detection | 3.680 | 0.060 | 0.062 | ||||||
| Response bias | 0.180 | 0.673 | 0.003 | ||||||
| Target localization | |||||||||
| Self-confidence | 1.723 | 0.195 | 0.030 | 0.191 | 0.663 | 0.003 | |||
| Perceived workload | 3.463 | 0.068 | 0.058 | 0.000 | 0.985 | 0.000 | 0.092 | 0.762 | 0.002 |
| Mental load | 3.201 | 0.079 | 0.054 | 0.661 | 0.420 | 0.012 | 0.026 | 0.871 | 0.000 |
| Physical load | 0.318 | 0.570 | 0.006 | 0.654 | 0.422 | 0.012 | 1.010 | 0.319 | 0.018 |
| Time pressure | 0.026 | 0.873 | 0.000 | 0.373 | 0.540 | 0.007 | |||
| Performance | 0.386 | 0.537 | 0.007 | 1.220 | 0.274 | 0.021 | |||
| Fatigue | 0.158 | 0.693 | 0.003 | 0.000 | 1 | 0.000 | |||
| Frustration | 0.266 | 0.608 | 0.005 | 0.266 | 0.608 | 0.005 | |||
Mean scores (and standard deviations) for participants’ performance use of automation, and subjective measures as a function of expertise and presence of the diagnostic aid.
| Professional screeners | Novices | |||
|---|---|---|---|---|
| Score | Without DA | DA | Without DA | DA |
| Detection [d’] | 2.29 (0.30) | 2.33 (0.29) | 1.15 (0.31) | 1.49 (0.29) |
| Response bias [c] | 0.43 (0.19) | 0.17 (0.32) | 0.63 (.36) | 0.30 (0.28) |
| Target localization [%] | 74.06 (5.78) | 77.81 (7.38) | 41.98 (9.10) | 56.72 (8.61) |
| Median LOA [1–3] | - | 2.23 (0.79) | - | 2.67 (0.52) |
| LOA switches per trial | - | 0.12 (0.14) | - | 0.34 (0.56) |
| Self-confidence [1–10] | 7.73 (0.88) | 7.20 (0.94) | 4.53 (1.51) | 4.27 (1.28) |
| Subjective workload [1–20] | ||||
| Mental load | 13.27 (3.69) | 13.80 (3.005) | 14.60 (3.25) | 15.40 (2.67) |
| Physical load | 6.53 (4.98) | 6.80 (6.18) | 7.13 (5.10) | 4.67 (4.69) |
| Time pressure | 6.80 (4.55) | 6.33 (4.15) | 11.47 (3.60) | 12.27 (3.69) |
| Performance | 15.40 (2.32) | 13.73 (4.01) | 9.67 (3.77) | 10.13 (4.50) |
| Fatigue | 12.40 (3.29) | 12.73 (4.18) | 15.00 (2.75) | 15.33 (2.53) |
| Frustration | 4.93 (4.28) | 4.93 (3.52) | 6.93 (5.22) | 8.13 (4.82) |
| Trust perception [1–7] | - | 4.27 (1.34) | - | 3.95 (0.79) |
| Compliance [%] | - | 76.06 (10.78)* | - | 57.93 (11.22) |
| Reliance [%] | - | 96.20 (3.28)* | - | 88.43 (7.60) |
FIGURE 2Mean detection performance as a function of expertise and presence of the diagnostic aid. Error bars correspond to the 95% confidence interval around the mean.
FIGURE 3Mean percentage of correct target localization as a function of expertise and presence of the diagnostic aid. Error bars correspond to the 95% confidence interval around the mean.
Correlation table between perceived trust and other measures as a function of expertise level (experts vs. novices).
| Perceived trust | ||||||
|---|---|---|---|---|---|---|
| Experts | Novices | |||||
| Compliance | -0.057 | 0.860 | 12 | 0.559 | 0.030 | 15 |
| Reliance | 0.007 | 0.983 | 12 | 0.002 | 0.994 | 15 |
| Experience | 0.270 | 0.330 | 15 | - | ||