| Literature DB >> 24009594 |
Sebastian Loth1, Kerstin Huth, Jan P De Ruiter.
Abstract
Recognizing the intention of others is important in all social interactions, especially in the service domain. Enabling a bartending robot to serve customers is particularly challenging as the system has to recognize the social signals produced by customers and respond appropriately. Detecting whether a customer would like to order is essential for the service encounter to succeed. This detection is particularly challenging in a noisy environment with multiple customers. Thus, a bartending robot has to be able to distinguish between customers intending to order, chatting with friends or just passing by. In order to study which signals customers use to initiate a service interaction in a bar, we recorded real-life customer-staff interactions in several German bars. These recordings were used to generate initial hypotheses about the signals customers produce when bidding for the attention of bar staff. Two experiments using snapshots and short video sequences then tested the validity of these hypothesized candidate signals. The results revealed that bar staff responded to a set of two non-verbal signals: first, customers position themselves directly at the bar counter and, secondly, they look at a member of staff. Both signals were necessary and, when occurring together, sufficient. The participants also showed a strong agreement about when these cues occurred in the videos. Finally, a signal detection analysis revealed that ignoring a potential order is deemed worse than erroneously inviting customers to order. We conclude that (a) these two easily recognizable actions are sufficient for recognizing the intention of customers to initiate a service interaction, but other actions such as gestures and speech were not necessary, and (b) the use of reaction time experiments using natural materials is feasible and provides ecologically valid results.Entities:
Keywords: action recognition; human robot interaction; intention recognition; social robotics; social signal processing; social signaling
Year: 2013 PMID: 24009594 PMCID: PMC3757322 DOI: 10.3389/fpsyg.2013.00557
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Summary of customer behavior when bidding for attention.
| Body to bar | 95 | 210 |
| Head to bar | 93 | 157 |
| Being directly at bar | 92 | 92 |
| Approaching bar | 44 | 44 |
| Leaning on bar | 12 | 12 |
| Turning to bar | 11 | 11 |
| Further away from bar | 4 | 4 |
| Looking at bartender | 86 | 246 |
| Head gesture | 11 | 14 |
| Looking at money | 7 | 13 |
| Looking at assortment | 3 | 3 |
| Looking at menu | 1 | 2 |
| Raising eyebrows | 5 | 9 |
| Smiling | 1 | 1 |
| Attention to bartender | 91 | 231 |
| Attention to human | 32 | 93 |
| Attention to object | 49 | 89 |
| Holding object/bottle | 17 | 17 |
| Hand gesture to others | 7 | 9 |
| Hand gesture to bartender | 4 | 5 |
| Speaking to bartender | 10 | 11 |
| Speaking to others | 21 | 57 |
The table shows the number of interactions that included a particular behavior and its absolute frequency in the attention bids.
Figure 1A grid of example snapshots recorded in the “Movie,” Bielefeld. Top left: Being at bar (no-response expected); Top right: Looking at bar (no-response expected); Bottom left: Ordering (yes-response expected); Bottom right: Not ordering (yes-response expected).
Results of Experiment 1.
| Being directly at bar | No | 14 (1.2%) | 1195 (98.8%) | −0.51 | 292 (24%) | 903 (76%) |
| Looking at bar | No | 16 (1.3%) | 1193 (98.7%) | −0.47 | 319 (27%) | 874 (73%) |
| Ordering | Yes | 16 (1.3%) | 1255 (98.7%) | +0.65 | 1034 (82%) | 221 (18%) |
| Not ordering | Yes | 21 (1.8%) | 1126 (98.2%) | +0.68 | 947 (84%) | 179 (16%) |
The response scores (yes-responses were scored as +1 and no-responses as −1) and times were computed for valid responses.
Proportions of .
| Signals present ( | Hit | Miss |
| 0.832 (1981) | 0.168 (400) | |
| Signals absent ( | False alarm | Correct rejection |
| 0.256 (611) | 0.744 (1777) |
The numbers in brackets show the absolute number of responses.
Results of Experiment 2.
| Being directly at bar | −0.58 | 95 (21%) | 387 (79%) | 450 |
| Looking at bar | −0.72 | 63 (14%) | 355 (86%) | 450 |
| Ordering | +0.94 | 871 (97%) | 28 (3%) | 899 |
Maximum entropy and entropy of correct go-responses in Experiment 2.
| 250 | 3.53 | 6.89 |
| 500 | 2.99 | 5.89 |
| 1000 | 2.37 | 4.89 |
Frequency of the signals mentioned in the interview responses.
| No order | 29 (21%) | “Kein Blickkontakt, kein Bestellungswunsch” [no eye contact, no interest to order] |
| Eye contact/gaze to bar/bartender | 38 (27%) | “Mann dreht sich nach vorne und schaut über den Tresen” [A man turns forward and looks behind the bar] |
| Moving to bar | 21 (15%) | “Die Frau kommt zur Theke.” [The woman walks to the counter] |
| Changing body posture | 19 (13%) | “Der Man wendet sich in der Richtung des Barmers” [The man turns around to the bartender] |
| Reading menu | 16 (11%) | “Ich habe das sofort erkannt, weil der Gast die Karte gelessen hat. Das heisst, dass er etwas bestehlen wollte.” [I recognized this immediately, because the customer read the menu. That means that he intended to order] |
| Looking for/at money, holding wallet | 11 (8%) | “Bringt Geld aus der Portmone raus” [Produces money out of wallet] |
| Being at bar | 10 (7%) | “Die Gaeste sind yur Bar gekommen und haben sich die Getraenkekarte angesehen” [The customers arrived at the bar and looked at the menu] |
| Customers had no drink | 8 (6%) | “Der Mann hat noch nichts zu trinken, gruenes shirt” [The man has nothing to drink yet, green shirt] |
| Pointing | 4 (3%) | “Zeigebewegung der Frau” [The woman's pointing gesture] |
| Others | 18 (13%) | “Die Frau schaut sich um.” [The woman looks about.] “Die Frau sieht suchend aus, als ob sie neu dazu gekommen ist” [The woman appears to be looking for something. As if she joined recently] |
The proportion of responses mentioning this signal is shown in brackets. The German examples are presented as the participants typed them. The classification and English translation were based on the most likely interpretation.