| Literature DB >> 25793289 |
Rosa S Gisladottir1, Dorothee J Chwilla2, Stephen C Levinson1.
Abstract
The ability to recognize speech acts (verbal actions) in conversation is critical for everyday interaction. However, utterances are often underspecified for the speech act they perform, requiring listeners to rely on the context to recognize the action. The goal of this study was to investigate the time-course of auditory speech act recognition in action-underspecified utterances and explore how sequential context (the prior action) impacts this process. We hypothesized that speech acts are recognized early in the utterance to allow for quick transitions between turns in conversation. Event-related potentials (ERPs) were recorded while participants listened to spoken dialogues and performed an action categorization task. The dialogues contained target utterances that each of which could deliver three distinct speech acts depending on the prior turn. The targets were identical across conditions, but differed in the type of speech act performed and how it fit into the larger action sequence. The ERP results show an early effect of action type, reflected by frontal positivities as early as 200 ms after target utterance onset. This indicates that speech act recognition begins early in the turn when the utterance has only been partially processed. Providing further support for early speech act recognition, actions in highly constraining contexts did not elicit an ERP effect to the utterance-final word. We take this to show that listeners can recognize the action before the final word through predictions at the speech act level. However, additional processing based on the complete utterance is required in more complex actions, as reflected by a posterior negativity at the final word when the speech act is in a less constraining context and a new action sequence is initiated. These findings demonstrate that sentence comprehension in conversational contexts crucially involves recognition of verbal action which begins as soon as it can.Entities:
Mesh:
Year: 2015 PMID: 25793289 PMCID: PMC4368040 DOI: 10.1371/journal.pone.0120068
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Stimuli in Dutch and English translations.
| Set 1 | Set 2 | |||
|---|---|---|---|---|
| Condition | Context | Target Utterance | Context | Target Utterance |
| Answer | Hoe ga je voor het ticket betalen? | Ik heb een creditcard. | Waar koop je je shampoo? | Ik ga naar de Kruidvat. |
|
|
|
|
| |
| Declination | Ik kan je wat geld lenen voor het ticket. | Ik heb een creditcard. | Ik kan wel shampoo voor je meenemen? | Ik ga naar de Kruidvat. |
|
|
|
|
| |
| Pre-offer | Ik heb geen geld om het ticket te betalen. | Ik heb een creditcard. | Mijn shampoo is op. | Ik ga naar de Kruidvat. |
|
|
|
|
| |
Behavioural results.
| Answer | Declination | Pre-offer | |
|---|---|---|---|
| Overall accuracy | 98.5% (2%) | 96.5% (3.8%) | 94.8% (8%) |
| Set 1 accuracy | 98.0% (3.7%) | 96.2% (4.5%) | 97.2% (5.2%) |
| Set 2 accuracy | 99.1% (2.2%) | 96.9% (4.9%) | 92.4% (11.9%) |
Mean accuracy (and standard deviation) in the categorization task for all items (overall) and for each stimulus Set.
Fig 1Timeline.
Mean onset of the first word, verb and final word in target utterances and a rough timeline for the early and late utterance time-windows.
Fig 2Regions used for analyses of EEG data.
Fig 3Early utterance time-window.
A) Grand-averaged ERPs time-locked to the onset of the first word. Representative electrodes showing the relevant effects are highlighted in dashed boxes. B) Scalp distribution of the ERP effects in the early utterance time-window. All waveforms were low-pass filtered (10 Hz) for illustration purposes only. Negativity is plotted upwards.
Regional omnibus analyses and pairwise comparisons for the early utterance time-window.
| Analysis | Source | DF | 100–200 | 200–300 | 300–400 | 400–500 | 500–600 | ||
|---|---|---|---|---|---|---|---|---|---|
| REGIONAL OMNIBUS | Lat | Action×AntPost×Hem | 2, 82 | 4.14 | 7.08 | 3.69 | 4.08 | 5.64 | |
| Action×Hem×Site | 6, 246 | 3.42 | 2.81 | 2.55 | |||||
| Med | Action×AntPost | 2, 82 | 4.11 | ||||||
| PAIRWISE COMPARISONS | D vs. A | Lat | Action×Hem ×Site | 3, 123 | 3.88 | 3.63 | 3.94 | ||
| Action×AntPost×Hem | 1, 41 | 6.77 | 7.08 | ||||||
| Action×AntPost×Site | 3, 123 | 2.89 | |||||||
| Lat Ant | Action | 1, 41 | 6.70 | ||||||
| Action×Hem | 1, 41 | 5.40 | 4.59 | ||||||
| Action×Hem×Site | 3, 123 | 3.28 | 3.06 | ||||||
| Right Ant | Action | 1, 41 | 8.29 | ||||||
| Med | Action×AntPost | 1, 41 | 6.86 | ||||||
| Med Ant | Action | 1, 41 | 6.90 | ||||||
| P vs. A | Lat | Action×Hem | 1, 41 | 6.47 | 7.99 | ||||
| Action×AntPost×Hem | 1, 41 | 9.63 | 15.18 | 7.10 | 5.46 | 10.37 | |||
| Action×AntPost×Hem×Site | 3, 123 | 2.96 | 3.40 | 2.88 | |||||
| Action×Hem×Site | 3, 123 | 7.68 | 4.02 | 3.21 | |||||
| Lat Ant | Action×Hem | 1, 41 | 4.77 | 10.40 | 8.56 | 11.57 | |||
| Action×Hem×Site | 3, 123 | 6.93 | 4.84 | 4.08 | 3.55 | ||||
| Right Ant | Action | 1, 41 | 7.94 | 6.08 | 10.61 | 6.66 | |||
| P vs. D | Lat | Action×AntPost×Hem | 1, 41 | 5.18 | |||||
| Med | Action×AntPost | 1, 41 | 6.07 | ||||||
D = Declination, A = Answer, P = Pre-offer, Lat = lateral regions, Med = medial regions, Ant = anterior regions, Hem = hemisphere, AntPost = Anteriority.
*p <. 05
**p<.01
Fig 4Late utterance time-window.
A) Grand-averaged ERPs time-locked to the onset of the final word. Representative electrodes showing the relevant effects are highlighted in dashed boxes. B) Scalp distribution of the ERP effects in the late utterance time-window.
Regional omnibus analyses and pairwise comparisons for the late utterance time-window, for those epochs that showed significant effects.
| Analysis | Source | DF | 100–200 | 600–700 | 700–800 | 800–900 | 900–1000 | ||
|---|---|---|---|---|---|---|---|---|---|
| REGIONAL OMNIBUS | Lat | Action×AntPost | 2, 82 | 3.78 | 3.43 | 4.17 | 4.21 | ||
| Action×Site | 6, 246 | 2.65 | |||||||
| Med | Action×Site | 8, 328 | 3.47 | 2.56 | |||||
| Action×AntPost | 2, 82 | 3.57 | |||||||
| PAIRWISE COMPARISONS | D vs. A | Med | Action×AntPost | 1, 41 | 6.32 | ||||
| P vs. A | Lat | Action×AntPost | 1, 41 | 4.41 | 6.79 | 8.75 | |||
| Lat Post | Action | 1, 41 | 5.11 | ||||||
| Med | Action×AntPost | 1, 41 | 5.56 | ||||||
| Med Post | Action | 1, 41 | 4.97 | ||||||
| P vs. D | Lat | Action×AntPost | 1, 41 | 7.64 | 6.45 | 9.04 | 8.52 | ||
| Lat Post | Action | 1, 41 | 9.26 | 4.24 | 4.40 | 5.93 | |||
| Med | Action×AntPost | 1, 41 | 4.28 | ||||||
| Med Post | Action | 1, 41 | 6.31 | ||||||
D = Declination, A = Answer, P = Pre-offer, Lat = lateral regions, Med = medial regions, Post = posterior regions, AntPost = Anteriority.
*p <. 05
**p<.01