| Literature DB >> 22065961 |
Chiara Gambi1, Martin J Pickering.
Abstract
Dialog partners coordinate with each other to reach a common goal. The analogy with other joint activities has sparked interesting observations (e.g., about the norms governing turn-taking) and has informed studies of linguistic alignment in dialog. However, the parallels between language and action have not been fully explored, especially with regard to the mechanisms that support moment-by-moment coordination during language use in conversation. We review the literature on joint actions to show (i) what sorts of mechanisms allow coordination and (ii) which types of experimental paradigms can be informative of the nature of such mechanisms. Regarding (i), there is converging evidence that the actions of others can be represented in the same format as one's own actions. Furthermore, the predicted actions of others are taken into account in the planning of one's own actions. Similarly, we propose that interlocutors are able to coordinate their acts of production because they can represent their partner's utterances. They can then use these representations to build predictions, which they take into account when planning self-generated utterances. Regarding (ii), we propose a new methodology to study interactive language. Psycholinguistic tasks that have traditionally been used to study individual language production are distributed across two participants, who either produce two utterances simultaneously or complete each other's utterances.Entities:
Keywords: coordination; joint action; prediction; shared representations
Year: 2011 PMID: 22065961 PMCID: PMC3206582 DOI: 10.3389/fpsyg.2011.00275
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Simultaneous production. A produces the word kite in response to the picture of a kite. sem, syn, phon are semantic, syntactic, and phonological representations for A’s utterance. t(sem), t(syn), t(phon) indicate the actual time elapsed from picture onset (in ms) when processing is completed at each stage and the corresponding representation has been built (based on Indefrey and Levelt, 2004); t(art) marks the onset of A’s utterance. (sem), (syn), (phon), and (art) are timing estimates computed by A for her own utterance. (sem), (syn), (phon), (art) are the content predictions for A’s own utterance, on which the timing estimates are based. A believes that B is speaking in response to the same picture. Dotted lines refer to representations of the other. (sem), (syn), (phon), and (art) are timing estimates computed by A for B’s utterance. (sem), (syn), (phon), (art) are A’s content predictions, at the various processing stages, for B’s utterance. Horizontal arrows [from (sem) to (sem), from (syn) to (syn), etc.] indicate that estimates of the timing at each level are based on content predictions at the same level. Timing estimates at one level could also be directly based on content estimates at other levels, but we ignore this here for simplicity. Vertical arrows from self- and other-predictions to planning represent the integration stage.
Figure 2Consecutive utterances: pictures of a wig and a carrot appear simultaneously. (A) JOINT: A names the left picture, then B names the right picture. (B) SOLO: A names the left picture, then A names the right picture. (C) NO: A names the left picture. Where two utterances are produced, we indicate the temporal relation between them by way of number subscripts (1 for the first utterance, 2 for the second utterance). In (A) art stands for the articulation stage of B’s utterance and (sem) is the semantic content prediction that A generates in relation to B’s utterance. Time in ms. All other details as in Figure 1.