| Literature DB >> 26029125 |
Seán G Roberts1, Francisco Torreira1, Stephen C Levinson1.
Abstract
The timing of turn taking in conversation is extremely rapid given the cognitive demands on speakers to comprehend, plan and execute turns in real time. Findings from psycholinguistics predict that the timing of turn taking is influenced by demands on processing, such as word frequency or syntactic complexity. An alternative view comes from the field of conversation analysis, which predicts that the rules of turn-taking and sequence organization may dictate the variation in gap durations (e.g., the functional role of each turn in communication). In this paper, we estimate the role of these two different kinds of factors in determining the speed of turn-taking in conversation. We use the Switchboard corpus of English telephone conversation, already richly annotated for syntactic structure speech act sequences, and segmental alignment. To this we add further information including Floor Transfer Offset (the amount of time between the end of one turn and the beginning of the next), word frequency, concreteness, and surprisal values. We then apply a novel statistical framework ("random forests") to show that these two dimensions are interwoven together with indexical properties of the speakers as explanatory factors determining the speed of response. We conclude that an explanation of the of the timing of turn taking will require insights from both processing and sequence organization.Entities:
Keywords: concreteness; frequency; processing; random forests; sequence organization; surprisal; turn-taking
Year: 2015 PMID: 26029125 PMCID: PMC4429583 DOI: 10.3389/fpsyg.2015.00509
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
The NXT dialog act categories and how they map onto sequence organization types.
| decl-q | Declarative Wh-Question | answer,statement | Y | |||
| open | Conventional-opening | Y | ||||
| open-q | Open-Question | neg,affirm,no,yes,statement,reject | Y | |||
| or | Or-Clause | neg,affirm,no,yes,statement,reject | Y | |||
| repeat-q | Signal-non-understanding | Y | ||||
| sum | Summarize/Reformulate | Y | ||||
| tag-q | Tag-Question | neg,affirm,no,yes,statement,reject | Y | |||
| wh-q | Wh-Question | answer,statement,reject | Y | |||
| yn-q | Yes-No-Question | yes,no,affirm,neg,statement | Y | |||
| yn-decl-q | Declarative Yes-No-Question | yes,affirm,statement | Y | |||
| acknowledge | Response Acknowledgment | Y | Y | |||
| backchannel | Backchannel | Y | Y | |||
| backchannel-q | Backchannel as question | Y | Y | |||
| ans-dispref | Dispreferred answers | Y | Neg | |||
| hedge | Hedge | Y | Neg | |||
| maybe | Maybe/Accept-part | Y | Neg | |||
| neg | Negative non-no answers | Y | Neg | |||
| no | No answers | Y | Neg | |||
| reject | Reject | Y | Neg | |||
| affirm | Affirmative non-yes answers | Y | Pos | |||
| agree | Agree/Accept | Y | Pos | |||
| answer | Other answers | Y | Pos | |||
| yes | Yes answers | accept | Y | Pos | ||
| apprec | Appreciation | Y | ||||
| abandon | Abandoned or Turn-Exit | |||||
| apology | Apology | agree,downplay | ||||
| close | Conventional-closing | close | ||||
| commit | Offers, Options, and Commits | |||||
| completion | Collaborative Completion | |||||
| directive | Action-directive | |||||
| downplay | Downplayer | |||||
| excluded | Excluded - bad segmentation | |||||
| hold | Hold before response | |||||
| opinion | Statement-opinion | agree,opinion,disagree,accept | ||||
| other | Other | |||||
| third-pty | 3rd-party-talk | |||||
| quote | Quotation | |||||
| repeat | Repeat-phrase | agree | ||||
| rhet-q | Rhetorical-Questions | agree | ||||
| self-talk | Self-Talk | |||||
| statement | Statement-non-opinion | statement | ||||
| thank | Thanking | downplay | ||||
| uninterp | Uninterpretable | |||||
Figure 1A decision tree splitting FTO data into groups by various measures of sequence organization.
Figure 2A decision tree splitting data into gaps and overlaps by measures of sequence organization and processing.
Figure 3The distribution of floor transfer offsets (the gap between two turns) for the Switchboard data.
Figure 4Variable importance in a random forest analysis of floor transfer offset. The dotted red line shows the absolute smallest value, which can be used as a baseline for spurious effects. Measures of processing appear as circles (black labels) and measures of sequence organization appear as triangles (orange labels). Other measures appear as crosses (purple labels).
The Pearson correlation between processing measures and FTO for T1 and T2.
| Concreteness | 0.028 | −0.004 |
| Mean frequency | −0.010 | 0.024 |
| Speech rate | −0.091 | −0.008 |
| Information uniformity | −0.009 | −0.004 |
| Turn duration | 0.043 | 0.025 |
| Surprisal | −0.003 | −0.012 |
| Number of clauses | 0.026 | −0.019 |
| Syntax tree height | 0.065 | 0.012 |
| Contextual diversity | −0.027 | −0.014 |
Figure 5The relationship between FTO and T1 and T2 duration. The data is grouped into 500 ms bins. Circles represent the mean of the bin, with bars showing the 95% confidence intervals.
Figure 6Mean FTOs between turns with different kind of sequential actions (responding and initiating).