| Literature DB >> 23060839 |
Laurence White1, Sven L Mattys, Lukas Wiget.
Abstract
Multiple cues influence listeners' segmentation of connected speech into words, but most previous studies have used stimuli elicited in careful readings rather than natural conversation. Discerning word boundaries in conversational speech may differ from the laboratory setting. In particular, a speaker's articulatory effort - hyperarticulation vs. hypoarticulation (H&H) - may vary according to communicative demands, suggesting a compensatory relationship whereby acoustic-phonetic cues are attenuated when other information sources strongly guide segmentation. We examined how listeners' interpretation of segmentation cues is affected by speech style (spontaneous conversation vs. read), using cross-modal identity priming. To elicit spontaneous stimuli, we used a map task in which speakers discussed routes around stylized landmarks. These landmarks were two-word phrases in which the strength of potential segmentation cues - semantic likelihood and cross-boundary diphone phonotactics - was systematically varied. Landmark-carrying utterances were transcribed and later re-recorded as read speech. Independent of speech style, we found an interaction between cue valence (favorable/unfavorable) and cue type (phonotactics/semantics). Thus, there was an effect of semantic plausibility, but no effect of cross-boundary phonotactics, indicating that the importance of phonotactic segmentation may have been overstated in studies where lexical information was artificially suppressed. These patterns were unaffected by whether the stimuli were elicited in a spontaneous or read context, even though the difference in speech styles was evident in a main effect. Durational analyses suggested speaker-driven cue trade-offs congruent with an H&H account, but these modulations did not impact on listener behavior. We conclude that previous research exploiting read speech is reliable in indicating the primacy of lexically based cues in the segmentation of natural conversational speech.Entities:
Keywords: conversational speech; cross-modal priming; phonotactics; semantics; speech segmentation
Year: 2012 PMID: 23060839 PMCID: PMC3464055 DOI: 10.3389/fpsyg.2012.00375
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Phrase pairs used in the semantics condition.
| Low predictability | High predictability | ||||
|---|---|---|---|---|---|
| BNC count | BNC count | ||||
| 0 | – | 36 | 186 | ||
| 0 | – | 24 | 397 | ||
| 0 | – | 41 | 55 | ||
| 0 | – | 207 | 465 | ||
| 0 | – | 36 | 134 | ||
| 0 | – | 51 | 298 | ||
| 0 | – | 42 | 109 | ||
The number of occurrences and the .
Within-word and between-word diphone statistics calculated from the Buckeye corpus for the segmentation-favorable and segmentation-unfavorable diphone pairs in the phonotactics condition.
| Diphone | Buckeye counts | Between/within ratio | Favorable/unfavorable index | ||||
|---|---|---|---|---|---|---|---|
| Within-word | Between-word | ||||||
| Raw | Norm | Raw | Norm | ||||
| /mr/ | 0 | 0.00 | 108 | 0.57 | 323 | 12285 | |
| /br/ | 455 | 0.81 | 4 | 0.02 | 0.03 | ||
| /vt/ | 0 | 0.00 | 623 | 3.31 | 1862 | 11789 | |
| /st/ | 8137 | 14.48 | 430 | 2.29 | 0.16 | ||
| /sr/ | 20 | 0.04 | 250 | 1.33 | 37 | 2492 | |
| /fr/ | 595 | 2.84 | 8 | 0.04 | 0.02 | ||
| /mk/ | 0 | 0.00 | 143 | 0.76 | 427 | 2193 | |
| /Nk/ | 2132 | 3.79 | 139 | 0.74 | 0.20 | ||
| /ml/ | 27 | 0.05 | 200 | 1.06 | 22 | 1781 | |
| /bl/ | 962 | 1.71 | 4 | 0.02 | 0.01 | ||
| /tk/ | 2 | 0.00 | 822 | 4.37 | 1229 | 1618 | |
| /sk/ | 1248 | 2.22 | 317 | 1.69 | 0.76 | ||
| /tS/ | 0 | 0.00 | 289 | 1.54 | 864 | 1243 | |
| /kS/ | 215 | 0.38 | 50 | 0.27 | 0.70 | ||
The first landmark in each pair contains the segmentation-favorable diphone. Diphones are given in SAMPA transcription.
Figure 1Examples of the landmark pictograms used in the spontaneous speech map task.
Overview of the 136 cross-modal identity priming trials.
| Priming: | “TARGET” lexical status | |
|---|---|---|
| Word | Non-word | |
| Related | 28 Experimental trials, e.g.,: | 44 Fillers, e.g.,: |
| 16 Fillers, e.g.,: | ||
| Unrelated | 24 Fillers, e.g.,: | 24 Fillers, e.g.,: |
Specific examples are from the semantics condition. For related trials, examples illustrate the variable overlap between landmark phrase and target.
Proportion of correct lexical decisions (%) by condition.
| Spontaneous | Read | |||
|---|---|---|---|---|
| Favorable | Unfavorable | Favorable | Unfavorable | |
| Semantics | 99 | 99 | 98 | 97 |
Figure 2Lexical-decision latencies (and standard errors) in the semantic segmentation condition for the two speech styles: spontaneous map dialogs vs. read sentences (101 participants). Favorable semantic cues: High word bigram frequency (e.g., oil tanker). Unfavorable semantic cues: Low word bigram frequency (e.g., seal tanker).
Figure 3Lexical-decision latencies (and standard errors) in the phonotactic segmentation condition for the two speech styles: spontaneous map dialogs vs. read sentences (117 participants). Favorable phonotactic cues: low within-word frequency of cross-boundary diphone (e.g., cream rickshaw). Unfavorable phonotactic cues: high within-word frequency of cross-boundary diphone (e.g., drab rickshaw).
Mean durations in milliseconds (standard errors in brackets) for constituents of the second word of landmark phrases in the semantics condition.
| Spontaneous | Read | |||
|---|---|---|---|---|
| Favorable | Unfavorable | Favorable | Unfavorable | |
| Onset consonant | 78 (5) | 85 (6) | 87 (6) | 94 (6) |
| Stressed vowel | 143 (12) | 143 (10) | 141 (14) | 150 (14) |
| Whole word | 389 (43) | 392 (38) | 386 (39) | 400 (32) |
E.g., .
Mean durations in milliseconds (standard errors in brackets) for constituents of the second word of landmark phrases in the phonotactics condition: e.g., .
| Spontaneous | Read | |||
|---|---|---|---|---|
| Favorable | Unfavorable | Favorable | Unfavorable | |
| Onset consonant | 88 (5) | 92 (5) | 90 (5) | 97 (6) |
| Stressed vowel | 91 (7) | 103 (8) | 98 (7) | 98 (7) |
| Whole word | 433 (22) | 449 (21) | 428 (19) | 437 (19) |