| Literature DB >> 28870139 |
Marco van de Ven1, Mirjam Ernestus1.
Abstract
In natural conversations, words are generally shorter and they often lack segments. It is unclear to what extent such durational and segmental reductions affect word recognition. The present study investigates to what extent reduction in the initial syllable hinders word comprehension, which types of segments listeners mostly rely on, and whether listeners use word duration as a cue in word recognition. We conducted three experiments in Dutch, in which we adapted the gating paradigm to study the comprehension of spontaneously uttered conversational speech by aligning the gates with the edges of consonant clusters or vowels. Participants heard the context and some segmental and/or durational information from reduced target words with unstressed initial syllables. The initial syllable varied in its degree of reduction, and in half of the stimuli the vowel was not clearly present. Participants gave too short answers if they were only provided with durational information from the target words, which shows that listeners are unaware of the reductions that can occur in spontaneous speech. More importantly, listeners required fewer segments to recognize target words if the vowel in the initial syllable was absent. This result strongly suggests that this vowel hardly plays a role in word comprehension, and that its presence may even delay this process. More important are the consonants and the stressed vowel.Entities:
Keywords: Acoustic reduction; gating; phonetic detail; speech perception; word recognition
Mesh:
Year: 2017 PMID: 28870139 PMCID: PMC6099978 DOI: 10.1177/0023830917727774
Source DB: PubMed Journal: Lang Speech ISSN: 0023-8309 Impact factor: 1.500
An overview of the segments provided in gates 1–4 (top line), and their average durations (bottom two lines), for tokens in which the first unstressed vowel was acoustically present or absent (exemplified by two tokens for the target word principe: [pə’sipə] and [p’sipə].
| Stimulus type | Gate 1 | Gate 2 | Gate 3 | Gate 4 |
|---|---|---|---|---|
| Vowel present | ∅ | 74.33 ([p]) | 109.31 ([pə]) | 176.37 ([pə’s]) |
| Vowel absent | ∅ | 128.84 ([p’s])[ | 209.48 ([p’si])[ | 271.64 ([p’sip])[ |
C, consonant cluster; V, vowel. Significance values were obtained by applying t-tests comparing the durations of the target words with and without first vowel presence. * = p < 0.05 ** = p < 0.01 *** = p < 0.001.
An overview of the distribution of tokens across speakers for the stimuli used in this study.
| Number of tokens | Number of speakers |
|---|---|
| 1 | 2 |
| 2 | 5 |
| 3 | 7 |
| 4 | 3 |
| 5 | 2 |
| 8 | 1 |
Absolute numbers (and percentages) of target words with different types of reduction in the initial consonant cluster, broken for the phonotactic well-formedness of this cluster.
| Missing segments | Phonotactically legal | Phonotactically illegal |
|---|---|---|
| None | 13 (20.63%) | 6 (9.52%) |
| Vowel only | 10 (15.87%) | 2 (3.17%) |
| Consonants only | 5 (7.94%) | – |
| Vowel + consonants | 18 (28.57%) | 9 (14.29%) |
Figure 1.Spectrograms and transcriptions for two tokens of the word principe “principle.” (a) with the first unstressed vowel; (b) without the first unstressed vowel.
Results for the statistical analysis for Experiment 1.
| Predictor |
|
|
| Variance explained |
|
| |
|---|---|---|---|---|---|---|---|
| Fixed effects | |||||||
| Intercept | 3.122 | 6.21 | <.0001 | ||||
| Gate (gate 2) | −1.978 | −6.81 | <.0001 | ||||
| Preceding bigram frequency | −0.004 | −2.29 | <.05 | ||||
| Random effects | |||||||
| Participant | 0.178 | 6.59 | <.05 | ||||
| Target token | 9.085 | 519.48 | <.0001 | ||||
| Target token: gate | 1.953 | 13.98 | <.001 | ||||
Percentage of contextually appropriate incorrect responses, and percentages of correct first segments, final segments, and number of syllables for the incorrect responses in Experiment 1, broken down by the gate and whether the initial unstressed vowel was realized.
| Gate | Vowel realized | Contextually appropriate | 1st segment | 1st and 2nd segment | 1st–3rd segment | Final segment | Number of syllables |
|---|---|---|---|---|---|---|---|
| Gate 1 | No | 34.56% | 8.05% | 1.68% | 0.34% | 19.13% | 11.74% |
| Gate 1 | Yes | 59.79% | 6.70% | 1.03% | 0.00% | 22.68% | 14.43% |
| Gate 2 | No | 32.48% | 36.31% | 14.01% | 7.01% | 15.29% | 12.74% |
| Gate 2 | Yes | 41.44% | 34.25% | 6.08% | 2.76% | 25.41% | 20.44% |
The percentages correct for target words in Experiment 2, broken down by whether the initial unstressed vowel was realized, and by gate.
| Gate | Vowel realized | Vowel absent |
|---|---|---|
| Gate 1 | 18.99% | 18.91% |
| Gate 2 | 25.66% | 45.02% |
Results for the statistical analysis comparing Experiments 1 and 2.
| Predictor |
|
|
| Variance explained |
|
| |
|---|---|---|---|---|---|---|---|
| Fixed effects | |||||||
| Intercept | 3.884 | 6.66 | <.0001 | ||||
| Experiment (experiment 2) | 0.561 | 2.51 | <.05 | ||||
| Gate (gate 2) | −2.288 | −7.51 | <.0001 | ||||
| Preceding bigram frequency | −0.004 | −1.97 | <.05 | ||||
| Experiment (experiment 2): preceding bigram frequency | 0.001 | 2.32 | <.05 | ||||
| Random effects | |||||||
| Participant | 0.244 | 20.02 | <.0001 | ||||
| Target type | 3.835 | 4.87 | <.05 | ||||
| Target token | 6.941 | 336.79 | <.0001 | ||||
| Target token: gate | 3.081 | 67.57 | <.0001 | ||||
Percentage of contextually appropriate incorrect responses, and percentages of correct first segments, final segments, and number of syllables for the incorrect responses in Experiment 2, broken down by the gate and whether the initial unstressed vowel was realized.
| Gate | Vowel realized | Contextually appropriate | 1st segment | 1st and 2nd segment | 1st–3rd segment | Final segment | Number of syllables |
|---|---|---|---|---|---|---|---|
| Gate 1 | No | 25.90% | 7.19% | 2.88% | 2.16% | 18.71% | 8.99% |
| Gate 1 | Yes | 39.85% | 8.81% | 2.30% | 0.00% | 16.86% | 17.24% |
| Gate 2 | No | 28.24% | 30.59% | 12.94% | 11.18% | 19.41% | 8.82% |
| Gate 2 | Yes | 43.98% | 27.78% | 6.48% | 1.85% | 23.15% | 15.28% |
The percentages correct for tokens in which listeners either heard the unstressed or stressed vowel from the target words in gates 3 and 4.
| Gate | Unstressed vowel | Stressed vowel |
|---|---|---|
| Gate 3 | 40.19% | 61.03% |
| Gate 4 | 55.59% | 81.10% |
Statistical results for gate 2 and gate 3 in Experiments 1 and 3 respectively.
| Predictor |
|
|
| Variance explained |
|
| |
|---|---|---|---|---|---|---|---|
| Fixed effects | |||||||
| Intercept | 2.530 | 4.46 | <.001 | ||||
| Gate (gate 3) | −1.199 | −3.20 | <.01[ | ||||
| Preceding bigram frequency | −0.008 | −3.23 | <.01[ | ||||
| First vowel presence (no.) | −2.308 | −3.70 | <.001 | ||||
| Gate (gate 3): preceding bigram frequency | 0.004 | 2.93 | <.01 | ||||
| Preceding bigram frequency: first vowel presence (no.) | 0.006 | 2.43 | <.05 | ||||
| Random effects | |||||||
| Participant | 0.209 | 8.43 | <.01 | ||||
| Target token | 7.173 | 339.54 | <.0001 | ||||
| Target token: gate | 3.071 | 39.74 | <.0001 | ||||
Statistical results for Experiment 3.
| Predictor |
|
|
| Variance explained |
|
| |
|---|---|---|---|---|---|---|---|
| Fixed effects | |||||||
| Intercept | 1.280 | 2.55 | <.01 | ||||
| Gate (gate 4) | −1.409 | −3.89 | <.001 | ||||
| First vowel presence (no.) | −2.368 | −3.61 | <.001 | ||||
| Preceding bigram frequency | −0.004 | −1.94 | n.s. | ||||
| Gate (gate 4): first vowel presence | 0.781 | 1.56 | n.s. | ||||
| Gate (gate 4): preceding bigram frequency | 0.002 | 0.93 | n.s. | ||||
| First vowel presence (no.): preceding bigram frequency | 0.006 | 2.45 | <.05 | ||||
| Gate (gate 4): first vowel presence (no.): preceding bigram frequency | −0.006 | −2.63 | <.01 | ||||
| Random effects | |||||||
| Participant | 0.015 | 8.45 | <.01 | ||||
| Participant: first vowel presence | 0.294 | 8.27 | <.05 | ||||
| Target token | 3.740 | 229.90 | <.0001 | ||||
| Target token: gate | 1.198 | 13.69 | <.01 | ||||
Percentage of contextually appropriate incorrect responses, and percentages of correct first segments, final segments, and number of syllables for the incorrect responses in Experiment 3, broken down by the gate and whether the initial unstressed vowel was realized.
| Gate | Vowel realized | Contextually appropriate | 1st segment | 1st and 2nd segment | 1st–3rd segment | Final segment | Number of syllables |
|---|---|---|---|---|---|---|---|
| Gate 3 | No | 14.53% | 20.51% | 9.40% | 4.27% | 21.37% | 13.68% |
| Gate 3 | Yes | 48.08% | 35.10% | 11.54% | 7.21% | 21.15% | 22.12% |
| Gate 4 | No | 18.87% | 13.21% | 13.21% | 11.32% | 30.19% | 15.09% |
| Gate 4 | Yes | 43.07% | 32.12% | 14.60% | 8.76% | 25.55% | 18.98% |