| Literature DB >> 25019156 |
Xiaohong Yang1, Xiangrong Shen2, Weijun Li1, Yufang Yang1.
Abstract
The presence of an intonational phrase boundary is often marked by three major acoustic cues: pause, final lengthening, and pitch reset. The present study investigates how these three acoustic cues are weighted in the perception of intonational phrase boundaries in two experiments. Sentences that contained two intonational phrases with a critical boundary between them were used as the experimental stimuli. The roles of the three acoustic cues at the critical boundary were manipulated in five conditions. The first condition featured none of the acoustic cues. The following three conditions featured only one cue each: pause, final lengthening, and pitch reset, respectively. The fifth condition featured both pause duration and pre-final lengthening. A baseline condition was also included in which all three acoustic cues were preserved intact. Listeners were asked to detect the presence of the critical boundaries in Experiment 1 and judge the strength of the critical boundaries in Experiment 2. The results of both experiments showed that listeners used all three acoustic cues in the perception of prosodic boundaries. More importantly, these acoustic cues were weighted differently across the two experiments: Pause was a more powerful perceptual cue than both final lengthening and pitch reset, with the latter two cues perceptually equivalent; the effect of pause and the effects of the other two acoustic cues were not additive. These results suggest that the weighting of acoustic cues contributes significantly to the perceptual differences of intonational phrase boundary.Entities:
Mesh:
Year: 2014 PMID: 25019156 PMCID: PMC4096911 DOI: 10.1371/journal.pone.0102166
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The procedures used to create the experimental conditions.
Means of the acoustic parameters for the six conditions (with standard deviations in parentheses).
| Conditions | Pause duration(s) | Syllable duration(s) | F0 reset (st) |
| No-cue | ---- | 0.22 (0.05) | -0.42 (4.82)— |
| Pause | 0.27 (0.11) | 0.22 (0.05) | -0.42 (4.82) |
| Final lengthening | ---- | 0.26 (0.04) | —0.42 (4.82) |
| Pitch reset | ---- | 0.22 (0.05) | 4.55 (4.68) |
| Pause + final lengthening | 0.27 (0.11) | 0.26 (0.04) | -0.42 (4.82) |
| Baseline | 0.27 (0.11) | 0.26 (0.04) | 4.55 (4.68) |
Note: Pause duration was measured as the duration of the silent interval at the IPBs. Final lengthening was measured as the duration of the pre-boundary syllable. Pitch reset was measured as the mean f0 differences between the two words before and after the boundaries. Pitch values were transformed into semitones through the following equation: St = 12 log2 (f0/f0ref). F0ref was determined to be 70 Hz since the speaker for the experimental material was male in the present study [39].
Figure 2Proportions of the boundaries detected in the six experimental conditions (Error bars represent standard error of the mean).
Naturalness rating scores for the six conditions.
| Conditions | Means | Standard deviations |
| No-cue | 3.49 | 1.23 |
| Pause | 5.04 | 0.77 |
| Final lengthening | 4.12 | 0.97 |
| Pitch reset | 4.17 | 1.03 |
| Pause + final lengthening | 5.20 | 0.63 |
| Baseline | 5.45 | 0.81 |
Figure 3Proportions of the boundaries detected with naturalness rating as covariate.
Figure 4Perceived boundary strength across the six experimental conditions.
Figure 5Perceived boundary strength with naturalness rating as covariate.