| Literature DB >> 24416026 |
Laura C Dilley1, Tuuli H Morrill1, Elina Banzina2.
Abstract
Recent findings [Dilley and Pitt, 2010. Psych. Science. 21, 1664-1670] have shown that manipulating context speech rate in English can cause entire syllables to disappear or appear perceptually. The current studies tested two rate-based explanations of this phenomenon while attempting to replicate and extend these findings to another language, Russian. In Experiment 1, native Russian speakers listened to Russian sentences which had been subjected to rate manipulations and performed a lexical report task. Experiment 2 investigated speech rate effects in cross-language speech perception; non-native speakers of Russian of both high and low proficiency were tested on the same Russian sentences as in Experiment 1. They decided between two lexical interpretations of a critical portion of the sentence, where one choice contained more phonological material than the other (e.g., /str'na/ "side" vs. /str'na/ "country"). In both experiments, with native and non-native speakers of Russian, context speech rate and the relative duration of the critical sentence portion were found to influence the amount of phonological material perceived. The results support the generalized rate normalization hypothesis, according to which the content perceived in a spectrally ambiguous stretch of speech depends on the duration of that content relative to the surrounding speech, while showing that the findings of Dilley and Pitt (2010) extend to a variety of morphosyntactic contexts and a new language, Russian. Findings indicate that relative timing cues across an utterance can be critical to accurate lexical perception by both native and non-native speakers.Entities:
Keywords: distal speech rate; lexical perception; non-native perception; speech recognition; word segmentation
Year: 2013 PMID: 24416026 PMCID: PMC3875230 DOI: 10.3389/fpsyg.2013.01002
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Stimulus sentences.
Figure 1Spectrograms of an example stimulus item for each of the Time Manipulation conditions. (A) Represents the Unaltered condition. Vertical white lines in spectrograms delineate Target and Context portions of speech material. Arrows beneath schematic waveforms positioned below spectrograms indicate which portion of the stimulus item was subject to time alteration for Time Manipulation conditions (B–E). Arrows pointing outward indicate time-expansion, while arrows pointing inward indicate time-compression.
Figure 2Rate of “Long” responses to lexical sequences in each experimental stimulus in Experiment 1.
A log mixed-effects model with coefficient estimates, standard errors, Wald's .
| (Intercept— | 2.05 | 0.49 | 4.23 | |
| Context expanded | −3.19 | 0.48 | −6.60 | |
| Target compressed | −2.48 | 0.46 | −5.38 | |
| Context compressed | 0.32 | 0.51 | 0.63 | |
| Target expanded | 0.75 | 0.56 | 1.34 |
Figure 3Rate of “Long” responses to lexical sequences in each experimental stimulus for three Proficiency groups in the two-alternative task of Experiment 2.
Figure 4Rate of “Long” responses to lexical sequences for three Proficiency groups in the Unaltered condition of the two-alternative task in Experiment 2.
A log mixed-effects model with coefficient estimates, standard errors, Wald's .
| (Intercept— | 1.36 | 0.42 | 3.26 | |
| Context expanded | −1.90 | 0.43 | −4.37 | |
| Target compressed | −1.84 | 0.43 | −4.27 | |
| Context compressed | 0.46 | 0.47 | 0.98 | 0.33 |
| Target expanded | 0.10 | 0.44 | −0.22 | 0.82 |
| High proficiency | 0.95 | 0.56 | 1.70 | 0.09 |
| Native proficiency | 1.98 | 0.70 | 2.84 | |
| −0.36 | 0.70 | −0.51 | 0.61 | |
| 0.31 | 0.70 | −0.45 | 0.66 | |
| −0.56 | 0.79 | −0.71 | 0.48 | |
| −0.56 | 0.74 | −0.76 | 0.45 | |
| −2.30 | 0.82 | −2.82 | ||
| −1.03 | 0.81 | −1.28 | 0.20062 | |
| −0.52 | 1.00 | −0.53 | 0.59883 | |
| −1.29 | 0.84 | −1.52 | 0.12750 | |
The Estimates for the Time Manipulation conditions represent the baseline (Low Proficiency).