| Literature DB >> 22970141 |
Kristin J Van Engen1, Bharath Chandrasekaran, Rajka Smiljanic.
Abstract
Extensive research shows that inter-talker variability (i.e., changing the talker) affects recognition memory for speech signals. However, relatively little is known about the consequences of intra-talker variability (i.e. changes in speaking style within a talker) on the encoding of speech signals in memory. It is well established that speakers can modulate the characteristics of their own speech and produce a listener-oriented, intelligibility-enhancing speaking style in response to communication demands (e.g., when speaking to listeners with hearing impairment or non-native speakers of the language). Here we conducted two experiments to examine the role of speaking style variation in spoken language processing. First, we examined the extent to which clear speech provided benefits in challenging listening environments (i.e. speech-in-noise). Second, we compared recognition memory for sentences produced in conversational and clear speaking styles. In both experiments, semantically normal and anomalous sentences were included to investigate the role of higher-level linguistic information in the processing of speaking style variability. The results show that acoustic-phonetic modifications implemented in listener-oriented speech lead to improved speech recognition in challenging listening conditions and, crucially, to a substantial enhancement in recognition memory for sentences.Entities:
Mesh:
Year: 2012 PMID: 22970141 PMCID: PMC3436755 DOI: 10.1371/journal.pone.0043753
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Waveforms and spectrograms of one meaningful sentence (top panels) and one anomalous sentences (bottom panels), each produced in both conversational (left panels) and clear (right panels) speaking styles.
Each panel display represents 2.5 seconds.
Acoustic measures of sentence materials by speaking style and material type.
| Mean (SD) | Clear speech: Anomalous | Conversational Speech: Anomalous | Clear speech: Meaningful | Conversational Speech: Meaningful |
|
| 2.87 (.44) | 1.42 (.12) | 3.21 (.35) | 1.55 (.14) |
|
| 170.85 (7.91) | 160.92 (8.68) | 167.42 (7.90) | 161.24 (8.85) |
|
| 157.90 (91.02) | 124.02 (100.97) | 215.63 (122.69) | 136.79 (108.25) |
|
| 23.22 (2.20) | 23.10 (2.19) | 22.17 (2.63) | 22.61 (2.52) |
Figure 2Average proportion of keywords identified from semantically anomalous and meaningful sentences produced in clear and conversational speaking styles.
Error bars represent standard error.
Results of the linear mixed effects logistic regression on intelligibility data for all sentences.
| Fixed effects: | Estimate | Std. Error | z value | Pr(>|z|) |
|
| 3.3184 | 0.4619 | 7.184 | 6.75e-13 *** |
|
| 3.9740 | 0.9123 | 4.356 | 1.33e-05 *** |
|
| −1.4941 | 0.1758 | −8.496 | <2e-16 *** |
|
| −1.0556 | 0.3284 | −3.214 | 0.00131 ** |
Results of the linear mixed effects logistic regression on intelligibility data for anomalous sentences.
| Anomalous Sentences Fixed effects: | Estimate | Std. Error | z value | Pr(>|z|) |
|
| 1.3805 | 0.1364 | 10.122 | <2e-16 *** |
|
| −0.9903 | 0.2259 | −4.383 | 1.17e-05 *** |
Results of the linear mixed effects logistic regression on intelligibility data for meaningful sentences.
| Meaningful Sentences Fixed effects: | Estimate | Std. Error | z value | Pr(>|z|) |
|
| 8.0046 | 1.7574 | 4.555 | 5.24e-06 *** |
|
| −1.8616 | 0.3759 | −4.952 | 7.34e-07 *** |
Calculated hit rates, false alarm rates, d′, and C values for the recognition memory test.
| Conversational Speech | Clear speech | |||||||
| Hit Rate | False Alarm Rate | d′ | C | Hit Rate | False Alarm Rate | d′ | C | |
|
| 0.67 | 0.31 | 0.96 | 0.02 | 0.64 | 0.21 | 1.19 | 0.23 |
|
| 0.63 | 0.34 | 0.95 | 0.04 | 0.56 | 0.22 | 1.24 | 0.27 |
|
| 0.69 | 0.25 | 1.16 | 0.08 | 0.70 | 0.15 | 1.56 | 0.26 |
|
| 0.73 | 0.25 | 1.39 | 0.04 | 0.64 | 0.13 | 1.56 | 0.39 |
Figure 3Average d′ scores in both testing blocks for semantically anomalous and meaningful sentences produced in clear and conversational speaking styles.
Error bars represent standard error.