| Literature DB >> 26134279 |
William L Schuerman1, Antje Meyer1, James M McQueen2.
Abstract
In different tasks involving action perception, performance has been found to be facilitated when the presented stimuli were produced by the participants themselves rather than by another participant. These results suggest that the same mental representations are accessed during both production and perception. However, with regard to spoken word perception, evidence also suggests that listeners' representations for speech reflect the input from their surrounding linguistic community rather than their own idiosyncratic productions. Furthermore, speech perception is heavily influenced by indexical cues that may lead listeners to frame their interpretations of incoming speech signals with regard to speaker identity. In order to determine whether word recognition evinces similar self-advantages as found in action perception, it was necessary to eliminate indexical cues from the speech signal. We therefore asked participants to identify noise-vocoded versions of Dutch words that were based on either their own recordings or those of a statistically average speaker. The majority of participants were more accurate for the average speaker than for themselves, even after taking into account differences in intelligibility. These results suggest that the speech representations accessed during perception of noise-vocoded speech are more reflective of the input of the speech community, and hence that speech perception is not necessarily based on representations of one's own speech.Entities:
Mesh:
Year: 2015 PMID: 26134279 PMCID: PMC4489924 DOI: 10.1371/journal.pone.0129731
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Average proportion of correct answers (Accuracy) and Levenshtein Distance for each word list, according to Talker, with standard error bars.
For Levenshtein Distance, higher scores indicate greater inaccuracy.
Fig 2Average Levenshtein Distance for stimuli produced by the participant (Self) and by the Average Speaker.
Higher scores indicate greater inaccuracy.
Results of the binomial and truncated Poisson mixed-effects regression analyses.
| Binomial | Truncated Poisson | |||||||
|---|---|---|---|---|---|---|---|---|
| Fixed Effects: | Estimate | Std. Error | Z-value | Pr(> ∣ | Estimate | Std. Error | Z-value | Pr(> ∣ |
| (Intercept) | -0.530 | 0.194 | -2.73 | 0.0063 | 0.874 | 0.066 | 13.24 | <2e 16 |
| Word Difficulty (Hard) | 1.15 | 0.253 | 4.55 | <5.3e-06 | -0.312 | 0.080 | -3.79 | 0.0001 |
| Talker (Self) | 0.264 | 0.114 | 2.32 | 0.0203 | 0.075 | 0.036 | 2.08 | 0.0379 |
Fixed effects reported in reference to Talker (Average Speaker) and Word Difficulty (Easy).
* p <.05,
** p <.01,
*** p <.001.
Fig 3Intelligibility ratings by Levenshtein Distance for each speaker (Experiment 2) plotted against that speaker’s inaccuracy for Self-produced stimuli (Experiment 1), with regression line.
Results of the binomial and truncated Poisson mixed-effects regression analyses.
| Binomial | Truncated Poisson | |||||||
|---|---|---|---|---|---|---|---|---|
| Fixed Effects: | Estimate | Std. Error | Z-value | Pr(> ∣ | Estimate | Std. Error | Z-value | Pr(> ∣ |
| (Intercept) | -1.722 | 0.396 | -4.35 | <1.42e 05 | 0.579 | 0.148 | 3.93 | <8.64e 05 |
| Intelligibility | 0.734 | 0.213 | 3.45 | 0.00057 | 0.181 | 0.080 | 2.25 | 0.02 |
| Word Difficulty (Hard) | 1.156 | 0.254 | 4.56 | <5.2e-06 | -0.310 | 0.080 | -3.87 | 0.00011 |
| Talker (Self) | 0.267 | 0.105 | 2.54 | 0.011 | 0.073 | 0.036 | 2.02 | 0.04 |
Intelligibility ratings are based on Levenshtein Distance, therefore greater values indicate that the speaker is less intelligible.
* p <.05,
** p <.01,
*** p <.001.