| Literature DB >> 26283946 |
Briony Banks1, Emma Gowen2, Kevin J Munro1, Patti Adank3.
Abstract
Perceptual adaptation allows humans to recognize different varieties of accented speech. We investigated whether perceptual adaptation to accented speech is facilitated if listeners can see a speaker's facial and mouth movements. In Study 1, participants listened to sentences in a novel accent and underwent a period of training with audiovisual or audio-only speech cues, presented in quiet or in background noise. A control group also underwent training with visual-only (speech-reading) cues. We observed no significant difference in perceptual adaptation between any of the groups. To address a number of remaining questions, we carried out a second study using a different accent, speaker and experimental design, in which participants listened to sentences in a non-native (Japanese) accent with audiovisual or audio-only cues, without separate training. Participants' eye gaze was recorded to verify that they looked at the speaker's face during audiovisual trials. Recognition accuracy was significantly better for audiovisual than for audio-only stimuli; however, no statistical difference in perceptual adaptation was observed between the two modalities. Furthermore, Bayesian analysis suggested that the data supported the null hypothesis. Our results suggest that although the availability of visual speech cues may be immediately beneficial for recognition of unfamiliar accented speech in noise, it does not improve perceptual adaptation.Entities:
Keywords: accented speech; audiovisual speech; multisensory perception; perceptual adaptation; speech perception
Year: 2015 PMID: 26283946 PMCID: PMC4522556 DOI: 10.3389/fnhum.2015.00422
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Phonetic description of the novel accent.
| IPA | Example |
|---|---|
| ɪ → ɛ | s |
| ɛ → ɪ | b |
| æ → ɛ | h |
| ∧ → ʊ | c |
| зː → ɛə | g |
| aː → ɔː | d |
| ɒ → ɔː | h |
| ɔː | door |
| uː | f |
| ʊ | g |
| ə | moth |
| iː | tr |
| ɛə → зː | h |
| əʊ → aʊ | v |
| aʊ → uː | h |
| ɛɪ → aɪ | w |
| aɪ → ɔɪ | m |
| ɪə | h |
| ɔɪ | j |
Mean SRTs in dB per training group (Study 1).
| Familiar accent | Novel accent | |||||||
|---|---|---|---|---|---|---|---|---|
| Baseline SRT | Pre-training SRT | Post-training SRT | ||||||
| Training group | SD | SD | SD | |||||
| Audiovisual | 0.4 | 1.68 | 7.6 | 2.33 | 5.0 | 2.66 | ||
| Audio-only | 0.2 | 1.42 | 7.9 | 2.36 | 5.0 | 1.88 | ||
| Visual | 0.6 | 1.75 | 7.8 | 2.56 | 6.0 | 1.89 | ||
| Audiovisual + noise | 0.9 | 2.19 | 7.4 | 3.13 | 5.9 | 3.46 | ||
| Audio-only + noise | 0.7 | 1.45 | 7.9 | 2.28 | 5.0 | 2.08 | ||
| All groups | 0.6 | 1.70 | 7.7 | 2.50 | 5.4 | 2.47 | ||
Bayes factor (B) for comparisons of adaptation between groups in Study 1.
| Group | AV | Audio | Visual | AV + noise |
|---|---|---|---|---|
| AV | – | |||
| Audio | 0.21* | – | ||
| Visual | 0.43 | 0.90 | – | |
| AV + noise | 0.85 | 1.68 | 0.24* | – |
| Audio + noise | 0.20* | 0.14* | 1.10 | 1.97 |