Literature DB >> 25844873

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

Dave F Kleinschmidt1, T Florian Jaeger2.   

Abstract

Successful speech perception requires that listeners map the acoustic signal to linguistic categories. These mappings are not only probabilistic, but change depending on the situation. For example, one talker's /p/ might be physically indistinguishable from another talker's /b/ (cf. lack of invariance). We characterize the computational problem posed by such a subjectively nonstationary world and propose that the speech perception system overcomes this challenge by (a) recognizing previously encountered situations, (b) generalizing to other situations based on previous similar experience, and (c) adapting to novel situations. We formalize this proposal in the ideal adapter framework: (a) to (c) can be understood as inference under uncertainty about the appropriate generative model for the current talker, thereby facilitating robust speech perception despite the lack of invariance. We focus on 2 critical aspects of the ideal adapter. First, in situations that clearly deviate from previous experience, listeners need to adapt. We develop a distributional (belief-updating) learning model of incremental adaptation. The model provides a good fit against known and novel phonetic adaptation data, including perceptual recalibration and selective adaptation. Second, robust speech recognition requires that listeners learn to represent the structured component of cross-situation variability in the speech signal. We discuss how these 2 aspects of the ideal adapter provide a unifying explanation for adaptation, talker-specificity, and generalization across talkers and groups of talkers (e.g., accents and dialects). The ideal adapter provides a guiding framework for future investigations into speech perception and adaptation, and more broadly language comprehension. (c) 2015 APA, all rights reserved).

Mesh:

Year:  2015        PMID: 25844873      PMCID: PMC4744792          DOI: 10.1037/a0038695

Source DB:  PubMed          Journal:  Psychol Rev        ISSN: 0033-295X            Impact factor:   8.934


  149 in total

Review 1.  Stimulus-specific adaptation, habituation and change detection in the gaze control system.

Authors:  Yoram Gutfreund
Journal:  Biol Cybern       Date:  2012-06-19       Impact factor: 2.086

2.  The role of variation in the perception of accented speech.

Authors:  Meghan Sumner
Journal:  Cognition       Date:  2010-12-08

Review 3.  Evolving theories of vowel perception.

Authors:  W Strange
Journal:  J Acoust Soc Am       Date:  1989-05       Impact factor: 1.840

4.  Rapid adaptation to foreign-accented English.

Authors:  Constance M Clarke; Merrill F Garrett
Journal:  J Acoust Soc Am       Date:  2004-12       Impact factor: 1.840

5.  A web-based interface to calculate phonotactic probability for words and nonwords in English.

Authors:  Michael S Vitevitch; Paul A Luce
Journal:  Behav Res Methods Instrum Comput       Date:  2004-08

6.  Expectation-based syntactic comprehension.

Authors:  Roger Levy
Journal:  Cognition       Date:  2007-07-30

Review 7.  Whatever next? Predictive brains, situated agents, and the future of cognitive science.

Authors:  Andy Clark
Journal:  Behav Brain Sci       Date:  2013-05-10       Impact factor: 12.579

8.  Implicit schemata and categories in memory-based language processing.

Authors:  Antal van den Bosch; Walter Daelemans
Journal:  Lang Speech       Date:  2013-09       Impact factor: 1.500

9.  The TRACE model of speech perception.

Authors:  J L McClelland; J L Elman
Journal:  Cogn Psychol       Date:  1986-01       Impact factor: 3.468

10.  Visual recalibration and selective adaptation in auditory-visual speech perception: Contrasting build-up courses.

Authors:  Jean Vroomen; Sabine van Linden; Béatrice de Gelder; Paul Bertelson
Journal:  Neuropsychologia       Date:  2006-03-10       Impact factor: 3.139

View more
  115 in total

1.  The role of training structure in perceptual learning of accented speech.

Authors:  Christina Y Tzeng; Jessica E D Alexander; Sabrina K Sidaras; Lynne C Nygaard
Journal:  J Exp Psychol Hum Percept Perform       Date:  2016-07-11       Impact factor: 3.332

2.  Evaluating the sources and functions of gradiency in phoneme categorization: An individual differences approach.

Authors:  Efthymia C Kapnoula; Matthew B Winn; Eun Jong Kong; Jan Edwards; Bob McMurray
Journal:  J Exp Psychol Hum Percept Perform       Date:  2017-04-13       Impact factor: 3.332

3.  Time and information in perceptual adaptation to speech.

Authors:  Ja Young Choi; Tyler K Perrachione
Journal:  Cognition       Date:  2019-06-21

4.  Nonnative Accent Discrimination with Words and Sentences.

Authors:  Eriko Atagi; Tessa Bent
Journal:  Phonetica       Date:  2017-03-07       Impact factor: 1.759

5.  Separate streams or probabilistic inference? What the N400 can tell us about the comprehension of events.

Authors:  Gina R Kuperberg
Journal:  Lang Cogn Neurosci       Date:  2016-01-20       Impact factor: 2.331

6.  Distinct Neural Networks Relate to Common and Speaker-Specific Language Priors.

Authors:  Leon O H Kroczek; Thomas C Gunter
Journal:  Cereb Cortex Commun       Date:  2020-05-29

7.  Developmental Timescale of Rapid Adaptation to Conflicting Cues in Real-Time Sentence Processing.

Authors:  Angele Yazbec; Michael P Kaschak; Arielle Borovsky
Journal:  Cogn Sci       Date:  2019-01

8.  Sociolinguistic Perception as Inference Under Uncertainty.

Authors:  Dave F Kleinschmidt; Kodi Weatherholtz; T Florian Jaeger
Journal:  Top Cogn Sci       Date:  2018-03-15

9.  Reversing expectations during discourse comprehension.

Authors:  Ming Xiang; Gina Kuperberg
Journal:  Lang Cogn Neurosci       Date:  2015-07-01       Impact factor: 2.331

10.  Dimension-Based Statistical Learning Affects Both Speech Perception and Production.

Authors:  Matthew Lehet; Lori L Holt
Journal:  Cogn Sci       Date:  2016-09-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.