Literature DB >> 30619905

Structure in talker variability: How much is there and how much can it help?

Dave F Kleinschmidt1,2.   

Abstract

One of the persistent puzzles in understanding human speech perception is how listeners cope with talker variability. One thing that might help listeners is structure in talker variability: rather than varying randomly, talkers of the same gender, dialect, age, etc. tend to produce language in similar ways. Listeners are sensitive to this covariation between linguistic variation and socio-indexical variables. In this paper I present new techniques based on ideal observer models to quantify (1) the amount and type of structure in talker variation (informativity of a grouping variable), and (2) how useful such structure can be for robust speech recognition in the face of talker variability (the utility of a grouping variable). I demonstrate these techniques in two phonetic domains-word-initial stop voicing and vowel identity-and show that these domains have different amounts and types of talker variability, consistent with previous, impressionistic findings. An R package (phondisttools) accompanies this paper, and the source and data are available from osf.io/zv6e3.

Entities:  

Keywords:  Speech perception; computational modelling; variability

Year:  2018        PMID: 30619905      PMCID: PMC6320234          DOI: 10.1080/23273798.2018.1500698

Source DB:  PubMed          Journal:  Lang Cogn Neurosci        ISSN: 2327-3798            Impact factor:   2.331


  11 in total

1.  Time and information in perceptual adaptation to speech.

Authors:  Ja Young Choi; Tyler K Perrachione
Journal:  Cognition       Date:  2019-06-21

2.  Lexical Information Guides Retuning of Neural Patterns in Perceptual Learning for Speech.

Authors:  Sahil Luthra; João M Correia; Dave F Kleinschmidt; Laura Mesite; Emily B Myers
Journal:  J Cogn Neurosci       Date:  2020-07-14       Impact factor: 3.225

3.  Perception of local and non-local vowels by adults and children in the South.

Authors:  Ewa Jacewicz; Robert Allen Fox
Journal:  J Acoust Soc Am       Date:  2020-01       Impact factor: 1.840

4.  Social Priming in Speech Perception: Revisiting Kangaroo/Kiwi Priming in New Zealand English.

Authors:  Gia Hurring; Jennifer Hay; Katie Drager; Ryan Podlubny; Laura Manhire; Alix Ellis
Journal:  Brain Sci       Date:  2022-05-24

5.  Distributional learning for speech reflects cumulative exposure to a talker's phonetic distributions.

Authors:  Rachel M Theodore; Nicholas R Monto
Journal:  Psychon Bull Rev       Date:  2019-06

6.  Boosting lexical support does not enhance lexically guided perceptual learning.

Authors:  Sahil Luthra; James S Magnuson; Emily B Myers
Journal:  J Exp Psychol Learn Mem Cogn       Date:  2020-10-15       Impact factor: 3.051

7.  Children track probabilistic distributions of facial cues across individuals.

Authors:  Kristina Woodard; Rista C Plate; Seth D Pollak
Journal:  J Exp Psychol Gen       Date:  2021-09-27

8.  Perceptual learning of multiple talkers requires additional exposure.

Authors:  Sahil Luthra; Hannah Mechtenberg; Emily B Myers
Journal:  Atten Percept Psychophys       Date:  2021-03-22       Impact factor: 2.157

9.  Categorization of Vocal Emotion Cues Depends on Distributions of Input.

Authors:  Kristina Woodard; Rista C Plate; Michele Morningstar; Adrienne Wood; Seth D Pollak
Journal:  Affect Sci       Date:  2021-04-10

10.  Toward "English" Phonetics: Variability in the Pre-consonantal Voicing Effect Across English Dialects and Speakers.

Authors:  James Tanner; Morgan Sonderegger; Jane Stuart-Smith; Josef Fruehwald
Journal:  Front Artif Intell       Date:  2020-05-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.