Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Structure in talker variability: How much is there and how much can it help?

Literature DB >> 30619905

Structure in talker variability: How much is there and how much can it help?

Abstract

One of the persistent puzzles in understanding human speech perception is how listeners cope with talker variability. One thing that might help listeners is structure in talker variability: rather than varying randomly, talkers of the same gender, dialect, age, etc. tend to produce language in similar ways. Listeners are sensitive to this covariation between linguistic variation and socio-indexical variables. In this paper I present new techniques based on ideal observer models to quantify (1) the amount and type of structure in talker variation (informativity of a grouping variable), and (2) how useful such structure can be for robust speech recognition in the face of talker variability (the utility of a grouping variable). I demonstrate these techniques in two phonetic domains-word-initial stop voicing and vowel identity-and show that these domains have different amounts and types of talker variability, consistent with previous, impressionistic findings. An R package (phondisttools) accompanies this paper, and the source and data are available from osf.io/zv6e3.

Entities: Chemical Disease Gene Species

Keywords: Speech perception; computational modelling; variability

Year: 2018 PMID： 30619905 PMCID： PMC6320234 DOI： 10.1080/23273798.2018.1500698

Source DB: PubMed Journal: Lang Cogn Neurosci ISSN： 2327-3798 Impact factor: 2.331

Keyword Cloud
Cited

11 in total

Structure in talker variability: How much is there and how much can it help?

1. Time and information in perceptual adaptation to speech.

2. Lexical Information Guides Retuning of Neural Patterns in Perceptual Learning for Speech.

3. Perception of local and non-local vowels by adults and children in the South.

4. Social Priming in Speech Perception: Revisiting Kangaroo/Kiwi Priming in New Zealand English.

5. Distributional learning for speech reflects cumulative exposure to a talker's phonetic distributions.

6. Boosting lexical support does not enhance lexically guided perceptual learning.

7. Children track probabilistic distributions of facial cues across individuals.

8. Perceptual learning of multiple talkers requires additional exposure.

9. Categorization of Vocal Emotion Cues Depends on Distributions of Input.

10. Toward "English" Phonetics: Variability in the Pre-consonantal Voicing Effect Across English Dialects and Speakers.