Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Human phoneme recognition depending on speech-intrinsic variability.

Literature DB >> 21110608

Human phoneme recognition depending on speech-intrinsic variability.

Bernd T Meyer¹, Tim Jürgens, Thorsten Wesker, Thomas Brand, Birger Kollmeier.

Abstract

The influence of different sources of speech-intrinsic variation (speaking rate, effort, style and dialect or accent) on human speech perception was investigated. In listening experiments with 16 listeners, confusions of consonant-vowel-consonant (CVC) and vowel-consonant-vowel (VCV) sounds in speech-weighted noise were analyzed. Experiments were based on the OLLO logatome speech database, which was designed for a man-machine comparison. It contains utterances spoken by 50 speakers from five dialect/accent regions and covers several intrinsic variations. By comparing results depending on intrinsic and extrinsic variations (i.e., different levels of masking noise), the degradation induced by variabilities can be expressed in terms of the SNR. The spectral level distance between the respective speech segment and the long-term spectrum of the masking noise was found to be a good predictor for recognition rates, while phoneme confusions were influenced by the distance to spectrally close phonemes. An analysis based on transmitted information of articulatory features showed that voicing and manner of articulation are comparatively robust cues in the presence of intrinsic variations, whereas the coding of place is more degraded. The database and detailed results have been made available for comparisons between human speech recognition (HSR) and automatic speech recognizers (ASR).

Entities: Species

Mesh：

Year: 2010 PMID： 21110608 DOI： 10.1121/1.3493450

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

3 in total

1. Event-Related Potentials Measured From In and Around the Ear Electrodes Integrated in a Live Hearing Device for Monitoring Sound Perception.

Authors: Florian Denk; Marleen Grzybowski; Stephan M A Ernst; Birger Kollmeier; Stefan Debener; Martin G Bleichner
Journal: Trends Hear Date: 2018 Jan-Dec Impact factor: 3.293

2. Auditory Nerve Fiber Discrimination and Representation of Naturally-Spoken Vowels in Noise.

Authors: Amarins N Heeringa; Christine Köppl
Journal: eNeuro Date: 2022-02-14

3. Speech recognition in natural background noise.

Authors: Julien Meyer; Laure Dentel; Fanny Meunier
Journal: PLoS One Date: 2013-11-19 Impact factor: 3.240

3 in total