Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.

Literature DB >> 21895088

Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.

Abstract

A model for predicting the intelligibility of processed noisy speech is proposed. The speech-based envelope power spectrum model has a similar structure as the model of Ewert and Dau [(2000). J. Acoust. Soc. Am. 108, 1181-1196], developed to account for modulation detection and masking data. The model estimates the speech-to-noise envelope power ratio, SNR(env), at the output of a modulation filterbank and relates this metric to speech intelligibility using the concept of an ideal observer. Predictions were compared to data on the intelligibility of speech presented in stationary speech-shaped noise. The model was further tested in conditions with noisy speech subjected to reverberation and spectral subtraction. Good agreement between predictions and data was found in all cases. For spectral subtraction, an analysis of the model's internal representation of the stimuli revealed that the predicted decrease of intelligibility was caused by the estimated noise envelope power exceeding that of the speech. The classical concept of the speech transmission index fails in this condition. The results strongly suggest that the signal-to-noise ratio at the output of a modulation frequency selective process provides a key measure of speech intelligibility.

Entities: Gene

Mesh：

Year: 2011 PMID： 21895088 DOI： 10.1121/1.3621502

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

36 in total

1. Channel selection in the modulation domain for improved speech intelligibility in noise.

Authors: Kamil K Wójcicki; Philipos C Loizou
Journal: J Acoust Soc Am Date: 2012-04 Impact factor: 1.840

2. Comparing the information conveyed by envelope modulation for speech intelligibility, speech quality, and music quality.

Authors: James M Kates; Kathryn H Arehart
Journal: J Acoust Soc Am Date: 2015-10 Impact factor: 1.840

3. Psychometric functions for sentence recognition in sinusoidally amplitude-modulated noises.

Authors: Yi Shen; Nicole K Manzano; Virginia M Richards
Journal: J Acoust Soc Am Date: 2015-12 Impact factor: 1.840

4. Auditory brainstem response latency in forward masking, a marker of sensory deficits in listeners with normal hearing thresholds.

Authors: Golbarg Mehraei; Andreu Paredes Gallardo; Barbara G Shinn-Cunningham; Torsten Dau
Journal: Hear Res Date: 2017-02-01 Impact factor: 3.208

5. Suboptimal use of neural information in a mammalian auditory system.

Authors: Laurel H Carney; Muhammad S A Zilany; Nicholas J Huang; Kristina S Abrams; Fabio Idrobo
Journal: J Neurosci Date: 2014-01-22 Impact factor: 6.167

6. Top-down or bottom up: decreased stimulus salience increases responses to predictable stimuli of auditory thalamic neurons.

Authors: Srinivasa P Kommajosyula; Rui Cai; Edward Bartlett; Donald M Caspary
Journal: J Physiol Date: 2019-04-21 Impact factor: 5.182

7. Efficiency in glimpsing vowel sequences in fluctuating makers: Effects of temporal fine structure and temporal regularity.

Authors: Yi Shen; Dylan V Pearson
Journal: J Acoust Soc Am Date: 2019-04 Impact factor: 1.840

8. Exploring the Role of Medial Olivocochlear Efferents on the Detection of Amplitude Modulation for Tones Presented in Noise.

Authors: Magdalena Wojtczak; Alix M Klang; Nathan T Torunsky
Journal: J Assoc Res Otolaryngol Date: 2019-05-28

9. Effects of Expanding Envelope Fluctuations on Consonant Perception in Hearing-Impaired Listeners.

Authors: Alan Wiinberg; Johannes Zaar; Torsten Dau
Journal: Trends Hear Date: 2018 Jan-Dec Impact factor: 3.293

10. Adaptive temporal encoding leads to a background-insensitive cortical representation of speech.

Authors: Nai Ding; Jonathan Z Simon
Journal: J Neurosci Date: 2013-03-27 Impact factor: 6.167