Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking.

Literature DB >> 28863616

The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking.

Abstract

The generalized power spectrum model [GPSM; Biberger and Ewert (2016). J. Acoust. Soc. Am. 140, 1023-1038], combining the "classical" concept of the power-spectrum model (PSM) and the envelope power spectrum-model (EPSM), was demonstrated to account for several psychoacoustic and speech intelligibility (SI) experiments. The PSM path of the model uses long-time power signal-to-noise ratios (SNRs), while the EPSM path uses short-time envelope power SNRs. A systematic comparison of existing SI models for several spectro-temporal manipulations of speech maskers and gender combinations of target and masker speakers [Schubotz et al. (2016). J. Acoust. Soc. Am. 140, 524-540] showed the importance of short-time power features. Conversely, Jørgensen et al. [(2013). J. Acoust. Soc. Am. 134, 436-446] demonstrated a higher predictive power of short-time envelope power SNRs than power SNRs using reverberation and spectral subtraction. Here the GPSM was extended to utilize short-time power SNRs and was shown to account for all psychoacoustic and SI data of the three mentioned studies. The best processing strategy was to exclusively use either power or envelope-power SNRs, depending on the experimental task. By analyzing both domains, the suggested model might provide a useful tool for clarifying the contribution of amplitude modulation masking and energetic masking.

Mesh：

Year: 2017 PMID： 28863616 DOI： 10.1121/1.4999059

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

5 in total

1. Efficiency in glimpsing vowel sequences in fluctuating makers: Effects of temporal fine structure and temporal regularity.

Authors: Yi Shen; Dylan V Pearson
Journal: J Acoust Soc Am Date: 2019-04 Impact factor: 1.840

2. Noise-Sensitive But More Precise Subcortical Representations Coexist with Robust Cortical Encoding of Natural Vocalizations.

Authors: Samira Souffi; Christian Lorenzi; Léo Varnet; Chloé Huetz; Jean-Marc Edeline
Journal: J Neurosci Date: 2020-05-22 Impact factor: 6.167

3. Identifying cues for tone-in-noise detection using decision variable correlation in the budgerigar (Melopsittacus undulatus).

Authors: Kenneth S Henry; Kassidy N Amburgey; Kristina S Abrams; Laurel H Carney
Journal: J Acoust Soc Am Date: 2020-02 Impact factor: 1.840

4. Instrumental Quality Predictions and Analysis of Auditory Cues for Algorithms in Modern Headphone Technology.

Authors: Thomas Biberger; Henning Schepker; Florian Denk; Stephan D Ewert
Journal: Trends Hear Date: 2021 Jan-Dec Impact factor: 3.293

5. Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.

Authors: Marc R Schädler; Anna Warzybok; Birger Kollmeier
Journal: Trends Hear Date: 2018 Jan-Dec Impact factor: 3.293

5 in total