Literature DB >> 28863616

The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking.

Thomas Biberger1, Stephan D Ewert1.   

Abstract

The generalized power spectrum model [GPSM; Biberger and Ewert (2016). J. Acoust. Soc. Am. 140, 1023-1038], combining the "classical" concept of the power-spectrum model (PSM) and the envelope power spectrum-model (EPSM), was demonstrated to account for several psychoacoustic and speech intelligibility (SI) experiments. The PSM path of the model uses long-time power signal-to-noise ratios (SNRs), while the EPSM path uses short-time envelope power SNRs. A systematic comparison of existing SI models for several spectro-temporal manipulations of speech maskers and gender combinations of target and masker speakers [Schubotz et al. (2016). J. Acoust. Soc. Am. 140, 524-540] showed the importance of short-time power features. Conversely, Jørgensen et al. [(2013). J. Acoust. Soc. Am. 134, 436-446] demonstrated a higher predictive power of short-time envelope power SNRs than power SNRs using reverberation and spectral subtraction. Here the GPSM was extended to utilize short-time power SNRs and was shown to account for all psychoacoustic and SI data of the three mentioned studies. The best processing strategy was to exclusively use either power or envelope-power SNRs, depending on the experimental task. By analyzing both domains, the suggested model might provide a useful tool for clarifying the contribution of amplitude modulation masking and energetic masking.

Mesh:

Year:  2017        PMID: 28863616     DOI: 10.1121/1.4999059

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  5 in total

1.  Efficiency in glimpsing vowel sequences in fluctuating makers: Effects of temporal fine structure and temporal regularity.

Authors:  Yi Shen; Dylan V Pearson
Journal:  J Acoust Soc Am       Date:  2019-04       Impact factor: 1.840

2.  Noise-Sensitive But More Precise Subcortical Representations Coexist with Robust Cortical Encoding of Natural Vocalizations.

Authors:  Samira Souffi; Christian Lorenzi; Léo Varnet; Chloé Huetz; Jean-Marc Edeline
Journal:  J Neurosci       Date:  2020-05-22       Impact factor: 6.167

3.  Identifying cues for tone-in-noise detection using decision variable correlation in the budgerigar (Melopsittacus undulatus).

Authors:  Kenneth S Henry; Kassidy N Amburgey; Kristina S Abrams; Laurel H Carney
Journal:  J Acoust Soc Am       Date:  2020-02       Impact factor: 1.840

4.  Instrumental Quality Predictions and Analysis of Auditory Cues for Algorithms in Modern Headphone Technology.

Authors:  Thomas Biberger; Henning Schepker; Florian Denk; Stephan D Ewert
Journal:  Trends Hear       Date:  2021 Jan-Dec       Impact factor: 3.293

5.  Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.

Authors:  Marc R Schädler; Anna Warzybok; Birger Kollmeier
Journal:  Trends Hear       Date:  2018 Jan-Dec       Impact factor: 3.293

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.