Literature DB >> 31671986

Predicting the effects of periodicity on the intelligibility of masked speech: An evaluation of different modelling approaches and their limitations.

Kurt Steinmetzger1, Johannes Zaar2, Helia Relaño-Iborra2, Stuart Rosen1, Torsten Dau2.   

Abstract

Four existing speech intelligibility models with different theoretical assumptions were used to predict previously published behavioural data. Those data showed that complex tones with pitch-related periodicity are far less effective maskers of speech than aperiodic noise. This so-called masker-periodicity benefit (MPB) far exceeded the fluctuating-masker benefit (FMB) obtained from slow masker envelope fluctuations. In contrast, the normal-hearing listeners hardly benefitted from periodicity in the target speech. All tested models consistently underestimated MPB and FMB, while most of them also overestimated the intelligibility of vocoded speech. To understand these shortcomings, the internal signal representations of the models were analysed in detail. The best-performing model, the correlation-based version of the speech-based envelope power spectrum model (sEPSMcorr), combined an auditory processing front end with a modulation filterbank and a correlation-based back end. This model was then modified to further improve the predictions. The resulting second version of the sEPSMcorr outperformed the original model with all tested maskers and accounted for about half the MPB, which can be attributed to reduced modulation masking caused by the periodic maskers. However, as the sEPSMcorr2 failed to account for the other half of the MPB, the results also indicate that future models should consider the contribution of pitch-related effects, such as enhanced stream segregation, to further improve their predictive power.

Mesh:

Year:  2019        PMID: 31671986     DOI: 10.1121/1.5129050

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  4 in total

1.  A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker.

Authors:  Luna Prud'homme; Mathieu Lavandier; Virginia Best
Journal:  J Acoust Soc Am       Date:  2020-11       Impact factor: 1.840

2.  Modulation masking and fine structure shape neural envelope coding to predict speech intelligibility across diverse listening conditions.

Authors:  Vibha Viswanathan; Hari M Bharadwaj; Barbara G Shinn-Cunningham; Michael G Heinz
Journal:  J Acoust Soc Am       Date:  2021-09       Impact factor: 2.482

3.  Speech Categorization Reveals the Role of Early-Stage Temporal-Coherence Processing in Auditory Scene Analysis.

Authors:  Vibha Viswanathan; Barbara G Shinn-Cunningham; Michael G Heinz
Journal:  J Neurosci       Date:  2021-11-11       Impact factor: 6.709

4.  Speech Intelligibility Prediction using Spectro-Temporal Modulation Analysis.

Authors:  Amin Edraki; Wai-Yip Chan; Jesper Jensen; Daniel Fogerty
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2020-11-24
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.