Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers.

Literature DB >> 27914424

A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers.

Yan Tang¹, Martin Cooke², Bruno M Fazenda¹, Trevor J Cox¹.

Abstract

One criterion in the design of binaural sound scenes in audio production is the extent to which the intended speech message is correctly understood. Object-based audio broadcasting systems have permitted sound editors to gain more access to the metadata (e.g., intensity and location) of each sound source, providing better control over speech intelligibility. The current study describes and evaluates a binaural distortion-weighted glimpse proportion metric-BiDWGP-which is motivated by better-ear glimpsing and binaural masking level differences. BiDWGP predicts intelligibility from two alternative input forms: either binaural recordings or monophonic recordings from each sound source along with their locations. Two listening experiments were performed with stationary noise and competing speech, one in the presence of a single masker, the other with multiple maskers, for a variety of spatial configurations. Overall, BiDWGP with both input forms predicts listener keyword scores with correlations of 0.95 and 0.91 for single- and multi-masker conditions, respectively. When considering masker type separately, correlations rise to 0.95 and above for both types of maskers. Predictions using the two input forms are very similar, suggesting that BiDWGP can be applied to the design of sound scenes where only individual sound sources and their locations are available.

Mesh：

Year: 2016 PMID： 27914424 DOI： 10.1121/1.4962484

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

3 in total

1. The importance of a broad bandwidth for understanding "glimpsed" speech.

Authors: Virginia Best; Elin Roverud; Lucas Baltzell; Jan Rennies; Mathieu Lavandier
Journal: J Acoust Soc Am Date: 2019-11 Impact factor: 1.840

2. Spectro-temporal glimpsing of speech in noise: Regularity and coherence of masking patterns reduces uncertainty and increases intelligibility.

Authors: Daniel Fogerty; Victoria A Sevich; Eric W Healy
Journal: J Acoust Soc Am Date: 2020-09 Impact factor: 1.840

3. Neurocognitive dynamics of near-threshold voice signal detection and affective voice evaluation.

Authors: Huw Swanborough; Matthias Staib; Sascha Frühholz
Journal: Sci Adv Date: 2020-12-11 Impact factor: 14.136

3 in total