Literature DB >> 19739755

Role of mask pattern in intelligibility of ideal binary-masked noisy speech.

Ulrik Kjems1, Jesper B Boldt, Michael S Pedersen, Thomas Lunner, Deliang Wang.   

Abstract

Intelligibility of ideal binary masked noisy speech was measured on a group of normal hearing individuals across mixture signal to noise ratio (SNR) levels, masker types, and local criteria for forming the binary mask. The binary mask is computed from time-frequency decompositions of target and masker signals using two different schemes: an ideal binary mask computed by thresholding the local SNR within time-frequency units and a target binary mask computed by comparing the local target energy against the long-term average speech spectrum. By depicting intelligibility scores as a function of the difference between mixture SNR and local SNR threshold, alignment of the performance curves is obtained for a large range of mixture SNR levels. Large intelligibility benefits are obtained for both sparse and dense binary masks. When an ideal mask is dense with many ones, the effect of changing mixture SNR level while fixing the mask is significant, whereas for more sparse masks the effect is small or insignificant.

Mesh:

Year:  2009        PMID: 19739755     DOI: 10.1121/1.3179673

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  19 in total

1.  Channel selection in the modulation domain for improved speech intelligibility in noise.

Authors:  Kamil K Wójcicki; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2012-04       Impact factor: 1.840

2.  Comparing the information conveyed by envelope modulation for speech intelligibility, speech quality, and music quality.

Authors:  James M Kates; Kathryn H Arehart
Journal:  J Acoust Soc Am       Date:  2015-10       Impact factor: 1.840

3.  The intelligibility of pointillistic speech.

Authors:  Gerald Kidd; Timothy M Streeter; Antje Ihlefeld; Ross K Maddox; Christine R Mason
Journal:  J Acoust Soc Am       Date:  2009-12       Impact factor: 1.840

4.  Speech-cue transmission by an algorithm to increase consonant recognition in noise for hearing-impaired listeners.

Authors:  Eric W Healy; Sarah E Yoho; Yuxuan Wang; Frédéric Apoux; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2014-12       Impact factor: 1.840

5.  The optimal threshold for removing noise from speech is similar across normal and impaired hearing-a time-frequency masking study.

Authors:  Eric W Healy; Jordan L Vasko; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2019-06       Impact factor: 1.840

6.  Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise: Effect of the modulation spectrum.

Authors:  Daniel Fogerty; Jiaqian Xu; Bobby E Gibbs
Journal:  J Acoust Soc Am       Date:  2016-09       Impact factor: 1.840

7.  An ideal quantized mask to increase intelligibility and quality of speech in noise.

Authors:  Eric W Healy; Jordan L Vasko
Journal:  J Acoust Soc Am       Date:  2018-09       Impact factor: 1.840

8.  Perceptual learning for speech in noise after application of binary time-frequency masks.

Authors:  Mahnaz Ahmadi; Vauna L Gross; Donal G Sinex
Journal:  J Acoust Soc Am       Date:  2013-03       Impact factor: 1.840

9.  Comparison of two channel selection criteria for noise suppression in cochlear implants.

Authors:  Oldooz Hazrati; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2013-03       Impact factor: 1.840

10.  Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training.

Authors:  Arun Narayanan; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2015-01-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.