Literature DB >> 18345855

Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.

Ning Li1, Philipos C Loizou.   

Abstract

The application of the ideal binary mask to an auditory mixture has been shown to yield substantial improvements in intelligibility. This mask is commonly applied to the time-frequency (T-F) representation of a mixture signal and eliminates portions of a signal below a signal-to-noise-ratio (SNR) threshold while allowing others to pass through intact. The factors influencing intelligibility of ideal binary-masked speech are not well understood and are examined in the present study. Specifically, the effects of the local SNR threshold, input SNR level, masker type, and errors introduced in estimating the ideal mask are examined. Consistent with previous studies, intelligibility of binary-masked stimuli is quite high even at -10 dB SNR for all maskers tested. Performance was affected the most when the masker dominated T-F units were wrongly labeled as target-dominated T-F units. Performance plateaued near 100% correct for SNR thresholds ranging from -20 to 5 dB. The existence of the plateau region suggests that it is the pattern of the ideal binary mask that matters the most rather than the local SNR of each T-F unit. This pattern directs the listener's attention to where the target is and enables them to segregate speech effectively in multitalker environments.

Mesh:

Year:  2008        PMID: 18345855      PMCID: PMC2696360          DOI: 10.1121/1.2832617

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  12 in total

1.  Informational and energetic masking effects in the perception of two simultaneous talkers.

Authors:  D S Brungart
Journal:  J Acoust Soc Am       Date:  2001-03       Impact factor: 1.840

2.  A speech corpus for multitalker communications research.

Authors:  R S Bolia; W T Nelson; M A Ericson; B D Simpson
Journal:  J Acoust Soc Am       Date:  2000-02       Impact factor: 1.840

3.  Informational and energetic masking effects in the perception of multiple simultaneous talkers.

Authors:  D S Brungart; B D Simpson; M A Ericson; K R Scott
Journal:  J Acoust Soc Am       Date:  2001-11       Impact factor: 1.840

4.  Speech segregation based on sound localization.

Authors:  Nicoleta Roman; DeLiang Wang; Guy J Brown
Journal:  J Acoust Soc Am       Date:  2003-10       Impact factor: 1.840

5.  Pitch-based monaural segregation of reverberant speech.

Authors:  Nicoleta Roman; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2006-07       Impact factor: 1.840

6.  A glimpsing model of speech perception in noise.

Authors:  Martin Cooke
Journal:  J Acoust Soc Am       Date:  2006-03       Impact factor: 1.840

7.  Determination of the potential benefit of time-frequency gain manipulation.

Authors:  Michael C Anzalone; Lauren Calandruccio; Karen A Doherty; Laurel H Carney
Journal:  Ear Hear       Date:  2006-10       Impact factor: 3.570

8.  Factors influencing glimpsing of speech in noise.

Authors:  Ning Li; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2007-08       Impact factor: 1.840

9.  Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing.

Authors:  J M Festen; R Plomp
Journal:  J Acoust Soc Am       Date:  1990-10       Impact factor: 1.840

10.  Subjective comparison and evaluation of speech enhancement algorithms.

Authors:  Yi Hu; Philipos C Loizou
Journal:  Speech Commun       Date:  2007-07       Impact factor: 2.017

View more
  40 in total

1.  The relative phonetic contributions of a cochlear implant and residual acoustic hearing to bimodal speech perception.

Authors:  Benjamin M Sheffield; Fan-Gang Zeng
Journal:  J Acoust Soc Am       Date:  2012-01       Impact factor: 1.840

2.  Channel selection in the modulation domain for improved speech intelligibility in noise.

Authors:  Kamil K Wójcicki; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2012-04       Impact factor: 1.840

3.  Comparing the information conveyed by envelope modulation for speech intelligibility, speech quality, and music quality.

Authors:  James M Kates; Kathryn H Arehart
Journal:  J Acoust Soc Am       Date:  2015-10       Impact factor: 1.840

Review 4.  Time-frequency masking for speech separation and its potential for hearing aid design.

Authors: 
Journal:  Trends Amplif       Date:  2008-10-30

5.  An algorithm to improve speech recognition in noise for hearing-impaired listeners.

Authors:  Eric W Healy; Sarah E Yoho; Yuxuan Wang; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2013-10       Impact factor: 1.840

6.  An algorithm that improves speech intelligibility in noise for normal-hearing listeners.

Authors:  Gibak Kim; Yang Lu; Yi Hu; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2009-09       Impact factor: 1.840

7.  Effect of spectral resolution on the intelligibility of ideal binary masked speech.

Authors:  Ning Li; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2008-04       Impact factor: 1.840

8.  Estimators of The Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty.

Authors:  Yang Lu; Philipos C Loizou
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2011-07-01

9.  Noise Perturbation for Supervised Speech Separation.

Authors:  Jitong Chen; Yuxuan Wang; DeLiang Wang
Journal:  Speech Commun       Date:  2016-04-01       Impact factor: 2.017

10.  Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions.

Authors:  Philipos C Loizou; Gibak Kim
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2011
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.