Literature DB >> 21886543

Estimators of The Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty.

Yang Lu1, Philipos C Loizou.   

Abstract

Statistical estimators of the magnitude-squared spectrum are derived based on the assumption that the magnitude-squared spectrum of the noisy speech signal can be computed as the sum of the (clean) signal and noise magnitude-squared spectra. Maximum a posterior (MAP) and minimum mean square error (MMSE) estimators are derived based on a Gaussian statistical model. The gain function of the MAP estimator was found to be identical to the gain function used in the ideal binary mask (IdBM) that is widely used in computational auditory scene analysis (CASA). As such, it was binary and assumed the value of 1 if the local SNR exceeded 0 dB, and assumed the value of 0 otherwise. By modeling the local instantaneous SNR as an F-distributed random variable, soft masking methods were derived incorporating SNR uncertainty. The soft masking method, in particular, which weighted the noisy magnitude-squared spectrum by the a priori probability that the local SNR exceeds 0 dB was shown to be identical to the Wiener gain function. Results indicated that the proposed estimators yielded significantly better speech quality than the conventional MMSE spectral power estimators, in terms of yielding lower residual noise and lower speech distortion.

Entities:  

Year:  2011        PMID: 21886543      PMCID: PMC3163489          DOI: 10.1109/TASL.2010.2082531

Source DB:  PubMed          Journal:  IEEE Trans Audio Speech Lang Process        ISSN: 1558-7916


  5 in total

1.  Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation.

Authors:  Douglas S Brungart; Peter S Chang; Brian D Simpson; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2006-12       Impact factor: 1.840

2.  Subjective comparison and evaluation of speech enhancement algorithms.

Authors:  Yi Hu; Philipos C Loizou
Journal:  Speech Commun       Date:  2007-07       Impact factor: 2.017

3.  Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.

Authors:  Ning Li; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2008-03       Impact factor: 1.840

4.  A geometric approach to spectral subtraction.

Authors:  Yang Lu; Philipos C Loizou
Journal:  Speech Commun       Date:  2008       Impact factor: 2.017

5.  An algorithm that improves speech intelligibility in noise for normal-hearing listeners.

Authors:  Gibak Kim; Yang Lu; Yi Hu; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2009-09       Impact factor: 1.840

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.