Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Estimators of The Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty.

Literature DB >> 21886543

Estimators of The Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty.

Abstract

Statistical estimators of the magnitude-squared spectrum are derived based on the assumption that the magnitude-squared spectrum of the noisy speech signal can be computed as the sum of the (clean) signal and noise magnitude-squared spectra. Maximum a posterior (MAP) and minimum mean square error (MMSE) estimators are derived based on a Gaussian statistical model. The gain function of the MAP estimator was found to be identical to the gain function used in the ideal binary mask (IdBM) that is widely used in computational auditory scene analysis (CASA). As such, it was binary and assumed the value of 1 if the local SNR exceeded 0 dB, and assumed the value of 0 otherwise. By modeling the local instantaneous SNR as an F-distributed random variable, soft masking methods were derived incorporating SNR uncertainty. The soft masking method, in particular, which weighted the noisy magnitude-squared spectrum by the a priori probability that the local SNR exceeds 0 dB was shown to be identical to the Wiener gain function. Results indicated that the proposed estimators yielded significantly better speech quality than the conventional MMSE spectral power estimators, in terms of yielding lower residual noise and lower speech distortion.

Entities: Chemical Gene

Year: 2011 PMID： 21886543 PMCID： PMC3163489 DOI： 10.1109/TASL.2010.2082531

Source DB: PubMed Journal: IEEE Trans Audio Speech Lang Process ISSN： 1558-7916

Keyword Cloud
References

5 in total

Estimators of The Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty.

1. Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation.

2. Subjective comparison and evaluation of speech enhancement algorithms.

3. Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.

4. A geometric approach to spectral subtraction.

5. An algorithm that improves speech intelligibility in noise for normal-hearing listeners.