Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reconstruction techniques for improving the perceptual quality of binary masked speech.

Literature DB >> 25096123

Reconstruction techniques for improving the perceptual quality of binary masked speech.

Donald S Williamson¹, Yuxuan Wang¹, DeLiang Wang².

Abstract

This study proposes an approach to improve the perceptual quality of speech separated by binary masking through the use of reconstruction in the time-frequency domain. Non-negative matrix factorization and sparse reconstruction approaches are investigated, both using a linear combination of basis vectors to represent a signal. In this approach, the short-time Fourier transform (STFT) of separated speech is represented as a linear combination of STFTs from a clean speech dictionary. Binary masking for separation is performed using deep neural networks or Bayesian classifiers. The perceptual evaluation of speech quality, which is a standard objective speech quality measure, is used to evaluate the performance of the proposed approach. The results show that the proposed techniques improve the perceptual quality of binary masked speech, and outperform traditional time-frequency reconstruction approaches.

Mesh：

Year: 2014 PMID： 25096123 PMCID： PMC5392053 DOI： 10.1121/1.4884759

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

11 in total

Reconstruction techniques for improving the perceptual quality of binary masked speech.

1. Learning the parts of objects by non-negative matrix factorization.

2. Image denoising via sparse and redundant representations over learned dictionaries.

3. Determination of the potential benefit of time-frequency gain manipulation.

4. Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.

5. Sparse representation for color image restoration.

Review 6. Time-frequency masking for speech separation and its potential for hearing aid design.

7. An algorithm to improve speech recognition in noise for hearing-impaired listeners.

8. An algorithm that improves speech intelligibility in noise for normal-hearing listeners.

9. Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise.

10. Speech intelligibility in background noise with ideal binary time-frequency masking.

1. Estimating nonnegative matrix model activations with deep neural networks to increase perceptual speech quality.

2. Impact of phase estimation on single-channel speech separation based on time-frequency masking.

3. Complex Ratio Masking for Monaural Speech Separation.