Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deep Learning Based Target Cancellation for Speech Dereverberation.

Literature DB >> 33748324

Deep Learning Based Target Cancellation for Speech Dereverberation.

Abstract

This study investigates deep learning based single- and multi-channel speech dereverberation. For single-channel processing, we extend magnitude-domain masking and mapping based dereverberation to complex-domain mapping, where deep neural networks (DNNs) are trained to predict the real and imaginary (RI) components of the direct-path signal from reverberant (and noisy) ones. For multi-channel processing, we first compute a minimum variance distortionless response (MVDR) beamformer to cancel the direct-path signal, and then feed the RI components of the cancelled signal, which is expected to be a filtered version of non-target signals, as additional features to perform dereverberation. Trained on a large dataset of simulated room impulse responses, our models show excellent speech dereverberation and recognition performance on the test set of the REVERB challenge, consistently better than single- and multi-channel weighted prediction error (WPE) algorithms.

Entities: Chemical Disease Species

Keywords: complex spectral mapping; deep learning; microphone array processing; phase estimation; speech dereverberation

Year: 2020 PMID： 33748324 PMCID： PMC7977279 DOI： 10.1109/taslp.2020.2975902

Source DB: PubMed Journal: IEEE/ACM Trans Audio Speech Lang Process

6 in total

1 in total

1. Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation.

Authors: Zhong-Qiu Wang; Peidong Wang; DeLiang Wang
Journal: IEEE/ACM Trans Audio Speech Lang Process Date: 2021-05-26

1 in total

Deep Learning Based Target Cancellation for Speech Dereverberation.

1. Binaural segregation in multisource reverberant environments.

2. Supervised Speech Separation Based on Deep Learning: An Overview.

3. Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising.

4. On Training Targets for Supervised Speech Separation.

5. Complex Ratio Masking for Monaural Speech Separation.

6. Deep Learning Based Binaural Speech Separation in Reverberant Environments.

1. Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation.