Literature DB >> 33748324

Deep Learning Based Target Cancellation for Speech Dereverberation.

Zhong-Qiu Wang1, DeLiang Wang2.   

Abstract

This study investigates deep learning based single- and multi-channel speech dereverberation. For single-channel processing, we extend magnitude-domain masking and mapping based dereverberation to complex-domain mapping, where deep neural networks (DNNs) are trained to predict the real and imaginary (RI) components of the direct-path signal from reverberant (and noisy) ones. For multi-channel processing, we first compute a minimum variance distortionless response (MVDR) beamformer to cancel the direct-path signal, and then feed the RI components of the cancelled signal, which is expected to be a filtered version of non-target signals, as additional features to perform dereverberation. Trained on a large dataset of simulated room impulse responses, our models show excellent speech dereverberation and recognition performance on the test set of the REVERB challenge, consistently better than single- and multi-channel weighted prediction error (WPE) algorithms.

Entities:  

Keywords:  complex spectral mapping; deep learning; microphone array processing; phase estimation; speech dereverberation

Year:  2020        PMID: 33748324      PMCID: PMC7977279          DOI: 10.1109/taslp.2020.2975902

Source DB:  PubMed          Journal:  IEEE/ACM Trans Audio Speech Lang Process


  6 in total

1.  Binaural segregation in multisource reverberant environments.

Authors:  Nicoleta Roman; Soundararajan Srinivasan; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2006-12       Impact factor: 1.840

2.  Supervised Speech Separation Based on Deep Learning: An Overview.

Authors:  DeLiang Wang; Jitong Chen
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2018-05-30

3.  Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising.

Authors:  Donald S Williamson; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-04-20

4.  On Training Targets for Supervised Speech Separation.

Authors:  Yuxuan Wang; Arun Narayanan; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2014-12

5.  Complex Ratio Masking for Monaural Speech Separation.

Authors:  Donald S Williamson; Yuxuan Wang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2015-12-23

6.  Deep Learning Based Binaural Speech Separation in Reverberant Environments.

Authors:  Xueliang Zhang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-03-24
  6 in total
  1 in total

1.  Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation.

Authors:  Zhong-Qiu Wang; Peidong Wang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2021-05-26
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.