Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deep Learning Based Real-time Speech Enhancement for Dual-microphone Mobile Phones.

Literature DB >> 34179221

Deep Learning Based Real-time Speech Enhancement for Dual-microphone Mobile Phones.

Ke Tan¹, Xueliang Zhang², DeLiang Wang³.

Abstract

In mobile speech communication, speech signals can be severely corrupted by background noise when the far-end talker is in a noisy acoustic environment. To suppress background noise, speech enhancement systems are typically integrated into mobile phones, in which one or more microphones are deployed. In this study, we propose a novel deep learning based approach to real-time speech enhancement for dual-microphone mobile phones. The proposed approach employs a new densely-connected convolutional recurrent network to perform dual-channel complex spectral mapping. We utilize a structured pruning technique to compress the model without significantly degrading the enhancement performance, which yields a low-latency and memory-efficient enhancement system for real-time processing. Experimental results suggest that the proposed approach consistently outperforms an earlier approach to dual-channel speech enhancement for mobile phone communication, as well as a deep learning based beamformer.

Entities: Chemical

Keywords: Real-time speech enhancement; complex spectral mapping; densely-connected convolutional recurrent network; dual-microphone mobile phones

Year: 2021 PMID： 34179221 PMCID： PMC8224499 DOI： 10.1109/taslp.2021.3082318

Source DB: PubMed Journal: IEEE/ACM Trans Audio Speech Lang Process

Keyword Cloud
References

8 in total

8. Speech Enhancement of Mobile Devices Based on the Integration of a Dual Microphone Array and a Background Noise Elimination Algorithm.

Authors: Yung-Yue Chen
Journal: Sensors (Basel) Date: 2018-05-08 Impact factor: 3.576

8 in total

Deep Learning Based Real-time Speech Enhancement for Dual-microphone Mobile Phones.

1. Supervised Speech Separation Based on Deep Learning: An Overview.

2. Gated Residual Networks with Dilated Convolutions for Monaural Speech Enhancement.

3. Learning Complex Spectral Mapping with Gated Convolutional Recurrent Networks for Monaural Speech Enhancement.

4. On Training Targets for Supervised Speech Separation.

5. UNet++: A Nested U-Net Architecture for Medical Image Segmentation.

6. Complex Ratio Masking for Monaural Speech Separation.

7. Deep Learning Based Binaural Speech Separation in Reverberant Environments.

8. Speech Enhancement of Mobile Devices Based on the Integration of a Dual Microphone Array and a Background Noise Elimination Algorithm.