Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT.

Literature DB >> 33716575

RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT.

Jalal Abdulbaqi¹, Yue Gu¹, Shuhong Chen¹, Ivan Marsic¹.

Abstract

Most current speech enhancement models use spectrogram features that require an expensive transformation and result in phase information loss. Previous work has overcome these issues by using convolutional networks to learn the temporal correlations across high-resolution waveforms. These models, however, are limited by memory-intensive dilated convolution and aliasing artifacts from upsampling. We introduce an end-to-end fully recurrent neural network for single-channel speech enhancement. The network structured as an hourglass-shape that can efficiently capture long-range temporal dependencies by reducing the features resolution without information loss. Also, we use residual connections to prevent gradient decay over layers and improve the model generalization. Experimental results show that our model outperforms state-of-the-art approaches in six quantitative evaluation metrics.

Entities: Chemical Gene Species

Keywords: Speech enhancement; recurrent neural network; residual connection; speech denoising; waveform

Year: 2020 PMID： 33716575 PMCID： PMC7954533 DOI： 10.1109/icassp40776.2020.9053544

Source DB: PubMed Journal: Proc IEEE Int Conf Acoust Speech Signal Process ISSN： 1520-6149

Keyword Cloud
Cited

1 in total

1. A Waveform Mapping-Based Approach for Enhancement of Trunk Borers' Vibration Signals Using Deep Learning Model.

Authors: Haopeng Shi; Zhibo Chen; Haiyan Zhang; Juhu Li; Xuanxin Liu; Lili Ren; Youqing Luo
Journal: Insects Date: 2022-06-29 Impact factor: 3.139

1 in total