Literature DB >> 33898608

Real-time single-channel deep neural network-based speech enhancement on edge devices.

Nikhil Shankar1, Gautam Shreedhar Bhat1, Issa M S Panahi1.   

Abstract

In this paper, we present a deep neural network architecture comprising of both convolutional neural network (CNN) and recurrent neural network (RNN) layers for real-time single-channel speech enhancement (SE). The proposed neural network model focuses on enhancing the noisy speech magnitude spectrum on a frame-by-frame process. The developed model is implemented on the smartphone (edge device), to demonstrate the real-time usability of the proposed method. Perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility (STOI) test results are used to compare the proposed algorithm to previously published conventional and deep learning-based SE methods. Subjective ratings show the performance improvement of the proposed model over the other baseline SE methods.

Entities:  

Keywords:  neural networks; real-time; smartphone; speech enhancement

Year:  2020        PMID: 33898608      PMCID: PMC8064406          DOI: 10.21437/Interspeech.2020-1901

Source DB:  PubMed          Journal:  Interspeech        ISSN: 2308-457X


  1 in total

1.  Smartphone-based single-channel speech enhancement application for hearing aids.

Authors:  Nikhil Shankar; Gautam Shreedhar Bhat; Issa M S Panahi; Stephanie Tittle; Linda M Thibodeau
Journal:  J Acoust Soc Am       Date:  2021-09       Impact factor: 2.482

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.