Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding.

Literature DB >> 32679578

Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding.

Neetha Das¹, Jeroen Zegers, Hugo Van Hamme, Tom Francart, Alexander Bertrand.

Abstract

OBJECTIVE: A hearing aid's noise reduction algorithm cannot infer to which speaker the user intends to listen to. Auditory attention decoding (AAD) algorithms allow to infer this information from neural signals, which leads to the concept of neuro-steered hearing aids. We aim to evaluate and demonstrate the feasibility of AAD-supported speech enhancement in challenging noisy conditions based on electroencephalography recordings. APPROACH: The AAD performance with a linear versus a deep neural network (DNN) based speaker separation was evaluated for same-gender speaker mixtures using three different speaker positions and three different noise conditions. MAIN
RESULTS: AAD results based on the linear approach were found to be at least on par and sometimes even better than pure DNN-based approaches in terms of AAD accuracy in all tested conditions. However, when using the DNN to support a linear data-driven beamformer, a performance improvement over the purely linear approach was obtained in the most challenging scenarios. The use of multiple microphones was also found to improve speaker separation and AAD performance over single-microphone systems. SIGNIFICANCE: Recent proof-of-concept studies in this context each focus on a different method in a different experimental setting, which makes it hard to compare them. Furthermore, they are tested in highly idealized experimental conditions, which are still far from a realistic hearing aid setting. This work provides a systematic comparison of a linear and non-linear neuro-steered speech enhancement model, as well as a more realistic validation in challenging conditions.

Entities: Chemical

Mesh：

Year: 2020 PMID： 32679578 DOI： 10.1088/1741-2552/aba6f8

Source DB: PubMed Journal: J Neural Eng ISSN： 1741-2552 Impact factor: 5.379

Keyword Cloud
Cited

4 in total

Review 1. Harnessing the Power of Artificial Intelligence in Otolaryngology and the Communication Sciences.

Authors: Blake S Wilson; Debara L Tucci; David A Moses; Edward F Chang; Nancy M Young; Fan-Gang Zeng; Nicholas A Lesica; Andrés M Bur; Hannah Kavookjian; Caroline Mussatto; Joseph Penn; Sara Goodwin; Shannon Kraft; Guanghui Wang; Jonathan M Cohen; Geoffrey S Ginsburg; Geraldine Dawson; Howard W Francis
Journal: J Assoc Res Otolaryngol Date: 2022-04-20

Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding.

Review 1. Harnessing the Power of Artificial Intelligence in Otolaryngology and the Communication Sciences.

2. A Speech-Level-Based Segmented Model to Decode the Dynamic Auditory Attention States in the Competing Speaker Scenes.

3. Synchronization of ear-EEG and audio streams in a portable research hearing device.

4. A particle swarm optimization improved BP neural network intelligent model for electrocardiogram classification.