Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Audio enhancement and intelligent classification of household sound events using a sparsely deployed array.

Literature DB >> 32006977

Audio enhancement and intelligent classification of household sound events using a sparsely deployed array.

Mingsian R Bai¹, Shih-Syuan Lan¹, Jong-Yi Huang¹, Yi-Cheng Hsu¹, Hing-Cheung So².

Abstract

A household sound event classification system consisting of an audio localization and enhancement front-end cascaded with an intelligent classification back-end is presented. The front-end is composed of a sparsely deployed microphone array and a preprocessing unit to localize the source and extract the associated signal. In the front-end, a two-stage method and a direct method are compared for localization. The two-stage method introduces a subspace algorithm to estimate the time difference of arrival, followed by a constrained least squares algorithm to determine the source location. The direct localization methods, the delay-and-sum beamformer, the minimum power distortionless response beamformer, and the multiple signal classification algorithm are compared in terms of localization performance for sparse array configuration. A modified particle swarm optimization algorithm enabled an efficient grid-search. A minimum variance distortionless response beamformer in conjunction with a minimum-mean-square-error postfilter is exploited to extract the source signals for sound event classification tasks that follow. The back-end of the system is a sound event classifier that is based on convolutional neural networks (CNNs), and convolutional long short-term memory networks Mel-spectrograms are used as the input features to the CNNs. Simulations and experiments conducted in a live room have demonstrated the strength and weakness of the direct and two-stage methods. Signal quality enhancement using the array-based front-end proves beneficial for improved classification accuracy over a single microphone.

Year: 2020 PMID： 32006977 DOI： 10.1121/10.0000492

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

2 in total

1. Acoustic- and Radio-Frequency-Based Human Activity Recognition.

Authors: Masoud Mohtadifar; Michael Cheffena; Alireza Pourafzal
Journal: Sensors (Basel) Date: 2022-04-19 Impact factor: 3.847

2. Intelligent Sports Video Classification Based on Deep Neural Network (DNN) Algorithm and Transfer Learning.

Authors: Xiaoping Guo
Journal: Comput Intell Neurosci Date: 2021-11-24

2 in total