Literature DB >> 34732762

Environmental sound classification using temporal-frequency attention based convolutional neural network.

Wenjie Mu1, Bo Yin2,3, Xianqing Huang4, Jiali Xu4, Zehua Du1.   

Abstract

Environmental sound classification is one of the important issues in the audio recognition field. Compared with structured sounds such as speech and music, the time-frequency structure of environmental sounds is more complicated. In order to learn time and frequency features from Log-Mel spectrogram more effectively, a temporal-frequency attention based convolutional neural network model (TFCNN) is proposed in this paper. Firstly, an experiment that is used as motivation in proposed method is designed to verify the effect of a specific frequency band in the spectrogram on model classification. Secondly, two new attention mechanisms, temporal attention mechanism and frequency attention mechanism, are proposed. These mechanisms can focus on key frequency bands and semantic related time frames on the spectrogram to reduce the influence of background noise and irrelevant frequency bands. Then, a feature information complementarity is formed by combining these mechanisms to more accurately capture the critical time-frequency features. In such a way, the representation ability of the network model can be greatly improved. Finally, experiments on two public data sets, UrbanSound 8 K and ESC-50, demonstrate the effectiveness of the proposed method.
© 2021. The Author(s).

Entities:  

Mesh:

Year:  2021        PMID: 34732762      PMCID: PMC8566500          DOI: 10.1038/s41598-021-01045-4

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  1 in total

1.  Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion.

Authors:  Yu Su; Ke Zhang; Jingyu Wang; Kurosh Madani
Journal:  Sensors (Basel)       Date:  2019-04-11       Impact factor: 3.576

  1 in total
  3 in total

1.  End-to-End Train Horn Detection for Railway Transit Safety.

Authors:  Van-Thuan Tran; Wei-Ho Tsai; Yury Furletov; Mikhail Gorodnichev
Journal:  Sensors (Basel)       Date:  2022-06-12       Impact factor: 3.847

2.  Significance of event related causality (ERC) in eloquent neural networks.

Authors:  Anna Korzeniewska; Takumi Mitsuhashi; Yujing Wang; Eishi Asano; Piotr J Franaszczuk; Nathan E Crone
Journal:  Neural Netw       Date:  2022-02-18

3.  QUCoughScope: An Intelligent Application to Detect COVID-19 Patients Using Cough and Breath Sounds.

Authors:  Tawsifur Rahman; Nabil Ibtehaz; Amith Khandakar; Md Sakib Abrar Hossain; Yosra Magdi Salih Mekki; Maymouna Ezeddin; Enamul Haque Bhuiyan; Mohamed Arselene Ayari; Anas Tahir; Yazan Qiblawey; Sakib Mahmud; Susu M Zughaier; Tariq Abbas; Somaya Al-Maadeed; Muhammad E H Chowdhury
Journal:  Diagnostics (Basel)       Date:  2022-04-07
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.