Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Evaluating deep learning architectures for Speech Emotion Recognition.

Literature DB >> 28396068

Evaluating deep learning architectures for Speech Emotion Recognition.

Haytham M Fayek¹, Margaret Lech², Lawrence Cavedon³.

Abstract

Speech Emotion Recognition (SER) can be regarded as a static or dynamic classification problem, which makes SER an excellent test bed for investigating and comparing various deep learning architectures. We describe a frame-based formulation to SER that relies on minimal speech processing and end-to-end deep learning to model intra-utterance dynamics. We use the proposed SER system to empirically explore feed-forward and recurrent neural network architectures and their variants. Experiments conducted illuminate the advantages and limitations of these architectures in paralinguistic speech recognition and emotion recognition in particular. As a result of our exploration, we report state-of-the-art results on the IEMOCAP database for speaker-independent SER and present quantitative and qualitative assessments of the models' performances.

Entities: Chemical Disease

Keywords: Affective computing; Deep learning; Emotion recognition; Neural networks; Speech recognition

Mesh：

Year: 2017 PMID： 28396068 DOI： 10.1016/j.neunet.2017.02.013

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

24 in total

1. Wearable Cardiorespiratory Sensors for Aerospace Applications.

Authors: Nichakorn Pongsakornsathien; Alessandro Gardi; Yixiang Lim; Roberto Sabatini; Trevor Kistan
Journal: Sensors (Basel) Date: 2022-06-21 Impact factor: 3.847

2. An Urdu speech corpus for emotion recognition.

Authors: Awais Asghar; Sarmad Sohaib; Saman Iftikhar; Muhammad Shafi; Kiran Fatima
Journal: PeerJ Comput Sci Date: 2022-05-09

3. Emotional sounds of crowds: spectrogram-based analysis using deep learning.

Authors: Valentina Franzoni; Giulio Biondi; Alfredo Milani
Journal: Multimed Tools Appl Date: 2020-08-17 Impact factor: 2.757

4. A Survey of Deep Network Techniques All Classifiers Can Adopt.

Authors: Alireza Ghods; Diane J Cook
Journal: Data Min Knowl Discov Date: 2020-11-17 Impact factor: 3.670

5. Sensor Networks for Aerospace Human-Machine Systems.

Authors: Nichakorn Pongsakornsathien; Yixiang Lim; Alessandro Gardi; Samuel Hilton; Lars Planke; Roberto Sabatini; Trevor Kistan; Neta Ezer
Journal: Sensors (Basel) Date: 2019-08-08 Impact factor: 3.576

6. Speech Emotion Recognition Based on Selective Interpolation Synthetic Minority Over-Sampling Technique in Small Sample Environment.

Authors: Zhen-Tao Liu; Bao-Han Wu; Dan-Yun Li; Peng Xiao; Jun-Wei Mao
Journal: Sensors (Basel) Date: 2020-04-17 Impact factor: 3.576

Evaluating deep learning architectures for Speech Emotion Recognition.

1. Wearable Cardiorespiratory Sensors for Aerospace Applications.

2. An Urdu speech corpus for emotion recognition.

3. Emotional sounds of crowds: spectrogram-based analysis using deep learning.

4. A Survey of Deep Network Techniques All Classifiers Can Adopt.

5. Sensor Networks for Aerospace Human-Machine Systems.

6. Speech Emotion Recognition Based on Selective Interpolation Synthetic Minority Over-Sampling Technique in Small Sample Environment.

7. Machine learning to assist clinical decision-making during the COVID-19 pandemic.

8. A CNN-Assisted Enhanced Audio Signal Processing for Speech Emotion Recognition.

9. Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network.

10. Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features.