Literature DB >> 28396068

Evaluating deep learning architectures for Speech Emotion Recognition.

Haytham M Fayek1, Margaret Lech2, Lawrence Cavedon3.   

Abstract

Speech Emotion Recognition (SER) can be regarded as a static or dynamic classification problem, which makes SER an excellent test bed for investigating and comparing various deep learning architectures. We describe a frame-based formulation to SER that relies on minimal speech processing and end-to-end deep learning to model intra-utterance dynamics. We use the proposed SER system to empirically explore feed-forward and recurrent neural network architectures and their variants. Experiments conducted illuminate the advantages and limitations of these architectures in paralinguistic speech recognition and emotion recognition in particular. As a result of our exploration, we report state-of-the-art results on the IEMOCAP database for speaker-independent SER and present quantitative and qualitative assessments of the models' performances.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Affective computing; Deep learning; Emotion recognition; Neural networks; Speech recognition

Mesh:

Year:  2017        PMID: 28396068     DOI: 10.1016/j.neunet.2017.02.013

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  24 in total

1.  Wearable Cardiorespiratory Sensors for Aerospace Applications.

Authors:  Nichakorn Pongsakornsathien; Alessandro Gardi; Yixiang Lim; Roberto Sabatini; Trevor Kistan
Journal:  Sensors (Basel)       Date:  2022-06-21       Impact factor: 3.847

2.  An Urdu speech corpus for emotion recognition.

Authors:  Awais Asghar; Sarmad Sohaib; Saman Iftikhar; Muhammad Shafi; Kiran Fatima
Journal:  PeerJ Comput Sci       Date:  2022-05-09

3.  Emotional sounds of crowds: spectrogram-based analysis using deep learning.

Authors:  Valentina Franzoni; Giulio Biondi; Alfredo Milani
Journal:  Multimed Tools Appl       Date:  2020-08-17       Impact factor: 2.757

4.  A Survey of Deep Network Techniques All Classifiers Can Adopt.

Authors:  Alireza Ghods; Diane J Cook
Journal:  Data Min Knowl Discov       Date:  2020-11-17       Impact factor: 3.670

5.  Sensor Networks for Aerospace Human-Machine Systems.

Authors:  Nichakorn Pongsakornsathien; Yixiang Lim; Alessandro Gardi; Samuel Hilton; Lars Planke; Roberto Sabatini; Trevor Kistan; Neta Ezer
Journal:  Sensors (Basel)       Date:  2019-08-08       Impact factor: 3.576

6.  Speech Emotion Recognition Based on Selective Interpolation Synthetic Minority Over-Sampling Technique in Small Sample Environment.

Authors:  Zhen-Tao Liu; Bao-Han Wu; Dan-Yun Li; Peng Xiao; Jun-Wei Mao
Journal:  Sensors (Basel)       Date:  2020-04-17       Impact factor: 3.576

7.  Machine learning to assist clinical decision-making during the COVID-19 pandemic.

Authors:  Shubham Debnath; Douglas P Barnaby; Kevin Coppa; Alexander Makhnevich; Eun Ji Kim; Saurav Chatterjee; Viktor Tóth; Todd J Levy; Marc D Paradis; Stuart L Cohen; Jamie S Hirsch; Theodoros P Zanos
Journal:  Bioelectron Med       Date:  2020-07-10

8.  A CNN-Assisted Enhanced Audio Signal Processing for Speech Emotion Recognition.

Authors:  Soonil Kwon
Journal:  Sensors (Basel)       Date:  2019-12-28       Impact factor: 3.576

9.  Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network.

Authors:  Misbah Farooq; Fawad Hussain; Naveed Khan Baloch; Fawad Riasat Raja; Heejung Yu; Yousaf Bin Zikria
Journal:  Sensors (Basel)       Date:  2020-10-23       Impact factor: 3.576

10.  Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features.

Authors:  Tursunov Anvarjon; Soonil Kwon
Journal:  Sensors (Basel)       Date:  2020-09-12       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.