Literature DB >> 33578714

Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models.

Babak Joze Abbaschian1, Daniel Sierra-Sosa1, Adel Elmaghraby1.   

Abstract

The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human-computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended problem. The current study reviews deep learning approaches for SER with available datasets, followed by conventional machine learning techniques for speech emotion recognition. Ultimately, we present a multi-aspect comparison between practical neural network approaches in speech emotion recognition. The goal of this study is to provide a survey of the field of discrete speech emotion recognition.

Entities:  

Keywords:  CNN; GAN; LSTM; attention mechanism; autoencoders; deep learning; emotional speech database; machine learning; speech emotion recognition

Year:  2021        PMID: 33578714     DOI: 10.3390/s21041249

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  9 in total

1.  Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features.

Authors:  Ming-Che Lee; Sheng-Cheng Yeh; Jia-Wei Chang; Zhen-Yi Chen
Journal:  Sensors (Basel)       Date:  2022-06-23       Impact factor: 3.847

2.  Investigation of Methods to Create Future Multimodal Emotional Data for Robot Interactions in Patients with Schizophrenia: A Case Study.

Authors:  Kyoko Osaka; Kazuyuki Matsumoto; Toshiya Akiyama; Ryuichi Tanioka; Feni Betriana; Yueren Zhao; Yoshihiro Kai; Misao Miyagawa; Tetsuya Tanioka; Rozzano C Locsin
Journal:  Healthcare (Basel)       Date:  2022-05-05

3.  Construction and Research of Constructive English Teaching Model Applying Multimodal Neural Network Algorithm.

Authors:  Nan Zhang; Hao Wang
Journal:  Comput Intell Neurosci       Date:  2022-05-26

4.  Application of Educational Psychology-Based Dance Therapy in College Students' Life Education.

Authors:  Haiyan Zhong; Chunhui Zhao; Fengrui Zhang; Ruizhi Zhang
Journal:  Front Psychol       Date:  2022-03-21

5.  Frequency, Time, Representation and Modeling Aspects for Major Speech and Audio Processing Applications.

Authors:  Juraj Kacur; Boris Puterka; Jarmila Pavlovicova; Milos Oravec
Journal:  Sensors (Basel)       Date:  2022-08-22       Impact factor: 3.847

6.  Global and local feature fusion via long and short-term memory mechanism for dance emotion recognition in robot.

Authors:  Yin Lyu; Yang Sun
Journal:  Front Neurorobot       Date:  2022-08-24       Impact factor: 3.493

7.  A Comparison of Machine Learning Algorithms and Feature Sets for Automatic Vocal Emotion Recognition in Speech.

Authors:  Cem Doğdu; Thomas Kessler; Dana Schneider; Maha Shadaydeh; Stefan R Schweinberger
Journal:  Sensors (Basel)       Date:  2022-10-06       Impact factor: 3.847

8.  An improved multi-input deep convolutional neural network for automatic emotion recognition.

Authors:  Peiji Chen; Bochao Zou; Abdelkader Nasreddine Belkacem; Xiangwen Lyu; Xixi Zhao; Weibo Yi; Zhaoyang Huang; Jun Liang; Chao Chen
Journal:  Front Neurosci       Date:  2022-10-04       Impact factor: 5.152

9.  Cascaded Convolutional Neural Network Architecture for Speech Emotion Recognition in Noisy Conditions.

Authors:  Youngja Nam; Chankyu Lee
Journal:  Sensors (Basel)       Date:  2021-06-27       Impact factor: 3.576

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.