Literature DB >> 23794771

Class-Level Spectral Features for Emotion Recognition.

Dmitri Bitouk1, Ragini Verma, Ani Nenkova.   

Abstract

The most common approaches to automatic emotion recognition rely on utterance level prosodic features. Recent studies have shown that utterance level statistics of segmental spectral features also contain rich information about expressivity and emotion. In our work we introduce a more fine-grained yet robust set of spectral features: statistics of Mel-Frequency Cepstral Coefficients computed over three phoneme type classes of interest-stressed vowels, unstressed vowels and consonants in the utterance. We investigate performance of our features in the task of speaker-independent emotion recognition using two publicly available datasets. Our experimental results clearly indicate that indeed both the richer set of spectral features and the differentiation between phoneme type classes are beneficial for the task. Classification accuracies are consistently higher for our features compared to prosodic or utterance-level spectral features. Combination of our phoneme class features with prosodic features leads to even further improvement. Given the large number of class-level spectral features, we expected feature selection will improve results even further, but none of several selection methods led to clear gains. Further analyses reveal that spectral features computed from consonant regions of the utterance contain more information about emotion than either stressed or unstressed vowel features. We also explore how emotion recognition accuracy depends on utterance length. We show that, while there is no significant dependence for utterance-level prosodic features, accuracy of emotion recognition using class-level spectral features increases with the utterance length.

Entities:  

Year:  2010        PMID: 23794771      PMCID: PMC3686526          DOI: 10.1016/j.specom.2010.02.010

Source DB:  PubMed          Journal:  Speech Commun        ISSN: 0167-6393            Impact factor:   2.017


  1 in total

1.  Acoustic profiles in vocal emotion expression.

Authors:  R Banse; K R Scherer
Journal:  J Pers Soc Psychol       Date:  1996-03
  1 in total
  5 in total

1.  Acoustic and Lexical Representations for Affect Prediction in Spontaneous Conversations.

Authors:  Houwei Cao; Arman Savran; Ragini Verma; Ani Nenkova
Journal:  Comput Speech Lang       Date:  2015-01-01       Impact factor: 1.899

2.  Speaker-sensitive emotion recognition via ranking: Studies on acted and spontaneous speech

Authors:  Houwei Cao; Ragini Verma; Ani Nenkova
Journal:  Comput Speech Lang       Date:  2015-01       Impact factor: 1.899

3.  Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering.

Authors:  Arman Savran; Houwei Cao; Miraj Shah; Ani Nenkova; Ragini Verma
Journal:  Proc ACM Int Conf Multimodal Interact       Date:  2012

4.  Predicting couple therapy outcomes based on speech acoustic features.

Authors:  Md Nasir; Brian Robert Baucom; Panayiotis Georgiou; Shrikanth Narayanan
Journal:  PLoS One       Date:  2017-09-21       Impact factor: 3.240

5.  A Comparison of Machine Learning Algorithms and Feature Sets for Automatic Vocal Emotion Recognition in Speech.

Authors:  Cem Doğdu; Thomas Kessler; Dana Schneider; Maha Shadaydeh; Stefan R Schweinberger
Journal:  Sensors (Basel)       Date:  2022-10-06       Impact factor: 3.847

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.