| Literature DB >> 33855126 |
Houwei Cao1, Štefan Beňuš2,3, Ruben C Gur1, Ragini Verma1, Ani Nenkova1.
Abstract
In this paper we study the relationship between acted perceptually unambiguous emotion and prosody. Unlike most contemporary approaches which base the analysis of emotion in voice solely on continuous features extracted automatically from the acoustic signal, we analyze the predictive power of discrete characterizations of intonations in the ToBI framework. The goal of our work is to test if particular discrete prosodic events provide significant discriminative power for emotion recognition. Our experiments provide strong evidence that patterns in breaks, boundary tones and type of pitch accent are highly informative of the emotional content of speech. We also present results from automatic prediction of emotion based on ToBI-derived features and compare their prediction power with state-of-the-art bag-of-frame acoustic features. Our results indicate their similar performance in the sentence-dependent emotion prediction tasks, while acoustic features are more robust for the sentence-independent tasks. Finally, we combine ToBI features and acoustic features together and further achieve modest improvements in sentence-independent emotion prediction, particularly in differentiating fear and neutral from other emotion.Entities:
Keywords: ToBI; automatic prediction; emotion
Year: 2014 PMID: 33855126 PMCID: PMC8042645 DOI: 10.21437/SpeechProsody.2014-14
Source DB: PubMed Journal: Speech Prosody ISSN: 2333-2042