Literature DB >> 19132136

MODELING THE INTONATION OF DISCOURSE SEGMENTS FOR IMPROVED ONLINE DIALOG ACT TAGGING.

Sridhar Vivek Kumar Rangarajan1, Shrikanth Narayanan, Srinivas Bangalore.   

Abstract

Prosody is an important cue for identifying dialog acts. In this paper, we show that modeling the sequence of acoustic-prosodic values as n-gram features with a maximum entropy model for dialog act (DA) tagging can perform better than conventional approaches that use coarse representation of the prosodic contour through acoustic correlates of prosody. We also propose a discriminative framework that exploits preceding context in the form of lexical and prosodic cues from previous discourse segments. Such a scheme facilitates online DA tagging and offers robustness in the decoding process, unlike greedy decoding schemes that can potentially propagate errors. Using only lexical and prosodic cues from 3 previous utterances, we achieve a DA tagging accuracy of 72% compared to the best case scenario with accurate knowledge of previous DA tag, which results in 74% accuracy.

Year:  2008        PMID: 19132136      PMCID: PMC2614672          DOI: 10.1109/ICASSP.2008.4518789

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Acoust Speech Signal Process        ISSN: 1520-6149


  1 in total

1.  Can prosody aid the automatic classification of dialog acts in conversational speech?

Authors:  E Shriberg; R Bates; A Stolcke; P Taylor; D Jurafsky; K Ries; N Coccaro; R Martin; M Meteer; C van Ess-Dykema
Journal:  Lang Speech       Date:  1998 Jul-Dec       Impact factor: 1.500

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.