Literature DB >> 19603083

Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework.

Vivek Kumar Rangarajan Sridhar1, Srinivas Bangalore, Shrikanth S Narayanan.   

Abstract

In this paper, we describe a maximum entropy-based automatic prosody labeling framework that exploits both language and speech information. We apply the proposed framework to both prominence and phrase structure detection within the Tones and Break Indices (ToBI) annotation scheme. Our framework utilizes novel syntactic features in the form of supertags and a quantized acoustic-prosodic feature representation that is similar to linear parameterizations of the prosodic contour. The proposed model is trained discriminatively and is robust in the selection of appropriate features for the task of prosody detection. The proposed maximum entropy acoustic-syntactic model achieves pitch accent and boundary tone detection accuracies of 86.0% and 93.1% on the Boston University Radio News corpus, and, 79.8% and 90.3% on the Boston Directions corpus. The phrase structure detection through prosodic break index labeling provides accuracies of 84% and 87% on the two corpora, respectively. The reported results are significantly better than previously reported results and demonstrate the strength of maximum entropy model in jointly modeling simple lexical, syntactic, and acoustic features for automatic prosody labeling.

Year:  2008        PMID: 19603083      PMCID: PMC2709295          DOI: 10.1109/TASL.2008.917071

Source DB:  PubMed          Journal:  IEEE Trans Audio Speech Lang Process        ISSN: 1558-7916


  5 in total

1.  Intonation and dialog context as constraints for speech recognition.

Authors:  P Taylor; S King; S Isard; H Wright
Journal:  Lang Speech       Date:  1998 Jul-Dec       Impact factor: 1.500

2.  Can prosody aid the automatic classification of dialog acts in conversational speech?

Authors:  E Shriberg; R Bates; A Stolcke; P Taylor; D Jurafsky; K Ries; N Coccaro; R Martin; M Meteer; C van Ess-Dykema
Journal:  Lang Speech       Date:  1998 Jul-Dec       Impact factor: 1.500

3.  Segmental durations in the vicinity of prosodic phrase boundaries.

Authors:  C W Wightman; S Shattuck-Hufnagel; M Ostendorf; P J Price
Journal:  J Acoust Soc Am       Date:  1992-03       Impact factor: 1.840

4.  The use of prosody in syntactic disambiguation.

Authors:  P J Price; M Ostendorf; S Shattuck-Hufnagel; C Fong
Journal:  J Acoust Soc Am       Date:  1991-12       Impact factor: 1.840

5.  Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework.

Authors:  Vivek Kumar Rangarajan Sridhar; Srinivas Bangalore; Shrikanth S Narayanan
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2008
  5 in total
  3 in total

1.  Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework.

Authors:  Vivek Kumar Rangarajan Sridhar; Srinivas Bangalore; Shrikanth S Narayanan
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2008

2.  A Comparison of Natural Language Processing Methods for Automated Coding of Motivational Interviewing.

Authors:  Michael Tanana; Kevin A Hallgren; Zac E Imel; David C Atkins; Vivek Srikumar
Journal:  J Subst Abuse Treat       Date:  2016-01-28

3.  Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language: Computational techniques are presented to analyze and model expressed and perceived human behavior-variedly characterized as typical, atypical, distressed, and disordered-from speech and language cues and their applications in health, commerce, education, and beyond.

Authors:  Shrikanth Narayanan; Panayiotis G Georgiou
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2013-02-07       Impact factor: 10.961

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.