Literature DB >> 19122857

Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence.

Sankaranarayanan Ananthakrishnan1, Shrikanth S Narayanan.   

Abstract

With the advent of prosody annotation standards such as tones and break indices (ToBI), speech technologists and linguists alike have been interested in automatically detecting prosodic events in speech. This is because the prosodic tier provides an additional layer of information over the short-term segment-level features and lexical representation of an utterance. As the prosody of an utterance is closely tied to its syntactic and semantic content in addition to its lexical content, knowledge of the prosodic events within and across utterances can assist spoken language applications such as automatic speech recognition and translation. On the other hand, corpora annotated with prosodic events are useful for building natural-sounding speech synthesizers. In this paper, we build an automatic detector and classifier for prosodic events in American English, based on their acoustic, lexical, and syntactic correlates. Following previous work in this area, we focus on accent (prominence, or "stress") and prosodic phrase boundary detection at the syllable level. Our experiments achieved a performance rate of 86.75% agreement on the accent detection task, and 91.61% agreement on the phrase boundary detection task on the Boston University Radio News Corpus.

Year:  2008        PMID: 19122857      PMCID: PMC2600436          DOI: 10.1109/TASL.2007.907570

Source DB:  PubMed          Journal:  IEEE Trans Audio Speech Lang Process        ISSN: 1558-7916


  1 in total

1.  Segmental durations in the vicinity of prosodic phrase boundaries.

Authors:  C W Wightman; S Shattuck-Hufnagel; M Ostendorf; P J Price
Journal:  J Acoust Soc Am       Date:  1992-03       Impact factor: 1.840

  1 in total
  5 in total

1.  FINE-GRAINED PITCH ACCENT AND BOUNDARY TONE LABELING WITH PARAMETRIC F0 FEATURES.

Authors:  Sankaranarayanan Ananthakrishnan; Shrikanth Narayanan
Journal:  Proc IEEE Int Conf Acoust Speech Signal Process       Date:  2008

2.  A NOVEL ALGORITHM FOR UNSUPERVISED PROSODIC LANGUAGE MODEL ADAPTATION.

Authors:  Sankaranarayanan Ananthakrishnan; Shrikanth Narayanan
Journal:  Proc IEEE Int Conf Acoust Speech Signal Process       Date:  2008

3.  Prominence Detection Using Auditory Attention Cues and Task-Dependent High Level Information.

Authors:  Ozlem Kalinli; Shrikanth Narayanan
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2009-07-01

4.  Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition.

Authors:  Sankaranarayanan Ananthakrishnan; Shrikanth Narayanan
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2009-01-01

5.  Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language: Computational techniques are presented to analyze and model expressed and perceived human behavior-variedly characterized as typical, atypical, distressed, and disordered-from speech and language cues and their applications in health, commerce, education, and beyond.

Authors:  Shrikanth Narayanan; Panayiotis G Georgiou
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2013-02-07       Impact factor: 10.961

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.