Literature DB >> 17974321

Connecting intonation labels to mathematical descriptions of fundamental frequency.

Esther Grabe1, Greg Kochanski, John Coleman.   

Abstract

The mathematical models of intonation used in speech technology are often inaccessible to linguists. By the same token, phonological descriptions of intonation are rarely used by speech technologists, as they cannot be implemented directly in applications. Consequently, these research communities do not benefit much from each other's insights. In this paper, we explore the interface between the disciplines, in search of bridges between intonational phonology and speech technology. In a corpus of speech data from seven dialects of English, we hand-labeled over 700 sentences and identified seven nuclear accent types. Then we fitted a third-order polynomial to the fundamental frequency (F0) contour in the region around the accent mark. The polynomial captures the local shape (time-dependence) of F0 in a few numbers, in our case, four coefficients. The coefficients were subjected to statistical analysis. Nineteen of the 21 pairs of accent types differed significantly in one or more coefficients. Our approach bridges the gap between intonational phonology and speech technology. It provides quantitative, empirically testable models of intonation labels that can be implemented in applications.

Mesh:

Year:  2007        PMID: 17974321     DOI: 10.1177/00238309070500030101

Source DB:  PubMed          Journal:  Lang Speech        ISSN: 0023-8309            Impact factor:   1.500


  5 in total

1.  Accent detection is a slippery slope: Direction and rate of F0 change drives listeners' comprehension.

Authors:  Angela M Isaacs; Duane G Watson
Journal:  Lang Cogn Process       Date:  2010-01-01

2.  Emotion identification using extremely low frequency components of speech feature contours.

Authors:  Chang-Hong Lin; Wei-Kai Liao; Wen-Chi Hsieh; Wei-Jiun Liao; Jia-Ching Wang
Journal:  ScientificWorldJournal       Date:  2014-05-20

3.  Unifying Amplitude and Phase Analysis: A Compositional Data Approach to Functional Multivariate Mixed-Effects Modeling of Mandarin Chinese.

Authors:  P Z Hadjipantelis; J A D Aston; H G Müller; J P Evans
Journal:  J Am Stat Assoc       Date:  2015-07-06       Impact factor: 5.033

4.  Correction by Focus: Cleft Constructions and the Cross-Linguistic Variation in Phonological Form.

Authors:  Markus Greif; Stavros Skopeteas
Journal:  Front Psychol       Date:  2021-11-29

5.  Identification, Analysis and Characterization of Base Units of Bird Vocal Communication: The White Spectacled Bulbul (Pycnonotus xanthopygos) as a Case Study.

Authors:  Aya Marck; Yoni Vortman; Oren Kolodny; Yizhar Lavner
Journal:  Front Behav Neurosci       Date:  2022-02-14       Impact factor: 3.558

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.