Literature DB >> 23231126

Automatic measurement of voice onset time using discriminative structured prediction.

Morgan Sonderegger1, Joseph Keshet.   

Abstract

A discriminative large-margin algorithm for automatic measurement of voice onset time (VOT) is described, considered as a case of predicting structured output from speech. Manually labeled data are used to train a function that takes as input a speech segment of an arbitrary length containing a voiceless stop, and outputs its VOT. The function is explicitly trained to minimize the difference between predicted and manually measured VOT; it operates on a set of acoustic feature functions designed based on spectral and temporal cues used by human VOT annotators. The algorithm is applied to initial voiceless stops from four corpora, representing different types of speech. Using several evaluation methods, the algorithm's performance is near human intertranscriber reliability, and compares favorably with previous work. Furthermore, the algorithm's performance is minimally affected by training and testing on different corpora, and remains essentially constant as the amount of training data is reduced to 50-250 manually labeled examples, demonstrating the method's practical applicability to new datasets.

Entities:  

Mesh:

Year:  2012        PMID: 23231126     DOI: 10.1121/1.4763995

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  6 in total

1.  Automatic measurement of vowel duration via structured prediction.

Authors:  Yossi Adi; Joseph Keshet; Emily Cibelli; Erin Gustafson; Cynthia Clopper; Matthew Goldrick
Journal:  J Acoust Soc Am       Date:  2016-12       Impact factor: 1.840

2.  SEQUENCE SEGMENTATION USING JOINT RNN AND STRUCTURED PREDICTION MODELS.

Authors:  Yossi Adi; Joseph Keshet; Emily Cibelli; Matthew Goldrick
Journal:  Proc IEEE Int Conf Acoust Speech Signal Process       Date:  2017-06-19

3.  Using automated acoustic analysis to explore the link between planning and articulation in second language speech production.

Authors:  Matthew Goldrick; Yosi Shrem; Oriana Kilbourn-Ceron; Cristina Baus; Joseph Keshet
Journal:  Lang Cogn Neurosci       Date:  2020-08-19       Impact factor: 2.331

4.  Automatic analysis of slips of the tongue: Insights into the cognitive architecture of speech production.

Authors:  Matthew Goldrick; Joseph Keshet; Erin Gustafson; Jordana Heller; Jeremy Needle
Journal:  Cognition       Date:  2016-01-09

5.  Voice Onset Time (VOT) at 50: Theoretical and practical issues in measuring voicing distinctions.

Authors:  Arthur S Abramson; D H Whalen
Journal:  J Phon       Date:  2017-05-23

6.  Chronset: An automated tool for detecting speech onset.

Authors:  Frédéric Roux; Blair C Armstrong; Manuel Carreiras
Journal:  Behav Res Methods       Date:  2017-10
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.