Literature DB >> 23231127

A procedure for estimating gestural scores from speech acoustics.

Hosung Nam1, Vikramjit Mitra, Mark Tiede, Mark Hasegawa-Johnson, Carol Espy-Wilson, Elliot Saltzman, Louis Goldstein.   

Abstract

Speech can be represented as a constellation of constricting vocal tract actions called gestures, whose temporal patterning with respect to one another is expressed in a gestural score. Current speech datasets do not come with gestural annotation and no formal gestural annotation procedure exists at present. This paper describes an iterative analysis-by-synthesis landmark-based time-warping architecture to perform gestural annotation of natural speech. For a given utterance, the Haskins Laboratories Task Dynamics and Application (TADA) model is employed to generate a corresponding prototype gestural score. The gestural score is temporally optimized through an iterative timing-warping process such that the acoustic distance between the original and TADA-synthesized speech is minimized. This paper demonstrates that the proposed iterative approach is superior to conventional acoustically-referenced dynamic timing-warping procedures and provides reliable gestural annotation for speech datasets.

Mesh:

Year:  2012        PMID: 23231127      PMCID: PMC3528686          DOI: 10.1121/1.4763545

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  13 in total

1.  An overlapping-feature-based phonological model incorporating linguistic constraints: applications to speech recognition.

Authors:  Jiping Sun; Li Deng
Journal:  J Acoust Soc Am       Date:  2002-02       Impact factor: 1.840

Review 2.  Articulatory phonology: an overview.

Authors:  C P Browman; L Goldstein
Journal:  Phonetica       Date:  1992       Impact factor: 1.759

3.  Prosodic strengthening and featural enhancement: evidence from acoustic and articulatory realizations of /a,i/ in English.

Authors:  Taehong Cho
Journal:  J Acoust Soc Am       Date:  2005-06       Impact factor: 1.840

4.  A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn.

Authors:  Helen M Hanson; Kenneth N Stevens
Journal:  J Acoust Soc Am       Date:  2002-09       Impact factor: 1.840

5.  A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition.

Authors:  Amit Juneja; Carol Espy-Wilson
Journal:  J Acoust Soc Am       Date:  2008-02       Impact factor: 1.840

6.  Articulatory strengthening at edges of prosodic domains.

Authors:  C Fougeron; P A Keating
Journal:  J Acoust Soc Am       Date:  1997-06       Impact factor: 1.840

7.  A precursor of language acquisition in young infants.

Authors:  J Mehler; P Jusczyk; G Lambertz; N Halsted; J Bertoncini; C Amiel-Tison
Journal:  Cognition       Date:  1988-07

8.  On the role of spectral transition for speech perception.

Authors:  S Furui
Journal:  J Acoust Soc Am       Date:  1986-10       Impact factor: 1.840

9.  The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation.

Authors:  K J de Jong
Journal:  J Acoust Soc Am       Date:  1995-01       Impact factor: 1.840

10.  Timing effects of syllable structure and stress on nasals: a real-time MRI examination.

Authors:  Dani Byrd; Stephen Tobin; Erik Bresch; Shrikanth Narayanan
Journal:  J Phon       Date:  2009-01-01
View more
  4 in total

1.  Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation.

Authors:  Vikram Ramanarayanan; Louis Goldstein; Shrikanth S Narayanan
Journal:  J Acoust Soc Am       Date:  2013-08       Impact factor: 1.840

2.  Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

Authors:  Adam Lammert; Louis Goldstein; Shrikanth Narayanan; Khalil Iskarous
Journal:  Speech Commun       Date:  2013-01       Impact factor: 2.017

3.  Differential Representation of Articulatory Gestures and Phonemes in Precentral and Inferior Frontal Gyri.

Authors:  Emily M Mugler; Matthew C Tate; Karen Livescu; Jessica W Templer; Matthew A Goldrick; Marc W Slutzky
Journal:  J Neurosci       Date:  2018-09-26       Impact factor: 6.167

4.  The FACTS model of speech motor control: Fusing state estimation and task-based control.

Authors:  Benjamin Parrell; Vikram Ramanarayanan; Srikantan Nagarajan; John Houde
Journal:  PLoS Comput Biol       Date:  2019-09-03       Impact factor: 4.475

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.