Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A procedure for estimating gestural scores from speech acoustics.

Literature DB >> 23231127

A procedure for estimating gestural scores from speech acoustics.

Hosung Nam¹, Vikramjit Mitra, Mark Tiede, Mark Hasegawa-Johnson, Carol Espy-Wilson, Elliot Saltzman, Louis Goldstein.

Abstract

Speech can be represented as a constellation of constricting vocal tract actions called gestures, whose temporal patterning with respect to one another is expressed in a gestural score. Current speech datasets do not come with gestural annotation and no formal gestural annotation procedure exists at present. This paper describes an iterative analysis-by-synthesis landmark-based time-warping architecture to perform gestural annotation of natural speech. For a given utterance, the Haskins Laboratories Task Dynamics and Application (TADA) model is employed to generate a corresponding prototype gestural score. The gestural score is temporally optimized through an iterative timing-warping process such that the acoustic distance between the original and TADA-synthesized speech is minimized. This paper demonstrates that the proposed iterative approach is superior to conventional acoustically-referenced dynamic timing-warping procedures and provides reliable gestural annotation for speech datasets.

Mesh：

Year: 2012 PMID： 23231127 PMCID： PMC3528686 DOI： 10.1121/1.4763545

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

13 in total

1. An overlapping-feature-based phonological model incorporating linguistic constraints: applications to speech recognition.

Authors: Jiping Sun; Li Deng
Journal: J Acoust Soc Am Date: 2002-02 Impact factor: 1.840

Review 2. Articulatory phonology: an overview.

Authors: C P Browman; L Goldstein
Journal: Phonetica Date: 1992 Impact factor: 1.759

3. Prosodic strengthening and featural enhancement: evidence from acoustic and articulatory realizations of /a,i/ in English.

Authors: Taehong Cho
Journal: J Acoust Soc Am Date: 2005-06 Impact factor: 1.840

4. A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn.

Authors: Helen M Hanson; Kenneth N Stevens
Journal: J Acoust Soc Am Date: 2002-09 Impact factor: 1.840

5. A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition.

Authors: Amit Juneja; Carol Espy-Wilson
Journal: J Acoust Soc Am Date: 2008-02 Impact factor: 1.840

6. Articulatory strengthening at edges of prosodic domains.

Authors: C Fougeron; P A Keating
Journal: J Acoust Soc Am Date: 1997-06 Impact factor: 1.840

7. A precursor of language acquisition in young infants.

Authors: J Mehler; P Jusczyk; G Lambertz; N Halsted; J Bertoncini; C Amiel-Tison
Journal: Cognition Date: 1988-07

8. On the role of spectral transition for speech perception.

Authors: S Furui
Journal: J Acoust Soc Am Date: 1986-10 Impact factor: 1.840

9. The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation.

Authors: K J de Jong
Journal: J Acoust Soc Am Date: 1995-01 Impact factor: 1.840

10. Timing effects of syllable structure and stress on nasals: a real-time MRI examination.

Authors: Dani Byrd; Stephen Tobin; Erik Bresch; Shrikanth Narayanan
Journal: J Phon Date: 2009-01-01

4 in total

1. Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation.

Authors: Vikram Ramanarayanan; Louis Goldstein; Shrikanth S Narayanan
Journal: J Acoust Soc Am Date: 2013-08 Impact factor: 1.840