Literature DB >> 10462814

Vocal tract normalization for midsagittal articulatory recovery with analysis-by-synthesis.

R S McGowan1, S Cushing.   

Abstract

A method is presented that accounts for differences in the acoustics of vowel production caused by human talkers' vocal-tract anatomies and postural settings. Such a method is needed by an analysis-by-synthesis procedure designed to recover midsagittal articulatory movement from speech acoustics because the procedure employs an articulatory model as an internal model. The normalization procedure involves the adjustment of parameters of the articulatory model that are not of interest for the midsagittal movement recovery procedure. These parameters are adjusted so that acoustic signals produced by the human and the articulatory model match as closely as possible over an initial set of pairs of corresponding human and model midsagittal shapes. Further, these initial midsagittal shape correspondence need to be generalized so that all midsagittal shapes of the human can be obtained from midsagittal shapes of the model. Once these procedures are complete, the midsagittal articulatory movement recovery algorithm can be used to derive model articulatory trajectories that, subsequently, can be transformed into human articulatory trajectories. In this paper the proposed normalization procedure is outlined and the results of experiments with data from two talkers contained in the X-ray Microbeam Speech Production Database are presented. It was found to be possible to characterize these vocal tracts during vowel production with the proposed procedure and to generalize the initial midsagittal correspondences over a set of vowels to other vowels. The procedure was also found to aid in midsagittal articulatory movement recovery from speech acoustics in a vowel-to-vowel production for the two subjects.

Entities:  

Mesh:

Year:  1999        PMID: 10462814     DOI: 10.1121/1.427117

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  3 in total

1.  Acoustic-articulatory mapping in vowels by locally weighted regression.

Authors:  Richard S McGowan; Michael A Berger
Journal:  J Acoust Soc Am       Date:  2009-10       Impact factor: 1.840

2.  Listening for the norm: adaptive coding in speech categorization.

Authors:  Jingyuan Huang; Lori L Holt
Journal:  Front Psychol       Date:  2012-02-01

3.  Tuned with a Tune: Talker Normalization via General Auditory Processes.

Authors:  Erika J C Laing; Ran Liu; Andrew J Lotto; Lori L Holt
Journal:  Front Psychol       Date:  2012-06-22
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.