Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A generalized smoothness criterion for acoustic-to-articulatory inversion.

Literature DB >> 20968386

A generalized smoothness criterion for acoustic-to-articulatory inversion.

Prasanta Kumar Ghosh¹, Shrikanth Narayanan.

Abstract

The many-to-one mapping from representations in the speech articulatory space to acoustic space renders the associated acoustic-to-articulatory inverse mapping non-unique. Among various techniques, imposing smoothness constraints on the articulator trajectories is one of the common approaches to handle the non-uniqueness in the acoustic-to-articulatory inversion problem. This is because, articulators typically move smoothly during speech production. A standard smoothness constraint is to minimize the energy of the difference of the articulatory position sequence so that the articulator trajectory is smooth and low-pass in nature. Such a fixed definition of smoothness is not always realistic or adequate for all articulators because different articulators have different degrees of smoothness. In this paper, an optimization formulation is proposed for the inversion problem, which includes a generalized smoothness criterion. Under such generalized smoothness settings, the smoothness parameter can be chosen depending on the specific articulator in a data-driven fashion. In addition, this formulation allows estimation of articulatory positions recursively over time without any loss in performance. Experiments with the MOCHA TIMIT database show that the estimated articulator trajectories obtained using such a generalized smoothness criterion have lower RMS error and higher correlation with the actual measured trajectories compared to those obtained using a fixed smoothness constraint.

Entities: Disease

Mesh：

Year: 2010 PMID： 20968386 PMCID： PMC2981125 DOI： 10.1121/1.3455847

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

2 in total

1. Accurate recovery of articulator positions from acoustics: new conclusions based on human data.

Authors: J Hogden; A Lofqvist; V Gracco; I Zlokarnik; P Rubin; E Saltzman
Journal: J Acoust Soc Am Date: 1996-09 Impact factor: 1.840

2. Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique.

Authors: B S Atal; J J Chang; M V Mathews; J W Tukey
Journal: J Acoust Soc Am Date: 1978-05 Impact factor: 1.840

2 in total

8 in total

1. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion.

Authors: Prasanta Kumar Ghosh; Shrikanth Narayanan
Journal: J Acoust Soc Am Date: 2011-10 Impact factor: 1.840

2. Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation.

Authors: Vikram Ramanarayanan; Louis Goldstein; Shrikanth S Narayanan
Journal: J Acoust Soc Am Date: 2013-08 Impact factor: 1.840

3. On smoothing articulatory trajectories obtained from Gaussian mixture model based acoustic-to-articulatory inversion.

Authors: Prasanta K Ghosh; Shrikanth S Narayanan
Journal: J Acoust Soc Am Date: 2013-08 Impact factor: 1.840

4. Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

Authors: Adam Lammert; Louis Goldstein; Shrikanth Narayanan; Khalil Iskarous
Journal: Speech Commun Date: 2013-01 Impact factor: 2.017

5. Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.

Authors: Ming Li; Jangwon Kim; Adam Lammert; Prasanta Kumar Ghosh; Vikram Ramanarayanan; Shrikanth Narayanan
Journal: Comput Speech Lang Date: 2015-05-22 Impact factor: 1.899

6. Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories.

Authors: Vikram Ramanarayanan; Maarten Van Segbroeck; Shrikanth S Narayanan
Journal: Comput Speech Lang Date: 2015-03-21 Impact factor: 1.899

7. Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research.

Authors: Asterios Toutios; Shrikanth S Narayanan
Journal: APSIPA Trans Signal Inf Process Date: 2016-03-31

8. Articulation constrained learning with application to speech emotion recognition.

Authors: Mohit Shah; Ming Tu; Visar Berisha; Chaitali Chakrabarti; Andreas Spanias
Journal: EURASIP J Audio Speech Music Process Date: 2019-08-20

8 in total