Literature DB >> 16119364

Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion.

Slim Ouni1, Yves Laprie.   

Abstract

Acoustic-to-articulatory inversion is a difficult problem mainly because of the nonlinearity between the articulatory and acoustic spaces and the nonuniqueness of this relationship. To resolve this problem, we have developed an inversion method that provides a complete description of the possible solutions without excessive constraints and retrieves realistic temporal dynamics of the vocal tract shapes. We present an adaptive sampling algorithm to ensure that the acoustical resolution is almost independent of the region in the articulatory space under consideration. This leads to a codebook that is organized in the form of a hierarchy of hypercubes, and ensures that, within each hypercube, the articulatory-to-acoustic mapping can be approximated by means of a linear transform. The inversion procedure retrieves articulatory vectors corresponding to acoustic entries from the hypercube codebook. A nonlinear smoothing algorithm together with a regularization technique is then used to recover the best articulatory trajectory. The inversion ensures that inverse articulatory parameters generate original formant trajectories with high precision and a realistic sequence of the vocal tract shapes.

Mesh:

Year:  2005        PMID: 16119364     DOI: 10.1121/1.1921448

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  4 in total

1.  A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model.

Authors:  Sankaran Panchapagesan; Abeer Alwan
Journal:  J Acoust Soc Am       Date:  2011-04       Impact factor: 1.840

2.  A modular architecture for articulatory synthesis from gestural specification.

Authors:  Rachel Alexander; Tanner Sorensen; Asterios Toutios; Shrikanth Narayanan
Journal:  J Acoust Soc Am       Date:  2019-12       Impact factor: 1.840

3.  A Neuromotor to Acoustical Jaw-Tongue Projection Model With Application in Parkinson's Disease Hypokinetic Dysarthria.

Authors:  Andrés Gómez; Pedro Gómez; Daniel Palacios; Victoria Rodellar; Víctor Nieto; Agustín Álvarez; Athanasios Tsanas
Journal:  Front Hum Neurosci       Date:  2021-03-15       Impact factor: 3.169

4.  Discrete constriction locations describe a comprehensive range of vocal tract shapes in the Maeda model.

Authors:  Jessica L Gaines; Kwang S Kim; Benjamin Parrell; Vikram Ramanarayanan; Srikantan S Nagarajan; John F Houde
Journal:  JASA Express Lett       Date:  2021-12-28
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.