Literature DB >> 23326297

Retrieving Tract Variables From Acoustics: A Comparison of Different Machine Learning Strategies.

Vikramjit Mitra1, Hosung Nam, Carol Y Espy-Wilson, Elliot Saltzman, Louis Goldstein.   

Abstract

Many different studies have claimed that articulatory information can be used to improve the performance of automatic speech recognition systems. Unfortunately, such articulatory information is not readily available in typical speaker-listener situations. Consequently, such information has to be estimated from the acoustic signal in a process which is usually termed "speech-inversion." This study aims to propose and compare various machine learning strategies for speech inversion: Trajectory mixture density networks (TMDNs), feedforward artificial neural networks (FF-ANN), support vector regression (SVR), autoregressive artificial neural network (AR-ANN), and distal supervised learning (DSL). Further, using a database generated by the Haskins Laboratories speech production model, we test the claim that information regarding constrictions produced by the distinct organs of the vocal tract (vocal tract variables) is superior to flesh-point information (articulatory pellet trajectories) for the inversion process.

Entities:  

Year:  2010        PMID: 23326297      PMCID: PMC3544523          DOI: 10.1109/JSTSP.2010.2076013

Source DB:  PubMed          Journal:  IEEE J Sel Top Signal Process        ISSN: 1932-4553            Impact factor:   6.856


  17 in total

1.  An overlapping-feature-based phonological model incorporating linguistic constraints: applications to speech recognition.

Authors:  Jiping Sun; Li Deng
Journal:  J Acoust Soc Am       Date:  2002-02       Impact factor: 1.840

2.  Toward a model for lexical access based on acoustic landmarks and distinctive features.

Authors:  Kenneth N Stevens
Journal:  J Acoust Soc Am       Date:  2002-04       Impact factor: 1.840

3.  Inferring articulation and recognizing gestures from acoustics with a neural network trained on x-ray microbeam data.

Authors:  G Papcun; J Hochberg; T R Thomas; F Laroche; J Zacks; S Levy
Journal:  J Acoust Soc Am       Date:  1992-08       Impact factor: 1.840

Review 4.  Articulatory phonology: an overview.

Authors:  C P Browman; L Goldstein
Journal:  Phonetica       Date:  1992       Impact factor: 1.759

5.  Accurate recovery of articulator positions from acoustics: new conclusions based on human data.

Authors:  J Hogden; A Lofqvist; V Gracco; I Zlokarnik; P Rubin; E Saltzman
Journal:  J Acoust Soc Am       Date:  1996-09       Impact factor: 1.840

Review 6.  Coordination and coarticulation in speech production.

Authors:  C A Fowler; E Saltzman
Journal:  Lang Speech       Date:  1993 Apr-Sep       Impact factor: 1.500

7.  Timing constraints and coarticulation: alveolo-palatals and sequences of alveolar + [j] in Catalan.

Authors:  D Recasens
Journal:  Phonetica       Date:  1984       Impact factor: 1.759

8.  Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique.

Authors:  B S Atal; J J Chang; M V Mathews; J W Tukey
Journal:  J Acoust Soc Am       Date:  1978-05       Impact factor: 1.840

9.  Generating vocal tract shapes from formant frequencies.

Authors:  P Ladefoged; R Harshman; L Goldstein; L Rice
Journal:  J Acoust Soc Am       Date:  1978-10       Impact factor: 1.840

10.  Coarticulation in VCV utterances: spectrographic measurements.

Authors:  S E Ohman
Journal:  J Acoust Soc Am       Date:  1966-01       Impact factor: 1.840

View more
  4 in total

1.  A procedure for estimating gestural scores from speech acoustics.

Authors:  Hosung Nam; Vikramjit Mitra; Mark Tiede; Mark Hasegawa-Johnson; Carol Espy-Wilson; Elliot Saltzman; Louis Goldstein
Journal:  J Acoust Soc Am       Date:  2012-12       Impact factor: 1.840

2.  Vocal generalization depends on gesture identity and sequence.

Authors:  Lukas A Hoffmann; Samuel J Sober
Journal:  J Neurosci       Date:  2014-04-16       Impact factor: 6.167

3.  Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

Authors:  Adam Lammert; Louis Goldstein; Shrikanth Narayanan; Khalil Iskarous
Journal:  Speech Commun       Date:  2013-01       Impact factor: 2.017

4.  Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion.

Authors:  Zhaoyan Zhang
Journal:  J Voice       Date:  2021-04-10       Impact factor: 2.300

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.