Literature DB >> 23927134

Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation.

Vikram Ramanarayanan1, Louis Goldstein, Shrikanth S Narayanan.   

Abstract

This paper presents a computational approach to derive interpretable movement primitives from speech articulation data. It puts forth a convolutive Nonnegative Matrix Factorization algorithm with sparseness constraints (cNMFsc) to decompose a given data matrix into a set of spatiotemporal basis sequences and an activation matrix. The algorithm optimizes a cost function that trades off the mismatch between the proposed model and the input data against the number of primitives that are active at any given instant. The method is applied to both measured articulatory data obtained through electromagnetic articulography as well as synthetic data generated using an articulatory synthesizer. The paper then describes how to evaluate the algorithm performance quantitatively and further performs a qualitative assessment of the algorithm's ability to recover compositional structure from data. This is done using pseudo ground-truth primitives generated by the articulatory synthesizer based on an Articulatory Phonology frame-work [Browman and Goldstein (1995). "Dynamics and articulatory phonology," in Mind as motion: Explorations in the dynamics of cognition, edited by R. F. Port and T.van Gelder (MIT Press, Cambridge, MA), pp. 175-194]. The results suggest that the proposed algorithm extracts movement primitives from human speech production data that are linguistically interpretable. Such a framework might aid the understanding of longstanding issues in speech production such as motor control and coarticulation.

Entities:  

Mesh:

Year:  2013        PMID: 23927134      PMCID: PMC3745549          DOI: 10.1121/1.4812765

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  22 in total

1.  Computational neuroscience. Think positive to find parts.

Authors:  B W Mel
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  When practice leads to co-articulation: the evolution of geometrically defined movement primitives.

Authors:  Ronen Sosnik; Bjoern Hauptmann; Avi Karni; Tamar Flash
Journal:  Exp Brain Res       Date:  2004-02-26       Impact factor: 1.972

Review 3.  Sparse coding of sensory inputs.

Authors:  Bruno A Olshausen; David J Field
Journal:  Curr Opin Neurobiol       Date:  2004-08       Impact factor: 6.627

Review 4.  Articulatory phonology: an overview.

Authors:  C P Browman; L Goldstein
Journal:  Phonetica       Date:  1992       Impact factor: 1.759

Review 5.  Coordination.

Authors:  M T Turvey
Journal:  Am Psychol       Date:  1990-08

6.  A neural basis for motor primitives in the spinal cord.

Authors:  Corey B Hart; Simon F Giszter
Journal:  J Neurosci       Date:  2010-01-27       Impact factor: 6.167

7.  Sparse coding with an overcomplete basis set: a strategy employed by V1?

Authors:  B A Olshausen; D J Field
Journal:  Vision Res       Date:  1997-12       Impact factor: 1.886

8.  Coarticulation of jaw movements in speech production: is context sensitivity in speech kinematics centrally planned?

Authors:  D J Ostry; P L Gribble; V L Gracco
Journal:  J Neurosci       Date:  1996-02-15       Impact factor: 6.167

9.  A theoretical model of phase transitions in human hand movements.

Authors:  H Haken; J A Kelso; H Bunz
Journal:  Biol Cybern       Date:  1985       Impact factor: 2.086

10.  Two functionally different synergies during arm reaching movements involving the trunk.

Authors:  S Ma; A G Feldman
Journal:  J Neurophysiol       Date:  1995-05       Impact factor: 2.714

View more
  14 in total

1.  Differentiating post-cancer from healthy tongue muscle coordination patterns during speech using deep learning.

Authors:  Jonghye Woo; Fangxu Xing; Jerry L Prince; Maureen Stone; Jordan R Green; Tessa Goldsmith; Timothy G Reese; Van J Wedeen; Georges El Fakhri
Journal:  J Acoust Soc Am       Date:  2019-05       Impact factor: 1.840

2.  Quantal biomechanical effects in speech postures of the lips.

Authors:  Bryan Gick; Connor Mayer; Chenhao Chiu; Erik Widing; François Roewer-Després; Sidney Fels; Ian Stavness
Journal:  J Neurophysiol       Date:  2020-07-29       Impact factor: 2.714

3.  A Sparse Non-Negative Matrix Factorization Framework for Identifying Functional Units of Tongue Behavior From MRI.

Authors:  Jerry L Prince; Maureen Stone; Arnold D Gomez; Jordan R Green; Christopher J Hartnick; Thomas J Brady; Timothy G Reese; Van J Wedeen; Georges El Fakhri
Journal:  IEEE Trans Med Imaging       Date:  2018-09-18       Impact factor: 10.048

4.  Unsupervised discovery of temporal sequences in high-dimensional datasets, with applications to neuroscience.

Authors:  Emily L Mackevicius; Andrew H Bahle; Alex H Williams; Shijie Gu; Natalia I Denisenko; Mark S Goldman; Michale S Fee
Journal:  Elife       Date:  2019-02-05       Impact factor: 8.140

5.  MUPET-Mouse Ultrasonic Profile ExTraction: A Signal Processing Tool for Rapid and Unsupervised Analysis of Ultrasonic Vocalizations.

Authors:  Maarten Van Segbroeck; Allison T Knoll; Pat Levitt; Shrikanth Narayanan
Journal:  Neuron       Date:  2017-05-03       Impact factor: 17.173

6.  Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories.

Authors:  Vikram Ramanarayanan; Maarten Van Segbroeck; Shrikanth S Narayanan
Journal:  Comput Speech Lang       Date:  2015-03-21       Impact factor: 1.899

7.  Determining functional units of tongue motion via graph-regularized sparse non-negative matrix factorization.

Authors:  Jonghye Woo; Fangxu Xing; Junghoon Lee; Maureen Stone; Jerry L Prince
Journal:  Med Image Comput Comput Assist Interv       Date:  2014

8.  Magnetic resonance imaging based anatomical assessment of tongue impairment due to amyotrophic lateral sclerosis: A preliminary study.

Authors:  Euna Lee; Fangxu Xing; Sung Ahn; Timothy G Reese; Ruopeng Wang; Jordan R Green; Nazem Atassi; Van J Wedeen; Georges El Fakhri; Jonghye Woo
Journal:  J Acoust Soc Am       Date:  2018-04       Impact factor: 1.840

9.  A deep joint sparse non-negative matrix factorization framework for identifying the common and subject-specific functional units of tongue motion during speech.

Authors:  Jonghye Woo; Fangxu Xing; Jerry L Prince; Maureen Stone; Arnold D Gomez; Timothy G Reese; Van J Wedeen; Georges El Fakhri
Journal:  Med Image Anal       Date:  2021-06-12       Impact factor: 13.828

Review 10.  Computer-Implemented Articulatory Models for Speech Production: A Review.

Authors:  Bernd J Kröger
Journal:  Front Robot AI       Date:  2022-03-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.