Literature DB >> 26564030

An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification.

Jun Wang, Ashok Samal, Panying Rong, Jordan R Green.   

Abstract

PURPOSE: The authors sought to determine an optimal set of flesh points on the tongue and lips for classifying speech movements.
METHOD: The authors used electromagnetic articulographs (Carstens AG500 and NDI Wave) to record tongue and lip movements from 13 healthy talkers who articulated 8 vowels, 11 consonants, a phonetically balanced set of words, and a set of short phrases during the recording. We used a machine-learning classifier (support-vector machine) to classify the speech stimuli on the basis of articulatory movements. We then compared classification accuracies of the flesh-point combinations to determine an optimal set of sensors.
RESULTS: When data from the 4 sensors (T1: the vicinity between the tongue tip and tongue blade; T4: the tongue-body back; UL: the upper lip; and LL: the lower lip) were combined, phoneme and word classifications were most accurate and were comparable with the full set (including T2: the tongue-body front; and T3: the tongue-body front).
CONCLUSION: We identified a 4-sensor set--that is, T1, T4, UL, LL--that yielded a classification accuracy (91%-95%) equivalent to that using all 6 sensors. These findings provide an empirical basis for selecting sensors and their locations for scientific and emerging clinical applications that incorporate articulatory movements.

Mesh:

Year:  2016        PMID: 26564030      PMCID: PMC4867928          DOI: 10.1044/2015_JSLHR-S-14-0112

Source DB:  PubMed          Journal:  J Speech Lang Hear Res        ISSN: 1092-4388            Impact factor:   2.297


  24 in total

1.  Vocal tract representation in the recognition of cerebral palsied speech.

Authors:  Frank Rudzicz; Graeme Hirst; Pascal van Lieshout
Journal:  J Speech Lang Hear Res       Date:  2012-01-23       Impact factor: 2.297

2.  Estimating mandibular motion based on chin surface targets during speech.

Authors:  Jordan R Green; Erin M Wilson; Yu-Tsai Wang; Christopher A Moore
Journal:  J Speech Lang Hear Res       Date:  2007-08       Impact factor: 2.297

3.  The distinctness of speakers' productions of vowel contrasts is related to their discrimination of the contrasts.

Authors:  Joseph S Perkell; Frank H Guenther; Harlan Lane; Melanie L Matthies; Ellen Stockmann; Mark Tiede; Majid Zandipour
Journal:  J Acoust Soc Am       Date:  2004-10       Impact factor: 1.840

4.  Accuracy of the NDI wave speech research system.

Authors:  Jeffrey J Berry
Journal:  J Speech Lang Hear Res       Date:  2011-04-15       Impact factor: 2.297

5.  Development of a (silent) speech recognition system for patients following laryngectomy.

Authors:  M J Fagan; S R Ell; J M Gilbert; E Sarrazin; P M Chapman
Journal:  Med Eng Phys       Date:  2007-06-27       Impact factor: 2.242

Review 6.  Speech production knowledge in automatic speech recognition.

Authors:  Simon King; Joe Frankel; Karen Livescu; Erik McDermott; Korin Richmond; Mirjam Wester
Journal:  J Acoust Soc Am       Date:  2007-02       Impact factor: 1.840

7.  Classifications of vocalic segments from articulatory kinematics: healthy controls and speakers with dysarthria.

Authors:  Yana Yunusova; Gary G Weismer; Mary J Lindstrom
Journal:  J Speech Lang Hear Res       Date:  2011-06-06       Impact factor: 2.297

8.  A protocol for comprehensive assessment of bulbar dysfunction in amyotrophic lateral sclerosis (ALS).

Authors:  Yana Yunusova; Jordan R Green; Jun Wang; Gary Pattee; Lorne Zinman
Journal:  J Vis Exp       Date:  2011-02-21       Impact factor: 1.355

9.  Articulatory movements during vowels in speakers with dysarthria and healthy controls.

Authors:  Yana Yunusova; Gary Weismer; John R Westbury; Mary J Lindstrom
Journal:  J Speech Lang Hear Res       Date:  2008-06       Impact factor: 2.297

10.  Accuracy assessment for AG500, electromagnetic articulograph.

Authors:  Yana Yunusova; Jordan R Green; Antje Mefferd
Journal:  J Speech Lang Hear Res       Date:  2008-08-22       Impact factor: 2.297

View more
  15 in total

1.  Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples.

Authors:  Jun Wang; Prasanna V Kothalkar; Myungjong Kim; Andrea Bandini; Beiming Cao; Yana Yunusova; Thomas F Campbell; Daragh Heitzman; Jordan R Green
Journal:  Int J Speech Lang Pathol       Date:  2018-11-08       Impact factor: 2.484

2.  Differentiating post-cancer from healthy tongue muscle coordination patterns during speech using deep learning.

Authors:  Jonghye Woo; Fangxu Xing; Jerry L Prince; Maureen Stone; Jordan R Green; Tessa Goldsmith; Timothy G Reese; Van J Wedeen; Georges El Fakhri
Journal:  J Acoust Soc Am       Date:  2019-05       Impact factor: 1.840

3.  Kinematic Analysis of Speech Sound Sequencing Errors Induced by Delayed Auditory Feedback.

Authors:  Gabriel J Cler; Jackson C Lee; Talia Mittelman; Cara E Stepp; Jason W Bohland
Journal:  J Speech Lang Hear Res       Date:  2017-06-22       Impact factor: 2.297

4.  Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Authors:  Beiming Cao; Myungjong Kim; Ted Mau; Jun Wang
Journal:  Workshop Speech Lang Process Assist Technol       Date:  2016-09

5.  Predicting Intelligible Speaking Rate in Individuals with Amyotrophic Lateral Sclerosis from a Small Number of Speech Acoustic and Articulatory Samples.

Authors:  Jun Wang; Prasanna V Kothalkar; Myungjong Kim; Yana Yunusova; Thomas F Campbell; Daragh Heitzman; Jordan R Green
Journal:  Workshop Speech Lang Process Assist Technol       Date:  2016-09

6.  Tongue- and Jaw-Specific Contributions to Acoustic Vowel Contrast Changes in the Diphthong /ai/ in Response to Slow, Loud, and Clear Speech.

Authors:  Antje S Mefferd
Journal:  J Speech Lang Hear Res       Date:  2017-11-09       Impact factor: 2.297

7.  Multimodal Speech Capture System for Speech Rehabilitation and Learning.

Authors:  Nordine Sebkhi; Dhyey Desai; Mohammad Islam; Jun Lu; Kimberly Wilson; Maysam Ghovanloo
Journal:  IEEE Trans Biomed Eng       Date:  2017-01-18       Impact factor: 4.538

8.  Initial Observations of Lingual Movement Characteristics of Children With Cerebral Palsy.

Authors:  Ignatius S B Nip; Carlos R Arias; Kristen Morita; Hannah Richardson
Journal:  J Speech Lang Hear Res       Date:  2017-06-22       Impact factor: 2.297

9.  Speaker-Independent Silent Speech Recognition from Flesh-Point Articulatory Movements Using an LSTM Neural Network.

Authors:  Myungjong Kim; Beiming Cao; Ted Mau; Jun Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-11-23

10.  Evaluation of a Wireless Tongue Tracking System on the Identification of Phoneme Landmarks.

Authors:  Nordine Sebkhi; Nina Santus; Arpan Bhavsar; Shayan Siahpoushan; Omer T Inan
Journal:  IEEE Trans Biomed Eng       Date:  2021-03-22       Impact factor: 4.538

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.