Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification.

Literature DB >> 26564030

An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification.

Jun Wang, Ashok Samal, Panying Rong, Jordan R Green.

Abstract

PURPOSE: The authors sought to determine an optimal set of flesh points on the tongue and lips for classifying speech movements.
METHOD: The authors used electromagnetic articulographs (Carstens AG500 and NDI Wave) to record tongue and lip movements from 13 healthy talkers who articulated 8 vowels, 11 consonants, a phonetically balanced set of words, and a set of short phrases during the recording. We used a machine-learning classifier (support-vector machine) to classify the speech stimuli on the basis of articulatory movements. We then compared classification accuracies of the flesh-point combinations to determine an optimal set of sensors.
RESULTS: When data from the 4 sensors (T1: the vicinity between the tongue tip and tongue blade; T4: the tongue-body back; UL: the upper lip; and LL: the lower lip) were combined, phoneme and word classifications were most accurate and were comparable with the full set (including T2: the tongue-body front; and T3: the tongue-body front).
CONCLUSION: We identified a 4-sensor set--that is, T1, T4, UL, LL--that yielded a classification accuracy (91%-95%) equivalent to that using all 6 sensors. These findings provide an empirical basis for selecting sensors and their locations for scientific and emerging clinical applications that incorporate articulatory movements.

Mesh：

Year: 2016 PMID： 26564030 PMCID： PMC4867928 DOI： 10.1044/2015_JSLHR-S-14-0112

Source DB: PubMed Journal: J Speech Lang Hear Res ISSN： 1092-4388 Impact factor: 2.297

24 in total

1. Vocal tract representation in the recognition of cerebral palsied speech.

Authors: Frank Rudzicz; Graeme Hirst; Pascal van Lieshout
Journal: J Speech Lang Hear Res Date: 2012-01-23 Impact factor: 2.297

2. Estimating mandibular motion based on chin surface targets during speech.

Authors: Jordan R Green; Erin M Wilson; Yu-Tsai Wang; Christopher A Moore
Journal: J Speech Lang Hear Res Date: 2007-08 Impact factor: 2.297

3. The distinctness of speakers' productions of vowel contrasts is related to their discrimination of the contrasts.

Authors: Joseph S Perkell; Frank H Guenther; Harlan Lane; Melanie L Matthies; Ellen Stockmann; Mark Tiede; Majid Zandipour
Journal: J Acoust Soc Am Date: 2004-10 Impact factor: 1.840

4. Accuracy of the NDI wave speech research system.

Authors: Jeffrey J Berry
Journal: J Speech Lang Hear Res Date: 2011-04-15 Impact factor: 2.297

5. Development of a (silent) speech recognition system for patients following laryngectomy.

Authors: M J Fagan; S R Ell; J M Gilbert; E Sarrazin; P M Chapman
Journal: Med Eng Phys Date: 2007-06-27 Impact factor: 2.242

Review 6. Speech production knowledge in automatic speech recognition.

Authors: Simon King; Joe Frankel; Karen Livescu; Erik McDermott; Korin Richmond; Mirjam Wester
Journal: J Acoust Soc Am Date: 2007-02 Impact factor: 1.840

7. Classifications of vocalic segments from articulatory kinematics: healthy controls and speakers with dysarthria.

Authors: Yana Yunusova; Gary G Weismer; Mary J Lindstrom
Journal: J Speech Lang Hear Res Date: 2011-06-06 Impact factor: 2.297

8. A protocol for comprehensive assessment of bulbar dysfunction in amyotrophic lateral sclerosis (ALS).

Authors: Yana Yunusova; Jordan R Green; Jun Wang; Gary Pattee; Lorne Zinman
Journal: J Vis Exp Date: 2011-02-21 Impact factor: 1.355

9. Articulatory movements during vowels in speakers with dysarthria and healthy controls.

Authors: Yana Yunusova; Gary Weismer; John R Westbury; Mary J Lindstrom
Journal: J Speech Lang Hear Res Date: 2008-06 Impact factor: 2.297

10. Accuracy assessment for AG500, electromagnetic articulograph.

Authors: Yana Yunusova; Jordan R Green; Antje Mefferd
Journal: J Speech Lang Hear Res Date: 2008-08-22 Impact factor: 2.297

15 in total

1. Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples.

Authors: Jun Wang; Prasanna V Kothalkar; Myungjong Kim; Andrea Bandini; Beiming Cao; Yana Yunusova; Thomas F Campbell; Daragh Heitzman; Jordan R Green
Journal: Int J Speech Lang Pathol Date: 2018-11-08 Impact factor: 2.484

2. Differentiating post-cancer from healthy tongue muscle coordination patterns during speech using deep learning.

Authors: Jonghye Woo; Fangxu Xing; Jerry L Prince; Maureen Stone; Jordan R Green; Tessa Goldsmith; Timothy G Reese; Van J Wedeen; Georges El Fakhri
Journal: J Acoust Soc Am Date: 2019-05 Impact factor: 1.840

3. Kinematic Analysis of Speech Sound Sequencing Errors Induced by Delayed Auditory Feedback.

Authors: Gabriel J Cler; Jackson C Lee; Talia Mittelman; Cara E Stepp; Jason W Bohland
Journal: J Speech Lang Hear Res Date: 2017-06-22 Impact factor: 2.297

4. Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Authors: Beiming Cao; Myungjong Kim; Ted Mau; Jun Wang
Journal: Workshop Speech Lang Process Assist Technol Date: 2016-09

5. Predicting Intelligible Speaking Rate in Individuals with Amyotrophic Lateral Sclerosis from a Small Number of Speech Acoustic and Articulatory Samples.

Authors: Jun Wang; Prasanna V Kothalkar; Myungjong Kim; Yana Yunusova; Thomas F Campbell; Daragh Heitzman; Jordan R Green
Journal: Workshop Speech Lang Process Assist Technol Date: 2016-09

6. Tongue- and Jaw-Specific Contributions to Acoustic Vowel Contrast Changes in the Diphthong /ai/ in Response to Slow, Loud, and Clear Speech.

Authors: Antje S Mefferd
Journal: J Speech Lang Hear Res Date: 2017-11-09 Impact factor: 2.297