Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.

Literature DB >> 29552581

Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.

Geoffrey S Meltzner¹, James T Heaton², Yunbin Deng³, Gianluca De Luca⁴, Serge H Roy⁴, Joshua C Kline⁴.

Abstract

Each year thousands of individuals require surgical removal of their larynx (voice box) due to trauma or disease, and thereby require an alternative voice source or assistive device to verbally communicate. Although natural voice is lost after laryngectomy, most muscles controlling speech articulation remain intact. Surface electromyographic (sEMG) activity of speech musculature can be recorded from the neck and face, and used for automatic speech recognition to provide speech-to-text or synthesized speech as an alternative means of communication. This is true even when speech is mouthed or spoken in a silent (subvocal) manner, making it an appropriate communication platform after laryngectomy. In this study, 8 individuals at least 6 months after total laryngectomy were recorded using 8 sEMG sensors on their face (4) and neck (4) while reading phrases constructed from a 2,500-word vocabulary. A unique set of phrases were used for training phoneme-based recognition models for each of the 39 commonly used phonemes in English, and the remaining phrases were used for testing word recognition of the models based on phoneme identification from running speech. Word error rates were on average 10.3% for the full 8-sensor set (averaging 9.5% for the top 4 participants), and 13.6% when reducing the sensor set to 4 locations per individual (n=7). This study provides a compelling proof-of-concept for sEMG-based alaryngeal speech recognition, with the strong potential to further improve recognition performance.

Entities: Chemical Disease Species

Keywords: Alaryngeal Speech; Assistive technology; Augmentative and Alternative Communication; Automatic Speech Recognition; EMG; Subvocal Speech Recognition; electromyography

Year: 2017 PMID： 29552581 PMCID： PMC5851476 DOI： 10.1109/TASLP.2017.2740000

Source DB: PubMed Journal: IEEE/ACM Trans Audio Speech Lang Process

12 in total

1. Signal acquisition and processing techniques for sEMG based silent speech recognition.

Authors: Geoffrey S Meltzner; Glen Colby; Yunbin Deng; James T Heaton
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2011

2. Impact of aberrant acoustic properties on the perception of sound quality in electrolarynx speech.

Authors: Geoffrey S Meltzner; Robert E Hillman
Journal: J Speech Lang Hear Res Date: 2005-08 Impact factor: 2.297

3. Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov Model.

Authors: Prasad D Polur; Gerald E Miller
Journal: IEEE Trans Neural Syst Rehabil Eng Date: 2005-12 Impact factor: 3.802

4. Multi-stream HMM for EMG-based speech recognition.

Authors: H Manabe; Z Zhang
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2004

5. Pattern learning with deep neural networks in EMG-based speech recognition.

Authors: Michael Wand; Tanja Schultz
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2014

6. Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique.

Authors: B S Atal; J J Chang; M V Mathews; J W Tukey
Journal: J Acoust Soc Am Date: 1978-05 Impact factor: 1.840

7. On the use of hidden Markov modelling for recognition of dysarthric speech.

Authors: J R Deller; D Hsu; L J Ferrier
Journal: Comput Methods Programs Biomed Date: 1991-06 Impact factor: 5.428

8. Myo-electric signals to augment speech recognition.

Authors: A D Chan; K Englehart; B Hudgins; D F Lovely
Journal: Med Biol Eng Comput Date: 2001-07 Impact factor: 3.079

9. Tracheostomy cannulas and voice prosthesis.

Authors: Burkhard Kramp; Steffen Dommerich
Journal: GMS Curr Top Otorhinolaryngol Head Neck Surg Date: 2011-03-10

10. Towards Contactless Silent Speech Recognition Based on Detection of Active and Visible Articulators Using IR-UWB Radar.

Authors: Young Hoon Shin; Jiwon Seo
Journal: Sensors (Basel) Date: 2016-10-29 Impact factor: 3.576

7 in total

1. Development of sEMG sensors and algorithms for silent speech recognition.

Authors: Geoffrey S Meltzner; James T Heaton; Yunbin Deng; Gianluca De Luca; Serge H Roy; Joshua C Kline
Journal: J Neural Eng Date: 2018-06-01 Impact factor: 5.379

2. Prediction of larynx function using multichannel surface EMG classification.

Authors: Johnny McNulty; Kylie de Jager; Henry T Lancashire; James Graveston; Martin Birchall; Anne Vanhoestenberghe
Journal: IEEE Trans Med Robot Bionics Date: 2021-10-26

3. 'I love you': the first phrase detected from dreams.

Authors: Michael Raduga
Journal: Sleep Sci Date: 2022 Apr-Jun

4. Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech.

Authors: Jennifer M Vojtech; Michael D Chan; Bhawna Shiwani; Serge H Roy; James T Heaton; Geoffrey S Meltzner; Paola Contessa; Gianluca De Luca; Rupal Patel; Joshua C Kline
Journal: J Speech Lang Hear Res Date: 2021-05-12 Impact factor: 2.297

5. Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language.

Authors: Huiyan Li; Haohong Lin; You Wang; Hengyang Wang; Ming Zhang; Han Gao; Qing Ai; Zhiyuan Luo; Guang Li
Journal: Brain Sci Date: 2022-06-23

6. A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.

Authors: Jinghan Wu; Yakun Zhang; Liang Xie; Ye Yan; Xu Zhang; Shuang Liu; Xingwei An; Erwei Yin; Dong Ming
Journal: Front Neurorobot Date: 2022-09-02 Impact factor: 3.493

7. Silent speech command word recognition using stepped frequency continuous wave radar.

Authors: Christoph Wagner; Petr Schaffer; Pouriya Amini Digehsara; Michael Bärhold; Dirk Plettemeier; Peter Birkholz
Journal: Sci Rep Date: 2022-03-09 Impact factor: 4.379

7 in total