Literature DB >> 29552581

Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.

Geoffrey S Meltzner1, James T Heaton2, Yunbin Deng3, Gianluca De Luca4, Serge H Roy4, Joshua C Kline4.   

Abstract

Each year thousands of individuals require surgical removal of their larynx (voice box) due to trauma or disease, and thereby require an alternative voice source or assistive device to verbally communicate. Although natural voice is lost after laryngectomy, most muscles controlling speech articulation remain intact. Surface electromyographic (sEMG) activity of speech musculature can be recorded from the neck and face, and used for automatic speech recognition to provide speech-to-text or synthesized speech as an alternative means of communication. This is true even when speech is mouthed or spoken in a silent (subvocal) manner, making it an appropriate communication platform after laryngectomy. In this study, 8 individuals at least 6 months after total laryngectomy were recorded using 8 sEMG sensors on their face (4) and neck (4) while reading phrases constructed from a 2,500-word vocabulary. A unique set of phrases were used for training phoneme-based recognition models for each of the 39 commonly used phonemes in English, and the remaining phrases were used for testing word recognition of the models based on phoneme identification from running speech. Word error rates were on average 10.3% for the full 8-sensor set (averaging 9.5% for the top 4 participants), and 13.6% when reducing the sensor set to 4 locations per individual (n=7). This study provides a compelling proof-of-concept for sEMG-based alaryngeal speech recognition, with the strong potential to further improve recognition performance.

Entities:  

Keywords:  Alaryngeal Speech; Assistive technology; Augmentative and Alternative Communication; Automatic Speech Recognition; EMG; Subvocal Speech Recognition; electromyography

Year:  2017        PMID: 29552581      PMCID: PMC5851476          DOI: 10.1109/TASLP.2017.2740000

Source DB:  PubMed          Journal:  IEEE/ACM Trans Audio Speech Lang Process


  12 in total

1.  Signal acquisition and processing techniques for sEMG based silent speech recognition.

Authors:  Geoffrey S Meltzner; Glen Colby; Yunbin Deng; James T Heaton
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2011

2.  Impact of aberrant acoustic properties on the perception of sound quality in electrolarynx speech.

Authors:  Geoffrey S Meltzner; Robert E Hillman
Journal:  J Speech Lang Hear Res       Date:  2005-08       Impact factor: 2.297

3.  Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov Model.

Authors:  Prasad D Polur; Gerald E Miller
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2005-12       Impact factor: 3.802

4.  Multi-stream HMM for EMG-based speech recognition.

Authors:  H Manabe; Z Zhang
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2004

5.  Pattern learning with deep neural networks in EMG-based speech recognition.

Authors:  Michael Wand; Tanja Schultz
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2014

6.  Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique.

Authors:  B S Atal; J J Chang; M V Mathews; J W Tukey
Journal:  J Acoust Soc Am       Date:  1978-05       Impact factor: 1.840

7.  On the use of hidden Markov modelling for recognition of dysarthric speech.

Authors:  J R Deller; D Hsu; L J Ferrier
Journal:  Comput Methods Programs Biomed       Date:  1991-06       Impact factor: 5.428

8.  Myo-electric signals to augment speech recognition.

Authors:  A D Chan; K Englehart; B Hudgins; D F Lovely
Journal:  Med Biol Eng Comput       Date:  2001-07       Impact factor: 3.079

9.  Tracheostomy cannulas and voice prosthesis.

Authors:  Burkhard Kramp; Steffen Dommerich
Journal:  GMS Curr Top Otorhinolaryngol Head Neck Surg       Date:  2011-03-10

10.  Towards Contactless Silent Speech Recognition Based on Detection of Active and Visible Articulators Using IR-UWB Radar.

Authors:  Young Hoon Shin; Jiwon Seo
Journal:  Sensors (Basel)       Date:  2016-10-29       Impact factor: 3.576

View more
  7 in total

1.  Development of sEMG sensors and algorithms for silent speech recognition.

Authors:  Geoffrey S Meltzner; James T Heaton; Yunbin Deng; Gianluca De Luca; Serge H Roy; Joshua C Kline
Journal:  J Neural Eng       Date:  2018-06-01       Impact factor: 5.379

2.  Prediction of larynx function using multichannel surface EMG classification.

Authors:  Johnny McNulty; Kylie de Jager; Henry T Lancashire; James Graveston; Martin Birchall; Anne Vanhoestenberghe
Journal:  IEEE Trans Med Robot Bionics       Date:  2021-10-26

3.  'I love you': the first phrase detected from dreams.

Authors:  Michael Raduga
Journal:  Sleep Sci       Date:  2022 Apr-Jun

4.  Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech.

Authors:  Jennifer M Vojtech; Michael D Chan; Bhawna Shiwani; Serge H Roy; James T Heaton; Geoffrey S Meltzner; Paola Contessa; Gianluca De Luca; Rupal Patel; Joshua C Kline
Journal:  J Speech Lang Hear Res       Date:  2021-05-12       Impact factor: 2.297

5.  Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language.

Authors:  Huiyan Li; Haohong Lin; You Wang; Hengyang Wang; Ming Zhang; Han Gao; Qing Ai; Zhiyuan Luo; Guang Li
Journal:  Brain Sci       Date:  2022-06-23

6.  A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.

Authors:  Jinghan Wu; Yakun Zhang; Liang Xie; Ye Yan; Xu Zhang; Shuang Liu; Xingwei An; Erwei Yin; Dong Ming
Journal:  Front Neurorobot       Date:  2022-09-02       Impact factor: 3.493

7.  Silent speech command word recognition using stepped frequency continuous wave radar.

Authors:  Christoph Wagner; Petr Schaffer; Pouriya Amini Digehsara; Michael Bärhold; Dirk Plettemeier; Peter Birkholz
Journal:  Sci Rep       Date:  2022-03-09       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.