Literature DB >> 8035358

Automatic speech recognition to aid the hearing impaired: prospects for the automatic generation of cued speech.

R M Uchanski1, L A Delhorne, A K Dix, L D Braida, C M Reed, N I Durlach.   

Abstract

Although great strides have been made in the development of automatic speech recognition (ASR) systems, the communication performance achievable with the output of current real-time speech recognition systems would be extremely poor relative to normal speech reception. An alternate application of ASR technology to aid the hearing impaired would derive cues from the acoustical speech signal that could be used to supplement speechreading. We report a study of highly trained receivers of Manual Cued Speech that indicates that nearly perfect reception of everyday connected speech materials can be achieved at near normal speaking rates. To understand the accuracy that might be achieved with automatically generated cues, we measured how well trained spectrogram readers and an automatic speech recognizer could assign cues for various cue systems. We then applied a recently developed model of audiovisual integration to these recognizer measurements and data on human recognition of consonant and vowel segments via speechreading to evaluate the benefit to speechreading provided by such cues. Our analysis suggests that with cues derived from current recognizers, consonant and vowel segments can be received with accuracies in excess of 80%. This level of performance is roughly equivalent to the segment reception accuracy required to account for observed levels of Manual Cued Speech reception. Current recognizers provide maximal benefit by generating only a relatively small number (three to five) of cue groups, and may not provide substantially greater aid to speechreading than simpler aids that do not incorporate discrete phonetic recognition. To provide guidance for the development of improved automatic cueing systems, we describe techniques for determining optimum cue groups for a given recognizer and speechreader, and estimate the cueing performance that might be achieved if the performance of current recognizers were improved.

Entities:  

Mesh:

Year:  1994        PMID: 8035358

Source DB:  PubMed          Journal:  J Rehabil Res Dev        ISSN: 0748-7711


  5 in total

1.  A Method for Transcribing the Manual Components of Cued Speech.

Authors:  Jean C Krause; Katherine A Pelley-Lopez; Morgan P Tessler
Journal:  Speech Commun       Date:  2011-03-01       Impact factor: 2.017

Review 2.  Cued speech for enhancing speech perception and first language development of children with cochlear implants.

Authors:  Jacqueline Leybaert; Carol J LaSasso
Journal:  Trends Amplif       Date:  2010-06

3.  Processing of speech signals for physical and sensory disabilities.

Authors:  H Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

4.  Cued Speech Transliteration: Effects of Speaking Rate and Lag Time on Production Accuracy.

Authors:  Jean C Krause; Morgan P Tessler
Journal:  J Deaf Stud Deaf Educ       Date:  2016-05-24

5.  Cued Speech Transliteration: Effects of Accuracy and Lag Time on Message Intelligibility.

Authors:  Jean C Krause; Katherine A Lopez
Journal:  J Deaf Stud Deaf Educ       Date:  2017-10-01
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.