Literature DB >> 34512195

A CAUSAL DEEP LEARNING FRAMEWORK FOR CLASSIFYING PHONEMES IN COCHLEAR IMPLANTS.

Kevin Chu1, Leslie Collins1, Boyla Mainsah1.   

Abstract

Speech intelligibility in cochlear implant (CI) users degrades considerably in listening environments with reverberation and noise. Previous research in automatic speech recognition (ASR) has shown that phoneme-based speech enhancement algorithms improve ASR system performance in reverberant environments as compared to a global model. However, phoneme-specific speech processing has not yet been implemented in CIs. In this paper, we propose a causal deep learning framework for classifying phonemes using features extracted at the time-frequency resolution of a CI processor. We trained and tested long short-term memory networks to classify phonemes and manner of articulation in anechoic and reverberant conditions. The results showed that CI-inspired features provide slightly higher levels of performance than traditional ASR features. To the best of our knowledge, this study is the first to provide a classification framework with the potential to categorize phonetic units in real-time in a CI.

Entities:  

Keywords:  cochlear implants; manner of articulation; phoneme classification; reverberation; speech enhancement

Year:  2021        PMID: 34512195      PMCID: PMC8425961          DOI: 10.1109/icassp39728.2021.9413986

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Acoust Speech Signal Process        ISSN: 1520-6149


  10 in total

1.  Speech perception as a function of electrical stimulation rate: using the Nucleus 24 cochlear implant system.

Authors:  A E Vandali; L A Whitford; K L Plant; G M Clark
Journal:  Ear Hear       Date:  2000-12       Impact factor: 3.570

2.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures.

Authors:  Alex Graves; Jürgen Schmidhuber
Journal:  Neural Netw       Date:  2005 Jun-Jul

3.  A channel-selection criterion for suppressing reverberation in cochlear implants.

Authors:  Kostas Kokkinakis; Oldooz Hazrati; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2011-05       Impact factor: 1.840

4.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

5.  USING AUTOMATIC SPEECH RECOGNITION AND SPEECH SYNTHESIS TO IMPROVE THE INTELLIGIBILITY OF COCHLEAR IMPLANT USERS IN REVERBERANT LISTENING ENVIRONMENTS.

Authors:  Kevin Chu; Leslie Collins; Boyla Mainsah
Journal:  Proc IEEE Int Conf Acoust Speech Signal Process       Date:  2020-05-14

6.  A "rationalized" arcsine transform.

Authors:  G A Studebaker
Journal:  J Speech Hear Res       Date:  1985-09

7.  Tackling the combined effects of reverberation and masking noise using ideal channel selection.

Authors:  Oldooz Hazrati; Philipos C Loizou
Journal:  J Speech Lang Hear Res       Date:  2012-01-09       Impact factor: 2.297

Review 8.  Trends in cochlear implants.

Authors:  Fan-Gang Zeng
Journal:  Trends Amplif       Date:  2004

9.  Audiovisual asynchrony detection and speech perception in hearing-impaired listeners with cochlear implants: a preliminary analysis.

Authors:  Marcia J Hay-McCutcheon; David B Pisoni; Kristopher K Hunt
Journal:  Int J Audiol       Date:  2009       Impact factor: 2.117

10.  Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants.

Authors:  Tobias Goehring; Mahmoud Keshavarzi; Robert P Carlyon; Brian C J Moore
Journal:  J Acoust Soc Am       Date:  2019-07       Impact factor: 1.840

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.