Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A CAUSAL DEEP LEARNING FRAMEWORK FOR CLASSIFYING PHONEMES IN COCHLEAR IMPLANTS.

Literature DB >> 34512195

A CAUSAL DEEP LEARNING FRAMEWORK FOR CLASSIFYING PHONEMES IN COCHLEAR IMPLANTS.

Kevin Chu¹, Leslie Collins¹, Boyla Mainsah¹.

Abstract

Speech intelligibility in cochlear implant (CI) users degrades considerably in listening environments with reverberation and noise. Previous research in automatic speech recognition (ASR) has shown that phoneme-based speech enhancement algorithms improve ASR system performance in reverberant environments as compared to a global model. However, phoneme-specific speech processing has not yet been implemented in CIs. In this paper, we propose a causal deep learning framework for classifying phonemes using features extracted at the time-frequency resolution of a CI processor. We trained and tested long short-term memory networks to classify phonemes and manner of articulation in anechoic and reverberant conditions. The results showed that CI-inspired features provide slightly higher levels of performance than traditional ASR features. To the best of our knowledge, this study is the first to provide a classification framework with the potential to categorize phonetic units in real-time in a CI.

Entities: Chemical

Keywords: cochlear implants; manner of articulation; phoneme classification; reverberation; speech enhancement

Year: 2021 PMID： 34512195 PMCID： PMC8425961 DOI： 10.1109/icassp39728.2021.9413986

Source DB: PubMed Journal: Proc IEEE Int Conf Acoust Speech Signal Process ISSN： 1520-6149

Keyword Cloud
References

10 in total

1. Speech perception as a function of electrical stimulation rate: using the Nucleus 24 cochlear implant system.

Authors: A E Vandali; L A Whitford; K L Plant; G M Clark
Journal: Ear Hear Date: 2000-12 Impact factor: 3.570

2. Framewise phoneme classification with bidirectional LSTM and other neural network architectures.

Authors: Alex Graves; Jürgen Schmidhuber
Journal: Neural Netw Date: 2005 Jun-Jul

3. A channel-selection criterion for suppressing reverberation in cochlear implants.

Authors: Kostas Kokkinakis; Oldooz Hazrati; Philipos C Loizou
Journal: J Acoust Soc Am Date: 2011-05 Impact factor: 1.840

4. Long short-term memory.

Authors: S Hochreiter; J Schmidhuber
Journal: Neural Comput Date: 1997-11-15 Impact factor: 2.026

5. USING AUTOMATIC SPEECH RECOGNITION AND SPEECH SYNTHESIS TO IMPROVE THE INTELLIGIBILITY OF COCHLEAR IMPLANT USERS IN REVERBERANT LISTENING ENVIRONMENTS.

Authors: Kevin Chu; Leslie Collins; Boyla Mainsah
Journal: Proc IEEE Int Conf Acoust Speech Signal Process Date: 2020-05-14

6. A "rationalized" arcsine transform.

Authors: G A Studebaker
Journal: J Speech Hear Res Date: 1985-09

7. Tackling the combined effects of reverberation and masking noise using ideal channel selection.

Authors: Oldooz Hazrati; Philipos C Loizou
Journal: J Speech Lang Hear Res Date: 2012-01-09 Impact factor: 2.297

Review 8. Trends in cochlear implants.

Authors: Fan-Gang Zeng
Journal: Trends Amplif Date: 2004

9. Audiovisual asynchrony detection and speech perception in hearing-impaired listeners with cochlear implants: a preliminary analysis.

Authors: Marcia J Hay-McCutcheon; David B Pisoni; Kristopher K Hunt
Journal: Int J Audiol Date: 2009 Impact factor: 2.117

10. Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants.

Authors: Tobias Goehring; Mahmoud Keshavarzi; Robert P Carlyon; Brian C J Moore
Journal: J Acoust Soc Am Date: 2019-07 Impact factor: 1.840

10 in total