| Literature DB >> 34512195 |
Kevin Chu1, Leslie Collins1, Boyla Mainsah1.
Abstract
Speech intelligibility in cochlear implant (CI) users degrades considerably in listening environments with reverberation and noise. Previous research in automatic speech recognition (ASR) has shown that phoneme-based speech enhancement algorithms improve ASR system performance in reverberant environments as compared to a global model. However, phoneme-specific speech processing has not yet been implemented in CIs. In this paper, we propose a causal deep learning framework for classifying phonemes using features extracted at the time-frequency resolution of a CI processor. We trained and tested long short-term memory networks to classify phonemes and manner of articulation in anechoic and reverberant conditions. The results showed that CI-inspired features provide slightly higher levels of performance than traditional ASR features. To the best of our knowledge, this study is the first to provide a classification framework with the potential to categorize phonetic units in real-time in a CI.Entities:
Keywords: cochlear implants; manner of articulation; phoneme classification; reverberation; speech enhancement
Year: 2021 PMID: 34512195 PMCID: PMC8425961 DOI: 10.1109/icassp39728.2021.9413986
Source DB: PubMed Journal: Proc IEEE Int Conf Acoust Speech Signal Process ISSN: 1520-6149