Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deep Learning for Talker-dependent Reverberant Speaker Separation: An Empirical Study.

Literature DB >> 33748321

Deep Learning for Talker-dependent Reverberant Speaker Separation: An Empirical Study.

Abstract

Speaker separation refers to the problem of separating speech signals from a mixture of simultaneous speakers. Previous studies are limited to addressing the speaker separation problem in anechoic conditions. This paper addresses the problem of talker-dependent speaker separation in reverberant conditions, which are characteristic of real-world environments. We employ recurrent neural networks with bidirectional long short-term memory (BLSTM) to separate and dereverberate the target speech signal. We propose two-stage networks to effectively deal with both speaker separation and speech dereverberation. In the two-stage model, the first stage separates and dereverberates two-talker mixtures and the second stage further enhances the separated target signal. We have extensively evaluated the two-stage architecture, and our empirical results demonstrate large improvements over unprocessed mixtures and clear performance gain over single-stage networks in a wide range of target-to-interferer ratios and reverberation times in simulated as well as recorded rooms. Moreover, we show that time-frequency masking yields better performance than spectral mapping for reverberant speaker separation.

Entities: Chemical Disease Gene Species

Keywords: Cochannel speech separation; deep neural networks; speech dereverberation; two-stage network

Year: 2019 PMID： 33748321 PMCID： PMC7970708 DOI： 10.1109/taslp.2019.2934319

Source DB: PubMed Journal: IEEE/ACM Trans Audio Speech Lang Process

Keyword Cloud
References

12 in total

Deep Learning for Talker-dependent Reverberant Speaker Separation: An Empirical Study.

1. Supervised Speech Separation Based on Deep Learning: An Overview.

2. Reverberation challenges the temporal representation of the pitch of complex sounds.

3. An algorithm to improve speech recognition in noise for hearing-impaired listeners.

4. An algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker.

5. Long short-term memory.

6. Two-stage Deep Learning for Noisy-reverberant Speech Enhancement.

7. Hearing loss, aging, and speech perception in reverberation and noise.

8. On Training Targets for Supervised Speech Separation.

9. A Deep Ensemble Learning Method for Monaural Speech Separation.

10. Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users.