Literature DB >> 33003849

A two-stage deep learning algorithm for talker-independent speaker separation in reverberant conditions.

Masood Delfarah1, Yuzhou Liu1, DeLiang Wang1.   

Abstract

Speaker separation is a special case of speech separation, in which the mixture signal comprises two or more speakers. Many talker-independent speaker separation methods have been introduced in recent years to address this problem in anechoic conditions. To consider more realistic environments, this paper investigates talker-independent speaker separation in reverberant conditions. To effectively deal with speaker separation and speech dereverberation, extending the deep computational auditory scene analysis (CASA) approach to a two-stage system is proposed. In this method, reverberant utterances are first separated and separated utterances are then dereverberated. The proposed two-stage deep CASA system significantly outperforms a baseline one-stage deep CASA method in real reverberant conditions. The proposed system has superior separation performance at the frame level and higher accuracy in assigning separated frames to individual speakers. The proposed system successfully generalizes to an unseen speech corpus and exhibits similar performance to a talker-dependent system.

Mesh:

Year:  2020        PMID: 33003849      PMCID: PMC7473777          DOI: 10.1121/10.0001779

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  13 in total

1.  Informational and energetic masking effects in the perception of two simultaneous talkers.

Authors:  D S Brungart
Journal:  J Acoust Soc Am       Date:  2001-03       Impact factor: 1.840

2.  Effects of reverberation on perceptual segregation of competing voices.

Authors:  John F Culling; Kathryn I Hodder; Chaz Yee Toh
Journal:  J Acoust Soc Am       Date:  2003-11       Impact factor: 1.840

3.  Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing.

Authors:  J M Festen; R Plomp
Journal:  J Acoust Soc Am       Date:  1990-10       Impact factor: 1.840

4.  The masking of speech.

Authors:  G A MILLER
Journal:  Psychol Bull       Date:  1947-03       Impact factor: 17.737

5.  An algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker.

Authors:  Eric W Healy; Masood Delfarah; Jordan L Vasko; Brittney L Carter; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2017-06       Impact factor: 1.840

6.  A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation.

Authors:  Eric W Healy; Masood Delfarah; Eric M Johnson; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2019-03       Impact factor: 1.840

7.  Hearing loss, aging, and speech perception in reverberation and noise.

Authors:  K S Helfer; L A Wilber
Journal:  J Speech Hear Res       Date:  1990-03

8.  On Training Targets for Supervised Speech Separation.

Authors:  Yuxuan Wang; Arun Narayanan; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2014-12

9.  A Deep Ensemble Learning Method for Monaural Speech Separation.

Authors:  Xiao-Lei Zhang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2016-03-01

10.  Complex Ratio Masking for Monaural Speech Separation.

Authors:  Donald S Williamson; Yuxuan Wang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2015-12-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.