Literature DB >> 33178880

Causal Deep CASA for Monaural Talker-Independent Speaker Separation.

Yuzhou Liu1, DeLiang Wang2.   

Abstract

Talker-independent monaural speaker separation aims to separate concurrent speakers from a single-microphone recording. Inspired by human auditory scene analysis (ASA) mechanisms, a two-stage deep CASA approach has been proposed recently to address this problem, which achieves state-of-the-art results in separating mixtures of two or three speakers. A main limitation of deep CASA is that it is a non-causal system, while many speech processing applications, e.g., telecommunication and hearing prosthesis, require causal processing. In this study, we propose a causal version of deep CASA to address this limitation. First, we modify temporal connections, normalization and clustering algorithms in deep CASA so that no future information is used throughout the deep network. We then train a C-speaker (C ≥ 2) deep CASA system in a speaker-number-independent fashion, generalizable to speech mixtures with up to C speakers without the prior knowledge about the speaker number. Experimental results show that causal deep CASA achieves excellent speaker separation performance with known or unknown speaker numbers.

Entities:  

Keywords:  Monaural speaker separation; causal processing; deep CASA; talker-independent speaker separation

Year:  2020        PMID: 33178880      PMCID: PMC7654633          DOI: 10.1109/taslp.2020.3007779

Source DB:  PubMed          Journal:  IEEE/ACM Trans Audio Speech Lang Process


  1 in total

1.  Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation.

Authors:  Yi Luo; Nima Mesgarani
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2019-05-06
  1 in total
  1 in total

1.  A causal and talker-independent speaker separation/dereverberation deep learning algorithm: Cost associated with conversion to real-time capable operation.

Authors:  Eric W Healy; Hassan Taherian; Eric M Johnson; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2021-11       Impact factor: 1.840

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.