Literature DB >> 17225430

Binaural segregation in multisource reverberant environments.

Nicoleta Roman1, Soundararajan Srinivasan, DeLiang Wang.   

Abstract

In a natural environment, speech signals are degraded by both reverberation and concurrent noise sources. While human listening is robust under these conditions using only two ears, current two-microphone algorithms perform poorly. The psychological process of figure-ground segregation suggests that the target signal is perceived as a foreground while the remaining stimuli are perceived as a background. Accordingly, the goal is to estimate an ideal time-frequency (T-F) binary mask, which selects the target if it is stronger than the interference in a local T-F unit. In this paper, a binaural segregation system that extracts the reverberant target signal from multisource reverberant mixtures by utilizing only the location information of target source is proposed. The proposed system combines target cancellation through adaptive filtering and a binary decision rule to estimate the ideal T-F binary mask. The main observation in this work is that the target attenuation in a T-F unit resulting from adaptive filtering is correlated with the relative strength of target to mixture. A comprehensive evaluation shows that the proposed system results in large SNR gains. In addition, comparisons using SNR as well as automatic speech recognition measures show that this system outperforms standard two-microphone beamforming approaches and a recent binaural processor.

Entities:  

Mesh:

Year:  2006        PMID: 17225430     DOI: 10.1121/1.2355480

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  4 in total

Review 1.  Time-frequency masking for speech separation and its potential for hearing aid design.

Authors: 
Journal:  Trends Amplif       Date:  2008-10-30

2.  Comparison of a target-equalization-cancellation approach and a localization approach to source separation.

Authors:  Jing Mi; Matti Groll; H Steven Colburn
Journal:  J Acoust Soc Am       Date:  2017-11       Impact factor: 1.840

3.  Deep Learning Based Target Cancellation for Speech Dereverberation.

Authors:  Zhong-Qiu Wang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2020-02-28

4.  A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments.

Authors:  Jing Mi; H Steven Colburn
Journal:  Trends Hear       Date:  2016-10-03       Impact factor: 3.293

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.