Literature DB >> 29195469

Comparison of a target-equalization-cancellation approach and a localization approach to source separation.

Jing Mi1, Matti Groll1, H Steven Colburn1.   

Abstract

Interaural differences are important for listeners to be able to maintain focus on a sound source of interest in the presence of multiple sources. Because interaural differences are sound localization cues, most binaural-cue-based source separation algorithms attempt separation by localizing each time-frequency (T-F) unit to one of the possible source directions using interaural differences. By assembling T-F units that are assigned to one direction, the sound stream from that direction is enhanced. In this paper, a different type of binaural cue for source-separation purposes is proposed. For each T-F unit, the target-direction signal is cancelled by applying the equalization-cancellation (EC) operation to cancel the signal from the target direction; then, the dominance of the target in each T-F unit is determined by the effectiveness of the cancellation. Specifically, the energy change from cancellation is used as the criterion for target dominance for each T-F unit. Source-separation performance using the target-EC cue is compared with performance using localization cues. With simulated multi-talker and diffuse-babble interferers, the algorithm based on target-EC cues yields better source-separation performance than the algorithm based on localization cues, both in direct comparison with the ideal binary mask and in measured speech intelligibility for the separated target streams.

Mesh:

Year:  2017        PMID: 29195469      PMCID: PMC5685812          DOI: 10.1121/1.5009763

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  21 in total

1.  A speech corpus for multitalker communications research.

Authors:  R S Bolia; W T Nelson; M A Ericson; B D Simpson
Journal:  J Acoust Soc Am       Date:  2000-02       Impact factor: 1.840

2.  Informational and energetic masking effects in the perception of multiple simultaneous talkers.

Authors:  D S Brungart; B D Simpson; M A Ericson; K R Scott
Journal:  J Acoust Soc Am       Date:  2001-11       Impact factor: 1.840

3.  Speech segregation based on sound localization.

Authors:  Nicoleta Roman; DeLiang Wang; Guy J Brown
Journal:  J Acoust Soc Am       Date:  2003-10       Impact factor: 1.840

4.  Application of an extended equalization-cancellation model to speech intelligibility with spatially distributed maskers.

Authors:  Rui Wan; Nathaniel I Durlach; H Steven Colburn
Journal:  J Acoust Soc Am       Date:  2010-12       Impact factor: 1.840

5.  Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners.

Authors:  Rainer Beutelmann; Thomas Brand
Journal:  J Acoust Soc Am       Date:  2006-07       Impact factor: 1.840

6.  Speech intelligibility in reverberation with ideal binary masking: effects of early reflections and signal-to-noise ratio threshold.

Authors:  Nicoleta Roman; John Woodruff
Journal:  J Acoust Soc Am       Date:  2013-03       Impact factor: 1.840

7.  Effects of interaural time delays of noise stimuli on low-frequency cells in the cat's inferior colliculus. III. Evidence for cross-correlation.

Authors:  T C Yin; J C Chan; L H Carney
Journal:  J Neurophysiol       Date:  1987-09       Impact factor: 2.714

8.  An information-maximization approach to blind separation and blind deconvolution.

Authors:  A J Bell; T J Sejnowski
Journal:  Neural Comput       Date:  1995-11       Impact factor: 2.026

9.  Theory of binaural interaction based on auditory-nerve data. II. Detection of tones in noise.

Authors:  H S Colburn
Journal:  J Acoust Soc Am       Date:  1977-02       Impact factor: 1.840

10.  A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments.

Authors:  Jing Mi; H Steven Colburn
Journal:  Trends Hear       Date:  2016-10-03       Impact factor: 3.293

View more
  1 in total

1.  Binaural Recordings in Natural Acoustic Environments: Estimates of Speech-Likeness and Interaural Parameters.

Authors:  S Theo Goverts; H Steven Colburn
Journal:  Trends Hear       Date:  2020 Jan-Dec       Impact factor: 3.293

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.