Literature DB >> 23145627

A classification based approach to speech segregation.

Kun Han1, DeLiang Wang.   

Abstract

A key problem in computational auditory scene analysis (CASA) is monaural speech segregation, which has proven to be very challenging. For monaural mixtures, one can only utilize the intrinsic properties of speech or interference to segregate target speech from background noise. Ideal binary mask (IBM) has been proposed as a main goal of sound segregation in CASA and has led to substantial improvements of human speech intelligibility in noise. This study proposes a classification approach to estimate the IBM and employs support vector machines to classify time-frequency units as either target- or interference-dominant. A re-thresholding method is incorporated to improve classification results and maximize hit minus false alarm rates. An auditory segmentation stage is utilized to further improve estimated masks. Systematic evaluations show that the proposed approach produces high quality estimated IBMs and outperforms a recent system in terms of classification accuracy.

Entities:  

Mesh:

Year:  2012        PMID: 23145627     DOI: 10.1121/1.4754541

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  6 in total

1.  An algorithm to improve speech recognition in noise for hearing-impaired listeners.

Authors:  Eric W Healy; Sarah E Yoho; Yuxuan Wang; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2013-10       Impact factor: 1.840

2.  On Training Targets for Supervised Speech Separation.

Authors:  Yuxuan Wang; Arun Narayanan; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2014-12

3.  A Deep Ensemble Learning Method for Monaural Speech Separation.

Authors:  Xiao-Lei Zhang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2016-03-01

Review 4.  Creating the feedback loop: closed-loop neurostimulation.

Authors:  Adam O Hebb; Jun Jason Zhang; Mohammad H Mahoor; Christos Tsiokos; Charles Matlack; Howard Jay Chizeck; Nader Pouratian
Journal:  Neurosurg Clin N Am       Date:  2013-10-23       Impact factor: 2.509

5.  A Competing Voices Test for Hearing-Impaired Listeners Applied to Spatial Separation and Ideal Time-Frequency Masks.

Authors:  Lars Bramsløw; Marianna Vatti; Rikke Rossing; Gaurav Naithani; Niels Henrik Pontoppidan
Journal:  Trends Hear       Date:  2019 Jan-Dec       Impact factor: 3.293

6.  The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility.

Authors:  Thomas Bentsen; Tobias May; Abigail A Kressner; Torsten Dau
Journal:  PLoS One       Date:  2018-05-15       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.