Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A classification based approach to speech segregation.

Literature DB >> 23145627

A classification based approach to speech segregation.

Abstract

A key problem in computational auditory scene analysis (CASA) is monaural speech segregation, which has proven to be very challenging. For monaural mixtures, one can only utilize the intrinsic properties of speech or interference to segregate target speech from background noise. Ideal binary mask (IBM) has been proposed as a main goal of sound segregation in CASA and has led to substantial improvements of human speech intelligibility in noise. This study proposes a classification approach to estimate the IBM and employs support vector machines to classify time-frequency units as either target- or interference-dominant. A re-thresholding method is incorporated to improve classification results and maximize hit minus false alarm rates. An auditory segmentation stage is utilized to further improve estimated masks. Systematic evaluations show that the proposed approach produces high quality estimated IBMs and outperforms a recent system in terms of classification accuracy.

Entities: Species

Mesh：

Year: 2012 PMID： 23145627 DOI： 10.1121/1.4754541

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

6 in total

6. The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility.

Authors: Thomas Bentsen; Tobias May; Abigail A Kressner; Torsten Dau
Journal: PLoS One Date: 2018-05-15 Impact factor: 3.240

6 in total

A classification based approach to speech segregation.

1. An algorithm to improve speech recognition in noise for hearing-impaired listeners.

2. On Training Targets for Supervised Speech Separation.

3. A Deep Ensemble Learning Method for Monaural Speech Separation.

Review 4. Creating the feedback loop: closed-loop neurostimulation.

5. A Competing Voices Test for Hearing-Impaired Listeners Applied to Spatial Separation and Ideal Time-Frequency Masks.

6. The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility.