Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech.

Literature DB >> 22087929

An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech.

Cees H Taal¹, Richard C Hendriks, Richard Heusdens, Jesper Jensen.

Abstract

Existing objective speech-intelligibility measures are suitable for several types of degradation, however, it turns out that they are less appropriate in cases where noisy speech is processed by a time-frequency weighting. To this end, an extensive evaluation is presented of objective measure for intelligibility prediction of noisy speech processed with a technique called ideal time frequency (TF) segregation. In total 17 measures are evaluated, including four advanced speech-intelligibility measures (CSII, CSTI, NSEC, DAU), the advanced speech-quality measure (PESQ), and several frame-based measures (e.g., SSNR). Furthermore, several additional measures are proposed. The study comprised a total number of 168 different TF-weightings, including unprocessed noisy speech. Out of all measures, the proposed frame-based measure MCC gave the best results (ρ = 0.93). An additional experiment shows that the good performing measures in this study also show high correlation with the intelligibility of single-channel noise reduced speech.

Mesh：

Year: 2011 PMID： 22087929 DOI： 10.1121/1.3641373

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

8 in total

1. Complex Ratio Masking for Monaural Speech Separation.

Authors: Donald S Williamson; Yuxuan Wang; DeLiang Wang
Journal: IEEE/ACM Trans Audio Speech Lang Process Date: 2015-12-23

2. Deep Learning Based Binaural Speech Separation in Reverberant Environments.

Authors: Xueliang Zhang; DeLiang Wang
Journal: IEEE/ACM Trans Audio Speech Lang Process Date: 2017-03-24

3. Comparing Binaural Pre-processing Strategies I: Instrumental Evaluation.

Authors: Regina M Baumgärtel; Martin Krawczyk-Becker; Daniel Marquardt; Christoph Völker; Hongmei Hu; Tobias Herzke; Graham Coleman; Kamil Adiloğlu; Stephan M A Ernst; Timo Gerkmann; Simon Doclo; Birger Kollmeier; Volker Hohmann; Mathias Dietz
Journal: Trends Hear Date: 2015-12-30 Impact factor: 3.293

4. A Binaural Steering Beamformer System for Enhancing a Moving Speech Source.

Authors: Kamil Adiloğlu; Hendrik Kayser; Regina M Baumgärtel; Sanja Rennebeck; Mathias Dietz; Volker Hohmann
Journal: Trends Hear Date: 2015-12-30 Impact factor: 3.293

5. Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.

Authors: Stephanie Haro; Christopher J Smalt; Gregory A Ciccarelli; Thomas F Quatieri
Journal: Front Neurosci Date: 2020-12-15 Impact factor: 4.677

6. VSUGAN unify voice style based on spectrogram and generated adversarial networks.

Authors: Tongjie Ouyang; Zhijun Yang; Huilong Xie; Tianlin Hu; Qingmei Liu
Journal: Sci Rep Date: 2021-12-21 Impact factor: 4.379

7. The effect of internet telephony and a cochlear implant accessory on mobile phone speech comprehension in cochlear implant users.

Authors: Markus E Huth; Regula L Boschung; Marco D Caversaccio; Wilhelm Wimmer; Mantokoudis Georgios
Journal: Eur Arch Otorhinolaryngol Date: 2022-04-24 Impact factor: 3.236

8. Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.

Authors: Marc R Schädler; Anna Warzybok; Birger Kollmeier
Journal: Trends Hear Date: 2018 Jan-Dec Impact factor: 3.293

8 in total