Literature DB >> 22087929

An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech.

Cees H Taal1, Richard C Hendriks, Richard Heusdens, Jesper Jensen.   

Abstract

Existing objective speech-intelligibility measures are suitable for several types of degradation, however, it turns out that they are less appropriate in cases where noisy speech is processed by a time-frequency weighting. To this end, an extensive evaluation is presented of objective measure for intelligibility prediction of noisy speech processed with a technique called ideal time frequency (TF) segregation. In total 17 measures are evaluated, including four advanced speech-intelligibility measures (CSII, CSTI, NSEC, DAU), the advanced speech-quality measure (PESQ), and several frame-based measures (e.g., SSNR). Furthermore, several additional measures are proposed. The study comprised a total number of 168 different TF-weightings, including unprocessed noisy speech. Out of all measures, the proposed frame-based measure MCC gave the best results (ρ = 0.93). An additional experiment shows that the good performing measures in this study also show high correlation with the intelligibility of single-channel noise reduced speech.

Mesh:

Year:  2011        PMID: 22087929     DOI: 10.1121/1.3641373

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  8 in total

1.  Complex Ratio Masking for Monaural Speech Separation.

Authors:  Donald S Williamson; Yuxuan Wang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2015-12-23

2.  Deep Learning Based Binaural Speech Separation in Reverberant Environments.

Authors:  Xueliang Zhang; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-03-24

3.  Comparing Binaural Pre-processing Strategies I: Instrumental Evaluation.

Authors:  Regina M Baumgärtel; Martin Krawczyk-Becker; Daniel Marquardt; Christoph Völker; Hongmei Hu; Tobias Herzke; Graham Coleman; Kamil Adiloğlu; Stephan M A Ernst; Timo Gerkmann; Simon Doclo; Birger Kollmeier; Volker Hohmann; Mathias Dietz
Journal:  Trends Hear       Date:  2015-12-30       Impact factor: 3.293

4.  A Binaural Steering Beamformer System for Enhancing a Moving Speech Source.

Authors:  Kamil Adiloğlu; Hendrik Kayser; Regina M Baumgärtel; Sanja Rennebeck; Mathias Dietz; Volker Hohmann
Journal:  Trends Hear       Date:  2015-12-30       Impact factor: 3.293

5.  Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.

Authors:  Stephanie Haro; Christopher J Smalt; Gregory A Ciccarelli; Thomas F Quatieri
Journal:  Front Neurosci       Date:  2020-12-15       Impact factor: 4.677

6.  VSUGAN unify voice style based on spectrogram and generated adversarial networks.

Authors:  Tongjie Ouyang; Zhijun Yang; Huilong Xie; Tianlin Hu; Qingmei Liu
Journal:  Sci Rep       Date:  2021-12-21       Impact factor: 4.379

7.  The effect of internet telephony and a cochlear implant accessory on mobile phone speech comprehension in cochlear implant users.

Authors:  Markus E Huth; Regula L Boschung; Marco D Caversaccio; Wilhelm Wimmer; Mantokoudis Georgios
Journal:  Eur Arch Otorhinolaryngol       Date:  2022-04-24       Impact factor: 3.236

8.  Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.

Authors:  Marc R Schädler; Anna Warzybok; Birger Kollmeier
Journal:  Trends Hear       Date:  2018 Jan-Dec       Impact factor: 3.293

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.