Literature DB >> 15658718

Analysis of speech-based Speech Transmission Index methods with implications for nonlinear operations.

Ray L Goldsworthy1, Julie E Greenberg.   

Abstract

The Speech Transmission Index (STI) is a physical metric that is well correlated with the intelligibility of speech degraded by additive noise and reverberation. The traditional STI uses modulated noise as a probe signal and is valid for assessing degradations that result from linear operations on the speech signal. Researchers have attempted to extend the STI to predict the intelligibility of nonlinearly processed speech by proposing variations that use speech as a probe signal. This work considers four previously proposed speech-based STI methods and four novel methods, studied under conditions of additive noise, reverberation, and two nonlinear operations (envelope thresholding and spectral subtraction). Analyzing intermediate metrics in the STI calculation reveals why some methods fail for nonlinear operations. Results indicate that none of the previously proposed methods is adequate for all of the conditions considered, while four proposed methods produce qualitatively reasonable results and warrant further study. The discussion considers the relevance of this work to predicting the intelligibility of cochlear-implant processed speech.

Mesh:

Year:  2004        PMID: 15658718     DOI: 10.1121/1.1804628

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  31 in total

1.  Analysis of a simplified normalized covariance measure based on binary weighting functions for predicting the intelligibility of noise-suppressed speech.

Authors:  Fei Chen; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2010-12       Impact factor: 1.840

2.  Comparing the information conveyed by envelope modulation for speech intelligibility, speech quality, and music quality.

Authors:  James M Kates; Kathryn H Arehart
Journal:  J Acoust Soc Am       Date:  2015-10       Impact factor: 1.840

3.  Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions.

Authors:  Jianfen Ma; Yi Hu; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2009-05       Impact factor: 1.840

4.  Contributions of cochlea-scaled entropy and consonant-vowel boundaries to prediction of speech intelligibility in noise.

Authors:  Fei Chen; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2012-05       Impact factor: 1.840

5.  Objective speech intelligibility measurement for cochlear implant users in complex listening environments.

Authors:  João F Santos; Stefano Cosentino; Oldooz Hazrati; Philipos C Loizou; Tiago H Falk
Journal:  Speech Commun       Date:  2013-09-01       Impact factor: 2.017

6.  Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data.

Authors:  Karen L Payton; Mona Shrestha
Journal:  J Acoust Soc Am       Date:  2013-11       Impact factor: 1.840

7.  Effects of source-to-listener distance and masking on perception of cochlear implant processed speech in reverberant rooms.

Authors:  Nathaniel A Whitmal; Sarah F Poissant
Journal:  J Acoust Soc Am       Date:  2009-11       Impact factor: 1.840

8.  Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms.

Authors:  Philipos C Loizou; Jianfen Ma
Journal:  J Acoust Soc Am       Date:  2011-08       Impact factor: 1.840

9.  Predicting the intelligibility of vocoded and wideband Mandarin Chinese.

Authors:  Fei Chen; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2011-05       Impact factor: 1.840

10.  Contribution of consonant landmarks to speech recognition in simulated acoustic-electric hearing.

Authors:  Fei Chen; Philipos C Loizou
Journal:  Ear Hear       Date:  2010-04       Impact factor: 3.570

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.