Literature DB >> 34598625

An intrusive method for estimating speech intelligibility from noisy and distorted signals.

Nursadul Mamun1, Muhammad S A Zilany2, John H L Hansen1, Evelyn E Davies-Venn3.   

Abstract

An objective metric that predicts speech intelligibility under different types of noise and distortion would be desirable in voice communication. To date, the majority of studies concerning speech intelligibility metrics have focused on predicting the effects of individual noise or distortion mechanisms. This study proposes an objective metric, the spectrogram orthogonal polynomial measure (SOPM), that attempts to predict speech intelligibility for people with normal hearing under adverse conditions. The SOPM metric is developed by extracting features from the spectrogram using Krawtchouk moments. The metric's performance is evaluated for several types of noise (steady-state and fluctuating noise), distortions (peak clipping, center clipping, and phase jitters), ideal time-frequency segregation, and reverberation conditions both in quiet and noisy environments. High correlation (0.97-0.996) is achieved with the proposed metric when evaluated with subjective scores by normal-hearing subjects under various conditions.

Entities:  

Mesh:

Year:  2021        PMID: 34598625      PMCID: PMC8637725          DOI: 10.1121/10.0005899

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   2.482


  17 in total

1.  Monosyllabic word recognition at higher-than-normal speech and noise levels.

Authors:  G A Studebaker; R L Sherbecoe; D M McDaniel; C A Gwaltney
Journal:  J Acoust Soc Am       Date:  1999-04       Impact factor: 1.840

2.  Analysis of speech-based Speech Transmission Index methods with implications for nonlinear operations.

Authors:  Ray L Goldsworthy; Julie E Greenberg
Journal:  J Acoust Soc Am       Date:  2004-12       Impact factor: 1.840

3.  Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation.

Authors:  Douglas S Brungart; Peter S Chang; Brian D Simpson; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2006-12       Impact factor: 1.840

4.  Image analysis by Krawtchouk moments.

Authors:  Pew-Thian Yap; Raveendran Paramesran; Seng-Huat Ong
Journal:  IEEE Trans Image Process       Date:  2003       Impact factor: 10.856

5.  Basilar membrane responses to noise at a basal site of the chinchilla cochlea: quasi-linear filtering.

Authors:  Alberto Recio-Spinoso; Shyamla S Narayan; Mario A Ruggero
Journal:  J Assoc Res Otolaryngol       Date:  2009-06-03

6.  Speech recognition in noise and reverberation by school-age children.

Authors:  W S Yacullo; D B Hawkins
Journal:  Audiology       Date:  1987

7.  Frequency-importance and transfer functions for the Auditec of St. Louis recordings of the NU-6 word test.

Authors:  G A Studebaker; R L Sherbecoe; C Gilmore
Journal:  J Speech Hear Res       Date:  1993-08

8.  Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise.

Authors:  M Nilsson; S D Soli; J A Sullivan
Journal:  J Acoust Soc Am       Date:  1994-02       Impact factor: 1.840

9.  Vowel confusions of hearing-impaired listeners under reverberant and nonreverberant conditions.

Authors:  A K Nabelek; T R Letowski
Journal:  J Speech Hear Disord       Date:  1985-05

10.  A physical method for measuring speech-transmission quality.

Authors:  H J Steeneken; T Houtgast
Journal:  J Acoust Soc Am       Date:  1980-01       Impact factor: 1.840

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.