Literature DB >> 27794278

Measuring time-frequency importance functions of speech with bubble noise.

Michael I Mandel1, Sarah E Yoho2, Eric W Healy2.   

Abstract

Listeners can reliably perceive speech in noisy conditions, but it is not well understood what specific features of speech they use to do this. This paper introduces a data-driven framework to identify the time-frequency locations of these features. Using the same speech utterance mixed with many different noise instances, the framework is able to compute the importance of each time-frequency point in the utterance to its intelligibility. The mixtures have approximately the same global signal-to-noise ratio at each frequency, but very different recognition rates. The difference between these intelligible vs unintelligible mixtures is the alignment between the speech and spectro-temporally modulated noise, providing different combinations of "glimpses" of speech in each mixture. The current results reveal the locations of these important noise-robust phonetic features in a restricted set of syllables. Classification models trained to predict whether individual mixtures are intelligible based on the location of these glimpses can generalize to new conditions, successfully predicting the intelligibility of novel mixtures. They are able to generalize to novel noise instances, novel productions of the same word by the same talker, novel utterances of the same word spoken by different talkers, and, to some extent, novel consonants.

Mesh:

Year:  2016        PMID: 27794278      PMCID: PMC6910024          DOI: 10.1121/1.4964102

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  32 in total

1.  Evaluation of the noise reduction system in a commercial digital hearing aid.

Authors:  José L Alcántara; Brian C J Moore; Volker Kühnel; Stefan Launer
Journal:  Int J Audiol       Date:  2003-01       Impact factor: 2.117

2.  Functional magnetic resonance imaging (fMRI) "brain reading": detecting and classifying distributed patterns of fMRI activity in human visual cortex.

Authors:  David D Cox; Robert L Savoy
Journal:  Neuroimage       Date:  2003-06       Impact factor: 6.556

3.  Bubbles: a technique to reveal the use of information in recognition tasks.

Authors:  F Gosselin; P G Schyns
Journal:  Vision Res       Date:  2001-08       Impact factor: 1.886

4.  Perceptual compensation for effects of reverberation in speech identification.

Authors:  Anthony J Watkins
Journal:  J Acoust Soc Am       Date:  2005-07       Impact factor: 1.840

5.  A glimpsing model of speech perception in noise.

Authors:  Martin Cooke
Journal:  J Acoust Soc Am       Date:  2006-03       Impact factor: 1.840

6.  Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing.

Authors:  J M Festen; R Plomp
Journal:  J Acoust Soc Am       Date:  1990-10       Impact factor: 1.840

7.  Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions.

Authors:  Jianfen Ma; Yi Hu; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2009-05       Impact factor: 1.840

8.  Evaluation of the importance of time-frequency contributions to speech intelligibility in noise.

Authors:  Chengzhu Yu; Kamil K Wójcicki; Philipos C Loizou; John H L Hansen; Michael T Johnson
Journal:  J Acoust Soc Am       Date:  2014-05       Impact factor: 1.840

9.  On the number of auditory filter outputs needed to understand speech: further evidence for auditory channel independence.

Authors:  Frédéric Apoux; Eric W Healy
Journal:  Hear Res       Date:  2009-06-16       Impact factor: 3.208

Review 10.  Are individual differences in speech reception related to individual differences in cognitive ability? A survey of twenty experimental studies with normal and hearing-impaired adults.

Authors:  Michael A Akeroyd
Journal:  Int J Audiol       Date:  2008-11       Impact factor: 2.117

View more
  4 in total

1.  Identification of the Spectrotemporal Modulations That Support Speech Intelligibility in Hearing-Impaired and Normal-Hearing Listeners.

Authors:  Jonathan H Venezia; Allison-Graham Martin; Gregory Hickok; Virginia M Richards
Journal:  J Speech Lang Hear Res       Date:  2019-04-15       Impact factor: 2.297

2.  CLEESE: An open-source audio-transformation toolbox for data-driven experiments in speech and music cognition.

Authors:  Juan José Burred; Emmanuel Ponsot; Louise Goupil; Marco Liuni; Jean-Julien Aucouturier
Journal:  PLoS One       Date:  2019-04-04       Impact factor: 3.240

3.  High-Frequency Sensorineural Hearing Loss Alters Cue-Weighting Strategies for Discriminating Stop Consonants in Noise.

Authors:  Léo Varnet; Chloé Langlet; Christian Lorenzi; Diane S Lazard; Christophe Micheyl
Journal:  Trends Hear       Date:  2019 Jan-Dec       Impact factor: 3.293

4.  Neurocognitive dynamics of near-threshold voice signal detection and affective voice evaluation.

Authors:  Huw Swanborough; Matthias Staib; Sascha Frühholz
Journal:  Sci Adv       Date:  2020-12-11       Impact factor: 14.136

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.