Literature DB >> 34764159

Speech Categorization Reveals the Role of Early-Stage Temporal-Coherence Processing in Auditory Scene Analysis.

Vibha Viswanathan1, Barbara G Shinn-Cunningham2, Michael G Heinz3,4.   

Abstract

Temporal coherence of sound fluctuations across spectral channels is thought to aid auditory grouping and scene segregation. Although prior studies on the neural bases of temporal-coherence processing focused mostly on cortical contributions, neurophysiological evidence suggests that temporal-coherence-based scene analysis may start as early as the cochlear nucleus (i.e., the first auditory region supporting cross-channel processing over a wide frequency range). Accordingly, we hypothesized that aspects of temporal-coherence processing that could be realized in early auditory areas may shape speech understanding in noise. We then explored whether physiologically plausible computational models could account for results from a behavioral experiment that measured consonant categorization in different masking conditions. We tested whether within-channel masking of target-speech modulations predicted consonant confusions across the different conditions and whether predictions were improved by adding across-channel temporal-coherence processing mirroring the computations known to exist in the cochlear nucleus. Consonant confusions provide a rich characterization of error patterns in speech categorization, and are thus crucial for rigorously testing models of speech perception; however, to the best of our knowledge, they have not been used in prior studies of scene analysis. We find that within-channel modulation masking can reasonably account for category confusions, but that it fails when temporal fine structure cues are unavailable. However, the addition of across-channel temporal-coherence processing significantly improves confusion predictions across all tested conditions. Our results suggest that temporal-coherence processing strongly shapes speech understanding in noise and that physiological computations that exist early along the auditory pathway may contribute to this process.SIGNIFICANCE STATEMENT Temporal coherence of sound fluctuations across distinct frequency channels is thought to be important for auditory scene analysis. Prior studies on the neural bases of temporal-coherence processing focused mostly on cortical contributions, and it was unknown whether speech understanding in noise may be shaped by across-channel processing that exists in earlier auditory areas. Using physiologically plausible computational modeling to predict consonant confusions across different listening conditions, we find that across-channel temporal coherence contributes significantly to scene analysis and speech perception and that such processing may arise in the auditory pathway as early as the brainstem. By virtue of providing a richer characterization of error patterns not obtainable with just intelligibility scores, consonant confusions yield unique insight into scene analysis mechanisms.
Copyright © 2022 the authors.

Entities:  

Keywords:  cochlear nucleus; comodulation masking release; computational modeling; consonant confusions; cross-channel processing; wideband inhibition

Mesh:

Year:  2021        PMID: 34764159      PMCID: PMC8802934          DOI: 10.1523/JNEUROSCI.1610-21.2021

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.709


  44 in total

1.  Characterizing frequency selectivity for envelope fluctuations.

Authors:  S D Ewert; T Dau
Journal:  J Acoust Soc Am       Date:  2000-09       Impact factor: 1.840

2.  Correct tonotopic representation is necessary for complex pitch perception.

Authors:  Andrew J Oxenham; Joshua G W Bernstein; Hector Penagos
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-12       Impact factor: 11.205

3.  Predicted effects of sensorineural hearing loss on across-fiber envelope coding in the auditory nerve.

Authors:  Jayaganesh Swaminathan; Michael G Heinz
Journal:  J Acoust Soc Am       Date:  2011-06       Impact factor: 1.840

4.  Modulation masking: effects of modulation frequency, depth, and phase.

Authors:  S P Bacon; D W Grantham
Journal:  J Acoust Soc Am       Date:  1989-06       Impact factor: 1.840

5.  Electrical signs of selective attention in the human brain.

Authors:  S A Hillyard; R F Hink; V L Schwent; T W Picton
Journal:  Science       Date:  1973-10-12       Impact factor: 47.728

6.  Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements.

Authors:  Christopher A Shera; John J Guinan; Andrew J Oxenham
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-26       Impact factor: 11.205

7.  Masking release for low- and high-pass-filtered speech in the presence of noise and single-talker interference.

Authors:  Andrew J Oxenham; Andrea M Simonson
Journal:  J Acoust Soc Am       Date:  2009-01       Impact factor: 1.840

8.  Temporal coherence in the perceptual organization and cortical representation of auditory scenes.

Authors:  Mounya Elhilali; Ling Ma; Christophe Micheyl; Andrew J Oxenham; Shihab A Shamma
Journal:  Neuron       Date:  2009-01-29       Impact factor: 17.173

9.  How early aging and environment interact in everyday listening: from brainstem to behavior through modeling.

Authors:  Barbara Shinn-Cunningham; Dorea R Ruggles; Hari Bharadwaj
Journal:  Adv Exp Med Biol       Date:  2013       Impact factor: 2.622

10.  Segregation of complex acoustic scenes based on temporal coherence.

Authors:  Sundeep Teki; Maria Chait; Sukhbinder Kumar; Shihab Shamma; Timothy D Griffiths
Journal:  Elife       Date:  2013-07-23       Impact factor: 8.140

View more
  1 in total

1.  FORUM: Remote testing for psychological and physiological acoustics.

Authors:  Z Ellen Peng; Sebastian Waz; Emily Buss; Yi Shen; Virginia Richards; Hari Bharadwaj; G Christopher Stecker; Jordan A Beim; Adam K Bosen; Meredith D Braza; Anna C Diedesch; Claire M Dorey; Andrew R Dykstra; Frederick J Gallun; Raymond L Goldsworthy; Lincoln Gray; Eric C Hoover; Antje Ihlefeld; Thomas Koelewijn; Judy G Kopun; Juraj Mesik; Daniel E Shub; Jonathan H Venezia
Journal:  J Acoust Soc Am       Date:  2022-05       Impact factor: 2.482

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.