Literature DB >> 28362674

Human Frequency Following Responses to Vocoded Speech.

Saradha Ananthakrishnan1, Xin Luo, Ananthanarayan Krishnan.   

Abstract

OBJECTIVES: Vocoders offer an effective platform to simulate the effects of cochlear implant speech processing strategies in normal-hearing listeners. Several behavioral studies have examined the effects of varying spectral and temporal cues on vocoded speech perception; however, little is known about the neural indices of vocoded speech perception. Here, the scalp-recorded frequency following response (FFR) was used to study the effects of varying spectral and temporal cues on brainstem neural representation of specific acoustic cues, the temporal envelope periodicity related to fundamental frequency (F0) and temporal fine structure (TFS) related to formant and formant-related frequencies, as reflected in the phase-locked neural activity in response to vocoded speech.
DESIGN: In experiment 1, FFRs were measured in 12 normal-hearing, adult listeners in response to a steady state English back vowel /u/ presented in an unaltered, unprocessed condition and six sine-vocoder conditions with varying numbers of channels (1, 2, 4, 8, 16, and 32), while the temporal envelope cutoff frequency was fixed at 500 Hz. In experiment 2, FFRs were obtained from 14 normal-hearing, adult listeners in response to the same English vowel /u/, presented in an unprocessed condition and four vocoded conditions where both the temporal envelope cutoff frequency (50 versus 500 Hz) and carrier type (sine wave versus noise band) were varied separately with the number of channels fixed at 8. Fast Fourier Transform was applied to the time waveforms of FFR to analyze the strength of brainstem neural representation of temporal envelope periodicity (F0) and TFS-related peaks (formant structure).
RESULTS: Brainstem neural representation of both temporal envelope and TFS cues improved when the number of channels increased from 1 to 4, followed by a plateau with 8 and 16 channels, and a reduction in phase-locking strength with 32 channels. For the sine vocoders, peaks in the FFRTFS spectra corresponded with the low-frequency sine-wave carriers and side band frequencies in the stimulus spectra. When the temporal envelope cutoff frequency increased from 50 to 500 Hz, an improvement was observed in brainstem F0 representation with no change in brainstem representation of spectral peaks proximal to the first formant frequency (F1). There was no significant effect of carrier type (sine- versus noise-vocoder) on brainstem neural representation of F0 cues when the temporal envelope cutoff frequency was 500 Hz.
CONCLUSIONS: While the improvement in neural representation of temporal envelope and TFS cues with up to 4 vocoder channels is consistent with the behavioral literature, the reduced neural phase-locking strength noted with even more channels may be because of the narrow bandwidth of each channel as the number of channels increases. Stronger neural representation of temporal envelope cues with higher temporal envelope cutoff frequencies is likely a reflection of brainstem neural phase-locking to F0-related periodicity fluctuations preserved in the 500-Hz temporal envelopes, which are unavailable in the 50-Hz temporal envelopes. No effect of temporal envelope cutoff frequency was seen for neural representation of TFS cues, suggesting that spectral side band frequencies created by the 500-Hz temporal envelopes did not improve neural representation of F1 cues over the 50-Hz temporal envelopes. Finally, brainstem F0 representation was not significantly affected by carrier type with a temporal envelope cutoff frequency of 500 Hz, which is inconsistent with previous results of behavioral studies examining pitch perception of vocoded stimuli.

Entities:  

Mesh:

Year:  2017        PMID: 28362674      PMCID: PMC5570627          DOI: 10.1097/AUD.0000000000000432

Source DB:  PubMed          Journal:  Ear Hear        ISSN: 0196-0202            Impact factor:   3.570


  33 in total

1.  Chimaeric sounds reveal dichotomies in auditory perception.

Authors:  Zachary M Smith; Bertrand Delgutte; Andrew J Oxenham
Journal:  Nature       Date:  2002-03-07       Impact factor: 49.962

2.  Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses.

Authors:  Li Xu; Yuhjung Tsai; Bryan E Pfingst
Journal:  J Acoust Soc Am       Date:  2002-07       Impact factor: 1.840

3.  Vocal emotion recognition by normal-hearing listeners and cochlear implant users.

Authors:  Qian-Jie Fu; John J Galvin
Journal:  Trends Amplif       Date:  2007-12

4.  Benefit of high-rate envelope cues in vocoder processing: effect of number of channels and spectral region.

Authors:  Michael A Stone; Christian Füllgrabe; Brian C J Moore
Journal:  J Acoust Soc Am       Date:  2008-10       Impact factor: 1.840

5.  Improved fundamental frequency coding in cochlear implant signal processing.

Authors:  Matthias Milczynski; Jan Wouters; Astrid van Wieringen
Journal:  J Acoust Soc Am       Date:  2009-04       Impact factor: 1.840

6.  Envelope and spectral frequency-following responses to vowel sounds.

Authors:  Steven J Aiken; Terence W Picton
Journal:  Hear Res       Date:  2008-08-19       Impact factor: 3.208

7.  Responses to speech signals in the normal and pathological peripheral auditory system.

Authors:  A R Palmer; P A Moorjani
Journal:  Prog Brain Res       Date:  1993       Impact factor: 2.453

8.  Distortion products and their influence on representation of pitch-relevant information in the human brainstem for unresolved harmonic complex tones.

Authors:  Christopher J Smalt; Ananthanarayan Krishnan; Gavin M Bidelman; Saradha Ananthakrishnan; Jackson T Gandour
Journal:  Hear Res       Date:  2012-08-14       Impact factor: 3.208

9.  Experience-dependent neural representation of dynamic pitch in the brainstem.

Authors:  Ananthanarayan Krishnan; Jackson T Gandour; Gavin M Bidelman; Jayaganesh Swaminathan
Journal:  Neuroreport       Date:  2009-03-04       Impact factor: 1.837

10.  Evoked cortical activity and speech recognition as a function of the number of simulated cochlear implant channels.

Authors:  L M Friesen; K L Tremblay; N Rohila; R A Wright; R V Shannon; D Başkent; J T Rubinstein
Journal:  Clin Neurophysiol       Date:  2009-02-27       Impact factor: 3.708

View more
  1 in total

1.  Age-Related Compensation Mechanism Revealed in the Cortical Representation of Degraded Speech.

Authors:  Samira Anderson; Lindsey Roque; Casey R Gaskins; Sandra Gordon-Salant; Matthew J Goupell
Journal:  J Assoc Res Otolaryngol       Date:  2020-07-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.