Literature DB >> 31671991

Weak neural signatures of spatial selective auditory attention in hearing-impaired listeners.

Lia M Bonacci¹, Lengshi Dai¹, Barbara G Shinn-Cunningham¹.

Abstract

Spatial attention may be used to select target speech in one location while suppressing irrelevant speech in another. However, if perceptual resolution of spatial cues is weak, spatially focused attention may work poorly, leading to difficulty communicating in noisy settings. In electroencephalography (EEG), the distribution of alpha (8-14 Hz) power over parietal sensors reflects the spatial focus of attention [Banerjee, Snyder, Molholm, and Foxe (2011). J. Neurosci. 31, 9923-9932; Foxe and Snyder (2011). Front. Psychol. 2, 154.] If spatial attention is degraded, however, alpha may not be modulated across parietal sensors. A previously published behavioral and EEG study found that, compared to normal-hearing (NH) listeners, hearing-impaired (HI) listeners often had higher interaural time difference thresholds, worse performance when asked to report the content of an acoustic stream from a particular location, and weaker attentional modulation of neural responses evoked by sounds in a mixture [Dai, Best, and Shinn-Cunningham (2018). Proc. Natl. Acad. Sci. U. S. A. 115, E3286]. This study explored whether these same HI listeners also showed weaker alpha lateralization during the previously reported task. In NH listeners, hemispheric parietal alpha power was greater when the ipsilateral location was attended; this lateralization was stronger when competing melodies were separated by a larger spatial difference. In HI listeners, however, alpha was not lateralized across parietal sensors, consistent with a degraded ability to use spatial features to selectively attend.

Entities: CellLine Disease Gene Species

Mesh：

Year: 2019 PMID： 31671991 PMCID： PMC7273515 DOI： 10.1121/1.5129055

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

INTRODUCTION

Knowing where to attend is often helpful when trying to communicate in noisy environments (Kidd ). However, if an individual has difficulty perceiving spatial differences among competing sound sources, then they may have difficulty deploying spatial attention. Hearing-impaired (HI) individuals often report difficulty holding conversations in noisy environments, even when using hearing aids (Marrone ). These difficulties likely arise from poor encoding of sound features in the auditory periphery (Shinn-Cunningham, 2008; Shinn-Cunningham and Best, 2008). Specifically, if the perceptual representation of the spectro-temporal structure of sound is degraded (Buss ; Strelcyk and Dau, 2009), then higher-order features that arise from these local features, like pitch and location, will also be degraded. Since such features support source segregation and source selection, a degraded peripheral representation can lead to failures on tasks requiring attention to be focused on a source in a complex scene (Shinn-Cunningham and Best, 2008). Neurophysiological correlates of selective attention are often obtained using electroencephalography (EEG). In particular, growing evidence suggests that the distribution of alpha (8–14 Hz) oscillatory power across parietal sensors reflects the spatial focus of attention, with alpha power increasing ipsilateral to the location being attended (Foxe and Snyder, 2011; Banerjee ). It is thought that this increase in alpha reflects suppression of the representation of distractors in the contralateral location (Worden ; Foxe and Snyder, 2011; Banerjee ). If spatial attention is degraded, however, then these neural correlates of attention may also be degraded. In a recent study, we found that HI individuals were less sensitive to interaural time differences (ITDs, a key feature when determining the perceived direction of sound) than normal-hearing (NH) individuals (Colburn, 1982; Dai ). Since perceived spatial differences are crucial for deploying spatial attention, then HI individuals may depend less on spatial cues to segregate and select objects in a complex scene. If this is the case, then EEG correlates of spatial attention, including the distribution of alpha power, may not be strongly modulated with the locus of attention in HI listeners. In healthy young listeners, attention strongly affects the magnitude of onset-evoked responses, including the N1, which arises between 100 and 150 ms after an onset event (e.g., the start of a note in a melody or a plosive sound in speech). In particular, N1 event-related potentials (ERPs) to events in a sound stream are larger when the stream is attended compared to when it is ignored (Hillyard ; Choi ; Choi ). However, we previously found that this attentional modulation of the N1 is weaker in HI listeners than in NH listeners (Dai ). Specifically, when NH subjects were directed to attend one of three simultaneous melodies, N1s to attended notes were larger than N1s to ignored notes; however, this difference was significantly reduced in HI listeners. Consistent with a previous study of NH listeners (Choi ), we also found a direct correlation between the magnitude of the attentional modulation of N1 and the ability of individual HI and NH listeners to perform the spatial selective attention task. These results suggest that a degradation of attentional modulation reflects a degraded ability to selectively attend. These HI listeners also had higher audiometric, ITD, and audibility thresholds than NH listeners, supporting the idea that degraded feature representation contributes to failures of selective attention (Shinn-Cunningham, 2008; Shinn-Cunningham and Best, 2008). Given these results, we wondered whether the degraded spatial selective attention abilities of HI listeners might also manifest in a weaker lateralization of alpha power over parietal EEG sensors. Specifically, based on previous work, we expected to find that the NH listeners in our study would display lateralized parietal alpha during auditory stimulus presentation. Furthermore, in that study, we tested conditions when the competing melodies were separated by either large or small differences in perceived lateral position. Based on other studies in our lab (Deng ), we expected that for these NH listeners—who we thought would be able to deploy spatial attention effectively—the lateralization of alpha would be greater when the perceived spatial separation of competing melodies was large compared to when it was small. However, we hypothesized that HI listeners would show an overall reduction in or even a lack of alpha lateralization, reflecting a weaker deployment of spatial attention. While our previous study (Dai ) showed performance improvements for both NH and HI groups with increased spatial separation, HI listeners performed worse than NH listeners in both conditions. Based on these results, we predicted that the internal noise in spatial representation would be greater for HI listeners, which would result in overall weaker alpha lateralization for both spatial separation conditions. To test these ideas, we reanalyzed the EEG data collected in our previously published study (Dai ) of spatial selective attention in NH and HI listeners.

METHODS

Data were taken from the experiment previously published in Dai . The subjects, stimuli, experimental paradigm, and data collection from this experiment are summarized below.

Experimental task and stimuli

Subjects were presented with three simultaneous melodies: a distractor melody, a leading melody, and a lagging melody [Fig. 1(A)]. For each trial, either the leading or lagging melody was the target and the distractor was always ignored. The three melodies were differentiated by the timing of their notes. The distractor melody started first, consisting of four notes with a duration of 919 ms and inter-stimulus interval (ISI) of 959 ms. A leading melody started 490 ms after distractor onset, and consisted of five notes with a duration of 624 ms and ISI of 664 ms. A lagging melody started 200 ms after the leading melody onset, and consisted of four notes with a duration of 728 ms and ISI of 768 ms.

FIG. 1.

Stimuli and experimental paradigm [figure adapted from (Dai )]. (A) Each trial presented three simultaneous melodies. Each melody consisted of a collection of high and low notes that formed one of three classes of pitch contours: “rising,” “falling,” or “zigzagging.” A distractor melody that was never the designated target always started first, with complex tones of F0 276 or 317 Hz. Next, the leading melody started, with each note having an F0 of either 177 or 194 Hz. Finally, the lagging melody started, with F0s that were either 113 or 124 Hz. (B) For each trial, subjects fixated on a central point until a visual cue was given to either attend left, right, or center. On each trial, the correspondence between streams (distractor, leading, and lagging) and direction (left, right, and center) was selected randomly. Left and right melodies were spatialized using symmetric ITD pairs, and two conditions were tested: one with small ITDs (light blue) and one with large ITDs (dark blue). After stimulus presentation, subjects were asked to report the pitch contour of the cued melody and were given visual feedback on the correctness of their response. Each of the three melodies was composed of high (H) and low (L) notes, differing in their fundamental frequency (F0). The H and L notes within a stream were relatively close in pitch, while the pitch separation between streams was larger. Distractor tones were comprised of a sinusoid of an F0 (276 or 317 Hz) and its first three harmonics, all at equal amplitude. Leading and lagging tones were broader band, consisting of an F0 (leading F0: 113 or 124 Hz, lagging F0: 177 or 194 Hz) and its first 33 harmonics, all at equal amplitude. Notes were gated on and off with cosine-squared ramps of duration 10 and 100 ms for onset and offset, respectively. Notes in each melody were arranged such that they formed a pitch contour that was “rising,” “falling,” or “zigzagging.” For “rising” melodies, the first note was low and transitioned to the high note for all subsequent notes (e.g., L-H-H-H). “Falling” melodies started high and transitioned to the low note for all subsequent notes (e.g., H-L-L-L). “Zigzagging” melodies started high or low, transitioned to the other note, and returned to the original note on the last onset (e.g., L-H-H-L or H-L-L-H). The pitch contour of each melody was chosen independently of the others, with each contour having equal probability (1/3). The three melodies were spatialized such that one came from the left, one from the right, and one from center; the correspondence between melody and position was assigned randomly on each trial. Symmetric ITD pairs (either ±205 μs or ±799 μs) were used to spatialize left and right stimuli, while 0 μs ITD was used for the center melody [Fig. 1(B)]. Trials in which the symmetric ITD pair was small (±205 μs) and those in which it was large (±799 μs) were intermixed. All possible combinations of target location (left, right, or center), spatial separation (small or large ITD), and target stream type (leading or lagging) were tested, for a total of 480 trials (40 in each condition). Here, we only focus on attend-left and attend-right trials in each ITD condition, collapsed across all other conditions, as these should show the greatest alpha lateralization, providing the strongest test of alpha lateralization. The structure of each trial is outlined in Fig. 1(B). At the beginning of each trial, subjects fixated on a central dot for 1 s before a 1-s visual cue was given. The visual cue was an arrow that pointed left, right, or upward, signaling subjects to pay attention to the left, right, or center melody, respectively. After the visual cue, there was a 0.3-s quiet period, followed by 3.8 s of auditory stimulus playback, and another 0.7-s quiet period. Subjects were then prompted to identify the pitch contour of the cued sequence via button press. Subjects were given 1.5 s to respond after which visual feedback was given to indicate if the response was correct or not. Training took place before testing to ensure that subjects could properly identify pitch contours of a single melody in quiet. This training consisted of two 12-trial blocks of a single stream. The first block tested leading streams (lowest F0), and the second block tested lagging streams (middle F0). Subjects were required to performed additional blocks until they achieved eight of 12 correct trials for seven consecutive blocks. Before testing, we also measured ITD thresholds for each subject using an adaptive three-down-one-up tracking procedure with notes similar to those presented during the auditory attention task [see Materials and Methods: ITD Threshold Procedures in Dai ]. Subjects were presented with two notes and had to report whether the first note was to the left or right of the second. In order to qualify for the experiment, subjects had to have an ITD threshold smaller than 205 μs; no subjects were disqualified for failing to meet this criterion.

Subjects

Data were collected from 25 NH listeners (13 male, 12 female, aged 20–52 yr) and 15 HI listeners (eight male, seven female, aged 20–59 yr). All NH listeners had audiometric thresholds ≤20 dB hearing level (HL) at octave frequencies from 250 Hz to 8 kHz. HI listeners had bilateral symmetric sensorineural hearing loss. Audiometric thresholds for all HI listeners were ≥25 dB HL at one or more frequencies from 250 to 8000 Hz, and threshold differences between the two ears were ≤20 dB at each frequency. NH and HI groups did not differ significantly in age (two-sided Wilcoxon Rank Sum test; rank sum = 329, P = 0.5651) (Dai ). The NH and HI groups also had similar abilities on non-auditory attention tasks. Specifically, the groups were statistically indistinguishable on two different visual tasks from the Test of Everyday Attention [Pearson; see Materials and Methods: Participants in Dai ]. Stimuli were presented at 70 dB sound pressure level (SPL) for all NH listeners. For HI listeners, the level was adjusted, starting at 70 dB SPL and increasing in steps of 5 dB until a comfortable level was reached. Of the 15 HI listeners, five settled on 75 dB SPL while the remaining ten settled on 70 dB SPL. These levels were used in training, prior to the testing. Therefore, given that all HI listeners were able to perform the melody contour identification task in quiet, audibility of the melodies was not a limiting factor in their performance. All subjects gave informed consent before participating, and were compensated at an hourly rate and also paid a bonus of $0.02 for each correct response in order to maintain motivation. All procedures were approved by the Boston University Institutional Review Board.

Data collection

EEG data were recorded in 32 electrodes and sampled at 4096 Hz using the BioSemi ActiveTwo system along with its ActiveView acquisition software (BioSemi, Amsterdam, Netherlands). During recording, the BioSemi ActiveTwo system applies a low-pass filter in hardware with cutoff frequency at 800 Hz. Scalp electrode positions were arranged according to the international 10–20 system, and two reference electrodes were placed on the mastoids. Event triggers were generated by matlab interfaced with Tucker-Davis Technologies System 3 (TDT, Alachua, FL) hardware and sent to the computer recording EEG data. Subjects performed the experiment in front of an LCD monitor in a sound-treated booth. Stimuli were generated using matlab (MathWorks, Natick, MA) with the PsychToolbox 3 extension (Brainard, 1997). Sound stimuli were presented diotically via Etymotic ER-1 insert headphones (Etymotic, Elk Grove Village, IL) connected to Tucker-Davis Technologies System 3 (TDT, Alachua, FL) hardware which interfaced with the matlab software running the experiment. During the task, subjects were instructed to keep eyes open and positioned on a central fixation dot.

Data analysis

EEG processing

Raw EEG data were first filtered from 1.5 to 50 Hz using a 6000-point finite impulse response (FIR) band-pass filter. Data were then epoched and downsampled to 256 Hz before band-pass filtering again from 2 to 25 Hz. Scalp voltages were transformed to current source density (CSD) using CSD Toolbox (Kayser and Tenke, 2006). This transform has been shown to reduce spatial noise, which is useful when localizing alpha over parietal sensors (McFarland, 2015; Kayser and Tenke, 2015). No artifact rejection was undertaken; all trials, whether correct and incorrect, were included in analysis. We chose this approach to ensure that we had a sufficient number of trials to analyze for all subjects. Therefore, for each subject, we averaged across 80 trials for each condition (attend-left and attend-right).

Induced alpha power

Figure 2(A) illustrates calculation of the induced alpha response. To obtain the induced response, we first removed phase-locked, or evoked activity. The evoked response was first calculated by averaging epochs across trials in each condition for each subject. This trial-average was then subtracted from each epoch to remove the phase-locked component for each trial. Some argue that one should not remove evoked responses when estimating alpha power. However for the analyses reported here, we perform within-subject subtraction of identically estimated alpha energy for attend-left versus attend-right trials. We then only compare averages of these estimated values across groups. Therefore, any effects of excluding evoked power would impact both attend-left and attend-right conditions identically. Based on this argument, it is not surprising that including the evoked response does not substantially change our results.

FIG. 2.

(A) Calculation of induced alpha power. For each subject and ITD condition, induced alpha power across time for each channel was calculated separately for attend-left and attend-right trials. First, single trial EEG was transformed from scalp voltage to current source density. Then, the induced EEG response was estimated by subtracting the evoked response from each trial. A time-frequency transform was then applied, and the power at the subject's individual alpha frequency (IAF) was extracted to produce a single alpha power time course for each trial in each sensor. The median across these trials was taken before normalizing the time series. (B) Calculation of AMI for each subject. Average alpha power during the stimulus period was collapsed across attend-left and attend-right trials to obtain the average alpha power ipsilateral and contralateral to the focus of attention. AMI was then computed using Eq. (1). After removing the evoked activity, a short-time Fourier transform (STFT) with a 1-s Hanning window was applied to each trial to estimate the power at each frequency in the alpha band (8–14 Hz). For each subject, an individual alpha frequency (IAF) was determined separately from the STFT analysis by finding the frequency in the range of 8–14 Hz whose magnitude was largest on average across cue-left and cue-right conditions and 10 parietal and occipital channels (P4, P8, PO4,O2, P3, P7, PO3, O1, Oz, Pz). This was estimated using a fast Fourier transform (FFT) with a frequency resolution of 0.11 Hz on entire epochs. In examining the distribution of IAFs estimated for each subject, we found that most subjects had an IAF of 10 Hz (17 subjects) or 11 Hz (13 subjects). In comparing the IAF distributions between NH (mean = 10.36, SD = 0.95, median = 10, mode = 10) and HI listeners (mean = 10.60, SD = 1.06, median = 11, mode = 11), we found no significant difference. Once an IAF was selected, power was extracted from the STFT signal at this frequency to produce a single time series for each trial in each EEG channel. For each subject, average alpha power at each time point and sensor was estimated for each condition using the median across all trials in that condition. First, attend leading and lagging trials were combined within each spatial attention condition (i.e., attend-left and attend-right). The median was then taken across the combined trials to estimate the average alpha power time series in each channel. The median was used instead of the mean in order to obtain an estimate that was robust to outliers, since no artifact rejection was performed. These trial-averaged time series were then normalized for each subject by dividing each time point by the average alpha power across time, sensors, and experimental conditions. Grand averages were obtained from these normalized time series. Quantities shown on topoplots represent averages across the stimulus period, which begins at the first distractor note onset and ends at the onset of the final leading melody note (0–3.14 s). An attentional modulation index of alpha power, AMI, was quantified for each subject, and is given by Eq. (1): Note that αipsi is the average alpha power during the stimulus period (0–3.14 s), measured ipsilateral to the cued sequence (e.g., average alpha in left parietal channels during attend-left trials), and αcontra is this average alpha power, measured contralateral to the cued sequence, (e.g., average alpha in right parietal channels during attend-left trials), as illustrated in Fig. 2(B). Positive values of AMI indicate that alpha power was overall larger when subjects attended the ipsilateral stimuli (i.e., the alpha response over a particular set of cortices was greater when ignoring the contralateral stimuli), as expected. Averages were calculated across left and right parietal and occipital channels separately, depending on the attention condition (i.e., left channels P3, P7, PO3, O1 for αipsi in attend-left trials and right channels P4, P8, PO4, O2 for αipsi in attend-right trials).

Significance testing

We asked if there were differences in alpha modulation (AMI) between NH and HI listeners in any of the ITD conditions tested. To determine if there were significant differences, we used a two-way mixed factors analysis of variance (ANOVA), with the between-groups factor being hearing status (two levels: NH and HI) and the within-groups factor being ITD condition (two levels: small and large ITD). Kolmogorov-Smirnov tests of normality were conducted before obtaining ANOVA results. Tukey's HSD was used post hoc to compare AMI between ITD conditions within each group (NH and HI). One-sample t-tests were also used post hoc to determine if AMI was significantly greater than zero. A two-way mixed factors ANOVA was also used to determine if differences in performance measures existed in the subset of trials reported here, which did not include attend center trials analyzed in Dai . A t-test was used to determine if ITD thresholds were significantly different between NH and HI listeners, and Pearson's method was used quantify the correlation between these ITD thresholds and performance measures.

RESULTS

Behavior

HI listeners performed worse on the task, and had higher ITD thresholds than NH listeners

As we previously reported (Dai ), HI listeners performed significantly worse on the spatial attention task than NH listeners. This result is summarized here by comparing the average percent correct scores, collapsed across attend-left and attend-right trials in each ITD condition [Fig. 3(A)]. A two-way mixed ANOVA confirmed significant main effects of hearing status (F(1,38) = 18.8, p = 0.0001, ) and ITD condition (F(1,38) = 15.5, p = 0.0003, ) on percent correct scores, and no significant interaction (F(1,38) = 0.66, p = 0.42, ). Thus, HI listeners performed significantly worse on the task than NH listeners, and increasing the perceived spatial separation significantly improved performance overall.

FIG. 3.

(A) Performance averaged across attend-left and attend-right trials. Asterisks indicate significant main effects of hearing status and ITD condition. (B) ITD thresholds for NH and HI listeners. HI listeners had significantly higher ITD thresholds than NH listeners (p < 0.01, t-test). Error bars represent the standard error of the mean. Figure 3(B) shows ITD thresholds for NH and HI subject groups. ITD thresholds were measured separately for the leading, low pitch stimuli, and the lagging, high pitch stimuli. Since these thresholds were not significantly different within subjects for the two different stimuli, we averaged the two measured ITD thresholds for each subject. The average ITD threshold for NH listeners (23.46 ± 11.79 μs, mean ± std. dev) was significantly lower than that for HI listeners (67.81 ± 60.02 μs, mean ± std. dev) (p < 0.01, t-test). These results confirm that these HI listeners had significantly poorer spatial acuity than the NH listeners in our experiment. Previous results published in Dai found a significant correlation between average ITD threshold and average performance on the task for both NH and HI listeners. This correlation remained for the subset of data analyzed here, which excluded “attend center” trials; only trials for which the target was to the left or right were included (NH: r = −0.48, p = 0.0148, HI: r = −0.59, p = 0.029).

Induced alpha power

In NH listeners, alpha was lateralized over parietal sensors, and this lateralization was stronger in the large ITD condition

Figure 4(A) shows the time course of induced alpha power averaged in left and right parietal-occipital sensors for NH listeners. In left sensors, alpha power was greater throughout the stimulus period (0–3.14 s) when subjects were cued to attend the melody on the left (blue trace), compared to when they were cued to attend the melody on the right (red trace). In right sensors, alpha power was greater throughout most of the stimulus period during attend-right trials. In comparing small and large ITD conditions, the difference in alpha power between red and blue traces appears to be larger in both left and right parietal-occipital sensors. In both conditions, we see that alpha power was modulated over time in NH listeners. After receiving a visual cue for where to attend, alpha decreased briefly and then increased before stimulus playback. Before the response period, alpha decreased again.

FIG. 4.

Grand average alpha power for NH listeners. (A) Grand average alpha power in left and right PO channels for both ITD conditions. Error bars represent the standard error of the mean. Dashed vertical lines specify the onsets of the first and last notes of auditory playback. (B) Grand average alpha power in 32 channels, averaged during the stimulus period (0–3.14 s). Average alpha is displayed separately for attend-left and attend-right trials. (C) Grand average alpha power differences between attend-left and attend-right trials during the stimulus period in each of the 32 channels. Figure 4(B) shows alpha power averaged over the stimulus period (0–3.14 s) in each sensor for NH listeners. Here, we see an overall asymmetry, independent of the direction of attention: alpha power was always larger in right parietal sensors. This general asymmetry is consistent with the parietal spatial representation being asymmetric, and has been observed in other studies of alpha lateralization with spatial attention (Heilman and Van Den Abell, 1980; Pouget and Driver, 2000; Szczepanski ; Ikkai ). Importantly, however, in comparing attend-left and attend-right conditions, alpha was greater in left parietal sensors for attend-left trials compared to attend-right trials; in right parietal sensors, alpha was greater in attend-right trials. These differences are more apparent in Fig. 4(C), where alpha is shown as the average difference between attend-left and attend-right trials during the stimulus period at each sensor on the scalp. Comparing small and large ITD conditions, we see that this alpha modulation with the direction of spatial attention was stronger when the perceived spatial separation was large.

In HI listeners, no alpha lateralization was observed across parietal sensors in either small or large ITD condition

Figure 5(A) shows the time course of alpha power averaged in parietal-occipital sensors for HI listeners. Unlike in NH listeners, alpha power in HI listeners did not appear to be modulated over time in either left or right sensors in either attention condition; alpha power did not even decrease after presentation of the visual cue as it did in NH listeners. There also appears to be no difference, in either set of parietal sensors, between attend-left and attend-right trials during the stimulus period. These results are similar between small and large ITD conditions.

FIG. 5.

Grand average alpha power for HI listeners. (A) Grand average alpha power in left and right PO channels for both ITD conditions. Two subjects' data were excluded from these time traces due to motion artifacts during the cue period. These subjects were only excluded in A since these artifacts occurred outside the time period of interest for subsequent analyses (0–3.14 s). Error bars represent the standard error of the mean. Dashed vertical lines specify the onsets of the first and last notes of auditory playback. (B) Grand average alpha power in 32 channels, averaged during the stimulus period (0–3.14 s). Average alpha is displayed separately for attend-left and attend-right trials. (C) Grand average alpha power differences between attend-left and attend-right trials during the stimulus period in each of the 32 channels. Average alpha power during the stimulus period is shown in Fig. 5(B). Here, we see the same asymmetry observed in NH listeners: greater alpha power in right parietal-occipital sensors than left sensors. However, unlike in NH listeners, there appears to be no substantial difference in any of these sensors between attend-left and attend-right trials [Fig. 5(C)]. Assuming alpha lateralization is an indication that spatial features are being used for selective attention, these results suggest that HI listeners do not use these spatial features. Increasing the perceived spatial separation did not increase the amount of alpha modulation observed across parietal sensors.

There was a significant interaction between hearing status and perceived spatial separation

In order to characterize overall alpha modulation, we collapsed the alpha differences shown in Figs. 4(C) and 5(C) across parietal sensors that were mirrored across hemispheres, as shown in Fig. 6(A). Here, alpha power is represented as the generalized difference between ipsilateral and contralateral attention conditions. In NH listeners, alpha power was greater in a particular set of parietal-occipital sensors when subjects were attending the ipsilateral sequence (i.e., ignoring the contralateral sequence). We found no difference between ipsilateral and contralateral attention conditions in HI listeners, however.

FIG. 6.

(A) Grand average alpha power differences between ipsilateral and contralateral attention conditions, mirrored across hemispheres. (B) Alpha attention modulation index for each subject in small (light blue) and large (dark blue) ITD conditions. Asterisks indicate significant differences between ITD conditions at the α = 0.05 significance level (Tukey's HSD). AMI was only significantly greater than zero for NH listeners in the large ITD condition (p = 0.028, t-test). AMI was quantified for each subject and is shown in Fig. 6(B). We asked whether there were significant differences in AMI between NH and HI listeners performing an auditory spatial attention task. The results of a two-way mixed ANOVA found no significant main effect of either hearing status (F(1,38) = 0.157, p = 0.694) or ITD condition (F(1,38) = 0.172, p = 0.681). However, there was a significant interaction between the two factors (F(1,38) = 5.06, p = 0.0303). Tukey post hoc testing revealed that there was a significant difference in AMI between large and small ITD conditions in NH listeners (p = 0.036), but not in HI listeners (p = 0.253), suggesting that a larger perceived spatial difference contributes to greater alpha lateralization in NH listeners, but not in those with degraded spatial acuity. While it may initially seem surprising that there was no main effect of hearing status on alpha modulation, further analysis revealed that for NH listeners, AMI was significantly greater than zero for the large ITD condition (p = 0.028, t-test), but not for the small ITD condition (p = 0.137, t-test), whereas for the HI listeners, it was not significantly greater than zero in either condition. Thus, there is a floor effect on the results: alpha lateralization was only significant for the “best” listeners (the NH listeners) in the large ITD condition. This suggests that unlike NH listeners, HI listeners may not depend on spatial cues to maintain attention on the target stream even in the large ITD condition.

To perform this task, listeners had to rely on spatial cues, at least initially

In our task, the target on a given trial could be either the leading or lagging stream, and the target could come from left, right, or center with equal likelihood. The leading, lagging, and distractor streams differed from one another consistently in their pitch and timing cues. However, the target was only defined by the visual cue for which direction to attend. Thus, all listeners had to initially use spatial information in order to select the target stream from the sound mixture. All of our NH listeners performed well above chance in all conditions (Dai ). While there were a few HI subjects who performed near chance in some spatial configurations [see Fig. 2(B) in Dai for details], most performed well above chance levels. Thus, our results suggest that even in our HI group, most listeners were effective at using spatial attention to focus on the target melody, at least to some degree. Once a target melody was the focus of attention, listeners could maintain attention on that target without using spatial information: the target always differed from the competing melodies in its pitch range and its note timing. In the current study, we do not have sufficient statistical power to explore the time course of alpha lateralization dynamics over the course of a trial. Instead, the current post hoc analyses considered sustained lateralization of alpha power over parietal EEG sensors, to test the hypothesis that sustained alpha lateralization would be weaker in HI listeners compared to NH listeners. However, if listeners transiently engage spatial attention and then maintain focus on a target using other features, it would not be reflected in our alpha lateralization metrics. Future experiments specifically designed to reveal such dynamics could lend more insight into how spatial attention is used by different listeners, and how this relates to their specific hearing acuity.

Spatial acuity only predicts performance when spatial separations are near perceptual limits

A number of studies have demonstrated that HI listeners benefit less from spatial release from masking in multi-talker settings than do NH listeners (Marrone ; Srinivasan ; Best ), consistent with the current results. However, past studies linking spatial acuity measures with selective attention measures have produced conflicting results (e.g., Strelcyk and Dau, 2009; Lőcsei ). In reconciling these discrepant findings, it is important to consider exactly what tasks are being used in a given study, and what is limiting performance. If the spatial separation between competing sounds is large, even HI listeners with poor spatial acuity may be able to use spatial information effectively. For instance, one study of NH and HI listeners examined speech-in-noise performance when presenting two speech streams played with ITDs of −700 μs and + 700 μs (an ITD difference of 1.4 ms) (Lőcsei ). In this case, the large separation benefited both NH and HI listeners by roughly the same amount; moreover, ITD thresholds did not correlate with performance. Another study [by the same research group in Lőcsei ] found a significant correlation between ITD sensitivity and the ability to understand speech that is spatially separated from a target (Strelcyk and Dau, 2009). Importantly, in this study, the target was diotic and a single masker was played from left or right with an ITD of 740 μs [about half the spatial separation between sources used in Lőcsei ]. When the spatial separation of sources is closer to the perceptual limit, it makes sense that ITD sensitivity is closely related to performance. In our study, there were three competing streams that were separated by 799 μs in the large ITD case (and by only 205 μs in the small ITD case). Given that even our “large” ITD was smaller than that used in many studies, and given that our listeners heard a relatively complex scene with three concurrent streams, it is therefore not surprising that spatial acuity was directly related to the ability to perform the task. Given these results, we did look to see whether the degree of alpha lateralization in an individual subject was related to their task performance. While there are direct relationships between the strength of attentional modulation of ERP data, ITD thresholds, and performance [reported previously in Dai ], estimates of alpha lateralization are noisy and show no such relationship. Importantly, our alpha lateralization metric only quantifies sustained spatial attention, so this is not particularly surprising. We suspect that in the right experiment, the strength of sustained alpha lateralization might be directly related to spatial attention performance; however, to see such effects likely requires an experiment in which the competing stream identities are confusable except in their spatial attributes.

In NH listeners, parietal alpha lateralization likely reflects some combination of what location is the focus of spatial attention and how strongly listeners are sustaining spatial attention

Previous work has suggested that parietal EEG alpha lateralization reflects the locus of spatial attention; specifically, alpha power increases over sensors contralateral to ignored stimuli (Worden ; Kerlin ; Foxe and Snyder, 2011; Banerjee ; Händel ; Payne ; van Diepen ; Wöstmann ). While most of these studies have identified this lateralization during visual spatial attention (Worden ; Foxe and Snyder, 2011; Händel ; Payne ; van Diepen ), considerably fewer have addressed alpha as a correlate of auditory spatial attention (but see (Kerlin ; Banerjee ; Wöstmann ). Here, we provide additional evidence for parietal alpha lateralization as a correlate of auditory spatial attention. In NH listeners, we observed that mean alpha power was greater in a particular set of parietal sensors when subjects attended the ipsilateral melody (i.e., ignoring the contralateral melody) when ITDs were large. NH listeners showed less alpha lateralization for small ITDs than for large ITDs, and the lateralization was only significant for large ITDs. This pattern may be explained by multiple (not mutually exclusive) effects. First, in another study from our lab, we have observed that alpha lateralization increases the farther off midline a listener directs auditory spatial attention (Deng ). If the magnitude of alpha lateralization scales with eccentricity of attention, it would produce greater alpha lateralization in large ITD trials than in small ITD trials. The lack of a significant effect in the small ITD condition thus might be simply a matter of statistical power: there may be a small lateralization of alpha that we do not have the power to observe in this study. Second, we have observed that when other sound features, such as pitch differences, differentiate one sound stream from another more effectively than do spatial features, listeners rely on those non-spatial features to maintain attentional focus (Bonacci and Shinn-Cunningham, 2019). In the current study, small ITDs may have been less reliable than the pitch separations and timing regularities of the streams in maintaining attention, so that alpha lateralization averaged over the 3 s of stimulus presentation was not significant. In contrast, when the ITD separation was large, it may have been more reliable than the pitch cue for our NH listeners, leading to significant sustained alpha lateralization in these trials. Indeed, previous studies have shown that when there are redundant features, their relative strengths determine how much influence each has on attention to an ongoing sound stream: as one feature becomes relatively stronger in differentiating competing streams, that feature is more influential, and vice versa (Maddox and Shinn-Cunningham, 2012). Finally, the relative spatial separation between target and distractor melodies may affect the strength of lateralization as well. In particular, perceptual context has been shown to affect the strength of auditory spatial attention such that attention is more strongly directed to a particular object if auditory stream segregation is promoted (Arnott and Alain, 2002). In the current study, the increase in spatial separation may have increased the perceptual separation of the streams, facilitating stronger suppression of distractors. This would be consistent with the increased strength of alpha lateralization with larger ITD separation observed here. The difference in alpha lateralization for small and large ITD conditions seen for NH listeners likely comes from some combination of these factors. Specifically, alpha lateralization seems to scale with the spatial eccentricity of the target, listeners may rely more heavily on spatial cues for larger spatial separations than for smaller separations, and greater perceptual stream separation may promote stronger alpha lateralization.

In HI listeners, alpha was never significantly lateralized, suggesting that HI listeners do not rely strongly on spatial cues to maintain attentional focus

Our HI listeners showed no significant alpha lateralization, even in large ITD trials. As already reported (Dai ), our HI listeners also had worse spatial sensitivity than did our NH listeners. Indeed, many of our HI listeners had ITD discrimination thresholds similar in magnitude to the spatial separation of adjacent streams in the small ITD condition [see Dai ]. To the extent that listeners focus attention to different features based on their relative perceptual reliability, it makes sense that compared to NH listeners, our HI listeners rely more on pitch differences across the streams to maintain focus on the target melody. This likely explains why HI listeners, as a group, showed no significant alpha lateralization even for large ITD condition when averaging over the duration of the roughly 3-s-long trials, while NH listeners did. Even though the HI listeners in the current task did not appear to maintain focus using spatial attention, as noted above, most performed the task above chance levels and thus used spatial cues at least initially. In fact, the behavioral results suggest that increasing the spatial separation of competing melodies helped subjects select the target stream. However, the failure of our HI listeners to show sustained alpha lateralization suggests that once the HI listeners (or perhaps even the NH listeners attending to sources separated by small ITDs) “latched on” to the target, they did not maintain attention using spatial focus, relying instead upon the pitch differences between sources and the regular timing of the notes within the target melody. While we have argued that the weak alpha lateralization observed in HI listeners reflects spatially specific auditory processing deficits, and that these deficits contributed to poor task performance, it is important to note that other factors may be involved. One possibility is that general cognitive deficits, rather than spatial hearing deficits, contributed to poor task performance. However, NH and HI listeners performed similarly on two different visual tasks from the Test of Everyday Attention (Pearson) (Dai ). Still, given our behavioral results, we cannot fully rule out the case that HI listeners have poorer general auditory processing difficulties, rather than a deficit specific to auditory spatial processing. Further work should be performed to fully explore this possibility. It is interesting to note, however, that while not statistically significant, a small interaction effect on performance between hearing status and ITD condition was observed (). Such an effect would suggest that HI listeners actually fail to benefit from an increase in spatial separation. This result could more definitively point to spatially specific deficits, though larger sample sizes are needed to determine the reliability of this effect.

IMPLICATIONS

Our results provide further evidence for alpha lateralization as a correlate of auditory spatial attention. For NH listeners who have good spatial acuity, sustained alpha lateralization over the duration of the melodies is significant when ITDs are large. While alpha lateralization reflects the use of sustained spatial attention in NH listeners hearing large spatial separations, no sustained alpha lateralization was observed in HI listeners. This lack of lateralization suggests that HI listeners do not (or cannot) rely on spatial cues to sustain attention, consistent with their poor ITD discrimination thresholds. This inability to use spatial cues as effectively as do NH listeners helps explain the overall poorer performance of our HI listeners. HI listeners often report difficulty communicating in noisy environments, contributing to a sense of social isolation in everyday settings. If it were possible to decode where or what an individual is trying to attend, then technology could be designed to assist object selection (e.g., by filtering out sound sources that a listener is trying to ignore). For instance, knowledge of where listeners are focusing spatial attention could be used to amplify sound from one direction while suppressing irrelevant sounds from others. EEG technology is being investigated for this purpose in many labs today (Choi ; Eyndhoven ; O'Sullivan ), as it is relatively low cost, portable, and noninvasive. If EEG correlates of attentional focus could be reliably decoded, they could be used to create smart hearing devices that help a listener switch and maintain attention as needed. Our previous analysis showed that HI listeners are less effective at modulating N1 ERPs than are NH listeners (Dai ), calling into question the potential utility of using these neural correlates of attention in next-generation listening devices for the listeners most in need of such aid. Here, we find that alpha lateralization signatures for where a listener is attending are also weak in HI listeners, suggesting HI listeners do not rely on sustained spatial attention in conditions where NH listeners do. Importantly, however, it may be that HI listeners do not rely on sustaining spatial attention because other features are more reliable. If a next-generation hearing device correctly determined where a listener was trying to focus spatial attention and modified the sound entering their ears effectively, HI listeners might learn to rely on spatial attention. Thus, even though the HI listeners do not show strong, robust neural correlates of spatial attention, there is still the possibility that they could be trained to use such neural signals to control an effective device. Future work thus may require not only building devices that look for typical neural correlates of attentional control, but training listeners to engage listening strategies that their past experience has taught them to avoid.

34 in total

1. Anticipatory biasing of visuospatial attention indexed by retinotopically specific alpha-band electroencephalography increases over occipital cortex.

Authors: M S Worden; J J Foxe; N Wang; G V Simpson
Journal: J Neurosci Date: 2000-03-15 Impact factor: 6.167

1. Poor early cortical differentiation of speech predicts perceptual difficulties of severely hearing-impaired listeners in multi-talker environments.

Authors: Brandon T Paul; Mila Uzelac; Emmanuel Chan; Andrew Dimitrijevic
Journal: Sci Rep Date: 2020-04-09 Impact factor: 4.379

1 in total

Weak neural signatures of spatial selective auditory attention in hearing-impaired listeners.

INTRODUCTION

METHODS

Experimental task and stimuli

Subjects

Data collection

Data analysis

EEG processing

Induced alpha power

Significance testing

RESULTS

Behavior

HI listeners performed worse on the task, and had higher ITD thresholds than NH listeners

Induced alpha power

In NH listeners, alpha was lateralized over parietal sensors, and this lateralization was stronger in the large ITD condition

In HI listeners, no alpha lateralization was observed across parietal sensors in either small or large ITD condition

There was a significant interaction between hearing status and perceived spatial separation

To perform this task, listeners had to rely on spatial cues, at least initially

Spatial acuity only predicts performance when spatial separations are near perceptual limits

In NH listeners, parietal alpha lateralization likely reflects some combination of what location is the focus of spatial attention and how strongly listeners are sustaining spatial attention

In HI listeners, alpha was never significantly lateralized, suggesting that HI listeners do not rely strongly on spatial cues to maintain attentional focus

IMPLICATIONS

1. Anticipatory biasing of visuospatial attention indexed by retinotopically specific alpha-band electroencephalography increases over occipital cortex.

2. Alpha oscillations correlate with the successful inhibition of unattended stimuli.

3. Object-based auditory and visual attention.

4. Right hemisphere dominance for attention: the mechanism underlying hemispheric asymmetries of inattention (neglect).

5. The influence of non-spatial factors on measures of spatial release from masking.

Review 6. Relating unilateral neglect to the neural coding of space.

7. Mechanisms of spatial attention control in frontal and parietal cortex.

8. Evaluating the benefit of hearing aids in solving the cocktail party problem.

9. Temporal Fine-Structure Coding and Lateralized Speech Perception in Normal-Hearing and Hearing-Impaired Listeners.

10. The Role of Alpha Activity in Spatial and Feature-Based Attention.

1. Poor early cortical differentiation of speech predicts perceptual difficulties of severely hearing-impaired listeners in multi-talker environments.