| Literature DB >> 33783913 |
Ira Kurthen1, Jolanda Galbier2, Laura Jagoda2, Pia Neuschwander2, Nathalie Giroud3, Martin Meyer2,4,5.
Abstract
Speech understanding in noisy situations is compromised in old age. This study investigated the energetic and informational masking components of multi-talker babble noise and their influence on neural tracking of the speech envelope in a sample of healthy older adults. Twenty-three older adults (age range 65-80 years) listened to an audiobook embedded in noise while their electroencephalogram (EEG) was recorded. Energetic masking was manipulated by varying the signal-to-noise ratio (SNR) between target speech and background talkers and informational masking was manipulated by varying the number of background talkers. Neural envelope tracking was measured by calculating temporal response functions (TRFs) between speech envelope and EEG. Number of background talkers, but not SNR modulated the amplitude of an earlier (around 50 ms time lag) and a later (around 300 ms time lag) peak in the TRFs. Selective attention, but not working memory or peripheral hearing additionally modulated the amplitude of the later TRF peak. Finally, amplitude of the later TRF peak was positively related to accuracy in the comprehension task. The results suggest that stronger envelope tracking is beneficial for speech-in-noise understanding and that selective attention is an important ability supporting speech-in-noise understanding in multi-talker scenes.Entities:
Keywords: electroencephalography; energetic masking; envelope tracking; informational masking; older adults; speech processing; speech-in-noise
Mesh:
Year: 2021 PMID: 33783913 PMCID: PMC8193518 DOI: 10.1002/hbm.25415
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
Correlation matrix of age and envelope tracking predictors
| Age | PTA | Flanker | |
|---|---|---|---|
| Age | |||
| PTA | 0.42* | ||
| Flanker | −0.06 | 0.18 | |
| Sentence span | 0.03 | −0.10 | −0.11 |
Note: *p <.05, **p <.01, ***p <.001. Negative values indicate negative correlations. Note that for PTA, larger values indicate higher hearing thresholds; for Flanker, higher values indicate worse selective attention; for Sentence Span, higher values indicate better working memory.
Summary statistics of hearing and cognitive abilities
| Original |
| |
|---|---|---|
| ( | ( | |
|
| ||
| Mean ( | 27.7 (9.34) | −0.02 (1.02) |
| Median [min, max] | 26.5 [10.4, 50.6] | −0.30 [−1.71, 2.24] |
|
| ||
| Mean ( | 0.60 (0.27) | −0.02 (1.02) |
| Median [min, max] | 0.65 [0, 0.95] | −0.15 [−1.90, 2.48] |
|
| ||
| Mean ( | 35.7 (48.5) | −0.03 (1.01) |
| Median [min, max] | 22.0 [−44.8, 143] | 0.15 [−2.28, 1.29] |
Note: Summary statistics for hearing and cognitive ability scores. The first column shows the scores how they were obtained from the hearing and cognitive tasks, and the second column shows the z‐scored scores. Please note that the z‐scored scores do not have 0 as their mean and 1 as their standard deviation because z‐scoring was performed on data of all 24 initial participants. PTA, pure‐tone average.
Experimental conditions
| Number of background talkers (IM) | ||
|---|---|---|
| SNR (EM) | 2 | 8 |
| 0 | High EM, high IM | High EM, low IM |
| 2 | Low EM, high IM | Low EM, low IM |
Note: This table shows the four experimental conditions. Each condition is defined by high or low EM and IM.
FIGURE 1(a) Sensitivity index d′ by condition. (b) Response bias c by condition. (c) Accuracy in the CT by condition. CT, comprehension task; IT, intelligibility task; 2_0, 2 background talkers; SNR 0, 2_2, 2 background talkers; SNR 2, 8_0, 8 background talkers, SNR 0, 8_2, 2 background talkers, SNR 2
FIGURE 2Grand average TFR traces of the four experimental conditions and the baseline TRF, averaged across postero‐occipital midline electrodes (A21, A22, A23, and A24), which are representative of the posterior electrode cluster. Time lags at which the average of the actual TRFs significantly differed from the baseline TRF (p <.05) are denoted with the gray bar slightly above the x axis. BT, background talkers; SNR, signal‐to‐noise ratio; TRF, temporal response function
FIGURE 3(a) Grand average TFR traces of the 2 BT conditions irrespective of SNR, the 8 BT conditions irrespective of SNR, and the baseline TRF, averaged across anterior cluster electrodes (C19, C25, C26, C27, C32, D3, D4, D5, D11, D12, D19, D20, and D27). Time lags at which the average of the actual TRFs differed from the baseline TRF are denoted with the light gray bar slightly above the x axis. Time lags at which the 2 BT TRFs differed from the 8 BT TRFs are denoted with the darker gray bar. (b) Topographies of t‐values at 31.3 and at 281 ms time lags. (c) Grand average TFR traces of the 2 BT conditions irrespective of SNR, the 8 BT conditions irrespective of SNR, and the baseline TRF, averaged across posterior cluster electrodes (A9, A10, A11, A12, A13, A14, A15, A16, A21, A22, A23, A24, A25, A26, A27, A28, A29, A31, A32, B3, B8, and B9). Time lags at which the average of the actual TRFs differed from the baseline TRF are denoted with the light gray bar slightly above the x axis. Time lags at which the 2 BT TRFs differed from the 8 BT TRFs are denoted with the darker gray bar. BT, background talkers; SNR, signal‐to‐noise ratio; TRF, temporal response function
Parameters from the model predicting TRF300 positive cluster amplitude from SNR, nBT, and selective attention
| Estimate |
|
|
| Pr (>| |
| |
|---|---|---|---|---|---|---|
| (Intercept) | −0.038 | 0.021 | 73 | −1.8 | 0.077 | 0.048 |
| SNR2 | −0.053 | 0.026 | 63 | −2 | 0.05 | |
| talkers8 | −0.14 | 0.026 | 63 | −5.5 | 7.4e−07 | |
| Flanker_z | 0.0074 | 0.021 | 73 | 0.35 | 0.73 | |
| SNR2:talkers8 | 0.079 | 0.037 | 63 | 2.1 | 0.038 | |
| SNR2:Flanker_z | 0.042 | 0.026 | 63 | 1.6 | 0.11 | |
| talkers8:Flanker_z | 0.053 | 0.026 | 63 | 2 | 0.045 | |
| SNR2:talkers8:Flanker_z | −0.076 | 0.037 | 63 | −2.1 | 0.043 |
Note: Model formula: amplitude – SNR * talkers * Flanker_z + (1 ∣ participant).
FIGURE 4Effects plot of the three‐way interaction between SNR, nBT, and selective attention (as measured by the Flanker task). A lower Flanker score indicates better selective attention. Bars indicate 95% confidence intervals. nBT, number of background talkers; SNR, signal‐to‐noise ratio
FIGURE 5Effects plot of the interaction between SNR and amplitude of the TRF300 negative cluster. Bars indicate 95% confidence intervals. SNR, signal‐to‐noise ratio; TRF, temporal response function