| Literature DB >> 29392835 |
Jens Hjortkjaer1,2, Jonatan Märcher-Rørsted1, Søren A Fuglsang1, Torsten Dau1.
Abstract
Neuronal oscillations are thought to play an important role in working memory (WM) and speech processing. Listening to speech in real-life situations is often cognitively demanding but it is unknown whether WM load influences how auditory cortical activity synchronizes to speech features. Here, we developed an auditory n-back paradigm to investigate cortical entrainment to speech envelope fluctuations under different degrees of WM load. We measured the electroencephalogram, pupil dilations and behavioural performance from 22 subjects listening to continuous speech with an embedded n-back task. The speech stimuli consisted of long spoken number sequences created to match natural speech in terms of sentence intonation, syllabic rate and phonetic content. To burden different WM functions during speech processing, listeners performed an n-back task on the speech sequences in different levels of background noise. Increasing WM load at higher n-back levels was associated with a decrease in posterior alpha power as well as increased pupil dilations. Frontal theta power increased at the start of the trial and increased additionally with higher n-back level. The observed alpha-theta power changes are consistent with visual n-back paradigms suggesting general oscillatory correlates of WM processing load. Speech entrainment was measured as a linear mapping between the envelope of the speech signal and low-frequency cortical activity (< 13 Hz). We found that increases in both types of WM load (background noise and n-back level) decreased cortical speech envelope entrainment. Although entrainment persisted under high load, our results suggest a top-down influence of WM processing on cortical speech entrainment.Entities:
Keywords: zzm321990EEGzzm321990; alpha and theta oscillations; n-back task; pupillometry; speech entrainment
Mesh:
Year: 2018 PMID: 29392835 PMCID: PMC7155003 DOI: 10.1111/ejn.13855
Source DB: PubMed Journal: Eur J Neurosci ISSN: 0953-816X Impact factor: 3.386
Figure 1Schematic illustration of the trial structure and task. Electroencephalogram and pupillometry were recorded, while subjects listened to continuous speech stimuli consisting of spoken number sequences. Red lines on the waveform represent the pitch contour of the continuous speech signal. In different trials, listeners identified either 1‐back or 2‐back number targets in different levels of background noise. Please see the Methods section for details.
Figure 2Behavioural performance (above) and pupil responses (below). (A) Percentage of correctly detected 1‐back and 2‐back targets during the speech trial. Larger circles represent the group average % correct at the average position of the targets. Shaded areas represent ± 1 SEM. (B) Behavioural sensitivity (d‐prime) for n‐back target detection measured over the trial duration. (C) The average trace of the pupil dilations relative to a pre‐stimulus baseline. (D) Mean and peak pupil dilation over the trial duration. Error bars represent ± 1 SEM ***P < 0.001.
Figure 3Changes in oscillatory power during the n‐back speech task. (A) Time–frequency representations (TFRs) of the power changes between the 2‐back and 1‐back tasks at frontal electrode AFz (above) and posterior electrode Oz (below). White stippled lines mark the location of the theta (above) and alpha (below) bands. Traces below the TFRs show the normalized theta band power and alpha band power in the two n‐back tasks. Shaded areas in the traces represent ± 1 SEM across subjects for each 5 s time window. (B) Trial‐mean (5–45 s) power in frontal theta (left) and posterior alpha (right). (C) Topographies showing the trial‐mean differences in theta (above) and alpha (below) power between the 2‐back and 1‐back tasks (left) and between high and low noise levels (right). Circles indicate the position of electrodes AFz and Oz. White asterisks indicate electrodes showing significant power differences between the n‐back conditions revealed by the cluster analysis (P < 0.01). Error bars represent ± 1 SEM **P < 0.01, ***P < 0.001.
Figure 4Electroencephalogram (EEG) responses to speech envelopes in the different working memory (WM) load conditions. Above (A–C): Temporal response functions (TRFs) derived from linear regression between EEG data and the speech stimulus. Below (D, E): Speech entrainment measured as the correlation between the cortical response predicted by the speech envelope and the EEG. (A) TRFs at selected electrode locations to illustrate the responses at different scalp positions. (B) TRFs averaged over frontocentral electrodes in the different experimental WM conditions. (C) The amplitude (above) and latency (below) of the late positive peak in the average TRF around 170 ms. (D) Topographical distribution of the EEG prediction accuracies (Pearson's r) averaged across conditions. The dots indicate the positions of the analysed frontocentral electrodes. (E) Average prediction accuracies in different frequency bands. The shaded areas represent chance‐level prediction. Error bars represent ± 1 SEM *P < 0.05, **P < 0.01, ***P < 0.001.