| Literature DB >> 35468160 |
Xin Zhou1, Gabriel S Sobczak2, Colette M McKay3,4, Ruth Y Litovsky1,5,6.
Abstract
The present study aimed to investigate the effects of degraded speech perception and binaural unmasking using functional near-infrared spectroscopy (fNIRS). Normal hearing listeners were tested when attending to unprocessed or vocoded speech, presented to the left ear at two speech-to-noise ratios (SNRs). Additionally, by comparing monaural versus diotic masker noise, we measured binaural unmasking. Our primary research question was whether the prefrontal cortex and temporal cortex responded differently to varying listening configurations. Our a priori regions of interest (ROIs) were located at the left dorsolateral prefrontal cortex (DLPFC) and auditory cortex (AC). The left DLPFC has been reported to be involved in attentional processes when listening to degraded speech and in spatial hearing processing, while the AC has been reported to be sensitive to speech intelligibility. Comparisons of cortical activity between these two ROIs revealed significantly different fNIRS response patterns. Further, we showed a significant and positive correlation between self-reported task difficulty levels and fNIRS responses in the DLPFC, with a negative but non-significant correlation for the left AC, suggesting that the two ROIs played different roles in effortful speech perception. Our secondary question was whether activity within three sub-regions of the lateral PFC (LPFC) including the DLPFC was differentially affected by varying speech-noise configurations. We found significant effects of spectral degradation and SNR, and significant differences in fNIRS response amplitudes between the three regions, but no significant interaction between ROI and speech type, or between ROI and SNR. When attending to speech with monaural and diotic noises, participants reported the latter conditions being easier; however, no significant main effect of masker condition on cortical activity was observed. For cortical responses in the LPFC, a significant interaction between SNR and masker condition was observed. These findings suggest that binaural unmasking affects cortical activity through improving speech reception threshold in noise, rather than by reducing effort exerted.Entities:
Mesh:
Year: 2022 PMID: 35468160 PMCID: PMC9037936 DOI: 10.1371/journal.pone.0267588
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1fNIRS montage and a priori regions of interest (ROIs).
The fNIRS montage was symmetric between hemispheres; panel (A) plots the connection of light sources (red, n = 8) and detectors (blue, n = 8), and channels (yellow lines) on the left hemisphere. The green dots denote detectors that provide 8-mm channels, with 4 on each side. Panel (B) shows channels comprising the three subregions within the left lateral prefrontal cortex (LPFC) and the left auditory cortex (AC). The three regions were the dorsolateral prefrontal cortex (DLPFC), and two adjacent regions of interest, i.e., f-ROI2 and f-ROI3. The colors are the sensitivity profiles, in log10 mm-1 units, generated from AtlasViewer [51].
Fig 2Listening conditions and diagram of fNIRS data collection.
In panel (A), the white loudspeakers are for unprocessed (U) and vocoded (V) speech; the black loudspeakers are for noise (N), monaurally (NmSm) or diotically (NoSm) presented. Panel (B) shows the pseudorandom block design used for data collection, with stimuli in 4 listening conditions (blue boxes) being presented in random order in each session.
Fig 3Diagram of fNIRS signal processing.
Fig 4Subjects’ self-reported difficulty levels and speech intelligibility scores.
Unprocessed (U, blue) or degraded (V, red) speech stimuli were always presented to the left ear alone. Noise stimuli were presented in ipsilateral (NmSm) or bilateral (NoSm, squares) conditions. In panel (A), violin plots show self-reported difficulty levels in individuals under varying listening conditions at -10dB SNR (left) and -15 dB SNR (right) during fNIRS recording. Panel (B) shows the self-reported difficulty levels in a separate group of participants in a behavioral session with no fNIRS data being recorded. Panel (C) shows the correlation between the self-reported difficulty levels (vertical) and speech intelligibility in rationalized arcsine units (RAU, horizontal) in individuals in the behavioral session.
Summary of results from ART tests for behavioral measures.
The behavioral measures are task difficulty level (TDL) recorded during fNIRS session and in the behavioral session with no fNIRS, and speech intelligibility scores (SIC) recorded in the behavioral session.
| SNR | Speech | Masker | SNR * speech | SNR * masker | Speech * masker | SNR* speech * masker | |
|---|---|---|---|---|---|---|---|
|
|
|
|
| F(1,154) = 0.78 | F(1,154) = 0.11 |
| F(1,154) = 0.75 |
|
|
|
|
|
| F(1,154) = 0.36 |
|
|
|
|
|
|
|
|
|
|
|
Fig 5ΔHbC amplitudes in the frontal and temporal cortex.
Group mean ΔHbC amplitudes for the frontal (orange) and temporal (green) cortex on the left and right hemispheres in response to unprocessed (dots) and vocoded speech (triangles) at -10 dB (solid lines) and -15 dB SNR (dash lines) are plotted.
Fig 6ΔHbC amplitudes in the left DLPFC and AC.
Panels A and C show the group mean (bars) and SEM (error bars) of ΔHbC amplitudes in the left DLPFC and AC, respectively, in the unprocessed (blue dots) and vocoded (red triangles) speech conditions with monaural (NmSm) and diotic noise (NoSm) at -10 dB (solid lines) and -15 dB SNR (dash lines). Panels (B) and (D) show the repeated measures correlations (rmcorr) between ΔHbC amplitudes in the left DLPFC and AC, respectively, and self-reported task difficulty level. In each panel, the gray dash lines connecting circles represent measures in individuals in different conditions; the red lines indicate the regression result from the rmcorr method.
Summary of results from ART tests for fNIRS measures.
The left and right sides summarize the results related to our primary and secondary research questions, relatively. We investigated the effect of speech type by comparing unprocessed (U) versus vocoded (V) speech, and the effect of masker condition by comparing diotic (NoSm) and monaural (NmSm) conditions, the effect of SNRs (-10 and -15 dB SNR), and the interactions between them on different cortical regions.
| Frontal versus temporal regions | DLPFC, f-ROI2, and f-ROI3 | ||||
|---|---|---|---|---|---|
| Factor(s) | ART results | Post hoc | Factor(s) | ART results | Post hoc |
| Region |
|
| Hemisphere |
|
|
| Hemisphere |
|
| ROI |
|
|
| Hemisphere*region | F(1, 710) = 1.11 | Hemisphere*ROI | F(1, 1034) = 2.75 |
| |
|
| SNR |
|
| ||
| Factor(s) | ART results | Post hoc | Speech |
|
|
| ROI |
|
| SNR*speech | F(1, 1034) = 3.35 | -15 dB SNR > -10dB SNR |
| ROI*SNR |
|
| Masker*SNR |
|
|
| ROI*speech |
|
| Masker | F(1, 1034) = 0.13 | |
| Masker | F(1,330) = .12 | Masker*hemisphere | F(1, 1034) = 2.11 | ||
| SNR | F(1,330) = .15 | Masker*ROI | F(1, 1034) = 0.039 | ||
| Speech | F(1,330) = .20 | Masker*speech | F(1, 1034) = 0.79 | ||
| Masker*ROI | F(1,330) = .019 | Hemisphere*SNR | F(1, 1034) = 0.062 | ||
| Masker*SNR | F(1,330) = .003 | Hemisphere | F(1, 1034) = 0.077 | ||
| Masker*speech | F(1,330) = 1.56 | ROI*SNR | F(1, 1034) = 0.063 | ||
| SNR*speech | F(1,330) = 1.45 | ROI*speech | F(1, 1034) = 0.069 | ||
Fig 7ΔHbC amplitudes in the three subregions within the LPFC on two hemispheres.