| Literature DB >> 33171753 |
Laura Bell1, Z Ellen Peng2,3, Florian Pausch2, Vanessa Reindl1,4, Christiane Neuschaefer-Rube5, Janina Fels2, Kerstin Konrad1,4.
Abstract
The integration of virtual acoustic environments (VAEs) with functional near-infrared spectroscopy (fNIRS) offers novel avenues to investigate behavioral and neural processes of speech-in-noise (SIN) comprehension in complex auditory scenes. Particularly in children with hearing aids (HAs), the combined application might offer new insights into the neural mechanism of SIN perception in simulated real-life acoustic scenarios. Here, we present first pilot data from six children with normal hearing (NH) and three children with bilateral HAs to explore the potential applicability of this novel approach. Children with NH received a speech recognition benefit from low room reverberation and target-distractors' spatial separation, particularly when the pitch of the target and the distractors was similar. On the neural level, the left inferior frontal gyrus appeared to support SIN comprehension during effortful listening. Children with HAs showed decreased SIN perception across conditions. The VAE-fNIRS approach is critically compared to traditional SIN assessments. Although the current study shows that feasibility still needs to be improved, the combined application potentially offers a promising tool to investigate novel research questions in simulated real-life listening. Future modified VAE-fNIRS applications are warranted to replicate the current findings and to validate its application in research and clinical settings.Entities:
Keywords: functional near-infrared spectroscopy; multimethod approach; speech perception; virtual acoustic environments; virtual acoustics
Year: 2020 PMID: 33171753 PMCID: PMC7695031 DOI: 10.3390/children7110219
Source DB: PubMed Journal: Children (Basel) ISSN: 2227-9067
Examples of SIN tests and investigations.
| Outcome Measure | Population of Interest | Behavioral/Neuroimaging Method | Test/Example Studies | Overview | |
|---|---|---|---|---|---|
|
|
| ||||
| SIN recognition assessments with varying SNR and/or noise location | Adults with NH/HL | Behavioral | Hearing In Noise Test—HINT [ | Headphone-based; recordings of 250 sentences by a male speaker that are intended to be utilized in adaptive SRT measurements in quiet or spectrally matched noise | |
| Adults with NH/HL | Oldenburger Satztest (Oldenburger sentence test)—OlSa [ | Headphone-based; recordings of sentences that consist of a random combination of 50 words that are used to measure the SRT in quiet and in noise | |||
| Adults with NH/HL | Words-In-Noise test—WiN [ | Earphone-based; recordings of 70 words embedded in unique segments of multi-talker distractor noise that are intended to be utilized in adaptive SRT measurements | |||
| Adults with NH/HL | Döring test [ | Loudspeaker-based; recordings are single syllables of the “Freiburger Sprachverständnistest” (Freiburger speech comprehension test) which are repeated three times in background noise (words of the “Freiburger Sprachverständnistest”); | |||
| Children with NH/HL | Listening in Spatialized Noise-Sentences test—LiSN-S [ | Headphone-based; recordings of 120 sentences by a female speaker that are intended to be utilized in adaptive SRT measurements in background speech by two masking talkers (two female speakers that record two distractor stories) in four different conditions: maskers are either spatially co-located with target or at ±90° azimuth and either share the same pitch or different pitch than the target | |||
| Children with NH/HL | “Oldenburger Kinder-Satztest” (Oldenburger sentence test for children)—OlKiSa [ | Headphone-based; simplified version of the Oldenburger sentence test (OlSA); recordings of sentences that consist of a random combination of 21 words that are used to measure the SRT in quiet and in noise | |||
| Children with NH/HL | Children’s Coordinate Response Measure—CCRM [ | Headphone-based; recordings of sentences that are to be utilized in adaptive SRT measurements in either 20-talker babble or speech-shaped noise | |||
|
|
| ||||
| Sound localization | Adults with NH | Behavioral | Bronkhorst [ | Investigations of auditory sound localization, distance perception, and attention switching using ear/headphones, research HAs, or loudspeaker-based reproduction of auditory stimuli with or without manipulation of acoustic variables including but not limited to reverberation, interaural level differences, and sound intensity | |
| Adults with HL | Best et al. [ | ||||
| Children with HL | Johnstone et al. [ | ||||
| Auditory distance perception | Blind and sighted adults with NH | Kolarik et al. [ | |||
| Adults with NH/HL | Courtois et al. [ | ||||
| Auditory attention switching | Adults with NH | Oberem et al. [ | |||
| Auditory simulations of SIN tasks in simulated indoor environments | Adults with NH | Behavioral | MacCutcheon et al. [ | Investigations of speech or word (in noise) recognition, listening effort, and the influence of variables such as language skills, working memory, or stimulus presentation, i.e., auditory-only or in combination with visual stimuli, and room acoustics such as reverberation times simulated VAEs | |
| Children with HAs/NH | McCreery, Walker, Spratford, Lewis, and Brennan [ | ||||
| Children with NH | Rudner, Lyberg-Ahlander, Brannstrom, Nirme, Pichora-Fuller, and Sahlen [ | ||||
| Adults with NH/HAs | Ricketts et al. [ | ||||
|
|
| ||||
| Speech/SIN recognition, the effect of cochlear implantation, age-related or early onset HL, and the underlying neural mechanisms that are identified by invasive (i.e., ECoG) or noninvasive (i.e., EEF, fNIRS, fMRI) neuroimaging | Adults with NH | fNIRS | Defenderfer et al. [ | Investigations of (selective attention to) speech or word in quiet or noise recognition and their underlying neural mechanisms, by means of spatial and/or temporal neural analyses, using head/earphone- or free-field loudspeaker-based auditory reproduction while manipulating auditory (and visual) stimulation or using real-life hyperscanning (i.e., measuring two or more participants at the same time) paradigms | |
| Adults with NH/CI | fNIRS | Olds et al. [ | |||
| Children with NH | fNIRS | Mushtaq et al. [ | |||
| Children with CI | fNIRS | Mushtaq et al. [ | |||
| Adults with age-related HL | EEG | Puschmann et al. [ | |||
| Infants | EEG-fNIRS | Telkemeyer et al. [ | |||
| Adults with NH | fNIRS hyperscanning | Dai et al. [ | |||
| Adults with medically intractable epilepsy | ECoG | Zion Golumbic et al. [ | |||
| Adults with NH | EEG-fMRI | Puschmann et al. [ | |||
| Adults with age-related HL | fMRI | Wong et al. [ | |||
Abbreviations: SIN—speech-in-noise; SRT—speech reception threshold; VAE(s)—virtual acoustic environment(s); SNR—signal-to-noise ratio; NH—normal hearing; HL—hearing loss; CI—cochlear implant; HA—hearing aid; fNIRS—functional near-infrared spectroscopy; EEG—electroencephalography; ERP—event-related potential; ECoG—intracranial electrocorticograph; fMRI—functional magnetic resonance imaging.
Figure 1Air-conduction pure tone audiograms. The air-conduction pure tone audiograms for (left) and (right) ear of each child in the hearing aid (HA) group are depicted.
Figure 2Illustration of the experimental setting. Panorama view of the inside of the sound insulated booth (A), left behind-the-ear receiver-in-canal device used as research hearing aids (HAs) (B), and schematic illustration of the setup (C). The participant was seated centrally within the sound insulated booth. For a subset of participants in the HA group, research HAs were led into the sound insulated booth. The fNIRS fibers and optodes were directed into the booth to be placed on the participant’s head within a cap. On top of the cap, the rigid body base, holding reflective markers, was mounted. The cameras, tracking the motion, were positioned in each corner above the loudspeakers. The computer and ETG4000 (Hitachi Medical Corporation, Tokyo, Japan) were placed outside the booth to minimize equipment noise.
Figure 3The fNIRS setup. (A) The fNIRS configuration is illustrated on a head model and the underlying brain areas are marked as areas surrounded by dashed lines. The round, unnumbered dots represent sending and receiving fNIRS optodes. Numbered circles in between unnumbered dots represent a measurement channel. (B) fNIRS preprocessing pipeline. Abbreviations: P—Probe set; LH—left hemisphere; RH—right hemisphere; DLPFC—dorsolateral prefrontal cortex; IFG—inferior frontal gyrus; pre/SMA—premotor and supplementary motor area; STG—superior temporal gyrus; MTG—medial temporal gyrus; primary somatos.—primary somatosensory area; primary auditory assoc.—primary auditory association area; supram. gyrus—supramarginal gyrus; CoV—coefficient of variation; OD—optical density; lpf—low pass filter; DCT—discrete cosine transformation; hpf—high pass filter; dlpf—differential path length factor.
Figure 4Illustration of the eight testing conditions and the procedure. (A) The 2 pitch cues × 2 spatial cues × 2 reverberation times design. Illustration adapted with permission from Peng, Pausch, and Fels [105]. (B) Example of all eight testing conditions with the last 50 s being of relevance for the neural analyses. (C) Excerpts of the testing procedure showing a 200 ms long beep and subsequent 500 ms silence that preceded each target sentence (see the Supporting Material for exemplary audio files and task instructions) as well as the verbal response. In between conditions, a playback introduced the next condition and a manually initiated (asterisk) break with a total silence duration of at least 30.4 s was presented before each condition. Abbreviations: SRT—speech reception threshold; T—target voice; D—distractor voices; Ssame—target and distractor at the same spatial position in front of participant; Sdiff—target at front and distractors at ± 90°; Psame—same pitch of target and distractor voices (both “D” and “T” in black color); Pdiff—different pitch of target and distractor voices (“T” in black and “D” in red); RTlow—low reverberation time; RThigh—high reverberation time. Footnotes: * The door is freshly painted. † The door is painted. ‡ The plane flies very quietly. § The plane flies. ¶ Last night was a thunderstorm.
Figure 5Effects of reverberation time, speech, and pitch on the speech reception threshold (SRT) and total hemoglobin concentration changes (ΔHbT) in the left inferior frontal gyrus (IFG). (A) SRT plotted as a function of pitch cues as group mean for children with normal hearing (NH). (B) Individual SRT data for the three children in the hearing aid (HA) group. (C) Effect of reverberation time and pitch on ΔHbT in the left IFG of the NH group. (D) Individual ΔHbT for the three children in the HA group. Error bars (A and C) represent the standard error. Line color indicates reverberation time (high in gray, low in black). Line style indicates spatial cue availability (same spatial position (straight line) or different position (dotted line)).
Comparison of SRTs from individual children with bilateral HAs with the control group of children with NH.
| Condition | Statistics | HA 1 | HA 2 | HA 3 |
|---|---|---|---|---|
|
| 1.93 | 1.70 | 0.62 | |
| two-tailed probability | 0.11 | 0.15 | 0.57 | |
| Effect size and 95% CI | 2.08 (0.58–3.55) | 1.84 (0.45–3.17) | 0.66 (−0.26–1.53) | |
| Estimated percentage of HA case falling above NH group (CI) | 94.42 (71.79–99.98) | 92.51 (67.30–99.92) | 71.72 (39.90–93.74) | |
|
| 2.73 | 1.24 | 0.83 | |
| two-tailed probability | 0.04 * | 0.27 | 0.45 | |
| Effect size and 95% CI | 2.95 (1.00–4.87) | 1.34 (0.17–2.45) | 0.89 (−0.10–1.83) | |
| Estimated percentage of HA case falling above NH group (CI) | 97.93 (84.02–100) | 86.50 (56.92–99.28) | 77.73 (46.05–96.64) | |
|
| 2.14 | 1.97 | 1.94 | |
| two-tailed probability | 0.09 | 0.11 | 0.11 | |
| Effect size and 95% CI | 2.31 (0.69–3.89) | 2.12 (0.60–3.61) | 2.09 (0.58–3.56) | |
| Estimated percentage of HA case falling above NH group (CI) | 95.74 (75.53–100) | 94.68 (72.47–99.98) | 94.48 (71.95–99.98) | |
|
| 3.12 | 2.26 | 0.85 | |
| two-tailed probability | 0.03 * | 0.07 | 0.43 | |
| Effect size and 95% CI | 3.37 (1.19–5.54) | 2.44 (0.76–4.10) | 0.92 (−0.08–1.86) | |
| Estimated percentage of HA case falling above NH group (CI) | 98.69 (88.35–100) | 96.35 (77.52–100) | 78.30 (46.67–96.87) | |
|
| 4.72 | 5.74 | 1.12 | |
| two-tailed probability | 0.005 ** | 0.002 ** | 0.31 | |
| Effect size and 95% CI | 5.10 (1.96–8.26) | 6.20 (2.42–10.00) | 1.21 (0.10–2.26) | |
| Estimated percentage of HA case falling above NH group (CI) | 99.74 (97.47–100) | 99.89 (99.23–100) | 84.32 (53.90–98.81) | |
|
| 5.20 | 4.73 | 1.66 | |
| two-tailed probability | 0.003 ** | 0.005 ** | 0.16 | |
| Effect size and 95% CI | 5.61 (2.18–9.07) | 5.11 (1.96–8.28) | 1.79 (0.42–3.11) | |
| Estimated percentage of HA case falling above NH group (CI) | 99.83 (98.52–100) | 99.74 (97.50–100) | 92.09 (66.42–99.90) | |
|
| 2.63 | 4.28 | 2.81 | |
| two-tailed probability | 0.047 * | 0.008 ** | 0.04 * | |
| Effect size and 95% CI | 2.84 (0.94–4.70) | 4.62 (1.75–7.50) | 3.03 (1.04–5.00) | |
| Estimated percentage of HA case falling above NH group (CI) | 97.66 (82.73–100) | 99.61% (95.97–100) | 98.11 (84.98–100) | |
|
| 1.19 | 1.33 | 0.89 | |
| two-tailed probability | 0.29 | 0.24 | 0.41 | |
| Effect size and 95% CI | 1.28 (0.14–2.36) | 1.44 (0.23–2.59) | 0.96 (−0.06–1.92) | |
| Estimated percentage of HA case falling above NH group (CI) | 85.58 (55.61–99.10) | 87.98 (59.13–99.52) | 79.30 (47.79–97.26) |
Modified t-test statistics performed based on methods described by Crawford and Howell [119] are listed for each child in the HA group, by comparing the individual’s speech reception threshold to the NH group in each test condition. Effect sizes with 95% confidence interval are shown between the individual case and control group. Abbreviations: HAs—hearing aids; NH—normal hearing; RTlow—low reverberation time; RThigh—high reverberation time; Ssame—same spatial position of target and distractor speakers; Sdiff—different position of target and distractor speakers; Psame—same pitch of target and distractor voices; Pdiff different pitch of target and distractor voices; CI—confidence interval; M—mean; SD—standard deviation; * p < 0.05; ** p < 0.01.
Challenges and recommendation for future combined fNIRS and VAEs applications.
| Aspect | Challenge(s) | Explanation | Considerations/Recommendations to Address Challenges |
|---|---|---|---|
|
| Long task duration and long-lasting fNIRS cap wearing | Lengthy and strenuous paradigm for younger children (~30 min; especially long duration if the speech recognition in background noise is good due to staircase procedure) | Administer the task in |
|
| Repetition of task conditions | For fNIRS measurements, ideally, repetitions of testing conditions within each subject are warranted; currently, each condition is only presented once | |
| Disentangling behavioral performance and manual presentation of target sentences | Currently, listening and speaking are both included in the mean concentration changes of HbT; | ||
| Perceived vs. (physical) loudness intensity | The amplitude of the fNIRS signal might be affected by sound intensity. | Loudness deviations when investigating SIN comprehension typically do not exceed 10 dB SPL. Activation differences thus hardly reflect overall sound intensity differences. Nevertheless, individual loudness perception (rather than physical intensity) appears to be related to brain activation [ | |
| Noise removal: | Head movements are warranted during VAE simulations; however, an excessive amount might distort the NIRS signal. | For datasets that are acquired from challenging samples, few trials, lengthy paradigms, and when head movements are an important aspect of the task, | |
| Speech-induced motion artifacts | Chin clasps of the cap might transfer speech-induced motion of the jar | Usage of (EEG) caps to ensure | |
| Localization/ROIs and lateralization | Variability in head size and shape might affect the formation of ROIs and a differential lateralization of speech-related activity might add additional variation. | The use of | |
|
| Varying degrees of HL, HA devices, and frequency of HA use | Due to time constraints and elaborative purpose of the study design, an audiometry was performed only for the HA group that served as input for the research HAs | Future studies, assessing larger populations, should aim at |
| Other factors affecting speech comprehension | Due to the small sample size, the current pilot investigation could not control for variability in hearing abilities. Auditory, linguistic as well as other cognitive mechanisms were suggested to affect speech understanding (e.g., see the ease of language understanding model Rönnberg, Lunner, Zekveld, Sörqvist, Danielsson, Lyxell, Dahlström, Signoret, Stenfelt, and Pichora-Fuller [ | Next to audiometry, |
Abbreviations: fNIRS—functional near-infrared spectroscopy; VAE(s)—virtual acoustic environment(s); HbT—total hemoglobin; CHs—channels; EEG—electroencephalography; ROI(s)—region of interest(s); HA—hearing aid; HL—hearing loss.