| Literature DB >> 28091178 |
Jean-Pierre Gagné1, Jana Besser2, Ulrike Lemke2.
Abstract
Published investigations ( n = 29) in which a dual-task experimental paradigm was employed to measure listening effort during speech understanding in younger and older adults were reviewed. A summary of the main findings reported in the articles is provided with respect to the participants' age-group and hearing status. Effects of different signal characteristics, such as the test modality, on dual-task outcomes are evaluated, and associations with cognitive abilities and self-report measures of listening effort are described. Then, several procedural issues associated with the use of dual-task experiment paradigms are discussed. Finally, some issues that warrant future research are addressed. The review revealed large variability in the dual-task experimental paradigms that have been used to measure the listening effort expended during speech understanding. The differences in experimental procedures used across studies make it difficult to draw firm conclusions concerning the optimal choice of dual-task paradigm or the sensitivity of specific paradigms to different types of experimental manipulations. In general, the analysis confirmed that dual-task paradigms have been used successfully to measure differences in effort under different experimental conditions, in both younger and older adults. Several research questions that warrant further investigation in order to better understand and characterize the intricacies of dual-task paradigms were identified.Entities:
Keywords: cognitive resources; dual-task paradigm; listening effort; speech recognition; speech understanding
Mesh:
Year: 2017 PMID: 28091178 PMCID: PMC5308443 DOI: 10.1177/2331216516687287
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Figure 1.Illustration of the classic method used to measure listening effort.
Summary of Published Articles in Which a Dual-Task Experimental Procedure Was Used to Measure the Listening Effort Expended to Perform a Speech-Understanding Task.
| (Authors) Research question | Participants | Experimental tasks used | Other relevant measures or information | Findings for dual task | Other relevant findings |
|---|---|---|---|---|---|
| ( | Experiment 1: YNH; | P: closed-set keyword recognition at SNR for 80% correct; Exp1: auditory-only, Exp2: AV; outcomes: pDTC for %-correct and RTs S: closed-set tactile pattern recognition; outcomes: pDTC for %-correct and RTs | Subjective scale (0%–100%) for amount of effort | Exp1 P: pDTC equal in YNH and ONH for %-correct and RTs S: ONH higher pDTC (greater effort) than YNH for %-correct; pDTCs equal in YNH and ONH for RTs Exp2 P: pDTC equal in YNH and ONH for %-correct and RTs S: pDTCs equal in YNH and ONH for %-correct; ONH higher pDTC (greater effort) than YNH for RTs | pDTCs generally higher for secondary (tactile) task than for primary task; pDTCs generally higher for AV conditions compared with auditory-only conditions |
| ( | YNH; | P: closed-set keyword recognition; cond1: equated level (−12 dB SNR), cond2: equated performance (SNR for 80% correct); outcomes: pDTC for %-correct and RTs S: closed-set tactile pattern recognition; outcomes: pDTC for %-correct and RTs | Subjective scale (0%–100%) for amount of effort | Cond1 P: ONH lower %-correct and longer RTs than YNH; pDTC: equal in YNH and ONH for %-correct and RTs S: ONH lower %-correct and longer RTs than YNH; pDTC: ONH higher pDTC (greater effort) for %-correct and RTs Cond2 P: ONH and YNH equal %-correct; ONH longer RTs than YNH; pDTC: equal in YNH and ONH for %-correct and RTs S: ONH lower %-correct and longer RTs than YNH; pDTC: ONH higher pDTC (greater effort) for %-correct; equal in YNH and ONH for RTs | No correlations between subjective ratings of effort and dual-task measures. |
| ( | OHI; | P: 4-word sentences (auditory); closed-set word recognition, choose from visually displayed alternatives after S-task reaction; SNRs: 0, 3, 6, 9, 12 dB in each condition; 3 conds: control, spectrally enhanced (ENH), spectral enhancement with compression (ENHC) S: judge whether P-task sentence silly or sensible; outcome: absolute RTs for correctly judged, correctly identified sentences | None | P: %-correct identification higher for ENH and ENHC than for control; %-correct identification equal for ENH and ENHC S: RTs shorter for ENH and ENHC than for control; RTs shorter for ENHC than for ENH; RTs shorter for higher SNRs, however, also P-task performance increased with increasing SNR, thus effect of intelligibility and effort cannot be separated. | Benefit from ENH processing twice as large for RTs as for %-correct identification |
| ( | OHI; | P-Harvard or IEEE sentences in 4-talker female babble S: visual motor tracking; outcome: %-time on moving target; effort: change in outcome from baseline to DT | Short Portable Mental Health Status Questionnaire(Pfeiffer, 1975). The digit symbol substitution test (DSST; Wechsler Adult Intelligence Scale-III; Wechsler, 1997) The letter number sequencing test (LNS) was used to measure participants’ working memory performance. self-reported estimates of listening effort expended on the speech-in-noise task were measured us-ingaseven-categoryscaling procedure. | Directional microphone reduced listening effort there was a trend for participants to expend less listening effort with the NR algorithm activated in the hearing aids (z4%—but not significant Participants’ listening effort was significantly reduced (by ≈5%) with the combined use of the directional microphones and the NR algorithm (NRDM) than with noise processing activated in the hearing aids (NoP). This is mostly accounted for by the DM effect | No correlation between rating scale and dual task cost However, participants did not perceive a difference in listening effort in background noise with and without the respective noise processing schemes (i.e., NR, DM, and NRDM) activated in the hearing aids. We did not find any statistically significant relationships between listening effort with NR, DM, or NRDM and cognitive function (processing speed and working memory). Experiences hearing aid users had worn hearing aids with directional microphones and NR algorithms, bilaterally, for at least 6 mo fitted bilaterally with Starkey HA. |
| ( | YNH; | P: R-SPIN at SNR for 76% correct; maskers: 6-talker babble, 2-talker babble (TTB), SSN; outcome: %-correct target words S: visual motor tracking; outcome: %-time on moving target; effort: change in outcome from baseline to DT | Subjective scale (0–100) for “ease of listening”; Rspan for WM function; Digit-Symbol Substitution Test (DSST) for perceptual speed of processing; Visual STROOP test for selective attention | P: constant performance across conditions and groups; performance better for high-context SPIN sentences in all groups S: ONH and OHI higher effort than YNH in TTB and SSN maskers; equal effort for low-and high-context R-SPIN sentences in all groups; ONH and OHI higher effort in SSN and TTB than in six-talker babble; YNH higher effort in six-talker babble than in SSN, effort equal for SSN and TTB | SSN: effort–Rspan ( |
| ( | OHI; | P: R-SPIN at SNR for 76% and 50% correct; with or without NR; masker: 2-talker babble (TTB); outcome: %-correct target words S: visual motor tracking; outcome: %-time on moving target; effort: change in outcome from baseline to DT | Subjective scale (0–100) for “ease of listening”; Rspan for WM function; Digit-Symbol Substitution Test (DSST) for perceptual speed of processing | P: equal performance with or without NR; word recognition higher for high-context R-SPIN sentences S: without NR: higher effort in 50%-correct than in 76%-correct condition; with NR: no difference in effort between conditions; at −76% correct: lower effort with NR; at 50%-correct: equal effort with/without NR; equal effort for low- and high-context R-SPIN sentences | Effort at 50%-correct with NR–DSST ( |
| ( | HI; N = 23; 29–68 (51.0) years, experienced HA users, bilaterally fitted with study devices | P: Monosyllabic words; masker: 8-talker babble; targets: 45 dB HL; SNR: 0 dB; outcome: %-correct S: RTs to visual probe (flashing light); effort: change in outcome from baseline to DT | None | P: better word recognition with HA than without (p < .001) S: lower effort with HA than without (p < .05), 16% of variance in decreased RT due to HA use; however, note that word recognition with versus without HA was not at the same performance level | None |
| ( | YNH; N = 48, 18–24 (19) years | P: R-SPIN sentences; −5 dB SNR; dichotic: target at 65° to left or right, masker at 65° to other side; conditions: binaural (BIN), monaural[ | Perceived ease of listening ratings; rating scale from 0 (very, very difficult) to 100 (very, very easy); modified direct magnitude estimation (MME) | P: performance best for BIN, poorest for MF (p < .01); better performance for high-context than for low-context sentences; interaction of context- and listening conditions S: attentional effort higher in MF than in BIN and MN; attentional effort in BIN and MN equal (all p < .01); | BIN rated as easiest, MF rated as most difficult (p < .01); Rated ease–attentional effort: r = −.01, controlling for recognition performance; attentional effort – word recognition: |
| ( | Experiment 1 NH; N = 30, 18–41 (25.0) years Experiment 2 NH; N = 30, 18–45 (25.0) years | P: closed-set keyword recognition; Exp1: equated level (−11 dB SNR) for A-only (80% correct) and AV (96% correct); Exp2:equated performance (80%) at −11 dB SNR for A-only and −19 dB SNR for AV; outcomes: %-correct and RTs S: closed-set tactile pattern recognition; outcomes: %-correct and RTs | Subjective scale (0%–100%) for amount of effort | Exp1 P: S: decrease in %-correct and increase in RTs by dual-task equal for A-only and AV Exp2 P: decrease in %-correct by dual task for AV but not for A-only; increase in RTs by dual-task equal for A-only and AV S: decrease in %-correct and increase in RTs by dual task bigger in AV than in A-only | Exp1 P: overall, %-correct higher in AV than in A-only; RTs equal for AV and A-only S: overall, %-correct and RTs equal for AV and A-only Exp2 P: overall, %-correct equal for AV and A-only; RTs longer for AV than for A-only S: overall, %-correct lower for AV than for A-only; RTs longer for AV than for A-only |
| ( | YNH; N = 10, 20–38 (23) years ONH (some high-frequency mild losses); N = 10, 60–69 (63) years | Setup: TVM sentences; free field; 3 talkers–1 target, 2 distractors; conds: all talkers at front (FF) or target at front, distractors at 60° right front (F-RF) P1: indicate number (0–2) of distractor sentences played backwards; outcome: %-correct P2: repeat sentence of designated target talker P1 and P2 performed separately (single-task) or congruently (dual- task) | Participants instructed to divide their attention equally between both tasks | P1: YNH performed better than ONH; F-RF performance better than FF; single-task better than dual-task; P2: YNH and ONH performed equally; F-RF performance better than FF, but only for single-task; single-task better than dual-task; basically no spatial-separation benefit for ONH in dual-task Trend for higher dual-task cost in YNH than in ONH and for F-RF compared with FF condition; Spatial-separation benefit reduced by dual- task; trend for ONH to have greater spatial- separation benefit than YNH in P1 but smaller in P2 | Age associated with dual-task cost ( |
| ( | HI; N = 16, 47–69 (65) years; fit with Phonak Micro Exelia BTE HAs; | P: word recognition; 8–12 words per list; target speech at 0°, cafeteria babble at 60°, 120°, 180°, 240°, 300°; 3 conds: without HA, with basic HA setting, with advanced HA setting[ | Subjective rating before and after testing; 5 items for concentration, listening effort, distractability, ability to maintain focus, current state of mental draining | P: performance better in aided than unaided conditions; no difference between aided conditions; S1: dual-task cost in RTs was higher in unaided and basic HA conds than with advanced HA settings S2: word recall better with than without HAs; no difference between HA settings | P: performance increased over the course of testing S1: dual-task cost in RTs increased over the course of testing in the unaided but not in the aided conditions S2: word recall remained stable over testing blocks Large variance in S1 effects across participants Increase in subjective fatigue and difficulty to maintain attention post- compared to pre- dual- task testing |
| ( | OHI H + C + ; N = 10, 60–79 (72.1) years OHI H-C+; N = 10, 70–83 (74.7) years OHI H + C-; N = 10, 68–81 (75.8) years OHI H-C-; N = 10, 65 – 81 (75.0) years | P: OLSA matrix test sentence recognition at −4, 0, 4 dB SNR; each SNR with inactive or moderate or strong NR S: visual response time (VRT) task; visually displayed digits; respond differently to even or odd digits; outcome: absolute RTs | Reading span, used for group assignment (cognitive function), not as an outcome measure; Paired-comparisons of preference ratings for NR settings | P: worse performance (p < .0001) with strong NR than with inactive NR; performance equal for moderate NR and inactive NR; performance increased with increasing SNR S: VRTs decreased with increasing SNR, differences between all SNRs (p < .001); VRTs longer for strong NR than for inactive NR (p < .001); VRTs equal for inactive NR and moderate NR | P: H + C+ group performed better than H-C+ (p < .05) and H-C- (p < .001) groups; thus, differences driven by HL rather than cognition; Some NR preferred over no NR for all SNRs and all groups, strength of preferred NR dependent on SNR and group |
| ( | OHI H + C+; N = 10, 60–80 (73.0) years OHI H-C+; N = 10, 71–84 (75.0) years OHI H + C-; N = 10, 69–82 (76.6) years OHI H-C-; N = 10, 66–82 (75.5) years | P: OLSA matrix test sentence recognition at −4 and 0 dB SNR; each SNR in 5 NR conditions with processing in signal and noise S: visual response time (VRT) task; visually displayed digits; respond differently to even or odd digits; outcome: RTs relative to median of several baseline conditions | Subjective effort ratings for sentences at −4, 0, 4 dB; 9-point rating scale ranging from “completely effortless” to “maximally effortful”; Reading span, used for group assignment (cognitive function), not as an outcome measure; Paired-comparisons preference ratings for NR settings | P: performance different for different NR settings, best for NR processing on noise only (no processing on signal); performance better (p < .0001) at 0 dB than at −4 dB SNR; S: VRTs shorter (p < .05) at 0 dB than at -4 dB SNR; VRTs longer (p < .01) with NR processing in signal | P: performance of H + C+ group better than of H−C+ (p < 0.01) and H−C− (p < 0.001) groups. S: VRTs equal for listener groups Subjective effort decreased with increasing SNR; effort ratings different for different NR settings, lowest for NR processing in signal |
| ( | OHI; N = 26, 56–65 (62.4) years; experienced HA users | P: repeat sentence-final word (for half of the sentence lists only); conds: 4 - talker babble (language same as target speech or Cantonese), with or without binary masking NR; at individual SNR for 95% word recognition; outcome: %-correct word recognition S: free recall of all sentence-final words from last set of P-task sentences; outcome: %-correct | Reading span for WM capacity, division into low-span and high-span group; Note: only for half the sentence lists there was an actual dual-task situation. For the other half, there was only the S-task | P: performance with same-language babble without NR processing poorer (ca 93% correct) than for other conds. (close to 100% correct) S: in same-language babble, recall better with than without NR; in Cantonese babble, no difference between with or without NR; high-span group outperformed low-span for primacy list items, but not for asymptote or recency; for low-span (but not for high-span) group performance better with than without NR for recency items, but not for primacy or asymptote; | Different strategies in recall by span group; low-span group tendency to recall words from late list positions, no such effect in high-span group |
| ( | YNH; N = 19, 19–25 (22) years CI simulation (noise vocoding) | P: Recognition of vocoded sentences; conds: 2, 4, 6, 8, 12, 16, or 24 channels or no processing (control); at self-adjusted level 65–75 dB SPL; outcome: %-correct words S1: rhyme judgments for visually presented monosyllabic words; button press yes or no; outcome: RTs for correct responses S2: mental rotation of Japanese characters; judgment whether two displayed characters are the same when rotated; button press yes/no; outcome: RTs for correct responses | Rating of perceived workload; NASA task- load index (TLX) | P: no difference in performance by choice of secondary task; performance improved from 2 to 4 channels and from 4 to 6 channels, for 6 or more channels stable at ceiling S: RTs decreased from 2 to 4, 4 to 6, and 6 to 8 channels, stable for 8 or more channels; thus, RTs decreased where speech recognition stable; reduction of RTs by increased no. of channels bigger for RTs during primary task than between primary-task trials | S: Training effects in RTs for both secondary tasks over the course of the experiment Both dual-task conditions judged as more effortful than the single-task conditions; ratings equal for the two secondary tasks; decrease in effort ratings for conditions up to 6 channels |
| (Pals, Sarampalis, van Rijn, & Baskent, 2015) Effects assessed: performance level, noise type, # of vocoded channels for speech recognition task | YNH; N = 19, M = 19 (18–25) years | P: Sentence understanding; conds: quiet, SSN, 8-talker babble, at individual SNRs for 79% correct and for near ceiling (NC) S: rhyme judgments for pairs of visually presented monosyllabic words; outcome: RTs for correct responses, separate for judgments during vs. between P-task stimuli | WAIS for processing speed; reading span for working memory capacity Note: this study is also listed under the single- task studies for response delays | P: intelligibility as intended S: RTs longer in masker than in quiet; no effect of noise type or intelligibility level on RTs | WAIS predicted RTs |
| ( | NH; N = 20, 19–44 (27.9) years | P: recognition of monosyllabic words in quiet or 4-talker babble; sets of 5 words; individual SNR for 75%- correct; conds: A-only and AV; outcome: %-correct S: Recall of presented words (sets of 5 words); outcome: %-correct | Rating of degree of effort put into hearing what was said; 11-point scale from “no effort” to “lots of effort”; Lipreading skills, Revised Shortened Utley Sentence Lipreading Test (ReSULT), visual- only sentence recognition WM capacity, automated operation span task (AOSPAN), solving of simple equations, memorization of letters | P: performance better in quiet than in babble; performance equal for A-only and AV conditions; no effect of serial position within set of words S: word recall better in quiet than in noise; effect of serial position: better recall for positions 4 and 5 than for positions 1–3; recall equal for A-only and AV conditions | Subjectively rated effort larger in babble than in quiet; no difference in effort ratings for A-only versus AV testing or by serial position; P: In babble: those with better lipreading benefited more from provision of visual cues (AV compared to A-only) in primary task S: devision into low-AOSPAN and high-AOSPAN groups: in babble, high-AOSPAN group had higher recall benefit from provision of visual cues (AV compared to A- only), but in quiet, recall for high-AOSPAN group worse in AV than in A-only |
| ( | HI; N = 27, 49–89 (65.3) years Fitted with Phonak Savia 211 BTEs | P: recognition of monosyllabic words in quiet and 4-talker babble; individual SNR for 50%–70% correct; A-only and AV; aided and unaided testing S: reaction to visual probe; react to red rectangle but not to white; outcome: RTs to probe trials | Lipreading skills, ReSULT, (see previous entry) WM capacity, AOSPAN (see previous entry) Verbal processing speed, lexical decision task (LDT), real word or not? outcome: RTs | P: performance better in quiet than in babble, effect bigger for AV than A-only condition; in quiet, AV performance better than A-only, but not in babble; HA benefit larger in A-only than in AV condition, but small in both; HA-benefit benefit larger in quiet than in babble S: RTs equal for A-only and AV conditions; RTs shorter in quiet than in babble; RTs shorter with than without HAs Overall, RT effects small and with big variance across listeners | S: HA benefit in RTs for A-only and AV correlated with each other; HA benefit in RTs associated with processing speed; for unaided quiet, RTs of listeners with better lipreading and faster verbal processing speed benefited more from visual cues; for unaided quiet and aided babble, RTs of listeners with smaller AOSPAN benefited more from visual cues |
| ( | YNH; N = 17, 21–24 (23.0) years HI; N = 17, 23–73 (60.1) years; sensorineur al HL, unaided testing | P: recognition of monosyllabic words in quiet or 4-talker babble; individual SNR for 80% correct; A-only and AV; outcome: %-correct S1 (simple): reaction to visual probe; react to red rectangle but not to white S2 (complex): reaction to visual stimulus; choice of response button based on even or odd digit S3 (semantic): semantic-category judgment of primary-task word, hit response button if word was a noun Outcome for all S-tasks: RTs | None | YNH P: performance better in quiet than in babble; performance equal for all S-task conditions; performance equal for A-only and AV conditions S: RTs longer for semantic than for complex and simple S-tasks and also longer for complex than for simple; RTs equal in quiet and babble for simple and complex S-tasks, but longer in babble than in noise for semantic S-task; RTs equal for A-only and AV in all S-tasks HI P: performance better in quiet than in babble; in quiet, performance better for AV than A-only, but equal in babble; performance equal for all S-task conditions S: RTs longer in babble than in quiet for all S-tasks; RTs longest for semantic S-task, intermediate for complex S-task, shortest for simple S-task; RTs equal in A-only and AV conds for all S-tasks | None |
| (Picou, Gordon, & Ricketts, 2016) Effect of noise and reverberation | YNH; N = 18, 22–30 (24.8) years | P: recognition of monosyllabic words, outcome: % correct; in quiet, 3 levels of reverberation (low, moderate, high); one SNR with low reverberation, two SNRs with moderate reverberation, two SNRs with high reverberation, see RQs; S: word-class judgment of primary-task word, hit response button if word was a noun; outcome: RTs (all button presses, not only correct) | RQ A: effect of babble (vs. quiet) with low or mode- rate or high reverb at constant intelligibility (ca 84%) RQ B: for constant moderate reverb, effect of babble (84% or 77% correct) vs. quiet; RQ C: at constant SNR, effect of low vs. moderate reverb; RQ D: at constant SNR, effect of moderate vs high reverb | P: not constant 84% as intended across reverb levels (RQ A), poorer performance with high reverb S: RQ A: RTs longer in babble than in quiet for all reverb levels; no difference between RTs for different reverb levels, neither in quiet nor babble RQ B: for constant reverb, RTs longer with babble than in quiet, longer for 77% correct than for 84% correct RQ C and D: for constant SNRs, no difference in RTs between reverb levels | RTs not correlated with word recognition, neither in quiet nor in babble and at no reverb level; RTs correlated with each other, across babble conditions and reverb levels; Word recognition scores not correlated with each other across conditions; Age not correlated with RTs or word recognition scores (note, all YNH!) |
| ( | Exp1 YNH; N = 8, ages not reported YHI; N = 9, ages not reported Exp2 YNH; N = 11, 21–29 (24) years HI; N = 11, 52–73 (62) years, presbycusis, mostly HA users | P: conds: listening 60 sec to (1) steady- state speech noise, (2) speech passages, questions on passages answered after digit recall; stimuli at 65 dB SPL for YNH, at most comfortable loudness for YHI and HI S: digit recall; string of digits presented before P-task, ordered recall after P- task; Exp1: 9-digit strings, Exp2: 9–13 - digit strings; outcome: number correct recalls | Listeners instructed to give equal priority to speech listening and digit recall | Exp1 S: digit recall better after noise than after speech trials; digit recall equal for YNH and YHI after noise trials, poorer for YHI than for YNH after speech trials; difference between noise and speech trials bigger for YHI than for YNH Exp2 S: recall better for YNH than for OHI; digit recall better after noise than after speech trials, effect bigger for OHI than for YNH Expressed as %-change from baseline (recall after noise) to recall after speech, the YHI group from Exp1 had the overall biggest decrease in performance from speech trials | None |
| ( | Participants = 30 normal- hearing young adults | Stimuli were low-pass filtered 22 consonants presented in a a-C-v context 4 conditions: AV, V, A, V with concurrent auditory processing PRIMARY task was an auditory syllable detection task V-AP condition required the performance of the VCV lipreading task with attention divided between auditory and visual modalities. Subjects were provided a second set of written instructions prior to administration of the end detection practice list and V-AP test condition. Subjects were instructed to maintain the same level of accuracy on the end detection task under the V-AP condition as that achieved on the practice list. | none | Results indicate that lipreading performance during divided attention was significantly lower than that measured during focused attention. The performance decrement suggests that simultaneous processing of the lipreading and auditory tasks exceeded capacity; with attention divided between modalities a sufficient amount of capacity was not available for optimal processing of visual stimuli, causing lipreading performance to suffer. | |
| ( | Exp1 YNH; = 25, 18–26 (20.0) years Exp2 YNH; = 25, 19–27 (ca. 21) years | Exp1 P: R-SPIN sentences; conds: in quiet at 65 dB SPL, in 4-talker babble at −2/+2 dB SNR with babble at 65 dB SPL; in babble unprocessed or with NR; outcome: %-correct targets S: remember last word of each sentence; recall after 8 sentences; outcome: %-correct Exp2 P: IEEE sentences; conds: in quiet at 65 dB SPL, in 4-talker babble at −6, −2, or+2 dB SNR with babble at 65 dB SPL; in babble unprocessed or with NR; outcome: %-correct targets S: complex visual RT task for speed of processing; different response for odd/even digit: arrow pointing toward/away from digit; outcome: RTs | None | Exp1 P: performance better at high SNR than at low SNR; performance perfect for low- and high-context sentences in quiet, in babble better for high- than for low- context; negative effect of NR at −2 dB SNR independent of context S: recall better for high- than for low- context words; for low-context: recall better in quiet than in babble, better at 2 dB SNR than at −2 dB SNR, almost no effect of NR; for high-context: recall better in quiet than in babble, with NR recall equal at −2 and +2 dB SNR, without NR, recall better at +2 dB SNR, recall equal with and without NR at 2 dB SNR, but recall poorer with NR at -2 dB SNR Exp2 P: performance perfect in quiet; performance decreased with decreasing SNR; no effect of NR S: RTs increased with decreasing SNR; effect of NR only at lowest SNR (-6 dB SNR): RTs shorter with NR | None |
| (Seeman & Sims, 2015) Comparison of psychophysiologic al and dual-task on listening effort | YNH; N = 46 (divided into 3 groups), 18–38 (21.2) years | Group 3: DT experiment (N = 15 or 16) P: sentences-in-noise (SIN) at +15, +5 dB SNR in steady-state noise; outcome: total number of correct keywords (max 75, 5 per sentence) S: visual letter identification; press button when displayed letter is target letter; outcome: %-correct and RTs, RT outcomes proportional: (RTdual – RTsingle)/RTsingle | Group 1: diotic-dichotic digit listening with psychophys measures Group 2: SIN understanding with psychophys measures Psychophys measures: skin conductance (SC), heart-rate variability (HRV), heart rate (HR) Subjective: NASA Task Load Index (TLX), except physical demand scaleNone | P: 100% correct at +15 dB SNR, 83% correct at +5 dB SNR S: at +15 dB SNR RTs not different from baseline; at +5 dB SNR, RTs longer than at baseline and at +15 dB SNRExp1 P: performance better at high SNR than at low SNR; performance perfect for low- and high-context sentences in quiet, in babble better for high- than for low- context; negative effect of NR at −2 dB SNR independent of context | NASA TLX: higher load for dual than for single tasks, load increased for decreasing SNRs (Groups 2 and 3), load increased with task complexity for dichotic digits (Group 1) HRV was sensitive to dichotic task complexity (Group 1) and to high (+15/+10 dB) vs low (+5/0 dB) SNRs (Group 2); HR and SC had some sensitivity No correlations of psychophys measures or RTs with NASA TLXNone |
| Exp1 P: R-SPIN sentences; conds: in quiet at 65 dB SPL, in 4-talker babble at -2/+2 dB SNR with babble at 65 dB SPL; in babble unprocessed or with NR; outcome: %-correct targets S: remember last word of each sentence; recall after 8 sentences; outcome: %-correct Exp2 P: IEEE sentences; conds: in quiet at 65 dB SPL, in 4-talker babble at −6, −2, or+2 dB SNR with babble at 65 dB SPL; in babble unprocessed or with NR; outcome: %-correct targets S: complex visual RT task for speed of processing; different response for odd/even digit: arrow pointing toward/away from digit; outcome: RTs | S: recall better for high- than for low- context words; for low-context: recall better in quiet than in babble, better at 2 dB SNR than at -2 dB SNR, almost no effect of NR; for high-context: recall better in quiet than in babble, with NR recall equal at −2 and +2 dB SNR, without NR, recall better at +2 dB SNR, recall equal with and without NR at 2 dB SNR, but recall poorer with NR at −2 dB SNR Exp2 P: performance perfect in quiet; performance decreased with decreasing SNR; no effect of NR S: RTs increased with decreasing SNR; effect of NR only at lowest SNR (-6 dB SNR): RTs shorter with NR | ||||
| ( | YNH; N = 18, 18–20 (18.3) years ONH; N = 18, 60–80 (69.6) years | P: listen to expository passages (ca. 230 words), repeat content later on in own words; two levels of proposition density S1: simple RT, press button, when letter “J” on screen S2: choice RT, press one button for letter “J”, press different button for letter “H”Group 3: DT experiment (N = 15 or 16) P: sentences-in-noise (SIN) at +15, +5 dB SNR in steady-state noise; outcome: total number of correct keywords (max 75, 5 per sentence) S: visual letter identification; press button when displayed letter is target letter; outcome: %-correct and RTs, RT outcomes proportional: (RTdual – RTsingle)/RTsingle | Reading and listening span for working memoryGroup 1: diotic- dichotic digit listening with psychophys measures Group 2: SIN understanding with psychophys measures Psychophys measures: skin conductance (SC), heart-rate variability (HRV), heart rate (HR) Subjective: NASA Task Load Index (TLX), except physical demand scale | P: recall better for low-density than for high-density passages, recall equal for both S-tasks; performance in all conditions similar for YNH and ONH but high-level propositions (in hierarchy) better recalled than low-level and this effect larger in ONH than in YNH; S: YNH faster than ONH; responses faster for simple-RT task than for choice-RT task; responses faster for low-density than for high-density passages but only for choice-RT task; dual-task cost was greater for ONH than for YNH for simple- RT task but not for choice-RT taskP: 100% correct at +15 dB SNR, 83% correct at +5 dB SNR S: at +15 dB SNR RTs not different from baseline; at +5 dB SNR, RTs longer than at baseline and at +15 dB SNR | Reading and listening span combined into one WM span measure; YNH had larger WM spans than ONH; WM span associated with proposition recall in single-task ( |
| ( | Y; N = 24, 20–46 (27.9) years (YNH, N = 12; YHI (mild); N = 12) O; N = 24, 67–80 (73.9) years (ONH, N = 12; OHI (mild), N = 12) | P: word list at 70 dB SPL, followed by 30-second counting task, followed by word recall; conds: semantically related and unrelated word lists; outcome: number of correctly recalled words S: visual motor tracking during responses to P-task (after counting task); outcome: %-time on moving target; speed individually set to 50 - 60% tracking accuracy; effort: change in outcome from baseline to DT | NH and HI groups matched on 3 cognitive measures (backward digit span for verbal WM, word-list recall for episodic memory, trail making A and B for executive control) | P: recall better for related than for unrelated word lists; recall better in Y than in O listeners but effect of dual-task similar in both groups; age x list-type interaction: O greater benefit of word relatedness than Y; NH groups performed better than HI, effect bigger in Y than in O S: effort larger for O than for Y, i.e., larger dual-task cost in O; HI poorer performance than NH, especially in O group; | None |
| ( | YNH; N = 21, 19–27 (21.0) years | P: sentence recognition; 4 vocoding conds: clear speech, six-band (NV-hi), six-band compressed (NV-lo), spectrally rotated (rNV); yes/no response for sentence intelligibility S1: auditory distracters: 400-ms noise bursts; non-targets: long onset, sharp offset; targets: sharp onset, long offset; diotic to P-task stimuli; yes/no response whether a target in trial S2: visual distracters: 200-ms presentations of cross-hatched white ellipses on black background; non- targets: solid lines; targets: dashed lines; yes/no response whether a target in trialP: listen to expository passages (ca. 230 words), repeat content later on in own words; two levels of proposition density S1: simple RT, press button, when letter “J” on screen S2: choice RT, press one button for letter “J”, press different button for letter “H” | For each trial, visual prompt cued attention to a single stimulus stream: “Speech” for speech stimuli, “Chirps” for auditory distracters, “Footballs” for visual distracters. Next to behavioral results, fMRI data were gathered. Reading and listening span for working memory | Results reported for attended conditions P: clear speech and NV-hi equally intelligible, better than NV-lo, which was better than rNV S1 and S2: Performance better than chance; performance better for visual distracters than for auditory distracters; performance unaffected by condition in (unattended) P-task P: recall better for low-density than for high-density passages, recall equal for both S-tasks; performance in all conditions similar for YNH and ONH but high-level propositions (in hierarchy) better recalled than low-level and this effect larger in ONH than in YNH; S: YNH faster than ONH; responses faster for simple-RT task than for choice-RT task; responses faster for low-density than for high-density passages but only for choice-RT task; dual-task cost was greater for ONH than for YNH for simple- RT task but not for choice-RT task | None Reading and listening span combined into one WM span measure; YNH had larger WM spans than ONH; WM span associated with proposition recall in single-task ( |
| ( | Exp1 OHI; N = 29, 56–85 (72.7) years; 25 were experience HA users Exp2 OHI; N = 19, 56–85 (71.7) years; subset of Exp1 participants Exp3 YNH; N = 14, 20–37 (23.4) years | Exp1 P: Connected speech test (CST); %- correct keyword repetition; stimuli prerecorded in car with noise at 75 dBA, SNR −1 dB, 3 conds: unaided, aided omnidirectional (OMNI), aided directional to back (DIR); testing with individually amplified speech (NAL- NL1); S: driving performance; mean, SD, interquartile range (IQR) of driving distance to lead vehicle; smaller numbers = better performance Exp2 (as Sarampalis et al., 2009) P: Speech recognition; 3 conds: unaided, OMNI, DIR S: complex visual RT task; different response for odd/even digit; outcome: RTs Exp3 as Exp2 | Participants were asked to pay equal amounts of attention to P and S tasks. | Exp1 P: single-task performance better than dual-task; OMNI better than unaided, DIR better than unaided and OMNI S: baseline better than dual-task; dual-task performance and dual-task cost equal for all HA conditions Exp2 P: performance equal for single- and dual- task; performance better for OMNI than for unaided and better for DIR than for OMNI and unaided S: baseline better than dual-task; dual-task performance and dual-task cost equal for all HA conditions Exp3 P: performance better in DIR than in unaided and OMNI S: baseline better than dual-task; dual-task performance better (and cost lower) in DIR than in unaided and OMNI | RTs in Exp2 were associated with driving performance in Exp1 ( |
| ( | NH; N = 8, 51–61 (Med = 55.0) years HI; N = 8, 59–66 (Med = 64.0) years+ | P: CRM at 65 dB SPL (0 dB SNR), with NAL-R gain for HI listeners; masker: 2 competing talkers; 4 conds for target- masker relationship: (1) sex differences and spatial separation, (2) sex difference only, (3) spatial separation only, (4) no cues; spatial separation 15° in Exp1, 60° in Exp 2; outcome: %-correct target words S: visual motor tracking; movement speed set to individual 60% accuracy; outcome: %-time on moving target; effort: change in outcome from baseline to DT | None | Exp1 (15° spatial separation) P: HI performed worse than NH in all conds but the no-cue cond (p < .001); in both groups, equal performance for both-cues and sex-cues conds; in HI, equal performance also for spatial-cues and no-cues conds; pattern true for both single-task and dual-task conds S: tracking-accuracy lower in dual-task cond; group – cue-type interaction: | Exp1 Spatial-separation benefit in P- task associated with reductions of cost in S-task (r = -.59*) |
Note. AOSPAN = automatic operation span task; AV = audiovisual; BIN = binaural; BTE = behind the ear; C−/+ = cognitive function good (+) or, cognitive function poor (−); CST = continuous speech test; DIR = directional; DM = directional microphone; DPRT = Digital pursuit Rotor Tracking; DT = dual task; DSST = digit symbol substitution test; ENH = spectrally enhanced; ENHC = spectrally enhanced with compression; FF = talker at front; F-RF = talker at right-front; H−/+ = hearing sensitivity good (+) or, hearing sensitivity poor (−); HA = hearing aid; HI = hearing impaired; HRV = heartrate variability; LDT = lexical decision task; LNS = letter number sequencing test; NAL-R = National Acoustics Laboratory-revised version; MF = monaural far; MME = modified direct magnitude estimation; MN monaural near; N = number of participants; NC = near coding; NH = normal hearing; NoP = no processing; NR = noise reduction; NRDM = noise reduction and directional microphone; NV-hi = vocoded, high intelligibility; NV-lo = vocoded, low intelligibility; OHI = old, hearing impaired; ONH = old, normal hearing; OSLA matrix = Oldenburg sentences matrix test; P = primary task; pDTC = proportional -task; R = reverberation; Rspan = reading span; RSPIN = revised speech in noise test; RT = response time; S = secondary task; SIN = sentences in noise; SNR = signal to noise ratio; SPIN = speech in noise; SSN = speech spectrum noise; TLX = text load index; TVM sentences = closed-set sentence recognition test; TTB = 2-talker babble; V-AP = visual-with concurrent auditory processing; VCV = vowel consonant vowel; VRT = visual response time; WAIS = Wechsler Adult Intelligence Scale; WM = working memory; WMC = working memory capacity; WMS = working memory span; YNH = young, normal hearing.
Unilateral hearing loss was simulated by the use of an earplug with inserted into one ear during testing, resulting in a gradually sloping 4 dB per octave conductive hearing loss of approximately 30 dB HL at 500, 1,000, and 2,000 Hz.
In basic mode, the hearing aid microphone was omnidirectional, and all advanced features, except for feedback management, were disabled, including directional processing and DNR. In advanced mode, the manufacturer’s default settings for different listening environments were used. This included multichannel automatic directivity and algorithms designed to reduce reverberation, general background noise, and wind noise. The background noise level in this study (55 dBA) was not high enough to activate the devices’ directional processing or noise-reduction algorithms. Thus, regardless of the aid setting (basic or advanced), the hearing aids were functioning in omnidirectional mode with no DNR active during testing. The interest was whether continuous access to advanced signal processing during daily activities would reduce cognitive processing demands and listening effort such that differences would be apparent when tested on the study’s cognitively demanding dual task completed at the end of the day.
Note on group labels: H+C+ = mild hearing loss, better cognitive function; H−C+ = moderate hearing loss, better cognitive function; H+C− = mild hearing loss, poorer cognitive function; H−C− = moderate hearing loss, poorer cognitive function
Note that the study by Wild et al. describes a dual task and provides behavioral results for it, but the study did not examine listening effort or dual-task cost. Performance was only assessed for attended stimuli.
List of Types of Secondary Tasks That Were Used in the Reviewed Dual-Task Studies on Listening Effort for Speech.
| Visual motor tracking task displayed on a computer screen | ( |
| Response time to visual probe or distractor | ( |
| Recall of words presented in the primary task or before the primary task | ( |
| Judgment concerning a feature of the primary task | ( |
| Tactile pattern recognition | ( |
| Semantic judgment task | ( |
| Driving a car simulator | ( |
| Mental rotation of Japanese characters | ( |
| Response time to predetermined auditory signals | ( |
| Rhyme judgment for words presented visually | ( |