| Literature DB >> 32372720 |
William J Bologna1, Jayne B Ahlstrom1, Judy R Dubno1.
Abstract
Focused attention on expected voice features, such as fundamental frequency (F0) and spectral envelope, may facilitate segregation and selection of a target talker in competing talker backgrounds. Age-related declines in attention may limit these abilities in older adults, resulting in poorer speech understanding in complex environments. To test this hypothesis, younger and older adults with normal hearing listened to sentences with a single competing talker. For most trials, listener attention was directed to the target by a cue phrase that matched the target talker's F0 and spectral envelope. For a small percentage of randomly occurring probe trials, the target's voice unexpectedly differed from the cue phrase in terms of F0 and spectral envelope. Overall, keyword recognition for the target talker was poorer for older adults than younger adults. Keyword recognition was poorer on probe trials than standard trials for both groups, and incorrect responses on probe trials contained keywords from the single-talker masker. No interaction was observed between age-group and the decline in keyword recognition on probe trials. Thus, reduced performance by older adults overall could not be attributed to declines in attention to an expected voice. Rather, other cognitive abilities, such as speed of processing and linguistic closure, were predictive of keyword recognition for younger and older adults. Moreover, the effects of age interacted with the sex of the target talker, such that older adults had greater difficulty understanding target keywords from female talkers than male talkers.Entities:
Keywords: aging; attention; speech perception; talker identification; talker sex
Year: 2020 PMID: 32372720 PMCID: PMC7225833 DOI: 10.1177/2331216520915110
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Figure 1.Example spectrograms illustrating a sentence processed for male voice features (left panel) and the same sentence processed for female voice features (right panel). When processing a male talker for female voice features, the fundamental frequency is shifted to a higher voice pitch and spectral envelope is broadened such that formants occur in a higher frequency range.
Figure 2.Schematic diagram of standard trials (top panel) and probe trials (bottom panel). All trials began with the cue phrase, followed by 1.5 s of silence, and then the target and competing talker mixture. On standard trials, F0 and spectral envelope of the cue phrase matched the target talker exactly. On probe trials, F0 and spectral envelope of the cue phrase were parametrically shifted toward the midpoint between the target and competing talker.
Figure 3.Recognition of keywords in target sentences in percent correct (filled symbols with solid lines) and masker-intrusion errors (open symbols with dotted lines) for younger (blue) and older (red) adults are plotted as a function of semitone shift of the cue phrase. Symbols are offset for clarity. Keyword recognition was higher for standard trials (shift of 0 semitones, left most data points) and declined for probe trials with increasing shift of the cue phrase. Masker-intrusion errors were rare on standard trials and are not plotted. Masker-intrusion errors increased for probe trials with increasing shift of the cue phrase. Keyword recognition for target sentences was higher for younger than older adults for standard and probe trials, but both groups demonstrated a similar number of masker-intrusion errors on probe trials.
Target keyword GLMM factors, coding, standard estimates, standard error, and z statistics are displayed for each significant fixed effect and interaction term.
| Factor | Description and Coding | Standard Estimate (β) | Standard Error | |
|---|---|---|---|---|
|
| Age-group of listener (older = 1; younger = –1) | −0.28 | 0.06 | −4.83*** |
|
| Sex of target talker (female = 1; male = −1) | 0.09 | 0.01 | 8.33*** |
|
| Shift in voice characteristics of cue phrase (0 for standard trials; 3.0, 3.5, or 4.0 for probe trials) | −0.25 | 0.01 | −34.22*** |
|
| Sentence length (number of keywords in the sentence, normalized | −0.13 | 0.01 | −12.61*** |
|
| Position of keyword within the sentence (serial order position of keyword scaled by total number of keywords in sentence, normalized | 0.34 | 0.01 | 32.92*** |
|
| Listener’s score on Connections Test (residualized for effects of age, normalized for | 0.20 | 0.06 | 3.44*** |
|
| Listener’s score on Text Reception Threshold Test (residualized for effects of age, normalized for | −0.16 | 0.06 | 2.72** |
|
| Interaction between target sex and age-group of listener | −0.10 | 0.01 | −8.42*** |
| | Effect of target sex for older listeners | <0.01 | 0.01 | <0.01 |
| | Effect of target sex for younger listeners | 0.19 | 0.02 | 11.12*** |
|
| Interaction between voice shift of cue phrase and target sex | 0.09 | 0.01 | 11.98*** |
| | Effect of voice shift on female targets | −0.16 | 0.01 | −15.67*** |
| | Effect of voice shift on male targets | −0.33 | 0.01 | −33.61*** |
|
| Interaction between age-group, target sex, and voice shift | −0.03 | 0.01 | −3.54** |
| | Interaction between voice shift and age-group for female targets | <0.01 | 0.01 | −0.27 |
| | Interaction between voice shift and age-group for male targets | 0.05 | 0.01 | 4.63*** |
| | Effect of voice shift on male targets for older listeners | −0.26 | 0.01 | −19.23*** |
| | Effect of voice shift on male targets for younger listeners | −0.41 | 0.01 | −30.09*** |
Note. Each interaction was explored with a separate post hoc model with split factors, and statistical results are indented below interactions. Asterisks indicated significance levels for z statistics (***p < .001; **p < .01). ns = not significant; TRT = Text Reception Threshold.
Figure 4.Recognition of keywords in target sentences in percent correct (filled symbols with solid lines) and masker-intrusion errors (open symbols with dotted lines) for younger (blue) and older (red) adults are plotted as a function of semitone shift of the cue phrase for male target talkers (left panel) and female target talkers (right panel). Symbols are offset for clarity. Standard trials are presented on the far left in each panel. Probe trials with male target talkers demonstrate poorer performance and more masker-intrusion errors than trials with female target talkers. Keyword recognition for target sentences with female targets was higher for younger than older adults for standard and probe trials, but the two groups performed more similarly for target sentences with male targets.
Masker-intrusion error GLMM factors, coding, standard estimates, standard error, and z statistics are displayed for each significant fixed effect and interaction term.
| Factor | Description and Coding | Standard Estimate (β) | Standard Error | |
|---|---|---|---|---|
|
| Sex of target talker (female = 1; male = −1) | −0.39 | 0.05 | −8.37*** |
|
| Shift in voice characteristics of cue phrase (0 for standard trials; 3.0, 3.5, or 4.0 for probe trials) | 0.64 | 0.02 | 37.45*** |
|
| Position of keyword within the sentence (serial order position of keyword scaled by total number of keywords in sentence, normalized | 0.22 | 0.03 | 7.73*** |
|
| Listener’s score on Connections Test (residualized for effects of age, normalized for | −0.21 | 0.08 | −2.59** |
|
| Interaction between target sex and age group of listener | 0.22 | 0.03 | 7.18*** |
|
| Effect of target sex for older listeners | −0.17 | 0.05 | −3.12** |
|
| Effect of target sex for younger listeners | −0.61 | 0.06 | −10.44*** |
|
| Interaction between voice shift of cue phrase and target sex | −0.10 | 0.02 | −5.77*** |
|
| Effect of voice shift on female targets | 0.55 | 0.03 | 19.05*** |
|
| Effect of voice shift on male targets | 0.74 | 0.02 | 39.08*** |
Note. Each interaction was explored with a separate post hoc model with split factors, and statistical results are indented below interactions. Asterisks indicated significance levels for z statistics (***p < .001; **p < .01).