| Literature DB >> 31104580 |
Lars Bramsløw1, Marianna Vatti1, Rikke Rossing1, Gaurav Naithani2, Niels Henrik Pontoppidan1.
Abstract
People with hearing impairment find competing voices scenarios to be challenging, both with respect to switching attention from one talker to the other, as well as maintaining attention. With the Danish competing voices test (CVT) presented here, the dual-attention skills can be assessed. The CVT provides sentences spoken by three male and three female talkers, played in sentence pairs. The task of the listener is to repeat the target sentence from the sentence pair based on cueing either before or after playback. One potential way of assisting segregation of two talkers is to take advantage of spatial unmasking by presenting one talker per ear after application of time-frequency masks for separating the mixture. Using the CVT, this study evaluated four spatial conditions in 14 moderate-to-severely hearing-impaired listeners to establish benchmark results for this type of algorithm applied to hearing-impaired listeners. The four spatial conditions were as follows: summed (diotic), separate, the ideal ratio mask, and the ideal binary mask. The results show that the test is sensitive to the change in spatial condition. The temporal position of the cue has a large impact, as cueing the target talker before playback focuses the attention toward the target, whereas cueing after playback requires equal attention to the two talkers, which is more difficult. Furthermore, both applied ideal masks show test scores very close to the ideal separate spatial condition, suggesting that this technique is useful for future separation algorithms using estimated rather than ideal masks.Entities:
Keywords: hearing impairment; ideal masks; spatial hearing; speech masker; speech test
Mesh:
Year: 2019 PMID: 31104580 PMCID: PMC6610337 DOI: 10.1177/2331216519848288
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Figure 1.Timeline of one competing voices test item with two sentences played simultaneously in a pair and the cue position either before (Pre) or after (Post) playback.
Figure 2.Summary of audiograms for the 14 hearing-impaired test persons. The values are air conduction thresholds across left and right ears.
Summary of Test Conditions in the Factorial Design for the Listening Test.
| Experimental factor | Number of levels | Labels |
|---|---|---|
| Spatial | 4 | Sum Separate Ideal ratio mask Ideal binary mask |
| Cuetype | 3 | Audio Text Gender |
| Cueposition | 2 | Pre Post |
| Gender | 3 | Male–Female Male–Male Female–Female |
| Test persons | 14 | N/A |
Figure 3.Box-whisker plots for the 14 TP, showing all test conditions.
Summary of Analysis of Variance.
| Effect (fixed/ random) | Nominator | Denominator Syn |
|
| |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Spatial × Cueposition | Fixed | 3 | 659.00 | 0.73 | .532 |
| Cuetype × Cueposition | Fixed | 2 | 659.00 | 2.73 | .066 |
|
|
|
|
|
| |
|
|
|
|
|
|
|
| TP × Spatial | Random | 39 | 659.00 | 1.28 | .122 |
| TP × Cuetype | Random | 26 | 659.00 | 1.16 | .267 |
|
|
|
|
|
| |
| Error | 659 |
Note. Significant effects (p < .05) are shown in italics. TP = test persons.
Figure 4.The combined effect of Spatial and Cuetype. Mean values and 95% confidence intervals of the mean are shown.
IBM = ideal binary mask; IRM = ideal ratio mask.
Figure 5.The effect of Cueposition (Pre vs. Post). Mean values and 95% confidence intervals of the mean are shown.
Figure 6.The combined effect of Gender mix and Cuetype. Mean values and 95% confidence intervals of the mean are shown.
Figure 7.The interaction effect, combining TP and Cueposition (Pre vs. Post). Mean values and 95% confidence intervals of the mean are shown.