| Literature DB >> 27641681 |
An-Chieh Chang1, Robert Lutfi2, Jungmee Lee1, Inseok Heo3.
Abstract
Research on hearing has long been challenged with understanding our exceptional ability to hear out individual sounds in a mixture (the so-called cocktail party problem). Two general approaches to the problem have been taken using sequences of tones as stimuli. The first has focused on our tendency to hear sequences, sufficiently separated in frequency, split into separate cohesive streams (auditory streaming). The second has focused on our ability to detect a change in one sequence, ignoring all others (auditory masking). The two phenomena are clearly related, but that relation has never been evaluated analytically. This article offers a detection-theoretic analysis of the relation between multitone streaming and masking that underscores the expected similarities and differences between these phenomena and the predicted outcome of experiments in each case. The key to establishing this relation is the function linking performance to the information divergence of the tone sequences, DKL (a measure of the statistical separation of their parameters). A strong prediction is that streaming and masking of tones will be a common function of DKL provided that the statistical properties of sequences are symmetric. Results of experiments are reported supporting this prediction.Entities:
Keywords: auditory masking; auditory perception; auditory streaming; hearing
Mesh:
Year: 2016 PMID: 27641681 PMCID: PMC5029798 DOI: 10.1177/2331216516664343
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Figure 1.Schematic of stimulus configuration. The frequencies of the A (black) and B (gray) tones were drawn at random from equal variance normal distributions (p and q) separated in mean, with the mean of the A tones fixed at 500 Hz. The independent variables were the distribution parameters Δ and σ. The listener’s task in different conditions was to report their confidence of hearing the B tones alternate in level indicated by width of gray rectangles (masking task) or hearing the A and B tones split into separate cohesive streams (streaming task).
Figure 2.Confidence ratings averaged across seven listeners for the streaming (top panel) and masking (bottom panel) tasks plotted as a function of Δ. Different curves correspond to the different values of σ. Error bars not shown for clarity of presentation (see, instead, analysis of variance described in the Results section).
Figure 3.Mean confidence ratings of Figure 2 replotted as a function of Δ/σ for both the streaming (filled symbols) and masking (unfilled symbols) tasks. Confidence ratings for the masking task have been shifted upward by a constant amount to equate overall difficulty for the two tasks (see text for explanation). The curve shown is the nonlinear, least-squares fit of Equation 5 to the data. The fit accounts for 96% of the total variance in the data.