| Literature DB >> 31747743 |
Chanbeom Kwak1,2, Woojae Han1,2,3.
Abstract
Auditory scene analysis is defined as a listener's ability to segregate a meaningful message from meaningless background noise in a listening environment. To gain better understanding of auditory perception in terms of message integration and segregation ability among concurrent signals, we aimed to systematically review the size of auditory scenes among individuals. A total of seven electronic databases were searched from 2000 to the present with related key terms. Using our inclusion criteria, 4,507 articles were classified according to four sequential steps-identification, screening, eligibility, included. Following study selection, the quality of four included articles was evaluated using the CAMARADES checklist. In general, studies concluded that the size of auditory scene increased as the number of sound sources increased; however, when the number of sources was five or higher, the listener's auditory scene analysis reached its maximum capability. Unfortunately, the score of study quality was not determined to be very high, and the number of articles used to calculate mean effect size and statistical significance was insufficient to draw significant conclusions. We suggest that study design and materials that consider realistic listening environments should be used in further studies to deep understand the nature of auditory scene analysis within various groups.Entities:
Keywords: Auditory segregation; Numerosity of auditory scene; Sound localization; Study quality
Year: 2019 PMID: 31747743 PMCID: PMC6949483 DOI: 10.7874/jao.2019.00248
Source DB: PubMed Journal: J Audiol Otol
Inclusion criteria based on the Participants, Intervention, Control, Outcomes, and Study designs (PICOS) strategy
| PICOS | Contents |
|---|---|
| Participants | Adults with normal hearing (≥18 years) |
| Intervention | Size of auditory scene analysis with any objective, subjective, and mixed measurement. |
| Control | Comparison with control group or repeated measures (experiments with additional purposes) |
| Outcomes | Outcome measure(s) related to size and/or number of auditory scenes |
| Study designs | Randomized controlled trials, non-randomized controlled trials, cohort studies (with a control comparison), and repeated measures (experiments with additional purposes) |
Fig. 1.A Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) flow diagram depicting the inclusion and exclusion process of the current study. A total of 4,507 searched articles were classified according to four sequential steps-identification, screening, eligibility, included. PICOS: Participants, Intervention, Control, Outcomes, and Study design.
Scientific study validity criteria based on CAMARADES checklists (Modified from Macleod, et al. [15].)
| Article | Scientific study validity criteria | Study quality score | |||||
|---|---|---|---|---|---|---|---|
| Randomization | Controls | Sample size calculation | Publication after peer review | Outcome measure | Statement of potential conflict of interest | ||
| Kawashima and Sato [ | 1 | 0 | 1 | 1 | 1 | 0 | 4 |
| Zhong and Yost [ | 1 | 0 | 0 | 1 | 1 | 0 | 3 |
| Roberts, et al. [ | 1 | 1 | 1 | 1 | 1 | 0 | 5 |
| Eramudugolla, et al. [ | 1 | 0 | 0 | 1 | 1 | 0 | 3 |
1 and 0 refer to “Yes” and “No”, respectively
Summary of the extracted and synthesized data from the four included articles
| Study | Participants | Test materials | Study design | Main factor | Outcome measures | Main findings | Size of scene | ||
|---|---|---|---|---|---|---|---|---|---|
| Kawashima and Sato [ | Twelve young adults with normal hearing in Exp. 3. | • Speech stimuli recorded by 20 talkers (ten male and ten female) were used. | Repeated measure | Perceptual limits | Correct percent | Perceptual limits were estimated as in the previous experiments, and they were 3.5, 3.4, and 4.4 in L1N6, L1N13, and L6N6 conditions, respectively. | For perceptual limits, | ||
| • Stimuli presented by a single and six loudspeakers. | • 1 SS: presenting by 6 and 13 talkers, recognized as 3.5 and 3.4 talkers, respectively. | ||||||||
| • 6 SS: presenting by 6 talkers, recognized as 4.4 talkers. | |||||||||
| Zhong and Yost [ | Total fourteen young, normalhearing listeners (i.e., eight in Exp. 1 and 3, and six in Exp. 4) | • A total of 24 words stimuli which consist of one-word countries were recorded by six female and six male speakers. | Repeated measure | Number of perceived size of auditory scene | Correct percent, correct location | For the cases of one, two, and three sound sources, the mean reported total number of sources were 1.1, 2.2, and 3.0, respectively, which were within one standard deviation compared to the actual number of sources in all cases. For the case of four sources, the mean perceived total number of sources was 3.5, which is smaller than the actual total number (four), but still within one standard deviation of ideal performance. | As recognition, When SS was increased 1 to 4, the average number of source recognition was 1.1, 2.2, 3.0, and 3.5 sound sources. | ||
| • Twelve of the 24 loudspeakers were used to present the stimuli. | |||||||||
| Eramudugolla, et al. [ | 28 and 26 young adults with normal hearing in Exp. 1 and 3, respectively. | • A combination of four, six, or eight sounds drawn at random from the library of 11 natural sounds. | Repeated measure | Selective attention in complex auditory scenes | Correct percent, sensitivity, response criterion | Sensitivity decreased with increasing scene size in the nondirected condition [ | Sensitivities: | ||
| With attention, | |||||||||
| • All stimuli presented via headphone. | 4 SS: 3.07 (d´) | ||||||||
| 6 SS: 2.91 (d´) | |||||||||
| • The HRTFs with a timedomain representation was used to generate the auditory objects. | 8 SS: 2.88 (d´) | ||||||||
| Without attention, | |||||||||
| 4 SS: 2.76 (d´) | |||||||||
| 6 SS: 2.03 (d´) | |||||||||
| 8 SS: 1.21 (d´) | |||||||||
| Roberts, et al. [ | 20 young and 20 old normalhearing listeners, and 20 young and 30 old normal-hearing listeners in Exp. 2 and 3, respectively. | The stimuli, which were 10-sec clips of eight distinctive sounds presented via headphones. | Repeated measure | Enumeration of concurrent auditory stimuli and of sequential auditory stimuli | Accuracy, response time, questionnaire | Participants became less accurate as numerosity increased, | Accuracy: | ||
| For young group, | |||||||||
| 1 SS: 2.43 (%) | |||||||||
| 2 SS: 2.90 (%) | |||||||||
| 3 SS: 2.83 (%) | |||||||||
| For old group, | |||||||||
| 1 SS: 2.44 (%) | |||||||||
| 2 SS: 2.69 (%) | |||||||||
| 3 SS: 2.65 (%) | |||||||||
Exp.: experiment, L1N6: six talkers were presented from one location, L1N13: thirteen talkers were presented from one location, L6N6: six talkers were presented from six locations, SS: sound source(s), HRTF: head related transfer function