| Literature DB >> 35774636 |
Ioannis Petros Samiotis1, Sihang Qiu1,2, Christoph Lofi1, Jie Yang1, Ujwal Gadiraju1, Alessandro Bozzon1.
Abstract
Music content annotation campaigns are common on paid crowdsourcing platforms. Crowd workers are expected to annotate complex music artifacts, a task often demanding specialized skills and expertise, thus selecting the right participants is crucial for campaign success. However, there is a general lack of deeper understanding of the distribution of musical skills, and especially auditory perception skills, in the worker population. To address this knowledge gap, we conducted a user study (N = 200) on Prolific and Amazon Mechanical Turk. We asked crowd workers to indicate their musical sophistication through a questionnaire and assessed their music perception skills through an audio-based skill test. The goal of this work is to better understand the extent to which crowd workers possess higher perceptions skills, beyond their own musical education level and self reported abilities. Our study shows that untrained crowd workers can possess high perception skills on the music elements of melody, tuning, accent, and tempo; skills that can be useful in a plethora of annotation tasks in the music domain.Entities:
Keywords: human computation; knowledge crowdsourcing; music annotation; music sophistication; perceptual skills
Year: 2022 PMID: 35774636 PMCID: PMC9237482 DOI: 10.3389/frai.2022.828733
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1The four steps in the music perception skills study.
Figure 2Interfaces of the study (A, GMSI questionnaire, B, Mini-PROMS, and C, Participant ID prompt).
Prolific participant demographics.
|
|
| |
|---|---|---|
| Age (years) | Range | 18–65 |
| Majority | 18–25 (70.11%) | |
| Occupation | Full-time | 30 |
| Part-time | 11 | |
| Unemployed | 44 | |
| Voluntary work | 2 | |
| Education | Associate degree | 3 |
| Bachelor's degree | 35 | |
| Doctorate degree | 1 | |
| High school/HED | 16 | |
| Master's degree | 12 | |
| Professional degree | 1 | |
| Some college, no diploma | 13 | |
| Some high school, no diploma | 2 | |
| Technical/trade/vocational training | 4 |
GMSI range, median, mean, and standard deviation.
|
|
|
|
| |
|---|---|---|---|---|
| Active engagement | 19–45 | 31 | 30.91 | 5.45 |
| Perceptual abilities | 16–45 | 34 | 33.62 | 6.65 |
| Musical training | 7–45 | 17 | 18.52 | 9.61 |
| Singing abilities | 9–41 | 28 | 27.41 | 6.03 |
| Emotions | 18–42 | 33 | 33.24 | 4.28 |
| General music sophistication | 40–101 | 69 | 69.76 | 14.20 |
Intercorrelations (Pearson's R) of subscales of GMSI scores.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Active engagement | 1.000 | ||||
| Perceptual abilities | 0.262 | 1.000 | |||
| Musical training | 0.224 | 0.442 | 1.000 | ||
| Emotions | 0.401 | 0.380 | 0.178 | 1.000 | |
| Singing abilities | 0.142 | 0.463 | 0.465 | 0.125 | 1.000 |
Statistical significance (p < 0.05) is marked using an asterisk (*).
Mini-PROMS range, median, mean, and standard deviation.
|
|
|
|
| |
|---|---|---|---|---|
| Melody | 1.5–9 | 5 | 4.98 | 1.59 |
| Tuning | 1–7.5 | 4 | 4.22 | 1.62 |
| Accent | 0–9.5 | 5 | 5.19 | 1.84 |
| Tempo | 1–8 | 5 | 5.14 | 1.59 |
| Mini-PROMS total | 6–30 | 19.5 | 19.53 | 4.98 |
MTurk participant demographics.
|
|
| |
|---|---|---|
| Age (years) | Range | 18–65+ |
| Majority | 26–35 (52.38%) | |
| Occupation | Full-time | 71 |
| Part-time | 9 | |
| Unemployed | 3 | |
| Retired | 1 | |
| Education | Associate degree | 6 |
| Bachelor's degree | 44 | |
| Doctorate degree | 2 | |
| High school/HED | 11 | |
| Master's degree | 10 | |
| Professional degree | 0 | |
| Some college, no diploma | 8 | |
| Some high school, no diploma | 1 | |
| Technical/trade/vocational training | 2 |
GMSI range, median, mean, and standard deviation.
|
|
|
|
| |
|---|---|---|---|---|
| Active engagement | 12–46 | 32 | 30.57 | 7.92 |
| Perceptual abilities | 18–47 | 32.5 | 32.82 | 5.92 |
| Musical training | 7–43 | 23 | 21.80 | 9.40 |
| Singing abilities | 9–45 | 32.5 | 28.29 | 8.12 |
| Emotions | 7–41 | 30.5 | 30.34 | 5.35 |
| General music sophistication | 29–113 | 75 | 72.19 | 18.15 |
Intercorrelations (Spearman's rank) of subscales of GMSI scores.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Active engagement | 1.000 | ||||
| Perceptual abilities | 0.232 | 1.000 | |||
| Musical training |
| 0.263 | 1.000 | ||
| Emotions | 0.213 | 0.471 | -0.052 | 1.000 | |
| Singing abilities |
| 0.340 |
| 0.223 | 1.000 |
Statistical significance (p < 0.05) is marked using an asterisk (*). Values in bold indicate intercorrelations higher than 0.5.
Mini-PROMS range, median, mean, and standard deviation.
|
|
|
|
| |
|---|---|---|---|---|
| Melody | 1–8 | 4.25 | 4.22 | 1.56 |
| Tuning | 1–7.5 | 3 | 3.2 | 1.33 |
| Accent | 0–7.5 | 4 | 4.04 | 1.42 |
| Tempo | 1–8 | 3.5 | 3.75 | 1.64 |
| Mini-PROMS | 6.5–29.5 | 14.5 | 15.2 | 4.77 |
Figure 3Musical Training (GMSI) and Performance on Mini-PROMS (acc%).