| Literature DB >> 27798645 |
Brian C Wesolowski1, Alex Hofmann2,3.
Abstract
The purpose of this study was to explore the relationship between audio descriptors for groove-based electronic dance music (EDM) and raters' perceived cognitive, affective, and psychomotor responses. From 198 musical excerpts (length: 15 sec.) representing 11 subgenres of EDM, 19 low-level audio feature descriptors were extracted. A principal component analysis of the feature vectors indicated that the musical excerpts could effectively be classified using five complex measures, describing the rhythmical properties of: (a) the high-frequency band, (b) the mid-frequency band, and (c) the low-frequency band, as well as overall fluctuations in (d) dynamics, and (e) timbres. Using these five complex audio measures, four meaningful clusters of the EDM excerpts emerged with distinct musical attributes comprising music with: (a) isochronous bass and static timbres, (b) isochronous bass with fluctuating dynamics and rhythmical variations in the mid-frequency range, (c) non-isochronous bass and fluctuating timbres, and (d) non-isochronous bass with rhythmical variations in the high frequencies. Raters (N = 99) were each asked to respond to four musical excerpts using a four point Likert-Type scale consisting of items representing cognitive (n = 9), affective (n = 9), and psychomotor (n = 3) domains. Musical excerpts falling under the cluster of "non-isochronous bass with rhythmical variations in the high frequencies" demonstrated the overall highest composite scores as evaluated by the raters. Musical samples falling under the cluster of "isochronous bass with static timbres" demonstrated the overall lowest composite scores as evaluated by the raters. Moreover, music preference was shown to significantly affect the systematic patterning of raters' responses for those with a musical preference for "contemporary" music, "sophisticated" music, and "intense" music.Entities:
Mesh:
Year: 2016 PMID: 27798645 PMCID: PMC5087899 DOI: 10.1371/journal.pone.0163938
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Pattern Factor Loading, Communalities, Variance, and Factor Correlations Based Upon a Principal Components Analysis (PCA) Factoring with Oblim Oblique Rotations for 16 Audio Feature Variables.
Grey areas indicate the low-level audio descriptors that characterize the respective factor.
| Factor | ||||||
|---|---|---|---|---|---|---|
| I | II | III | IV | V | ||
| Low-level Audio Descriptors | High Frequency Band Rhythmical Descriptors | Low Frequency Band Rhythmical Descriptors | Dynamic Descriptors | Mid Frequency Band Rhythmical Descriptors | Timbre Descriptors | |
| IOIMid2SD | 0.97 | 0.02 | 0.05 | 0.14 | -0.04 | 0.79 |
| IOIHiSD | 0.84 | 0.00 | 0.02 | -0.15 | -0.07 | 0.83 |
| evDensityHi | -0.65 | 0.05 | 0.21 | 0.20 | -0.08 | 0.79 |
| evDensityMid2 | -0.56 | 0.02 | 0.18 | 0.34 | -0.05 | 0.78 |
| evDensityLow | 0.07 | 0.90 | -0.12 | 0.08 | 0.09 | 0.81 |
| evDensMetrLevelLow | 0.07 | 0.84 | 0.04 | 0.06 | 0.09 | 0.72 |
| IOILowSD | 0.10 | -0.83 | -0.10 | 0.18 | 0.16 | 0.74 |
| rmsMean | -0.04 | 0.06 | -0.93 | 0.18 | 0.04 | 0.78 |
| rmsSD | -0.13 | 0.12 | 0.78 | 0.10 | -0.18 | 0.85 |
| centSD | 0.03 | 0.01 | 0.59 | 0.30 | 0.47 | 0.71 |
| IOIMid1SD | 0.02 | -0.04 | 0.07 | -0.90 | 0.09 | 0.84 |
| evDensityMid1 | -0.09 | 0.04 | -0.01 | 0.87 | -0.05 | 0.87 |
| fluxMean | -0.14 | 0.05 | -0.12 | -0.12 | 0.92 | 0.87 |
| centMean | 0.39 | -0.16 | 0.10 | -0.13 | 0.65 | 0.80 |
| roughnessMeanFull | 0.21 | -0.05 | -0.32 | -0.35 | 0.49 | 0.81 |
| fluxSD | -0.23 | 0.00 | 0.39 | 0.28 | 0.47 | 0.62 |
| Percentage of total variance accounted for | 0.25 | 0.18 | 0.19 | 0.21 | 0.17 | |
| Cumulative variance accounted for | 0.19 | 0.34 | 0.49 | 0.66 | 0.79 | |
| I | 1.000 | |||||
| II | -0.21 | 1.000 | ||||
| III | -0.27 | 0.05 | 1.000 | |||
| IV | -0.51 | 0.27 | 0.35 | 1.000 | ||
| V | 0.14 | -0.13 | -0.03 | -0.15 | 1.000 | |
Fig 1Results of K-means clustering of the stimuli for musical samples.
(A) Results are derived from the five factor scores gained from the low-level audio descriptors (see Table 1). Stimuli are labeled with their Track IDs and plotted in a 2-dimensional representation (Component/Factor 1 and 2) of the four clusters (1 = black, 2 = grey, 3 = green, 4 = pink) presented in Table 2.
Final Four Cluster Centers by Five Factors comprising 19 low-level audio descriptors.
| I | II | III | IV | V | |
|---|---|---|---|---|---|
| Cluster | High Frequency Band Rhythmical Descriptors | Low Frequency Band Rhythmical Descriptors | Dynamic Descriptors | Mid Frequency Band Rhythmical Descriptors | Timbre Descriptors |
| Cluster 1: Isochronous Bass / Static Timbre | -0.38 | 0.31 | -0.14 | 0.38 | -1.18 |
| Cluster 2: Isochronous Bass / Varying Dynamics / Mid Freq. Variations | -0.49 | 0.52 | 1.07 | 0.69 | 0.20 |
| Cluster 3: Varying Timbres / non-iso. Bass | -0.04 | -0.31 | -0.58 | -0.18 | 0.62 |
| Cluster 4: High Freq. Variations / non-iso. Bass | 1.81 | -0.86 | -0.45 | -1.67 | 0.33 |
Fig 2Distribution of the musical stimuli.
(A) Musical stimuli are drawn from 11 sub-genres of EDM across the 4 clusters derived from the audio descriptors.
Summary Statistics from the MFR-RS Model showing overall significance for all 6 facets (Musical Sample, Rater, Items, Subgenres, Musical Preference and Clusters derived from the Audio Analysis).
| Facets | ||||||
|---|---|---|---|---|---|---|
| Musical Sample | Rater | Item | Subgenre | Music Pref | Cluster | |
| 0.07 | 0.00 | 0.00 | 0.09 | 0.06 | .07 | |
| 0.19 | 0.49 | 0.54 | 0.16 | 0.10 | .05 | |
| 198 | 99 | 19 | 11 | 5 | 4 | |
| 0.99 | 0.98 | 0.99 | 0.99 | 0.97 | 1.01 | |
| 0.33 | 0.35 | 0.29 | 0.11 | 0.09 | 0.08 | |
| -0.20 | -0.30 | -0.50 | -0.30 | -0.70 | -0.10 | |
| 1.70 | 2.40 | 4.70 | 2.20 | 2.60 | 2.40 | |
| 1.00 | 0.99 | 1.00 | 0.99 | 0.98 | 1.02 | |
| 0.34 | 0.34 | 0.30 | 0.11 | 0.09 | 0.07 | |
| -0.10 | -0.30 | -0.40 | -0.10 | -0.40 | 0.20 | |
| 1.70 | 2.30 | 4.70 | 2.20 | 2.60 | 2.10 | |
| 0.89 | 0.91 | 0.99 | 0.91 | 0.87 | 0.67 | |
| 1582.10 | 1036.30 | 1345.80 | 126.10 | 51.80 | 9.4 | |
| 176 | 98 | 18 | 10 | 4 | 3 | |
* p < 0.01
** p = .02
Fig 3Visual depiction of the cluster calibration in relation to item calibration on the logit scale.
(A) Cluster spread ranged from 0.01 logits to 0.16 logits and demonstrated significant distinction according to one fit item: Inartistic/Artistic (C8).
Calibration of the Item Facet.
Items (C = cognitive, A = affective, P = psychomotor) are ordered by their endorsability. Most difficult item to rate (C9) on top, easiest (C1) on bottom. C, A, and P items are mixed in their ratings and spread across the logit scale.
| Item | Observed Average Rating | Measure | SE | Infit | Std. Infit | Outfit | Std. Outfit |
|---|---|---|---|---|---|---|---|
| C9 (Subtle-Obvious) | 2.03 | 0.98 | 0.07 | 1.43 | 5.80 | 1.57 | 7.18 |
| C5 (Delicate-Rugged) | 2.07 | 0.92 | 0.07 | 1.16 | 2.37 | 1.22 | 3.06 |
| A1 (Unemotional-Emotional) | 2.19 | 0.74 | 0.07 | 1.14 | 2.14 | 1.15 | 2.20 |
| A2 (Unforgettable-Forgettable) | 2.23 | 0.68 | 0.07 | 0.91 | -1.34 | 0.95 | -0.79 |
| C4 (Simple-Complex) | 2.30 | 0.57 | 0.06 | 1.41 | 5.71 | 1.40 | 5.55 |
| P3 (Dance) | 2.31 | 0.56 | 0.06 | 1.29 | 4.21 | 1.27 | 3.87 |
| C8 (Inartistic-Artistic) | 2.57 | 0.18 | 0.06 | 0.73 | -4.58 | 0.73 | -4.60 |
| A5 (Distasteful-Delightful) | 2.69 | 0.00 | 0.07 | 0.55 | -8.25 | 0.56 | -7.95 |
| A9 (Cold-Hot) | 2.73 | -0.06 | 0.07 | 0.86 | -2.16 | 0.86 | -2.14 |
| A4 (Boring-Thrilling) | 2.75 | -0.09 | 0.07 | 0.79 | -3.39 | 0.78 | -3.54 |
| A7 (Dejected-Elated) | 2.81 | -0.19 | 0.07 | 0.55 | -8.16 | 0.56 | -7.86 |
| P2 (Nod head) | 2.86 | -0.26 | 0.07 | 1.38 | 5.07 | 1.34 | 4.59 |
| C6 (Indefinite-Clear) | 2.87 | -0.29 | 0.07 | 1.12 | 1.76 | 1.11 | 1.55 |
| C2 (Unbalanced-Balanced) | 2.93 | -0.37 | 0.07 | 1.05 | 0.76 | 1.06 | 0.87 |
| P1 (Tap feet) | 2.93 | -0.38 | 0.07 | 1.12 | 1.69 | 1.08 | 1.15 |
| A8 (Sad-Joyful) | 2.94 | -0.39 | 0.07 | 0.59 | -7.00 | 0.61 | -6.52 |
| C3 (Disorderly-Orderly) | 2.94 | -0.39 | 0.07 | 1.27 | 3.68 | 1.26 | 3.49 |
| C7 (Plain-Ornate) | 2.95 | -0.41 | 0.07 | 0.59 | -6.98 | 0.60 | -6.75 |
| A6 (Deadening-Enlivening) | 2.95 | -0.42 | 0.07 | 0.73 | -4.38 | 0.74 | -4.05 |
| A3 (Depressing-Exciting) | 3.08 | -0.64 | 0.07 | 0.78 | -3.28 | 0.76 | -3.25 |
| C1 (Unstructured-Structured) | 3.14 | -0.74 | 0.07 | 1.33 | 4.19 | 1.33 | 4.11 |
Note. The items are arranged in measure (endorsability) order, from low to high.
Calibration of the Cluster Facet.
Spread of the clusters across the logit scale.
| Cluster | Observed Average Rating | Measure | SE | Infit | Std. Infit | Outfit | Std. Outfit |
|---|---|---|---|---|---|---|---|
| Cluster 4 (High Freq. Variations / non-iso. Bass) | 2.81 | 0.16 | 0.04 | 1.13 | 3.17 | 1.13 | 2.97 |
| Cluster 3 (Varying Timbres / non-iso. Bass) | 2.68 | 0.08 | 0.03 | 0.98 | -0.90 | 0.98 | -0.88 |
| Cluster 2 (Isochronous Bass / Varying Dynamics / Mid Freq. Variations) | 2.68 | 0.04 | 0.03 | 0.91 | -3.40 | 0.93 | -2.66 |
| Cluster 1 (Iso. Bass / Static Timbre) | 2.60 | 0.01 | 0.03 | 1.02 | 0.78 | 1.04 | 1.21 |
Summary of Differential Facet Functioning Statistics (Music Preference Interactions) for Item Exhibiting | Z | > = 2.0.
Showing only selected items, which were significantly overrated by listeners with specific musical preferences.
| Music Pref. | Infit MSQ | Outfit MSQ | Item | Total Observed | Total Expected | Stand. Mean Residual (obs-exp) | Bias Logit | ||
|---|---|---|---|---|---|---|---|---|---|
| Sophisticated | 1.00 | 1.00 | C1 (Unstru-Struct) | 132.00 | 121.81 | 0.26 | 0.53 | 0.22 | 2.18 |
| Sophisticated | 1.30 | 1.30 | C3 (Disord-Ord) | 124.00 | 113.80 | 0.26 | 0.46 | 0.22 | 2.07 |
| Intense | 1.40 | 1.40 | P3 (Dance) | 211.00 | 233.28 | -0.22 | -0.33 | 0.12 | -2.68 |
| Sophisticated | 1.00 | 0.90 | C4 (Simp-Compx) | 76.00 | 89.11 | -0.34 | -0.52 | 0.21 | -2.54 |