| Literature DB >> 28421007 |
Irene Eunyoung Lee1,2, Charles-Francois V Latchoumane3, Jaeseung Jeong1,4.
Abstract
Emotional visual music is a promising tool for the study of aesthetic perception in human psychology; however, the production of such stimuli and the mechanisms of auditory-visual emotion perception remain poorly understood. In Experiment 1, we suggested a literature-based, directive approach to emotional visual music design, and inspected the emotional meanings thereof using the self-rated psychometric and electroencephalographic (EEG) responses of the viewers. A two-dimensional (2D) approach to the assessment of emotion (the valence-arousal plane) with frontal alpha power asymmetry EEG (as a proposed index of valence) validated our visual music as an emotional stimulus. In Experiment 2, we used our synthetic stimuli to investigate possible underlying mechanisms of affective evaluation mechanisms in relation to audio and visual integration conditions between modalities (namely congruent, complementation, or incongruent combinations). In this experiment, we found that, when arousal information between auditory and visual modalities was contradictory [for example, active (+) on the audio channel but passive (-) on the video channel], the perceived emotion of cross-modal perception (visual music) followed the channel conveying the stronger arousal. Moreover, we found that an enhancement effect (heightened and compacted in subjects' emotional responses) in the aesthetic perception of visual music might occur when the two channels contained contradictory arousal information and positive congruency in valence and texture/control. To the best of our knowledge, this work is the first to propose a literature-based directive production of emotional visual music prototypes and the validations thereof for the study of cross-modally evoked aesthetic experiences in human subjects.Entities:
Keywords: aesthetic experience; aesthetic perception; art and emotion; auditory-visual integration; auditory-visual perception; cross-modal integration; music and emotion; visual music and emotion
Year: 2017 PMID: 28421007 PMCID: PMC5379063 DOI: 10.3389/fpsyg.2017.00440
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1A model of continuous auditory-visual modulation integration perception. The diagram begins with the auditory-visual stimuli that are presented to the individual organisms and processed by evaluation functions (V) into their psychological values of (Ov) and (Oa). These psychological stimuli combine with continuous internal integration functions (S) through the modulation values (Mvm) and predictive deviation values (Pvm) into tentative implicit synthesizing responses (Evm). And the multisensory aesthetic evaluation function (R) transports them to elicit the immanent, holistic aesthetic emotion of the visual music (Rvm).
Summary of musical design structural guidelines selected from reviewed studies and artists' decisions.
| Harmony | Consonant | Consonant | Consonant (with harmonic tensions such as b9, 9, 11, and 13, for example) | For comprehensive reviews for studies of correlations between musical expressions and emotions, see (Bunt and Pavlicevic, |
| Tonal Modality | Major (in F#) | minor (in C) | minor (in D) | |
| Melodic intervals | Wide Range, High-Pitched | Narrow Range, Low-Pitched | Complex (narrow to wide) Range, Large Intervals | |
| Pitch level | High | Mid | High+Low | |
| Loudness and loudness variation | Small few/no change | Soft few/no change | Loud rapid changes | |
| Articulation | Staccato | Legato | Complex (staccato + legato) | |
| Tempo and pulse | Firm pulse, fast and lively (112 bpm, Dotted Tempo, 4/4 meter) | Relaxed pulse, gentle and slow pulse (around 60 bpm, 4/4 meter) | Firm pulse, variedly quick and rapid (from 174 to 87 bpm, 4/4 meter) | |
| Adjectives for timbre and texture (depicted by main instruments and musical style) | Bright and playful (female vocal, latin percussive) | Calm, serene, with slow breathing (korean daegum flute, ethnic) | Tension and alleviation (electronic synthesizer lead, drum and bass) | |
| Pitch contour and melodic direction | Slightly Ascending at the Ending | Descending toward Ending | Building Up & Down | |
| Motive phrase melodic motion | Intervallic leaps, descending pitch contour | Stepwise + Leaps, ascending pitch contour | From stepwise to intervallic leaps, high complexity |
Prototypical directive guidelines provided information about basic settings in important musical structural expressions to generate the associated target emotional meaning in the music.
Sub-element of the component chosen by the artist considering the target emotion of the music production.
Summary of visual design structural guidelines selected from reviewed studies and artists' decisions.
| Warm, delight, cheerful, joyous | Calm, soothing, serene, tranquil | Exploding, lively, sprightly, exciting | Lundholm, | |
| Water droplet: circular form | Water waves: S-curve flows | Water Vortex: | Hevner, | |
| Repetition of circles | Descending S-curves then focusing in | Expanding out then repetition of rising bursts | Poffenberger and Barrows, | |
| Red, orange, yellow | Blue, green, purple, white, brown | Blue, green, yellow, white | Hevner, | |
| Uphill Slant (Lower Left to Upper Right), | Horizontal (Left To Right), | Vertical Upward, | Block, | |
| Medium | Slow | Quick | Block, | |
| Strong and fast changes, jump cuts, long exposure of objects | Slow and weak changes, smooth and dissolve, small number of scenes | Contrasting between different images, jump cuts, many scene changes | Block, |
Prototypical directive guidelines provided information about basic settings in important visual and animation structural expressions to generate the associated target emotional meaning in the abstract animation.
Sub-element of the component chosen by the artist considering the target emotion of the music production.
Selective studies vary in methodological approaches to investigating the influence of the related visual component and its expressions regarding emotion.
Figure 2Snapshot information about the visual channel, the auditory channel, and structural variations in the temporal streams of our three positive emotion visual music stimuli.
Mean and standard deviation comparison for evaluation, activity, and potency indices of all three stimuli in three different modalities.
| Audio Only Clip 1 (A1) | 8 | 0.60 | 0.17 | 0.54 | 0.16 | 0.27 | 0.15 |
| Visual Only Clip 1 (V1) | 8 | 0.47 | 0.16 | 0.06 | 0.33 | 0.08 | 0.36 |
| A1V1 (A1V1) | 8 | 0.51 | 0.20 | 0.07 | 0.43 | 0.34 | 0.17 |
| Audio Only Clip 2 (A2) | 8 | 0.32 | 0.18 | −0.06 | 0.28 | 0.32 | 0.29 |
| Visual Only Clip 2 (V2) | 8 | 0.15 | 0.11 | −0.02 | 0.42 | 0.08 | 0.36 |
| A2V2 (A2V2) | 8 | 0.13 | 0.27 | −0.27 | 0.24 | −0.11 | 0.24 |
| Audio Only Clip 3 (A3) | 8 | 0.18 | 0.30 | 0.81 | 0.12 | −0.28 | 0.27 |
| Visual Only Clip 3 (V3) | 8 | 0.28 | 0.25 | 0.59 | 0.27 | 0.26 | 0.22 |
| A3V3 (A3V3) | 8 | 0.34 | 0.37 | 0.51 | 0.27 | 0.13 | 0.35 |
Figure 3Comparison of connotative meanings of three clip stimuli using the evaluation and activity indices into 2D planes. Subjects' responses are shown for the conditions (A) audio only, (B) visual only, and (C) visual music (combined auditory and visual stimuli).
Results of Two-Way Repeated Measure MANOVA using the Huynh-Feldt Correction following a violation in the assumption of sphericity.
| AestheticEmotion | 1.527 | 1.785 | 0.855 | 11.018 | 0.000 | 0.208 |
| AestheticEmotion * Modality | 0.324 | 1.785 | 0.182 | 2.340 | 0.109 | 0.053 |
| AestheticEmotion * Emotion | 4.933 | 3.570 | 1.382 | 17.796 | 0.000 | 0.459 |
| AestheticEmotion * Modality * Emotion | 0.984 | 3.570 | 0.257 | 3.548 | 0.013 | 0.145 |
| Error(Affection) | 5.821 | 74.978 | 0.078 |
The analysis checks for effects of variables in modality and target emotion categories on affection information (evaluation, activity, and potency).
Results of the Kruskal-Wallis Test comparing the effect of our designed emotional stimuli on the three perceived emotion assessment indices (evaluation, activity, and potency).
| Clip 1 Target-Emotion ‘Happy’ | Evaluation | 24 | 0.526 | 0.178 | 0.250 | 0.880 | 0.375 | 0.531 | 0.688 |
| Activity | 24 | 0.224 | 0.387 | −0.940 | 0.750 | 0.000 | 0.281 | 0.438 | |
| Potency | 24 | 0.237 | 0.204 | −0.060 | 0.630 | 0.063 | 0.250 | 0.359 | |
| Total | 24 | 24.330 | 26.811 | 1.000 | 61.000 | 1.000 | 11.000 | 61.000 | |
| Clip 2 Target-Emotion ‘Relaxed’ | Evaluation | 24 | 0.198 | 0.227 | −0.310 | 0.690 | 0.063 | 0.219 | 0.359 |
| Activity | 24 | −0.117 | 0.328 | −0.560 | 0.560 | −0.422 | −0.188 | 0.125 | |
| Potency | 24 | 0.086 | 0.344 | −0.500 | 0.690 | −0.172 | −0.031 | 0.359 | |
| Total | 24 | 28.670 | 25.481 | 2.000 | 62.000 | 2.000 | 22.000 | 62.000 | |
| Clip 3 Target-Emotion ‘Vigorous’ | Evaluation | 16 | 0.268 | 0.306 | −0.310 | 0.880 | 0.063 | 0.281 | 0.469 |
| Activity | 16 | 0.635 | 0.256 | 0.000 | 0.940 | 0.500 | 0.719 | 0.813 | |
| Potency | 16 | 0.000 | 0.339 | −0.750 | 0.690 | −0.250 | −0.031 | 0.250 | |
| Total | 16 | 33.000 | 25.022 | 3.000 | 63.000 | 3.000 | 33.000 | 63.000 | |
| Clip 1 Target-Emotion ‘Happy’ | Evaluation | A1 | 8 | 3.227 | 2 | 0.199 | 0.140 | ||
| A1V1 | 8 | 11.380 | |||||||
| V1 | 8 | 10.060 | |||||||
| Total | 24 | ||||||||
| Activity | A1 | 8 | 11.409 | 2 | 0.003 | 0.496 | |||
| A1V1 | 8 | 10.130 | |||||||
| V1 | 8 | 8.130 | |||||||
| Total | 24 | ||||||||
| Potency | A1 | 8 | 14.060 | 6.244 | 2 | 0.044 | 0.271 | ||
| A1V1 | 8 | ||||||||
| V1 | 8 | 7.560 | |||||||
| Total | 24 | ||||||||
| Clip 2 Target-Emotion ‘Relaxed’ | Evaluation | A2 | 8 | 3.674 | 2 | 0.159 | 0.160 | ||
| A2V2 | 8 | 10.560 | |||||||
| V2 | 8 | 10.560 | |||||||
| Total | 24 | ||||||||
| Activity | A2 | 8 | 2.379 | 2 | 0.304 | 0.103 | |||
| A2V2 | 8 | 9.380 | |||||||
| V2 | 8 | 13.880 | |||||||
| Total | 24 | ||||||||
| Potency | A2 | 8 | 6.687 | 2 | 0.035 | 0.291 | |||
| A2V2 | 8 | 8.250 | |||||||
| V2 | 8 | 11.940 | |||||||
| Total | 24 | ||||||||
| Clip 3 Target-Emotion ‘Vigorous’ | Evaluation | A3 | 8 | 10.060 | 1.441 | 2 | 0.487 | 0.063 | |
| A3V3 | 8 | 13.630 | |||||||
| V3 | 8 | ||||||||
| Total | 24 | ||||||||
| Activity | A3 | 8 | 6.143 | 2 | 0.046 | 0.267 | |||
| A3V3 | 8 | 9.190 | |||||||
| V3 | 8 | 10.880 | |||||||
| Total | 24 | ||||||||
| Potency | A3 | 8 | 6.560 | 8.551 | 2 | 0.014 | 0.372 | ||
| A3V3 | 8 | 15.130 | |||||||
| V3 | 8 | ||||||||
| Total | 24 | ||||||||
The rank-based non-parametric analysis was used to check the rank-order of the three modalities (audio only, video only, visual music) and to determine if there were statistically significant differences between two (or more) modalities. For further investigation, a higher value in the mean rank indicates the upper rank; for example, in the “happy” clip group, A1 (Mean rank = 16.06) is higher in rank than the other two modalities, V1 (Mean rank = 10.06) and A1V1 (Mean rank = 11.38) on the evaluation index.
Indicates a significant difference p < 0.05, while bold type indicates the highest rank per category.
Figure 4Scatter plot and correlation between evaluation rating (valence) and upperAlpha frontal asymmetry. Upper Alpha power was estimated for the early presentation of clips (10 s), and was log transformed (10log10 uV2). Frontal asymmetry was estimated as the difference of F3-F4 of log transform of upperAlpha band power [10log10(P)]. Pearson's correlation was used (N = 8 subjects).
Figure 5EEG responses to each clip presentation. (A) F3 and F4 frontal upper Alpha power (10log10 uV2) for the first 10 s of clip presentation for clip 1 (happy), clip 2 (relaxed), and clip 3 (vigorous). (B) The overall topographic EEG changes from the baseline for the early presentation of each clip (first 10 s). The power is normalized to the baseline (division) and log transformed (10*log10) to provide the baseline change in dB. See Figure S6 the full temporal dynamics of the EEG response to each clip. Data are presented on a scatter plot and mean ± s.e.m; ** indicates a p ≤ 0.001.
Mean and Standard Deviation Comparison between Original and Altered Groups for Evaluation, Activity, and Potency Indices of the Control Clip (A3V3).
| Evaluation | Original group | 42 | 0.226 | 0.306 | 0.047 | 0.131 | 0.322 | −0.438 | 0.938 |
| Altered group | 53 | 0.212 | 0.292 | 0.040 | 0.132 | 0.293 | −0.500 | 0.875 | |
| Total | 95 | 0.218 | 0.297 | 0.030 | 0.158 | 0.279 | −0.500 | 0.938 | |
| Activity | Original group | 42 | 0.238 | 0.216 | 0.033 | 0.171 | 0.305 | −0.313 | 0.750 |
| Altered group | 53 | 0.251 | 0.284 | 0.039 | 0.173 | 0.330 | −0.625 | 1.000 | |
| Total | 95 | 0.245 | 0.255 | 0.026 | 0.193 | 0.297 | −0.625 | 1.000 | |
| Potency | Original group | 42 | 0.109 | 0.373 | 0.058 | −0.008 | 0.225 | −0.938 | 0.938 |
| Altered group | 53 | 0.179 | 0.335 | 0.046 | 0.087 | 0.272 | −0.625 | 0.813 | |
| Total | 95 | 0.148 | 0.352 | 0.036 | 0.076 | 0.220 | −0.938 | 0.938 | |
Means and STDs of three indices describing the emotional meanings of each clip stimulus in statistics: the highest .
| Audio only | Evaluation | 0.59 ± 0.25 (A1) | 0.33 ± 0.21 (A2) | 0.16 ± 0.23 (A3) | −0.30 ± 0.24 (A4) | −0.35 ± 0.26 (A5) |
| Activity | 0.29 ± 0.15 (A1) | 0.18 ± 0.22 (A2) | 0.50 ± 0.20 (A3) | 0.45 ± 0.22 (A4) | 0.29 ± 0.15 (A5) | |
| Potency | 0.52 ± 0.24 (A1) | 0.22 ± 0.27 (A2) | 0.04 ± 0.29 (A3) | −0.49 ± 0.21 (A4) | −0.23 ± 0.28 (A5) | |
| Visual only | Evaluation | 0.40 ± 0.35 (V1) | 0.10 ± 0.37 (V2) | −0.21 ± 0.29 (V4) | −0.20 ± 0.40 (V5) | |
| Activity | −0.05 ± 0.30 (V1) | −0.17 ± 0.35 (V2) | 0.21 ± 0.21 (V3) | 0.24 ± 0.30 (V4) | 0.09 ± 0.35 (V5) | |
| Potency | 0.35 ± 0.34 (V1) | 0.00 ± 0.33 (V2) | 0.31 ± 0.37 (V3) | −0.28 ± 0.27 (V4) | −0.39 ± 0.37 (V5) | |
| Original visual music | Evaluation | 0.23 ± 0.31 (A3V3) | − | −0.20 ± 0.33 (A5V5) | ||
| Activity | 0.26 ± 0.15 (A1V1) | 0.17 ± 0.25 (A2V2) | 0.24 ± 0.22 (A3V3) | 0.03 ± 0.27 (A4V4) | 0.25 ± 0.16 (A5V5) | |
| Potency | 0.51 ± 0.25 (A1V1) | 0.37 ± 0.25 (A2V2) | 0.11 ± 0.37 (A3V3) | −0.48 ± 0.31 (A4V4) | −0.27 ± 0.34 (A5V5) | |
| Altered visual music | Evaluation | 0.41 ± 0.37 (A1V4) | 0.06 ± 0.32 (A2V5) | 0.21 ± 0.29 (A3V3) | −0.36 ± 0.32 (A4V2) | − |
| Activity | 0.25 ± 0.20 (A1V4) | −0.01 ± 0.28 (A2V5) | 0.25 ± 0.28 (A3V3) | −0.11 ± 0.23 (A4V2) | 0.11 ± 0.35 (A5V1) | |
| Potency | 0.28 ± 0.38 (A1V4) | 0.07 ± 0.28 (A2V5) | 0.18 ± 0.34 (A3V3) | −0.37 ± 0.25 (A4V2) | −0.37 ± 0.25 (A5V1) |
Data are described as mean ± SD.
Figure 6Comparisons of immanent emotions in the three clip stimuli using the evaluation and activity indices (circumplex method) for the modalities. (A) audio-only, (B) visual-only, and (C) visual music (cross-modal stimuli), represented by blue square (happy, clip 1), red disk (relaxed, clip 2) green triangle (vigorous, clip 3), pink star (clip 4, no target emotion), and black six-pointed star (clip 5, no target emotion). The symbols show the subjects' evaluations and activity index values for each clip.
Figure 7Bar graph summary of the immanent emotion for the nine visual music stimuli using evaluation, activity, and potency for the combination conditions of (A) conformance, (B) complementation and (C) contest. Data are shown as mean ± s.e.m. The mean increase of evaluation index was observed for clips A1V1 and A2V2 compared to their respective unimodal responses.
Silhouette clustering index values using 2D factors (evaluation and activity) rating indices for the data presented in Figure .
| Audio Only | 0.274048 | − | 0.177347 | −0.101890 | ||
| Visual Only | −0.103724 | −0.172770 | −0.013282 | −0.301105 | −0.076232 | |
| Original Visual Music | −0.134250 (A2V2) | −0.181865 (A3V3) | −0.047418 (A5V5) | 0.045774 | ||
| Altered Visual Music | (A1V4) 0.047593 | (A2V5) −0.192984 | (A3V3) | (A4V2) 0.048313 | (A5V1) −0.111249 | −0.065979 |
Silhouette values are given for individual clips and overall clustering quality. Silhouette values range within [−1, 1], with 1 indicating the best clustering and −1 the worst clustering. Bold values indicate the best clustering values relative to each clip (row-wise comparison).
, control clip.
Visual music stimuli combination conditions information in the current study.
| α 2( | β 3( | |||||
| α 2( | β 3( | |||||
| α 1 | ||||||
| β 2( | α 1( | |||||
| β 2( | α 1 | |||||
Clip name and the emotional contexts (valence and arousal information) of each unimodal clip are indicated in bold types. Cross-binding codes indicate auditory-visual (A–V) combination conditions for all of the visual music stimuli in the current study. For example, β 2(I, C) where A2 combines with V5 indicates that it is an altered combination in contest condition with incongruent relationship in valence and congruent relationship in arousal between auditory and visual channels' emotional information. The emotional meanings were validated using similar evaluation and activity indices from the unimodal stimuli survey results in Experiment 1 and Experiment 2. α, original combination; β, altered combination; 1, Conformance; 2, Complement; 3, Contest;
, control.
Compactness of response.
| Audio only | 0.34628204 | 0.394233616 | ||||
| Visual only | 0.499628207 | 0.555422101 | 0.437046321 | 0.464087266 | 0.576930643 | 0.506622908 |
| Original visual music | 0.468533303 (A3V3) | 0.479346853 (A4V4) | 0.466745095 (A5V5) | 0.420712794 | ||
| Altered visual music | 0.454134911 A1V4 | 0.44759437 (A2V5) | 0.460930243 (A3V3) | 0.419427517 (A4V2) | 0.448666851 (A5V1) | 0.44759437 |
Average distances to centroid using the evaluation, activity, and potency indices. The centroid for each clip was estimated first, and the average distance of each subject's response for the same clip was then estimated. Closer to zero indicates more compactness. Bold values indicate the closest compactness to relative centroid of each clip.
, control clip.
Figure 8Hypothesized conditional enhancement effects as added values in visual music. The congruency in valence and texture/control aspects might stabilize the conjoint gestalt or schema to form good predictability (P) whereas the substantial differences in the arousal aspect between the dual modalities might emphasize focus/attention (C) that alter the continuous coding of predictive errors and recovering to predictable patterns for rewarding (E).
Figure 9A hypothesized functional-information-integration process within our model of continuous auditory-visual modulation integration perception. Visual Music aesthetic experience requires multi-sensory perceptual, cognitive judgment, and processing that involve a number of continuous processing stages such as affective analysis (in valence, activity, and potency), perceptual and cognitive modulation analysis, and predictive deviation analysis.
| Clip 1 Group | Evaluation | 155 | 0.515 | 0.321 | −0.560 | 1.000 | 0.375 | 0.563 | 0.750 |
| Activity | 155 | 0.209 | 0.233 | −0.560 | 0.630 | 0.125 | 0.250 | 0.375 | |
| Potency | 155 | 0.404 | 0.328 | −0.880 | 0.940 | 0.250 | 0.438 | 0.625 | |
| Total | 155 | 18.610 | 20.115 | 1.000 | 61.000 | 11.000 | 14.000 | 14.000 | |
| Clip 2 Group | Evaluation | 155 | 0.224 | 0.335 | −0.560 | 1.000 | 0.000 | 0.250 | 0.438 |
| Activity | 155 | 0.050 | 0.303 | −0.750 | 0.750 | −0.125 | 0.063 | 0.250 | |
| Potency | 155 | 0.172 | 0.312 | −0.630 | 0.940 | −0.063 | 0.188 | 0.438 | |
| Total | 155 | 25.740 | 18.849 | 2.000 | 62.000 | 22.000 | 25.000 | 25.000 | |
| Clip 3 Group | Evaluation | 155 | 0.214 | 0.286 | −0.500 | 1.000 | 0.000 | 0.250 | 0.438 |
| Activity | 155 | 0.293 | 0.259 | −0.630 | 1.000 | 0.125 | 0.250 | 0.500 | |
| Potency | 155 | 0.153 | 0.351 | −0.940 | 0.940 | −0.063 | 0.188 | 0.375 | |
| Total | 155 | 32.180 | 18.717 | 3.000 | 63.000 | 33.000 | 34.000 | 34.000 | |
| Clip 4 Group | Evaluation | 155 | −0.340 | 0.294 | −1.000 | 0.630 | −0.563 | −0.375 | −0.125 |
| Activity | 155 | 0.108 | 0.329 | −0.750 | 0.880 | −0.125 | 0.125 | 0.375 | |
| Potency | 154 | −0.416 | 0.295 | −0.940 | 0.810 | −0.625 | −0.438 | −0.250 | |
| Total | 155 | 38.280 | 19.531 | 4.000 | 64.000 | 42.000 | 42.000 | 44.000 | |
| Clip 5 Group | Evaluation | 154 | −0.305 | 0.322 | −1.000 | 0.560 | −0.516 | −0.313 | −0.063 |
| Activity | 155 | 0.185 | 0.282 | −0.690 | 1.000 | 0.000 | 0.188 | 0.375 | |
| Potency | 154 | −0.316 | 0.307 | −1.000 | 0.560 | −0.500 | −0.344 | −0.125 | |
| Total | 155 | 44.730 | 21.274 | 5.000 | 65.000 | 51.000 | 51.000 | 55.000 | |
| Clip 1 Group | Evaluation | A1 | 33 | 87.110 | 18.206 | 3 | 0.000 | 0.118 |
| A1V1 | 42 | |||||||
| A1V4 | 53 | 64.390 | ||||||
| V1 | 27 | 62.170 | ||||||
| Total | 155 | |||||||
| Activity | A1 | 33 | 90.180 | 27.684 | 3 | 0.000 | 0.180 | |
| A1V1 | 42 | 85.060 | ||||||
| A1V4 | 53 | 85.640 | ||||||
| V1 | 27 | 37.130 | ||||||
| Total | 155 | |||||||
| Potency | A1 | 33 | 92.890 | 16.127 | 3 | 0.001 | 0.105 | |
| A1V1 | 42 | 92.520 | ||||||
| A1V4 | 53 | 61.710 | ||||||
| V1 | 27 | 69.190 | ||||||
| Total | 155 | |||||||
| Clip 2 Group | Evaluation | A2 | 33 | 92.300 | 36.510 | 3 | 0.000 | 0.237 |
| A2V2 | 42 | |||||||
| A2V5 | 53 | 55.020 | ||||||
| V2 | 27 | 62.590 | ||||||
| Total | 155 | |||||||
| Activity | A2 | 33 | 97.790 | 26.277 | 3 | 0.000 | 0.171 | |
| A2V2 | 42 | 93.810 | ||||||
| A2V5 | 53 | 68.490 | ||||||
| V2 | 27 | 47.890 | ||||||
| Total | 155 | |||||||
| Potency | A2 | 33 | 83.800 | 31.111 | 3 | 0.000 | 0.202 | |
| A2V2 | 42 | 107.130 | ||||||
| A2V5 | 53 | 62.240 | ||||||
| V2 | 27 | 56.540 | ||||||
| Total | 155 | |||||||
| Clip 3 Group | Evaluation | A3 | 33 | 66.970 | 3.019 | 3 | 0.389 | 0.020 |
| A3V3 | 42 | 80.830 | ||||||
| A3V3c | 53 | 78.600 | ||||||
| V3 | 27 | |||||||
| Total | 155 | |||||||
| Activity | A3 | 33 | 116.110 | 31.326 | 3 | 0.000 | 0.203 | |
| A3V3 | 42 | 67.930 | ||||||
| A3V3c | 53 | 70.920 | ||||||
| V3 | 27 | 60.980 | ||||||
| Total | 155 | |||||||
| Potency | A3 | 33 | 61.650 | 10.298 | 3 | 0.016 | 0.067 | |
| A3V3 | 42 | 73.940 | ||||||
| A3V3c | 53 | 81.310 | ||||||
| V3 | 27 | 97.800 | ||||||
| Total | 155 | |||||||
| Clip 4 Group | Evaluation | A4 | 33 | 85.830 | 10.020 | 3 | 0.018 | 0.065 |
| A4V2 | 53 | 72.420 | ||||||
| A4V4 | 42 | 66.180 | ||||||
| V4 | 27 | |||||||
| Total | 155 | |||||||
| Activity | A4 | 33 | 124.860 | 66.513 | 3 | 0.000 | 0.432 | |
| A4V2 | 53 | 48.070 | ||||||
| A4V4 | 42 | 67.400 | ||||||
| V4 | 27 | 95.960 | ||||||
| Total | 155 | |||||||
| Potency | A4 | 32 | 68.090 | 12.611 | 3 | 0.006 | 0.082 | |
| A4V2 | 53 | 82.140 | ||||||
| A4V4 | 42 | 64.350 | ||||||
| V4 | 27 | 100.000 | ||||||
| Total | 154 | |||||||
| Clip 5 Group | Evaluation | A5 | 32 | 71.410 | 13.786 | 3 | 0.003 | 0.090 |
| A5V1 | 53 | 62.760 | ||||||
| A5V5 | 42 | |||||||
| V5 | 27 | 87.410 | ||||||
| Total | 154 | |||||||
| Activity | A5 | 33 | 97.380 | 14.845 | 3 | 0.002 | 0.096 | |
| A5V1 | 53 | 67.800 | ||||||
| A5V5 | 42 | 86.980 | ||||||
| V5 | 27 | 60.370 | ||||||
| Total | 155 | |||||||
| Potency | A5 | 33 | 90.940 | 7.101 | 3 | 0.069 | 0.046 | |
| A5V1 | 53 | 70.310 | ||||||
| A5V5 | 42 | 83.520 | ||||||
| V5 | 26 | 65.370 | ||||||
| Total | 154 | |||||||
The rank-based non-parametric analysis was used to examine the rank-order of four clips in a group (audio only, video only, original visual music vs. altered visual music), and to determine whether there were statistically significant differences between two (or more) modalities. A higher value in mean rank indicates that the upper rank (for example, for the clip group 1, A1V1 (Mean rank = 98.200) is higher in rank than are the other three clips A1(Mean rank = 87.110), V1 (Mean rank = 10.06) or A1V4 (Mean rank = 64.390) on the evaluation index.
Indicates a significant difference p < 0.05,
indicates p < 0.001, and bold type indicates the highest rank in valence (evaluation).
| Clip 1 Group | Evaluation | 75 | 0.632 | 0.226 | −0.060 | 1.000 | 0.500 | 0.688 | 0.813 |
| Activity | 75 | 0.275 | 0.148 | −0.060 | 0.630 | 0.188 | 0.250 | 0.375 | |
| Potency | 75 | 0.513 | 0.242 | −0.250 | 0.880 | 0.375 | 0.500 | 0.688 | |
| Total | 75 | 6.600 | 4.997 | 1.000 | 11.000 | 1.000 | 11.000 | 11.000 | |
| Clip 2 Group | Evaluation | 75 | 0.388 | 0.250 | −0.130 | 1.000 | 0.188 | 0.375 | 0.563 |
| Activity | 75 | 0.173 | 0.235 | −0.500 | 0.750 | 0.063 | 0.188 | 0.313 | |
| Potency | 75 | 0.307 | 0.268 | −0.250 | 0.940 | 0.125 | 0.313 | 0.500 | |
| Total | 75 | 13.200 | 9.995 | 2.000 | 22.000 | 2.000 | 22.000 | 22.000 | |
| Clip 3 Group | Evaluation | 75 | 0.197 | 0.275 | −0.440 | 0.940 | 0.000 | 0.188 | 0.375 |
| Activity | 75 | 0.352 | 0.245 | −0.310 | 0.810 | 0.188 | 0.375 | 0.500 | |
| Potency | 75 | 0.079 | 0.340 | −0.940 | 0.940 | −0.125 | 0.063 | 0.313 | |
| Total | 75 | 19.800 | 14.992 | 3.000 | 33.000 | 3.000 | 33.000 | 33.000 | |
| Clip 4 Group | Evaluation | 75 | −0.369 | 0.268 | −1.000 | 0.190 | −0.500 | −0.438 | −0.188 |
| Activity | 75 | 0.212 | 0.324 | −0.630 | 0.810 | 0.000 | 0.250 | 0.500 | |
| Potency | 74 | −0.487 | 0.268 | −0.940 | 0.440 | −0.688 | −0.500 | −0.375 | |
| Total | 75 | 26.400 | 19.989 | 4.000 | 44.000 | 4.000 | 44.000 | 44.000 | |
| Clip 5 Group | Evaluation | 74 | −0.267 | 0.308 | −1.000 | 0.500 | −0.500 | −0.219 | −0.063 |
| Activity | 75 | 0.271 | 0.156 | −0.130 | 0.560 | 0.188 | 0.250 | 0.375 | |
| Potency | 75 | −0.254 | 0.313 | −1.000 | 0.500 | −0.500 | −0.250 | −0.063 | |
| Total | 75 | 33.000 | 24.986 | 5.000 | 55.000 | 5.000 | 55.000 | 55.000 | |
| Clip 1 Group | Evaluation | A1 | 33 | 34.920 | 1.183 | 1 | 0.277 | 0.016 |
| A1V1 | 42 | |||||||
| Total | 75 | |||||||
| Activity | A1 | 33 | 39.650 | 0.344 | 1 | 0.558 | 0.005 | |
| A1V1 | 42 | 36.700 | ||||||
| Total | 75 | |||||||
| Potency | A1 | 33 | 38.060 | 0.000 | 1 | 0.983 | 0.000 | |
| A1V1 | 42 | 37.950 | ||||||
| Total | 75 | |||||||
| Clip 2 Group | Evaluation | A2 | 33 | 33.110 | 2.994 | 1 | 0.084 | 0.040 |
| A2V2 | 42 | |||||||
| Total | 75 | |||||||
| Activity | A2 | 33 | 39.200 | 0.180 | 1 | 0.672 | 0.002 | |
| A2V2 | 42 | 37.060 | ||||||
| Total | 75 | |||||||
| Potency | A2 | 33 | 30.970 | 6.173 | 1 | 0.013 | 0.083 | |
| A2V2 | 42 | 43.520 | ||||||
| Total | 75 | |||||||
| Clip 3 Group | Evaluation | A3 | 33 | 33.920 | 2.074 | 1 | 0.150 | 0.028 |
| A3V3 | 42 | |||||||
| Total | 75 | |||||||
| Activity | A3 | 33 | 51.850 | 23.984 | 1 | 0.000 | 0.324 | |
| A3V3 | 42 | 27.120 | ||||||
| Total | 75 | |||||||
| Potency | A3 | 33 | 34.800 | 1.274 | 1 | 0.259 | 0.017 | |
| A3V3 | 42 | 40.510 | ||||||
| Total | 75 | |||||||
| Clip 4 Group | Evaluation | A4 | 33 | 4.027 | 1 | 0.045 | 0.054 | |
| A4V4 | 42 | 33.550 | ||||||
| Total | 75 | |||||||
| Activity | A4 | 33 | 54.440 | 33.707 | 1 | 0.000 | 0.456 | |
| A4V4 | 42 | 25.080 | ||||||
| Total | 75 | |||||||
| Potency | A4 | 32 | 38.660 | 0.164 | 1 | 0.685 | 0.002 | |
| A4V4 | 42 | 36.620 | ||||||
| Total | 74 | |||||||
| Clip 5 Group | Evaluation | A5 | 32 | 30.630 | 5.791 | 1 | 0.016 | 0.079 |
| A5V5 | 42 | |||||||
| Total | 74 | |||||||
| Activity | A5 | 33 | 42.110 | 2.129 | 1 | 0.145 | 0.029 | |
| A5V5 | 42 | 34.770 | ||||||
| Total | 75 | |||||||
| Potency | A5 | 33 | 39.500 | 0.281 | 1 | 0.596 | 0.004 | |
| A5V5 | 42 | 36.820 | ||||||
| Total | 75 | |||||||
The rank-based non-parametric analysis was used to examine the rank-order of the three modalities (audio only, video only, vs. visual music), and to determine whether there were statistically significant differences between two (or more) modalities. A higher value in mean rank indicates the upper rank (for example, in the clip group 1, A1V1 (Mean rank = 40.420) is higher in rank than is A1 (Mean rank = 34.920) on the evaluation index.
Indicates significant difference p < 0.05,
indicates p < 0.001, and bold type indicates the highest rank in valence (evaluation).
| Clip 1 Group | Evaluation | 86 | 0.480 | 0.336 | −0.560 | 1.000 | 0.375 | 0.563 | 0.688 |
| Activity | 86 | 0.264 | 0.182 | −0.500 | 0.630 | 0.188 | 0.281 | 0.375 | |
| Potency | 86 | 0.371 | 0.352 | −0.880 | 0.940 | 0.188 | 0.438 | 0.625 | |
| Total | 86 | 9.010 | 6.359 | 1.000 | 14.000 | 1.000 | 14.000 | 14.000 | |
| Clip 2 Group | Evaluation | 86 | 0.160 | 0.310 | −0.560 | 0.880 | −0.016 | 0.188 | 0.375 |
| Activity | 86 | 0.064 | 0.275 | −0.750 | 0.500 | −0.125 | 0.125 | 0.313 | |
| Potency | 86 | 0.127 | 0.286 | −0.560 | 0.940 | −0.063 | 0.125 | 0.313 | |
| Total | 86 | 16.170 | 11.250 | 2.000 | 25.000 | 2.000 | 25.000 | 25.000 | |
| Clip 4 Group | Evaluation | 86 | −0.339 | 0.289 | −0.940 | 0.630 | −0.563 | −0.375 | −0.125 |
| Activity | 86 | 0.108 | 0.352 | −0.750 | 0.810 | −0.188 | 0.031 | 0.438 | |
| Potency | 85 | −0.423 | 0.288 | −0.940 | 0.810 | −0.563 | −0.438 | −0.313 | |
| Total | 86 | 27.420 | 18.587 | 4.000 | 42.000 | 4.000 | 42.000 | 42.000 | |
| Clip 5 Group | Evaluation | 85 | −0.389 | 0.262 | −0.880 | 0.250 | −0.594 | −0.438 | −0.188 |
| Activity | 86 | 0.182 | 0.300 | −0.690 | 0.810 | −0.063 | 0.250 | 0.438 | |
| Potency | 86 | −0.315 | 0.266 | −1.000 | 0.440 | −0.500 | −0.313 | −0.188 | |
| Total | 86 | 33.350 | 22.501 | 5.000 | 51.000 | 5.000 | 51.000 | 51.000 | |
| Clip 1 Group | Evaluation | A1 | 33 | 5.267 | 1 | 0.022 | 0.062 | |
| A1V4 | 53 | 38.640 | ||||||
| Total | 86 | |||||||
| Activity | A1 | 33 | 44.620 | 0.110 | 1 | 0.740 | 0.001 | |
| A1V4 | 53 | 42.800 | ||||||
| Total | 86 | |||||||
| Potency | A1 | 33 | 54.210 | 9.897 | 1 | 0.002 | 0.116 | |
| A1V4 | 53 | 36.830 | ||||||
| Total | 86 | |||||||
| Clip 2 Group | Evaluation | A2 | 33 | 16.803 | 1 | 0.000 | 0.198 | |
| A2V5 | 53 | 34.810 | ||||||
| Total | 86 | |||||||
| Activity | A2 | 33 | 54.240 | 9.982 | 1 | 0.002 | 0.117 | |
| A2V5 | 53 | 36.810 | ||||||
| Total | 86 | |||||||
| Potency | A2 | 33 | 51.470 | 5.502 | 1 | 0.019 | 0.065 | |
| A2V5 | 53 | 38.540 | ||||||
| Total | 86 | |||||||
| Clip 4 Group | Evaluation | A4 | 33 | 1.905 | 1 | 0.168 | 0.022 | |
| A4V2 | 53 | 40.580 | ||||||
| Total | 86 | |||||||
| Activity | A4 | 33 | 67.650 | 50.346 | 1 | 0.000 | 0.592 | |
| A4V2 | 53 | 28.460 | ||||||
| Total | 86 | |||||||
| Potency | A4 | 32 | 38.330 | 1.858 | 1 | 0.173 | 0.022 | |
| A4V2 | 53 | 45.820 | ||||||
| Total | 85 | |||||||
| Clip 5 Group | Evaluation | A5 | 32 | 0.984 | 1 | 0.321 | 0.012 | |
| A5V1 | 53 | 40.940 | ||||||
| Total | 85 | |||||||
| Activity | A5 | 33 | 52.480 | 6.994 | 1 | 0.008 | 0.082 | |
| A5V1 | 53 | 37.910 | ||||||
| Total | 86 | |||||||
| Potency | A5 | 33 | 51.360 | 5.357 | 1 | 0.021 | 0.063 | |
| A5V1 | 53 | 38.600 | ||||||
| Total | 86 | |||||||
The rank-based non-parametric analysis was used to examine the rank-order of the three modalities (audio only, video only, vs. visual music), and to determine whether there were statistically significant differences between two (or more) modalities. A higher value in mean rank indicates the upper rank; (for example, in the clip group 1, A1 (Mean rank = 51.300) is higher in rank than is A1V4 (Mean rank = 38.640) on the evaluation index.
Indicates significant difference p < 0.05,
indicates p < 0.001, and bold type indicates the highest rank per category.