| Literature DB >> 23964250 |
Frank A Russo1, Naresh N Vempala, Gillian M Sandstrom.
Abstract
Listening to music often leads to physiological responses. Do these physiological responses contain sufficient information to infer emotion induced in the listener? The current study explores this question by attempting to predict judgments of "felt" emotion from physiological responses alone using linear and neural network models. We measured five channels of peripheral physiology from 20 participants-heart rate (HR), respiration, galvanic skin response, and activity in corrugator supercilii and zygomaticus major facial muscles. Using valence and arousal (VA) dimensions, participants rated their felt emotion after listening to each of 12 classical music excerpts. After extracting features from the five channels, we examined their correlation with VA ratings, and then performed multiple linear regression to see if a linear relationship between the physiological responses could account for the ratings. Although linear models predicted a significant amount of variance in arousal ratings, they were unable to do so with valence ratings. We then used a neural network to provide a non-linear account of the ratings. The network was trained on the mean ratings of eight of the 12 excerpts and tested on the remainder. Performance of the neural network confirms that physiological responses alone can be used to predict musically induced emotion. The non-linear model derived from the neural network was more accurate than linear models derived from multiple linear regression, particularly along the valence dimension. A secondary analysis allowed us to quantify the relative contributions of inputs to the non-linear model. The study represents a novel approach to understanding the complex relationship between physiological responses and musically induced emotion.Entities:
Keywords: computational modeling; emotion; music cognition; neural networks; physiological responses
Year: 2013 PMID: 23964250 PMCID: PMC3737459 DOI: 10.3389/fpsyg.2013.00468
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Twelve music excerpts with composers, emotion quadrants, and mean valence/arousal ratings.
| M1 | Bartok | Sonata for 2 pianos and percussion (Assai lento) | Agitated | 5 | 4.1 | 6.35 | 6.98 |
| M2 | Shostakovich | Symphony No. 8 (Adagio) | Agitated | 3.35 | 7.45 | ||
| M3 | Stravinsky | Dame sacrale (Le Sacre du Printemps) | Agitated | 3.95 | 7.15 | ||
| M4 | Beethoven | Symphony No. 7 (Vivace) | Happy | 6.6 | 6.38 | 6.35 | 6.7 |
| M5 | Liszt | Les Preludes | Happy | 5.75 | 6.25 | ||
| M6 | Strauss | Unter Donner und Blitz | Happy | 6.8 | 7.5 | ||
| M7 | Bizet | Intermezzo (Carmen Suite) | Peaceful | 6.6 | 6.1 | 2.85 | 2.77 |
| M8 | Dvorak | Symphony No. 9 (Largo) | Peaceful | 5.95 | 2.65 | ||
| M9 | Schumann | Traumerei | Peaceful | 5.75 | 2.8 | ||
| M10 | Chopin | Funeral March, Op. 72 No. 2 | Sad | 4.85 | 4.4 | 2.55 | 3.48 |
| M11 | Grieg | Aase's death (Peer Gynt) | Sad | 4.05 | 4.15 | ||
| M12 | Mozart | Requiem (Lacrimosa) | Sad | 4.3 | 3.75 |
Figure 1Mean participant valence/arousal ratings (on a scale of 1–9) for all 12 excerpts (training and test sets combined).
Figure 2Neural network with five input units, five hidden units, and two output units. W indicates connection weights from input units to hidden units. W indicates connection weights from hidden units to output units.
Figure 3Mean participant valence/arousal ratings (on a scale of 1–9) for the test set and corresponding mean neural network outputs. NN indicates neural network output.
Performance of neural network and linear regression performance for each of the 4 test excerpts.
| M3 (Stravinsky; agitated) | 0.025 | 0.868 | 3.795 | 0.59 |
| M6 (Strauss; happy) | 1.656 | 0.04 | 9.363 | 0.407 |
| M9 (Schumann; peaceful) | 0.56 | 0.626 | 8.155 | 5.459 |
| M12 (Mozart; sad) | 1.039 | 2.01 | 8.05 | 3.989 |
| Mean error | 0.82 | 0.886 | 7.341 | 2.611 |
| Mean error % | 10.25 | 11.08 | 91.76 | 32.64 |
| Mean performance accuracy % | 89.75 | 88.92 | 8.24 | 67.36 |
Two measures of performance for neural network and linear regression based on all 12 excerpts: (1) RMSE and (2) correlation with mean valence/arousal ratings.
| RMSE | 0.09 | 0.11 | 0.1 | 0.17 |
| Correlation | 0.79 | 0.91 | 0.73 | 0.77 |
| Explained variance ( | 62.4% | 82.8% | 53.3% | 59.3% |
Figure 4Contribution of each physiological feature toward judgments of (A) valence, and (B) arousal, based on weight proportion. A negative value indicates that the physiological feature is higher for negative valence, or low arousal, respectively.