| Literature DB >> 29367844 |
Andrea Clerico1, Abhishek Tiwari1, Rishabh Gupta1, Srinivasan Jayaraman1, Tiago H Falk1.
Abstract
The quantity of music content is rapidly increasing and automated affective tagging of music video clips can enable the development of intelligent retrieval, music recommendation, automatic playlist generators, and music browsing interfaces tuned to the users' current desires, preferences, or affective states. To achieve this goal, the field of affective computing has emerged, in particular the development of so-called affective brain-computer interfaces, which measure the user's affective state directly from measured brain waves using non-invasive tools, such as electroencephalography (EEG). Typically, conventional features extracted from the EEG signal have been used, such as frequency subband powers and/or inter-hemispheric power asymmetry indices. More recently, the coupling between EEG and peripheral physiological signals, such as the galvanic skin response (GSR), have also been proposed. Here, we show the importance of EEG amplitude modulations and propose several new features that measure the amplitude-amplitude cross-frequency coupling per EEG electrode, as well as linear and non-linear connections between multiple electrode pairs. When tested on a publicly available dataset of music video clips tagged with subjective affective ratings, support vector classifiers trained on the proposed features were shown to outperform those trained on conventional benchmark EEG features by as much as 6, 20, 8, and 7% for arousal, valence, dominance and liking, respectively. Moreover, fusion of the proposed features with EEG-GSR coupling features showed to be particularly useful for arousal (feature-level fusion) and liking (decision-level fusion) prediction. Together, these findings show the importance of the proposed features to characterize human affective states during music clip watching.Entities:
Keywords: affective computing; electroencephalography; emotion classification; multimedia content; pattern classification; physiological signals; signal processing
Year: 2018 PMID: 29367844 PMCID: PMC5767842 DOI: 10.3389/fncom.2017.00115
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Figure 1Signal processing steps used to compute the EEG amplitude modulation feature sets.
Figure 2Amplitude envelope extraction from each EEG subband time series signal (gray) and their respective Hilbert amplitude envelopes (black).
Figure 3Individualized threshold such that approximately 50/50 ratio was achieved for high/low class for valence and arousal dimensions.
Selected top-35 features for the arousal dimension.
| 1 | γ_m-γ_FC5_CP2 | α_m-α_T8_CP6 | ratio_γ_m-γ_Fz | cfc_FC1_7_Hz | AI_β_FC1_FC2 |
| 2 | β_m-α_FC5_CP5 | θ_m-θ_Fp1_Pz | β_m-α_F7 | cfc_CP5_7_Hz | AI_θ_FC1_FC2 |
| 3 | γ_m-γ_FC5_Cz | γ_m-γ_FC5_FC1 | ratio_γ_m-β_Pz | cfc_O1_19_Hz | γ_Fp1 |
| 4 | γ_m-γ_FC5_AF4 | γ_m-θ_CP5_F8 | θ_m-θ_O1 | cfc_FC5_15_Hz | θ_O2 |
| 5 | γ_m-γ_AF4_CP2 | θ_m-θ_C3_O2 | ratio_β_m-θ_CP5 | cfc_FC1_44_Hz | α_O2 |
| 6 | β_m-α_CP5_Pz | α_m-α_P7_C4 | β_m-β_O2 | cfc_O1_20_Hz | α_F7 |
| 7 | γ_m-γ_FC5_PO4 | α_m-θ_F7_T7 | ratio_α_m-θ_O2 | cfc_O1_27_Hz | θ_CP6 |
| 8 | γ_m-α_PO3_F8 | γ_m-γ_P7_F8 | γ_m-β_F7 | cfc_O1_28_Hz | α_Pz |
| 9 | γ_m-γ_FC5_C4 | β_m-θ_C4_P4 | ratio_α_m-α_T8 | cfc_FC5_16_Hz | AI_β_AF3_AF4 |
| 10 | γ_m-β_FC5_PO4 | θ_m-θ_FC6_Cz | ratio_β_m-β_FC2 | cfc_FC1_39_Hz | β_FC5 |
| 11 | β_m-θ_FC2_P8 | α_m-θ_T8_CP6 | θ_m-θ_FC5 | cfc_FC1_43_Hz | θ_AF4 |
| 12 | γ_m-γ_FC5_Fp2 | θ_m-θ_Fp1_P7 | ratio_α_m-θ_Cz | cfc_FC1_42_Hz | θ_P4 |
| 13 | γ_m-γ_FC5_Fz | γ_m-θ_P7_F8 | ratio_β_m-θ_Pz | cfc_O1_18_Hz | AI_β_P7_P8 |
| 14 | γ_m-γ_AF4_Cz | α_m-α_FC2_P8 | α_m-α_Cz | cfc_O1_26_Hz | θ_F8 |
| 15 | β_m-β_AF3_CP5 | α_m-θ_P7_C4 | ratio_α_m-α_O2 | cfc_P8_5_Hz | AI_β_FC5_FC6 |
| 16 | β_m-β_FC5_CP5 | β_m-α_C4_P4 | ratio_α_m-θ_Fz | cfc_FC1_37_Hz | β_Fp2 |
| 17 | α_m-α_FC1_T8 | θ_m-θ_C3_O1 | ratio_α_m-α_Cz | cfc_O1_29_Hz | θ_FC6 |
| 18 | α_m-α_Oz_CP2 | θ_m-θ_P3_P8 | γ_m-α_F7 | cfc_O1_23_Hz | θ_T8 |
| 19 | γ_m-γ_FC5_FC6 | α_m-θ_Fp1_Cz | α_m-θ_O2 | cfc_O1_22_Hz | α_Fz |
| 20 | β_m-β_PO3_P8 | γ_m-α_T7_FC2 | ratio_β_m-θ_P3 | cfc_FC1_8_Hz | α_PO3 |
| 21 | γ_m-β_AF4_PO4 | γ_m-β_FC5_FC1 | α_m-θ_T8 | cfc_FC5_18_Hz | γ_F4 |
| 22 | γ_m-β_FC5_Fz | β_m-θ_T7_T8 | ratio_γ_m-γ_Oz | cfc_FC1_35_Hz | AI_θ_O1_O2 |
| 23 | β_m-α_AF3_Pz | γ_m-α_FC5_FC1 | θ_m-θ_P7 | cfc_Fz_19_Hz | θ_P8 |
| 24 | γ_m-γ_AF4_PO4 | β_m-θ_Cz_PO4 | ratio_γ_m-θ_CP1 | esc_C4 | AI_β_Fp1_Fp2 |
| 25 | γ_m-β_FC5_Fp2 | θ_m-θ_O1_CP6 | α_m-θ_CP6 | cfc_CP1_5_Hz | β_F3 |
| 26 | γ_m-γ_Fp2_AF4 | γ_m-γ_CP5_F8 | α_m-α_T8 | cfc_O1_25_Hz | β_FC1 |
| 27 | α_m-θ_PO3_CP2 | γ_m-γ_T7_FC2 | β_m-α_C3 | cfc_FC1_41_Hz | γ_P3 |
| 28 | γ_m-γ_FC5_P3 | β_m-β_F3_PO3 | ratio_γ_m-γ_Pz | cfc_FC1_40_Hz | β_Fp1 |
| 29 | γ_m-γ_FC5_FC1 | γ_m-β_T7_FC2 | ratio_γ_m-θ_Pz | cfc_FC1_38_Hz | α_PO4 |
| 30 | β_m-β_AF3_O2 | β_m-β_C4_P4 | ratio_α_m-α_Fz | cfc_O1_30_Hz | θ_Fp2 |
| 31 | α_m-θ_FC1_T8 | α_m-θ_FC2_P8 | ratio_γ_m-θ_P3 | cfc_FC5_20_Hz | α_F4 |
| 32 | α_m-θ_F3_Oz | γ_m-β_CP5_F8 | α_m-θ_Cz | cfc_FC5_19_Hz | α_P7 |
| 33 | γ_m-β_FC5_FC6 | γ_m-β_P7_F8 | ratio_θ_m-θ_O2 | cfc_O1_21_Hz | AI_β_F7_F8 |
| 34 | γ_m-γ_F3_Fp2 | α_m-α_F7_T7 | β_m-β_F7 | cfc_FC5_17_Hz | β_AF3 |
| 35 | γ_m-γ_FC1_AF4 | α_m-α_Fp1_Cz | ratio_α_m-θ_AF4 | cfc_O1_24_Hz | θ_CP2 |
Feature names listed should be self explanatory. The “ratio” features correspond to the log-ratio ones between the video and baseline periods; “AI” corresponds to the asymmetry index between the indicated channels.
Selected top-21 features for the liking dimension.
| 1 | β_m-θ_AF4_CP6 | γ_m-α_Pz_AF4 | ratio_β_m-θ_FC6 | cfc_P7_30_Hz | α_P3 |
| 2 | β_m-α_PO3_P8 | β_m-β_O1_T8 | γ_m-β_P7 | cfc_FC1_7_Hz | θ_C3 |
| 3 | α_m-α_O1_Oz | β_m-β_Pz_FC2 | ratio_γ_m-γ_P8 | cfc_P7_29_Hz | β_P3 |
| 4 | γ_m-γ_Fp1_T7 | γ_m-β_Pz_AF4 | γ_m-θ_P3 | cfc_PO4_42_Hz | β_T8 |
| 5 | α_m-α_Oz_FC2 | β_m-β_CP5_AF4 | α_m-α_AF4 | esc_T8 | β_O1 |
| 6 | θ_m-θ_Fp1_AF4 | γ_m-γ_Pz_AF4 | ratio_γ_m-α_P8 | cfc_P7_32_Hz | θ_P4 |
| 7 | θ_m-θ_C3_P8 | β_m-θ_FC1_O1 | ratio_γ_m-θ_F3 | cfc_P7_31_Hz | α_F8 |
| 8 | θ_m-θ_CP5_AF4 | γ_m-θ_Pz_AF4 | α_m-α_CP1 | cfc_PO4_39_Hz | β_PO3 |
| 9 | β_m-θ_P3_AF4 | γ_m-γ_AF3_Oz | ratio_β_m-θ_P8 | cfc_PO4_45_Hz | AI_β_FC5_FC6 |
| 10 | β_m-θ_F7_AF4 | γ_m-β_CP1_AF4 | γ_m-θ_F3 | esc_F3 | β_AF3 |
| 11 | θ_m-θ_P7_AF4 | β_m-α_O1_T8 | ratio_β_m-β_C3 | cfc_PO4_44_Hz | θ_CP1 |
| 12 | β_m-β_PO3_P8 | γ_m-α_Fp1_T7 | ratio_β_m-α_C3 | cfc_Fp1_8_Hz | α_CP5 |
| 13 | θ_m-θ_PO3_Cz | β_m-α_CP5_P4 | ratio_β_m-α_F3 | cfc_P7_26_Hz | β_F3 |
| 14 | β_m-θ_F3_AF4 | γ_m-θ_CP1_AF4 | ratio_α_m-θ_Fp1 | esc_CP1 | AI_α_P7_P8 |
| 15 | θ_m-θ_CP1_AF4 | γ_m-β_AF3_Oz | ratio_γ_m-γ_FC6 | cfc_PO4_41_Hz | AI_β_F7_F8 |
| 16 | α_m-θ_Oz_FC2 | β_m-θ_FC5_PO3 | ratio_β_m-α_Fp2 | cfc_PO4_43_Hz | AI_β_PO3_PO4 |
| 17 | α_m-θ_P7_P8 | γ_m-γ_CP1_AF4 | ratio_γ_m-β_P8 | cfc_P7_27_Hz | θ_FC6 |
| 18 | β_m-β_PO3_P4 | γ_m-α_AF3_Oz | β_m-θ_P7 | esc_P8 | β_FC5 |
| 19 | θ_m-θ_Pz_CP6 | β_m-θ_F3_P8 | ratio_θ_m-θ_P7 | cfc_FC1_8_Hz | AI_θ_F7_F8 |
| 20 | θ_m-θ_Pz_AF4 | β_m-β_F3_P8 | ratio_β_m-β_F3 | cfc_FC6_10_Hz | θ_F4 |
| 21 | β_m-θ_AF4_T8 | γ_m-α_CP1_AF4 | β_m-β_T7 | cfc_P7_28_Hz | θ_Fz |
Feature names listed should be self explanatory. The “ratio” features correspond to the log-ratio ones between the video and baseline periods; “AI” corresponds to the asymmetry index between the indicated channels.
Performance comparison of SVM classifiers for different feature sets and fusion strategies.
| AMI | – | 0.604 | 4.4 | 0.583 | 5.6 | 0.564 | 4.1 | 0.626 | 1.1 |
| AMC | – | 0.594 | 2.7 | 0.563 | 1.9 | 0.569 | 5.0 | 0.630 | 1.9 |
| AME | – | 0.600 | 3.6 | 0.563 | 1.9 | 0.573 | 5.6 | 0.627 | 1.3 |
| AMF | Feature-level | 0.594 | 2.7 | 0.583 | 5.6 | 0.566 | 4.4 | 0.624 | 0.9 |
| PAC | – | 0.634 | 9.7 | 0.568 | 3.0 | 0.559 | 3.2 | 0.629 | 1.7 |
| SF | – | 0.578 | – | 0.552 | – | 0.542 | – | 0.619 | – |
| AMF + SF + PAC | Feature-level | 0.594 | 2.7 | 0.598 | 8.4 | 0.567 | 4.6 | 0.624 | 0.9 |
| AMI + AMC + AME | Decision-level | 0.594 | 2.8 | 0.563 | 1.9 | 0.567 | 4.6 | 0.625 | 1.0 |
| AMF + PAC + SF | Decision-level | 0.594 | 2.7 | 0.563 | 1.9 | 0.563 | 3.7 | 0.633 | 2.2 |
All reported results were significantly higher than chance achieved with a random voting classifier (p < 0.05). Column labeled “%” indicates relative improvement, in percentage, over the SF baseline set.
Selected top-23 features for the valence dimension.
| 1 | α_m-α_O1_CP2 | θ_m-θ_T7_F8 | β_m-θ_PO4 | cfc_T8_5_Hz | AI_α_PO3_PO4 |
| 2 | α_m-α_O1_Oz | β_m-β_AF3_F4 | ratio_γ_m-α_PO3 | cfc_C3_26_Hz | α_P7 |
| 3 | α_m-θ_F7_Pz | γ_m-γ_CP1_P7 | ratio_β_m-β_Fp1 | cfc_CP1_25_Hz | γ_Fz |
| 4 | α_m-α_F3_O1 | γ_m-θ_AF3_Oz | ratio_α_m-θ_Oz | cfc_CP1_28_Hz | α_P3 |
| 5 | α_m-α_O1_Fp2 | β_m-α_F7_P8 | ratio_γ_m-β_PO3 | cfc_O2_15_Hz | AI_α_P3_P4 |
| 6 | α_m-α_O1_O2 | γ_m-β_F3_Oz | β_m-θ_Pz | cfc_C3_25_Hz | AI_γ_O1_O2 |
| 7 | α_m-α_T7_O1 | γ_m-γ_AF3_P7 | ratio_γ_m-β_Fp1 | cfc_C3_24_Hz | θ_Fz |
| 8 | β_m-β_CP6_CP2 | γ_m-θ_AF3_P7 | ratio_β_m-α_Fp1 | esc_F3 | α_PO3 |
| 9 | α_m-θ_O1_CP2 | γ_m-α_F3_Oz | γ_m-β_PO4 | cfc_O2_14_Hz | AI_α_P7_P8 |
| 10 | β_m-β_F4_CP2 | θ_m-θ_Pz_PO4 | ratio_β_m-θ_P8 | cfc_FC1_42_Hz | θ_O1 |
| 11 | β_m-θ_AF3_Oz | α_m-α_Fp1_Pz | β_m-β_P3 | cfc_FC1_43_Hz | β_PO3 |
| 12 | α_m-α_O1_PO4 | θ_m-θ_F4_FC2 | ratio_α_m-α_CP2 | cfc_C3_27_Hz | AI_α_F7_F8 |
| 13 | β_m-β_F4_F8 | γ_m-γ_F3_Oz | γ_m-γ_PO4 | cfc_CP1_23_Hz | AI_γ_C3_C4 |
| 14 | γ_m-β_F7_Cz | γ_m-θ_F3_Oz | β_m-θ_T8 | cfc_C3_23_Hz | AI_γ_FC1_FC2 |
| 15 | α_m-θ_O1_O2 | β_m-α_AF3_F4 | β_m-α_P3 | cfc_CP1_30_Hz | AI_β_PO3_PO4 |
| 16 | α_m-θ_O1_Cz | γ_m-θ_Oz_O2 | β_m-α_PO4 | cfc_CP1_24_Hz | AI_β_FC5_FC6 |
| 17 | α_m-α_CP1_PO4 | β_m-α_F4_P8 | β_m-β_T7 | esc_F4 | AI_α_O1_O2 |
| 18 | γ_m-θ_F3_O1 | γ_m-α_CP1_P7 | β_m-θ_T7 | cfc_FC1_45_Hz | AI_β_F7_F8 |
| 19 | γ_m-β_P8_O2 | γ_m-β_CP1_P7 | ratio_β_m-α_PO3 | cfc_CP1_26_Hz | AI_θ_AF3_AF4 |
| 20 | α_m-θ_O1_Fz | β_m-β_CP5_T8 | γ_m-θ_PO4 | cfc_CP1_29_Hz | α_Fz |
| 21 | α_m-θ_F7_AF4 | β_m-β_F7_P8 | ratio_β_m-β_PO3 | cfc_CP1_27_Hz | AI_β_P3_P4 |
| 22 | α_m-α_O1_Cz | γ_m-β_AF3_P7 | ratio_θ_m-θ_CP2 | cfc_FC1_44_Hz | β_P3 |
| 23 | α_m-θ_O1_Oz | θ_m-θ_O1_Cz | ratio_γ_m-α_Fp1 | esc_AF3 | θ_AF3 |
Feature names listed should be self explanatory. The “ratio” features correspond to the log-ratio ones between the video and baseline periods; “AI” corresponds to the asymmetry index between the indicated channels.
Selected top-19 features for the dominance dimension.
| 1 | θ_m-θ_CP1_T8 | β_m-θ_P7_F8 | γ_m-β_P7 | esc_AF3 | θ_FC2 |
| 2 | α_m-α_P3_Oz | β_m-α_CP1_F8 | β_m-β_P3 | cfc_FC2_11_Hz | γ_F3 |
| 3 | α_m-θ_AF3_T7 | β_m-θ_T7_F8 | α_m-α_Pz | cfc_FC2_8_Hz | α_PO3 |
| 4 | γ_m-α_F7_CP6 | β_m-α_CP5_AF4 | γ_m-α_P7 | cfc_CP6_7_Hz | θ_C3 |
| 5 | θ_m-θ_P3_P8 | β_m-β_CP1_Fz | γ_m-θ_PO4 | cfc_CP6_8_Hz | θ_Pz |
| 6 | θ_m-θ_FC2_P8 | β_m-α_P7_FC6 | α_m-θ_Pz | cfc_F3_6_Hz | γ_P7 |
| 7 | β_m-α_CP1_Pz | β_m-θ_PO3_P4 | ratio_γ_m-β_P8 | cfc_FC2_7_Hz | θ_FC6 |
| 8 | α_m-α_F3_Fz | β_m-α_CP1_Fz | ratio_β_m-θ_P8 | cfc_FC5_5_Hz | θ_P4 |
| 9 | β_m-θ_P3_F4 | β_m-θ_F8_P4 | ratio_γ_m-γ_P8 | cfc_F4_12_Hz | β_F3 |
| 10 | β_m-α_CP1_P3 | β_m-β_CP1_F8 | θ_m-θ_Pz | cfc_FC2_9_Hz | α_P7 |
| 11 | β_m-α_P3_Pz | β_m-α_PO3_PO4 | γ_m-γ_PO4 | cfc_FC5_11_Hz | β_C4 |
| 12 | β_m-θ_P3_PO4 | β_m-α_P7_F8 | ratio_γ_m-γ_PO4 | cfc_CP1_42_Hz | AI_β_CP5_CP6 |
| 13 | α_m-α_AF3_Fz | β_m-β_CP5_Pz | β_m-α_F7 | cfc_CP6_9_Hz | θ_PO3 |
| 14 | β_m-β_FC5_Pz | β_m-α_CP5_Pz | γ_m-β_F7 | cfc_P4_6_Hz | α_Pz |
| 15 | α_m-α_AF3_T7 | β_m-β_CP5_AF4 | γ_m-β_P3 | cfc_F4_14_Hz | θ_P3 |
| 16 | γ_m-α_F7_CP2 | β_m-β_P7_F8 | γ_m-γ_F7 | cfc_AF4_5_Hz | θ_Fp2 |
| 17 | θ_m-θ_CP1_CP2 | β_m-β_PO3_PO4 | γ_m-θ_Cz | cfc_F4_13_Hz | AI_β_PO3_PO4 |
| 18 | θ_m-θ_T8_P8 | β_m-θ_CP1_F8 | β_m-β_Cz | cfc_FC2_12_Hz | θ_O1 |
| 19 | β_m-β_FC5_P3 | β_m-θ_CP1_Fz | β_m-β_F7 | cfc_FC2_10_Hz | AI_γ_F7_F8 |
Feature names listed should be self explanatory. The “ratio” features correspond to the log-ratio ones between the video and baseline periods; “AI” corresponds to the asymmetry index between the indicated channels.