| Literature DB >> 34068895 |
Filipe Galvão1, Soraia M Alarcão1, Manuel J Fonseca1.
Abstract
Recognition of emotions from physiological signals, and in particular from electroencephalography (EEG), is a field within affective computing gaining increasing relevance. Although researchers have used these signals to recognize emotions, most of them only identify a limited set of emotional states (e.g., happiness, sadness, anger, etc.) and have not attempted to predict exact values for valence and arousal, which would provide a wider range of emotional states. This paper describes our proposed model for predicting the exact values of valence and arousal in a subject-independent scenario. To create it, we studied the best features, brain waves, and machine learning models that are currently in use for emotion classification. This systematic analysis revealed that the best prediction model uses a KNN regressor (K = 1) with Manhattan distance, features from the alpha, beta and gamma bands, and the differential asymmetry from the alpha band. Results, using the DEAP, AMIGOS and DREAMER datasets, show that our model can predict valence and arousal values with a low error (MAE < 0.06, RMSE < 0.16) and a strong correlation between predicted and expected values (PCC > 0.80), and can identify four emotional classes with an accuracy of 84.4%. The findings of this work show that the features, brain waves and machine learning models, typically used in emotion classification tasks, can be used in more challenging situations, such as the prediction of exact values for valence and arousal.Entities:
Keywords: EEG; arousal and valence prediction; comparative study; emotion recognition
Mesh:
Year: 2021 PMID: 34068895 PMCID: PMC8155937 DOI: 10.3390/s21103414
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The typical set of emotions recognized in the literature (a–d) and what we want to achieve with our work (e). A: arousal; V: valence. (best seen in color).
A brief summary of the analyzed works and their main characteristics.
| Work | Database | Features (Brain Waves) | Classifier | Emotions (#Classes) |
|---|---|---|---|---|
| [ | MANHOB-HCI | PSD, DPSA ( | SVM | arousal (3); valence (3) |
| [ | DEAP | PSD, APSD ( | NB | arousal (2); valence (2) |
| [ | Video (Own) | SampEn, Spectral Centroid ( | KNN, PNN | disgust, happy, surprise, fear and neutral (5) |
| [ | DEAP | PSD ( | DLN, SVM, NB | arousal (3); valence (3) |
| [ | Video (Own) | DPSA, WT, WE, AE, FD, HE ( | SVM | positive and negative (2) |
| [ | DEAP | Pearson correlation, Phase coherence, MI ( | SVM | arousal (2); valence (2) |
| [ | SEED | PSD, DE, Differential/Rational asymmetry ( | KNN, LR, SVM, DBN | positive, negative and neutral (3) |
| [ | DEAP | PSD, STFT, HHS, HOC ( | RF, SVM | anger, surprise, other (3) |
| [ | DEAP | Statistical, PSD, HP, FD ( | SVM | arousal (2); valence (2) |
| [ | DEAP | WP, WE ( | SVM, KNN | arousal (2); valence (2) |
| [ | DEAP, Music | PSD, FD, differential asymmetry ( | SVM, MLP, C4.5 | arousal (2); valence (2) |
| [ | MANHOB-HCI | EMD, SampEn ( | SVM | high/low valence/arousal (4) |
| [ | DEAP | EMD, AR ( | SVM | high/low valence/arousal (4) |
| [ | DREAMER | Logarithmic PSD ( | SVM | arousal (2); valence (2) |
| [ | AMIGOS | Logarithmic PSD, APSD ( | NB | arousal (2); valence (2) |
| [ | DEAP | WP ( | ANN, SVM | high/low valence/arousal (4) |
| [ | DEAP | Quadratic time-frequency distributions (Custom) | SVM | high/low valence/arousal (4) |
| [ | DEAP, SEED | Flexible Analytic WT, Rényi’s Quadratic Entropy (Custom) | SVM, RF | high/low valence/arousal (4); positive, negative and neutral (3) |
| [ | DEAP | PSD, APSD, Shannon Entropy, SE, ZCR, Statistical ( | LSSVM (Least Square SVM) | joy, peace, anger and depression (4) |
| [ | DEAP | WP, WE ( | KELM | high/low valence/arousal (4) |
| [ | DEAP, SEED | WT, High Order Statistics ( | DLN | high/low valence/arousal (4); positive, negative and neutral (3) |
| [ | Video (Own) | LZC, WT, Cointegration Degree, EMD, AE (Custom) | SVM | arousal (2); valence (2) |
Feature Extraction: AE—Approximate Entropy, APSD—Asymmetric Power Spectrum Density, AR—Auto Regressive models, DE— Differential Entropy, DPSA—Differential Power Spectral Asymmetry, EMD—Empirical Mode Decomposition, FD—Fractal Dimensions, HE—Hurst Exponent, HHS—Hilbert–Huang Spectrum, HOC—Higher Order Crossings, HP—Hjorth Parameters, LZC—Lempel-Ziv Complexity, MI—Mutual Information, PSD—Power Spectrum Density, SampEn—Sample Entropy, SE—Spectral Entropy, STFT—Short- Time Fourier Transform, WE—Wavelet Entropy, WP—Wavelet Energy, WT—Wavelet Transform and ZCR—Zero-Crossing Rate. Classifier: ANN—Artificial Neural Networks, DBN—Deep Belief Networks, DLN—Deep Learning Networks, KELM—Extreme Learning Machine with kernel, KNN—K-Nearest Neighbors, LR—Logistic Regression, MLP—Multi-Layer Perceptron, NB—Naive Bayes, RF—Random Forest and SVM—Support Vector Machines.
Main characteristics of the AMIGOS, DEAP and DREAMER datasets.
| AMIGOS | DEAP | DREAMER | |
|---|---|---|---|
| #Videos | 16 | 40 | 18 |
| Type | movie extracts | music videos | film clips |
| Duration | <250 s | 60 s | 65–393 s |
| Physiological Signals | EEG, GSR, ECG | EEG, GSR, BVP, RESP, SKT, EOG, EMG | EEG, ECG |
| Participants | 40 | 32 | 23 |
Physiological signals: electroencephalography (EEG), galvanic skin response (GSR), electrocardiography (ECG), blood volume pulse (BVP), respiration (RESP), skin temperature (SKT), electrooculography (EOG) and electromyography (EMG).
Figure 2Steps to compute the feature vector from the raw EEG signal.
Figure 3Steps of the analysis to identify the best configuration for our valence and arousal prediction models. (Lin: Linear; RBF: Radial Basis Function).
Results for each regressor per wave. Values in bold represent the two best results for each column in each wave. Grey rows represent the selected regressors.
| Arousal | Valence | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| Alpha | AR | 0.248 | 0.205 | 0.245 | 0.172 | 0.224 | 0.264 |
| DT | 0.339 | 0.194 | 0.241 | 0.263 | 0.216 | 0.261 | |
| KNN (K = 1) |
|
| 0.263 |
|
| 0.290 | |
| LR | 0.273 | 0.202 | 0.244 | 0.203 | 0.222 | 0.262 | |
| RF |
|
|
|
|
|
| |
| SVR (linear) | 0.251 | 0.200 | 0.248 | 0.232 | 0.237 | 0.242 | |
| SVR (RBF) | 0.244 | 0.212 |
| 0.214 | 0.211 |
| |
| Beta | AR | 0.297 | 0.201 | 0.242 | 0.229 | 0.221 | 0.261 |
| DT | 0.431 | 0.180 | 0.232 | 0.404 | 0.196 | 0.249 | |
| KNN (K = 1) |
|
|
|
|
| 0.244 | |
| LR | 0.301 | 0.200 | 0.242 | 0.280 | 0.216 | 0.257 | |
| RF |
|
|
|
|
|
| |
| SVR (linear) | 0.273 | 0.198 | 0.247 | 0.259 | 0.225 | 0.243 | |
| SVR (RBF) | 0.260 | 0.215 | 0.229 | 0.241 | 0.226 |
| |
| Gamma | AR | 0.297 | 0.201 | 0.242 | 0.256 | 0.219 | 0.259 |
| DT |
|
|
|
|
|
| |
| KNN (K = 1) | 0.409 | 0.174 | 0.275 | 0.387 | 0.191 | 0.297 | |
| LR | 0.247 | 0.203 | 0.246 | 0.255 | 0.218 | 0.259 | |
| RF |
|
|
|
|
|
| |
| SVR (linear) | 0.218 | 0.202 | 0.251 | 0.211 | 0.218 | 0.265 | |
| SVR (RBF) | 0.213 | 0.203 | 0.249 | 0.176 | 0.221 | 0.264 | |
Results for two types of asymmetry, and their combination, per wave. Bold values represent the best results for KNN and RF on each column.
| Arousal | Valence | |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| KNN | Rational | Alpha | 0.410 | 0.175 | 0.275 | 0.366 | 0.196 | 0.302 |
| Beta |
|
|
| 0.550 | 0.144 | 0.255 | ||
| Gamma | 0.447 | 0.165 | 0.266 | 0.448 | 0.177 | 0.283 | ||
| Differential | Alpha | 0.411 | 0.182 | 0.275 |
|
|
| |
| Beta | 0.338 | 0.195 | 0.290 | 0.337 | 0.209 | 0.308 | ||
| Gamma | 0.271 | 0.212 | 0.305 | 0.279 | 0.227 | 0.322 | ||
| Both | Alpha | 0.472 | 0.162 | 0.260 | 0.672 | 0.134 | 0.215 | |
| Beta | 0.530 | 0.141 | 0.245 | 0.525 | 0.152 | 0.262 | ||
| Gamma | 0.422 | 0.172 | 0.271 | 0.423 | 0.185 | 0.289 | ||
| RF | Rational | Alpha | 0.516 | 0.176 | 0.218 | 0.481 | 0.196 | 0.237 |
| Beta | 0.681 | 0.151 | 0.191 | 0.679 | 0.162 | 0.202 | ||
| Gamma | 0.694 | 0.147 | 0.188 | 0.694 | 0.157 | 0.194 | ||
| Differential | Alpha | 0.610 | 0.166 | 0.205 |
|
|
| |
| Beta | 0.674 | 0.151 | 0.192 | 0.672 | 0.162 | 0.203 | ||
| Gamma | 0.689 | 0.148 | 0.189 | 0.685 | 0.158 | 0.200 | ||
| Both | Alpha | 0.586 | 0.169 | 0.209 | 0.812 | 0.141 | 0.176 | |
| Beta | 0.680 | 0.150 | 0.191 | 0.682 | 0.161 | 0.202 | ||
| Gamma |
|
|
| 0.694 | 0.156 | 0.198 | ||
Figure 4PCC values when each of the features per wave were used individually for predicting arousal and valence values. (best seen in color).
Results for the different features by waves, and their combinations. Values in bold represent the best results for KNN and RF, for valence and arousal. Grey rows represent the selected features.
| Arousal | Valence | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| KNN |
| 0.505 | 0.147 | 0.252 | 0.451 | 0.167 | 0.281 |
|
| 0.621 | 0.114 | 0.221 | 0.597 | 0.126 | 0.241 | |
|
| 0.535 | 0.139 | 0.244 | 0.528 | 0.149 | 0.260 | |
|
| 0.689 | 0.093 | 0.199 | 0.658 | 0.105 | 0.221 | |
|
| 0.640 | 0.109 | 0.215 | 0.603 | 0.122 | 0.239 | |
|
| 0.672 | 0.099 | 0.205 | 0.652 | 0.110 | 0.224 | |
|
| 0.722 | 0.084 | 0.189 | 0.691 | 0.095 | 0.211 | |
|
| 0.430 | 0.178 | 0.270 |
| 0.115 |
| |
|
|
|
| 0.750 |
| 0.189 | ||
| RF |
| 0.543 | 0.173 | 0.215 | 0.505 | 0.193 | 0.234 |
|
| 0.669 | 0.150 | 0.192 | 0.661 | 0.161 | 0.204 | |
|
| 0.719 | 0.138 | 0.180 | 0.717 | 0.145 | 0.190 | |
|
| 0.722 | 0.138 | 0.180 | 0.719 | 0.147 | 0.190 | |
|
| 0.642 | 0.159 | 0.198 |
|
|
| |
|
|
|
| 0.845 | 0.119 | 0.155 | ||
Results for the optimization of the KNN and RF models, for different values of K, Manhattan distance and number of trees (T). Values in bold represent the best results for each model. Grey rows represent the selected parameters.
| Arousal | Valence | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| KNN | K = 1 |
|
|
|
|
|
|
| K = 3 | 0.725 | 0.120 | 0.175 | 0.725 | 0.128 | 0.185 | |
| K = 5 | 0.684 | 0.137 | 0.185 | 0.689 | 0.146 | 0.194 | |
| K = 7 | 0.655 | 0.147 | 0.192 | 0.663 | 0.156 | 0.201 | |
| K = 11 | 0.622 | 0.156 | 0.199 | 0.633 | 0.166 | 0.208 | |
| K = 21 | 0.579 | 0.166 | 0.208 | 0.595 | 0.176 | 0.217 | |
| RF | T = 50 | 0.740 | 0.137 | 0.176 | 0.839 | 0.119 | 0.156 |
| T = 100 | 0.748 | 0.136 | 0.175 | 0.845 | 0.119 | 0.155 | |
| T = 500 | 0.755 |
|
| 0.852 |
|
| |
| T = 750 | 0.755 |
|
| 0.852 |
|
| |
| T = 1000 |
|
|
|
|
|
| |
Prediction results using the datasets DEAP, AMIGOS and DREAMER.
| Arousal | Valence | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| KNN | DEAP | 0.794 | 0.062 | 0.163 | 0.795 | 0.066 | 0.172 |
| AMIGOS | 0.830 | 0.045 | 0.129 | 0.808 | 0.063 | 0.175 | |
| DREAMER | 0.806 | 0.058 | 0.165 | 0.812 | 0.076 | 0.213 | |
| RF | DEAP | 0.755 | 0.135 | 0.174 | 0.852 | 0.118 | 0.153 |
| AMIGOS | 0.789 | 0.115 | 0.148 | 0.769 | 0.158 | 0.195 | |
| DREAMER | 0.864 | 0.099 | 0.142 | 0.870 | 0.128 | 0.181 | |
Accuracy values (%) for arousal and valence binary classification. Values in bold represent the best results for each column.
| DEAP | AMIGOS | DREAMER | ||||
|---|---|---|---|---|---|---|
| Arousal | Valence | Arousal | Valence | Arousal | Valence | |
| KNN |
|
|
|
| 93.72 | 92.16 |
| RF | 80.62 | 85.91 | 85.98 | 83.00 |
|
|
Comparison of the accuracy (%) of the proposed model with previous works, for arousal and valence binary classification (low/high arousal, low/high valence). Values are from the original papers and using the DEAP dataset.
| Year | Method | Arousal | Valence |
|---|---|---|---|
| 2020 | Deep Physiological Affect Network (Convolutional LSTM with a temporal loss function) [ | 79.03 | 78.72 |
| 2020 | Attention-based LSTM with Domain Discriminator [ | 72.97 | 69.06 |
| 2019 | Spectrum centroid and Lempel–Ziv complexity from EMD; KNN [ | 86.46 | 84.90 |
| 2019 | Ensemble of CNNs with LSTM model [ | —– | 84.92 |
| 2019 | Phase-locking value-based graph CNN [ | 77.03 | 73.31 |
| 2018 | Time, frequency and connectivity features combined with mRMR and PCA for features reduction; Random Forest [ | 74.30 | 77.20 |
| 2017 | Transfer recursive feature elimination; least square SVM [ | 78.67 | 78.75 |
| 2012 | EEG Power spectral features + Asymmetry, from four bands; naive Bayes classifier [ | 62.00 | 57.60 |
| 2021 | Proposed model | 89.84 | 89.83 |
Figure 5Confusion matrices of four quadrants classification for the KNN (top), RF (middle) and a combination of both regressors (bottom), using the datasets DEAP (left), AMIGOS (center) and DREAMER (right). (best seen in color).
Comparison of the accuracy (%) of the proposed model with previous works, for the four quadrants classification. Values are from the original papers and using the DEAP dataset.
| Year | Method | Accuracy |
|---|---|---|
| 2020 | Nonlinear higher order statistics and deep learning algorithm [ | 82.01 |
| 2019 | Wavelet energy and entropy; Extreme Learning Machine with kernel [ | 80.83 |
| 2019 | Time-frequency analysis using multivariate synchrosqueezing transform; Gaussian SVM [ | 76.30 |
| 2018 | Wavelet energy; SVM classifier [ | 81.97 |
| 2018 | Flexible analytic wavelet transform + information potential to extract features; Random Forest [ | 71.43 |
| 2017 | Hybrid deep learning neural network (CNN + LSTM) [ | 75.21 |
| 2016 | Discriminative Graph regularized Extreme Learning Machine with differential entropy features [ | 69.67 |
| 2021 | Proposed model | 84.40 |