| Literature DB >> 35685688 |
Sharifah Noor Masidayu Sayed Ismail1, Nor Azlina Ab Aziz2, Siti Zainab Ibrahim1, Sophan Wahyudi Nawawi3, Salem Alelyani4,5, Mohamed Mohana4, Lee Chia Chun6.
Abstract
Background: The electrocardiogram (ECG) is a physiological signal used to diagnose and monitor cardiovascular disease, usually using 2- D ECG. Numerous studies have proven that ECG can be used to detect human emotions using 1-D ECG; however, ECG is typically captured as 2-D images rather than as 1-D data. There is still no consensus on the effect of the ECG input format on the accuracy of the emotion recognition system (ERS). The ERS using 2-D ECG is still inadequately studied. Therefore, this study compared ERS performance using 1-D and 2-D ECG data to investigate the effect of the ECG input format on the ERS.Entities:
Keywords: DREAMER; Emotion recognition; electrocardiogram; image ECG; numerical ECG
Mesh:
Year: 2021 PMID: 35685688 PMCID: PMC9171287 DOI: 10.12688/f1000research.73255.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. The P wave, QRS complex, and T wave in the standard electrocardiogram (ECG). This figure has been reproduced with permission from Ref. 7.
Figure 2. The example of 2-D ECG in a PDF file.
The summary of existing works using 1-D and 2-D ECG input.
| Ref | Dataset | Signal used | ECG input | Purposes | Feature extracted | Classifier | Result (%) |
|---|---|---|---|---|---|---|---|
|
| Own dataset
| ECG | 1-D | ERS | Statistical features from the time and frequency domains | SVM, NB, KNN, Gaussian | SVM – 69.23
|
|
| AUBT | ECG | 1-D | ERS | Local pattern description using Local Binary Pattern (LBP) and Local Ternary Pattern (LTP) | KNN | LBP – 84.17
|
|
| (MPED)
| ECG | 1-D | ERS | Statistical features from the time and frequency domains | SVM, KNN, LSTM, A-LSTM | SVM – 42.66
|
|
| (DREAMER)
| ECG, EEG | 1-D | ERS | Statistical features from the time and frequency domains | SVM, KNN, LDA | Valence – 62.37
|
|
| DREAMER | ECG | 1-D | ERS | Statistical features from the time, frequency, time-frequency domains, and nonlinear analysis-related | SVM | Valence – 86.09
|
|
| DREAMER | ECG | 1-D | ERS | Deep-learning | Convolutional Neural Network (CNN) | Valence – 74.90
|
|
| DREAMER | ECG | 1-D | ERS | Statistical features from the time and frequency domains | SVM | Valence – 65.80
|
|
| (MWM-HIT)
| ECG | 2-D | Authentication System | PQRST peaks | CNN | 99.99* |
|
| PhysioNet dataset (Fantasia and ECG-ID) | ECG | 2-D spectral | Authentication system | Spectrogram | CNN | 99.42* |
|
| Own dataset generated by FLUKE “ProSim 4 Vital Sign
| ECG | 2-D spectral | ECG classification | Instantaneous frequency and spectral entropy | LSTM | 100* |
|
| Zhejiang dataset | ECG | 2-D | Myocardial infarction screening | Object detection | DenseNet, KNN, SVM | DenseNet – 94.73*
|
|
| MIT-BIH arrhythmia dataset | ECG | 2-D spectral | Arrhythmia classification | Local features from 2-D images using deep learning | CNN | 99.11* |
|
| Physiobank dataset | ECG | 1-D and 2-D | Ventricular Arrhythmia detection | ECG beat images | SVM, Probabilistic Neural Network (PNN), KNN, Random Forest (RF) | 99.99* (both are useful) |
|
| Own dataset
| ECG and EEG | 2-D spectral | ERS | Statistical features from the time and frequency domains (R-R interval spectrogram) | CNN | ECG – 91.67
|
|
| AMIGOS, DEAP | ECG, PPG, EDA | 2-D spectral | ERS | Features extracted from spectrogram by ResNet-50 | Logistic Regression | AMIGOS – 78.30
|
The summary of the DREAMER dataset.
| No of subject | 23 |
|---|---|
|
| 18 audio-visual stimuli |
|
| Audio-video |
|
| ECG (256) |
|
| Valence, Arousal |
|
| 1–5 |
Figure 3. The overall structure of 1-D ECG-based ERS.
Features extracted from Augsburg Bio-signal Toolbox (AUBT).
| Features | Description |
|---|---|
| P, Q, R, S, T | P-, Q-, R-, S-, T-peaks (ECG) |
| HRV | Heart rate variability |
| Ampl | Amplitude signal |
| Mean | Mean value |
| Median | Median value |
| Std | Standard deviation |
| Min | Minimum value |
| Max | Maximum value |
| SpecRange | Mean of the frequency spectrum in a given range |
Features extracted from Toolbox for Emotional feature extraction from Physiological signals (TEAP).
| Features | Description |
|---|---|
| meanIBI | Mean inter-beat interval |
| HRV | Heart Rate Variability |
| MSE | Multiscale entropy at 5 levels |
| sp0001/0102/0203/0304 | Spectral power 0-0.1 Hz, 0.1-0.2 Hz, 0.2-0.3 Hz, 0.3-0.4 Hz |
| energyRatio | Spectral energy ratio between f<0.08 Hz/f>0.15 Hz and f<5.0 Hz |
| tachogram_LF/MF/HF | Spectral power in tachogram (HRV) for low, medium, and high frequencies. |
| tachogram_energy_ratio | Energy ratio for tachogram spectral content (MF/(LF+HF)) |
Figure 4. The 2-D ECG converted from 1-D ECG.
Figure 5. The overall structure of 1-D ECG-based ERS.
The experimental setting values.
| Setting | Value | |
|---|---|---|
| SVM hyperparameter | Kernel | {linear, rbf} |
| C | [0.1,1,10] | |
| Gamma | [0.1,1,10] | |
| Train-test split | Stratified 70:30 | |
| Kfold cross-validation | 10 | |
Testing emotion classification accuracy and F1-score for 1-D and 2-D electrocardiogram (ECG).
| Type of ECG input | Feature extractor | Valence | Arousal | ||
|---|---|---|---|---|---|
| Accuracy | F1-Score | Accuracy | F1-Score | ||
| 1-D | *AUBT and BioSig | 62.37 | 53.05 | 62.37 | 57.98 |
| AUBT | 63.86 | 72.73 |
|
| |
| TEAP |
|
| 54.22 | 42.42 | |
| 2-D | ORB | 61.75 | 47.76 | 56.33 | 40.59 |
| KAZE |
|
| 56.33 | 40.59 | |
| SIFT | 61.14 | 46.4 | 56.33 | 40.59 | |
| AKAZE | 61.75 | 47.76 |
|
| |
| BRISK | 61.75 | 47.76 | 56.02 | 40.23 | |
| HOG | 61.14 | 46.4 | 56.33 | 40.59 | |
Computation Time for Each Feature Extractor using Support Vector Machine (SVM).
| Type of ECG input | Feature extractor | Computational time (sec) | |
|---|---|---|---|
| Valence | Arousal | ||
|
| AUBT | 1.65 | 1.63 |
| TEAP | 1.51 | 1.55 | |
|
| ORB | 1473.07 | 1461.77 |
| KAZE | 4486.27 | 6034.26 | |
| SIFT | 239.31 | 238.55 | |
| AKAZE | 2950.23 | 3308.23 | |
| BRISK | 4926.46 | 3610.64 | |
| HOG | 6516.30 | 6431.28 | |