| Literature DB >> 31847238 |
Md Junayed Hasan1, Jong-Myon Kim1.
Abstract
Human stress analysis using electroencephalogram (EEG) signals requires a detailed and domain-specific information pool to develop an effective machine learning model. In this study, a multi-domain hybrid feature pool is designed to identify most of the important information from the signal. The hybrid feature pool contains features from two types of analysis: (a) statistical parametric analysis from the time domain, and (b) wavelet-based bandwidth specific feature analysis from the time-frequency domain. Then, a wrapper-based feature selector, Boruta, is applied for ranking all the relevant features from that feature pool instead of considering only the non-redundant features. Finally, the k-nearest neighbor (k-NN) algorithm is used for final classification. The proposed model yields an overall accuracy of 73.38% for the total considered dataset. To validate the performance of the proposed model and highlight the necessity of designing a hybrid feature pool, the model was compared to non-linear dimensionality reduction techniques, as well as those without feature ranking.Entities:
Keywords: EEG signals; feature selector; k-NN; stress analysis
Year: 2019 PMID: 31847238 PMCID: PMC6956373 DOI: 10.3390/brainsci9120376
Source DB: PubMed Journal: Brain Sci ISSN: 2076-3425
Figure 1Block diagram of the proposed method [15].
Mathematical description of the considered time-domain features.
| Feature | Equation | Feature | Equation | Feature | Equation |
|---|---|---|---|---|---|
| F1 |
| F2 |
| F3 |
|
| F4 |
| F5 |
| F6 |
|
| F7 |
| F8 |
| F9 |
|
| F10 |
| F11 |
|
X is the original EEG signal in the time domain. N denotes the total number of samples of the signal.
Figure 2Decomposition tree of level 5 discrete wavelet transform (DWPT).
Details of the correlated DWPT packets.
| Frequency Bands | DWPT Packets | Presence in the Considered DEAP EEG Filtered Signals (4–45 Hz) | Usage of DWPT Packet |
|---|---|---|---|
| Delta | 0–4 Hz | Not Present | No |
| Theta | 4–8 Hz | Present | Yes |
| Alpha | 8–12 Hz | Present | Yes |
| Beta | 14–16 Hz | Present | Yes |
| 16–32 Hz | Present | Yes | |
| Gamma | 32–36 Hz | Present | No |
| 32–40 Hz | Present | Yes | |
| 32–48 Hz | Not Present | No |
Figure 3Descriptive illustration of k-NN algorithm.
Figure 4K-fold cross-validation process in k-NN for selecting the k-value.
Details of the considered dataset.
| Dataset | Participant ID | Considered Number of Channels | Experimental ID That Reflects the Distinctive State | |
|---|---|---|---|---|
| Calm | Stress | |||
|
|
| 32 EEG Channels | 9,14 | 17,32,34,35,36,37 |
|
|
| 5,7,10,22,24,36 | 29,30,32,37,38 | |
|
|
| 2,6,18 | 24,28,32 | |
|
|
| 13,29 | 23,30,37 | |
|
|
| 10,37,39 | 31,36 | |
|
|
| 15,17,20,22,26,27,28 | 21,30,35,36,37,38,39 | |
|
|
| 2,12,16,19,25,26,28,40 | 27,35,37,38,39 | |
|
|
| 16,17,28 | 25,29,32,33,35,36,37,38 | |
|
|
| 12,15,16 | 7,21,23,31,34,35,36,37,38,39 | |
|
|
| 22,27 | 10,21,23,24,29,30,32,34,35,36,38 | |
|
|
| 7,16,22,26 | 24,25,30,38 | |
|
|
| 6,12,16,21,36 | 1,15,17,24,26,27,34 | |
|
|
| 22,26,34 | 30 | |
|
|
| 15,26,27 | 29,38 | |
|
|
| 16,26,27,28,40 | 23,25,29 | |
|
|
| 3,21,26,34,35 | 20,22,24 | |
|
|
| 1,6,12,15,16,28 | 23,24,29,30,32,33,35,36,37,38,39 | |
|
|
| 33,40 | 21,23,24,30,31,38,39 | |
|
|
| 4,5,26,27,28,34 | 2,10,23,29,31,32,33,37,38,39 | |
|
|
| 30 | 34 | |
|
|
| 5,15,19,26,27,28,33,40 | 27 | |
|
|
| 15,22,24,25 | 35,38 | |
|
|
| 15,17 | 30,31,33,35 | |
|
|
| 17,22,24,27,28,29 | 23,32,34,37,38,39 | |
|
|
| 2,6,15,26,33 | 24,30,37 | |
Figure 5Boruta feature space for Dataset 2.
Details of the class-wise and average classification accuracy.
| Dataset | Participant ID | Class-Wise Accuracy (%) | Avg. Accuracy (%) | |
|---|---|---|---|---|
| Calm | Stress | |||
|
| 1 | 79.45 | 78.68 | 79.07 |
|
| 2 | 93.59 | 93.65 | 93.62 |
|
| 4 | 97.14 | 95.24 | 96.13 |
|
| 5 | 75 | 83.33 | 79.82 |
|
| 8 | 77.78 | 78.95 | 78.23 |
|
| 10 | 62.5 | 65.48 | 64.00 |
|
| 11 | 85.48 | 82.97 | 84.23 |
|
| 12 | 59.35 | 73.14 | 65.26 |
|
| 13 | 62.61 | 69.22 | 65.92 |
|
| 14 | 82.20 | 88.96 | 85.58 |
|
| 15 | 79.59 | 72.22 | 76.09 |
|
| 16 | 62.71 | 68.52 | 65.62 |
|
| 18 | 95.29 | 88.13 | 91.71 |
|
| 19 | 76 | 92.86 | 82.58 |
|
| 20 | 82.73 | 79.15 | 80.94 |
|
| 21 | 88.51 | 82.49 | 85.5 |
|
| 22 | 92.13 | 93.44 | 92.79 |
|
| 24 | 73.46 | 93.98 | 83.72 |
|
| 25 | 82.22 | 87.49 | 84.85 |
|
| 26 | 86.67 | 100 | 93.33 |
|
| 27 | 84.43 | 78.92 | 81.66 |
|
| 28 | 88.49 | 86.53 | 87.51 |
|
| 29 | 85.73 | 84.92 | 85.33 |
|
| 31 | 82.43 | 78.75 | 80.62 |
|
| 32 | 83.78 | 75.86 | 81.17 |
Comparative analysis of different approaches for individual datasets.
| Dataset | Participant ID | PCA + k-NN | All Features + k-NN | Proposed |
|---|---|---|---|---|
|
| 1 | 70.52 | 77.22 | 79.07 |
|
| 2 | 70.83 | 91.65 | 93.62 |
|
| 4 | 67.56 | 94.81 | 96.13 |
|
| 5 | 77.14 | 75.0 | 79.82 |
|
| 8 | 65.62 | 74.67 | 78.23 |
|
| 10 | 53.89 | 62.95 | 64.00 |
|
| 11 | 78.32 | 82.22 | 84.23 |
|
| 12 | 56.22 | 64.21 | 65.26 |
|
| 13 | 54.31 | 62.28 | 65.92 |
|
| 14 | 71.29 | 86.23 | 85.58 |
|
| 15 | 58.68 | 75.71 | 76.09 |
|
| 16 | 49.23 | 65.91 | 65.62 |
|
| 18 | 68.55 | 80.24 | 91.71 |
|
| 19 | 54.36 | 77.44 | 82.58 |
|
| 20 | 55.81 | 71.92 | 80.94 |
|
| 21 | 58.94 | 76.22 | 85.5 |
|
| 22 | 64.87 | 89.44 | 92.79 |
|
| 24 | 62.91 | 76.36 | 83.72 |
|
| 25 | 63.30 | 81.49 | 84.85 |
|
| 26 | 54.89 | 91.45 | 93.33 |
|
| 27 | 75.43 | 78.25 | 81.66 |
|
| 28 | 49.43 | 76 | 87.51 |
|
| 29 | 52.82 | 75.12 | 85.33 |
|
| 31 | 65.85 | 78.42 | 80.62 |
|
| 32 | 61.09 | 80.03 | 81.17 |
Figure 6(a) Classification performance accuracy comparisons by boxplot, (b) performance chart based on accuracy comparisons.
Comparative analysis of different approaches for the merged dataset.
| Methods | Dataset | Avg. Accuracy (%) | Decrement from the Proposed Method (%) |
|---|---|---|---|
| PCA + k-NN | Merged dataset | 65.62 | 7.76 |
| All Features + k-NN | 69.26 | 4.12 | |
| Proposed | 73.38 | - |
The merged dataset is the finally formed dataset in which data from Dataset 1 to 25 (from Table 3) are considered together.