| Literature DB >> 30400575 |
Dongkoo Shon1, Kichang Im2, Jeong-Ho Park3, Dong-Sun Lim4, Byungtae Jang5, Jong-Myon Kim6.
Abstract
In recent years, stress analysis by using electro-encephalography (EEG) signals incorporating machine learning techniques has emerged as an important area of research. EEG signals are one of the most important means of indirectly measuring the state of the brain. The existing stress algorithms lack efficient feature selection techniques to improve the performance of a subsequent classifier. In this paper, genetic algorithm (GA)-based feature selection and k-nearest neighbor (k-NN) classifier are used to identify stress in human beings by analyzing electro-encephalography (EEG) signals. GA is incorporated in the stress analysis pipeline to effectively select subset of features that are suitable to enhance the performance of the k-NN classifier. The performance of the proposed method is evaluated using the Database for Emotion Analysis using Physiological Signals (DEAP), which is a public EEG dataset. A feature set is extracted in 32 EEG channels, which consists of statistical features, Hjorth parameters, band power, and frontal alpha asymmetry. The selected features through GA are used as input to the k-NN classifier to distinguish whether each EEG datapoint represents a stress state. To further consolidate, the effectiveness of the proposed method is compared with that of a state-of-the-art principle component analysis (PCA) method. Experimental results show that the proposed GA-based method outperforms PCA, with GA demonstrating 71.76% classification accuracy compared with 65.3% for PCA. Thus, it can be concluded that the proposed method can be effectively used for stress analysis with high classification accuracy.Entities:
Keywords: genetic algorithm; k-nearest neighbors; machine learning; stress detection
Mesh:
Year: 2018 PMID: 30400575 PMCID: PMC6265975 DOI: 10.3390/ijerph15112461
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1An overall process of stress classification.
Figure 2The result of data annotation.
Frequency band ranges of recorded EEG signals.
| No. of Frequency Band | Frequency Range (Hz) |
|---|---|
| 1 | 4–7.2 Hz |
| 2 | 7.2–10.4 Hz |
| 3 | 10.4–13.8 Hz |
| 4 | 13.8–17 Hz |
| 5 | 17–20 Hz |
| 6 | 20–23 Hz |
| 7 | 23–60 Hz |
Number of features from each type using the EEG signals.
| Feature Types | Number of Features |
|---|---|
| Statistical features | 192 |
| Frequency domain (Power spectral density) | 224 |
| HOC | 160 |
| Hjorth Parameter | 96 |
| Frontal Asymmetry Alpha | 1 |
Classification performance of the three different methods for feature selection.
| Participant No. | All Features + KNN Precision | PCA + KNN Precision | GA Based + KNN Precision |
|---|---|---|---|
| 1 | 87.91% | 80.10% | 87.13% |
| 2 | 63.91% | 60.22% | 61.64% |
| 4 | 54.69% | 51.04% | 52.60% |
| 5 | 61.24% | 59.36% | 68.15% |
| 8 | 72.54% | 66.93% | 72.51% |
| 10 | 60.05% | 55.14% | 61.83% |
| 11 | 68.03% | 66.59% | 75.72% |
| 12 | 73.58% | 73.29% | 72.45% |
| 13 | 71.63% | 72.83% | 78.84% |
| 14 | 82.21% | 80.29% | 83.65% |
| 15 | 79.31% | 75.80% | 91.42% |
| 16 | 62.24% | 60.16% | 64.84% |
| 18 | 73.48% | 73.44% | 79.68% |
| 19 | 51.84% | 49.98% | 61.22% |
| 20 | 58.98% | 54.28% | 62.49% |
| 21 | 66.78% | 64.45% | 72.64% |
| 22 | 63.97% | 62.14% | 62.14% |
| 24 | 73.26% | 75.00% | 77.08% |
| 25 | 60.94% | 61.14% | 60.56% |
| 26 | 54.69% | 54.62% | 89.11% |
| 27 | 88.89% | 88.54% | 88.19% |
| 28 | 63.54% | 65.63% | 68.23% |
| 29 | 67.19% | 64.06% | 68.23% |
| 31 | 54.69% | 51.82% | 57.81% |
| 32 | 61.35% | 58.99% | 75.78% |
Figure 3Comparison between (a) PCA and (b) GA-based feature selection.
Comparison of average precision of related stress classification from the EEG recording.
| Features | Classifier | Precision |
|---|---|---|
| all features | 67.08% | |
| PCA [ | 65.03% | |
| proposed GA method | 71.76% |