| Literature DB >> 36015730 |
Maximilian Ehrhart1, Bernd Resch1,2, Clemens Havas1, David Niederseer3.
Abstract
Human-centered applications using wearable sensors in combination with machine learning have received a great deal of attention in the last couple of years. At the same time, wearable sensors have also evolved and are now able to accurately measure physiological signals and are, therefore, suitable for detecting body reactions to stress. The field of machine learning, or more precisely, deep learning, has been able to produce outstanding results. However, in order to produce these good results, large amounts of labeled data are needed, which, in the context of physiological data related to stress detection, are a great challenge to collect, as they usually require costly experiments or expert knowledge. This usually results in an imbalanced and small dataset, which makes it difficult to train a deep learning algorithm. In recent studies, this problem is tackled with data augmentation via a Generative Adversarial Network (GAN). Conditional GANs (cGAN) are particularly suitable for this as they provide the opportunity to feed auxiliary information such as a class label into the training process to generate labeled data. However, it has been found that during the training process of GANs, different problems usually occur, such as mode collapse or vanishing gradients. To tackle the problems mentioned above, we propose a Long Short-Term Memory (LSTM) network, combined with a Fully Convolutional Network (FCN) cGAN architecture, with an additional diversity term to generate synthetic physiological data, which are used to augment the training dataset to improve the performance of a binary classifier for stress detection. We evaluated the methodology on our collected physiological measurement dataset, and we were able to show that using the method, the performance of an LSTM and an FCN classifier could be improved. Further, we showed that the generated data could not be distinguished from the real data any longer.Entities:
Keywords: expert evaluation; generating measurement data; machine learning; physiological sensor data; stress classification; time series GAN
Mesh:
Year: 2022 PMID: 36015730 PMCID: PMC9412645 DOI: 10.3390/s22165969
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The label distribution of our physiological measurement dataset. The left bar is the Moment Of Stress (MOS) class, and the right one is the non-MOS class.
Figure 2Prepare and preprocess raw signals for the cGAN and the stress classifier. The red line indicates ST and the blue line indicates GSR. The dotted line in the raw signals and in the filtered signals indicates induced MOS. In the window plot the dotted line indicates split index.
Figure 3cGAN workflow.
Figure 4LSTM cell.
Figure 5The architecture of our conditional GAN. In the input and output figure in (a,b), the blue line indicates GSR and the red line indicates ST, which shows a prototypical MOS.
Figure 6Visual comparison of real and generated samples. The red line shows a standardized and filtered 16 s ST signal. The blue line shows a standardized and filtered 16 s GSR signal. There are always two generated and two real signal samples arranged in a 2 × 2 grid.
Figure 7The two figures show the results from the t-sne. The red points are the generated points, and the blue points are the real data points.
The results from the classifier experiments are shown. The different scores indicate the best possible results we reached during training.
|
|
|
|
|
|
| baseline | 0.4881 | 0.8542 | 0.6212 | 0.84 |
| RCGAN TGTR | 0.5357 | 0.7377 | 0.6207 | 0.8382 |
| RCGAN DAug | 0.5833 | 0.7903 | 0.6712 | 0.8588 |
| TimeGAN TGTR | 0.5833 | 0.6203 | 0.6012 | 0.8088 |
| TimeGAN DAug | 0.6429 | 0.71 | 0.6750 | 0.8471 |
| Ours TGTR | 0.5238 | 0.7719 | 0.6241 | 0.84 |
| Ours DAug | 0.7262 | 0.7439 | 0.7349 | 0.8676 |
|
|
|
|
|
|
| baseline | 0.5357 | 0.8654 | 0.6618 | 0.8647 |
| RCGAN TGTR | 0.4762 | 0.6250 | 0.5405 | 0.8000 |
| RCGAN DAug | 0.6190 | 0.7324 | 0.6709 | 0.8500 |
| TimeGAN TGTR | 0.5833 | 0.6533 | 0.6163 | 0.8206 |
| TimeGAN DAug | 0.5952 | 0.8065 | 0.6849 | 0.8647 |
| Ours TGTR | 0.6786 | 0.7600 | 0.7170 | 0.8618 |
| Ours DAug | 0.7262 | 0.8243 | 0.7721 | 0.88 |
The results of the classification between real and generated performed by experts. The accuracy score is the mean of the participants’ performance.
| Accuracy | |
|---|---|
| Real/Generated | 0.4575 |
The binary classification of physiological measurement data according to stress moments performed by experts.
| Recall | Precision | F1 | Accuracy | |
|---|---|---|---|---|
| All Sequences | 0.7567 | 0.7814 | 0.7487 | 0.8175 |
| Real | 0.74 | 0.7019 | 0.6973 | 0.765 |
| Generated | 0.7733 | 0.8816 | 0.8065 | 0.870 |
Results of the classifier two-sample test. The closer to the chance level, the better are the results.
| Neural Net | LSTM | |
|---|---|---|
| CTST LSTM-FCN | 0.6221 | 0.5903 |