| Literature DB >> 35359827 |
Talha Iqbal1, Adnan Elahi2, William Wijns1, Atif Shahzad1,3.
Abstract
Over the past decade, there has been a significant development in wearable health technologies for diagnosis and monitoring, including application to stress monitoring. Most of the wearable stress monitoring systems are built on a supervised learning classification algorithm. These systems rely on the collection of sensor and reference data during the development phase. One of the most challenging tasks in physiological or pathological stress monitoring is the labeling of the physiological signals collected during an experiment. Commonly, different types of self-reporting questionnaires are used to label the perceived stress instances. These questionnaires only capture stress levels at a specific point in time. Moreover, self-reporting is subjective and prone to inaccuracies. This paper explores the potential feasibility of unsupervised learning clustering classifiers such as Affinity Propagation, Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH), K-mean, Mini-Batch K-mean, Mean Shift, Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Ordering Points To Identify the Clustering Structure (OPTICS) for implementation in stress monitoring wearable devices. Traditional supervised machine learning (linear, ensembles, trees, and neighboring models) classifiers require hand-crafted features and labels while on the other hand, the unsupervised classifier does not require any labels of perceived stress levels and performs classification based on clustering algorithms. The classification results of unsupervised machine learning classifiers are found comparable to supervised machine learning classifiers on two publicly available datasets. The analysis and results of this comparative study demonstrate the potential of unsupervised learning for the development of non-invasive, continuous, and robust detection and monitoring of physiological and pathological stress.Entities:
Keywords: heart rate; machine learning; physiological signals; respiratory rate; stress monitoring; unsupervised and supervised learning
Year: 2022 PMID: 35359827 PMCID: PMC8962952 DOI: 10.3389/fmedt.2022.782756
Source DB: PubMed Journal: Front Med Technol ISSN: 2673-3129
Hyper-parameters settings and python library used for implementation.
|
|
|
|
|
|
|---|---|---|---|---|
| Supervised machine learning algorithm | Logistics regression | • Solver = “lbfgs” | sklearn.linear_model | |
| Gaussian Naïve Bayes | • Variance smooting = 1e-09 | sklearn.naive_bayes | ||
| Decision tree | • Quality of split criterion = “gini” | sklearn.tree | ||
| Random forest | • Quality of split criterion = “gini” | sklearn.ensemble | ||
| AdaBoost | • Learning rate was varied between range (0.01-1.1 with increment of 0.01) | sklearn.ensemble | ||
| K-nearest neighbors | • Number of neighbors required was set to 2 | sklearn.neighbors | ||
| K-nearest neighbors | 70-30% | • Number of neighbors required set at 5 | sklearn.neighbors | |
| Unsupervised machine learning algorithm | Affinity propagation | • Damping factor was set at 0.8 to maintain current value relative to incoming value (weight 1-damping) | sklearn.cluster | |
| BIRCH | • Threshold from which the radius of subcluster should be lesser = 0.5 | sklearn.cluster | ||
| DBSCAN | • Maximum distance between two samples for consideration as neighbors (eps) = 0.50 | sklearn.cluster | ||
| K-mean | • Number of neighbors required was set to 2 | sklearn.cluster | ||
| Mini-batch K-mean | • Number of neighbors required was set to 2 | sklearn.cluster | ||
| Mean shift | • Number of clusters = length of unique ids in training set (default = 2) | sklearn.cluster | ||
| OPTICS | • Maximum distance between two samples for consideration as neighbors (eps) = 0.80 | sklearn.cluster |
Figure 1Block diagram of the implemented classification methods illustrating pre-processing, classification, and post-processing stages.
Results of supervised learning algorithms on stress recognition in automobile drivers dataset.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Stress recognition in automobile drivers dataset | Logistic regression | Heart rate and respiratory rate | 59.3% | 0.59 | 0.59 | 0.59 | |
| Gaussian Naive Bayes | 56.5% | 0.60 | 0.59 | 0.59 | |||
| Decision tree | 63.4% | 0.64 | 0.64 | 0.63 | |||
| Random forest | 65.0% | 0.65 | 0.66 | 0.65 | |||
| AdaBoost | 66.8% | 0.67 | 0.66 | 0.65 | |||
| KNN = 5 | 63.7% | 0.63 | 0.63 | 0.63 | |||
| KNN = 2 | 58.1% | 0.60 | 0.57 | 0.56 | |||
| Stress recognition in automobile drivers dataset | Logistic regression | Heart rate | 58.4% | 0.59 | 0.58 | 0.58 | |
| Gaussian Naive Bayes | 56.0% | 0.59 | 0.56 | 0.55 | |||
| Decision tree | 61.9% | 0.66 | 0.062 | 0.57 | |||
| Random forest | 70-30 % | 56.2% | 0.56 | 0.56 | 0.56 | ||
| AdaBoost | 61.5% | 0.61 | 0.61 | 0.60 | |||
| KNN = 5 | 54.4% | 0.54 | 0.54 | 0.54 | |||
| KNN = 2 | 51.7% | 0.55 | 0.52 | 0.50 | |||
| Stress recognition in automobile drivers dataset | Logistic regression | Respiratory rate | 63.2% | 0.70 | 0.63 | 0.55 | |
| Gaussian Naive Bayes | 63.4% | 0.72 | 0.63 | 0.55 | |||
| Decision tree | 62.4% | 0.64 | 0.62 | 0.63 | |||
| Random forest | 56.9% | 0.57 | 0.57 | 0.57 | |||
| AdaBoost | 66.8% | 0.66 | 0.67 | 0.67 | |||
| KNN = 5 | 59.5% | 0.59 | 0.60 | 0.59 | |||
| KNN = 2 | 54.0% | 0.58 | 0.54 | 0.53 |
Results of supervised learning algorithms on Stress recognition in automobile drivers dataset (K-fold cross validation).
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
|
| ||||||
| Stress recognition in automobile drivers dataset | Logistic regression | Heart rate and respiratory rate | 61.5% | 0.038 | 58.8% | 64.2% | |
| Gaussian Naive Bayes | 61.6% | 0.022 | 58.9% | 64.3% | |||
| Decision tree | 64.1% | 0.047 | 61.5% | 66.8% | |||
| Random forest | 64.0% | 0.029 | 61.3% | 66.6% | |||
| AdaBoost | 65.6% | 0.036 | 62.9% | 68.2% | |||
| KNN = 2 | 54.9% | 0.051 | 52.2% | 57.6% | |||
| KNN = 5 | 58.6% | 0.034 | 55.9% | 61.3% | |||
| Stress recognition in automobile drivers dataset | Logistic regression | Heart rate | 58.7% | 0.20 | 57.2% | 60.2% | |
| Gaussian Naive Bayes | 56.4% | 0.024 | 54.9% | 57.9% | |||
| Decision tree | 59.9% | 0.019 | 58.4% | 61.4% | |||
| Random forest | 10-fold cross validation | 57.5% | 0.027 | 56.0% | 59.0% | ||
| AdaBoost | 59.9% | 0.016 | 58.4% | 61.4% | |||
| KNN = 5 | 52.0% | 0.023 | 50.4% | 53.5% | |||
| KNN = 5 | 56.1% | 0.024 | 54.6% | 57.6% | |||
| Stress recognition in automobile drivers dataset | Logistic regression | Respiratory rate | 58.3% | 0.037 | 55.6% | 61.0% | |
| Gaussian Naive Bayes | 58.7% | 0.038 | 56.0% | 61.4% | |||
| Decision tree | 61.4% | 0.053 | 58.7% | 64.0% | |||
| Random forest | 59.4% | 0.50 | 56.7% | 62.1% | |||
| AdaBoost | 63.9% | 0.036 | 61.2% | 66.5% | |||
| KNN = 2 | 54.6% | 0.039 | 51.9% | 57.4% | |||
| KNN = 5 | 59.0% | 0.052 | 56.3% | 61.7% | |||
Results of unsupervised learning algorithms on stress recognition in automobile drivers dataset.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Stress recognition in automobile drivers dataset | Affinity propagation | Heart rate and respiratory rate | 63.8% | 0.65 | 0.64 | 0.62 | |
| BIRCH | 54.9% | 0.62 | 0.57 | 0.50 | |||
| DBSCAN | 53.8% | 0.56 | 0.54 | 0.41 | |||
| K-mean | 55.7% | 0.62 | 0.56 | 0.52 | |||
| Mini-batch K-mean | 53.0% | 0.28 | 0.53 | 0.37 | |||
| Mean shift | 53.0% | 0.28 | 0.53 | 0.37 | |||
| OPTICS | 54.1% | 0.54 | 0.54 | 0.53 | |||
| Stress recognition in automobile drivers dataset | Affinity propagation | Heart rate | 59.7% | 0.60 | 0.82 | 0.69 | |
| BIRCH | 49.1% | 0.66 | 0.49 | 0.38 | |||
| DBSCAN | 54.7% | 0.30 | 0.55 | 0.39 | |||
| K-mean | 70-30 % | 55.5% | 0.61 | 0.55 | 0.53 | ||
| Mini-batch K-mean | 54.8% | 0.61 | 0.55 | 0.52 | |||
| Mean shift | 54.7% | 0.30 | 0.55 | 0.39 | |||
| OPTICS | 51.6% | 0.51 | 0.52 | 0.51 | |||
| Stress recognition in automobile drivers dataset | Affinity propagation | Respiratory rate | 65.0% | 0.77 | 0.65 | 0.57 | |
| BIRCH | 57.4% | 0.33 | 0.57 | 0.42 | |||
| DBSCAN | 60.6% | 0.62 | 0.61 | 0.53 | |||
| K-mean | 59.8% | 0.63 | 0.60 | 0.60 | |||
| Mini-batch K-mean | 60.3% | 0.6 | 0.60 | 0.60 | |||
| Mean shift | 57.4% | 0.33 | 0.57 | 0.42 | |||
| OPTICS | 54.6% | 0.49 | 0.55 | 0.46 |
Results of supervised learning algorithms on SWELL-KW dataset.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| SWELL-KW dataset | Logistic regression | Heart rate | 70-30 % | 70.2% | 0.70 | 0.70 | 0.64 |
| Gaussian naive bayes | 70.3% | 0.70 | 0.70 | 0.64 | |||
| Decision tree | 74.8% | 0.74 | 0.75 | 0.73 | |||
| Random forest | 74.8% | 0.74 | 0.75 | 0.73 | |||
| AdaBoost | 74.6% | 0.75 | 0.75 | 0.71 | |||
| KNN = 5 | 71.8% | 0.71 | 0.72 | 0.71 | |||
| KNN = 2 | 62.7% | 0.68 | 0.63 | 0.64 |
Results of supervised learning algorithms on SWELL-KW dataset (K-fold cross validation).
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
|
| ||||||
| SWELL-KW dataset | Logistic regression | Heart rate | 10-fold cross validation | 70.2% | 0.002 | 70.0% | 70.4% |
| Gaussian Naive Bayes | 70.3% | 0.002 | 70.4% | 70.5% | |||
| Decision tree | 74.8% | 0.002 | 74.6% | 75.0% | |||
| Random forest | 75.0% | 0.003 | 74.8% | 75.2% | |||
| AdaBoost | 74.6% | 0.003 | 74.4% | 74.8% | |||
| KNN = 2 | 62.8% | 0.002 | 62.6% | 63.0% | |||
| KNN = 5 | 72.0% | 0.003 | 71.8% | 72.2% | |||
Results of unsupervised learning algorithms on SWELL-KW dataset.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| SWELL-KW dataset | Affinity propagation | Heart rate | 70-30 % | 66.5% | 0.44 | 0.67 | 0.53 |
| BIRCH | 68.1% | 0.66 | 0.68 | 0.60 | |||
| K-mean | 66.7% | 0.45 | 0.67 | 0.53 | |||
| Mini-batch K-mean | 66.7% | 0.45 | 0.67 | 0.53 | |||
| Mean shift | 68.3% | 0.69 | 0.68 | 0.60 | |||
| DBSCAN | 66.7% | 0.45 | 0.67 | 0.53 | |||
| OPTICS | 66.7% | 0.45 | 0.67 | 0.53 |
Results comparison of supervised learning algorithms on datasets with previously reported work.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Stress recognition in automobile drivers dataset | Table 5.8 of ( | Respiratory rate | 62.2% | 66.8% | 63.9% | |
| Supervised learning algorithms | Heart rate | 52.6% | 61.9% | 59.9% | ||
| SWELL-KW dataset | Table 4 of ( | Heart rate | 64.1% | 74.8% | 75.0% |
Figure 2Bar-plot of classification accuracies of supervised and unsupervised classification algorithms using (A) Stress Recognition in Automobile Drivers Dataset and (B) SWELL-KW Dataset.