| Literature DB >> 30813418 |
Xile Gao1, Haiyong Luo2, Qu Wang3, Fang Zhao4, Langlang Ye5, Yuexia Zhang6.
Abstract
Recently, the demand for human activity recognition has become more and more urgent. It is widely used in indoor positioning, medical monitoring, safe driving, etc. Existing activity recognition approaches require either the location information of the sensors or the specific domain knowledge, which are expensive, intrusive, and inconvenient for pervasive implementation. In this paper, a human activity recognition algorithm based on SDAE (Stacking Denoising Autoencoder) and LightGBM (LGB) is proposed. The SDAE is adopted to sanitize the noise in raw sensor data and extract the most effective characteristic expression with unsupervised learning. The LGB reveals the inherent feature dependencies among categories for accurate human activity recognition. Extensive experiments are conducted on four datasets of distinct sensor combinations collected by different devices in three typical application scenarios, which are human moving modes, current static, and dynamic behaviors of users. The experimental results demonstrate that our proposed algorithm achieves an average accuracy of 95.99%, outperforming other comparative algorithms using XGBoost, CNN (Convolutional Neural Network), CNN + Statistical features, or single SDAE.Entities:
Keywords: LightGBM; Stacking Denoising Autoencoder; deep learning; human activity recognition; indoor positioning
Year: 2019 PMID: 30813418 PMCID: PMC6412893 DOI: 10.3390/s19040947
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
The further experimental results of the related work.
| Reference | Dataset | Algorithm | Accuracy |
|---|---|---|---|
| [ | WISDM | Artificial features + Dropout | 85.36% |
Figure 1The architecture of the classification network. The solid black line in the data classification module represents the process of data classification while the solid blue line represents the encoding and decoding process of SDAE.
The total inner and outer class divergence of the Original data and the Extracted feature.
| Evaluation Indicator | Original Data | Extracted Feature |
|---|---|---|
| inner-class divergence | 8.0264 | 0.2064 |
| outer-class divergence | 6.9440 | 0.8646 |
The specific inner and outer class divergence of the Original data.
| Category | WALK | WALKUP | WALKDOWN | SIT | STAND | LAY |
|---|---|---|---|---|---|---|
| WALK | 10.50 | 0.48 | 0.06 | 3.95 | 0.05 | 45.21 |
| WALKUP | 13.25 | 0.77 | 6.79 | 0.77 | 50.39 | 0.05 |
| WALKDOWN | 17.73 | 3.30 | 0.05 | 43.29 | 0.77 | 3.95 |
| SIT | 2.46 | 3.29 | 26.81 | 0.05 | 6.79 | 0.06 |
| STAND | 0.98 | 43.56 | 3.29 | 3.30 | 0.77 | 0.48 |
| LAY | 7.03 | 0.98 | 2.46 | 17.73 | 13.25 | 10.50 |
The specific inner and outer class divergence of the Extracted feature.
| Category | WALK | WALKUP | WALKDOWN | SIT | STAND | LAY |
|---|---|---|---|---|---|---|
| WALK | 0.09 | 0.23 | 0.32 | 0.49 | 0.12 | 5.69 |
| WALKUP | 0.23 | 0.21 | 0.4 | 0.67 | 0.64 | 6.03 |
| WALKDOWN | 0.32 | 0.4 | 0.23 | 0.49 | 0.39 | 5.63 |
| SIT | 0.49 | 0.67 | 0.49 | 0.37 | 0.41 | 3.38 |
| STAND | 0.12 | 0.64 | 0.39 | 0.41 | 0.04 | 5.48 |
| LAY | 5.69 | 6.03 | 5.63 | 3.38 | 5.48 | 0.27 |
Figure 2The feature values between different categories. (a) Shows the different distributions of a certain feature value between static classes and dynamic classes. (b) Displays the data distribution and cumulative probability of a certain feature on the dynamic class and (c) shows that on the static class.
Figure 3The schematic diagram of boosting K-Fold LGB. The training sets of each fold are determined by the previous fold’s predicted results.
Figure 4The single SDAE network for classification. After the training of stacking denoising encoders, an output layer is added on the top of the encoding network. By performing the gradient descent on the supervised loss, the classification result of single SDAE can be obtained.
The feature engineering of XGB algorithm.
| Sensors | Cluster | Features |
|---|---|---|
| Acceleration | Vertical component | mean, variance, standard deviation, median, minimum, maximum, range, quartile |
| Horizontal component | mean, variance, standard deviation, median, minimum, maximum, range, quartile | |
| Acceleration &Gyroscope & Magnetic | Modulus value | mean, variance, standard deviation, median, minimum, maximum, range, quartile, kurtosis, skewness, root mean, square, integral, double integral, autocorrelation, 7 FFT features |
| Three axes value | Pearson correlation coefficient between three axes | |
| Pressure | 6s’s windows | Change value, standard deviation |
| 10s’s windows | Change value, standard deviation |
Figure 5The convolutional neural network (CNN) network of human activity recognition. Each convolutional network is for a set of data in a sensor. Then the outputs of the convolutional network are spliced together to be the input of the fully connected layer.
The sample number of each category on each dataset. The HMM contains both HMMwithPre and HMMwithoutPre.
| Dataset | Category | Abbreviation | Number of Samples | Sample Percent of Each Human Activity |
|---|---|---|---|---|
| HMM | Stilling | STI | 1325 | 12.42% |
| HSBD | Walking | WAL | 1772 | 17.12% |
| HDBD | Stand-to-sit | S2SI | 697 | 12.48% |
The network structure for SDAE.
| Dataset | n_Layer | n_Hidden | Dropout | Batchsize | Epoch |
|---|---|---|---|---|---|
| HMMwithPre | 3 | [150,70,20] for Pre | 0.4 | 32 | 20 |
| HMMwithoutPre | 3 | [400,200,30] | 0.4 | 32 | 4 |
| HSBD | 2 | [100,30] | 0.4 | 32 | 2 |
| HDBD | 1 | [30] | 0.4 | 32 | 20 |
Figure 6The confusion matrix of SDAE+LGB on each dataset: (a) HMMwithPre, (b) HMMwithoutPre, (c) HSBD, (d) HDBD. The Number in each cell is the probability of the current predicted result, while blank represents the probability is below 0.01.
Average recognition accuracy of our proposed method on the different datasets.
| Dataset | Accuracy |
|---|---|
| HMMwithPre | 95.73% |
The evaluation score of single SDAE on four datasets.
| Dataset | Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|
| HMMwithPre | SDAE | 86.04% | 87.22% | 86.04% | 86.01% |
| HMMwithoutPre | SDAE | 84.42% | 84.77% | 84.42% | 84.37% |
| HSBD | SDAE | 84.63% | 86.58% | 84.62% | 84.79% |
| HDBD | SDAE | 79.14% | 80.65% | 79.14% | 78.38% |
The evaluation score of XGB on four datasets.
| Dataset | Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|
| HMMwithPre | XGB | 95.06% | 95.06% | 95.06% | 95.03% |
| HMMwithoutPre | XGB | 85.47% | 85.41% | 85.47% | 85.12% |
| HSBD | XGB | 94.05% | 94.07% | 94.05% | 94.05% |
| HDBD | XGB | 80.12% | 80.21% | 80.12% | 79.99% |
The evaluation score of CNN on four datasets.
| Dataset | Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|
| HMMwithPre | CNN | 90.77% | 90.77% | 90.57% | 90.77% |
| HMMwithoutPre | CNN | 88.64% | 88.31% | 88.42% | 88.31% |
| HSBD | CNN | 85.83% | 85.84% | 85.83% | 85.83% |
| HDBD | CNN | 92.84% | 93.84% | 92.97% | 92.97% |
The comparison of evaluation score of [13] on four datasets.
| Dataset | Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|
| HMMwithPre | [ | 93.24% | 93.15% | 93.24% | 93.19% |
| HMMwithoutPre | [ | 86.31% | 86.15% | 86.31% | 86.05% |
| HSBD | [ | 97.63% | 97.68% | 97.63% | 97.62% |
| HDBD | [ | 78.11% | 78.11% | 78.11% | 78.10% |