| Literature DB >> 31109126 |
Shoujiang Xu1,2, Qingfeng Tang3, Linpeng Jin4, Zhigeng Pan5.
Abstract
Human activity recognition (HAR) has gained lots of attention in recent years due to its high demand in different domains. In this paper, a novel HAR system based on a cascade ensemble learning (CELearning) model is proposed. Each layer of the proposed model is comprised of Extremely Gradient Boosting Trees (XGBoost), Random Forest, Extremely Randomized Trees (ExtraTrees) and Softmax Regression, and the model goes deeper layer by layer. The initial input vectors sampled from smartphone accelerometer and gyroscope sensor are trained separately by four different classifiers in the first layer, and the probability vectors representing different classes to which each sample belongs are obtained. Both the initial input data and the probability vectors are concatenated together and considered as input to the next layer's classifiers, and eventually the final prediction is obtained according to the classifiers of the last layer. This system achieved satisfying classification accuracy on two public datasets of HAR based on smartphone accelerometer and gyroscope sensor. The experimental results show that the proposed approach has gained better classification accuracy for HAR compared to existing state-of-the-art methods, and the training process of the model is simple and efficient.Entities:
Keywords: Random Forest; Softmax Regression; cascade ensemble learning model; extremely gradient boosting trees; extremely randomized trees; human activity recognition; sensor; smartphone
Mesh:
Year: 2019 PMID: 31109126 PMCID: PMC6566970 DOI: 10.3390/s19102307
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Overview of HAR system. Handcrafted feature extraction based HAR contains data collection, signal processing, feature extraction and CELearning model. Automatic feature extraction based HAR contains data collection, FFT and CELearning model.
Figure 2CELearning model. Each layer is composed of four basic classifiers which generate the probability vectors as augmented features for next layer’s learning.
Figure 3Class vector generation of the randomized decision trees. A probability vector is obtained from each decision tree and the final probability vector of randomized decision trees is jointly generated by all the decision trees.
Figure 4Augmented features generation of each classifier. K-fold cross validation is used for each classifier to generate K-1 estimated class vectors, which are averaged to obtain a final vector as augmented features.
Figure 5Three-axial linear acceleration and three-axial angular velocity: (a–c) the three-axis data of the accelerometer, respectively; and (d–f) the three-axis data of the gyroscope, respectively.
Confusion matrix of HAR based on handcrafted feature extraction.
| Target Class | ||||||||
|---|---|---|---|---|---|---|---|---|
| Walking | Upstairs | Downstairs | Sitting | Standing | Lying | Precision | ||
|
| Walking | 492 | 22 | 4 | 0 | 0 | 0 | 94.98% |
| Upstairs | 1 | 448 | 17 | 0 | 0 | 0 | 96.14% | |
| Downstairs | 3 | 1 | 399 | 0 | 0 | 0 | 99.01% | |
| Sitting | 0 | 0 | 0 | 464 | 17 | 0 | 96.47 | |
| Standing | 0 | 0 | 0 | 27 | 515 | 0 | 95.02% | |
| Lying | 0 | 0 | 0 | 0 | 0 | 537 | 100.00% | |
| Recall | 99.19% | 95.12% | 95.00% | 94.50% | 96.80% | 100.00% | 96.88% | |
Comparison of different methods based on handcrafted feature extraction.
| Approach | Accuracy |
|---|---|
| ANN (as reported in [ | 91.08% |
| SVM [ | 96.00% |
| DBN (as reported in [ | 95.80% |
| SAE [ | 96.50% |
| CELearning (proposed) | 96.88% |
Confusion matrix of HAR based on automatic feature extraction.
| Target Class | ||||||||
|---|---|---|---|---|---|---|---|---|
| Walking | Upstairs | Downstairs | Sitting | Standing | Lying | Precision | ||
|
| Walking | 493 | 3 | 12 | 0 | 0 | 0 | 97.05% |
| Upstairs | 1 | 464 | 27 | 1 | 0 | 0 | 94.12% | |
| Downstairs | 2 | 4 | 381 | 0 | 0 | 0 | 98.45% | |
| Sitting | 0 | 0 | 0 | 432 | 12 | 0 | 97.30% | |
| Standing | 0 | 0 | 0 | 58 | 520 | 0 | 89.97% | |
| Lying | 0 | 0 | 0 | 0 | 0 | 537 | 100.00% | |
| Recall | 99.40% | 98.51% | 90.71% | 87.98% | 97.74% | 100.00% | 95.93% | |
Comparison of different methods based on automatic feature extraction.
| Approach | Accuracy |
|---|---|
| CNN [ | 95.75% |
| DBN (as reported in [ | 95.50% |
| SAE [ | 95.59% |
| CELearning (proposed) | 95.93% |
Figure 6Convergence curves of the proposed model for HAR: (a) the convergence curve of handcrafted feature extraction based HAR; and (b) the convergence curve of automatic feature extraction based HAR.
Comparison of different combinations of four classifiers based on handcrafted feature extraction.
| Approach | Mean Value | Standard Deviation |
|---|---|---|
| XGBoost | 90.87 | 0.00 |
| ExtraTrees | 94.12 | 0.15 |
| Random Forest | 92.77 | 0.19 |
| Softmax Regression | 95.89 | 0.00 |
| CELearning (Softmax Regression + ExtraTrees) | 96.64 | 0.08 |
| CELearning (Softmax Regression + ExtraTrees + Random Forest) | 96.65 | 0.11 |
| CELearning (proposed) | 96.67 | 0.11 |
Comparison of different combinations of four classifiers based on automatic feature extraction.
| Approach | Mean Value | Standard Deviation |
|---|---|---|
| XGBoost | 94.33 | 0.00 |
| ExtraTrees | 91.40 | 0.12 |
| Random Forest | 92.12 | 0.20 |
| Softmax Regression | 90.77 | 0.00 |
| CELearning (XGBoost + Softmax Regression) | 95.45 | 0.15 |
| CELearning (XGBoost + Softmax Regression + Random Forest) | 95.56 | 0.15 |
| CELearning (proposed) | 95.82 | 0.06 |
Comparison of different methods for 12 categories of HAR.
| Approach | Total Rightly | Overall Accuracy | Total Wrongly |
|---|---|---|---|
| ANN | 2816 | 89.06% | 346 |
| SVM | 2976 | 94.12% | 186 |
| CELearning | 3007 | 95.10% | 155 |