| Literature DB >> 27413392 |
Robertas Damaševičius1, Mindaugas Vasiljevas1, Justas Šalkevičius1, Marcin Woźniak2.
Abstract
Automatic human activity recognition systems aim to capture the state of the user and its environment by exploiting heterogeneous sensors attached to the subject's body and permit continuous monitoring of numerous physiological signals reflecting the state of human actions. Successful identification of human activities can be immensely useful in healthcare applications for Ambient Assisted Living (AAL), for automatic and intelligent activity monitoring systems developed for elderly and disabled people. In this paper, we propose the method for activity recognition and subject identification based on random projections from high-dimensional feature space to low-dimensional projection space, where the classes are separated using the Jaccard distance between probability density functions of projected data. Two HAR domain tasks are considered: activity identification and subject identification. The experimental results using the proposed method with Human Activity Dataset (HAD) data are presented.Entities:
Mesh:
Year: 2016 PMID: 27413392 PMCID: PMC4931102 DOI: 10.1155/2016/4073584
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Catalogue of features.
| Feature number | Description | Equation (notation) |
|---|---|---|
| 4–6 | Acceleration ( |
|
|
| ||
| 7–9 | Gyroscope ( |
|
|
| ||
| 10–15 | Moving variance of 100 samples of acceleration and gyroscope data |
|
|
| ||
| 16-17 | Movement intensity of acceleration and gyroscope data |
|
|
| ||
| 18 | Movement intensity of difference between acceleration and gyroscope data |
|
|
| ||
| 19–21 | Moving variance of 100 samples of movement intensity data |
|
|
| ||
| 22–24 | Polar coordinates of acceleration data |
|
|
| ||
| 25–27 | Polar coordinates of gyroscope data |
|
|
| ||
| 28–30 | Polar coordinates of difference between acceleration and gyroscope data |
|
|
| ||
| 31 | Simple moving average of acceleration data |
|
|
| ||
| 32 | Simple moving average of gyroscope data |
|
|
| ||
| 33 | Simple moving average of difference between acceleration and gyroscope data |
|
|
| ||
| 34 | First eigenvalue of moving covariance between acceleration data |
|
|
| ||
| 35 | First eigenvalue of moving covariance between gyroscope data |
|
|
| ||
| 36 | First eigenvalue of moving covariance of difference between acceleration and gyroscope data |
|
|
| ||
| 37–42 | Moving energy of acceleration and gyroscope data |
|
|
| ||
| 43–48 | Difference between moving maximum and moving minimum of acceleration and gyroscope data |
|
|
| ||
| 49 | Moving correlation between | MC |
|
| ||
| 50 | Moving correlation between | MC |
|
| ||
| 51 | Moving correlation between | MC |
|
| ||
| 52 | Moving correlation between | MC |
|
| ||
| 53 | Moving correlation between | MC |
|
| ||
| 54 | Moving correlation between | MC |
|
| ||
| 55–57 | Projection of gyroscope data onto acceleration data |
|
|
| ||
| 58 | Moving mean of orientation vector of acceleration data |
|
|
| ||
| 59 | Moving variance of orientation vector of acceleration data |
|
|
| ||
| 60 | Moving energy of orientation vector of acceleration data |
|
|
| ||
| 61–63 | Moving energy of difference between acceleration and gyroscope data |
|
|
| ||
| 64 | Moving energy of difference between |
|
|
| ||
| 65 | Moving energy of difference between |
|
|
| ||
| 66 | Moving energy of difference between |
|
|
| ||
| 67 | Moving mean of orientation vector of difference between acceleration and gyroscope data |
|
|
| ||
| 68 | Moving variance of orientation vector of difference between acceleration and gyroscope data |
|
|
| ||
| 69 | Moving energy of orientation vector of difference between acceleration and gyroscope data |
|
|
| ||
| 70 | Moving mean of orientation vector of gravity data |
|
|
| ||
| 71 | Moving variance of orientation vector of gravity data |
|
|
| ||
| 72 | Moving energy of orientation vector of gravity data |
|
|
| ||
| 73 | Moving mean of orientation vector of difference between acceleration and gravity data |
|
|
| ||
| 74 | Moving variance of orientation vector of difference between acceleration and gravity data |
|
|
| ||
| 75 | Moving energy of orientation vector of difference between acceleration and gravity data |
|
|
| ||
| 76–81 | Moving cumulative sum of acceleration and gyroscope data |
|
|
| ||
| 82 | Simple moving average of cumulative sums of acceleration data |
|
|
| ||
| 83 | Simple moving average of cumulative sums of gyroscope data |
|
|
| ||
| 84 | Simple moving average of cumulative sums of difference between accelerometer and gyroscope data |
|
|
| ||
| 85–90 | Moving 2nd-order cumulative sum of acceleration and gyroscope data |
|
|
| ||
| 91–93 | Moving 2nd-order cumulative sum of differences between cumulative sums of acceleration and gyroscope data |
|
|
| ||
| 94–96 | Polar coordinates of moving cumulative sum of acceleration data |
|
|
| ||
| 97–99 | Polar coordinates of moving cumulative sum of gyroscope data |
|
|
| ||
| 100–102 | Polar coordinates of moving cumulative sum of differences between acceleration and gyroscope data |
|
Summary of feature selection/dimensionality reduction methods in HAR.
| Method | Advantages | Disadvantages | Complexity |
|---|---|---|---|
| PCA | High dimensionality reduction; reduction of noise; lack of redundancy of data due to orthogonality of components | The covariance matrix is difficult to be evaluated accurately; even the simplest invariance could not be captured by the PCA unless the training data explicitly provides for it |
|
|
| |||
|
| Low computational complexity | Unstable due to random selection of instances |
|
|
| |||
| Rankfeatures | Features highly correlated with already selected features are less likely to be included | It assumes that data classes are normally distributed | It depends upon class separability criterion |
|
| |||
| CFS | It evaluates a subset of features rather than individual features | It fails to select locally predictive features when they are overshadowed by strong, globally predictive features |
|
Summary of related works in the HAR domain.
| Author | Activities | Sensor data | Features | Feature selection | Classification method | Accuracy |
|---|---|---|---|---|---|---|
| Atallah et al. [ | Lying down, preparing food, eating and drinking, socialising, reading, dressing, walking, treadmill walking, vacuuming, wiping tables, running, treadmill, running, cycling, sitting down/getting up, and lying down/getting up | Acceleration sensors | Averaged entropy over 3 axes, main FFT frequency (averaged) over 3 axes, energy of the 0.2 Hz window centred around main frequency over total FFT energy (3-axis average), and averaged mean of cross covariance between every 2 axes |
| kNN, Bayesian classifier | 90% |
|
| ||||||
| Bayat et al. [ | Running, slow walk, fast walk, aerobic dancing, stairs up, and stairs down | Triaxial accelerometer | Mean along | Feature clustering | Multilayer perceptron, SVM, Random Forest, and Logit Boost | 81%–91% |
|
| ||||||
| Berchtold et al. [ | Standing, sitting, lying, walking, climbing stairs, cycling, and being stationary | Accelerometer | Variance, mean | None | Fuzzy inference | 97.3% |
|
| ||||||
| Capela et al. [ | Sitting, standing, and lying; ramp up and ramp down; stairs up and stairs down; transition between activities | Linear acceleration, gravity, and velocity sensors | Range, mean, standard deviation, kurtosis, moving average, covariance matrix, skewness, zero cross rate, and mean cross rate | None | Naïve-Bayes, Support Vector Machine, and j48 decision tree | 97% |
|
| ||||||
| Gupta and Dallas [ | Jumping, running, walking, sitting, sitting-to-standing, and standing-to-kneeling | Triaxial accelerometer | Energy, entropy, mean, variance, mean trend, windowed mean difference, variance trend, windowed variance difference, detrended fluctuation analysis coefficients, |
| kNN, Naive Bayes | 98% |
|
| ||||||
| Henpraserttae et al. [ | Sitting, lying, standing, and walking | Accelerometer | Mean and standard deviation | None | Rules and threshold based classification | 90% |
|
| ||||||
| Hoque and Stankovic [ | Leaving house, using toilet, taking shower, sleeping, preparing breakfast, preparing dinner, getting snack, getting drink, using washing machine, and using dishwasher | Location sensors (open/closed) | Magnitude | None | Custom clustering method | 64.5%–89.9% |
|
| ||||||
| Iso and Yamazaki [ | Walking, running, stairs up/down, and fast walking | Accelerometer | Wavelet components, periodograms, and information entropy | None | Bayesian probabilities | 80% |
|
| ||||||
| Kose et al. [ | Walking, running, biking, sitting, and standing | Accelerometer | Min., max., average, variance, FFT coefficients, and autocorrelation | None | Clustered kNN | 95.2%–97.5% |
|
| ||||||
| Kwapisz et al. [ | Walking, jogging, stairs up/down, sitting, and standing | Accelerometer | Mean, std. dev., average absolute difference, average resultant acceleration, time between peaks, and binned distribution | None | Decision tree, logistic regression, and MNN | 91.7% |
|
| ||||||
| Lane et al. [ | Driving, being stationary, running, and walking | GPS, accelerometer, and microphone | Mean, variance | None | Naïve-Bayes | 85–98% |
|
| ||||||
| Lee and Cho [ | Standing, walking, running, stairs up/down, shopping, and taking bus | Accelerometer |
| None | Hierarchical HMM | 70%–90% |
|
| ||||||
| Mannini and Sabatini [ | Walking, walking carrying items, sitting & relaxing, working on computer, standing still, eating or drinking, watching TV, reading, running, bicycling, stretching, strength training, scrubbing, vacuuming, folding laundry, lying down and relaxing, brushing teeth, climbing stairs, riding elevator, and riding escalator | Acceleration sensors | DC component, energy, frequency-domain entropy, and correlation coefficients | SFFS (Pudil algorithm) | Continuous emissions, Hidden Markov Model | 99.1% |
|
| ||||||
| Mathie et al. [ | Various human movements, including resting, walking, and falling | Triaxial acceleration sensor | Integrated area under curve | None | Binary decision tree | 97.7% (sensitivity) 98.7% (specificity) |
|
| ||||||
| Maurer et al. [ | Walking, standing, sitting, running, and ascending and descending the stairs | Multiple sensors | Mean, root mean square, standard deviation, variance, mean absolute deviation, cumulative histogram, | Correlation-based Feature Selection (CFS) | Decision trees (C4.5 algorithm), | 80%–92% |
|
| ||||||
| Miluzzo et al. [ | Sitting, standing, walking, and running | Accelerometer, GPS, and audio | DFT, FFT features, mean, std. dev. and number of peaks per unit, and time deviation of DFT power | None | Decision tree | 79% |
|
| ||||||
| Pärkkä et al. [ | Lying down, rowing, ex-biking, sitting/standing, running, and Nordic walking | GPS, audio, altitude, EKG, accelerometer, compass, humidity, light, temperature, heart rate, pulse, respiratory effort, and skin resistance | Peak frequency of up-down chest acceleration, median of up-down chest acceleration, peak power of up-down chest acceleration, variance of back-forth chest acceleration, sum of variances of 3D wrist acceleration, and power ratio of frequency bands 1–1.5 Hz and 0.2–5 Hz measured from chest magnetometer | Heuristic | Decision tree | 86% |
|
| ||||||
| Saponas et al. [ | Walking, jogging | Accelerometer | 124 features: Nike + iPod Packet Payload, magnitude (mean, std. dev., min., max., and min. minus max.), frequency (energy in each of the first 10 frequency components of DFT, energy in each band of 10 frequency components, largest frequency component, and index of the largest frequency component) | None | Naïve-Bayesian Network | 97.4% (within-person), 99.48% (cross-person) |
|
| ||||||
| Siirtola and Röning [ | Walking, running, cycling, driving, sitting, and standing | Accelerometer | Magnitude, std., mean, min., max., percentiles (10, 25, 50, 75, and 90), and sum and square sum of observations above/below percentile (5, 10, 25, 75, 90, and 95) of magnitude acceleration and square sum of | None | Decision tree + kNN/QDA | 95% |
|
| ||||||
| Sohn et al. [ | Walking, driving, and dwelling | GPS | Spearman rank correlation, variance, and mean Euclidean distance over a window of measurements | None | Logistic regression | 85% |
|
| ||||||
| Yang [ | Sitting, Standing, walking, running, driving, and bicycling | Accelerometer | Mean, std., zero crossing rate, 75th percentile, interquartile, spectrum centroid, entropy, and cross-correlation | None | Decision tree, Naïve-Bayes, kNN, and SVM | 90% |
|
| ||||||
| Zhu and Sheng [ | Sitting, standing, lying, walking, sitting-to-standing, standing-to-sitting, lying-to-sitting, and sitting-to-lying | 3D acceleration | Mean, variance | None | Neural network ensemble | 67%–98% |
Figure 1General scheme of the proposed method.
Figure 2Graphical illustration of good separation versus bad separation of kernel density estimation functions (Subject 1, Trial 1, Walking Forward versus Walking Upstairs; 2nd dimension).
Figure 3Example of classification: walking versus running (Subject 1, Trial 1) classes randomly projected in a bidimensional feature subspace.
Pseudocode 1Pseudocode of FindBestProjection.
Pseudocode 2Pseudocode of binary classification.
Top features for binary classification of human activities.
| Activity | WL | WR | WU | WD | RF | JU | Si | St | Sl | EU | ED |
|---|---|---|---|---|---|---|---|---|---|---|---|
| WF | 97, 88, 91 | 85, 100, 88 | 15, 63, 42 | 14, 39, 34 | 10, 15, 60 | 10, 19, 37 | 17, 18, 30 | 19, 36, 35 | 34, 19, 36 | 15, 36, 35 | 15, 42, 63 |
| WL | 97, 91, 88 | 97, 63, 42 | 87, 97, 86 | 10, 37, 19 | 10, 37, 19 | 17, 18, 30 | 19, 36, 34 | 34, 19, 36 | 15, 36, 35 | 15, 42, 63 | |
| WR | 63, 42, 15 | 34, 87, 39 | 10, 59, 37 | 10, 37, 19 | 17, 18, 30 | 19, 34, 14 | 34, 19, 85 | 15, 36, 35 | 15, 42, 63 | ||
| WU | 87, 39, 78 | 63, 42, 15 | 10, 19, 63 | 17, 18, 26 | 19, 14, 62 | 34, 19, 35 | 15, 36, 35 | 15, 42, 63 | |||
| WD | 65, 38, 34 | 10, 19, 60 | 17, 18, 7 | 19, 10, 20 | 19, 34, 35 | 15, 36, 35 | 15, 42, 63 | ||||
| RF | 35, 36, 62 | 38, 36, 17 | 19, 10, 15 | 34, 19, 35 | 15, 36, 35 | 15, 42, 63 | |||||
| JU | 4, 22, 16 | 19, 10, 15 | 19, 34, 36 | 15, 42, 63 | 15, 42, 63 | ||||||
| Si | 22, 5, 38 | 22, 4, 85 | 15, 42, 63 | 59, 60, 15 | |||||||
| St | 76, 85, 39 | 10, 15, 36 | 59, 11, 60 | ||||||||
| Sl | 15, 22, 42 | 59, 60, 94 | |||||||||
| EU | 59, 60, 10 |
WF: Walking Forward; WL: Walking Left; WR: Walking Right; WU: Walking Upstairs; WD: Walking Downstairs; RF: Running Forward; JU: Jumping Up; Si: Sitting; St: Standing; Sl: Sleeping; EU: Elevator Up; ED: Elevator Down.
The confusion matrix of within-subject activity classification using Rankfeatures.
| Activity | WF | WL | WR | WU | WD | RF | JU | Si | St | Sl | EU | ED |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WF | 1 | 0.774 | 0.980 | 0.874 | 0.985 | 0.996 | 0.980 | 0.997 | 0.999 | 1 | 0.999 | 1 |
| WL | 0.774 | 1 | 0.989 | 0.968 | 0.958 | 0.998 | 0.996 | 0.951 | 0.999 | 1 | 0.999 | 1 |
| WR | 0.980 | 0.989 | 1 | 0.798 | 0.998 | 0.997 | 0.988 | 0.988 | 0.971 | 1 | 0.981 | 1 |
| WU | 0.874 | 0.968 | 0.798 | 1 | 0.708 | 0.985 | 0.979 | 0.962 | 0.998 | 1 | 0.971 | 1 |
| WD | 0.985 | 0.958 | 0.998 | 0.708 | 1 | 0.992 | 0.878 | 0.967 | 0.850 | 1 | 0.986 | 1 |
| RF | 0.996 | 0.998 | 0.997 | 0.985 | 0.992 | 1 | 0.978 | 0.991 | 0.996 | 0.957 | 0.994 | 1 |
| JU | 0.980 | 0.996 | 0.988 | 0.979 | 0.878 | 0.978 | 1 | 0.973 | 1 | 0.929 | 1 | 1 |
| Si | 0.997 | 0.951 | 0.988 | 0.962 | 0.967 | 0.991 | 0.973 | 1 | 0.987 | 1 | 0.126 | 0.992 |
| St | 0.999 | 0.999 | 0.971 | 0.999 | 0.850 | 0.995 | 1 | 0.987 | 1 | 1 | 0.326 | 0.887 |
| Sl | 1 | 1 | 1 | 1 | 1 | 0.957 | 0.930 | 1 | 1 | 1 | 1 | 0.992 |
| EU | 0.999 | 0.999 | 0.981 | 0.971 | 0.986 | 0.994 | 1 | 0.126 | 0.326 | 1 | 1 | 0.697 |
| ED | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.992 | 0.887 | 0.992 | 0.697 | 1 |
| Mean |
|
|
|
|
|
|
|
|
|
|
|
|
| Baseline | 0.650 | 0.616 | 0.616 | 0.712 | 0.713 | 0.621 | 0.641 | 0.627 | 0.642 | 0.651 | 0.640 | 0.628 |
WF: Walking Forward; WL: Walking Left; WR: Walking Right; WU: Walking Upstairs; WD: Walking Downstairs; RF: Running Forward; JU: Jumping Up; Si: Sitting; St: Standing; Sl: Sleeping; EU: Elevator Up; ED: Elevator Down.
The confusion matrix of within-subject activity classification using ReliefF.
| Activity | WF | WL | WR | WU | WD | RF | JU | Si | St | Sl | EU | ED |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WF | 1.000 | 0.998 | 0.892 | 0.931 | 0.993 | 1.000 | 0.999 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| WL | 0.998 | 1.000 | 0.853 | 0.989 | 0.997 | 1.000 | 1.000 | 0.999 | 1.000 | 0.896 | 0.971 | 1.000 |
| WR | 0.892 | 0.853 | 1.000 | 0.789 | 0.964 | 1.000 | 1.000 | 0.999 | 1.000 | 0.853 | 1.000 | 1.000 |
| WU | 0.931 | 0.989 | 0.789 | 1.000 | 0.702 | 1.000 | 1.000 | 0.996 | 0.956 | 0.992 | 0.999 | 1.000 |
| WD | 0.993 | 0.997 | 0.964 | 0.702 | 1.000 | 0.965 | 0.998 | 0.655 | 0.997 | 0.982 | 1.000 | 0.975 |
| RF | 1.000 | 1.000 | 1.000 | 1.000 | 0.965 | 1.000 | 0.688 | 0.999 | 1.000 | 1.000 | 1.000 | 1.000 |
| JU | 0.999 | 1.000 | 1.000 | 1.000 | 0.998 | 0.688 | 1.000 | 1.000 | 1.000 | 0.993 | 1.000 | 1.000 |
| Si | 1.000 | 0.999 | 0.999 | 0.996 | 0.655 | 0.999 | 1.000 | 1.000 | 0.491 | 0.967 | 0.328 | 0.313 |
| St | 1.000 | 1.000 | 1.000 | 0.956 | 0.997 | 1.000 | 1.000 | 0.491 | 1.000 | 0.766 | 0.528 | 0.901 |
| Sl | 1.000 | 0.896 | 0.853 | 0.992 | 0.982 | 1.000 | 0.993 | 0.967 | 0.766 | 1.000 | 1.000 | 1.000 |
| EU | 1.000 | 0.971 | 1.000 | 0.999 | 1.000 | 1.000 | 1.000 | 0.328 | 0.528 | 1.000 | 1.000 | 0.765 |
| ED | 1.000 | 1.000 | 1.000 | 1.000 | 0.975 | 1.000 | 1.000 | 0.313 | 0.901 | 1.000 | 0.765 | 1.000 |
| Mean |
|
|
|
|
|
|
|
|
|
|
|
|
| Baseline | 0.621 | 0.637 | 0.600 | 0.695 | 0.703 | 0.644 | 0.618 | 0.628 | 0.640 | 0.644 | 0.635 | 0.642 |
WF: Walking Forward; WL: Walking Left; WR: Walking Right; WU: Walking Upstairs; WD: Walking Downstairs; RF: Running Forward; JU: Jumping Up; Si: Sitting; St: Standing; Sl: Sleeping; EU: Elevator Up; ED: Elevator Down.
Results of one-versus-all subject identification (all activities).
| Subjects | Accuracy | Precision | Recall |
|
|---|---|---|---|---|
| S1 | 0.657 | 0.142 | 0.744 | 0.239 |
| S2 | 0.273 | 0.075 | 0.917 | 0.139 |
| S3 | 0.549 | 0.078 | 0.716 | 0.140 |
| S4 | 0.496 | 0.073 | 0.697 | 0.132 |
| S5 | 0.323 | 0.068 | 0.931 | 0.127 |
| S6 | 0.863 | 0.220 | 0.637 | 0.328 |
| S7 | 0.265 | 0.055 | 0.920 | 0.103 |
| S8 | 0.107 | 0.071 | 0.985 | 0.132 |
| S9 | 0.683 | 0.229 | 0.967 | 0.370 |
| S10 | 0.156 | 0.091 | 0.943 | 0.166 |
| S11 | 0.755 | 0.263 | 0.905 | 0.407 |
| S12 | 0.689 | 0.167 | 0.533 | 0.254 |
| S13 | 0.373 | 0.115 | 0.881 | 0.203 |
| S14 | 0.493 | 0.112 | 0.866 | 0.198 |
| Mean |
|
|
|
|
| Random baseline | 0.071 | 0.071 | 0.929 | 0.133 |
Results of one-versus-all subject identification for specific activities.
| Activity | Accuracy |
|
|---|---|---|
| Walking Forward | 0.947 | 0.727 |
| Walking Left | 0.955 | 0.769 |
| Walking Right | 0.931 | 0.722 |
| Walking Upstairs | 0.857 | 0.551 |
| Walking Downstairs | 0.833 | 0.497 |
| Running Forward | 0.832 | 0.496 |
| Jumping Up | 0.814 | 0.453 |
| Sitting | 0.506 | 0.391 |
| Standing | 0.722 | 0.432 |
| Sleeping | 0.589 | 0.292 |
| Elevator Up | 0.337 | 0.235 |
| Elevator Down | 0.318 | 0.232 |
| Mean |
|
|
| Random baseline | 0.071 | 0.133 |
Accuracy of binary subject identification using separate activities.
| Activity | Accuracy |
|
|---|---|---|
| Walking Forward | 0.992 | 0.987 |
| Walking Left | 0.989 | 0.987 |
| Walking Right | 0.993 | 0.993 |
| Walking Upstairs | 0.977 | 0.970 |
| Walking Downstairs | 0.974 | 0.971 |
| Running Forward | 0.980 | 0.974 |
| Jumping Up | 0.983 | 0.980 |
| Sitting | 0.883 | 0.859 |
| Standing | 0.940 | 0.932 |
| Sleeping | 0.956 | 0.953 |
| Elevator Up | 0.856 | 0.847 |
| Elevator Down | 0.846 | 0.822 |
| Mean |
|
|
| Random baseline | 0.5 | 0.5 |
Summary of HAR results using USC-HAD dataset.
| Reference | Features | Classification method | Accuracy |
|---|---|---|---|
| Zheng [ | Means and variances of magnitude and angles of acceleration along | Least Squares Support Vector Machine (LS-SVM), Naïve-Bayes (NB) | 95.6% |
| Sivakumar [ | Accelerometer and gyroscope data | Symbolic approximation | 84.3% |
| Vaka [ | Mean, std. dev., correlation between | Random Forest | 90.7% |
| This paper | 99 times, frequency and physical features | Heuristic (random projections + PDFs + Jaccard distance) | 95.52% |