| Literature DB >> 32295028 |
Daniel Garcia-Gonzalez1, Daniel Rivero1, Enrique Fernandez-Blanco1, Miguel R Luaces1.
Abstract
In recent years, human activity recognition has become a hot topic inside the scientific community. The reason to be under the spotlight is its direct application in multiple domains, like healthcare or fitness. Additionally, the current worldwide use of smartphones makes it particularly easy to get this kind of data from people in a non-intrusive and cheaper way, without the need for other wearables. In this paper, we introduce our orientation-independent, placement-independent and subject-independent human activity recognition dataset. The information in this dataset is the measurements from the accelerometer, gyroscope, magnetometer, and GPS of the smartphone. Additionally, each measure is associated with one of the four possible registered activities: inactive, active, walking and driving. This work also proposes asupport vector machine (SVM) model to perform some preliminary experiments on the dataset. Considering that this dataset was taken from smartphones in their actual use, unlike other datasets, the development of a good model on such data is an open problem and a challenge for researchers. By doing so, we would be able to close the gap between the model and a real-life application.Entities:
Keywords: HAR; SVM; dataset; human activity recognition; sensors; smartphones
Mesh:
Year: 2020 PMID: 32295028 PMCID: PMC7218897 DOI: 10.3390/s20082200
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Comparison between datasets: UCI HAR, WISDM and the proposed one.
| UCI HAR | WISDM | Proposed | |
|---|---|---|---|
|
| Short-themed | Short-themed |
|
|
| Fixed | Fixed |
|
|
| Yes | Yes |
|
|
| Yes | Yes |
|
|
| Acc. and gyro. | Acc. and gyro. |
|
Average number of recordings per second for each sensor and each activity measured.
| Activity | Accelerometer Hz. | Gyroscope Hz. | Magnetometer Hz. | GPS Hz. |
|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dataset distribution for each activity measured.
| Activity | Time Recorded (s) | Number of Recordings | Number of Samples | Percentage of Data |
|---|---|---|---|---|
|
| 292,213 | 147 | 7,064,757 | 24.25% |
|
| 178,806 | 99 | 8,918,021 | 30.62% |
|
| 98,071 | 200 | 4,541,130 | 15.59% |
|
| 112,226 | 128 | 8,602,902 | 29.54% |
|
| 681,316 | 574 | 29,126,810 | 100% |
Dataset distribution for each activity measured without gyroscope.
| Activity | Time Recorded (s) | Number of Recordings | Number of Samples | Percentage of Data |
|---|---|---|---|---|
|
| 11,523 | 8 | 668,536 | 2.29% |
|
| 13,866 | 7 | 619,913 | 2.13% |
|
| 4169 | 15 | 584,262 | 2.01% |
|
| 25,718 | 23 | 3,776,468 | 12.97% |
|
| 55,276 | 53 | 5,649,179 | 19.40% |
Dataset distribution for each activity measured without gyroscope and magnetometer.
| Activity | Time Recorded (s) | Number of Recordings | Number of Samples | Percentage of Data |
|---|---|---|---|---|
|
| 5409 | 2 | 269,710 | 0.93% |
|
| 10,286 | 2 | 90,487 | 0.31% |
|
| 0 | 0 | 0 | 0% |
|
| 0 | 0 | 0 | 0% |
|
| 25,695 | 4 | 360,197 | 1.24% |
Sensor’s mean and standard deviation values for each activity measured.
| Activity | |||||
|---|---|---|---|---|---|
| Inactive | Active | Walking | Driving | ||
|
| 0.11761 | −0.01338 | 0.09425 | −0.04747 | |
|
|
| 0.06136 | 0.07598 | −0.37604 | −0.12936 |
|
| 0.84318 | 0.13008 | 0.07353 | 0.18127 | |
|
| −0.00004 | −0.00001 | 0.00760 | 0.00080 | |
|
|
| 0.00004 | −0.00102 | −0.00020 | 0.00277 |
|
| 0.00001 | 0.00055 | −0.00560 | −0.00243 | |
|
| 25.93805 | 6.03153 | −0.28182 | −5.96356 | |
|
|
| −19.62683 | −0.02890 | 18.73800 | 10.73609 |
|
| −56.60425 | 9.56310 | 0.64541 | −2.93043 | |
|
| 0.00075 | 0.00112 | 0.00047 | 0.00175 | |
|
| 0.00125 | 0.00118 | 0.00056 | 0.00204 | |
|
|
| 32.59169 | 30.77538 | 34.06931 | 41.59391 |
|
| 0.37222 | 0.12109 | 0.79924 | 10.82191 | |
|
| 57.25005 | 14.69719 | 124.85103 | 118.88108 | |
|
| 265.44485 | 214.57640 | 75.54539 | 192.90736 | |
Number of patterns for the samples containing all the sensors with a sliding window of 20 s and 19 s overlap.
| Activity | ||||
|---|---|---|---|---|
|
|
|
|
|
|
| 201,501 | 137,407 | 86,383 | 77,852 | 503,143 |
| (40%) | (27%) | (17%) | (16%) | |
Mean f1-scores achieved for each combination of kernel, C, and degree hyperparameters in the grid search. The best result found is highlighted in bold.
| C = 1 | C = 10 | C = 100 | C = 1000 | C = 10,000 | ||
|---|---|---|---|---|---|---|
|
| 36.15% | 31.41% | 31.41% | 31.41% | 31.41% | |
| 10.56% | 4.57% | 17.04% | 40.72% | 34.70% | ||
| 20.67% | 21.30% | 39.71% | 38.70% | 46.70% | ||
|
| 60.37% |
| 56.47% | 57.20% | 56.49% | |
| 51.76% | 54.10% | 57.09% | 51.62% | 51.36% | ||
|
| 41.16% | 41.28% | 41.28% | 41.28% | ||
| 18.09% | 21.04% | 41.00% | 32.67% | 37.12% | ||
| 16.09% | 37.86% | 37.82% | 37.26% | 32.01% | ||
|
| 37.73% | 41.49% | 36.16% | 36.67% | 36.67% | |
| 33.36% | 32.58% | 34.11% | 34.11% | 34.11% | ||
| 36.15% | 31.41% | 31.41% | 31.41% | 31.41% | ||
| 10.96% | 6.27% | 7.03% | 9.34% | 9.60% | ||
| 7.03% | 9.10% | 8.39% | 10.62% | 22.55% | ||
|
| 9.60% | 10.55% | 23.08% | 24.34% | 27.69% | |
| 22.73% | 23.46% | 25.84% | 25.82% | 25.82% | ||
| 25.58% | 25.59% | 25.59% | 25.59% | 25.59% | ||
| 6.11% | 6.86% | 10.61% | 9.15% | 11.16% | ||
| 9.15% | 11.16% | 6.04% | 8.56% | 19.86% | ||
|
| 8.32% | 23.63% | 23.18% | 20.63% | 30.29% | |
| 21.79% | 25.40% | 27.70% | 27.70% | 27.70% | ||
| 23.11% | 23.11% | 23.11% | 23.11% | 23.11% | ||
| 7.33% | 8.20% | 6.96% | 4.78% | 10.36% | ||
| 10.36% | 7.63% | 7.84% | 13.20% | 9.68% | ||
|
| 9.68% | 9.54% | 8.04% | 7.11% | 11.79% | |
| 8.39% | 12.34% | 12.34% | 12.34% | 12.34% | ||
| 9.02% | 9.02% | 9.02% | 9.02% | 9.02% | ||
Average confusion matrix for the experiments conducted.
| Ground Truth | |||||||
|---|---|---|---|---|---|---|---|
| Inactive | Active | Walking | Driving | Precision | |||
| Inactive | 15,887 | 1904 | 1165 | 1195 | 78.84% | ||
| Active | 3226 | 6159 | 3134 | 1222 | 44.82% | ||
| Walking | 259 | 1540 | 5863 | 976 | 67.88% | ||
| Driving | 149 | 653 | 1073 | 5910 | 75.92% | ||
| Recall | 81.38% | 60.05% | 52.19% | 63.53% | 67.22% | ||
Mean accuracies achieved for each set of data, with the best group result highlighted in bold.
| Acc. + GPS. | Acc. + Magn. + GPS | Acc. + Gyro. + Magn.+ GPS |
|---|---|---|
| 60.10% | 62.66% |