| Literature DB >> 30428600 |
Taeho Hur1, Jaehun Bang2, Thien Huynh-The3, Jongwon Lee4, Jee-In Kim5, Sungyoung Lee6.
Abstract
The most significant barrier to success in human activity recognition is extracting and selecting the right features. In traditional methods, the features are chosen by humans, which requires the user to have expert knowledge or to do a large amount of empirical study. Newly developed deep learning technology can automatically extract and select features. Among the various deep learning methods, convolutional neural networks (CNNs) have the advantages of local dependency and scale invariance and are suitable for temporal data such as accelerometer (ACC) signals. In this paper, we propose an efficient human activity recognition method, namely Iss2Image (Inertial sensor signal to Image), a novel encoding technique for transforming an inertial sensor signal into an image with minimum distortion and a CNN model for image-based activity classification. Iss2Image converts real number values from the X, Y, and Z axes into three color channels to precisely infer correlations among successive sensor signal values in three different dimensions. We experimentally evaluated our method using several well-known datasets and our own dataset collected from a smartphone and smartwatch. The proposed method shows higher accuracy than other state-of-the-art approaches on the tested datasets.Entities:
Keywords: accelerometer; convolutional neural network; encoder; human activity recognition; signal transformation; smartphone; smartwatch
Mesh:
Year: 2018 PMID: 30428600 PMCID: PMC6263516 DOI: 10.3390/s18113910
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The workflow of our proposed method for human activity recognition; the main contributions are the data encoder and CNN model.
Figure 2An example of encoding an ACC signal into an image.
Figure 3The architecture of UCNet6 for image-based activity recognition.
The detailed CNN architecture of UCNet6.
| Layer | No. of Filters | Size of Filer/Pooling | Size of Stride | Size of Padding |
|---|---|---|---|---|
| conv1 | 64 | 3 | 1 | 1 |
| conv2 | 128 | 3 | 1 | 1 |
| maxpool1 | - | 2 | 2 | 0 |
| conv3 | 64 | 3 | 1 | 1 |
| conv4 | 128 | 3 | 1 | 1 |
| maxpool2 | - | 2 | 2 | 0 |
| conv5 | 64 | 3 | 1 | 1 |
| conv6 | 128 | 3 | 1 | 1 |
Activity list for all the datasets and their abbreviations.
| MobiAct | DaLiAc | UCI-HAR | UC-HAR |
|---|---|---|---|
| Standing (STD) | Sitting (SI) | Walking | Eating (Eat) |
| Walking (WAK) | Lying (LY) | Upstairs | Lying (Lie) |
| Jogging (JOG) | Standing (ST) | Downstairs | Running (Run) |
| Jumping (JUM) | Washing dishes (WD) | Standing | Sitting (Sit) |
| Stairs up (STU) | Vacuuming (VC) | Sitting | Standing (Std) |
| Stairs down (STN) | Sweeping (SW) | Lying | Stretching (Str) |
| Stand to sit (SCH) | Walking (WK) | Sweeping (Swp) | |
| Car step in (CSI) | Ascending stairs (AS) | Walking (Wlk) | |
| Car step out (CSO) | Descending stairs (DS) | ||
| Running (RU) | |||
| Bicycling on ergometer 50 W (BC50) | |||
| Bicycling on ergometer 100 W (BC100) | |||
| Rope jumping (RJ) |
Figure 4The confusion matrices of the recognition results of our proposed Iss2Image and UCNet6 method on the different datasets: (a) MobiAct, (b) DaLiAc, (c) UCI-HAR, and (d) UC-HAR.
Average recognition accuracy of our proposed method on the different datasets.
| Dataset | Accuracy (%) |
|---|---|
| MobiAct | 100.00 |
| DaLiAc | 98.90 |
| UCI-HAR | 97.11 |
| UC-HAR | 98.16 |
Figure 5An illustration of activity images generated by (a) raw signal plot, (b) spectrogram, (c) recurrence plot, (d) multichannel, and (e) Iss2Image.
Average recognition accuracy (%) comparison of the different transformation methods with the different datasets.
| Method | MobiAct | DaLiAc | UCI-HAR | UC-HAR |
|---|---|---|---|---|
| Raw signal plot [ | 98.22 | 92.06 | 92.86 | 93.08 |
| Spectrogram [ | 98.02 | 94.54 | 91.02 | 93.40 |
| Recurrence plot [ | 100.00 | 84.75 | 88.47 | 93.15 |
| Multichannel [ | 99.88 | 98.12 | 96.60 | 98.14 |
| Iss2Image | 100.00 | 98.90 | 97.11 | 98.16 |
Recognition accuracy (%) comparison of different transformation methods on different networks.
| Method | ResNet18 | GoogleNet | AlexNet | UCNet6 | UCNet6 |
|---|---|---|---|---|---|
| Raw signal plot | 93.82 | 94.55 | 94.15 | 93.08 | 94.24 |
| Spectrogram | 95.19 | 96.87 | 96.09 | 93.40 | 95.46 |
| Recurrence plot | 98.10 | 96.38 | 96.15 | 91.15 | 94.37 |
| Multichannel | 98.55 | 99.02 | 98.07 | 98.14 | 98.71 |
| Iss2Image | 99.70 | 99.46 | 98.97 | 98.16 | 99.27 |
| Average | 97.07 | 97.25 | 96.68 | 95.78 | 96.41 |
Average accuracy comparison between Iss2Image-UCNet6, and existing methods on the MobiAct dataset.
| Method | Accuracy (%) |
|---|---|
| IBk [ | 99.88 |
| J48 [ | 99.30 |
| SVM [ | 97.45 |
| Iss2Image-UCNet6 | 100.00 |
Average accuracy comparison between Iss2Image-UCNet6 and existing methods on the DaLiAc dataset.
| Method | Accuracy (%) |
|---|---|
| Hierarchical classifier [ | 89.60 |
| Decision tree [ | 80.00 |
| kNN [ | 68.70 |
| SVM [ | 93.00 |
| Iss2Image-UCNet6 | 96.40 |
Average accuracy comparison between Iss2Image-UCNet6 and existing methods on the UCI-HAR dataset.
| Method | Accuracy (%) |
|---|---|
| MC-SVM [ | 89.30 |
| Convnet + MLP [ | 94.79 |
| tFFT + Convnet [ | 95.75 |
| GCHAR [ | 94.16 |
| Iss2Image-UCNet6 | 96.84 |
Comparison of image creation time on different signal transformation methods.
| Method | Created Images for 10 s |
|---|---|
| Raw signal plot | 7 images |
| Spectrogram | 4 images |
| Recurrence plot | 699 images |
| Multichannel | 2838 images |
| Iss2Image | 2772 images |
Comparison of training times (minutes) on different networks.
| Method | ResNet18 | GoogleNet | AlexNet | UCNet6 | UCNet6 |
|---|---|---|---|---|---|
| Raw signal plot | 60 | 57 | 73 | 309 | 9 |
| Spectrogram | 63 | 58 | 71 | 330 | 8 |
| Recurrence plot | 64 | 60 | 77 | 318 | 14 |
| Multichannel | 61 | 58 | 78 | 7 | 6 |
| Iss2Image | 62 | 54 | 72 | 9 | 7 |
| Average | 62 | 57 | 74 | 195 | 9 |
Comparison of inference time (seconds) on different networks with 1000 samples.
| Method | ResNet18 | GoogleNet | AlexNet | UCNet6 |
|---|---|---|---|---|
| Raw signal plot | 1.89 | 1.77 | 1.18 | 0.61 |
| Spectrogram | 2.06 | 1.90 | 1.33 | 0.87 |
| Recurrence plot | 2.12 | 1.93 | 1.33 | 0.88 |
| Multichannel | 1.96 | 1.86 | 1.30 | 0.17 |
| Iss2Image | 1.99 | 1.87 | 1.27 | 0.29 |
| Average | 2.01 | 1.87 | 1.29 | 0.56 |