| Literature DB >> 27271623 |
Zae Myung Kim1, Young-Seob Jeong2, Hyung Rai Oh3, Kyo-Joong Oh4, Chae-Gyun Lim5, Youssef Iraqi6, Ho-Jin Choi7.
Abstract
For the past few decades, action recognition has been attracting many researchers due to its wide use in a variety of applications. Especially with the increasing number of smartphone users, many studies have been conducted using sensors within a smartphone. However, a lot of these studies assume that the users carry the device in specific ways such as by hand, in a pocket, in a bag, etc. This paper investigates the impact of providing an action recognition system with the information of the possession-way of a smartphone, and vice versa. The experimental dataset consists of five possession-ways (hand, backpack, upper-pocket, lower-pocket, and shoulder-bag) and two actions (walking and running) gathered by seven users separately. Various machine learning models including recurrent neural network architectures are employed to explore the relationship between the action recognition and the possession-way recognition. The experimental results show that the assumption of possession-ways of smartphones do affect the performance of action recognition, and vice versa. The results also reveal that a good performance is achieved when both actions and possession-ways are recognized simultaneously.Entities:
Keywords: action recognition; artificial neural networks; possession-way recognition
Year: 2016 PMID: 27271623 PMCID: PMC4934238 DOI: 10.3390/s16060812
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Three proposed approaches.
Figure 2The proposed structures of (a) DNN and (b) RNN.
Figure 3The five possession-ways of a smartphone targeted in the experiments.
Statistics of the raw dataset.
| Hand | Backpack | Upper-Pocket | Lower-Pocket | Shoulder-Bag | |
|---|---|---|---|---|---|
| 147,283 | 143,511 | 162,167 | 178,689 | 167,862 | |
| 184,256 | 185,572 | 157,273 | 177,308 | 172,180 |
Statistics of the generated feature samples.
| Hand | Backpack | Upper-Pocket | Lower-Pocket | Shoulder-Bag | ||
|---|---|---|---|---|---|---|
| 14144 | 14546 | 13554 | 15070 | 14291 | ||
| 17769 | 15637 | 15880 | 15126 | 15380 | ||
| 8484 | 8728 | 8130 | 9039 | 8573 | ||
| 10661 | 9381 | 9526 | 9074 | 9227 | ||
| 4240 | 4361 | 4062 | 4518 | 4283 | ||
| 5328 | 4689 | 4761 | 4534 | 4612 | ||
| 1409 | 1451 | 1349 | 1502 | 1424 | ||
| 1774 | 1561 | 1583 | 1508 | 1534 | ||
| 845 | 868 | 808 | 900 | 852 | ||
| 1061 | 933 | 950 | 903 | 920 | ||
| 601 | 619 | 577 | 641 | 607 | ||
| 756 | 666 | 676 | 644 | 655 |
Figure 4Linear accelerometer values of a user while (a) running and (b) walking with the phone placed in the lower-pocket.
Accuracies of possession-action recognition for different lengths of windows.
| SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 61.06 | 46.12 | 76.64 | 76.02 | 62.75 | 47.61 | 78.14 | 77.28 | 63.76 | 49.20 | 78.39 | 78.33 | ||||
| 75.70 | 73.73 | 80.24 | 80.74 | 76.70 | 77.45 | 80.20 | 80.81 | 78.43 | 79.59 | 81.65 | 80.68 | ||||
| 78.17 | 69.41 | 81.40 | 81.26 | 80.11 | 74.17 | 81.85 | 82.12 | 81.60 | 78.05 | 81.88 | 82.16 | ||||
| 77.02 | 69.53 | 78.83 | 79.58 | 78.76 | 72.88 | 80.55 | 80.10 | 79.42 | 75.77 | 80.85 | 79.78 | ||||
| 74.17 | 71.91 | 82.19 | 82.31 | 76.59 | 76.58 | 82.26 | 81.71 | 79.55 | 80.72 | 81.97 | 82.04 | ||||
| 81.28 | 77.12 | 81.90 | 82.29 | 81.74 | 79.71 | 81.88 | 82.29 | 82.38 | 81.16 | 82.66 | 82.54 | ||||
| 47.18 | 33.36 | 62.01 | 61.76 | 49.43 | 36.26 | 63.57 | 62.91 | 51.18 | 38.90 | 61.12 | 63.79 | ||||
| 64.51 | 51.40 | 79.24 | 77.93 | 64.98 | 50.57 | 78.50 | 76.79 | 65.68 | 51.19 | 77.65 | 76.49 | ||||
| 80.76 | 80.26 | 81.48 | 81.06 | 81.16 | 80.56 | 81.02 | 81.30 | 81.19 | 80.80 | 81.58 | 81.45 | ||||
| 82.59 | 80.60 | 82.42 | 82.34 | 82.57 | 80.39 | 82.39 | 82.43 | 82.40 | 80.47 | 82.07 | 82.60 | ||||
| 80.90 | 77.63 | 81.17 | 80.51 | 81.21 | 78.39 | 81.62 | 81.90 | 81.06 | 78.92 | 81.91 | 81.91 | ||||
| 82.04 | 82.01 | 81.87 | 81.33 | 82.25 | 81.88 | 82.02 | 81.55 | 82.34 | 81.95 | 82.21 | 82.08 | ||||
| 82.78 | 81.45 | 82.73 | 82.23 | 82.72 | 82.85 | 81.32 | 82.78 | 82.90 | 81.43 | 82.90 | |||||
| 52.78 | 41.32 | 64.92 | 63.51 | 53.27 | 40.71 | 64.37 | 62.99 | 53.83 | 41.32 | 63.78 | 62.89 | ||||
Accuracies of possession-action recognition for different number of windows.
| n = 1 | n = 3 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | |
| 63.76 | 49.20 | 78.39 | 78.33 | 59.44 | 50.57 | 79.38 | 78.32 | |||
| 78.43 | 79.59 | 81.65 | 80.68 | 80.35 | 79.96 | 81.26 | 80.71 | |||
| 81.60 | 78.05 | 81.88 | 82.16 | 82.06 | 80.18 | 82.37 | 82.25 | |||
| 79.42 | 75.77 | 80.85 | 79.78 | 80.46 | 77.03 | 81.45 | 81.65 | |||
| 79.55 | 80.72 | 81.97 | 82.04 | 79.50 | 81.78 | 82.15 | 81.42 | |||
| 82.38 | 81.16 | 82.66 | 82.54 | 82.68 | 81.55 | 82.78 | 82.71 | |||
| 51.18 | 38.90 | 64.12 | 63.79 | 48.15 | 40.51 | 65.09 | 64.03 | |||
| 57.53 | 51.22 | 75.78 | 76.72 | 56.46 | 51.58 | 74.65 | 75.70 | |||
| 80.28 | 79.87 | 78.71 | 81.02 | 80.35 | 79.77 | 79.44 | 81.13 | |||
| 82.53 | 80.21 | 82.67 | 82.40 | 80.74 | 82.80 | 82.60 | ||||
| 80.47 | 81.81 | 77.60 | 79.78 | 80.48 | 78.02 | 81.65 | 81.65 | |||
| 79.29 | 81.78 | 79.62 | 81.31 | 78.46 | 81.62 | 78.19 | 81.42 | |||
| 82.72 | 81.32 | 82.77 | 82.59 | 82.78 | 81.13 | 82.66 | 83.03 | |||
| 46.64 | 41.06 | 61.51 | 62.51 | 45.69 | 41.40 | 60.43 | 62.05 | |||
Accuracies of possession-action recognition under one-user-out cross validation.
| SVM | RF | NB | DNN | RNN | |
|---|---|---|---|---|---|
| 49.58 | 51.48 | 51.00 | 48.66 | ||
| 73.37 | 77.47 | 82.68 | 78.27 | ||
| 87.50 | 81.43 | 75.33 | 89.55 | ||
| 84.41 | 80.81 | 85.22 | 83.70 | ||
| 91.83 | 81.43 | 73.22 | 75.12 | ||
| 94.75 | 92.49 | 94.14 | 92.38 | ||
| 42.54 | 44.07 | 43.82 | 40.78 |
Accuracies of action-possession recognition for different lengths of windows.
| SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 72.42 | 66.72 | 79.19 | 79.12 | 74.70 | 70.54 | 80.10 | 79.48 | 77.46 | 73.71 | 80.16 | 80.02 | ||||
| 67.46 | 53.91 | 77.55 | 77.46 | 69.04 | 58.74 | 78.77 | 78.15 | 72.33 | 63.98 | 79.45 | 79.16 | ||||
| 62.87 | 49.45 | 78.91 | 77.69 | 62.59 | 50.90 | 79.64 | 78.09 | 63.19 | 53.28 | 79.85 | 79.09 | ||||
| 47.19 | 34.48 | 61.95 | 61.38 | 49.16 | 38.67 | 63.44 | 62.09 | 52.49 | 43.22 | 63.85 | 63.32 | ||||
| 80.71 | 76.51 | 80.65 | 80.94 | 81.42 | 76.68 | 80.20 | 81.35 | 81.77 | 76.90 | 81.47 | 81.28 | ||||
| 75.71 | 67.90 | 79.99 | 79.58 | 76.37 | 69.02 | 79.82 | 79.08 | 76.82 | 68.58 | 79.45 | 79.15 | ||||
| 63.09 | 54.88 | 79.71 | 78.00 | 61.90 | 55.26 | 78.14 | 77.07 | 61.18 | 55.52 | 77.27 | 77.57 | ||||
| 56.01 | 46.97 | 64.40 | 63.77 | 56.29 | 47.65 | 63.34 | 63.51 | 56.42 | 47.72 | 63.84 | 63.69 | ||||
Accuracies of action-possession recognition for different number of windows.
| n = 1 | n = 3 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | |
| 77.46 | 73.71 | 80.16 | 80.02 | 78.94 | 76.04 | 79.91 | 79.92 | |||
| 72.33 | 63.98 | 79.45 | 79.16 | 70.62 | 66.90 | 79.32 | 78.50 | |||
| 63.19 | 53.28 | 79.85 | 79.09 | 56.19 | 56.43 | 79.27 | 77.79 | |||
| 52.49 | 43.22 | 63.85 | 63.32 | 50.05 | 46.89 | 63.36 | 62.45 | |||
| 79.09 | 76.30 | 80.43 | 80.74 | 79.10 | 76.68 | 79.42 | 80.80 | |||
| 67.77 | 65.61 | 79.22 | 79.46 | 65.85 | 64.52 | 77.68 | 79.92 | |||
| 51.88 | 57.45 | 77.58 | 76.13 | 49.26 | 57.70 | 75.91 | 75.10 | |||
| 47.32 | 46.95 | 63.06 | 62.81 | 45.53 | 46.86 | 60.99 | 62.63 | |||
Accuracies of action-possession recognition under one-user-out cross validation.
| SVM | RF | NB | DNN | RNN | |
|---|---|---|---|---|---|
| 88.05 | 89.67 | 86.31 | 88.60 | ||
| 50.81 | 51.49 | 56.11 | 56.02 | ||
| 43.41 | 44.76 | 40.80 | 45.46 | ||
| 42.62 | 44.41 | 41.82 | 44.96 |
Accuracies of concurrent, possession-action, and action-possession recognition for different lengths of windows.
| SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 60.07 | 43.32 | 75.38 | 74.46 | 61.65 | 48.37 | 76.55 | 76.59 | 64.64 | 54.14 | 78.00 | 78.08 | ||||
| 47.18 | 33.36 | 62.01 | 61.76 | 49.43 | 36.26 | 63.57 | 62.91 | 51.18 | 38.90 | 64.12 | 63.79 | ||||
| 47.19 | 34.48 | 61.95 | 61.38 | 49.16 | 38.67 | 63.44 | 62.09 | 52.49 | 43.22 | 63.85 | 63.32 | ||||
| 67.85 | 58.52 | 78.25 | 77.79 | 67.92 | 59.37 | 78.16 | 77.42 | 67.90 | 58.55 | 77.84 | 76.53 | ||||
| 52.78 | 41.32 | 64.92 | 63.51 | 53.27 | 40.71 | 64.37 | 62.99 | 53.83 | 41.32 | 63.78 | 62.89 | ||||
| 56.01 | 46.97 | 64.40 | 63.77 | 56.29 | 47.65 | 63.34 | 63.51 | 56.42 | 47.72 | 63.84 | 63.69 | ||||
Accuracies of concurrent, possession-action, and action-possession recognition for different number of windows.
| n = 1 | n = 3 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SVM | RF | NB | DNN | RNN | SVM | RF | NB | DNN | RNN | |
| 64.64 | 54.14 | 78.00 | 78.08 | 61.70 | 58.58 | 78.80 | 78.04 | |||
| 51.18 | 38.90 | 64.12 | 63.79 | 48.15 | 40.51 | 65.09 | 64.03 | |||
| 52.49 | 43.22 | 63.85 | 63.32 | 50.05 | 46.89 | 63.36 | 62.45 | |||
| 58.59 | 59.31 | 77.88 | 76.80 | 55.55 | 58.94 | 76.90 | 76.05 | |||
| 46.64 | 41.06 | 61.51 | 62.51 | 45.69 | 41.40 | 60.43 | 62.05 | |||
| 47.32 | 46.95 | 63.06 | 62.81 | 45.53 | 46.86 | 60.99 | 62.63 | |||
Accuracies of concurrent, possession-action, and action-possession recognition under one-user-out cross validation.
| SVM | RF | NB | DNN | RNN | |
|---|---|---|---|---|---|
| 44.73 | 44.82 | 44.88 | 47.48 | ||
| 42.54 | 44.07 | 43.82 | 40.78 | ||
| 42.62 | 44.41 | 41.82 | 44.96 |