| Literature DB >> 25436224 |
Abstract
This paper presents a novel method of human action recognition, which is based on the reconstructed phase space. Firstly, the human body is divided into 15 key points, whose trajectory represents the human body behavior, and the modified particle filter is used to track these key points for self-occlusion. Secondly, we reconstruct the phase spaces for extracting more useful information from human action trajectories. Finally, we apply the semisupervised probability model and Bayes classified method for classification. Experiments are performed on the Weizmann, KTH, UCF sports, and our action dataset to test and evaluate the proposed method. The compare experiment results showed that the proposed method can achieve was more effective than compare methods.Entities:
Mesh:
Year: 2014 PMID: 25436224 PMCID: PMC4241687 DOI: 10.1155/2014/495071
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1The human joints model. The original photo stems from Weizmann dataset [21].
Figure 2The target tracking results. The original photo stems from Weizmann datasets [21].
Figure 3Examples of the reconstructed phase space of the missing data. Hip rotations for walking, jogging, and running actions in the KTH dataset [33]. (a) shows original images. (b) shows the result of reconstructing the reconstructed phase space of the missing data. (b1) shows the phase space reconstruction of right foot motion. (b2) shows the phase space reconstruction of right elbow motion. (b3) shows the phase space reconstruction of right elbow motion. (c) shows the reconstructed phase space of the total occlude joint point. (c1) shows the phase space reconstruction of walking. (c2) shows the phase space reconstruction of jogging. (c3) shows the phase space reconstruction of running.
Algorithm 1Comparison with other approaches on our action dataset.
| Method | Average recognition rate (%) |
|---|---|
| The proposed method | 90.30 |
| Martínez-Contreras et al. [ | 88.80 |
| Chaaraoui et al. [ | 89.60 |
| Zhang and Gong [ | 87.10 |
Figure 4Sample frames from our datasets. The action labels in each dataset are as follows: (a) KTH dataset [33]: walking (a1), jogging (a2), running (a3), boxing (a4), and hand clapping (a5); (b) Weizmann dataset [21]: bending (a1), jumping jack (a2), jumping forward on two legs (a3), jumping in place on two legs (a4), running (a5), galloping sideways (a6), walking (a7), waving one hand (a8), and waving two hands (a9); (c)UCF sports action dataset [37]: diving(a1), golf swinging (a2), kicking (a3), lifting (a4), horseback riding (a5), running (a6), skating (a7), swinging (a8), and walking (a9); (d) our action dataset: walking (a1), jogging (a2), running (a3), boxing (a4), and handclapping (a5).
Confusion matrix for KTH dataset.
| a1 | a2 | a3 | a4 | a5 | |
|---|---|---|---|---|---|
| a1 |
| 0.01 | 0.02 | 0.00 | 0.01 |
| a2 | 0.01 |
| 0.02 | 0.10 | 0.00 |
| a3 | 0.00 | 0.02 |
| 0.00 | 0.01 |
| a4 | 0.01 | 0.00 | 0.00 |
| 0.30 |
| a5 | 0.03 | 0.00 | 0.02 | 0.00 |
|
Confusion matrix for the Weizmann dataset.
| a1 | a2 | a3 | a4 | a5 | a6 | a7 | a8 | a9 | |
|---|---|---|---|---|---|---|---|---|---|
| a1 |
| 0.01 | 0.02 | 0.00 | 0.20 | 0.00 | 0.10 | 0.05 | 0.02 |
| a2 | 0.01 |
| 0.02 | 0.03 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 |
| a3 | 0.00 | 0.00 |
| 0.10 | 0.13 | 0.00 | 0.02 | 0.01 | 0.00 |
| a4 | 0.00 | 0.01 | 0.00 |
| 0.00 | 0.20 | 0.04 | 0.00 | 0.00 |
| a5 | 0.00 | 0.01 | 0.00 | 0.00 |
| 0.00 | 0.00 | 0.30 | 0.02 |
| a6 | 0.01 | 0.00 | 0.03 | 0.00 | 0.05 |
| 0.02 | 0.00 | 0.01 |
| a7 | 0.00 | 0.03 | 0.00 | 0.00 | 0.01 | 0.00 |
| 0.00 | 0.02 |
| a8 | 0.00 | 0.03 | 0.04 | 0.10 | 0.00 | 0.00 | 0.00 |
| 0.00 |
| a9 | 0.00 | 0.00 | 0.20 | 0.00 | 0.10 | 0.00 | 0.00 | 0.03 |
|
Confusion matrix for the UCF sports dataset.
| a1 | a2 | a3 | a4 | a5 | a6 | a7 | a8 | a9 | |
|---|---|---|---|---|---|---|---|---|---|
| a1 |
| 0.02 | 0.01 | 0.00 | 0.15 | 0.00 | 0.10 | 0.05 | 0.02 |
| a2 | 0.00 |
| 0.01 | 0.00 | 0.00 | 0.02 | 0.00 | 0.03 | 0.00 |
| a3 | 0.01 | 0.00 |
| 0.15 | 0.10 | 0.00 | 0.02 | 0.02 | 0.00 |
| a4 | 0.00 | 0.00 | 0.00 |
| 0.10 | 0.10 | 0.00 | 0.00 | 0.00 |
| a5 | 0.00 | 0.01 | 0.20 | 0.00 |
| 0.00 | 0.00 | 0.10 | 0.02 |
| a6 | 0.01 | 0.00 | 0.02 | 0.00 | 0.05 |
| 0.05 | 0.01 | 0.02 |
| a7 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
| 0.00 | 0.02 |
| a8 | 0.00 | 0.02 | 0.03 | 0.10 | 0.00 | 0.00 | 0.00 |
| 0.00 |
| a9 | 0.00 | 0.10 | 0.30 | 0.04 | 0.10 | 0.00 | 0.00 | 0.00 |
|
Confusion matrix for our dataset.
| a1 | a2 | a3 | a4 | a5 | |
|---|---|---|---|---|---|
| a1 |
| 0.00 | 0.00 | 0.01 | 0.02 |
| a2 | 0.00 |
| 0.01 | 0.00 | 0.00 |
| a3 | 0.00 | 0.02 |
| 0.01 | 0.00 |
| a4 | 0.00 | 0.20 | 0.00 |
| 0.02 |
| a5 | 0.02 | 0.10 | 0.00 | 0.00 |
|
Comparison with other approaches on KTH action dataset.
| Method | Average recognition rate (%) |
|---|---|
| The proposed method | 92.30 |
|
Martínez-Contreras et al. [ | 89.20 |
|
Chaaraoui et al. [ | 91.20 |
|
Zhang and Gong [ | 90.60 |
Comparison with other approaches on the Weizmann action dataset.
| Method | Average recognition rate (%) |
|---|---|
| The proposed method | 89.10 |
| Martínez-Contreras et al. [ | 85.10 |
| Chaaraoui et al. [ | 87.20 |
| Zhang and Gong [ | 85.40 |
Comparison with other approaches on UCF sportsaction dataset.
| Method | Average recognition rate (%) |
|---|---|
| The proposed method | 91.10 |
| Martínez-Contreras et al. [ | 85.20 |
| Chaaraoui et al. [ | 87.30 |
| Zhang and Gong [ | 88.60 |