| Literature DB >> 31554229 |
Qing Lei1,2, Ji-Xiang Du3,4, Hong-Bo Zhang5,6, Shuang Ye7,8, Duan-Sheng Chen9,10.
Abstract
The fields of human activity analysis have recently begun to diversify. Many researchers have taken much interest in developing action recognition or action prediction methods. The research on human action evaluation differs by aiming to design computation models and evaluation approaches for automatically assessing the quality of human actions. This line of study has become popular because of its explosively emerging real-world applications, such as physical rehabilitation, assistive living for elderly people, skill training on self-learning platforms, and sports activity scoring. This paper presents a comprehensive survey of approaches and techniques in action evaluation research, including motion detection and preprocessing using skeleton data, handcrafted feature representation methods, and deep learning-based feature representation methods. The benchmark datasets from this research field and some evaluation criteria employed to validate the algorithms' performance are introduced. Finally, the authors present several promising future directions for further studies.Entities:
Keywords: action evaluation dataset; action quality assessment; deep learning features; feature learning; hand-crafted features; human action evaluation
Mesh:
Year: 2019 PMID: 31554229 PMCID: PMC6806217 DOI: 10.3390/s19194129
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Several application fields of human action evaluation. (a) healthcare and rehabilitation; (b) surgical skill training; (c) sports skill training; (d) sports activity scoring.
Figure 2The framework of reviewed action evaluation research works.
Figure 3The task of human action evaluation (not only identifying action labels but also assessing the quality score and providing semantic feedback).
The reviewed research works of human action evaluation.
| Applications Methods | Physical Therapy | Sports Activity Scoring | Skill Training |
|---|---|---|---|
| Skeleton or kinematic data-based methods | [ | [ | [ |
| Handcrafted feature learning methods | [ | [ | [ |
| Deep feature learning methods | [ | [ | [ |
Figure 4Skeleton model and detection examples of OpenPose [60]. (a) Eighteen-joint skeleton model of OpenPose; (b) example of diving; (c) example of figure skating.
Physical rehabilitation dataset.
| Dataset Name | #Action Categories | #Persons | #Samples | Data Modality |
|---|---|---|---|---|
| SPHERE-Staircase2014 dataset [ | 1 | 12 | 48 | Depth sequences, skeletons |
| SPHERE-Walking2015 dataset [ | 1 | 10 | 40 | Depth sequences, skeletons |
| SPHERE-SitStand 2015 dataset [ | 2 | 10 | 109 | Depth sequences, skeletons |
| UI-PRMD dataset [ | 10 | 10 | 100 | Positions and angles of body joints in the skeletal models |
| AHA-3D dataset [ | 4 | 21 | 79 | 3D skeletal sequences, RGB images |
Sports activity scoring dataset.
| Dataset Name | #Action Categories | #Samples | View of Samples | Background of Samples |
|---|---|---|---|---|
| FINA09 diving dataset [ | 1 | 68 | Front, side | Same |
| MIT Olympic Scoring dataset [ | 2 | 309 | Variations | Same |
| UNLV AQA-7 dataset [ | 7 | 1189 | Severe changes | Different |
| MTL-AQA dataset [ | 1 | 1412 | Severe changes | Different |
| Fis-V dataset [ | 1 | 500 | Severe changes | Different |
| GolfDB dataset [ | 1 | 1400 | Multiple views | Different |
| YogaVidCollected dataset [ | 6 | 88 | Small changes | Same |
Skill training-related dataset.
| Dataset Name | #Action Categories | #Samples | Data Modality | View of Samples | Background of Samples |
|---|---|---|---|---|---|
| JIGSAWS dataset [ | 3 | 103 | Kinematics data, video data | two left and right cameras | Same |
| EPIC-Skills 2018 dataset [ | 6 | 216 | Video data | single view | Different |
| BEST 2019 dataset [ | 5 | 500 | Video data | Severe changes | Different |
| Breakfast Actions database [ | 10 | 1989 | Video data | 3–5 cameras | Different |
| ADL dataset [ | 18 | 440 | Video data | 170-degree first-person view angle | Different |
Action evaluation performance of methods on sports scoring dataset. The superscript D indicates a deep learning method.
| Methods | Year | MIT Olympic Scoring Diving | MIT Olympic Scoring Skating | UNLV AQA-7 Diving | UNLV AQA-7 Vault |
|---|---|---|---|---|---|
| [ | 2014 | 0.41 | 0.35 | ||
| [ | 2015 | 0.45 | |||
| [ | 2017 | 0.74 | 0.53 | 0.79 | 0.68 |
| [ | 2018 | 0.61 | 0.67 | ||
| [ | 2018 | 0.57 | 0.80 | 0.70 | |
| [ | 2018 | 0.78 | 0.84 | 0.70 | |
| [ | 2018 | 0.86 | |||
| [ | 2019 | 0.59 |
Action evaluation performance of methods on the JIGSAWS dataset. The superscript D indicates a deep learning method.
| Method | Year | Evaluation Criteria | Action Categories | ||
|---|---|---|---|---|---|
| Suturing | Knot Tying | Needle Passing | |||
| [ | 2012 | Classification accuracy | 97.4% | 96.2% | 94.4% |
| [ | 2017 | Classification accuracy | 89.7% | 96.3% | 61.1% |
| [ | 2018 | Classification accuracy | 100% | 99.9% | 100% |
| Score prediction (Correlation Coefficient) | 0.75 | 0.63 | 0.46 | ||
| [ | 2018 | Classification accuracy | 89.9% | 95.8% | 82.3% |
| [ | 2018 | Classification accuracy | 93.4% | 89.8% | 84.9% |
| [ | 2019 | Classification accuracy | 100% | - | 96.4% |
| [ | 2018 | Rank accuracy | 70.2% | ||
| [ | 2019 | Rank accuracy | 73.1% | ||