| Literature DB >> 35592723 |
Abstract
In order to solve the problem that the traditional feature extraction methods rely on manual design, the research method is changed from the traditional method to the deep learning method based on convolutional neural networks. The experimental results show that the larger average DTW occurs near the 55th calculation, that is, about the 275th frame of the video. In the 55th calculation, the joint angle with the largest DTW distance is the right knee joint. A multiscene action similarity analysis algorithm based on human joint points has been realized. In the fitness scene, by analyzing the joint angle through cosine similarity, the time of fitness key posture in the action sequence can be recognized. In the sports scene, through the similarity analysis of joint angle sequences by the DTW algorithm, we can get the similarity between people's actions in the sports video and the joint positions with large differences in some time intervals, and the real validity of the experiment is verified. The accuracy of motion recognition before and after the improvement is 95.2% and 97.1%, which is 0.19% higher than that before the improvement. The methods and results are widely used in the fields of sports recognition, movement specification, sports training, health management, and so on.Entities:
Mesh:
Year: 2022 PMID: 35592723 PMCID: PMC9113890 DOI: 10.1155/2022/1826951
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Motion recognition of human joint points.
Figure 2Schematic diagram of faster-R-CNN frame.
Figure 3Schematic diagram of ROI pool layer process.
Size of improved anchor candidate box.
| Candidate box number | Candidate box size (width ∗ height unit: pixel) |
|---|---|
| Candidate box 1 | 20 × 50 |
| Candidate box 2 | 24 × 60 |
| Candidate box 3 | 32 × 80 |
| Candidate box 4 | 44 × 11 |
| Candidate box 5 | 60 × 150 |
| Candidate box 6 | 80 × 200 |
| Candidate box 7 | 120 × 300 |
| Candidate box 8 | 160 × 400 |
| Candidate box 9 | 200 × 500 |
Anchor candidate box size based on the K-means clustering algorithm.
| Candidate box number | Candidate box size (width ∗ height unit: pixel) |
|---|---|
| Candidate box 1 | 18 × 44 |
| Candidate box 2 | 22 × 55 |
| Candidate box 3 | 31 × 76 |
| Candidate box 4 | 43 × 105 |
| Candidate box 5 | 59 × 145 |
| Candidate box 6 | 82 × 200 |
| Candidate box 7 | 113 × 276 |
| Candidate box 8 | 156 × 381 |
| Candidate box 9 | 215 × 526 |
| Candidate box 10 | 45 × 20 |
| Candidate box 11 | 110 × 42 |
| Candidate box 12 | 250 × 100 |
| Candidate box 13 | 500 × 204 |
Several bit bits and their data range.
| Digit (bit) | Minimum value | Maximum |
|---|---|---|
| 8 | −120 | 120 |
| 16 | −32766 | 32766 |
| 32 | −2147483640 | 2147483640 |
Figure 4Quantitative model inference process.
Several quantization operation modes are supported by TFLite.
| Quantitative method | Advantage | Applicable hardware |
|---|---|---|
| Weight quantization | The model is reduced by 4 times and accelerated by 2-3 times, with good accuracy | CPU |
| All integer quantization | The model is reduced by 4 times and accelerated by more than 3 times | CPU, edge TPU, etc. |
| Float 16 quantization | The model is reduced by 2 times, which has the potential of GPU acceleration | CPU/GPU |
Comparison of quantitative effects of various models.
| Model | Precision: raw | Precision quantization | Delay: initial | Delay quantization (ms) | Size: initial (MB) | Size quantization (MB) |
|---|---|---|---|---|---|---|
| MobileNetV1 | 0.708 | 0.654 | 124 ms | 112 | 16.6 | 4.3 |
| MobileNetV2 | 0.709 | 0.632 | 89 ms3 | 98 | 13 | 3.6 |
| InceptionV3 | 0.77 | 0.762 | 1130 ms | 845 | 95.9 | 23.9 |
| ResnetV2 | 0.76 | 0.758 | 973 ms | 2868 | 178.5 | 44.9 |
Comparison of test accuracy of three models on coco2017 dataset.
| Model | PoseLite | Direct conversion model | Weight quantization model |
|---|---|---|---|
| Test data set | 2017 val | 2017 val | 2017 val |
| AP@0.5 : 0.95 | 36.9 | 35.9 | 27.6 |
| AP@0.5 | 64.0 | 62.8 | 53.8 |
| AP@0.75 | 35.0 | 33.4 | 26.1 |
| AP medium | 30.1 | 28.1 | 28.2 |
| AP large | 46.0 | 44.0 | 30.8 |
| AR@0.5 : 0.95 | 41.5 | 40.9 | 32.6 |
| AR@0.5 | 66.1 | 64.7 | 56.9 |
| AR@0.75 | 40.7 | 38.7 | 29.0 |
| AR medium | 31.5 | 29.9 | 28.9 |
| AR large | 55.6 | 53.4 | 35.7 |
Figure 5Inspection effect.
Figure 6Improved.
Figure 7Before improvement.
Figure 8Improved.