| Literature DB >> 35957257 |
Kuan-Yu Chen1,2, Jungpil Shin1, Md Al Mehedi Hasan1, Jiun-Jian Liaw2, Okuyama Yuichi1, Yoichi Tomioka1.
Abstract
Fitness is important in people's lives. Good fitness habits can improve cardiopulmonary capacity, increase concentration, prevent obesity, and effectively reduce the risk of death. Home fitness does not require large equipment but uses dumbbells, yoga mats, and horizontal bars to complete fitness exercises and can effectively avoid contact with people, so it is deeply loved by people. People who work out at home use social media to obtain fitness knowledge, but learning ability is limited. Incomplete fitness is likely to lead to injury, and a cheap, timely, and accurate fitness detection system can reduce the risk of fitness injuries and can effectively improve people's fitness awareness. In the past, many studies have engaged in the detection of fitness movements, among which the detection of fitness movements based on wearable devices, body nodes, and image deep learning has achieved better performance. However, a wearable device cannot detect a variety of fitness movements, may hinder the exercise of the fitness user, and has a high cost. Both body-node-based and image-deep-learning-based methods have lower costs, but each has some drawbacks. Therefore, this paper used a method based on deep transfer learning to establish a fitness database. After that, a deep neural network was trained to detect the type and completeness of fitness movements. We used Yolov4 and Mediapipe to instantly detect fitness movements and stored the 1D fitness signal of movement to build a database. Finally, MLP was used to classify the 1D signal waveform of fitness. In the performance of the classification of fitness movement types, the mAP was 99.71%, accuracy was 98.56%, precision was 97.9%, recall was 98.56%, and the F1-score was 98.23%, which is quite a high performance. In the performance of fitness movement completeness classification, accuracy was 92.84%, precision was 92.85, recall was 92.84%, and the F1-score was 92.83%. The average FPS in detection was 17.5. Experimental results show that our method achieves higher accuracy compared to other methods.Entities:
Keywords: Mediapipe; Yolov4; deep neural network; deep transfer learning; fitness detection; image processing; machine learning; pose detection
Mesh:
Year: 2022 PMID: 35957257 PMCID: PMC9371130 DOI: 10.3390/s22155700
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Method flowchart.
Twelve types of fitness movements and names.
| Squat | Push-Up | Pull-Up | Sit-Up |
|---|---|---|---|
|
|
|
|
|
| Standing | Biceps-curl | Bulgarian-squat | Bench-press |
|
|
|
|
|
| Lateral-raise | Overhead-press | Dumbbell-rowing | Triceps-extension |
|
|
|
|
|
The fitness time photographed by each user.
| User (No.) | Time (s) | User (No.) | Time (s) |
|---|---|---|---|
| 1 | 228 | 11 | 169 |
| 2 | 246 | 12 | 136 |
| 3 | 246 | 13 | 168 |
| 4 | 172 | 14 | 193 |
| 5 | 190 | 15 | 157 |
| 6 | 177 | 16 | 146 |
| 7 | 191 | 17 | 170 |
| 8 | 300 | 18 | 191 |
| 9 | 170 | 19 | 192 |
| 10 | 168 | 20 | 157 |
The motion track recorded after converting the video database to images.
| 0% | 25% | 50% | 75% | 100% |
|---|---|---|---|---|
|
|
|
|
|
|
In the image database, the shooting angle included in each fitness exercise.
| 0° | 45° | 90° | 135° | 180° |
|---|---|---|---|---|
|
|
|
|
|
|
Figure 2Labeling process using LabelImg.
Category and number of images.
| Fitness |
|
|
|
|---|---|---|---|
| Squat | 198 | 644 | 121 |
| Pull-up | 239 | 1442 | 427 |
| Push-up | 317 | 913 | 264 |
| Sit-up | 373 | 977 | 328 |
| Standing | 132 | 1065 | 454 |
| Biceps-curl | 273 | 405 | 154 |
| Bulgarian-split-squat | 311 | 577 | 365 |
| Bench-press | 304 | 924 | 471 |
| Lateral-raise | 162 | 299 | 152 |
| Overhead-press | 202 | 724 | 365 |
| Dumbbell-rowing | 305 | 598 | 347 |
| Triceps-extension | 148 | 769 | 384 |
| Total | 2964 | 9337 | 3823 |
Figure 3Mediapipe detects 33 nodes of the human pose.
Figure 4Node missing on Mediapipe detection, (a) squat missing node, (b) push-up missing node, and (c) pull-up missing node.
Result of Mediapipe and Yolov4 detecting fitness.
| Squat | Push-Up | Pull-Up | Sit-Up |
|---|---|---|---|
|
|
|
|
|
| Standing | Biceps-curl | Bulgarian-squat | Bench-press |
|
|
|
|
|
| Lateral-raise | Overhead-press | Dumbbell-rowing | Triceps-extension |
|
|
|
|
|
The position of , , and corresponding to the body node in Figure 3.
| Fitness |
|
|
|
|
|
|---|---|---|---|---|---|
| Squat | 24 | 26 | 28 | 100 | 170 |
| Pull-up | 12 | 14 | 16 | 80 | 170 |
| Push-up | 12 | 14 | 16 | 80 | 170 |
| Sit-up | 12 | 24 | 26 | 100 | 120 |
| Standing |
|
|
|
|
|
| Biceps-curl | 12 | 14 | 16 | 80 | 160 |
| Bulgarian-split-squat | 24 | 26 | 28 | 110 | 160 |
| Bench-press | 12 | 14 | 16 | 80 | 140 |
| Lateral-raise | 14 | 12 | 24 | 20 | 80 |
| Overhead-press | 12 | 14 | 16 | 80 | 150 |
| Dumbbell-rowing | 12 | 14 | 16 | 110 | 150 |
| Triceps-extension | 12 | 14 | 16 | 80 | 140 |
Figure 5The waveforms of the fitness movements were used for classification.
The distribution of training and testing data in database .
|
|
| |
|---|---|---|
| Complete | 223 | 203 |
| No-complete | 372 | 261 |
| No-movement | 62 | 123 |
| Total | 657 | 587 |
Figure 6Three categories of waveforms: (a) complete, (b) no-complete, and (c) no-movement.
Result of Yolov4 detecting fitness type.
| Squat | Push-Up | Pull-Up | Sit-Up |
|---|---|---|---|
|
|
|
|
|
| Standing | Biceps-curl | Bulgarian-squat | Bench-press |
|
|
|
|
|
| Lateral-raise | Overhead-press | Dumbbell-rowing | Triceps-extension |
|
|
|
|
|
Experimental setup for training Yolov4.
| Parameters | Value |
|---|---|
| Class | 12 |
| Batch size | 64 |
| Subdivisions | 40 |
| Image width | 416 |
| Image height | 416 |
| Channels | 3 |
| Max batches | 100,000 |
| Filters | 51 |
| Learning rate | 0.001 |
| Decay | 0.0005 |
Experimental setup for training the MLP.
| Parameters | Value |
|---|---|
| Class | 3 |
| Batch size | 64 |
| Data size | 100 |
| Max batches | 100 |
| Learning rate | 0.001 |
| Decay | 0.0001 |
Figure 7When the IoU threshold was 0.5, the performance comparison of different iterations.
Figure 8When the IoU threshold was 0.75, the performance comparison of different iterations.
Performance of Yolov4.
| Evaluation Index | IoU Threshold = 0.5 | IoU Threshold = 0.75 |
|---|---|---|
| mAP | 99.71% | 99.08% |
| Accuracy | 98.56% | 96.97% |
| Precision | 97.90% | 98.67% |
| Recall | 98.56% | 96.97% |
| F1-score | 98.23% | 97.82% |
Performance of the MLP.
| Evaluation Index | MLP |
|---|---|
| Accuracy | 92.84% |
| Precision | 92.85% |
| Recall | 92.84% |
| F1-score | 92.83% |
Figure 9Confusion matrix for MLP classification performance.
Comparison of the mAP for fitness movement classification.
| Evaluation Index | mAP |
|---|---|
| Ours | 99.71% |
| Hobeom Jeon et al. [ | 90.5% |
Comparison of accuracy of fitness movement classification.
| Evaluation Index | Accuracy |
|---|---|
| Ours | 98.56% |
| Yongpan Zou et al. [ | 96.07% |
| Crema et al. [ | 94.36% |
| Ali Bidaran et al. [ | 92.9% |
Comparison of accuracy of fitness movement analysis.
| Evaluation Index | Accuracy |
|---|---|
| Ours | 92.84% |
| Yongpan Zou et al. [ | 90.7% |
| Jiangkun Zhou et al. [ | 59.7% |