| Literature DB >> 32825134 |
Tim Steels1, Ben Van Herbruggen1, Jaron Fontaine1, Toon De Pessemier2, David Plets2, Eli De Poorter1.
Abstract
A thorough analysis of sports is becoming increasingly important during the training process of badminton players at both the recreational and professional level. Nowadays, game situations are usually filmed and reviewed afterwards in order to analyze the game situation, but these video set-ups tend to be difficult to analyze, expensive, and intrusive to set up. In contrast, we classified badminton movements using off-the-shelf accelerometer and gyroscope data. To this end, we organized a data capturing campaign and designed a novel neural network using different frame sizes as input. This paper shows that with only accelerometer data, our novel convolutional neural network is able to distinguish nine activities with 86% precision when using a sampling frequency of 50 Hz. Adding the gyroscope data causes an increase of up to 99% precision, as compared to, respectively, 79% and 88% when using a traditional convolutional neural network. In addition, our paper analyses the impact of different sensor placement options and discusses the impact of different sampling frequenciess of the sensors. As such, our approach provides a low cost solution that is easy to use and can collect useful information for the analysis of a badminton game.Entities:
Keywords: CNN; DNN; accelerometer; activity recognition; badminton; gyroscope; machine learning; neural network
Mesh:
Year: 2020 PMID: 32825134 PMCID: PMC7506561 DOI: 10.3390/s20174685
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Comparing recognized activities, number of distinguished classes, sensors, and methods proposed in this paper with related work. What makes our research unique is the optimization of every step within activity recognition.
| Paper | Activity | Number of Classes | Sensors | Method/Positions | Machine Learning Approach |
|---|---|---|---|---|---|
| [ | Daily activities | 8 | Accelerometer + gyroscope | Wrist, ankle | SVC, DNN |
| [ | Coarse-grained sport categories | 9 | Accelerometer + gyroscope | Wrist, ankle | SVC, DNN |
| [ | Tennis | 5 | Accelerometer + gyroscope | Wrist, waist | CNN |
| [ | Badminton | 5 | Accelerometer + gyroscope | On the strings of the racket | KNN, SVM |
| [ | Badminton | 2 | Cameras | Computer vision | SOFW |
| [ | Badminton | 2 | Cameras | Computer vision | CNN |
|
|
|
|
|
|
|
Figure 1Classified strokes.
Figure 2Positions sensors.
Layers CNN (frame size = 80).
| Layer | Output Dimension |
|---|---|
| Input | 80 × 6 |
| Conv2D (16, 3 × 3), Relu | 78 × 4 × 16 |
| Batch normalization | 78 × 4 × 16 |
| Dropout 10% | 78 × 4 × 16 |
| Conv2D (32, 3 × 3), Relu | 76 × 2 × 32 |
| Dropout 20% | 76 × 2 × 32 |
| Flatten | 4864 |
| Dense (64), Relu, kernel_regularizer, bias_regularizer | 64 |
| Dense (9), Softmax | 9 |
Figure 3Overview preprocessing, feature extraction and classification.
Number of strokes in training and test data.
| Stroke | # Training | # Test |
|---|---|---|
| Overhead Defensive Clear | 150 | 31 |
| Dab/Block | 280 | 30 |
| Drive | 120 | 30 |
| Short serve | 65 | 31 |
| Lob | 168 | 30 |
| Net drop | 300 | 34 |
| Smash | 170 | 30 |
Figure 4Confusion matrix: only accelerometer data and a sampling frequency of 12.5 Hz.
Best frame size by stroke type.
| Stroke | Frame Size (s) |
|---|---|
| Overhead Defensive Clear | 1.60 |
| Dab/Block | 1.08 |
| Drive | 1.08 |
| Short serve | 0.68 |
| Lob | 1.28 |
| Net drop | 1.20 |
| Smash | 0.72 |
Best accuracy by frame size (100 Hz, accelerometer + gyroscope).
| Frame Size (s) | Accuracy (%) |
|---|---|
| 0.40 | 80 |
| 0.60 | 81 |
| 0.80 | 88 |
| 1.00 | 88 |
| 1.20 | 92 |
| 1.40 | 88 |
| 1.60 | 93 |
|
|
|
Figure 5Precision, recall and f1 score measured with (a) accelerometer data only or in (b) combination with gyroscope data at sample frequencies of 100, 50, 25, and 12.5 Hz.
Most optimal f1 score after varying score weights in ensemble learning.
| Position | Dab-F1 Score | Netdrop-F1 Score |
|---|---|---|
| Racket | 0.99 | 0.98 |
| Wrist | 0.50 | 0.73 |
Figure 6Impact shuttle when placing the sensor on the racket’s grip. The blue, green and orange line represent the respectively X, Y, and Z axis of the accelerometer. Left: stroke when not hitting the shuttle. Right: same stroke when the shuttle is hit. This shows a peak in the Y axis of the accelerometer when hitting the shuttle.
Figure 7Confusion matrix for distinguishing overhand and underhand strokes. We notice that the overhand and underhand strokes are correctly classified with an average accuracy of 99%.
Figure 8Novel weights-based neural network architecture. Confusion matrix: accelerometer and gyroscope data. A sampling frequency of 100 Hz and ensemble learning with extra weights are used. The average accuracy is 98%.
Layers DNN (frame size = 80).
| Layer | Output Dimension |
|---|---|
| Input | 80 × 6 |
| Dense (1024), Relu | 80 × 1024 |
| Dropout 20% | 80 × 1024 |
| Dense (512), Relu | 80 × 512 |
| Dropout 20% | 80 × 512 |
| Dense (256), Relu | 80 × 256 |
| Dropout 20% | 80 × 256 |
| Dense (128), Relu | 80 × 128 |
| Dropout 20% | 80 × 128 |
| Dense (32), Relu | 80 × 32 |
| Dropout 20% | 80 × 32 |
| Flatten | 2560 |
| Dense (9), Softmax | 9 |