| Literature DB >> 35115647 |
Tevin Moodley1, Dustin van der Haar2, Habib Noorbhai3.
Abstract
There have been limited studies demonstrating the validation of batting techniques in cricket using machine learning. This study demonstrates how the batting backlift technique in cricket can be automatically recognised in video footage and compares the performance of popular deep learning architectures, namely, AlexNet, Inception V3, Inception Resnet V2, and Xception. A dataset is created containing the lateral and straight backlift classes and assessed according to standard machine learning metrics. The architectures had similar performance with one false positive in the lateral class and a precision score of 100%, along with a recall score of 95%, and an f1-score of 98% for each architecture, respectively. The AlexNet architecture performed the worst out of the four architectures as it incorrectly classified four images that were supposed to be in the straight class. The architecture that is best suited for the problem domain is the Xception architecture with a loss of 0.03 and 98.2.5% accuracy, thus demonstrating its capability in differentiating between lateral and straight backlifts. This study provides a way forward in the automatic recognition of player patterns and motion capture, making it less challenging for sports scientists, biomechanists and video analysts working in the field.Entities:
Year: 2022 PMID: 35115647 PMCID: PMC8814181 DOI: 10.1038/s41598-022-05966-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The AlexNet architecture depicting the different layers used for classification[22].
Figure 2The left side represents the overall schema for the pure Inception Resnet V2, where the right side illustrates the detailed composition of the stem[30].
Figure 3A figure representing the Xception architecture[26].
The Inception V3, Inception Resnet V2, and Xception architectures pre-trained and bench-marked on the ImageNet dataset to illustrate each architectures performance on a generalised dataset and to justify the selection of architectures in the study.
| Network | Top-1 Accuracy | Top-5 Accuracy |
|---|---|---|
| Alexnet[ | 0.6330 | 0.8460 |
| Inception V3[ | 0.790 | 0.945 |
| Inception Resnet V2[ | 0.779 | 0.937 |
| Xception[ | 0.803 | 0.953 |
The confusion matrix representing each of the architectures across ten runs, where the false and true positive predictions for each class are represented.
| Architecture | Class | Lateral | Straight | Accuracy (%) | Loss (%) |
|---|---|---|---|---|---|
| AlexNet | Lateral | 19 | 1 | 82.45 | 34.17 |
| Straight | 4 | 16 | |||
| Inception V3 | Lateral | 19 | 1 | 93.75 | 0.13 |
| Straight | 0 | 20 | |||
| Inception Resnet V2 | Lateral | 19 | 1 | 96.1 | 0.12 |
| Straight | 0 | 20 | |||
| Xception | Lateral | 19 | 1 | 98.2 | 0.03 |
| Straight | 0 | 20 |
The average precision, recall, and f1-scores across ten runs for the lateral and straight backlifts for each architecture.
| Architecture | Class | Precision (%) | Recall (%) | F1-Score (%) |
|---|---|---|---|---|
| AlexNet | Lateral | 83 | 95 | 88 |
| Straight | 94 | 80 | 86 | |
| Inception V3 | Lateral | 100 | 95 | 97 |
| Straight | 95 | 100 | 98 | |
| Inception Resnet V2 | Lateral | 100 | 95 | 97 |
| Straight | 95 | 100 | 98 | |
| Xception | Lateral | 100 | 95 | 97 |
| Straight | 95 | 100 | 98 |
Figure 4The image that is mispredicted by the Inception V3, Inception Resnet V2, and Xception architectures as a straight backlift, where the image represents a lateral backlift[35].