| Literature DB >> 35670925 |
Aritra Majumdar1, Rashid Bakirov2, Dan Hodges3,4, Suzanne Scott3, Tim Rees5.
Abstract
Attempts to better understand the relationship between training and competition load and injury in football are essential for helping to understand adaptation to training programmes, assessing fatigue and recovery, and minimising the risk of injury and illness. To this end, technological advancements have enabled the collection of multiple points of data for use in analysis and injury prediction. The full breadth of available data has, however, only recently begun to be explored using suitable statistical methods. Advances in automatic and interactive data analysis with the help of machine learning are now being used to better establish the intricacies of the player load and injury relationship. In this article, we examine this recent research, describing the analyses and algorithms used, reporting the key findings, and comparing model fit. To date, the vast array of variables used in analysis as proxy indicators of player load, alongside differences in approach to key aspects of data treatment-such as response to data imbalance, model fitting, and a lack of multi-season data-limit a systematic evaluation of findings and the drawing of a unified conclusion. If, however, the limitations of current studies can be addressed, machine learning has much to offer the field and could in future provide solutions to the training load and injury paradox through enhanced and systematic analysis of athlete data.Entities:
Keywords: Football injuries; Injury prediction; Machine learning; Training load
Year: 2022 PMID: 35670925 PMCID: PMC9174408 DOI: 10.1186/s40798-022-00465-4
Source DB: PubMed Journal: Sports Med Open ISSN: 2198-9761
Descriptive data for the highlighted papers
| No. of players | No. of injuries | Age group (years) | Injury type | Dataset time span | |
|---|---|---|---|---|---|
| Rossi et al. [ | 26 | 21 | 20–30 | Every non-contact | 23 weeks |
| Naglah et al. [ | 21 | 36 | Unreported | Every non-contact | 16 months |
| López-Valenciano et al. [ | 132 | 32 | Unreported | Lower leg muscle | Pre-season + 1 Season |
| Ayala et al. [ | 96 | 18 | Unreported | Hamstring strain | Pre-season + 1 season |
| Rommers et al. [ | 734 | 368 | 10–15 | Acute and overuse | Pre-season + 1 season |
| Oliver et al. [ | 400 | 99 | 10–18 | Non-contact lower leg | Pre-season + 1 season |
| Vallance et al. [ | 40 | 142 | 23.6–35.2 | Every non-contact | Pre-season + 1 season |
| Venturelli et al. [ | 84 | 27 | 14–18 | Thigh muscle strain | Pre-season + 1 season |
| Kampakis [ | Unreported | Unreported | Unreported | Not specified | Unreported |
Only Oliver et al. [41] and Vallance et al. [42] specifically reported using “male” players. The other papers noted the following: young football players, elite football players, youth players, and/or professional football players
Model fit for the best-fitting models from each paper
| Machine learning algorithms | Pre-processing techniques | Accuracy (%) | Precision (%) | AUC | Recall (%) | F1-score (%) | Specificity (%) | |
|---|---|---|---|---|---|---|---|---|
| [ | Decision tree | – | 50 | 0.76 | 80 | 64 | – | |
| [ | SVM | Data normalisation | 83.50 | – | – | – | – | – |
| [ | SmoteBoost | – | – | 0.75 | 65.90 | – | 79.10 | |
| [ | SmoteBoost | – | – | 0.84 | 77.80 | – | 83.80 | |
| [ | XGBoost | Unmentioned | – | 85 | – | 85 | 85 | – |
| [ | – | 78 | – | 78 | 78 | – | ||
| [ | Decision tree | Various balancing techniques | – | – | 0.66 | 55.60 | – | 74.20 |
| [ | Random forest | 95.5 | 92.2 | 0.92 | 94.5 | – | – | |
| [ | XGBoost | 97 | 97 | 0.97 | 97 | – | – | |
| [ | Cox regression | Unmentioned | – | – | – | – | – | – |
| [ | Supervised principal components analysis | Unmentioned | 88.80 | 55 | – | 33 | – | – |
| [ | 97.07 | 19 | – | 20 | – | – |
Each paper used a different overall set of model fit metrics. In papers [39, 42] and [44], two key differential approaches (denoted a and b) were used
*This article did not explicitly mention evaluation metrics—we approximated these values from the article’s presented boxplots
Training load features in the highlighted papers
| [ | [ | [ | [ | [ | [ | [ | [ | |
|---|---|---|---|---|---|---|---|---|
| Exposure | X | |||||||
| Jumps | X | X | ||||||
| Distance | X | X | X | X | ||||
| Accelerations and decelerations | X | X | X | X | ||||
| DSL (Total weighted impacts above 2 g) | X | |||||||
| Duration | X | X | ||||||
| Player load | X | X | X | |||||
| Speed and velocity | X | X | X | |||||
| Meterage per minute | X | |||||||
| Total efforts | X | |||||||
| High inertial movement analysis | X | |||||||
| Average metabolic power | X | |||||||
| Dynamic stress load | X | |||||||
| Impacts | X | |||||||
| Energy expenditure | X | |||||||
| Step Balance | X | |||||||
| Dribbling | X | |||||||
| Sprint | X | |||||||
| Jumping, moving and balancing | X | |||||||
| Body mass index | X | X | X | X | X | X | ||
| Fat percentage | X | X | ||||||
| Step yo-yo test | X | X | ||||||
| Heart rate | X | |||||||
| Ratings of perceived exertion (RPE) | X | |||||||
| Sleep quality | X | X | ||||||
| Physical exhaustion | X | |||||||
| Reduced sense of exhaustion | X | |||||||
| Sport devaluation | X | |||||||
| Fatigue, shape, pain, pleasure, worry, satisfaction | X | |||||||
| Height and weight | X | X | X | X | X | |||
| Age | X | X | X | X | X | X | X | |
| Role of the player (Position)/field position | X | X | X | X | X | |||
| Previous injury | X | X | X | X | ||||
| Minutes played in previous games | X | |||||||
| Number of games played | ||||||||
| before each training session | X | |||||||
| Dominant leg | X | X | ||||||
| Current level of play | X | |||||||
| Injury details | X | |||||||
| Season stage | X | |||||||
| Activity | X | |||||||
| Phase of play | X | |||||||
| Footwear | X | |||||||
| Surface condition | X | |||||||
| Sitting height, curl-ups, leg length | X | |||||||
| 75% Hop, SLCMJ, SLHD, Y-balance, TJ Knee | X | |||||||
| ACWR and MSWR of training loads | X | |||||||
| Neuromuscular training loads | X | X | ||||||
| Total training load features | 55 | 65 | 151, 229 | 29 | 20 | 27 | 18 | 18 |
Neuromuscular training loads is an over-arching “feature” which includes multiple variables not explicitly mentioned here
*These two papers included 151 and 229 training load variables, under eight over-arching topics (with the most important ones noted here)