| Literature DB >> 35784892 |
Alessio Rossi1, Enrico Perri2, Luca Pappalardo3, Paolo Cintia1, Giampietro Alberti2, Darcy Norman4,5, F Marcello Iaia2.
Abstract
Training for success has increasingly become a balance between maintaining high performance standards and avoiding the negative consequences of accumulated fatigue. The aim of this study is to develop a big data analytics framework to predict players' wellness according to the external and internal workloads performed in previous days. Such a framework is useful for coaches and staff to simulate the players' response to scheduled training in order to adapt the training stimulus to the players' fatigue response. 17 players competing in the Italian championship (Serie A) were recruited for this study. Players' Global Position System (GPS) data was recorded during each training and match. Moreover, every morning each player has filled in a questionnaire about their perceived wellness (WI) that consists of a 7-point Likert scale for 4 items (fatigue, sleep, stress, and muscle soreness). Finally, the rate of perceived exertion (RPE) was used to assess the effort performed by the players after each training or match. The main findings of this study are that it is possible to accurately estimate players' WI considering their workload history as input. The machine learning framework proposed in this study is useful for sports scientists, athletic trainers, and coaches to maximise the periodization of the training based on the physiological requests of a specific period of the season.Entities:
Keywords: external workload; prediction; recovery; training load; wellness
Year: 2022 PMID: 35784892 PMCID: PMC9240643 DOI: 10.3389/fphys.2022.896928
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.755
Features group. Summary of all the features extracted from the GPS devices.
| Feature group | List of features |
|---|---|
| Cinematic | Session time and distance; Total loading; Sprint; High speed running distance; Explosive distance; Max speed; High metabolic load; Distance covered at different velocity; Bursts duration and number, Time and distance covered above 20W; Average Estimated Metabolic Power; Equivalent Estimated Metabolic Distance |
| Metabolic | Time in heart zone (from 1 to 6); Max heart rate; Average Heart rate; Energy Expenditure expressed in KCal; Heart Rate Exertion |
| Mechanical | Impacts, Deceleration and acceleration at different intensity |
Data pre-processing approaches description.
| Data pre-processing | Description |
|---|---|
| Daily workload | Raw data of the current training/match session (67 GPS features + RPE) |
| Acute workload | EWMA of the previous 7 days (67 GPS features + RPE) |
| Chronic workload | EWMA of the previous 28 days (67 GPS features + RPE) |
| ACWR workload | Ration between (67 GPS features + RPE) |
FIGURE 1Model validation approaches. f and w refer to fold and week, respectively. (A) Cross-Validation approach. (B) Real scenario approach.
WI group descriptive statistics.
| WI Group | Count | Mean | SD | min | 25% | 50% | 75% | max |
|---|---|---|---|---|---|---|---|---|
| High | 933 | 7.10 | 1.28 | 4 | 7 | 8 | 8 | 8 |
| Moderate | 1245 | 11.21 | 1.16 | 9 | 10 | 12 | 12 | 12 |
| Low | 552 | 15.07 | 1.18 | 13 | 14 | 16 | 16 | 17 |
SD, min and max refer to standard deviations, minimum values and maximal values, respectively.
FIGURE 2(A) Distribution of WI day by day as the season went by. The black dots refer to the Game Day (GD). Moreover, W refers to winter stop. (B) Distribution of WI in accordance with the Match Day (MD). The values refer to the day when the WI are recorded. For example, the WIs reported in Game Day (GD) refer to the WI recorded before the start of the match. (C) Distribution of WI in accordance with the periods of the season.
Models goodness of cross-validation.
| Model | WI classes | Precision | Recall | F1-score | Accuracy |
|---|---|---|---|---|---|
| DTC | High | 0.66 ± 0.01 | 0.67 ± 0.01 | 0.67 ± 0.01 | 0.67 ± 0.01 |
| Moderate | 0.67 ± 0.02 | 0.68 ± 0.02 | 0.68 ± 0.02 | ||
| Low | 0.67 ± 0.02 | 0.68 ± 0.03 | 0.67 ± 0.02 | ||
| XGB | High | 0.75 ± 0.01 | 0.75 ± 0.02 | 0.75 ± 0.02 | 0.74 ± 0.01 |
| Moderate | 0.74 ± 0.02 | 0.75 ± 0.02 | 0.75 ± 0.01 | ||
| Low | 0.75 ± 0.03 | 0.73 ± 0.04 | 0.74 ± 0.02 | ||
| Bs | High | 0.35 ± 0.02 | 0.35 ± 0.03 | 0.35 ± 0.02 | 0.37 ± 0.01 |
| Moderate | 0.38 ± 0.05 | 0.38 ± 0.05 | 0.38 ± 0.05 | ||
| Low | 0.34 ± 0.08 | 0.34 ± 0.09 | 0.34 ± 0.08 |
Feature importance of cross-validation.
| Features | Folds (n) | Mean (%) | SD (%) |
|---|---|---|---|
| HML Distance Per Minute (Chronic) | 10 | 3.73 | 2.15 |
| Time In Heart Rate Zone6 (Daily) | 5 | 3.68 | 3.93 |
| Distance Total (Chronic) | 4 | 2.66 | 2.86 |
| Accelerations Zone5 (Chronic) | 10 | 2.64 | 1.71 |
| Decelerations Zone6 (Chronic) | 10 | 2.42 | 1.87 |
| Accelerations Z5 to Z6 (Chronic) | 9 | 2.18 | 1.70 |
| Impacts Zone2 (Acute) | 9 | 2.18 | 0.63 |
| Distance 16-21 (Chronic) | 10 | 2.10 | 1.58 |
| Impacts Z5 to Z6 (Chronic) | 10 | 1.92 | 1.35 |
| Energy Expenditure (KCal) (Chronic) | 9 | 1.54 | 0.84 |
| High Speed Running >21 km/h (Chronic) | 10 | 1.53 | 1.33 |
| Time In Heart Rate Zone5 (Acute) | 10 | 1.52 | 1.59 |
| Impacts Zone3 (Acute) | 8 | 1.49 | 1.27 |
| Time In Heart Rate Zone6 (Acute) | 10 | 1.46 | 1.82 |
| Distance 0-10 (Chronic) | 9 | 1.42 | 1.74 |
This table reports only the 15 most important features. The values for mean and standard deviation (SD) are expressed in percentage. The folds number refers to how many folds a feature is used to WI prediction.
FIGURE 3Influence of the 15 most important features of each WI class on defining classes’ membership. This plot shows the correlation coefficient between SHAP values and features’ values. Coloured bars refer to a positive correlation, while the grey ones show a negative relationship.
FIGURE 4Cumulative goodness accuracy. This plot is split into four different soccer season periods: Pre-Season, 1st part of the competition season, Winter stop (W-stop) and 2nd part of the competition season.
Model performance goodness of the last week.
| Model | WI classes | Precision | Recall | F1-score | Accuracy |
|---|---|---|---|---|---|
| DTC | High | 0.72 | 0.72 | 0.72 | 0.78 |
| Moderate | 0.83 | 0.75 | 0.79 | ||
| Low | 0.78 | 1.00 | 0.88 | ||
| XGB | High | 0.75 | 1.00 | 0.86 | 0.87 |
| Moderate | 1.00 | 0.75 | 0.86 | ||
| Low | 1.00 | 0.86 | 0.92 | ||
| Bs | High | 0.46 | 0.33 | 0.39 | 0.40 |
| Moderate | 0.50 | 0.55 | 0.52 | ||
| Low | 0.10 | 0.14 | 0.12 |
FIGURE 5Influences of a single variable on belonging to a specific WI class. The values provided show an importance higher than 3%. Coloured bars show a positive influence, i.e., the higher the feature value is the higher is the probability to be in a specific WI class, while vice versa for grey bars.
Feature importance of real scenario.
| Features | Week (n) | Mean (%) | SD (%) |
|---|---|---|---|
| HML Distance Per Minute (Chronic) | 23 | 7.66 | 9.64 |
| RPE (Chronic) | 8 | 6.65 | 3.97 |
| Accelerations Zone6 (Chronic) | 8 | 4.55 | 1.97 |
| Time In Heart Rate Zone4 (Chronic) | 8 | 3.73 | 1.25 |
| Time In Heart Rate Zone3 (Chronic) | 7 | 10.00 | 8.86 |
| Decelerations Zone6 (Chronic) | 6 | 8.08 | 4.72 |
| Distance 16-21 (Chronic) | 6 | 5.88 | 4.26 |
| Impacts Zone2 (Acute) | 6 | 3.13 | 0.29 |
| Time In Heart Rate Zone5 (Acute) | 5 | 8.47 | 9.16 |
| Sprints (Daily) | 5 | 7.59 | 4.92 |
| Time In Heart Rate Zone6 (ACWR) | 5 | 5.26 | 3.56 |
| Accelerations Z5 to Z6 (Chronic) | 4 | 5.61 | 2.00 |
| Accelerations Zone5 (Chronic) | 4 | 5.28 | 4.18 |
| Accelerations Zone4 (Chronic) | 4 | 3.49 | 0.58 |
| Time In Heart Rate Zone4 (Daily) | 3 | 28.21 | 24.04 |
This table reports only the 15 most important features. The values for mean and standard deviation (SD) are expressed in percentage. The weeks’ number refers to how many weeks a feature is used for WI prediction. The values are sorted by week and mean.