| Literature DB >> 33091064 |
Laura M S de Jong1, Paul B Gastin2, Maia Angelova3, Lyndell Bruce1, Dan B Dwyer1.
Abstract
Knowledge of optimal technical performance is used to determine match strategy and the design of training programs. Previous studies in men's soccer have identified certain technical characteristics that are related to success. These studies however, have relative limited sample sizes or limited ranges of performance indicators, which may have limited the analytical approaches that were used. Research in women's soccer and our understanding of optimal technical performance, is even more limited (n = 3). Therefore, the aim of this study was to identify technical determinants of match outcome in the women's game and to compare analytical approaches using a large sample size (n = 1390 team performances) and range of variables (n = 450). Three different analytical approaches (i.e. combinations of technical performance variables) were used, a data-driven approach, a rational approach and an approach based on the literature in men's soccer. Match outcome was modelled using variables from each analytical approach, using generalised linear modelling and decision trees. It was found that the rational and data-driven approaches outperformed the literature-driven approach in predicting match outcome. The strongest determinants of match outcome were; scoring first, intentional assists relative to the opponent, the percentage of shots on goal saved by the goalkeeper relative to the opponent, shots on goal relative to the opponent and the percentage of duels that are successful. Moreover the rational and data-driven approach achieved higher prediction accuracies than comparable studies about men's soccer.Entities:
Mesh:
Year: 2020 PMID: 33091064 PMCID: PMC7580913 DOI: 10.1371/journal.pone.0240992
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Distribution of matches per league or tournament.
| League/Tournament | Team performances |
|---|---|
| EC 2013 | 50 |
| EC 2017 | 62 |
| FAWSL 2015 | 112 |
| FAWSL 2016 | 144 |
| FAWSL 2017–18 | 180 |
| NWSL 2016 | 206 |
| NWSL 2017 | 246 |
| NWSL 2018 | 222 |
| WC 2011 | 64 |
| WC 2015 | 104 |
Fig 1Flowchart of the analysis process.
Details of the techniques are explained in the text.
The most valuable 20 technical variables identified in feature selection for the three analytical approaches.
| Rank | Data-Driven Approach | Rational Approach | Literature-Driven Approach |
|---|---|---|---|
| 1 | |||
| 2 | Relative Intentional Assist | Relative Intentional Assist | |
| 3 | Assists | ||
| 4 | Relative Aerial Duels Won | ||
| 5 | |||
| 6 | Intentional Assist | Intentional Assist | |
| 7 | |||
| 8 | Relative Total Passes | ||
| 9 | |||
| 10 | Relative Offsides | ||
| 11 | Relative Recoveries | ||
| 12 | |||
| 13 | |||
| 14 | Relative Second Assists | ||
| 15 | Relative Touches Open Play Opponent Box | Percentage Successful Duels | |
| 16 | Relative Successful Dribbles | Total Passes | |
| 17 | Relative Recoveries | ||
| 18 | Relative Defensive Aerial Duels Won | Relative Percentage Total Successful Passes | |
| 19 | Aerial Duels Won | ||
| 20 | Relative Second Assists | Relative Aerial Duels Lost | Offsides |
All variables were used in the first phase of analysis, which included variables that have a pseudo-relationship with scoring. The pseudo-score related variables (underlined) were removed in the second phase of the analysis.
The structure and performance of the generalised linear (GLM) and Decision Tree (DT) models in both phases of the analysis.
| First Goal | -1.47 | First Goal | -1.24 | Relative Shots On Goal | -0.20 | |
| Relative Intentional Assist | -0.58 | Relative Percentage Shots On Goal Saved | -0.95 | |||
| Relative Big Chances On Target | -0.07 | Relative Intentional Assist | -0.49 | |||
| Relative Right Foot Shots on Target | -0.01 | Relative Shots On From Inside Box | -0.06 | |||
| Relative Shots On Conceded Inside Box | 0.01 | |||||
| 0.96 (0.93, 0.98) | 0.99 (0.97, 1.00) | 0.78 (0.72, 0.84) | ||||
| 0.88 (0.83, 0.92) | 0.89 (0.84, 0.92) | 0.75 (0.68, 0.80) | ||||
| Relative Intentional Assist | 0.75 | Percentage Successful Duels | -2.01 | Relative Aerial Duels Won | -0.05 | |
| Assists | 0.17 | Relative Intentional Assist | -0.69 | Relative Offsides | -0.04 | |
| Relative Touches Open Play Opponent Box | -0.002 | Intentional Assist | -0.24 | Relative Total Passes | -0.001 | |
| Relative Successful Dribbles | -0.02 | |||||
| Relative Recoveries | -0.01 | |||||
| Relative Aerial Duels Lost | 0.003 | |||||
| 0.87 (0.82, 0.91) | 0.78 (0.72, 0.83) | 0.65 (0.58, 0.71) | ||||
| 0.76 (0.70, 0.82) | 0.77 (0.70, 0.82) | 0.66 (0.59, 0.72) | ||||
Phase 1 of the analysis included pseudo-score related variables, whereas in phase 2 they were excluded. There were also three analytical approaches that reflect different sets of variables that were used for modelling. Accuracies presented on scale 0–1 with 95% Confidence Interval (CI).
Fig 2Decision tree for the data-driven approach in the second phase.
The tree indicates the association between the values of the key performance indicators and the likely match outcome. This model predicted match outcome with a classification accuracy of 76%. Percentages mentioned are the proportion of the total dataset. Probability shown is the likelihood of a match with those conditions falling into the final category. The number on the branches of the tree indicates the cut off value for the technical action.