| Literature DB >> 30379858 |
Donald Barron1, Graham Ball2, Matthew Robins3, Caroline Sunderland4.
Abstract
The aim was to objectively identify key performance indicators in professional soccer that influence outfield players' league status using an artificial neural network. Mean technical performance data were collected from 966 outfield players' (mean SD; age: 25 ± 4 yr, 1.81 ±) 90-minute performances in the English Football League. ProZone's MatchViewer system and online databases were used to collect data on 347 indicators assessing the total number, accuracy and consistency of passes, tackles, possessions regained, clearances and shots. Players were assigned to one of three categories based on where they went on to complete most of their match time in the following season: group 0 (n = 209 players) went on to play in a lower soccer league, group 1 (n = 637 players) remained in the Football League Championship, and group 2 (n = 120 players) consisted of players who moved up to the English Premier League. The models created correctly predicted between 61.5% and 78.8% of the players' league status. The model with the highest average test performance was for group 0 v 2 (U21 international caps, international caps, median tackles, percentage of first time passes unsuccessful upper quartile, maximum dribbles and possessions gained minimum) which correctly predicted 78.8% of the players' league status with a test error of 8.3%. To date, there has not been a published example of an objective method of predicting career trajectory in soccer. This is a significant development as it highlights the potential for machine learning to be used in the scouting and recruitment process in a professional soccer environment.Entities:
Mesh:
Year: 2018 PMID: 30379858 PMCID: PMC6209225 DOI: 10.1371/journal.pone.0205818
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Biographical data represented as means and standard deviations for player groupings.
| Variables | Group 0 | Group 1 | Group 2 |
|---|---|---|---|
| N | 209 | 637 | 120 |
| Age | 25.5 ± | 25.4 ± | 25.6 ± |
| Height | 181.6 ± | 181.0 ± | 181.4 ± |
| 90 Minute Appearances | 10 ± | 18 ± | 19 ± |
| Total Minutes | 1262.9 ± | 2048.4 ± | 2223.7 ± |
Results for group 0 v group 1 balanced data set (Best Average Test Performance = 67.9% and Best Average Test Error = 10.8% with a combination of nine variables) and group 0 v group 1 model variables as means and standard deviations for player groupings.
| Rank | Variable | Average Test Performance (%) | Average Test Error (%) | Group 0 Means and Standard Deviations | Group 1 Means and Standard Deviations |
|---|---|---|---|---|---|
| 1 | Playing % | 65.5 | 11.2 | 30.5 ± | 49.5 ± |
| 2 | % of Backwards Passes Successful (Minimum) | 65.5 | 11.0 | 66.3 ± | 52.9 ± |
| 3 | Total Assists | 66.7 | 10.9 | 0.9 ± | 1.7 ± |
| 4 | % of Forwards Passes Successful (Median) | 66.7 | 10.9 | 56.3 ± | 56.9 ± |
| 5 | Total Shots on Target (Excluding Blocked) (Mean) | 66.7 | 10.9 | 0.3 ± | 0.4 ± |
| 6 | Offsides (Mean) | 66.7 | 10.9 | 0.3 ± | 0.3 ± |
| 7 | Shots On Target Outside the Box (Maximum) | 66.7 | 10.8 | 0.8 | 1.3 ± |
| 8 | Long Passes (Maximum) | 67.9 | 10.9 | 9.0 ± | 10.9 ± |
| 9 | First Time Passes Unsuccessful (Upper Quartile) | 67.9 | 10.8 | 3.1 ± | 3.2 ± |
| 10 | Passes Successful Own Half (Lower Quartile) | 66.7 | 10.8 | 6.6 ± | 6.2 ± |
Results for group 1 v group 2 balanced data set (Best Average Test Performance = 61.5% and Best Average Test Error = 11.6% with a combination of seven variables) and group 1 v group 2 model variables as means and standard deviations for player groupings.
| Rank | Variable | Average Test Performance (%) | Average Test Error (%) | Group 1 Means and Standard Deviations | Group 2 Means and Standard Deviations |
|---|---|---|---|---|---|
| 1 | % Unsuccessful Headers (Lower Quartile) | 54.2 | 12.3 | 44.2 ± | 40.7 ± |
| 2 | Number of Possessions (Median) | 56.3 | 12.2 | 44.3 ± | 46.4 ± |
| 3 | Interceptions (Mean) | 56.3 | 12.2 | 14.3 ± | 14.0 ± |
| 4 | Total Blocked Shots (Maximum) | 55.2 | 12.2 | 1.5 ± | 1.5 ± |
| 5 | Total Goals | 55.2 | 12.0 | 2.6 ± | 4.6 ± |
| 6 | Crosses (Upper Quartile) | 59.4 | 11.6 | 2.1 ± | 2.2 ± |
| 7 | Total Blocked Shots (Mean) | 61.5 | 11.6 | 0.4 ± | 0.3 ± |
| 8 | First Time Passes Successful (Upper Quartile) | 60.4 | 11.6 | 7.6 ± | 8.2 ± |
| 9 | % Successful Headers (Lower Quartile) | 59.4 | 11.6 | 30.7 ± | 30.9 ± |
| 10 | Average Touches (Maximum) | 60.4 | 11.6 | 2.4 ± | 2.4 ± |
Results for the group 0 v group 2 balanced data set (Best Average Test Performance = 78.8% Best Average Test Error = 8.3% with a combination of ten variables) and group 0 v group 2 model variables as means and standard deviations for player groupings.
| Rank | Input ID | Average Test Performance (%) | Average Test Error (%) | Group 0 Means and Standard Deviations | Group 2 Means and Standard Deviations |
|---|---|---|---|---|---|
| 1 | Under 21 International Caps | 69.7 | 10.2 | 0.9 ± | 3.0 ± |
| 2 | Full International Caps | 71.2 | 9.5 | 3.1 ± | 7.6 ± |
| 3 | Tackles (Median) | 73.5 | 9.1 | 3.1 ± | 3.0 ± |
| 4 | % First Time Passes Unsuccessful (Upper Quartile) | 75.8 | 8.9 | 38.3 ± | 36.1 ± |
| 5 | Fouls | 75.8 | 8.8 | 16.8 ± | 29.1 ± |
| 6 | Dribbles (Maximum) | 77.3 | 8.5 | 1.2 ± | 2.3 ± |
| 7 | Possession Gained (Minimum) | 78.8 | 8.4 | 13.4 ± | 10.8 ± |
| 8 | Number of Possessions (Mean) | 78.8 | 8.5 | 44.0 ± | 46.6 ± |
| 9 | Penalty Area Entries (Median) | 78.8 | 8.6 | 3.4 ± | 3.7 ± |
| 10 | Average Time in Possession (Maximum) | 78.8 | 8.3 | 2.9 ± | 3.1 ± |