| Literature DB >> 34921155 |
Maximilian Klemp1, Fabian Wunderlich2, Daniel Memmert2.
Abstract
Two highly relevant aspects of football, namely forecasting of results and performance analysis by means of performance indicators, are combined in the present study by analysing the value of in-play information in terms of event and positional data in forecasting the further course of football matches. Event and positional data from 50 matches, including more than 300 million datapoints were used to extract a total of 18 performance indicators. Moreover, goals from more than 30,000 additional matches have been analysed. Results suggest that surprisingly goals do not possess any relevant informative value on the further course of a match, if controlling for pre-game market expectation by means of betting odds. Performance indicators based on event and positional data have been shown to possess more informative value than goals, but still are not sufficient to reveal significant predictive value in-play. The present results are relevant to match analysts and bookmakers who should not overestimate the value of in-play information when explaining match performance or compiling in-play betting odds. Moreover, the framework presented in the present study has methodological implications for performance analysis in football, as it suggests that researchers should increasingly segment matches by scoreline and control carefully for general team strength.Entities:
Year: 2021 PMID: 34921155 PMCID: PMC8683419 DOI: 10.1038/s41598-021-03157-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Performance indicators used in this study with descriptions, abbreviations and example references.
| Performance indicator [abbreviation] | Description | Example Reference |
|---|---|---|
| Shots [SHOT] | Number of shots | [ |
| Passes [PASS] | Number of passes | [ |
| Short passes [SPASS] | Number of short passes | [ |
| Long passes [LPASS] | Number of long passes | [ |
| Crosses [CROSS] | Number of crosses | [ |
| Throw-Ins [THROW] | Number of throw-ins | [ |
| Clearances [CLEAR] | Number of clearances | [ |
| Fouls [FOUL] | Number of fouls committed | [ |
| Running distance [RD] | Total distance covered by all players | [ |
| In-possession running distance [RD_IP] | Total distance covered by players while team in possession | [ |
| Out-of-possession running distance [RD_OOP] | Total distance covered by players while team not in possession | [ |
| High-speed running distance [RD_HS] | Total distance covered by players faster than 14.4 km•h−1 | [ |
| Ball possession [BP] | Relative ball possession rate | [ |
| Ball distance [BD] | Total distance covered by the ball while the respective team in possession | Analysed for the first time |
| Space control [SC] | Proportion of the pitch controlled by the respective teams by means of Voronoi diagrams | [ |
| Space control attacking third [SC_ATT] | Proportion of the attacking third controlled by the respective teams by means of Voronoi diagrams | [ |
| Space control midfield third [SC_MID] | Proportion of the midfield third controlled by the respective teams by means of Voronoi diagrams | [ |
| Space control defensive third [SC_DEF] | Proportion of the defensive third controlled by the respective teams by means of Voronoi diagrams | [ |
Descriptive Statistics for all Performance Indicators.
| Performance indicator | Home | Away | ||||||
|---|---|---|---|---|---|---|---|---|
| Mean | SD | Min | Max | Mean | SD | Min | Max | |
| SHOT [#] | 7.3 | 3.5 | 2 | 16 | 5.3 | 2.5 | 0 | 11 |
| PASS [#] | 233.8 | 65.1 | 105 | 419 | 210.5 | 57.4 | 99 | 357 |
| SPASS [#] | 210.6 | 65.3 | 85 | 396 | 187.9 | 57.1 | 80 | 332 |
| LPASS [#] | 23.2 | 5 | 10 | 34 | 22.6 | 5.8 | 10 | 41 |
| CROSS [#] | 8.3 | 3.9 | 1 | 18 | 6.7 | 3.5 | 2 | 18 |
| THROW [#] | 13.1 | 4.5 | 5 | 27 | 12.2 | 3.6 | 5 | 20 |
| CLEAR [#] | 29.8 | 7 | 17 | 43 | 33 | 9.4 | 13 | 52 |
| FOUL [#] | 6.6 | 2.7 | 2 | 13 | 7.5 | 2.9 | 2 | 15 |
| RD [m] | 57,772.7 | 2,397.5 | 51,199.7 | 63,578.8 | 57,407.6 | 2,559.4 | 51,166.6 | 64,215.2 |
| RD_IP [m/min] | 1,242.5 | 73.6 | 1,068.6 | 1,440.3 | 1,227 | 73.2 | 1,069.4 | 1,408 |
| RD_OOP [m/min] | 1,286.9 | 88.3 | 1,098.9 | 1,547.5 | 1,284.4 | 96.6 | 1,088.9 | 1,554.7 |
| RD_HS [m] | 13,627.4 | 1,355.8 | 10,713.1 | 16,909.8 | 13,522.1 | 1,335.7 | 11,100.8 | 16,466.5 |
| BP [%] | 50.9 | 5.7 | 38.1 | 62 | 49.1 | 5.7 | 38 | 61.9 |
| BD [m/min] | 163 | 16.1 | 127.5 | 203.7 | 157.7 | 14.3 | 132.3 | 190.7 |
| SC [%] | 51.7 | 4.5 | 41.6 | 62.8 | 48.3 | 4.5 | 37.2 | 58.4 |
| SC_ATT [%] | 16.3 | 4.1 | 9 | 25.2 | 13.3 | 3.7 | 4.9 | 23 |
| SC_MID [%] | 51.9 | 6.3 | 34.6 | 68.2 | 48.1 | 6.3 | 31.8 | 65.4 |
| SC_DEF [%] | 86.7 | 3.7 | 77 | 95.1 | 83.7 | 4.1 | 74.8 | 91 |
SD standard deviation, Min minimum, Max maximum, units are presented in brackets where # refers to count variables.
Results for various models forecasting the outcome of the second half in terms of home win, draw or away win.
| Model | RPS | UNI | FRQ | GOAL | PROB |
|---|---|---|---|---|---|
| UNI | – | – | – | – | |
| FRQ | < 0.0001* | – | – | – | |
| GOAL | < 0.0001* | < 0.0001* | – | – | |
| PROB | < 0.0001* | < 0.0001* | < 0.0001* | – | |
| BOTH | < 0.0001* | < 0.0001* | < 0.0001* | 0.2862 |
*Significant at a 5% level.
Correlations for various performance indicators.
| Technical performance indicators | ||||||||
|---|---|---|---|---|---|---|---|---|
| SHOT | PASS | SPASS | LPASS | CROSS | THROW | CLEAR | FOUL | |
| Strength dependence | 0.10 | − 0.13 | − | − 0.08 | ||||
| Explanatory power | − 0.05 | − 0.08 | − | − | − 0.04 | |||
| Predictive power | 0.12 | − 0.02 | 0.14 | − 0.04 | − 0.09 | − 0.03 | ||
| Predictive overperformance | 0.09 | − 0.03 | 0.12 | − 0.04 | − 0.09 | − 0.02 | ||
**Significant at a 5% level, * significant at a 10% level, correlation coefficients with |r|> = 0.15 are highlighted in bold.
Figure 1Correlation coefficients in the four measures of association for all performance indicators examined. Top panel shows technical performance indicators, middle panel shows tactical performance indicators and bottom panel shows physical performance indicators. Dashed horizontal line indicates a correlation coefficient of 0. Please refer to Method for a detailed description of the measures and performance indicators.