| Literature DB >> 35719541 |
Lingfeng Ruan1, Huanmin Ge1, Yanfei Shen1, Zhiqiang Pu2, Shouxin Zong1, Yixiong Cui1,3.
Abstract
Establishing and illustrating a predictive and prescriptive model of playing styles that football teams adopt during matches is a key step toward describing and measuring the effectiveness of styles of play. The current study aimed to identify and measure the effectiveness of different defensive playing styles for professional football teams considering the opponent's expected goal. Event data of all 1,120 matches played in the Chinese Football Super League (CSL) from the 2016 to 2020 seasons were collected, with fifteen defense-related performance variables being extracted. The PCA model (KMO = 0.76) output eight factors that represented 7 different styles of play (factor 6 and 8 represent one style of play) and explained 85.17% of the total variance. An expected goal (xG) model was built using data related to 27,852 shots. Finally, the xG of the opponent was calculated in the multivariate regression model, outputting five factors that (p < 0.05) explained 41.6% of the total variance in the xG of the opponent and receiving a dangerous situation (factor 7) was the most apparent style (31.3%). Finally, the predicted model with defensive styles correlated with actual xG of the opponent at r = 0.62 using the 2020 season as testing data which showed that the predicted xG was correlated moderately with the actual. The result indicated that if the team strengthened the defense closed to the own goal, high intensity confrontation, and defense of goalkeeper, meanwhile making less errors and receiving less dangerous situations, the xG of the opponent would be greatly reduced.Entities:
Keywords: PCA; defense; match analysis; multivariate regression; xG
Year: 2022 PMID: 35719541 PMCID: PMC9202555 DOI: 10.3389/fpsyg.2022.899199
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Description and definition of the defensive performance indicators.
| Variable | Definition |
|---|---|
| Related to own team | |
| 1. Interception | A player reads an opponent’s pass and intercepts the ball by moving into the line of the intended pass. The interception could be finished with or without the ball recovery |
| 2. Clearance | A player kicks the ball away from his own goal with no intended recipient, and the clearance included that the ball kicked from the offensive filed or kicked to the sideline |
| 3. Ball gain zone 1 | The analyzed team gained the ball in zone 1 (see |
| 4. Error total | The number that a defensive player makes total errors, which leads to goals or shots conceded |
| 5. Error in own half (%) | The number that a defensive player makes errors in own half divided by total errors |
| 6. Keeper claim | The number of times that the goalkeeper get possession of the ball positively |
| 7. Keeper smother | The number of times that the goalkeeper who comes out and claims balls at the feet of a forward gets smothers, similar to tackles |
| 8. Foul total | The number of times that a player commits a foul for defense |
| 9. Creating danger | After gaining the ball, the analyzed team made a shot or entry into the opposing penalty area |
| Related to opponent team | |
| 10. Deep completion | The number of pass (excluding crosses) that was received in a 20-meter radius from the opponent goal line |
| 11. Cross unsuccess | The number of teams crossed unsuccessfully |
| 12. Dribble success | The number of teams dribbled successfully |
| 13. Shot accuracy (%) | The number that a player shot accuracy |
| 14. Pass in the zone 3 | Any kind of pass made by the team in zone 3 (see |
| 15. Pass in the zone 4 | Any kind of pass made by the team in zone 4 (see |
Figure 1Pitch divisions in six zones parallel to the goal lines. ① represents zone 1; ② represents zone 2; ③ represents zone 3; ④ represents zone 4; ⑤ represents zone 5; ⑥ represents zone 6.
Figure 2Expected conversion probabilities of shots on the pitch. The darker the red, the higher the expected goal.
Rotated component matrix for the performance indicators showing a strong positive or negative correlation.
| Component | ||||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
| Interception | 0.625 | |||||||
| Clearance | 0.888 | |||||||
| Ball gain zone 1 | 0.875 | |||||||
| Error total | 0.970 | |||||||
| Error in own half | 0.974 | |||||||
| Keeper claim | 0.986 | |||||||
| Keeper smother | 0.948 | |||||||
| Deep completion | 0.746 | |||||||
| Foul total | 0.622 | |||||||
| Cross unsuccess | 0.833 | |||||||
| Dribble success | −0.687 | |||||||
| Shot accuracy | 0.943 | |||||||
| Pass in the zone 3 | −0.810 | |||||||
| Pass in the zone 4 | −0.876 | |||||||
| Creating danger | 0.799 | |||||||
Figure 3The ROC curve of different logistic regression models in xG. The diagonal dotted line represents a line of zero discrimination, also known as pure chance line.
Relative contribution of defensive styles of play to the variance of excepted goals (xG) of the opponent frequency.
| Predictors | Non-standardized | Standardized | Adjusted |
|
| Tolerance | VIF |
|---|---|---|---|---|---|---|---|
| Constant | 0.219 | 2.561 | 0.01 | ||||
| Receiving a dangerous situation | 2.405 | 0.917 | 0.312 | 31.872 | <0.01 | 0.68 | 1.46 |
| Defense closed to the own goal | −1.053 | −0.441 | 0.389 | −15.375 | <0.01 | 0.61 | 1.64 |
| Error | 0.155 | 0.109 | 0.403 | 6.216 | <0.01 | 0.60 | 1.66 |
| Keeper claim | −0.040 | −0.101 | 0.413 | −5.688 | <0.01 | 0.58 | 1.71 |
| High intensity confrontation | −0.175 | −0.051 | 0.416 | −2.923 | <0.01 | 0.58 | 1.72 |
| Mid-positioning defense with pressure | −0.021 | −1.042 | 0.30 | 0.79 | 1.27 | ||
| Defense in advanced zones | −0.042 | −1.946 | 0.05 | 0.65 | 1.54 | ||
| Keeper smother | 0.009 | 0.497 | 0.62 | 0.99 | 1.02 |
VIF, Variance inflation factor. Adjusted R2 is cumulative, with each incremental step adding to the variance explained.
p < 0.05;
p < 0.01.
Figure 4Correlation between the actual and predicted xG of the opponent of 2020 season in CSL. Each dot represents one team in a match, for a total of 320 real-predicted comparisons. The dotted line represents the trend of correlation between the actual and predicted xG of the opponent.