| Literature DB >> 29870554 |
Fabian Wunderlich1, Daniel Memmert1.
Abstract
Betting odds are frequently found to outperform mathematical models in sports related forecasting tasks, however the factors contributing to betting odds are not fully traceable and in contrast to rating-based forecasts no straightforward measure of team-specific quality is deducible from the betting odds. The present study investigates the approach of combining the methods of mathematical models and the information included in betting odds. A soccer forecasting model based on the well-known ELO rating system and taking advantage of betting odds as a source of information is presented. Data from almost 15.000 soccer matches (seasons 2007/2008 until 2016/2017) are used, including both domestic matches (English Premier League, German Bundesliga, Spanish Primera Division and Italian Serie A) and international matches (UEFA Champions League, UEFA Europe League). The novel betting odds based ELO model is shown to outperform classic ELO models, thus demonstrating that betting odds prior to a match contain more relevant information than the result of the match itself. It is shown how the novel model can help to gain valuable insights into the quality of soccer teams and its development over time, thus having a practical benefit in performance analysis. Moreover, it is argued that network based approaches might help in further improving rating and forecasting methods.Entities:
Mesh:
Year: 2018 PMID: 29870554 PMCID: PMC5988281 DOI: 10.1371/journal.pone.0198668
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information on the data set used within this study.
| Competition | Seasons | Number of matches | Average overround | Average theoretical bookmaker payout |
|---|---|---|---|---|
| English Premier League | 07/08–16/17 | 3,800 | 1.065 | 0.939 |
| German Bundesliga | 07/08–16/17 | 3,060 | 1.060 | 0.944 |
| Spanish Primera Division | 07/08–16/17 | 3,800 | 1.065 | 0.939 |
| Italian Serie A | 07/08–16/17 | 3,798 | 1.067 | 0.937 |
| UEFA Champions League | 07/08–16/17 | 316 | 1.047 | 0.956 |
| UEFA Europe League | 07/08–16/17 | 157 | 1.054 | 0.949 |
| Total | 07/08–16/17 | 14,931 | 1.064 | 0.940 |
Fig 1The forecasting methods and statistical framework as used within this study and largely obtained from Hvattum and Arntzen.
Comparison of informational loss for different models and various parameters.
| Forecasting model | Parameters | Average Li |
|---|---|---|
| Betting Odds | - | 1.3795 |
| ELO-Odds | k = 175 | 1.3913 |
| ELO-Odds | k = 200 | 1.3913 |
| ELO-Odds | k = 150 | 1.3914 |
| ELO-Odds | k = 250 | 1.3915 |
| ELO-Odds | k = 100 | 1.3919 |
| ELO-Odds | k = 300 | 1.3920 |
| ELO-Odds | k = 400 | 1.3937 |
| ELO-Odds | k = 50 | 1.3937 |
| ELO-Goals | k0 = 4, λ = 1.6 | 1.4008 |
| ELO-Goals | k0 = 4, λ = 1.4 | 1.4009 |
| ELO-Goals | k0 = 6, λ = 1.2 | 1.4009 |
| ELO-Goals | k0 = 6, λ = 1.0 | 1.4011 |
| ELO-Goals | k0 = 2, λ = 2.0 | 1.4012 |
| ELO-Goals | k0 = 6, λ = 1.4 | 1.4013 |
| ELO-Goals | k0 = 8, λ = 1.0 | 1.4013 |
| ELO-Goals | k0 = 8, λ = 0.8 | 1.4013 |
| ELO-Goals | k0 = 4, λ = 1.8 | 1.4014 |
| ELO-Goals | k0 = 4, λ = 1.2 | 1.4016 |
| ELO-Result | k = 14 | 1.4032 |
| ELO-Result | k = 12 | 1.4033 |
| ELO-Result | k = 16 | 1.4034 |
| ELO-Result | k = 18 | 1.4038 |
| ELO-Result | k = 10 | 1.4038 |
| ELO-Result | k = 20 | 1.4043 |
| ELO-Result | k = 25 | 1.4060 |
| ELO-Result | k = 5 | 1.4105 |
Fig 2Average informational loss for various choices of the parameter k in model ELO-Result.
Fig 4Average informational loss for various choices of the parameter k in model ELO-Odds.
Statistical tests comparing the predictive qualities of different forecasting methods.
The p-value compares each model to the model in the next row.
| Forecasting Model | Average Li | Standard deviation Li | p-value (paired t-test) |
|---|---|---|---|
| Betting Odds | 1.380 | 0.674 | < 0.0001 |
| ELO-Odds (k = 175) | 1.391 | 0.706 | < 0.0001 |
| ELO-Goals (k0 = 4, λ = 1.6) | 1.401 | 0.714 | 0.0202 |
| ELO-Result (k = 14) | 1.403 | 0.715 | - |
Statistical tests comparing the predictive qualities of ELO-Odds (various extreme parameters) to ELO-Goals.
The p-value compares each model to ELO-Goals.
| Forecasting Model | Average Li | Standard deviation Li | p-value (paired t-test) |
|---|---|---|---|
| ELO-Odds (k = 175) | 1.391 | 0.706 | <0.0001 |
| ELO-Odds (k = 400) | 1.394 | 0.709 | 0.0044 |
| ELO-Odds (k = 30) | 1.396 | 0.707 | 0.0026 |
| ELO-Odds (k = 500) | 1.397 | 0.707 | 0.2118 |
| ELO-Odds (k = 20) | 1.398 | 0.714 | 0.0857 |
| ELO-Goals (k0 = 4, λ = 1.6) | 1.401 | 0.714 | - |
Fig 5ELO-Odds and ELO-Result of Borussia Dortmund within the seasons 2013/14 and 2014/15.
Fig 6ELO-Odds and ELO-Result of Leicester City within the seasons 2014/15 and 2015/16.
Comparison between league table and average ELO-Odds rating (Primera Division 2013/14).
| Pos. | Team | Goal Diff. | Points | Pos. | Team | ELO-Odds |
|---|---|---|---|---|---|---|
| 1. | Atlético Madrid | 51 | 90 | 1. | FC Barcelona | 1294.59 |
| 2. | FC Barcelona | 67 | 87 | 2. | Real Madrid | 1229.22 |
| 3. | Real Madrid | 66 | 87 | 3. | Atlético Madrid | 1174.11 |
| 4. | Athletic Bilbao | 27 | 70 | 4. | FC Valencia | 1057.04 |
| 5. | FC Sevilla | 17 | 63 | 5. | Athletic Bilbao | 1045.58 |
| 6. | FC Villarreal | 16 | 59 | 6. | Real Sociedad | 1033.51 |
| 7. | Real Sociedad | 7 | 59 | 7. | FC Villareal | 1022.52 |
| 8. | FC Valencia | −2 | 49 | 8. | FC Sevilla | 989.47 |
| 9. | Celta Vigo | −5 | 49 | 9. | Espanyol Barcelona | 983.53 |
| 10. | Levante UD | −8 | 48 | 10. | FC Málaga | 973.85 |
| 11. | FC Málaga | −7 | 45 | 11. | Betis Sevilla | 968.70 |
| 12. | Rayo Vallecano | −34 | 43 | 12. | Celta Vigo | 962.58 |
| 13. | FC Getafe | −19 | 42 | 13. | FC Getafe | 946.62 |
| 14. | Espanyol Barcelona | −10 | 42 | 14. | FC Granada | 941.67 |
| 15. | FC Granada | −24 | 41 | 15. | FC Elche | 941.33 |
| 16. | FC Elche | −20 | 40 | 16. | Rayo Vallecano | 940.17 |
| 17. | UD Almería | −28 | 40 | 17. | CA Osasuna | 938.78 |
| 18. | CA Osasuna | −30 | 39 | 18. | Real Valladolid | 933.25 |
| 19. | Real Valladolid | −22 | 36 | 19. | UD Almería | 928.16 |
| 20. | Betis Sevilla | −42 | 25 | 20. | Levante UD | 915.48 |
Fig 7Simplified illustration of the database as a network of teams (nodes) and matches (edges).