| Literature DB >> 29420576 |
John R Doyle1, Paul A Bottomley1.
Abstract
The paper analyses two datasets of elite soccer players (top 1000 professionals and UEFA Under-19 Youth League). In both, we find a Relative Age Effect (RAE) for frequency, but not for value. That is, while there are more players born at the start of the competition year, their transfer values are no higher, nor are they given more game time. We use Poisson regression to derive a transparent index of the discrimination present in RAE. Also, because Poisson is valid for small frequency counts, it supports analysis at the disaggregated levels of country and club. From this, we conclude there are no paragon clubs or countries immune to RAE; that is clubs and countries do not differ systematically in the RAE they experience; also, that Poisson regression is a powerful and flexible method of analysing RAE data.Entities:
Mesh:
Year: 2018 PMID: 29420576 PMCID: PMC5805271 DOI: 10.1371/journal.pone.0192209
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
RAE by frequency: Overall, and country-by-country analysis.
| t-test of mean | Poisson regression | ||||||
|---|---|---|---|---|---|---|---|
| Country | N | p | b0 | b1 | R2 | ID | |
| Argentina | 62 | 0.351 | < .001 | 0.9592 | -1.8513 | 0.21 | 6.37 |
| Netherlands | 41 | 0.386 | < .05 | 0.3701 | -1.3781 | 0.09 | 3.97 |
| Turkey | 23 | 0.390 | < .05 | -0.2168 | -1.3557 | 0.07 | 3.88 |
| Belgium | 31 | 0.405 | < .05 | -0.0143 | -1.1154 | 0.06 | 3.05 |
| Spain | 102 | 0.408 | < .01 | 1.1654 | -1.0880 | 0.13 | 2.97 |
| Italy | 72 | 0.420 | < .05 | 0.7775 | -1.0200 | 0.09 | 2.77 |
| Germany | 67 | 0.428 | < .05 | 0.6341 | -0.8246 | 0.07 | 2.28 |
| Portugal | 31 | 0.431 | -0.1486 | -0.7969 | 0.03 | 2.22 | |
| Brazil | 100 | 0.440 | < .05 | 0.9750 | -0.6884 | 0.07 | 1.99 |
| Russia | 24 | 0.445 | -0.4788 | -0.6285 | 0.02 | 1.87 | |
| France | 81 | 0.461 | 0.6718 | -0.5057 | 0.03 | 1.66 | |
| All countries | 1000 | 0.445 | < .001 | 3.2566 | -0.6466 | 0.37 | 1.91 |
| Above 11 countries | 634 | 0.418 | < .001 | 2.9455 | -0.9810 | 0.45 | 2.67 |
N is the number of players appearing in the top 1000 by transfer value; is the mean time of birth (0 at beginning of year, 1 at end, 0.5 mid-year); Poisson estimating equation is y = exp(b0 + b1.tB); R2 is McFadden’s pseudo-R2; ID is the Index of Discrimination = exp(-b1); p is the statistical significance for rejecting the null hypothesis ≥ 0.50. Same significance levels also found when testing null hypothesis b1 ≥ 0.
Fig 1Scatterplots of RAE by frequency and value: 1000 top professional footballers.
Left panel: Birth frequency by week of year (Poisson regression, best fit). Right panel: Mean log transfer value (€m.) by week (OLS regression, best fit).
RAE by frequency: Overall, and club-by-club analysis.
| t-test of mean | Poisson Regression | |||||||
|---|---|---|---|---|---|---|---|---|
| Club | N | p | b0 | b1 | p | R2 | ID | |
| Galatasaray AS | 27 | 0.275 | < .001 | 0.5269 | -3.1170 | < .001 | 0.287 | 22.58 |
| Liverpool* | 30 | 0.283 | < .001 | 0.5882 | -2.9590 | < .001 | 0.266 | 19.28 |
| Barcelona | 32 | 0.284 | < .001 | 0.6511 | -2.9534 | < .001 | 0.293 | 19.17 |
| Sporting Lisbon | 31 | 0.294 | < .001 | 0.5659 | -2.7681 | < .001 | 0.269 | 15.93 |
| Chelsea* | 30 | 0.296 | < .001 | 0.5273 | -2.7486 | < .001 | 0.278 | 15.62 |
| Juventus | 27 | 0.305 | < .001 | 0.3737 | -2.5880 | < .01 | 0.227 | 13.30 |
| Malmo FF | 40 | 0.305 | < .001 | 0.7663 | -2.5865 | < .001 | 0.252 | 13.28 |
| Shakhtar Donestsk | 37 | 0.308 | < .001 | 0.6751 | -2.5434 | < .001 | 0.253 | 12.72 |
| Bayer Leverkusen | 35 | 0.313 | < .001 | 0.5949 | -2.4640 | < .001 | 0.237 | 11.75 |
| Zenit St Petersburg | 27 | 0.315 | < .001 | 0.3242 | -2.4283 | < .01 | 0.207 | 11.34 |
| Basel 1893 | 25 | 0.323 | < .001 | 0.2099 | -2.3112 | < .01 | 0.169 | 10.09 |
| Borussia Dortmund | 31 | 0.328 | < .01 | 0.3994 | -2.2324 | < .01 | 0.190 | 9.32 |
| Real Madrid | 35 | 0.329 | < .001 | 0.5158 | -2.2174 | < .001 | 0.256 | 9.18 |
| Athletico Madrid | 31 | 0.332 | < .001 | 0.3809 | -2.1764 | < .01 | 0.167 | 8.81 |
| CSKA Moskva | 31 | 0.335 | < .001 | 0.3624 | -2.1209 | < .01 | 0.177 | 8.34 |
| Olympiacos | 39 | 0.340 | < .001 | 0.5686 | -2.0518 | < .001 | 0.193 | 7.78 |
| Bate Borisov | 27 | 0.340 | < .01 | 0.2004 | -2.0502 | < .01 | 0.153 | 7.77 |
| Ajax | 37 | 0.341 | < .001 | 0.5099 | -2.0339 | < .01 | 0.208 | 7.64 |
| Porto | 35 | 0.344 | < .01 | 0.4393 | -1.9900 | < .01 | 0.169 | 7.32 |
| RSC Anderlecht | 36 | 0.348 | < .001 | 0.4499 | -1.9392 | < .01 | 0.193 | 6.95 |
| Schalke 04 | 35 | 0.351 | < .01 | 0.4064 | -1.8954 | < .01 | 0.162 | 6.66 |
| Manchester City* | 19 | 0.365 | < .05 | -0.2749 | -1.6985 | < .05 | 0.089 | 5.47 |
| Paris Saint Germain | 36 | 0.367 | < .01 | 0.3535 | -1.6695 | < .01 | 0.141 | 5.31 |
| AS Roma | 31 | 0.369 | < .01 | 0.1949 | -1.6447 | < .01 | 0.106 | 5.18 |
| AS Monaco | 29 | 0.371 | < .05 | 0.1170 | -1.6144 | < .05 | 0.117 | 5.02 |
| Atheletic Bilboa | 40 | 0.372 | < .01 | 0.4330 | -1.5995 | < .01 | 0.142 | 4.95 |
| SL Benfica | 29 | 0.380 | < .05 | 0.0699 | -1.4892 | < .05 | 0.098 | 4.43 |
| Bayern Muenchen | 26 | 0.384 | < .05 | -0.0575 | -1.4416 | < .05 | 0.086 | 4.23 |
| Ludogorets Razgrad | 40 | 0.419 | < .05 | 0.1927 | -0.9912 | < .05 | 0.045 | 2.69 |
| Apoel | 39 | 0.421 | < .05 | 0.1536 | -0.9585 | < .05 | 0.052 | 2.61 |
| NK Maribor | 38 | 0.423 | 0.1212 | -0.9432 | 0.047 | 2.57 | ||
| Arsenal* | 33 | 0.436 | -0.0927 | -0.7736 | 0.027 | 2.17 | ||
| All 32 clubs | 1038 | 0.345 | < .001 | 3.8234 | -1.9738 | < .001 | 0.894 | 7.20 |
N is the number of “domestic” players appearing in the squad; is the mean time of birth (0 at beginning of year, 1 at end, 0.5 mid-year); Poisson estimating equation is y = exp(b0 + b1.tB); R2 is McFadden’s pseudo-R2; ID is the Index of Discrimination = exp(-b1); First p is the statistical significance for rejecting the null hypothesis ≥ 0.50. Second p is the statistical significance for rejecting the null hypothesis b1 ≥ 0.
Asterisk (*) is for clubs using 1st Sep. to 31st Aug. competition year
Fig 2Scatterplots of RAE by frequency and value: 32 clubs’ U19 academy footballers.
Left panel: Birth frequency by week of year (Poisson regression best fit). Right panel: Mean games played per player by birthweek (OLS regression best fit).