| Literature DB >> 32976547 |
Michael Barth1,2, Arne Güllich3, Christian Raschner2, Eike Emrich1.
Abstract
Research investigating the nature and scope of developmental participation patterns leading to international senior-level success is mainly explorative up to date. One of the criticisms of earlier research was its typical multiple testing for many individual participation variables using bivariate, linear analyses. Here, we applied state-of-the-art supervised machine learning to investigate potential non-linear and multivariate effects of coach-led practice in the athlete's respective main sport and in other sports on the achievement of international medals. Participants were matched pairs (sport, sex, age) of adult international medallists and non-medallists (n = 166). Comparison of several non-ensemble and tree-based ensemble binary classification algorithms identified "eXtreme gradient boosting" as the best-performing algorithm for our classification problem. The model showed fair discrimination power between the international medallists and non-medallists. The results indicate that coach-led other-sports practice until age 14 years was the most important feature. Furthermore, both main-sport and other-sports practice were non-linearly related to international success. The amount of main-sport practice displayed a parabolic pattern while the amount of other-sports practice displayed a saturation pattern. The findings question excess involvement in specialised coach-led main-sport practice at an early age and call for childhood/adolescent engagement in coach-led practice in various sports. In data analyses, combining traditional statistics with advanced supervised machine learning may improve both testing of the robustness of findings and new discovery of patterns among multivariate relationships of variables, and thereby of new hypotheses.Entities:
Mesh:
Year: 2020 PMID: 32976547 PMCID: PMC7518846 DOI: 10.1371/journal.pone.0239378
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Relevance of the volume of coach-led practice in an athlete’s main sport and in other sports for the differentiation between senior international and national-level success (adapted and updated from [11]).
| Amount of main-sport practice | Amount of other-sports practice | ||||
|---|---|---|---|---|---|
| No | Source | Childhood and adolescence | Adulthood | Childhood and adolescence | Adulthood |
| 1 | [ | – | n.a. | + | n.a. |
| 2 | [ | +/o | n.a. | n.a. | n.a. |
| 3 | [ | o | n.a. | +/o | n.a. |
| 4 | [ | o/– | o | o | o |
| 5 | [ | o/– | o | + | + |
| 6 | [ | o | o//– | o/+ | o//+ |
| 7 | [ | – | o/n.a. | + | o/n.a. |
| 8 | [ | o | o | + | n.a./o |
| 9 | [ | o//– | o | o//+ | o |
Note: n.a.: no information available
Relevance for success:
+: sig. positive correlation (athletes achieving international senior-level success practiced more compared to ‘only’ nationally successful athletes)
–: sig. negative correlation (internationally successful athletes practiced less compared to ‘only’ nationally successful athletes)
o: no correlation between success and amount of practice
x/y: the majority of the results in this category correspond to x, but y was also found
x//y: x and y were found the same number of times
Note that study 2 [13] compared more successful Greek with less successful Canadian gymnasts.
Senior international medallists’ and non-medallists’ main-sport and other-sports coach-led practice hours through three age categories: Until 14, 15 to 18 and 19 to 21 years.
| Medallists | Non-Medallists | |||||
|---|---|---|---|---|---|---|
| M | (SD) | n | M | (SD) | n | |
| Age (years) | 25.0 | 4.7 | 83 | 24.2 | 4.4 | 83 |
| Age of peak performance (years) | 23.5 | 3.9 | 83 | 23.3 | 4.0 | 83 |
| Main-sport practice (hours) | ||||||
| until 14 years | 1005.7 | 1057.0 | 83 | 1482.4 | 1246.0 | 83 |
| 15 to 18 years | 2300.3 | 1451.2 | 83 | 2783.0 | 1648.8 | 83 |
| 19 to 21 years | 2783.7 | 1503.2 | 81 | 3007.0 | 1422.5 | 81 |
| Other-sports practice (hours) | ||||||
| until 14 years | 728.6 | 1021.8 | 83 | 179.7 | 336.0 | 83 |
| 15 to 18 years | 274.7 | 505.3 | 83 | 60.5 | 133.1 | 83 |
| 19 to 21 years | 90.0 | 245.9 | 81 | 63.5 | 191.5 | 81 |
Assessed classification accuracy of different methods.
| Indicator | Support Vector Machines | Random Forests | eXtreme Gradient Boosting |
|---|---|---|---|
| AUC | 0.71 | 0.78 | 0.79 |
| Precision medallists | 0.72 | 0.70 | 0.71 |
| non-medallists | 0.67 | 0.71 | 0.73 |
| Recall medallists | 0.63 | 0.72 | 0.74 |
| non-medallists | 0.75 | 0.69 | 0.69 |
| f1 medallists | 0.67 | 0.71 | 0.72 |
| non-medallists | 0.71 | 0.70 | 0.71 |
Baseline values are in all cases 0.50.
Fig 1Importance of the model’s features.
Note: A: Each variable’s relative contribution is scaled with the sum of all adding to 100. B: Ratio between “permutation error” and “original error” was calculated, which means features with an importance of (around) 1 are not relevant for the model.
Fig 2Individual Conditional Expectation (ICE) curves for the features volume of coach-led practice in the athlete’s main sport and coach-led practice in other sports.
Bold curves illustrate the mean values. A likelihood value of 0.5 corresponds to chance. Note the different abscissae scale orientations.
Fig 3Interactive effect of the features coach-led other-sports practice up to 14 years and coach-led main-sport practice at 19 to 21 years on the probability to be classified as an international medallist.
A partial dependence value of 0.5 corresponds to chance. Note the different scale orientations of the abscissae.