| Literature DB >> 30763328 |
Dan Weaving1,2, Ben Jones1,3,4, Matt Ireton1,5, Sarah Whitehead1,2, Kevin Till1,2,3, Clive B Beggs1.
Abstract
OBJECTIVES: Professional sporting organisations invest considerable resources collecting and analysing data in order to better understand the factors that influence performance. Recent advances in non-invasive technologies, such as global positioning systems (GPS), mean that large volumes of data are now readily available to coaches and sport scientists. However analysing such data can be challenging, particularly when sample sizes are small and data sets contain multiple highly correlated variables, as is often the case in a sporting context. Multicollinearity in particular, if not treated appropriately, can be problematic and might lead to erroneous conclusions. In this paper we present a novel 'leave one variable out' (LOVO) partial least squares correlation analysis (PLSCA) methodology, designed to overcome the problem of multicollinearity, and show how this can be used to identify the training load (TL) variables that influence most 'end fitness' in young rugby league players.Entities:
Mesh:
Year: 2019 PMID: 30763328 PMCID: PMC6375576 DOI: 10.1371/journal.pone.0211776
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
League outcome and match performance data for the teams in the European Super League (season 2017).
| Team | League Points | Score difference | Number | Number | Number |
|---|---|---|---|---|---|
| Castleford Tigers | 40 | 391 | 1523 | 918 | 208 |
| Leeds Rhinos | 30 | 76 | 1588 | 796 | 153 |
| Wigan Warriors | 23 | 21 | 1574 | 697 | 157 |
| Warrington Wolves | 20 | -145 | 1654 | 830 | 165 |
| Wakefield Trinity Wildcats | 26 | 63 | 1519 | 698 | 174 |
| Salford Red Devils | 26 | 76 | 1525 | 784 | 178 |
| Huddersfield Giants | 21 | 35 | 1656 | 757 | 143 |
| Hull FC | 27 | 27 | 1522 | 872 | 180 |
| St Helens | 25 | 25 | 1746 | 744 | 166 |
| Leigh Centurions | 12 | 12 | 1639 | 606 | 159 |
| Widnes Vikings | 11 | -269 | 1652 | 626 | 136 |
| Catalans Dragons | 15 | -220 | 1451 | 701 | 129 |
Fig 1Singular value inertial value (indicated by dotted line) computed from the observed data and the null-distribution of the inertia computed using a permutation test with 10,000 permutations.
Training load (TL) data and termination speed during 30–15 intermittent fitness test for 16 professional youth rugby league players.
| Player ID | TD | SZ1 | SZ2 | SZ3 | SZ4 | PL | sRPE | IndSZ | PLZ1 | PLZ2 | PLZ3 | PLZ4 | Start Fitness | End Fitness |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 53844 | 36323 | 14962 | 2196 | 358 | 7115 | 16027 | 4674 | 1985 | 3504 | 1269 | 358 | 16.5 | 18.0 |
| 2 | 70550 | 43477 | 22926 | 3682 | 464 | 8241 | 18848 | 7072 | 2471 | 4569 | 1051 | 151 | 16.5 | 17.5 |
| 3 | 55967 | 35915 | 16397 | 3376 | 278 | 5665 | 11952 | 4521 | 2335 | 2516 | 544 | 270 | 17.0 | 17.5 |
| 4 | 57847 | 35950 | 18612 | 2713 | 535 | 5798 | 17276 | 3960 | 2168 | 2806 | 522 | 270 | 17.0 | 19.0 |
| 5 | 42585 | 28529 | 11444 | 2260 | 352 | 4453 | 12475 | 3950 | 2001 | 1932 | 360 | 161 | 17.0 | 18.0 |
| 6 | 63157 | 41285 | 17447 | 3876 | 520 | 6492 | 15594 | 4712 | 2699 | 3126 | 530 | 138 | 17.5 | 19.5 |
| 7 | 63540 | 40009 | 18602 | 4227 | 699 | 6394 | 14806 | 5453 | 2665 | 2819 | 673 | 238 | 17.5 | 20.0 |
| 8 | 63833 | 41708 | 18462 | 3090 | 567 | 6955 | 13744 | 4180 | 2692 | 3484 | 630 | 150 | 17.5 | 19.0 |
| 9 | 47832 | 29853 | 13184 | 3885 | 897 | 4988 | 12059 | 4379 | 2145 | 2353 | 381 | 109 | 17.5 | 19.0 |
| 10 | 67531 | 42024 | 20176 | 4491 | 840 | 7221 | 15251 | 5152 | 2805 | 3215 | 845 | 356 | 18.5 | 19.5 |
| 11 | 54425 | 34689 | 16342 | 2911 | 483 | 5655 | 15039 | 2870 | 2270 | 2766 | 460 | 160 | 18.5 | 19.0 |
| 12 | 62172 | 39659 | 17703 | 4094 | 705 | 6072 | 16563 | 4574 | 2766 | 2670 | 514 | 123 | 18.5 | 20.0 |
| 13 | 76006 | 45183 | 23589 | 6133 | 1100 | 8041 | 13024 | 8178 | 2688 | 3776 | 1157 | 421 | 19.0 | 20.5 |
| 14 | 35828 | 21557 | 10290 | 2989 | 992 | 3524 | 14590 | 2982 | 1362 | 1601 | 318 | 242 | 19.0 | 20.5 |
| 15 | 52281 | 33999 | 15164 | 2272 | 633 | 5928 | 14316 | 3279 | 2204 | 2956 | 568 | 200 | 19.5 | 19.5 |
| 16 | 47583 | 28391 | 15613 | 3154 | 425 | 4546 | 11090 | 2919 | 2102 | 1967 | 378 | 100 | 19.5 | 20.0 |
| 57186 | 36159 | 16932 | 3459 | 616 | 6068 | 14541 | 4553 | 2335 | 2879 | 638 | 215 | 17.9 | 19.2 | |
| 10579 | 6491 | 3626 | 1012 | 240 | 1294 | 2102 | 1442 | 386 | 750 | 292 | 98 | 1.0 | 1.0 |
Abbreviations: TD = total distance (m); SZ1 = speed zone 1 (0 to 3 m·s-1; [m]); SZ2 = speed zone 2 (3.1 to 5 m·s-1; [m]); SZ3 = speed zone 3 (5.1 to 7 m·s-1; [m]); SZ4 = speed zone 1 (> 7.1 m·s-1; [m]); PL = PlayerLoad (AU); PLZ1 = PlayerLoad Zone 1 (0 to 1 AU); PLZ2 = PlayerLoad Zone 2 (1.1 to 2 AU); PLZ3 = PlayerLoad Zone 3 (2.1 to 3 AU); PLZ4 = PlayerLoad Zone 4 (> 3.1 AU); sRPE = session-rating-of-perceived-exertion; IndSZ = Individualised speed zone (> 30–15 intermittent fitness test termination speed).
Results of the Pearson correlation analysis between training load variables, starting fitness and end fitness.
| TD | SZ1 | SZ2 | SZ3 | SZ4 | PL | sRPE | IndSZ | PLZ1 | PLZ2 | PLZ3 | PLZ4 | Starting | End | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TD [p value] | NA | [0.000] | [0.000] | [0.004] | [0.560] | [0.000] | [0.122] | [0.000] | [0.000] | [0.000] | [0.005] | [0.195] | [0.605] | [0.937] |
| SZ1 [p value] | [0.000] | NA | [0.000] | [0.021] | [0.889] | [0.000] | [0.094] | [0.001] | [0.000] | [0.000] | [0.005] | [0.248] | [0.372] | [0.771] |
| SZ2 [p value] | [0.000] | [0.000] | NA | [0.006] | [0.597] | [0.000] | [0.111] | [0.000] | [0.000] | [0.000] | [0.007] | [0.205] | [0.761] | [0.922] |
| SZ3 [p value] | [0.004] | [0.021] | [0.006] | NA | [0.006] | [0.058] | [0.872] | [0.001] | [0.011] | [0.197] | [0.191] | [0.205] | [0.401] | [0.075] |
| SZ4 [p value] | [0.560] | [0.889] | [0.597] | [0.006] | NA | [0.860] | [0.860] | [0.232] | [0.807] | [0.951] | [0.810] | [0.244] | [0.043] | [0.001] |
| PL [p value] | [0.000] | [0.000] | [0.000] | [0.058] | [0.860] | NA | [0.056] | [0.000] | [0.002] | [0.000] | [0.000] | [0.108] | [0.296] | [0.505] |
| sRPE [p value] | [0.122] | [0.094] | [0.111] | [0.872] | [0.860] | [0.056] | NA | [0.329] | [0.468] | [0.022] | [0.126] | [0.745] | [0.188] | [0.582] |
| IndSZ [p value] | [0.000] | [0.001] | [0.000] | [0.001] | [0.232] | [0.000] | [0.329] | NA | [0.027] | [0.001] | [0.001] | [0.067] | [0.323] | [0.843] |
| PLZ1 [p value] | [0.000] | [0.000] | [0.000] | [0.011] | [0.807] | [0.002] | [0.468] | [0.027] | NA | [0.016] | [0.190] | [0.835] | [0.824] | [0.780] |
| PLZ2 [p value] | [0.000] | [0.000] | [0.000] | [0.197] | [0.951] | [0.000] | [0.022] | [0.001] | [0.016] | NA | [0.000] | [0.257] | [0.204] | [0.292] |
| PLZ3 [p value] | [0.005] | [0.005] | [0.007] | [0.191] | [0.810] | [0.000] | [0.126] | [0.001] | [0.190] | [0.000] | NA | [0.005] | [0.293] | [0.453] |
| PLZ4 [p value] | [0.195] | [0.248] | [0.205] | [0.205] | [0.244] | [0.108] | [0.745] | [0.067] | [0.835] | [0.257] | [0.005] | NA | [0.948] | [0.771] |
| Starting Fitness [p value] | [0.605] | [0.372] | [0.761] | [0.401] | [0.043] | [0.296] | [0.188] | [0.323] | [0.824] | [0.204] | [0.293] | [0.948] | NA | [0.000] |
| End Fitness [p value] | [0.937] | [0.771] | [0.922] | [0.075] | [0.001] | [0.505] | [0.582] | [0.843] | [0.780] | [0.292] | [0.453] | [0.771] | [0.000] | NA |
Abbreviations: TD = total distance (m); SZ1 = speed zone 1 (0 to 3 m·s-1; [m]); SZ2 = speed zone 2 (3.1 to 5 m·s-1; [m]); SZ3 = speed zone 3 (5.1 to 7 m·s-1; [m]); SZ4 = speed zone 1 (> 7.1 m·s-1; [m]); PL = PlayerLoad (AU); PLZ1 = PlayerLoad Zone 1 (0 to 1 AU); PLZ2 = PlayerLoad Zone 2 (1.1 to 2 AU); PLZ3 = PlayerLoad Zone 3 (2.1 to 3 AU); PLZ4 = PlayerLoad Zone 4 (> 3.1 AU); sRPE = session-rating-of-perceived-exertion; IndSZ = Individualised speed zone (> 30–15 intermittent fitness test termination speed).
Baseline multiple linear regression model with end fitness as the response variable, showing the calculated variable inflation factors (VIFs).
| Response | Predictor Variables | Coefficient | Significance | VIF | Model Metrics |
|---|---|---|---|---|---|
| End Fitness | Intercept | 1.289e+01 | 0.426 | NA | 0.562 (0.324) |
| TD | 1.322e-03 | 0.876 | 224288.3 | ||
| SZ1 | -8.480e-04 | 0.920 | 83232.3 | ||
| SZ2 | -1.172e-03 | 0.886 | 24699.2 | ||
| SZ3 | -1.529e-03 | 0.853 | 1930.9 | ||
| SZ4 | 2.432e-03 | 0.765 | 104.0 | ||
| PL | -1.206e-03 | 0.985 | 203338.7 | ||
| sRPE | -3.599e-05 | 0.823 | 3.2 | ||
| IndSZ | -2.923e-04 | 0.670 | 26.1 | ||
| PLZ1 | -3.033e-03 | 0.962 | 17170.2 | ||
| PLZ2 | -2.006e-03 | 0.975 | 65129.4 | ||
| PLZ3 | 4.597e-03 | 0.945 | 10600.5 | ||
| PLZ4 | -7.791e-03 | 0.905 | 1131.4 | ||
| Starting Fitness | 3.200e-01 | 0.691 | 18.0 |
Results of the LOVO PLSCA showing the effect on singular value inertia of omitting variables one at a time.
| PLSCA | Response variables | Predictor | No. of subjects included | No. of simulations | Measured inertia | Change in inertia compared to baseline | Significance |
|---|---|---|---|---|---|---|---|
| PLSCA Baseline | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 25.096 | NA | 0.271 |
| Omit TD | End Fitness, | SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 24.785 | 0.311 | 0.240 |
| Omit SZ1 | End Fitness, | TD, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 24.512 | 0.584 | 0.253 |
| Omit SZ2 | End Fitness, | TD, SZ1, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 24.967 | 0.129 | 0.232 |
| Omit SZ3 | End Fitness, | TD, SZ1, SZ2, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 23.151 | 1.945 | 0.774 |
| Omit SZ4 | End Fitness, | TD, SZ1, SZ2, SZ3, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 19.170 | 5.926 | 0.547 |
| Omit PL | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 24.352 | 0.744 | 0.261 |
| Omit sRPE | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 23.897 | 1.200 | 0.273 |
| Omit IndSZ | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 24.308 | 0.788 | 0.260 |
| Omit PLZ1 | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ2, PLZ3, PLZ4, Starting Fitness | 16 | 10000 | 24.913 | 0.183 | 0.233 |
| Omit PLZ2 | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ3, PLZ4 | 16 | 10000 | 23.906 | 1.190 | 0.278 |
| Omit PLZ3 | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ4 | 16 | 10000 | 24.325 | 0.771 | 0.258 |
| Omit PLZ4 | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3 | 16 | 10000 | 24.996 | 0.100 | 0.224 |
Abbreviations: TD = total distance (m); SZ1 = speed zone 1 (0 to 3 m·s-1; [m]); SZ2 = speed zone 2 (3.1 to 5 m·s-1; [m]); SZ3 = speed zone 3 (5.1 to 7 m·s-1; [m]); SZ4 = speed zone 1 (> 7.1 m·s-1; [m]); PL = PlayerLoad (AU); PLZ1 = PlayerLoad Zone 1 (0 to 1 AU); PLZ2 = PlayerLoad Zone 2 (1.1 to 2 AU); PLZ3 = PlayerLoad Zone 3 (2.1 to 3 AU); PLZ4 = PlayerLoad Zone 4 (> 3.1 AU); sRPE = session-rating-of-perceived-exertion; IndSZ = Individualised speed zone (> 30–15 intermittent fitness test termination speed).
Fig 2Variable importance plot showing the decrease in singular value inertia attributable to each predictor variable.
Results of the PLSCA using refined models.
| PLSCA | Response variables | Predictor | No. of subjects included | No. of simulations | Measure inertia | Significance | Chi-square |
|---|---|---|---|---|---|---|---|
| PLSCA Baseline | End Fitness, | TD, SZ1, SZ2, SZ3, SZ4, PL, sRPE, IndSZ, PLZ1, PLZ2, PLZ3, PLZ4 | 16 | 10000 | 25.096 | 0.271 | NA |
| PLSCA Model 1 | End Fitness, | SZ4 | 16 | 10000 | 13.459 | 0.007 | 709.5 |
| PLSCA Model 2 | End Fitness, | SZ3, SZ4 | 16 | 10000 | 16.419 | 0.015 | 2030.8 |
Abbreviations: TD = total distance (m); SZ1 = speed zone 1 (0 to 3 m·s-1; [m]); SZ2 = speed zone 2 (3.1 to 5 m·s-1; [m]); SZ3 = speed zone 3 (5.1 to 7 m·s-1; [m]); SZ4 = speed zone 1 (> 7.1 m·s-1; [m]); PL = PlayerLoad (AU); PLZ1 = PlayerLoad Zone 1 (0 to 1 AU); PLZ2 = PlayerLoad Zone 2 (1.1 to 2 AU); PLZ3 = PlayerLoad Zone 3 (2.1 to 3 AU); PLZ4 = PlayerLoad Zone 4 (> 3.1 AU); sRPE = session-rating-of-perceived-exertion; IndSZ = Individualised speed zone (> 30–15 intermittent fitness test termination speed).
* p values less than 0.05 considered significant for one-tailed test
Results of refined MLR models with respective variable inflation factors.
| MLR model | Response | Predictor Variables | Coefficient | Standard | Significance | VIF | Model | Model p value | AIC |
|---|---|---|---|---|---|---|---|---|---|
| MLR Model 1 | End Fitness | Intercept | 8.555 (3.172–13.939) | NA | 0.004 | NA | 0.733 | <0.001 | 28.3 |
| SZ4 | 0.002 (0.000–0.003) | 2.477e-07 | 0.011 | 1.35 | |||||
| Starting Fitness | 0.528 (0.196–0.850) | 1.606e-03 | 0.004 | 1.35 | |||||
| MLR Model 2 | End Fitness | Intercept | 8.303 (2.453–14.153) | NA | 0.009 | NA | 0.714 | <0.001 | 30.1 |
| SZ3 | 6.082e-05 (0.000–4.495e-04) | 5.879e-08 | 0.739 | 1.78 | |||||
| SZ4 | 1.675e-03 (0.000–3.530e-03) | 6.820e-06 | 0.073 | 2.29 | |||||
| Starting Fitness | 0.537 (0.207–0.877) | 0.515 | 0.005 | 1.39 |