David L Carey1,2, Kay M Crossley1, Rod Whiteley3, Andrea Mosler1,3, Kok-Leong Ong4, Justin Crow2, Meg E Morris1,5. 1. La Trobe Sport and Exercise Medicine Research Centre, College of Science, Health and Engineering, La Trobe University, Melbourne, AUSTRALIA. 2. Essendon Football Club, Melbourne, AUSTRALIA. 3. Rehabilitation Department, Aspetar Orthopedic and Sports Medicine Hospital, Doha, QATAR. 4. Research Centre for Data Analytics and Cognition, La Trobe University, Melbourne, AUSTRALIA. 5. Healthscope, Northpark Private Hospital, Melbourne, AUSTRALIA.
Abstract
PURPOSE: To evaluate common modeling strategies in training load and injury risk research when modeling continuous variables and interpreting continuous risk estimates; and present improved modeling strategies. METHOD: Workload data were pooled from Australian football (n = 2550) and soccer (n = 23,742) populations to create a representative sample of acute:chronic workload ratio observations for team sports. Injuries were simulated in the data using three predefined risk profiles (U-shaped, flat and S-shaped). One-hundred data sets were simulated with sample sizes of 1000 and 5000 observations. Discrete modeling methods were compared with continuous methods (spline regression and fractional polynomials) for their ability to fit the defined risk profiles. Models were evaluated using measures of discrimination (area under receiver operator characteristic [ROC] curve) and calibration (Brier score, logarithmic scoring). RESULTS: Discrete models were inferior to continuous methods for fitting the true injury risk profiles in the data. Discrete methods had higher false discovery rates (16%-21%) than continuous methods (3%-7%). Evaluating models using the area under the ROC curve incorrectly identified discrete models as superior in over 30% of simulations. Brier and logarithmic scoring was more suited to assessing model performance with less than 6% discrete model selection rate. CONCLUSIONS: Many studies on the relationship between training loads and injury that have used regression modeling have significant limitations due to improper discretization of continuous variables and risk estimates. Continuous methods are more suited to modeling the relationship between training load and injury. Comparing injury risk models using ROC curves can lead to inferior model selection. Measures of calibration are more informative judging the utility of injury risk models.
PURPOSE: To evaluate common modeling strategies in training load and injury risk research when modeling continuous variables and interpreting continuous risk estimates; and present improved modeling strategies. METHOD: Workload data were pooled from Australian football (n = 2550) and soccer (n = 23,742) populations to create a representative sample of acute:chronic workload ratio observations for team sports. Injuries were simulated in the data using three predefined risk profiles (U-shaped, flat and S-shaped). One-hundred data sets were simulated with sample sizes of 1000 and 5000 observations. Discrete modeling methods were compared with continuous methods (spline regression and fractional polynomials) for their ability to fit the defined risk profiles. Models were evaluated using measures of discrimination (area under receiver operator characteristic [ROC] curve) and calibration (Brier score, logarithmic scoring). RESULTS: Discrete models were inferior to continuous methods for fitting the true injury risk profiles in the data. Discrete methods had higher false discovery rates (16%-21%) than continuous methods (3%-7%). Evaluating models using the area under the ROC curve incorrectly identified discrete models as superior in over 30% of simulations. Brier and logarithmic scoring was more suited to assessing model performance with less than 6% discrete model selection rate. CONCLUSIONS: Many studies on the relationship between training loads and injury that have used regression modeling have significant limitations due to improper discretization of continuous variables and risk estimates. Continuous methods are more suited to modeling the relationship between training load and injury. Comparing injury risk models using ROC curves can lead to inferior model selection. Measures of calibration are more informative judging the utility of injury risk models.
Authors: Lena Kristin Bache-Mathiesen; Thor Einar Andersen; Torstein Dalen-Lorentsen; Benjamin Clarsen; Morten Wang Fagerland Journal: BMJ Open Sport Exerc Med Date: 2021-08-06
Authors: Harry Fisher; Marianne Jr Gittoes; Lynne Evans; C Leah Bitchell; Richard J Mullen; Marco Scutari Journal: Front Sports Act Living Date: 2020-12-14
Authors: Johann Windt; Clare L Ardern; Tim J Gabbett; Karim M Khan; Chad E Cook; Ben C Sporer; Bruno D Zumbo Journal: BMJ Open Date: 2018-10-02 Impact factor: 2.692
Authors: Issa Moussa; Arthur Leroy; Guillaume Sauliere; Julien Schipman; Jean-François Toussaint; Adrien Sedeaud Journal: BMJ Open Sport Exerc Med Date: 2019-10-30