| Literature DB >> 33345141 |
Maria Henriquez1, Jacob Sumner2, Mallory Faherty3, Timothy Sell3, Brinnae Bent4.
Abstract
Injury rates in student athletes are high and often unpredictable. Injury risk factors are not agreed upon and often not validated. Here, we present a random-forest machine learning methodology for identifying the most significant injury risk factors and develop a model of lower extremity musculoskeletal injury risk in student athletes with physical performance metrics spanning joint strength measured with force transducers, postural stability measured using a force plate, and flexibility, measured with a goniometer, combined with previous injury metrics and athlete demographics. We tested our model in a population of 122 student athletes with performance metrics for the lower extremity musculoskeletal system and achieved an injury risk accuracy of 79% and identified significant injury risk factors, that could be used to increase accuracy of injury risk assessments, implement timely interventions, and decrease the number of career-ending or chronic injuries among student athletes.Entities:
Keywords: injury risk; machine learning; random forest; sports science; student athlete
Year: 2020 PMID: 33345141 PMCID: PMC7739722 DOI: 10.3389/fspor.2020.576655
Source DB: PubMed Journal: Front Sports Act Living ISSN: 2624-9367
Demographics.
| Women's Basketball | 12 | 9.38% |
| Men's Basketball | 11 | 9.02% |
| Women's Soccer | 23 | 18.85% |
| Men's Soccer | 23 | 18.85% |
| Women's Volleyball | 16 | 13.11% |
| Men's Football | 37 | 30.33% |
| Male | 71 | 58.20% |
| Female | 51 | 41.80% |
| 18–20 | 66 | 54.10% |
| 20–22 | 42 | 34.40% |
| 22+ | 14 | 11.50% |
Student athlete primary collegiate sport, self-identified gender, and age range of participants in the dataset.
Figure 1
Figure 2Random Forest ROC. ROC of test/train validated model. Black line shows results of the model presented in this study. Random chance is represented with the blue dashed line.
Random Forest Variable Importance.
| Hip adductor strength | Strength | 5.3465738 |
| Hip external rotation strength | Strength | 4.3168850 |
| Straight leg raise | Flexibility | 4.1723271 |
| Height | Demographic | 3.6593198 |
| Hip abductor strength | Strength | 3.6520573 |
| Hip internal rotation strength | Strength | 3.4898969 |
| Eyes open balance test composite score | Balance | 3.2547060 |
| Ankle dorsiflexion strength | Strength | 2.9879077 |
| Ankle plantarflexion strength | Strength | 2.8693613 |
| Primary sport type | Demographic | 2.7218000 |
| Knee flexion strength | Strength | 2.3598642 |
| Eyes closed balance test composite score | Balance | 2.1693963 |
| Active knee extension | Flexibility | 1.7146625 |
| Ankle eversion strength | Strength | 1.6240054 |
| Ankle inversion strength | Strength | 0.8680607 |
Variable Importance represented by Mean Decrease Accuracy for 15 variables used in tuned model.
Figure 3Relative Variable Importance by Feature Type. Importance of feature types in RF model using relative Mean Decrease Accuracy as the metric.