| Literature DB >> 31354507 |
Joshua D Ruddy1, Stuart J Cormack1, Rod Whiteley2, Morgan D Williams3, Ryan G Timmins1, David A Opar1.
Abstract
Injuries are a common occurrence in team sports and can have significant financial, physical and psychological consequences for athletes and their sporting organizations. As such, an abundance of research has attempted to identify factors associated with the risk of injury, which is important when developing injury prevention and risk mitigation strategies. There are a number of methods that can be used to identify injury risk factors. However, difficulty in understanding the nuances between different statistical approaches can lead to incorrect inferences and decisions being made from data. Accordingly, this narrative review aims to (1) outline commonly implemented methods for determining injury risk, (2) highlight the differences between association and prediction as it relates to injury and (3) describe advances in statistical modeling and the current evidence relating to predicting injuries in sport. Based on the points that are discussed throughout this narrative review, both researchers and practitioners alike need to carefully consider the different types of variables that are examined in relation to injury risk and how the analyses pertaining to these different variables are interpreted. There are a number of other important considerations when modeling the risk of injury, such as the method of data transformation, model validation and performance assessment. With these technical considerations in mind, researchers and practitioners should consider shifting their perspective of injury etiology from one of reductionism to one of complexity. Concurrently, research implementing reductionist approaches should be used to inform and implement complex approaches to identifying injury risk. However, the ability to capture large injury numbers is a current limitation of sports injury research and there has been a call to make data available to researchers, so that analyses and results can be replicated and verified. Collaborative efforts such as this will help prevent incorrect inferences being made from spurious data and will assist in developing interventions that are underpinned by sound scientific rationale. Such efforts will be a step in the right direction of improving the ability to identify injury risk, which in turn will help improve risk mitigation and ultimately the prevention of injuries.Entities:
Keywords: association; injury; prediction; prevention; sport
Year: 2019 PMID: 31354507 PMCID: PMC6629941 DOI: 10.3389/fphys.2019.00829
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
FIGURE 1A contingency table which can be used to express the outcomes of binary classification.
FIGURE 2A contingency table expressing the outcomes of a mock dataset. The frequency distribution of athletes that have or have not sustained a previous injury is displayed against the frequency distribution of athletes that did or did not sustain a prospective injury.
A summary of the steps involved in calculating the post-test probability of an injury occurring given a history of injury.
| 1. Pre-test | Odds (as a decimal) | 0.25 | The decimal odds of sustaining a future injury for all athletes, prior to accounting for previous injury. This can also be calculated using the pre-test probability (see section “Pre-test and Post-test Probabilities”). | |
| Odds (as a ratio) | 1:4 | As above, calculated as a fraction | The likelihood of a future injury occurring (1) compared to the likelihood of a future injury not occurring (4) for all athletes. | |
| Probability | 20% | The percentage of athletes likely to sustain a future injury (prior to accounting for previous injury). | ||
| Explanation | 2 in 10 chance | − | This can simplified to a 1 in 5 chance. | |
| 2. Likelihood ratio | Positive likelihood ratio | 6 | The magnitude by which having a previous injury increases the odds of sustaining a future injury. This is calculated using sensitivity and specificity (see section “Sensitivity and Specificity”). | |
| 3. Post-test | Odds (as a decimal) | 1.5 | The decimal odds of athletes with a previous injury sustaining a future injury. | |
| Odds (as a ratio) | 6:4 | As above, calculated as a fraction | The likelihood of a future injury occurring (6) compared to the likelihood of a future injury not occurring (4) for athletes with a previous injury. | |
| Probability | 60% | The percentage of previously injured athletes likely to sustain a future injury. This is calculated using the post-test odds. | ||
| Explanation | 6 in 10 chance | − | This can be simplified to a 3 in 5 chance. |
FIGURE 3An example of a receiver operating characteristic curve, which can be used to illustrate how well a continuous variable performs as a binary classifier. The true positive rate (sensitivity) is plotted against the false positive rate (1 – specificity) at every conceivable cut point for a continuous variable. The gray shaded area indicates the area under the curve.
FIGURE 4A complex systems approach for modeling the risk of injury, adapted from Bittencourt et al. (2016). The web of determinants represents the individual risk factors as a collective, with the size of the circle indicating that variable’s level of influence. These variables interact with each other to differing degrees to result in a risk profile. An athlete is then exposed to the risk of injury during training/competition. During an athlete’s exposure to the risk of injury, an inciting event may occur and this can result in an injury. The other outcome is no injury. How much an athlete trains or competes and what they do during these sessions (i.e., their level of exposure) will feed back into the web and revise their subsequent and resulting risk profile. The outcome (injury or no injury), will also influence their future risk profiles.
FIGURE 5A typical supervised learning modeling approach. A dataset with a known outcome variable (i.e., injured or uninjured), referred to as training data, is used to identify patterns and predict the withheld outcome variable of an independent dataset, referred to as testing data. The performance of the model can then be assessed by comparing the predicted outcomes against the withheld outcome variable of the testing data.
FIGURE 6A contingency table expressing the outcomes of a mock dataset. The frequency distribution of athletes predicted as sustaining an injury and athletes predicted as remaining uninjured is displayed against the frequency distribution of athletes that were actually injured and uninjured.