| Literature DB >> 34671691 |
Kyle N Kunze1, Evan M Polce2, Anil S Ranawat1, Per-Henrik Randsborg1, Riley J Williams1, Answorth A Allen1, Benedict U Nwachukwu1, Andrew Pearle1,2,3,4, Beth S Stein1,2,3,4, David Dines1,2,3,4, Anne Kelly1,2,3,4, Bryan Kelly1,2,3,4, Howard Rose1,2,3,4, Michael Maynard1,2,3,4, Sabrina Strickland1,2,3,4, Struan Coleman1,2,3,4, Jo Hannafin1,2,3,4, John MacGillivray1,2,3,4, Robert Marx1,2,3,4, Russell Warren1,2,3,4, Scott Rodeo1,2,3,4, Stephen Fealy1,2,3,4, Stephen O'Brien1,2,3,4, Thomas Wickiewicz1,2,3,4, Joshua S Dines1,2,3,4, Frank Cordasco1,2,3,4, David Altcheck1,2,3,4.
Abstract
BACKGROUND: Understanding specific risk profiles for each patient and their propensity to experience clinically meaningful improvement after anterior cruciate ligament reconstruction (ACLR) is important for preoperative patient counseling and management of expectations.Entities:
Keywords: IKDC; MCID; anterior cruciate ligament; artificial intelligence; clinically meaningful; reconstruction; machine learning
Year: 2021 PMID: 34671691 PMCID: PMC8521431 DOI: 10.1177/23259671211046575
Source DB: PubMed Journal: Orthop J Sports Med ISSN: 2325-9671
Baseline Characteristic and Injury Information for Patients Included in the Final Analysis (N = 442)
| Characteristic | Median (IQR) or No. (%) | Missing Data, % | Characteristic | Median (IQR) or No. (%) | Missing Data, % |
|---|---|---|---|---|---|
| Age, y | 29.0 (21.0-40.3) | 0 | Anterior drawer endpoint | 5.4 | |
| Body mass index | 24.2 (21.9-26.6) | 11.8 | A | 19 (4.6) | |
| Male sex | 231 (52.3) | 0 | B | 397 (95.4) | |
| White race | 360 (81.4) | 0 | Posterior sag | 6.6 | |
| Smoking status | 10.2 | Normal | 407 (98.5) | ||
| Never | 320 (80.6) | Flush | 3 (0.73) | ||
| Quit >6 mo preoperatively | 39 (9.8) | Back | 3 (0.73) | ||
| Quit <6 mo preoperatively | 14 (3.5) | Posterior drawer endpoint | 13.8 | ||
| Current | 24 (6.0) | A | 351 (92.6) | ||
| Diabetes mellitus | 1 (0.25) | 9.7 | B | 28 (7.4) | |
| Sports participation | 343 (83.7) | 7.2 | MCL examination | 6.6 | |
| Contact injury mechanism | 112 (28.6) | 11.5 | Stable | 388 (93.0) | |
| Previous ipsilateral knee surgery | 25 (6.3) | 0 | Loose | 29 (7.0) | |
| Previous contralateral knee surgery | 52 (13.2) | 6.1 | MCL examination: extension to 30° | 6.3 | |
| Graft source | 0 | Grade 0 | 374 (90.3) | ||
| Autograft | 316 (71.5) | Grade 1 | 29 (7.0) | ||
| Allograft | 126 (28.5) | Grade 2 | 6 (1.4) | ||
| Graft configuration | 0 | Grade 3 | 5 (1.2) | ||
| Single bundle | 426 (96.4) | LCL examination | 6.1 | ||
| Double bundle | 16 (3.6) | Stable | 410 (98.8) | ||
| Graft type | 0 | Loose | 5 (1.2) | ||
| Bone-patellar tendon-bone | 214 (48.4) | LCL examination: extension to 30° | 5.9 | ||
| Hamstring: semitendinosus | 38 (8.6) | Grade 0 | 403 (96.9) | ||
| Hamstring: S+T | 72 (16.3) | Grade 1 | 8 (1.9) | ||
| Quadriceps-bone | 5 (1.1) | Grade 2 | 3 (0.72) | ||
| Iliotibial band | 0 (0.0) | Grade 3 | 2 (0.48) | ||
| Achilles tendon | 99 (22.4) | Pivot shift | 9.0 | ||
| Tibialis anterior | 13 (2.9) | 0 | 10 (2.5) | ||
| Tibialis posterior | 1 (0.23) | 1+ | 100 (24.9) | ||
| Femoral tunnel drilling | 4.5 | 2+ | 284 (70.6) | ||
| Transtibial | 85 (20.2) | 3+ | 8 (2.0) | ||
| Anteromedial | 327 (77.9) | Reverse pivot shift | 9.0 | ||
| Outside-in | 3 (0.71) | 0 | 392 (97.5) | ||
| Retro-drill | 5 (1.2) | 1+ | 5 (1.2) | ||
| Tibial tunnel drilling | 6.3 | 2+ | 4 (1.0) | ||
| Outside-in | 402 (97.1) | 3+ | 1 (0.25) | ||
| Retro-drill | 12 (2.9) | SSD: external rotation at 30° | 5.7 | ||
| Femoral/tibial fixation | 5.7 | Grade 0 (<5°) | 409 (98.1) | ||
| Intratunnel | 310 (74.0) | Grade 1 (5°-10°) | 6 (1.4) | ||
| Suspensory | 109 (26.0) | Grade 2 (>10°) | 2 (0.48) | ||
| Effusion on examination | 89 (21.9) | 7.9 | SSD: External rotation at 90° | 5.7 | |
| Preoperative ROM: extension | 5.7 | Grade 0 (<5°) | 406 (97.4) | ||
| Recurvatum | 17 (4.1) | Grade 1 (5°-10°) | 7 (1.7) | ||
| Neutral | 379 (90.9) | Grade 2 (>10°) | 4 (0.96) | ||
| Extension loss | 21 (5.0) | SSD: Internal rotation at 30° | 5.7 | ||
| Preoperative ROM: flexion | 5.9 | Grade 0 (<5°) | 408 (97.8) | ||
| Symmetric to contralateral side | 387 (93.0) | Grade 1 (5°-10°) | 6 (1.4) | ||
| Flexion loss | 29 (7.0) | Grade 2 (>10°) | 3 (0.72) | ||
| Lachman grade | 5.7 | SSD: Internal rotation at 90° | 5.7 | ||
| 0 | 1 (0.24) | Grade 0 (<5°) | 406 (97.4) | ||
| 1 | 12 (2.9) | Grade 1 (5°-10°) | 6 (1.4) | ||
| 2 | 400 (96.2) | Grade 2 (>10°) | 5 (1.2) | ||
| 3 | 3 (0.72) | Preoperative Lysholm score | 64.0 (51.0-76.0) | 2.5 | |
| Preoperative IKDC score | 50.6 (39.4-61.8) | 0 | |||
| Preoperative Tegner score | 2.0 (1.0-3.0) | 0.45 |
IKDC, International Knee Documentation Committee; IQR, interquartile range; LCL, lateral collateral ligament; MCL, medial collateral ligament; ROM, range of motion; SSD, side-to-side difference; S+T, semitendinosus + gracilis.
At 2-year follow-up, 39 (8.8%) patients did not achieve the MCID for the IKDC score.
Figure 1.Machine learning algorithm development methodology. ACL, anterior cruciate ligament; MCID, minimal clinically important difference.
Performance Metric Interpretation Guide
| Metric | Description |
|---|---|
| Discrimination | Assessed through performing ROC analyses and quantifying the AUC (also referred to as the concordance statistic [C-statistic]). The C-statistic is described as the probability that the machine learning model will assign a greater predicted probability to a randomly selected positive case (patient who achieved the MCID) relative to a randomly selected negative case (false-positive case, ie, a patient who did not achieve the MCID). |
| Calibration | Assesses the agreement between predictions made by the machine learning models and the true observed outcomes. A calibration slope of 1 and calibration intercept of 0 are indicative of perfect prediction by the model. Performance is assessed through quantifying the calibration slope (precision of predictions) and calibration intercept (tendency for model to overestimate or underestimate the observed outcome). |
| Brier score | A proper scoring function that assesses overall performance and is an extension of calibration and discrimination. The Brier score for each model is equal to the mean squared difference between the true observed outcomes and the model prediction probabilities as a benchmark to quantitatively ensure that the machine learning models are providing valuable predictions and not demonstrating class imbalance; the null model Brier score (Brier score where the predicted probabilities of the null model are equal to the outcome prevalence of the entire study cohort) is calculated. The Brier score of each machine learning model is subsequently compared with this value. In general, lower Brier scores indicate that predictions are better calibrated (with zero being perfect performance and calibration), and Brier scores lower than the null model score indicate model usefulness. |
| Decision-curve analysis | An analysis that provides insight into potential clinical utility of making changes in patient management based off of the machine learning model and alternative scenarios by comparing the predicted net benefit of using the model at varying risk thresholds. Decision-curve analysis specifically compares changes in management based off of the model, the best-performing predictive variable alone, changes for all patients, and changes for no patients. As the risk threshold probability increases, the cost to benefit ratio (and consequently the weight attributed to false-positive classifications made by the model) increases. |
| Local interpretable model-agnostic explanations | LIME samples local input variable distributions using a predefined number of permutations and assesses the effect of specific ranges of values for each predictor feature on the primary outcome. The importance of each feature is computed and carried forward based on similarities between the features and the model predictions. LIME then explains model fit (here, how well this local example represents both the global model behavior and its plausibility) and provides a visual explanation of how each feature contributes to the overall predictions, demonstrating how each variable on a case-by-case basis either supports (increases the probability of achieving the MCID) or contradicts (decreases the probability of achieving the MCID) the prediction. A ridge regression model with the Gower distance function and a kernel width of 1.25 was used to optimize LIME in the current study. |
AUC, area under the curve; LIME, local interpretable model-agnostic explanations; MCID, minimal clinically important difference; ROC, receiver operating characteristic.
Algorithm Performance in Independent Testing Set (n = 131)
| Metric | Stochastic Gradient Boosting | Random Forest | Support Vector Machine | Adaptive Gradient Boosting | Neural Network | Elastic-Net Penalized Logistic Regression |
|---|---|---|---|---|---|---|
| C-statistic | 0.70 | 0.78 | 0.79 | 0.79 | 0.81 | 0.82 |
| Calibration intercept | 0.02 | 0.21 | 0.19 | 0.17 | 0.18 | 0.10 |
| Calibration slope | 0.49 | 0.63 | 5.05 | 0.49 | 1.74 | 1.15 |
| Brier score | 0.080 | 0.083 | 0.075 | 0.073 | 0.069 | 0.068 |
Data in parentheses are 95% CIs.
Null model Brier score = 0.077.
Figure 2.(A) Global variable importance plot and (B) discrimination performance from the elastic-net penalized logistic regression model on the independent testing set. Each predictive weight of each variable is compared among the other 7 variables chosen from recursive feature elimination. The global variable importance plot represents the predictive value of each variable in descending order, with variables having lower predictive value as one moves down the y-axis. This plot indicates that a history of contralateral knee surgery is the most important predictor of achieving the minimal clinically important difference, whereas the importance of the preoperative Lysholm score is negligible. bmi, body mass index; contknee, history of contralateral knee surgery; ext, preoperative knee extension; femfix, femoral tunnel fixation method; FPR, false-positive rate; IKDC, International Knee Documentation Committee; mclexext, medial collateral ligament examination from extension to 30°; ROC, receiver operating characteristic; TPR, true-positive rate.
Figure 3.Calibration plot for the elastic net penalized logistic regression (ENPLR) model on the independent testing set of patients. The y-axis displays the true observed proportion of those who achieved the minimal clinically important difference, while the x-axis displays the corresponding predictions made by the ENPLR model. The shaded area indicates the 95% CI of the predicted probabilities. The red line represents perfect prediction.
Figure 4 .Decision-curve analysis for the elastic-net penalized logistic regression (ENPLR) model on the independent testing set of patients. The y-axis shows the standardized net benefit of changing management based off of the model (ENPLR), the best-performing variable (BPV; history of contralateral knee surgery), for all patients, and for no patients. The x-axis demonstrates risk thresholds for not achieving the minimal clinically important difference (MCID) as a percentage, as well as the cost to benefit ratio (ratio of false-positive outcomes to true-positive outcomes). (A) View of decision-curve for wide range of risk thresholds. (B) View of decision curves for higher-risk thresholds. When risk is very high (80% likelihood of not achieving MCID), management changes based off of the ENPLR model give greater net benefit (higher likelihood of achieving the MCID) than changing management based on the other decisions.
Figure 5.Demonstration of the clinical effect that application of the clinical decision-making tool derived from the elastic-net penalized logistic regression model can have if applied during the preoperative period. The red bars indicate features that support the probability of achieving the minimal clinically important difference (MCID), and the blue bars indicate features that put the patient at risk of not achieving the MCID. (A) Case 1: A 30-year-old patient with an anterior cruciate ligament tear and body mass index (BMI) of 31 is evaluated at the clinic. The patient has a relatively high level of function (International Knee Documentation Committee [IKDC] score, 75; Lysholm, 80). The patient has never had a contralateral knee surgery. On examination, the patient demonstrates a grade 0 medial collateral ligament examination and has an extension loss; the decision is made to operate using an intratunnel femoral fixation technique. Given this decision, at 2 years postoperatively, there is a 25% chance the patient will not achieve a clinically meaningful improvement in symptoms and function. (B) Case 2: Instead of pursuing surgery, the patient is recommended to first optimize his current health state. The patient is able to decrease BMI into the normal category (BMI, 27) and obtain neutral extension on examination via physical therapy. By using the current algorithm to optimize his health state based off of their specific risk factors, this patient improved the probability of achieving a clinically meaningful improvement in symptoms and function to 95% at 2 years postoperatively. bmi, body mass index; contknee, history of contralateral knee surgery; ext, preoperative knee extension; femfix, femoral tunnel fixation method; mclexext, medial collateral ligament examination from extension to 30°.