| Literature DB >> 35246270 |
Lok Sze Lee1, Ping Keung Chan2, Chunyi Wen3, Wing Chiu Fung1, Amy Cheung4, Vincent Wai Kwan Chan4, Man Hong Cheung1, Henry Fu1, Chun Hoi Yan5, Kwong Yuen Chiu1.
Abstract
BACKGROUND: Artificial intelligence is an emerging technology with rapid growth and increasing applications in orthopaedics. This study aimed to summarize the existing evidence and recent developments of artificial intelligence in diagnosing knee osteoarthritis and predicting outcomes of total knee arthroplasty.Entities:
Keywords: Arthroplasty; Artificial intelligence; Machine learning; Osteoarthritis; Replacement; Total knee arthroplasty
Year: 2022 PMID: 35246270 PMCID: PMC8897859 DOI: 10.1186/s42836-022-00118-7
Source DB: PubMed Journal: Arthroplasty ISSN: 2524-7948
Studies included in the scoping review
| Area | First author | Year | Journal | Reference no. |
|---|---|---|---|---|
| OA diagnosis and TKA need | El-Galaly, A. | 2020 | Clinical Orthopaedics and Related Research | [ |
| Heisinger, S. | 2020 | Journal of Clinical Medicine | [ | |
| Jafarzadeh, S. | 2020 | Osteoarthritis Cartilage | [ | |
| Leung, K. | 2020 | Radiology | [ | |
| Tolpadi, A.A. | 2020 | Scientific Reports | [ | |
| Yi, P. H. | 2020 | Knee | [ | |
| Norman, B. | 2019 | Journal of Digital Imaging | [ | |
| Tiulpin, A. | 2018 | Scientific reports | [ | |
| Postoperative outcomes | Harris, A. | 2021 | The Journal of Arthroplasty | [ |
| Bonakdari, H. | 2020 | Computer Methods and Programs in Biomedicine | [ | |
| Farooq, H. | 2020 | The Journal of Arthroplasty | [ | |
| Hyer, J.M. | 2020 | Journal of the American College of Surgeons | [ | |
| Ko, S. | 2020 | Knee Surgery, Sports Traumatology, Arthroscopy | [ | |
| Kunze, K. | 2020 | The Journal of Arthroplasty | [ | |
| Fontana, M. A. | 2019 | Clinical Orthopaedics and Related Research | [ | |
| Harris, A. | 2019 | Clinical Orthopaedics and Related Research | [ | |
| Huber, M. | 2019 | BMC Medical Informatics and Decision Making | [ | |
| Lee, H. K. | 2019 | IEEE Journal of Biomedical and Health Informatics | [ | |
| Aram, P. | 2018 | American Journal of Epidemiology | [ | |
| Huang, Z. | 2018 | Transfusion | [ | |
| Kluge, F. | 2018 | Gait Posture | [ | |
| Van Onsem, S. | 2016 | The Journal of Arthroplasty | [ |
A summary of reviewed studies on knee osteoarthritis diagnosis and knee arthroplasty prediction
| Author (Year) | Journal | Prediction outcome | AI/ML algorithm(s) | Statistical performance | Strengths | Weaknesses | Clinical significance of study |
|---|---|---|---|---|---|---|---|
| Norman (2019) [ | Journal of Digital Imaging | OA severity (KL grade) | DenseNet neural network architectures | Sensitivity & specificity: 84% & 86% (KL grades 0–1), 70% & 84% (KL grade 2), 69% & 97% (KL grade 3), 86% & 99% (KL grade 4). | Comparable sensitivity and specificity to manual KL grading and previous automatic systems employing different AI/ML algorithms | Training, validation and testing sets were selected from the same dataset. Misclassifications of KL grading typically occurred when there was hardware in the knee. | Provides additional data supporting the potential of AI in automatic assessment of OA radiological severity. |
| Tiulpin (2018) [ | Scientific reports | OA severity (KL grade) | Deep Siamese CNN architecture | Average multi-class accuracy: 66.71%. AUC: 0.93. Kappa coefficient (agreement with expert annotations on test dataset): 0.83 (excellent). MSE value: 0.48. | Different datasets used for initial training and testing | Validation and testing sets were selected from the same dataset. | The provision of probability distributions for each KL grade prediction may assist clinicians in choosing KL grade in ambiguous cases. |
| Heisinger (2020) [ | Journal of Clinical Medicine | Need for TKA | Artificial neural networks (ANNs) with linear, radial basis function and three-layer perceptron neural networks architectures | Total percentage of correctly predicted knees: 80%. Positive predictive value: 84%. Negative predictive value: 73%. Sensitivity: 41%. Specificity 30%. | First study to consider longitudinal change in symptomology (pain, function, quality of life) and radiographic structural change in a 4-year period prior to TKA | Training and testing sets were selected from the same dataset. | Future externally validated algorithms that can predict TKA need in advance using routinely available patient data could be highly useful for decisions for referral and triage in a primary care setting. |
| Leung (2020) [ | Radiology | Need for TKA | Multitask deep learning model (ResNet34) trained with transfer learning | AUC: 0.87. Sensitivity: 83%. Specificity: 77%. | First study to directly predict TKA from knee radiographs using deep learning model | Limited data size (radiographs from 728 individuals in total) / Training and testing sets were selected from the same dataset. | TKA prediction models solely based on radiological data have limited clinical utility, although they may serve as a reference for future ML studies. |
| El-Galaly (2020) [ | Clinical Orthopaedics and Related Research | Need for early revision TKA | LASSO regression, random forest classifier, gradient boosting model, neural network | AUCs: 0.57–0.60. | First study to predict early revision TKA (≤ 2 years of primary TKA) using preoperative patient data from arthroplasty registries / Temporal external validation was conducted (testing set selected from a separate hold-out year not included in training set). | Training and testing sets were selected from the same dataset. | Results from this study suggest that future models predicting early revision TKA may benefit from including more pre-operative information or predicting revision over a longer follow-up duration. |
A summary of reviewed studies on predicting postoperative outcomes of total knee arthroplasty
| Author (Year) | Journal | Prediction outcome | AI/ML algorithm(s) | Statistical performance | Strengths | Weaknesses | Clinical significance of study |
|---|---|---|---|---|---|---|---|
| Huber (2019) [ | BMC Medical Informatics and Decision Making | Postoperative improvement in PROMs | Extreme gradient boosting, multi-step adaptive elastic-net, random forest, neural net, Naïve Bayes, k-Nearest Neighbors | AUCs: 0.86 (VAS) & 0.70 (Q score). | Comparison of a wide variety of ML approaches in addition to regression methods | Training and testing sets were selected from the same dataset. | Identified important predictors for postoperative PROMs (e.g., preoperative VAS). |
| Harris (2021) [ | The Journal of Arthroplasty | Postoperative 1-year achievement of MCID | LASSO regression, GBM, quadratic discriminant analysis | AUC: 0.76 (ADL), 0.72 (pain), 0.72 (symptoms), 0.71 (quality of life). | Provided sensitivity and specificity of various thresholds of predicted probability of failure to achieve MCID 1 year post-TKA. | Training and testing sets were selected from the same dataset. | Demonstrated potential for AI to predict patients most likely to benefit from TKA. |
| Kunze (2020) [ | The Journal of Arthroplasty | Postoperative patient dissatisfaction | Stochastic gradient boosting, random forest, support vector machine, neural network, elastic-net penalized logistic regression | AUC: 0.66–0.79. | All five machine learning algorithms demonstrated superior predictive performance than the standard logistic regression model. Algorithm-identified predictors of postoperative patient dissatisfaction are consistent with previous systematic reviews. | Training and testing sets were selected from the same dataset. | Demonstrated potential for AI to predict patients most likely to experience postoperative dissatisfaction. |
| Farooq (2020) [ | The Journal of Arthroplasty | Postoperative patient satisfaction | Stochastic gradient boosting | AUC: 0.81. Sensitivity: 73.0%. Specificity: 74.6%. | Demonstrated superior predictive performance than the binary logistic regression model. | Limited sample size (data from 897 cases) / Training and testing sets were selected from the same dataset. | Identified important predictors for postoperative patient satisfaction (e.g., age). |
| Harris (2019) [ | Clinical Orthopaedics and Related Research | Postoperative 30-day complications and mortality | LASSO regression | AUC: 0.72 (cardiac complications), 0.69 (mortality), 0.60 (renal complications). | Different datasets were used for initial training and testing. | Training dataset (ACS-NSQIP) does not contain complete patient medical data (e.g., comorbidities), and includes patients from a limited number of hospitals. | Developed an externally validated model using routine clinical data as predictors and could therefore potentially be used to identify high-risk patients preoperatively. |