Literature DB >> 33997202

Predictive Models for Clinical Outcomes in Total Knee Arthroplasty: A Systematic Analysis.

Cécile Batailler1,2, Timothy Lording3, Daniele De Massari4, Sietske Witvoet-Braam4, Stefano Bini5, Sébastien Lustig1,2.   

Abstract

BACKGROUND: Predictive modeling promises to improve our understanding of what variables influence patient satisfaction after total knee arthroplasty (TKA). The purpose of this article was to systematically review the relevant literature using predictive models of clinical outcomes after TKA. The aim was to identify the predictor strategies used for systematic data collection with the highest likelihood of success in predicting clinical outcomes.
METHODS: A Preferred Reporting Items for Systematic Reviews and Meta-Analyses protocol systematic review was conducted using 3 databases (MEDLINE, EMBASE, and PubMed) to identify all clinical studies that had used predictive models or that assessed predictive features for outcomes after TKA between 1996 and 2020. The ROBINS-I tool was used to evaluate the quality of the studies and the risk of bias.
RESULTS: A total of 75 studies were identified of which 48 met our inclusion criteria. Preoperative predictive factors strongly associated with postoperative clinical outcomes were knee pain, knee-specific Patient-Reported Outcome Measure (PROM) scores, and mental health scores. Demographic characteristics, pre-existing comorbidities, and knee alignment had an inconsistent association with outcomes. The outcome measures that correlated best with the predictive models were improvement of PROM scores, pain scores, and patient satisfaction.
CONCLUSIONS: Several algorithms, based on PROM improvement, patient satisfaction, or pain after TKA, have been developed to improve decision-making regarding both indications for surgery and surgical strategy. Functional features such as preoperative pain and PROM scores were highly predictive for clinical outcomes after TKA. Some variables such as demographics data or knee alignment were less strongly correlated with TKA outcomes. LEVEL OF EVIDENCE: Systematic review - Level III.
© 2021 The Authors.

Entities:  

Keywords:  Functional outcomes; Predictive factor; Predictive model; Satisfaction; Total knee arthroplasty

Year:  2021        PMID: 33997202      PMCID: PMC8099715          DOI: 10.1016/j.artd.2021.03.013

Source DB:  PubMed          Journal:  Arthroplast Today        ISSN: 2352-3441


Introduction

Total knee arthroplasty (TKA) is an efficient surgical treatment for knee osteoarthritis. However, patient dissatisfaction and suboptimal patient reported outcomes are reported to be as high as to 20% [1,2]. With the rise of robotic surgery, a time may come when the procedure itself will no longer be considered a feature that significantly determines outcomes. In such a scenario, understanding how other groups of variables such as patient-specific attributes, functional measures, socio-economic indicators, or perioperative recovery location influence clinical outcomes will become increasingly important. Conceivably, using relevant data points incorporated into an algorithm, the insights derived for any given patient could impact surgical indications, procedure type, venue of surgery, and even recovery site. Predictive models are usually deployed in contexts where the measurement of the output is difficult, time-demanding, and expensive [3]. The increasing availability of large digital health-care data sets has facilitated the application of predictive models. Several studies have published predictive models for TKA outcomes that have taken into account several features such as functional scores, preoperative pain [4], comorbidities [5], demographic characteristics, and psychological features [[6], [7], [8]]. The goal was to use these probabilistic models to estimate and predict the likelihood of improvements in function and satisfaction after TKA with the goal of supporting surgeon decision-making [9]. Ever more complex algorithmic approaches have been developed; however, none of these have so far been able to replicate standard surgeon intuition [10,11]. Currently, to our knowledge, no study summarizes which features have been identified as the most predictive of clinical outcomes or which algorithms have been most successfully used in predictive analytics after TKA. The purpose of this article is therefore to systematically review the relevant literature on predictive factors and predictive models for outcome after TKA. This article will describe the preoperative predictive features which have been identified as having the strongest correlation with outcomes and patient satisfaction after TKA. Second, it will review the machine learning models of TKA results.

Material and methods

Article identification and selection process

A query in December 2020 was performed to identify all available literature that described or used predictive models for outcomes after TKA. The search was performed through PubMed, EMBASE, and MEDLINE data bases from 1996 to 2020 inclusive using the 2009 Preferred Reporting Items for Systematic Reviews and Meta-Analyses protocol (PRISMA). Inclusion criteria for the search strategy included all English language studies reporting information regarding the use of predictive models or the identification of preoperative predictive factors for outcomes after TKA. The following terms were used: “total knee arthroplasty” or “total knee replacement”; “predictive factor” or “predictive model” or “predictive modeling” or “predictive feature” or “predict”; and “outcomes”, “satisfaction”, “pain” or “PROMs” or “dissatisfaction”. Exclusion criteria consisted of (1) editorial articles, (2) systematic reviews or meta-analyses, (3) articles on revision TKA, and (4) articles evaluating joints other than the knee. The abstracts from all identified articles were independently reviewed by 2 investigators.

Quality assessment

The Risk Of Bias In Non-Randomised Studies of Interventions (ROBINS-I) tool [12] was used to evaluate the quality of the included studies and their relative risk of bias. The categories for risk of bias judgements are “Low risk”, “Moderate risk”, “Serious risk” and “Critical risk”. To increase the reliability of this classification, the same observer evaluated all articles with the ROBINS-I tool 2 times separated by an interval of 4 weeks. If the assessment of the study quality was not the same during these 2 evaluations, a second observer evaluated the concerned article with the ROBINS-I tool.

Results

Included articles and study characteristics

The PRISMA flow diagram for study selection is shown in Figure 1. Of the 75 potential articles, 19 were excluded as not relevant, and 8 were excluded because of their scoring a “critical risk of bias” score leaving 48 studies for inclusion. The risk of bias for these studies is reported in Table 1.
Figure 1

Flow chart from initial literature search through to data extraction from final list of included studies.

Table 1

Summary of quality assessment of the studies included in our analysis, according to ROBINS-I tool (Risk Of Bias In Non-Randomised Studies of Interventions) [12].

StudyConfoundingSelection of patientsClassification of interventionsDeviations from intended interventionsMissing dataMeasurement of outcomesSelection of reported resultsStudy risk
Brander et al. (2003) [7]ModerateLowLowLowLowModerateModerateModerate
Lingard et al. (2004) [56]ModerateLowModerateLowModerateModerateModerateModerate
Bourne et al. (2007) [28]ModerateModerateLowLowLowModerateLowModerate
Escobar et al. (2007) [10]ModerateLowLowLowLowModerateLowModerate
Davis et al. (2008) [44]ModerateLowLowLowModerateModerateModerateModerate
Franklin et al. (2008) [32]LowLowLowLowLowLowLowLow
Nilsdotter et al. (2009) [45]SeriousModerateModerateLowModerateSeriousModerateSerious
Rajgopal et al. (2008) [38]ModerateLowLowLowModerateLowModerateModerate
Dowsey et al. (2010) [35]ModerateLowLowLowLowModerateLowModerate
Bourne et al. (2010) [1]ModerateModerateModerateLowLowSeriousModerateSerious
Blackburn et al. (2012) [27]ModerateModerateModerateLowModerateModerateLowModerate
Judge et al. (2012) [6]ModerateLowModerateLowSeriousSeriousModerateSerious
Baker et al. (2012) [5]ModerateLowLowLowModerateModerateLowModerate
Schnurr et al. (2013) [36]ModerateModerateLowLowModerateModerateModerateModerate
Barlow et al. (2014) [57]SeriousModerateModerateLowSeriousSeriousModerateSerious
Lungu et al. (2014) [46]ModerateModerateLowLowSeriousSeriousModerateSerious
Sueyoshi et al. (2015) [37]ModerateLowModerateLowModerateModerateLowModerate
Huijbregts et al. (2016) [17]ModerateLowLowLowModerateLowModerateModerate
Maratt et al. (2015) [22]ModerateModerateModerateLowModerateModerateModerateModerate
Feldmann et al. (2015) [58]SeriousModerateLowLowModerateSeriousLowSerious
Maempel et al. (2016) [30]LowLowLowLowLowLowLowLow
Van Onsem et al. (2016) [15]ModerateModerateLowLowModerateModerateLowModerate
Giurea et al. (2016) [24]ModerateLowModerateLowModerateModerateModerateModerate
Hinarejos et al. (2016) [40]ModerateLowModerateLowLowModerateModerateModerate
Kremers et al. (2017) [59]LowModerateLowModerateSeriousModerateSeriousSerious
Jain et al. (2017) [60]SeriousSeriousModerateLowModerateSeriousModerateSerious
Sanchez Santos et al. (2018) [33]LowLowLowLowLowLowLowLow
Clement et al. (2018) [20]ModerateLowLowLowModerateModerateLowModerate
Van Onsem et al. (2018) [43]SeriousModerateModerateLowSeriousSeriousModerateSerious
Clement et al. (2018) [23]ModerateLowModerateLowModerateModerateModerateModerate
Abrecht et al. (2019) [13]ModerateModerateModerateLowLowModerateLowModerate
Calkins et al. (2019) [31]LowSeriousModerateLowLowSeriousLowSerious
Twiggs et al. (2019) [42]LowLowLowLowLowLowLowLow
Tolk et al. (2020) [18]ModerateLowModerateLowModerateModerateModerateModerate
Zabawa et al. (2019) [14]ModerateModerateModerateLowLowModerateLowModerate
Kunze et al. (2019) [39]LowLowModerateLowLowModerateLowModerate
Clement et al. (2019) [19]ModerateLowLowLowModerateModerateModerateModerate
Ramkumar et al. (2019) [61]LowLowLowLowLowLowLowLow
Pua et al. (2019) [25]ModerateLowModerateLowLowModerateModerateModerate
Xu et al. (2020) [62]ModerateModerateLowLowModerateModerateLowModerate
Vissers et al. (2020) [63]ModerateLowModerateLowLowModerateModerateModerate
Kunze et al. (2020) [41]ModerateModerateLowLowModerateLowLowModerate
Belford et al. (2020) [64]ModerateModerateModerateLowModerateModerateModerateModerate
Pua et al. (2020) [26]ModerateLowLowLowLowModerateModerateModerate
Farooq et al. (2020) [29]ModerateLowLowLowModerateModerateLowModerate
Harris et al. (2021) [21]ModerateLowLowLowModerateLowModerateModerate
Itou et al. (2020) [65]SeriousSeriousModerateLowModerateSeriousModerateSerious
Anis et al. (2020) [34]LowLowLowLowLowLowLowLow

The categories for risk of bias judgements are “Low risk”, “Moderate risk”, “Serious risk”, and “Critical risk”. The worst judgment bias assigned within any one domain gives the judgment score of the complete study.

Flow chart from initial literature search through to data extraction from final list of included studies. Summary of quality assessment of the studies included in our analysis, according to ROBINS-I tool (Risk Of Bias In Non-Randomised Studies of Interventions) [12]. The categories for risk of bias judgements are “Low risk”, “Moderate risk”, “Serious risk”, and “Critical risk”. The worst judgment bias assigned within any one domain gives the judgment score of the complete study.

Predictive factors

Several parameters were consistently identified in different studies as impacting outcomes after TKA. These parameters have been classified into 3 groups according to the strength of their association with outcome (Supplementary Table 1): (1) strong and consistent association; (2) strong but inconsistent association; (3) weak and inconsistent association (Table 2). The predictive factors classified in group I (strong association) were significantly correlated with outcomes after TKA (P < .05) in all studies with low or moderate risk, which assessed these factors. The predictive factors classified in group II (strong but inconsistent association) were significantly correlated with TKA outcomes (P < .05) in low- or moderate-risk studies, but not in all. For this group, relevant studies found a significant correlation, but other relevant studies did not find the same strong association. The predictive factors classified in group III (weak association) were not significantly correlated with TKA outcomes in the low- or moderate-risk studies.
Supplementary Table 1

Table reporting the 3 different types of predictive factors according to the strength of their association with TKA outcomes.

Joint specific PROMsFunctionGeneral PROMsSatisfactionPain
OKS (Q-score)Improvement in OKSWOMACWOMAC FuncWOMAC StiffnessKOOSChange in KOOSSF-36post-op ROMEQ5 D VASSelf assessment of outcomes improvementKSS satisfaction subscaleVAS PainWOMAC PainNo pain reliefOpioid consumption
Clear association with (poor)outcomesPainPre-op VAS PainHuijbregts (2016)Van Onsem (2016) Zab awa (2019)Abrecht (2019)Abrecht (2019)
Neurological disease / Back painEscobar (2007)Escobar (2007), Clement (2019)Twiggs (2019)Clement (2018)Escobar (2007)
Joint specific PRE-op PROMsPre-op knee function scores (KOOS pain/function)Sanchez-Santos (2018)Lungu (2014)Lingard (2018) Lungu (2014)Lungu (2014)Twiggs (2019)Van Onsem (2016)Lungu (2014)
Pre-op WOMAC FunctionAllyson Jones (2003) Lingard (2004) Rajgopal (2008) Nunez (2009)Escobar (2007) Lingard (2018) Nunez (2007)Clement (2019)Allyson Jones (2003) Lingard (2004)Lopez-Olivio (2011) Clement (2019)
Pre-op WOMAC PainClement (2018)Van Onsem (2016)Nunez (2007) Clement (2019)
Worse Pre-op WOMAC StiffnessLungu (2014)Lungu (2014)Clement (2019) Nunez (2007)Van Onsem (2016)Lungu (2014) Clement (2019)
Pre-op SF-12 PCS/SF-36Huijbregts (2016)Lingard (2004)Escobar (2007) Clement (2019)Escobar (2007)Lingard (2004)Clement (2019)Escobar (2007) Clement (2019)
KneeAbsent or damaged ACL pre-opSanchez-Santos (2018)
Pre-op Range of Motion (ROM)Sanchez-Santos (2018)Van Onsem (2018)
gen. PROMPre-op EQ5D VASHuber (2019)Maratt (2015)Huber (2019)
Mental healthDepression/AnxietyXu (2019)Lopez-Olivio (2011)Judge (2012)Clement (2018) Giurea (2016)Van Onsem (2016) Zabawa (2019)Abrecht (2019)Clement (2019)Abrecht (2019)
Ability to copeSanchez-Santos (2018)Lopez-Olivio (2011)Giurea (2016)Van Onsem (2016)
Hospital Anxiety and Depression Scale (HAD)Blackburn (2012)Xu (2019)
Pre-op SF-12 MCSRajgopal (2008) Xu (2019)Escobar (2007) Clement (2019)Escobar (2007) Clement (2019)Lingard (2004) Franklin (2008)Clement (2018)Escobar (2007) Clement (2019)
otherGeography (UK vs US/AUS)Lingard (2018)
Joint co-morbidityRajgopal (2008)
Occurance of falls in preceding yearTwiggs (2019)
Allergy (>1 self-reported)Hinarejos (2016)Kunze (2019)Hinarejos (2016)
Wide-spread body painNunez (2007)Nunez (2007)Nunez (2007)Dave (2017) Nunez (2007)
Severity osteoarthritis (Kellgren-Lawrence)Vissers (2020) Judge (2012)Schnurr (2013) Kunze (2019)
Inconsistent associationDemographicsMedical co-morbidities / ASA scoreSanchez-Santos (2018)Allyson Jones (2003) Lingard (2004) Nunez (2009) Nunez (2011)Lingard (2018) Allyson Jones (2003) Escobar (2007) Lopez-Olivio (2011)Escobar (2007)Lingard (2004) Hilton (2016)Allyson Jones (2003) Escobar (2007) Lopez-Olivio (2011) Hilton (2016)
BMISanchez-Santos (2018) Judge (2012)Allyson Jones (2003) Lingard (2004) Bourne (2007) Rajgopal (2008)Nunez (2007) Nunez (2009) Lopez-Olivio (2011)Twiggs (2019)Allyson Jones (2003) Lingard (2004) Cushnaghan (2008) Franklin (2008) Dowsey (2010)Maempel (2016)Clement (2018) Kunze (2019)Zabawa (2019) Calkins (2019)Abrecht (2019)Nunez (2007-2009) Lopez-Olivio (2011) Clement (2019)Sueyoshi (2015)Abrecht (2019)
GenderJudge (2012)Allyson Jones (2003) Nunez (2007) Bourne (2007) Rajgopal (2008)Lingard (2004) Escobar (2007) Nunez (2009)Escobar (2007)Twiggs (2019)Kiebzak (2002) Allyson Jones (2003) Cushnaghan (2008) Franklin (2008)Maempel (2016)Van Onsem (2016) Zabawa (2019) Van Onsem (2018) Calkins (2019)Abrecht (2019)Lingard (2004) Escobar (2007) Clement(2019)Sueyoshi (2015)Abrecht (2019)
AgeClement (2012)Nunez (2007) Bourne (2007) Rajgopal (2008)Allyson Jones (2003) Escobar (2007) Cushnaghan (2008)Escobar (2007)Twiggs (2019)Allyson Jones (2003) Lingard(2004) Cushnaghan (2008) Franklin (2008) Clement (2012)Maempel (2016)Abrecht (2019)Huijbregts (2016) Schnurr (2013)Van Onsem (2016) Calkins (2019)Abrecht (2019)Clement (2019) Escobar (2007)Sueyoshi (2015)Abrecht (2019)
Knee(No) previous knee surgeryHuber (2019) Sanchez-Santos (2018)Rajgopal (2008)Twiggs (2019)Kunze (2019)Abrecht (2019)Abrecht (2019)
Pre-op knee alignmentTwiggs (2019)Sueyoshi (2015)
SocialIncomeSanchez-Santos (2018) Judge (2012)Davis (2008)Davis (2008)
Decreased social supportEscobar (2007) Lopez-Olivio (2011)Escobar (2007)Twiggs (2019)Escobar (2007)
Low or no significant correlationEducation / Socioeconomic status (SES)Davis (2008)Feldman (2015)Pua (2019)(2015)Pua (2019)Lopez-Olivio (2011)
Smoking / DrinkingTwiggs (2019)Cushnaghan (2008)
Employment statusTwiggs (2019)
ExpectationNilsdotter (2009)
EthnicityLopez-Olivio (2011)
Quadriceps st rengthVan Onsem (2018)
Pre-op pain medicationTwiggs (2019)
Table 2

Table reporting the different predictive factors and the strength of their correlation with TKA outcomes, for each included study.

Parameters
Brander (2003) [7]
Lingard (2004) [56]
Bourne (2007) [28]
Escobar (2007) [10]
Davis (2008) [44]
Franklin (2008) [32]
Nilsdotter (2008) [45]
Rajgopal (2008) [38]
Dowsey (2010) [35]
Bourne (2010) [1]
Blackbum (2012) [27]
Judge (2012) [6]
Baker (2012) [5]
Schnurr (2013) [36]
Barlow (2014) [57]
Lungu (2014) [46]
Sueyoshi (2015) [37]
Huijbregts (2015) [17]
Maratt (2015) [22]
Feldmann (2015) [58]
Maempel (2016) [30]
Van Onsem (2016) [15]
Quality assessmentMod.Mod.Mod.Mod.Mod.LowSeriousMod.Mod.SeriousMod.SeriousMod.Mod.SeriousSeriousMod.Mod.Mod.SeriousLowMod.
Preoperative predictive factors
 Preop VAS PainSASASA
 Preop pain medication
 Neurological disease/backpainSA
 Preop KOOS scoreSASA
 Preop KSS score
 Preop WOMACSASASAIASASASA
 Preop SF-12 PCS/SF-36SAIASANASAIA
 Preop SF-12 MCSSASASASA
 Preop OKS ScoreSASANASASA
 Preop EQ5D VASSASA
 Preop ROMIASA
 Joint comorbidity/previous knee surgerySANASA
 Severity osteoarthritis (Kellgren)SA
 Preop knee alignmentIASAIA
 Quadriceps strengthSA
 Depression/AnxietySASASASANASA
 Ability to copeSA
 Allergy (>1 self-reported)
 Medical comorbidities/ASA scoreIASANANA
 BMINAIAIASAIAIAIANANANANANANASA
 GenderNAIANAIANAIANANAIANANASAIA
 AgeNASASAIASAIANASAIASANAIASANAIASAIA
 Geography (UK vs US/AUS)SA
 IncomeIA
 Decreased social supportNA
 Education/Socioeconomic status (SES)IANASA
 Smoking/Drinking
 Employment statusNA
 ExpectationNASANA
 EthnicityNA
Patient-reported outcome measures
 Pain Catastrophizing Scale (PCS)XX
 VAS painX
 KSCR improvementXX
 WOMAC ScoreXXXXXXXXX
 WOMAC improvementXXXXX
 SF12 PCS scoreXX
 SF12 MCS scoreX
 SF12 PCS improvementXXXX
 SF-36 scoreXXX
 KSS scoreXXX
 KOOS scoreXXX
 KOOS Improvement
 IKS scoreXX
 IKS improvementX
 OKS scoreXXXXXX
 OKS improvementX
 EQ-5D scoreXX
 EQ-5D improvementX
 SatisfactionXXX
 Post op ROMX
 Revision risk

BMI, body mass index; EQ-5D, Euro QOL score; KOOS, Knee injury and Osteoarthritis Outcome Score; KSS, Knee Society Score; IA, inconsistent association; Mod., moderate; NA, no association; OKS, Oxford Knee Score; PROM, Patient-Reported Outcome Measures; ROM, range of motion; SA, strong association; WOMAC, Western Ontario and McMaster Universities Osteoarthritis Index; VAS, Visual Analog Scale.

Table reporting the different predictive factors and the strength of their correlation with TKA outcomes, for each included study. BMI, body mass index; EQ-5D, Euro QOL score; KOOS, Knee injury and Osteoarthritis Outcome Score; KSS, Knee Society Score; IA, inconsistent association; Mod., moderate; NA, no association; OKS, Oxford Knee Score; PROM, Patient-Reported Outcome Measures; ROM, range of motion; SA, strong association; WOMAC, Western Ontario and McMaster Universities Osteoarthritis Index; VAS, Visual Analog Scale.

Strong and clear association

Preoperative pain

Patients with higher levels of pain before TKA surgery had lower postoperative functional scores. However, improvements in pain scores, functional knee scores, and patient satisfaction were greater in this group [[13], [14], [15], [16]]. Huijbregts et al. found that a one-point increase in preoperative NRS-pain (Numerical Rating Scale) resulted in a 1.73-point decrease in 1-year Oxford Knee Score (OKS) [17].

Preoperative PROM score

Preoperative PROM scores, particularly knee scores, were strongly correlated with TKA outcome [5,14,15,[18], [19], [20], [21]]. Maratt et al. has defined the minimally clinically important difference (MCIDs) for the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) scores at 2 years after TKA [22]. The preoperative WOMAC scores were the strongest predictive factors for improvement in postoperative WOMAC scores in a cohort of 2350 TKAs.

Mental health

Anxiety and depression before surgery are frequently identified as risk factors for lower patient-reported outcomes after TKA [16], in particular with regard to patient dissatisfaction [6,14,15,20,23,24], knee pain [7], and walking limitations [5,[25], [26], [27]].

Inconsistent association

Demographic characteristics

Huijbregts et al. reported an inverse correlation between age and satisfaction with knee surgery [17,28,29]. With respect to sex, several studies have suggested that residual pain and stiffness [30], and consequently dissatisfaction, were more prevalent in female patients [15,25]. Body mass index (BMI) is a statistically significant predictor of satisfaction (Knee Society Score [KSS] satisfaction subscale) [14,31], postoperative PROM scores [[32], [33], [34]], and postoperative range of motion [30]. Dowsey et al. described poorer functional outcomes (Knee and function KSS) in morbidly obese patients (BMI > 40 kg/m2) [35]. Nevertheless, this correlation is not consistently demonstrated by all authors [29,[36], [37], [38]]. Some studies had grouped several demographic parameters (age, female gender, and BMI) to create a single demographic criterion [13,34]. A predictive model for postoperative PROM scores found a significant correlation between age and gender and OKS [33]. For example, younger women (age < 60 years) had better OKS outcomes than men, but in the oldest age group (age 80 years or older), women had worse outcomes than men [33].

Clinical comorbidities

Several studies identified clinical comorbidities as significant predictive factors of poor outcomes after TKA [16]. Diabetes [20,39] or allergies [40,41] were singled out.

Osteoarthritis severity

On 996 TKAs, Schnurr et al. reported that the severity of preoperative osteoarthritis was the only feature which inversely correlated with patient satisfaction [36]. In comparison to severe arthritis grade IV, the risk for dissatisfaction was 2.6-fold higher for arthritis grade III (P < .001) and 3.0-fold higher for grade II (P = .001).

Surgical history

In some studies, the number of previous knee surgeries was correlated to postoperative outcomes [39] or to pain during the hospitalization (P < .002) [13].

Preoperative knee alignment

Sueyoshi et al. described a significant association between preoperative varus greater than 5° and postoperative pain (P = .0096) [37]. However, Twiggs et al. found no correlation between preoperative knee alignment and pain scores at 12 months [42].

Preoperative range of motion

Similarly, preoperative range of motion (ROM) has an uncertain impact on postoperative clinical outcomes after TKA despite having a direct correlation with postoperative ROM [25,30,43].

Low association

Some features were not identified as significant predictors of clinical outcomes, such as education, socioeconomic status, smoking, and alcohol habits, and patient expectations were not found to be predictive of outcomes or pain after TKA [13,14,42,44,45]. Preoperative pain medication use was not a significant predictive factor of postoperative satisfaction after TKA [36]. Surgical time and tourniquet time were not clearly identified as independent predictors of postoperative pain or dissatisfaction after TKA [13,29]. These features were not found to be predictive of postoperative outcomes in the literature reviewed and are unlikely to contribute to a predictive model.

Patient-reported outcome measures

Twenty-three different outcome measurement parameters were found in the included studies. These parameters could be grouped as follows: (1) postoperative validated measures, (2) patient satisfaction measures, (3) pain measures, and (4) improvement in PROMs. Several predictive models have been developed to estimate various measures of postoperative outcome after TKA (Table 3).
Table 3

Studies reporting a predictive model for TKA outcomes.

StudyYearSample sizeLocationPredictive factorsOutcome measurement parametersDelayPredictive model
Judge et al. [6]20121991UKAge, gender, BMI, primary diagnosis, ASA score, Index of Multiple Deprivation, OKS, EQ5DSatisfaction, OKS6 moN/A
Lungu et al. [46]2014141Canada5 Preoperative WOMAC questions: difficulty of taking off socks, getting on/off toilet, performing light domestic duties, and rising from bed as well as degree of morning stiffness after the first wakeningWOMAC6 moPredictive rule, based on 5 preop WOMAC questions
Van Onsem et al. [15]2016113BelgiumQuestion selections based on KOOS, OKS, PCS, EQ-5D, KSS, age, and genderKSS satisfaction subscore3 moAlgorithm:Satisfaction at M3 = 26.10 + 2.3∗gender + 0.13∗age + 1.58∗Q3 − 1.40∗Q4 − 1.08∗Q5 − 0.75∗Q6 − 1∗Q7 − 1.12∗Q8 − 0.88∗Q9 − 1.10∗Q10
Sanchez et al. [33]20181649 (External validation on 595)UKAge, gender, marital status, Index of Multiple Deprivation, BMI, anxiety/depression, OKS, ASA score, etiology, previous knee arthroscopy, flexion contracture, ACL status.OKS12 moN/A
Van Onsem et al. [43]201857BelgiumPreop ROM, quadriceps and hamstring force, sit-to-stand test, 6-min walk testKOOS, KSS, OKS6 moN/A
Twiggs et al. [42]2019330 (2 external validations)US/AustraliaAge, gender, KOOS items, back pain, occurrence of hip pain, occurrence of falls in past yearKnee painMCID = 10 points of KOOS pain score12 moPredictive model with a web application
Tolk et al. [18]20207071NLAge, gender, ASA score, BMI, smoking, previous knee surgery, Charnley score, KOOS-PS, OKS, EuroQoL 5D-3L, NRSResidual symptoms (pain at rest and activity, sit-to-stand movement, stair negotiation, walking, performance of activities of daily living, kneeling and squatting).6 and 12 moPredictive model for residual symptoms
Kunze et al. [39]2019484USBMI, drug allergies, osteophytes, soft tissue thickness, flexion contracture, diabetes, opioid use, comorbidity, previous knee surgery, surgical indication, smokingPatient-reported health state, KSS, ROM, satisfaction=> Knee survey score12 moKnee survey score on 110 pts. 4 risks of experiencing postoperative dissatisfaction:Score 96.5-110 = low riskScore 75-96.4 = mild riskScore 60-74.9 = medium riskScore <60 = high risk
Ramkumar et al. [61]2019171,025USAge, gender, ethnicity, emergency department, risk of mortality, severity of illness, comorbidity weekend admission, hospital type, incomeLOS charges/cost, discharge dispositionModel code (https://github.com/JaretK/ NeuralNetArthroplasty)
Pua et al. [25]20194026SingaporeAge, gender, race, educational level, diabetes, preop gait aids, contralateral knee pain, psychological distressKnee extension, knee flexion, knee pain, walking limitation6 moPrediction model with a web application (https://sgh-physio. shinyapps.io/predicTKR/)
Anis et al. [34]20205958 and 2391USAge, gender, BMI, race, educational level, smoking, comorbidity, KOOS items, 12PCS, 12MCSLOS, 90-days readmission, PROMS12 moN/A

EQ-5D, Euro QOL score; KOOS, Knee injury and Osteoarthritis Outcome Score; KSS, Knee Society Score; LOS, length of stay; OKS, Oxford Knee Score; PCS, Pain Catastrophizing Scale; ROM, range of motion; WOMAC, Western Ontario and McMaster Universities Osteoarthritis Index.

Studies reporting a predictive model for TKA outcomes. EQ-5D, Euro QOL score; KOOS, Knee injury and Osteoarthritis Outcome Score; KSS, Knee Society Score; LOS, length of stay; OKS, Oxford Knee Score; PCS, Pain Catastrophizing Scale; ROM, range of motion; WOMAC, Western Ontario and McMaster Universities Osteoarthritis Index.

Postoperative PROMs (specific of the knee or general)

Some predictive models have focused on postoperative knee PROMs. Sanchez-Santos et al. described a predictive model for the postoperative OKS questionnaire at 12 months after TKA using data from a cohort of 1649 patients [33]. Tolk et al. estimated the probability of residual symptoms after TKA based on individual PROM questions and PROM total scores [18,33]. However, the use of these models in clinical practice is neither intuitive nor practical. Furthermore, these models are built on relatively small data sets and not have wide applicability.

Patient satisfaction

In a cohort of 484 patients, Kunze et al. have developed a preoperative knee survey score to predict patient outcome and satisfaction at 1 year after TKA [39]. They concluded that a knee survey score of 96.5 would confer a 97.5% sensitivity and 95.7% negative predictive value for satisfaction, and a knee survey score of less than or equal to 96.5 increased the probability of experiencing postoperative dissatisfaction.

Patient pain

From a cohort of 4026 TKAs, Pua et al. developed a predictive model designed to determine the expected knee ROM, knee pain, and walking limitations of a patient 6 months postoperatively [25]. They created a web application to facilitate its use in clinical practice (https://sgh-physio.shinyapps.io/predicTKR/). However, no study has assessed the predictive value of this model. Twiggs et al. created an algorithm designed to predict a patient’s knee pain score 12 months after TKA [42]. The algorithm is based on a preoperative self-administered questionnaire and predicts the likelihood that a patient’s change in the pain score will be equal to or greater than the MCID in the PROM score. The use of the MCID allows the patient and the surgeon to know during the preoperative consultation if the patient will likely experience a clinically significant improvement in pain after TKA.

PROM improvement

The most commonly used measurement to assess outcome after TKA is the improvement seen between results collected before surgery and postoperative data collected using validated instruments or objective measures such as ROM [19,21,34].

Discussion

The aim of this article was to identify and group the preoperative predictive factors and outcome measurement parameters which have been found to be predictive of clinical outcomes after TKA in the current literature. This compendium identifies those variables that are the most likely to be useful features in the context of predictive algorithms for clinical outcomes after TKA. These features are summarized in Table 4.
Table 4

Summary of the main preoperative predictive factors and outcome measurement parameters.

Strength of associationPredictive factorsOutcome measurement parametersDelay
Strong correlation

Pain

VAS pain

Back pain

•Knee-specific PROMs

KOOS

WOMAC

•Knee characteristics

ROM

•General PROMs

EQ-5D

Mental health

Anxiety/Depression

SF-12

Improvement of knee-specific PROMs

OKS

KOOS

WOMAC

Satisfaction

Self-assessment of improvement

KSS satisfaction subscale

Pain

VAS pain

WOMAC pain

Persistent pain

6 and 12 mo
Inconsistent correlation

Comorbidities/ASA score

BMI

Gender

Age

Previous knee surgery

Severity of osteoarthritis

Preop knee alignment

Knee-specific PROMs

OKS

KOOS

WOMAC

SF-36

General PROMs

BMI, body mass index; EQ-5D, Euro QOL score; KOOS, Knee injury and Osteoarthritis Outcome Score; KSS, Knee Society Score; OKS, Oxford Knee Score; PROM, Patient-Reported Outcome Measures; ROM, range of motion; WOMAC, Western Ontario and McMaster Universities Osteoarthritis Index; VAS, Visual Analog Scale.

Summary of the main preoperative predictive factors and outcome measurement parameters. Pain VAS pain Back pain •Knee-specific PROMs KOOS WOMAC •Knee characteristics ROM •General PROMs EQ-5D Mental health Anxiety/Depression SF-12 Improvement of knee-specific PROMs OKS KOOS WOMAC Satisfaction Self-assessment of improvement KSS satisfaction subscale Pain VAS pain WOMAC pain Persistent pain Comorbidities/ASA score BMI Gender Age Previous knee surgery Severity of osteoarthritis Preop knee alignment Knee-specific PROMs OKS KOOS WOMAC SF-36 General PROMs BMI, body mass index; EQ-5D, Euro QOL score; KOOS, Knee injury and Osteoarthritis Outcome Score; KSS, Knee Society Score; OKS, Oxford Knee Score; PROM, Patient-Reported Outcome Measures; ROM, range of motion; WOMAC, Western Ontario and McMaster Universities Osteoarthritis Index; VAS, Visual Analog Scale. The challenge of developing useful predictive algorithms is twofold: (1) choosing the right predictive factors and (2) selecting those outcomes that are both “predictable” and useful measures of clinical satisfaction [18,46]. Indeed, in several studies, predictive factors and the outcomes are correlated independently of patient variables because the same questions, the same scores, and parameter are assessed before and after TKA. For this reason, multiple variable analysis is essential to assess the independent contribution of each feature to the prediction of postoperative outcomes. For example, a patient with a preoperative fixed flexion deformity has a higher risk of having a postoperative fixed flexion deformity [25]. In this context, the improvement in PROM scores can be very useful to assess TKA outcomes. The clinical relevance of the improvements can be quantified by the MCID. Unfortunately, MCIDs are not universally valid across populations and cultures and vary by instrument. Another challenge of using predictive models built and validated with data from one population is extrapolating the results of the algorithm using data from other populations. For example, Zabawa et al. [14] and Calkins et al. [31] did not support the validity of the Van Onsem prediction tool [15]. These differences can have several explanations. First, the populations under study can be very different from one country to another. For example, the mean BMI may vary considerably between 2 populations, and if this parameter is identified as a predictive factor in the algorithm, the model may simply be inaccurate in the second population. Second, the indications for TKA can change between countries or between centers in the same country, particularly relative to age, BMI, and osteoarthritis stage. Thus, the result from the model for a given patient may meet threshold criteria in one country/center but not in another. For this reason, predictive models using data from very large populations numbering in the hundreds of thousands, including several centers or countries, are more relevant and reliable [18]. It is also worth noting that while some correlations were identified between preoperative variables and postoperative clinical outcomes, the strength of even the best correlations was underwhelming. While it is possible that larger and more accurate data sets may increase the validity and predictive value of the algorithms, it is also possible that entirely different end points will be required. The existing “gold standard” PROMs are now several decades old and measured outcomes that were tied to problems faced by implant technology and surgical techniques that are now antiquated. While the primary concern in the latter 20th century was with implant survivorship of TKA, in the first 2 decades of the 21st century, attention has turned to functional outcomes after that procedure. However, the functional aspect of most scoring systems sets a reasonably low bar for success such as standing up from a chair rather than playing a round of golf. This is one reason so many PROMs have well-defined ceiling effects [[47], [48], [49], [50]]. Some more demanding functional scores are sometimes used, such as the forgotten joint score, the WOMAC score, and the UCLA score. But they remain rarely used to assess the TKA outcomes. Maybe other assessment methods would be necessary, such as the gait analysis or digital care management platforms with tools to have connected patients. Nevertheless, these devices are currently lightly used and not described or assessed in the studies on the predictive models. In terms of modeling technique, a substantial number of publications relied on traditional regression models which are robust and provide a quantitative assessment of the predictor’s relationship with the output through the investigation of the model’s coefficients [10,18,20,36,37]. However, machine learning techniques have been shown to outperform linear regression model in specific tasks, such as prediction of post-TKA EQ-5D-3L visual analog scale [51], estimating risk of total joint replacement [52], and more recently length-of-stay prediction after TKA [34,53]. More and more studies try to use the machine learning to predict the TKA outcomes and to adapt the practices according to the established predictive features [21,26,29,41]. The development of clinical decision-making tools generated from machine learning, which can be used in consultation to help discuss risk stratification with patients, could provide a means of better understanding which patients are at a greater risk to experience dissatisfaction after primary TKA [34,54,55]. Therefore, we expect the level of adoption of machine learning models to increase in light of the promising results reported in some of the publications reviewed herein [44]. We hope that future research in this field will adopt the best practice of benchmarking different algorithms to a given prediction problem as we clearly have seen that there is no “one-size-fits-all” best solution in predictive modeling for TKA clinical outcomes. Our findings should be considered in the light of the key limitations of the data set. First, the inclusion criteria, such as English language or the requirement of full text access, may have excluded relevant research. Second, the methodology score has known limitations with regard to the type of studies included (predictive cohort-based studies) and the difficulties in assessing the validity of the analyses conducted without having access to the raw data. Third, there was an important variability between the studies with respect to the type of predictive features or outcome measurements, the follow-up period, the patient population, and the analyses performed. This heterogeneity limits the possibility of performing a true meta-analysis. Finally, the accuracy of predictive algorithms is derived from 2 critical aspects of the data set on which they are constructed: their size and the accuracy and completeness of the data sets within them. The larger the number of variables that can influence an outcome and the more complex the interaction between these variables, the larger the data set needs to be to discern and predict these complex interactions. The indications for TKA are complex and multifactorial. Selecting patients for surgery based strictly on the prediction of clinical outcomes alone is probably not reasonable. However, predictive models can communicate information and insight to both patients and surgeons that can be included in a shared decision-making process. The output of these algorithms might 1 day be expanded from simply predicting outcomes to providing a stratification in the variation of possible outcomes based on the pursuit of different surgical strategies. In this scenario, the surgeon and patient would essentially customize the procedure to optimize the likelihood of meeting the patient’s needs. Examples include the decision between partial and total knee replacement or the choice of a cruciate retaining or cruciate sacrificing TKA. By feeding results back to the data set, the models evolve and improve over time and become increasingly accurate (Fig. 2). We expect that such predictive models, when trained with appropriate and accurate data sets, could become an important adjunct to daily clinical practice in the near future.
Figure 2

Diagram explaining the correlation between predictive factors and outcomes in a predictive model.

Diagram explaining the correlation between predictive factors and outcomes in a predictive model.

Conclusion

The existing literature on predictive modeling of clinical outcomes after TKA has identified preoperative variables that have at least some correlation with clinical results. Functional features such as pain, PROMs scores, or mental health were highly predictive for clinical outcomes after TKA. Some variables such as demographics data, surgical history, or knee alignment were less strongly correlated with TKA outcomes. The challenge of developing useful predictive algorithms is further complicated by the need to select the most appropriate measurement parameters of TKA outcomes such as improvement in PROMs, patient satisfaction, or postoperative pain. Creating accurate and reproducible predictive algorithms may 1 day provide advanced tools for shared decision-making relative to surgical indications and expected outcomes. However, the data gathered also suggested that work is still required to define outcomes measures that more accurately correlate with preoperative variables and better reflect patient satisfaction.

Conflicts of interest

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: C.B.: no conflict of interest. T.L.: in the speakers bureau of Smith and Nephew and Arthrex; and consultant for Amplitude. D.D.M.: paid employee for Stryker. S.W.-B.: Paid employee for Stryker. S.B.: consultant for Stryker and Zimmer Biomet; has stock or stock options from Cloudmedx.com and Insilicotrials.com; in orthopedic publications editorial board of the Journal of Arthroplasty and Arthroplasty Today; board member/committee appointments for American Academy of Orthopedic surgery and American Association of Hip and Knee Surgeons. S.L.: consultant for Stryker, Smith and Nephew, Heraeus, and Depuy Synthes; received institutional research support from Lepine and Amplitude; in editorial board of the Journal of Bone and Joint Surgery (Am).
  64 in total

1.  Acute postoperative pain at rest after hip and knee arthroplasty: severity, sensory qualities and impact on sleep.

Authors:  V Wylde; J Rooker; L Halliday; A Blom
Journal:  Orthop Traumatol Surg Res       Date:  2011-03-08       Impact factor: 2.256

2.  External Validity of a New Prediction Model for Patient Satisfaction After Total Knee Arthroplasty.

Authors:  Tyler E Calkins; Chris Culvern; Cindy R Nahhas; Craig J Della Valle; Tad L Gerlinger; Brett R Levine; Denis Nam
Journal:  J Arthroplasty       Date:  2019-04-13       Impact factor: 4.757

3.  The Chitranjan Ranawat Award: functional outcome after total knee replacement varies with patient attributes.

Authors:  Patricia D Franklin; Wenjun Li; David C Ayers
Journal:  Clin Orthop Relat Res       Date:  2008-11       Impact factor: 4.176

4.  A New Prediction Model for Patient Satisfaction After Total Knee Arthroplasty.

Authors:  Stefaan Van Onsem; Catherine Van Der Straeten; Nele Arnout; Patrick Deprez; Geert Van Damme; Jan Victor
Journal:  J Arthroplasty       Date:  2016-07-14       Impact factor: 4.757

5.  Patient-reported allergies cause inferior outcomes after total knee arthroplasty.

Authors:  Pedro Hinarejos; Tulia Ferrer; Joan Leal; Raul Torres-Claramunt; Juan Sánchez-Soler; Joan Carles Monllau
Journal:  Knee Surg Sports Traumatol Arthrosc       Date:  2015-11-03       Impact factor: 4.342

6.  Prosthetic alignment after total knee replacement is not associated with dissatisfaction or change in Oxford Knee Score: A multivariable regression analysis.

Authors:  Henricus J T A M Huijbregts; Riaz J K Khan; Daniel P Fick; Olivia M Jarrett; Samantha Haebich
Journal:  Knee       Date:  2016-01-27       Impact factor: 2.199

7.  Predicting total knee replacement pain: a prospective, observational study.

Authors:  Victoria A Brander; S David Stulberg; Angela D Adams; R Norman Harden; Stephen Bruehl; Steven P Stanos; Timothy Houle
Journal:  Clin Orthop Relat Res       Date:  2003-11       Impact factor: 4.176

8.  What proportion of patients report long-term pain after total hip or knee replacement for osteoarthritis? A systematic review of prospective studies in unselected patients.

Authors:  Andrew David Beswick; Vikki Wylde; Rachael Gooberman-Hill; Ashley Blom; Paul Dieppe
Journal:  BMJ Open       Date:  2012-02-22       Impact factor: 2.692

9.  Development of an outcome prediction tool for patients considering a total knee replacement--the Knee Outcome Prediction Study (KOPS).

Authors:  Tim Barlow; Mark Dunbar; Andrew Sprowson; Nick Parsons; Damian Griffin
Journal:  BMC Musculoskelet Disord       Date:  2014-12-23       Impact factor: 2.362

10.  Construction and Comparison of Predictive Models for Length of Stay after Total Knee Arthroplasty: Regression Model and Machine Learning Analysis Based on 1,826 Cases in a Single Singapore Center.

Authors:  Hui Li; Juyang Jiao; Shutao Zhang; Haozheng Tang; Xinhua Qu; Bing Yue
Journal:  J Knee Surg       Date:  2020-06-08       Impact factor: 2.757

View more
  2 in total

1.  Predictive capacity of four machine learning models for in-hospital postoperative outcomes following total knee arthroplasty.

Authors:  Abdul K Zalikha; Mouhanad M El-Othmani; Roshan P Shah
Journal:  J Orthop       Date:  2022-03-21

Review 2.  Artificial intelligence in knee arthroplasty: current concept of the available clinical applications.

Authors:  Cécile Batailler; Jobe Shatrov; Elliot Sappey-Marinier; Elvire Servien; Sébastien Parratte; Sébastien Lustig
Journal:  Arthroplasty       Date:  2022-05-02
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.