Alex H S Harris1, Alfred C Kuo, Yingjie Weng, Amber W Trickey, Thomas Bowe, Nicholas J Giori. 1. A. H. S. Harris, T. Bowe, N. J. Giori Center for Innovation to Implementation, VA Palo Alto Healthcare System, Palo Alto, CA, USA A. C. Kuo San Francisco Veterans Affairs Medical Center, University of California, San Francisco, CA, USA A H. S. Harris, Y. Weng, A. W. Trickey Stanford-Surgical Policy Improvement Research and Education Center, Stanford, CA, USA N. J. Giori Department of Orthopedic Surgery, Stanford University School of Medicine, Stanford, CA, USA.
Abstract
BACKGROUND: Existing universal and procedure-specific surgical risk prediction models of death and major complications after elective total joint arthroplasty (TJA) have limitations including poor transparency, poor to modest accuracy, and insufficient validation to establish performance across diverse settings. Thus, the need remains for accurate and validated prediction models for use in preoperative management, informed consent, shared decision-making, and risk adjustment for reimbursement. QUESTIONS/PURPOSES: The purpose of this study was to use machine learning methods and large national databases to develop and validate (both internally and externally) parsimonious risk-prediction models for mortality and complications after TJA. METHODS: Preoperative demographic and clinical variables from all 107,792 nonemergent primary THAs and TKAs in the 2013 to 2014 American College of Surgeons-National Surgical Quality Improvement Program (ACS-NSQIP) were evaluated as predictors of 30-day death and major complications. The NSQIP database was chosen for its high-quality data on important outcomes and rich characterization of preoperative demographic and clinical predictors for demographically and geographically diverse patients. Least absolute shrinkage and selection operator (LASSO) regression, a type of machine learning that optimizes accuracy and parsimony, was used for model development. Tenfold validation was used to produce C-statistics, a measure of how well models discriminate patients who experience an outcome from those who do not. External validation, which evaluates the generalizability of the models to new data sources and patient groups, was accomplished using data from the Veterans Affairs Surgical Quality Improvement Program (VASQIP). Models previously developed from VASQIP data were also externally validated using NSQIP data to examine the generalizability of their performance with a different group of patients outside the VASQIP context. RESULTS: The models, developed using LASSO regression with diverse clinical (for example, American Society of Anesthesiologists classification, comorbidities) and demographic (for example, age, gender) inputs, had good accuracy in terms of discriminating the likelihood a patient would experience, within 30 days of arthroplasty, a renal complication (C-statistic, 0.78; 95% confidence interval [CI], 0.76-0.80), death (0.73; 95% CI, 0.70-0.76), or a cardiac complication (0.73; 95% CI, 0.71-0.75) from one who would not. By contrast, the models demonstrated poor accuracy for venous thromboembolism (C-statistic, 0.61; 95% CI, 0.60-0.62) and any complication (C-statistic, 0.64; 95% CI, 0.63-0.65). External validation of the NSQIP- derived models using VASQIP data found them to be robust in terms of predictions about mortality and cardiac complications, but not for predicting renal complications. Models previously developed with VASQIP data had poor accuracy when externally validated with NSQIP data, suggesting they should not be used outside the context of the Veterans Health Administration. CONCLUSIONS: Moderately accurate predictive models of 30-day mortality and cardiac complications after elective primary TJA were developed as well as internally and externally validated. To our knowledge, these are the most accurate and rigorously validated TJA-specific prediction models currently available (http://med.stanford.edu/s-spire/Resources/clinical-tools-.html). Methods to improve these models, including the addition of nonstandard inputs such as natural language processing of preoperative clinical progress notes or radiographs, should be pursued as should the development and validation of models to predict longer term improvements in pain and function. LEVEL OF EVIDENCE: Level III, diagnostic study.
BACKGROUND: Existing universal and procedure-specific surgical risk prediction models of death and major complications after elective total joint arthroplasty (TJA) have limitations including poor transparency, poor to modest accuracy, and insufficient validation to establish performance across diverse settings. Thus, the need remains for accurate and validated prediction models for use in preoperative management, informed consent, shared decision-making, and risk adjustment for reimbursement. QUESTIONS/PURPOSES: The purpose of this study was to use machine learning methods and large national databases to develop and validate (both internally and externally) parsimonious risk-prediction models for mortality and complications after TJA. METHODS: Preoperative demographic and clinical variables from all 107,792 nonemergent primary THAs and TKAs in the 2013 to 2014 American College of Surgeons-National Surgical Quality Improvement Program (ACS-NSQIP) were evaluated as predictors of 30-day death and major complications. The NSQIP database was chosen for its high-quality data on important outcomes and rich characterization of preoperative demographic and clinical predictors for demographically and geographically diverse patients. Least absolute shrinkage and selection operator (LASSO) regression, a type of machine learning that optimizes accuracy and parsimony, was used for model development. Tenfold validation was used to produce C-statistics, a measure of how well models discriminate patients who experience an outcome from those who do not. External validation, which evaluates the generalizability of the models to new data sources and patient groups, was accomplished using data from the Veterans Affairs Surgical Quality Improvement Program (VASQIP). Models previously developed from VASQIP data were also externally validated using NSQIP data to examine the generalizability of their performance with a different group of patients outside the VASQIP context. RESULTS: The models, developed using LASSO regression with diverse clinical (for example, American Society of Anesthesiologists classification, comorbidities) and demographic (for example, age, gender) inputs, had good accuracy in terms of discriminating the likelihood a patient would experience, within 30 days of arthroplasty, a renal complication (C-statistic, 0.78; 95% confidence interval [CI], 0.76-0.80), death (0.73; 95% CI, 0.70-0.76), or a cardiac complication (0.73; 95% CI, 0.71-0.75) from one who would not. By contrast, the models demonstrated poor accuracy for venous thromboembolism (C-statistic, 0.61; 95% CI, 0.60-0.62) and any complication (C-statistic, 0.64; 95% CI, 0.63-0.65). External validation of the NSQIP- derived models using VASQIP data found them to be robust in terms of predictions about mortality and cardiac complications, but not for predicting renal complications. Models previously developed with VASQIP data had poor accuracy when externally validated with NSQIP data, suggesting they should not be used outside the context of the Veterans Health Administration. CONCLUSIONS: Moderately accurate predictive models of 30-day mortality and cardiac complications after elective primary TJA were developed as well as internally and externally validated. To our knowledge, these are the most accurate and rigorously validated TJA-specific prediction models currently available (http://med.stanford.edu/s-spire/Resources/clinical-tools-.html). Methods to improve these models, including the addition of nonstandard inputs such as natural language processing of preoperative clinical progress notes or radiographs, should be pursued as should the development and validation of models to predict longer term improvements in pain and function. LEVEL OF EVIDENCE: Level III, diagnostic study.
Authors: E W Steyerberg; F E Harrell; G J Borsboom; M J Eijkemans; Y Vergouwe; J D Habbema Journal: J Clin Epidemiol Date: 2001-08 Impact factor: 6.437
Authors: S F Khuri; J Daley; W Henderson; K Hur; J Demakis; J B Aust; V Chong; P J Fabri; J O Gibbs; F Grover; K Hammermeister; G Irvin; G McDonald; E Passaro; L Phillips; F Scamman; J Spencer; J F Stremple Journal: Ann Surg Date: 1998-10 Impact factor: 12.969
Authors: Yi Mu; Jonathan R Edwards; Teresa C Horan; Sandra I Berrios-Torres; Scott K Fridkin Journal: Infect Control Hosp Epidemiol Date: 2011-09-01 Impact factor: 3.254
Authors: Karl Y Bilimoria; Yaoming Liu; Jennifer L Paruch; Lynn Zhou; Thomas E Kmiecik; Clifford Y Ko; Mark E Cohen Journal: J Am Coll Surg Date: 2013-09-18 Impact factor: 6.113
Authors: Kevin J Bozic; Edmund Lau; Steven Kurtz; Kevin Ong; Harry Rubash; Thomas P Vail; Daniel J Berry Journal: J Bone Joint Surg Am Date: 2012-05-02 Impact factor: 5.284
Authors: Elie F Berbari; Douglas R Osmon; Brian Lahr; Jeanette E Eckel-Passow; Geoffrey Tsaras; Arlen D Hanssen; Tad Mabry; James Steckelberg; Rodney Thompson Journal: Infect Control Hosp Epidemiol Date: 2012-06-20 Impact factor: 3.254
Authors: Shukri F Khuri; William G Henderson; Jennifer Daley; Olga Jonasson; R Scott Jones; Darrell A Campbell; Aaron S Fink; Robert M Mentzer; Leigh Neumayer; Karl Hammermeister; Cecilia Mosca; Nancy Healey Journal: Ann Surg Date: 2008-08 Impact factor: 12.969
Authors: Thomas G Myers; Prem N Ramkumar; Benjamin F Ricciardi; Kenneth L Urish; Jens Kipper; Constantinos Ketonis Journal: J Bone Joint Surg Am Date: 2020-05-06 Impact factor: 5.284
Authors: Mark Alan Fontana; Stephen Lyman; Gourab K Sarker; Douglas E Padgett; Catherine H MacLean Journal: Clin Orthop Relat Res Date: 2019-06 Impact factor: 4.176
Authors: Dmytro Onishchenko; Daniel S Rubin; James R van Horne; R Parker Ward; Ishanu Chattopadhyay Journal: J Am Heart Assoc Date: 2022-07-29 Impact factor: 6.106