| Literature DB >> 35227128 |
Cesar D Lopez1, Venkat Boddapati1, Joseph M Lombardi1, Nathan J Lee1, Justin Mathew1, Nicholas C Danford1, Rajiv R Iyer1, Marc D Dyrszka1, Zeeshan M Sardar1, Lawrence G Lenke1, Ronald A Lehman1.
Abstract
OBJECTIVES: This current systematic review sought to identify and evaluate all current research-based spine surgery applications of AI/ML in optimizing preoperative patient selection, as well as predicting and managing postoperative outcomes and complications.Entities:
Keywords: artificial intelligence; deep learning; machine learning; orthopedic surgery; predictive modeling; spine surgery
Year: 2022 PMID: 35227128 PMCID: PMC9393994 DOI: 10.1177/21925682211049164
Source DB: PubMed Journal: Global Spine J ISSN: 2192-5682
Figure 1.PRISMA flowchart showing systematic review search strategy.
Figure 2.Trends in the annual number of AI/ML publications in spine surgery (2011 to 2020).
Reviewed Studies of Preoperative Patient Selection and Planning in Spine Surgery.
| Author, year | Pathology or procedure | ML algorithms | Prediction outputs | Number of patients | Avg. age | % female | Data source |
|---|---|---|---|---|---|---|---|
| Kalagara, 2018 | Lumbar laminectomy | Boosting/ensemble learning | Readmissions/reoperations | 4030 | 63 | -- | ACS-NSQIP database |
| Stopa, 2019 | Spine fusion | Deep Learning/ANN | Discharge/LOS | 144 | 50 | 45.10 | Single center |
| Ames, 2019 | ASD | Cluster analysis | Pre-op selection/planning | 570 | 56.8 | 78.80 | Multicenter ASD databases |
| Goyal, 2019 | Spine fusion | Regression analysis, boosting/ensemble learning, deep Learning/ANN, decision tree, and Bayesian networks | Discharge/LOS and readmissions/reoperations | 8872 | 57 | 48.50 | ACS-NSQIP database |
| Ogink, 2019 | Spondylolisthesis | Deep learning/ANN, SVM, decision tree, and Bayesian networks | Discharge/LOS | 1868 | 63 | 63.00 | ACS-NSQIP database |
| Kuo, 2018 | Spinal fusion | Regression analysis, SVM, decision tree, and Bayesian networks | Cost prediction | 532 | 62.4 | 58.60 | Single center |
| Lerner, 2019 | Posterior lumbar spinal fusion | Cluster analysis | Pre-op selection/planning | 18770 | 51.3 | 56.10 | IBM MarketScan® commercial database |
| Siccoli, 2019 | Lumbar decompression | Boosting/ensemble learning and decision tree | Discharge/LOS and readmissions/reoperations | 635 | 62 | 48.00 | Single center |
| Chia, 2017 | Cerebral palsy | Deep learning/ANN | Pre-op selection/planning | 242 | -- | -- | Single center |
| Huang, 2019 | ACDF | Bayesian networks, SVM, and regression analysis | Pre-op selection/planning | 321 | -- | -- | Single center |
| Varghese, 2018 | Spinal fusion | Decision tree and regression analysis | Pre-op selection/planning | -- | -- | -- | Single center |
| Karhade, 2018 | Lumbar degeneration | Deep learning/ANN, decision tree, SVM, and Bayesian networks | Discharge/LOS | 5273 | 53 | 46.90 | ACS-NSQIP database |
| Hopkins, 2019 | Posterior lumbar spinal fusion | Deep learning/ANN | Readmissions/reoperations | 5816 | -- | -- | ACS-NSQIP database |
| Ogink, 2019 | Lumbar spinal stenosis | Deep Learning/ANN, decision tree, SVM, and Bayesian networks | Discharge/LOS | 9338 | 67 | 47.30 | ACS-NSQIP database |
| Karnuta, 2019 | Spinal fusion | Bayesian networks | Discharge/LOS and cost prediction | 3807 | -- | 57.80 | New York state SPARCS database |
| Khatri, 2019 | Spinal fusion | Decision tree | Pre-op selection/planning | -- | -- | -- | Single center |
| Bekelis, 2014 | ACDF | Regression analysis | Discharge/LOS and readmissions/reoperations | 2732 | 55.7 | 46.30 | ACS-NSQIP database |
| Assi, 2014 | Scoliosis | Regression analysis | Pre-op selection/planning | 141 | -- | -- | Single center |
Abbreviations: ASD, adult spinal deformity; ACDF, anterior cervical discectomy and fusion; ANN, artificial neural network; SVM, support vector machine; LOS, length of stay; ACS-NSQIP, American College of Surgery-National Surgical Quality Improvement Program; SPARCS, Statewide Planning and Research Cooperative System.
Statistical Comparisons of Reported Model Performance Metrics, by Administrative/Clinical Decision Support Application.
| Administrative or clinical decision support applications | Performance metrics: Mean (SD, N) | |||
|---|---|---|---|---|
| AUC | Accuracy | Sensitivity | Specificity | |
| Preoperative planning and cost prediction | .89 (.08, 11) | 82.2 (4.8, 7) | 70.5 (10.9, 6) | 87.7 (5.1, 6) |
| Discharge, LOS | .80 (.08, 23) | 78.0 (7.7, 9) | 69.1 (19.8, 7) | 76.6 (7.8, 7) |
| Readmissions and reoperations | .67 (.08, 11) | 70.2 (11.8, 8) | 56.0 (16.5, 7) | 59.0 (16.5, 7) |
| ANOVA | ||||
| Tukey post hoc tests | 1 vs 2 ( | 1 vs 3 ( | -- | 1 vs 3 ( |
| 1 vs 3 ( | 2 vs 3 ( | -- | 2 vs 3 ( | |
| 2 vs 3 ( | -- | -- | -- | |
Abbreviations: AUC, area under the curve; SD, standard deviation; N, number of models; LOS, length of stay.
Reviewed Studies of Postoperative Outcome Prediction in Spine Surgery.
| Author/year | Pathology or procedure | ML algorithms | Prediction outputs | Number of patients | Avg age | % Female | Data source |
|---|---|---|---|---|---|---|---|
| Arvind, 2018 | ACDF | Deep learning/ANN, regression analysis, and SVM | Cardiac, VTE, wound infection, and 30-day mortality | 6264 | 53 | 52 | Multi-center database |
| Kim, 2018 | Lumbar decompression | Deep learning/ANN and regression analysis | Cardiac, VTE, wound infection, and 30-day mortality | 6789 | 60 | 55 | ACS-NSQIP database |
| Kim, 2018 | ASD | Deep learning/ANN and regression analysis | Cardiac, VTE, wound infection, and 30-day mortality | 1746 | 60 | 59 | ACS-NSQIP database |
| Karhade, 2019 | ACDF | Deep learning/ANN, SVM, decision tree, regression analysis, and boosting/ensemble learning | Sustained opioid use | 2737 | 51 | 53 | Multi-center database |
| Han, 2019 | General | Regression analysis | Adverse events, cardiac/CHF, neurologic, pulmonary/pneumonia, and overall medical/surgical complication | 331870 | 63 | 54 | IBM MarketScan, CMS Medicaid, Medicare databases |
| Durand, 2018 | ASD | Decision tree | Postoperative blood transfusion | 205 | 54 | 66 | ACS-NSQIP database |
| Karhade, 2019 | Spinal metastatic disease | Deep learning/ANN, regression analysis, SVM, decision tree, and boosting/ensemble learning | 90-day mortality, and 1-year mortality | 732 | 61 | 42 | Single center |
| Scheer, 2017 | ASD | Decision tree | Adverse events and major complications | 557 | 58 | 79 | Multi-center ASD databases |
| Janssen, 2018 | Thoracolumbar spine surgery | Regression analysis | Wound infection | 898 | 52 | 51 | NCI SEER registry |
| Karhade, 2019 | Spinal epidural abscess | Deep Learning/ANN, regression analysis, SVM, decision tree, and boosting/ensemble learning | Mortality: 90-day | 1053 | 59 | 39 | Multi-center database |
| Karhade, 2019 | Lumbar spine surgery | Regression analysis | Sustained opioid use | 8435 | 60 | 46 | Multi-center database |
| Ryu, 2018 | Spinal ependymoma | Decision tree and regression analysis | 5-year mortality and 10-year mortality | 2822 | -- | 47 | NCI SEER registry |
| Khan, 2020 | Degenerative cervical myelopathy (DCM) | Decision tree, regression analysis, SVM, and boosting/ensemble learning | PRO/functional outcomes (SF-36 MCS, PCS) | 193 | 52 | 35 | Multi-center AOSpine CSM clinical trials |
| Staartjes, 2019 | Single-level tubular microdiscectomy for lumbar disc herniation | Deep learning/ANN and regression analysis | Clinical improvement (leg pain, back pain, and functional disability) | 422 | 49 | 49 | Single center |
| Hoffman, 2015 | Cervical spondylotic myelopathy | SVM and regression analysis | PRO/functional outcomes (post-op ODI) | 20 | 60 | 45 | Single center |
| Shamim, 2009 | Lumbar disc surgery | Cluster analysis | Post-op poor outcomes | 501 | 41 | 31 | Single center |
| Azimi, 2014 | Lumbar spinal stenosis | Deep learning/ANN and regression analysis | Post-op patient satisfaction | 168 | 60 | 65 | Single center |
| Azimi, 2015 | Lumbar disk herniation | Deep learning/ANN and regression analysis | Post-op recurrent lumbar disc herniation | 402 | 50 | 54 | Single center |
| Azimi, 2017 | Lumbar disk herniation | Deep learning/ANN | Post-op successful outcomes | 203 | 48 | 53 | Single center |
| Buchlak, 2017 | ASD | Regression analysis | Post-op complications | 136 | 63 | 74 | Single center |
| Karhade, 2018 | Spinopelvic chordoma surgery | Deep Learning/ANN, decision tree, SVM, and Bayesian networks | 5-year mortality | 265 | 64 | 39 | NCI SEER registry |
| Khor, 2018 | Lumbar fusion | Regression analysis | Clinical improvement | 1965 | 61 | 60 | Multi-center database |
| Bekelis, 2014 | ACDF | Regression analysis | Cardiac, VTE, wound infection, and 30-day mortality | 2732 | 56 | 46 | ACS-NSQIP database |
| Siccoli, 2019 | Lumbar decompression | Deep learning/ANN, decision tree, and Bayesian networks | Clinical improvement (6wk, 12wk) | 635 | 62 | 48 | Single center |
| Ames, 2019 | Adult spinal deformity | Regression analysis, decision tree, and boosting/ensemble learning | PRO/functional outcomes (SRS-22R) | 561 | 54.4 | 75.9 | Single center |
Abbreviations: ANN, artificial neural network; SVM, support vector machine; LOS, length of stay; ASD, adult spinal deformity; ACDF, anterior cervical discectomy and fusion; CHF, congestive heart failure; VTE, venous thromboembolism; UTI, urinary tract infection; PRO, patient-reported outcomes; SF-36, short-form 36 questionnaire; MCS, mental health composite score; PCS, physical health composite score; ODI, Oswestry disability index; ACS-NSQIP, American College of Surgery-National Surgical Quality Improvement Program; CMS, centers for Medicare and Medicaid services; NCI SEER, National Cancer Institute Surveillance, Epidemiology, and End Results database; AOSpine CSM, AOSpine North America cervical spondylotic myelopathy study.
Statistical Comparisons of Reported Model Performance Metrics, by Postoperative Prediction/Management Application.
| Postoperative prediction/management applications | Performance metrics: Mean (SD, N) | |||
|---|---|---|---|---|
| AUC | Accuracy | Sensitivity | Specificity | |
| Postoperative cardiovascular complications | .69 (.12, 21) | -- | 81.0 (4.2, 2) | 52.0 (1.4, 2) |
| Other postoperative complications | .68 (.12, 31) | 85.8 (7.9, 4) | 77.6 (4.4, 5) | 51.6 (.5, 5) |
| Postoperative mortality | .82 (.08, 30) | -- | -- | -- |
| Postoperative functional or clinical outcomes | .75 (.09, 30) | 72.2 (11.2, 28) | 73.8 (15.5, 24) | 60.9 (17.5, 24) |
| ANOVA | ||||
| Tukey post hoc tests | 1 vs 3 ( | -- | -- | -- |
| 2 vs 3 ( | -- | -- | -- | |
| 3 vs 4 ( | -- | -- | -- | |
Abbreviations: AUC, area under the curve; SD, standard deviation; N, number of models; LOS, length of stay.
Statistical Comparisons of Reported Model Performance Metrics, by AI/ML Algorithm.
| AI/ML algorithm | Performance metrics: Mean (SD, N) | |||
|---|---|---|---|---|
| AUC | Accuracy | Sensitivity | Specificity | |
| Bayesian network (BN) | .80 (.09, 13) | 76.9 (11.9, 8) | 63.7 (11.0, 4) | 67.4 (17.7, 4) |
| Boosting/ensemble learning (BEL) | .76 (.10, 13) | 74.1 (9.6, 8) | 55.7 (21.7, 7) | 71.7 (11.4, 7) |
| Decision tree (DT) | .77 (.11, 29) | 74.0 (8.7, 13) | 75.4 (13.7, 12) | 62.5 (21.7, 12) |
| Deep learning/artificial neural network (ANN) | .77 (.11, 34) | 83.0 (10.7, 10) | 81.5 (12.1, 8) | 71.8 (10.1, 8) |
| Logistic regression (LR) | .74 (.11, 56) | 70.4 (10.6, 13) | 70.6 (12.4, 19) | 61.0 (12.4, 19) |
| Support vector machines (SVM) | .63 (.18, 17) | 67.5 (12.9, 3) | 72.3 (18.3, 3) | 56.0 (42.9, 3) |
| ANOVA | ||||
| Tukey post hoc tests | BN vs SVM ( | -- | BEL vs ANN ( | -- |
| DT vs SVM ( | -- | -- | -- | |
| ANN vs SVM ( | -- | -- | -- | |
Abbreviations: AUC, area under the curve; SD, standard deviation; N, number of models; AI/ML, artificial intelligence and machine learning.