| Literature DB >> 36050996 |
Chuang Xiong1, Runhan Zhao1, Jingtao Xu1, Hao Liang1, Chao Zhang1, Zenghui Zhao1, Tianji Huang1, Xiaoji Luo1,2.
Abstract
Purpose: Surgical site infection is one of the serious complications after lumbar fusion. Early prediction and timely intervention can reduce the harm to patients. The aims of this study were to construct and validate a machine learning model for predicting surgical site infection after posterior lumbar interbody fusion, to screen out the most important risk factors for surgical site infection, and to explore whether synthetic minority oversampling technique could improve the model performance. Method: This study reviewed 584 patients who underwent posterior lumbar interbody fusion for degenerative lumbar disease at our center from January 2019 to August 2021. Clinical information and laboratory test data were collected from the electronic medical records. The original dataset was divided into training set and validation set in a 1 : 1 ratio. Seven machine learning algorithms were used to develop predictive models; the training set of each model was resampled using synthetic minority oversampling technique. Finally, the model performance was assessed in the validation set.Entities:
Mesh:
Year: 2022 PMID: 36050996 PMCID: PMC9427297 DOI: 10.1155/2022/2697841
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
Baseline characteristics of all patients included in the study.
| Total | Non-SSI | SSI |
| |
|---|---|---|---|---|
| Number of patients | 584 | 551 (94.35%) | 33 (5.65%) | |
| Age (years) | 58.36 ± 13.76 | 57.86 ± 13.86 | 66.70 ± 8.60 | <0.001 |
| Operation time | 145.56 ± 40.48 | 144.81 ± 40.06 | 158.06 ± 45.77 | 0.095 |
| EBL | 151.23 ± 81.04 | 150.22 ± 80.95 | 168.18 ± 81.79 | 0.125 |
| Pre WBC | 5.94 ± 1.71 | 5.94 ± 1.72 | 5.86 ± 1.50 | 0.937 |
| Pre RBC | 4.44 ± 0.55 | 4.45 ± 0.54 | 4.25 ± 0.59 | 0.041 |
| Pre Hb | 134.76 ± 16.33 | 134.92 ± 16.30 | 132.15 ± 16.96 | 0.229 |
| Pre erythrocyte volume | 40.94 ± 4.38 | 41.00 ± 4.36 | 39.94 ± 4.67 | 0.179 |
| Pre PLT | 205.67 ± 55.58 | 205.21 ± 54.98 | 213.33 ± 65.29 | 0.619 |
| Pre neutrophil percentage | 58.03 ± 9.47 | 58.10 ± 9.56 | 56.84 ± 7.98 | 0.490 |
| Pre lymphocyte percentage | 31.09 ± 8.42 | 31.10 ± 8.50 | 30.89 ± 6.96 | 0.795 |
| Pre Alb | 42.56 ± 3.47 | 42.80 ± 3.34 | 38.64 ± 3.24 | <0.001 |
| Pre globulin | 24.34 ± 3.65 | 24.29 ± 3.66 | 25.15 ± 3.29 | 0.137 |
| Pre ALT | 22.47 ± 17.56 | 22.66 ± 17.83 | 19.39 ± 12.02 | 0.658 |
| Pre AST | 20.40 ± 9.82 | 20.39 ± 9.96 | 20.55 ± 7.34 | 0.342 |
| Pre creatinine | 68.66 ± 17.69 | 68.47 ± 17.65 | 71.82 ± 18.19 | 0.208 |
| BMI | 24.22 ± 3.33 | 24.20 ± 3.34 | 24.59 ± 3.16 | 0.456 |
| Sex (%) | 0.342 | |||
| Female | 321 (54.97) | 306 (55.54) | 15 (45.45) | |
| Male | 263 (45.03) | 245 (44.46) | 18 (54.55) | |
| Diagnosis (%) | 0.625 | |||
| Lumbar disc herniation | 284 (48.63) | 269 (48.82) | 15 (45.45) | |
| Lumbar spinal stenosis | 137 (23.46) | 127 (23.05) | 10 (30.3) | |
| Lumbar instability/spondylolisthesis | 163 (27.91) | 155 (28.13) | 8 (24.24) | |
| Number of fusion segments | 0.040 | |||
| 1-2 | 549 (94) | 521 (94.56) | 28 (84.8) | |
| ≥3 | 35 (6) | 30 (5.44) | 5 (15.2) | |
| Allograft bone | 0.081 | |||
| No | 554 (94.86) | 525 (95.28) | 29 (87.9) | |
| Yes | 30 (5.14) | 26 (4.72) | 4 (12.1) | |
| Weather | 0.408 | |||
| Cold | 315 (53.94) | 300 (54.45) | 15 (45.45) | |
| Warm | 269 (46.06) | 251 (45.55) | 18 (54.55) | |
| Dural tear | <0.001 | |||
| No | 557 (95.38) | 532 (96.55) | 25 (75.76) | |
| Yes | 27 (4.62) | 19 (3.45) | 8 (24.24) | |
| Hypertension | 0.107 | |||
| No | 417 (71.4) | 398 (72.23) | 19 (57.58) | |
| Yes | 167 (28.6) | 153 (27.77) | 14 (42.42) | |
| Diabetes | <0.001 | |||
| No | 483 (82.71) | 467 (84.75) | 16 (48.48) | |
| Yes | 101 (17.29) | 84 (15.25) | 17 (51.52) | |
| CHD | 0.073 | |||
| No | 555 (95.03) | 526 (95.46) | 29 (87.88) | |
| Yes | 29 (4.97) | 25 (4.54) | 4 (12.12) | |
| RD | 0.002 | |||
| No | 573 (98.12) | 544 (98.73) | 29 (87.88) | |
| Yes | 11 (1.9) | 7 (1.27) | 4 (12.12) | |
| Smoking | 0.174 | |||
| No | 454 (77.74) | 432 (78.40) | 22 (66.67) | |
| Yes | 130 (22.26) | 119 (21.60) | 11 (33.33) | |
| Alcohol | 0.613 | |||
| No | 471 (80.65) | 446 (80.94) | 25 (75.76) | |
| Yes | 113 (19.35) | 105 (19.06) | 8 (24.24) | |
| Osteoporosis | 0.519 | |||
| No | 476 (81.51) | 451 (81.85) | 25 (75.76) | |
| Yes | 108 (18.49) | 100 (18.15) | 8 (24.24) | |
| ASA | 0.010 | |||
| 1-2 | 394 (67.47) | 379 (68.78) | 15 (45.45) | |
| 3 | 190 (32.53) | 172 (31.22) | 18 (54.55) | |
| NYHA | 0.639 | |||
| ≤2 | 560 (95.89) | 529 (96.01) | 31 (93.94) | |
| 3 | 24 (4.11) | 22 (3.99) | 2 (6.06) |
EBL: estimated intraoperative blood loss; Pre WBC: preoperative white blood cell count; Pre RBC: preoperative red blood cell count; Pre Hb: preoperative hemoglobin; Pre erythrocyte volume: preoperative erythrocyte volume; Pre PLT: preoperative platelets; Pre neutrophil percentage: preoperative neutrophil percentage; Pre lymphocyte percentage: preoperative lymphocyte percentage; Pre Alb: preoperative albumin; Pre globulin: preoperative globulin; Pre ALT: preoperative alanine aminotransferase; Pre AST: preoperative aspartate aminotransferase; Pre creatinine: preoperative creatinine; BMI: body mass index; CHD: coronary heart disease; ASA: American Society of Anesthesiologists; NYHA: New York Heart Association Class; SSI: surgical site infection; RD: rheumatic disease.
Figure 1Univariate logistic regression. LSS: lumbar spinal stenosis; LSO: lumbar spondylolisthesis; EBL: estimated blood loss; DT: dural tear; RD: rheumatic disease; CHD: coronary heart disease; Pre WBC: preoperative white blood cell count; Pre RBC: preoperative red blood cell count; Pre Hb: preoperative hemoglobin; Pre erythrocyte volume: preoperative erythrocyte volume; Pre PLT: preoperative platelets; Pre neutrophil percentage: preoperative neutrophil percentage; Pre lymphocyte percentage: preoperative lymphocyte percentage; Pre Alb: preoperative albumin; Pre globulin: preoperative globulin; Pre ALT: preoperative alanine aminotransferase; Pre AST: preoperative aspartate aminotransferase; Pre creatinine: preoperative creatinine; ASA: American Society of Anesthesiologists physical status; NYHA: New York Heart Association Class; BMI: body mass index. ∗P value < 0.05; ∗∗P value < 0.01; ∗∗∗P value < 0.001.
Figure 2Multivariate logistic regression and stepwise logistic regression. DT: dural tear; RD: rheumatic disease; Pre RBC: preoperative red blood cell count; Pre Alb: preoperative albumin; ASA: American Society of Anesthesiologists physical status. ∗P value < 0.05; ∗∗P value < 0.01; ∗∗∗P value < 0.001.
Figure 3Area under the curve of receiver-operating characteristic curve by machine learning models in the validation cohort. (a) Machine learning; (b) SMOTE + machine learning. AUC: area under the receiver-operating characteristic curve; SMOTE: synthetic minority oversampling technique; ada: Boosted Classification Trees; LogitBoost: Boosted Logistic Regression; xgbLinear: Extreme Gradient Boosting; gbm: Stochastic Gradient Boosting; glm: Generalized Linear Model; adaboost: AdaBoost Classification Trees; rf: random forest.
Figure 4Confusion matrix of prediction model for validation set. SSI: surgical site infection; no SSI: no surgical site infection; (a) Boosted Classification Trees; (b) SMOTE+Boosted Classification Trees; (c) Boosted Logistic Regression; (d) SMOTE+Boosted Logistic Regression; (e) Extreme Gradient Boosting; (f) SMOTE+Extreme Gradient Boosting; (g) Stochastic Gradient Boosting; (h) SMOTE+Stochastic Gradient Boosting; (i) Generalized Linear Model, (j) SMOTE+Generalized Linear Model; (k) AdaBoost Classification Trees; (l) SMOTE+AdaBoost Classification Trees; (m) random forest; (n) SMOTE+Random Forest.
Figure 5Model performance. AUC: area under the receiver-operating characteristic curve; SMOTE: Synthetic Minority Oversampling Technique; ada: Boosted Classification Trees; LogitBoost: Boosted Logistic Regression; xgbLinear: Extreme Gradient Boosting; gbm: Stochastic Gradient Boosting; glm: Generalized Linear Model; adaboost: AdaBoost Classification Trees; rf: random forest.
Figure 6Variable importance. Pre Alb: preoperative albumin; DT: dural tear; RD: rheumatic disease.