| Literature DB >> 31083643 |
Yang Cao1, Xin Fang2, Johan Ottosson3, Erik Näslund4, Erik Stenberg5.
Abstract
BACKGROUND: Severe obesity is a global public health threat of growing proportions. Accurate models to predict severe postoperative complications could be of value in the preoperative assessment of potential candidates for bariatric surgery. So far, traditional statistical methods have failed to produce high accuracy. We aimed to find a useful machine learning (ML) algorithm to predict the risk for severe complication after bariatric surgery.Entities:
Keywords: bariatric surgery; comparative study; machine learning; prediction; severe complication
Year: 2019 PMID: 31083643 PMCID: PMC6571760 DOI: 10.3390/jcm8050668
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Base line characteristics of the training patients.
| Characteristics | All | No Serious Complication | Having Serious Complication | |
|---|---|---|---|---|
| Age in years, mean ± SD | 41.2 ± 11.2 | 41.1 ± 11.2 | 42.9 ± 10.7 | <0.001 * |
| Sex, | ||||
| Female | 28,682 (75.9%) | 27,766 (75.9%) | 916 (75.1%) | 0.521 † |
| Male | 9129 (24.1%) | 8825 (24.1%) | 304 (24.9%) | |
| BMI in kg/m2, mean ± SD | 42.12 ± 5.66 | 42.13 ± 5.66 | 41.79 ± 5.58 | 0.0355 * |
| WC in cm, mean ± SD | 126.0 ± 14.0 | 126.0 ± 14.0 | 126.2 ± 13.8 | 0.6018 * |
| HbA1c, median (P25, P75) | 38 (35, 42) | 38 (38, 32) | 38 (35, 43) | 0.0090 ‡ |
| Comorbidity, | ||||
| Sleep apnoea | 3792 (10.0%) | 3656 (10.0%) | 136 (11.2%) | 0.186 † |
| Hypertension | 9760 (25.8%) | 9404 (25.7%) | 356 (29.2%) | 0.006 † |
| Diabetes | 5407 (14.3%) | 5204 (14.2%) | 203 (16.6%) | 0.018 † |
| Dyslipidaemia | 3802 (10.1%) | 3667 (10.0%) | 135(11.1%) | 0.233 † |
| Dyspepsia | 3970 (10.5%) | 3803 (10.4%) | 167 (13.7%) | <0.001 † |
| Depression | 5609 (14.8%) | 5409 (14.8%) | 200 (16.4%) | 0.119 † |
| Musculoskeletal pain | 4905 (13.0%) | 4754 (13.0%) | 151 (12.4%) | 0.529 † |
| Previous venous thromboembolism | 918 (2.4%) | 875 (2.39%) | 43 (3.52%) | 0.011 † |
| Revisional surgery | 1367 (3.6%) | 1261 (3.5%) | 106 (8.7%) | <0.001 † |
SD—standard deviation; BMI—body mass index; WC—waist circumference; P25—the 25th percentile; P75—the 75th percentile. * t-test was used; † χ2 test was used; ‡ Mann-Whitney U test was used.
Base line characteristics of the test patients.
| Characteristics | All | No Serious Complication | Having Serious Complication | |
|---|---|---|---|---|
| Age in years, mean ± SD | 41.2 ± 11.5 | 41.2 ± 11.5 | 42.9 ± 11.8 | 0.0423 * |
| Sex, | ||||
| Female | 4832 (77.3%) | 4682 (77.2%) | 150 (79.8%) | 0.411 † |
| Male | 1418 (22.7%) | 1380 (22.8%) | 38 (20.2%) | |
| BMI in kg/m2, mean ± SD | 41.22 ± 5.87 | 41.20 ± 5.89 | 41.95 ± 5.40 | 0.0848 * |
| WC in cm, mean ± SD | 123.3 ± 14.1 | 123.2 ± 14.0 | 126.2 ± 14.7 | 0.0086 * |
| HbA1c, median (P25, P75) | 37 (34, 41) | 37 (34, 41) | 38 (35, 44) | 0.0017 ‡ |
| Comorbidity, | ||||
| Sleep apnoea | 622 (10.0%) | 607 (10.0%) | 15 (8.0%) | 0.359 † |
| Hypertension | 1563 (25.0%) | 1506 (24.8%) | 57 (30.3%) | 0.088 † |
| Diabetes | 761 (12.2%) | 734 (12.1%) | 27 (14.4%) | 0.352 † |
| Dyslipidaemia | 518 (8.3%) | 493 (8.13%) | 25 (13.3%) | 0.011 † |
| Dyspepsia | 645 (10.3%) | 620 (10.2%) | 25 (13.3%) | 0.173 † |
| Depression | 1096 (17.5%) | 1053 (17.4%) | 43 (22.9%) | 0.051 † |
| Musculoskeletal pain | 1315 (21.0%) | 1268 (20.9%) | 47 (25.0%) | 0.176 † |
| Previous venous thromboembolism | 182 (2.9%) | 177 (2.99%) | 5 (2.7%) | 0.834 † |
| Revisional surgery | 61 (1.0%) | 54 (0.9%) | 7 (3.7%) | <0.001 † |
SD—standard deviation; BMI—body mass index; WC—waist circumference; P25—the 25th percentile; P75—the 75th percentile. * t-test was used; † χ2 test was used; ‡ Mann-Whitney U test was used.
Figure 1Performance of the algorithms. AdaBoost—adaptive boosting; LDA—linear discriminant analysis; QDA—quadratic discriminant analysis; ExtRa—extremely randomized; AdaExtra—adaptive boosting extremely randomized; AdaGradient—adaptive boosting gradient; KNN—k-nearest neighbor; SVM—support vector machine; MLP—multilayer perceptron; NN—neural network.
Figure 2Receiver operating characteristic (ROC) curves of logistic regression, LDA, and QDA.
Figure 3ROC curves of tree-based algorithms.
Figure 4ROC curves of KNN and SVM.
Figure 5ROC curves of neural network algorithms.