| Literature DB >> 34290326 |
Jin Youp Kim1,2, Hyoun-Joong Kong3,4, Su Hwan Kim5, Sangjun Lee6,7, Seung Heon Kang8, Seung Cheol Han8, Do Won Kim8, Jeong-Yeon Ji8, Hyun Jik Kim9.
Abstract
Increasing recognition of anatomical obstruction has resulted in a large variety of sleep surgeries to improve anatomic collapse of obstructive sleep apnea (OSA) and the prediction of whether sleep surgery will have successful outcome is very important. The aim of this study is to assess a machine learning-based clinical model that predict the success rate of sleep surgery in OSA subjects. The predicted success rate from machine learning and the predicted subjective surgical outcome from the physician were compared with the actual success rate in 163 male dominated-OSA subjects. Predicted success rate of sleep surgery from machine learning models based on sleep parameters and endoscopic findings of upper airway demonstrated higher accuracy than subjective predicted value of sleep surgeon. The gradient boosting model showed the best performance to predict the surgical success that is evaluated by pre- and post-operative polysomnography or home sleep apnea testing among the logistic regression and three machine learning models, and the accuracy of gradient boosting model (0.708) was significantly higher than logistic regression model (0.542). Our data demonstrate that the data mining-driven prediction such as gradient boosting exhibited higher accuracy for prediction of surgical outcome and we can provide accurate information on surgical outcomes before surgery to OSA subjects using machine learning models.Entities:
Year: 2021 PMID: 34290326 PMCID: PMC8295249 DOI: 10.1038/s41598-021-94454-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Prediction modeling and evaluation process. The total subjects (n = 163) were randomly divided into training and test sets by 7:3 ratio. The training set (n = 115) was used to derive the four prediction models: logistic regression (generalized linear model), random forest, gradient boosting machine, and support vector machine. Then, each model was applied to predict the surgical success for the subjects in test set (n = 48); surgical success was defined as a postoperative AHI < 20 and a ≥ 50% reduction in preoperative AHI. The performance of each model was evaluated by the true reference (surgical outcomes in their postoperative PSG or HSAT). AUC area under curve, PPV positive predictive value, NPV negative predictive value.
Clinical characteristics and management of 163 subjects.
| Variables | Value, No |
|---|---|
| Age, years | 43.5 ± 11.8 |
| Sex | |
| Male | 148 (90.8%) |
| Female | 15 (9.2%) |
| Body mass index, kg/m2 | 26.2 ± 3.2 |
| Presence of septal deviation | 147 (90.2%) |
| Tonsil size grade | |
| Grade 0 | 3 (1.8%) |
| Grade I | 84 (51.5%) |
| Grade II | 43 (26.4%) |
| Grade III | 29 (17.85%) |
| Grade IV | 4 (2.5%) |
| Palate position grade | |
| Grade I | 9 (5.5%) |
| Grade II | 56 (34.3%) |
| Grade III | 78 (47.9%) |
| Grade IV | 19 (11.7%) |
| Friedman stage | |
| Stage I | 12 (7.4%) |
| Stage II | 73 (44.8%) |
| Stage III | 78 (47.9%) |
| Presence of uvula elongation, % | 85 (52.1%) |
| Preoperative AHI, events/h | 36.5 ± 19.1 |
| Ratio of non-supine versus supine AHI | 0.39 ± 0.47 |
| Sleep efficiency, % | 85.8 ± 9.4 |
| Percent of stage REM, % | 20.9 ± 7.3 |
| Ratio of REM versus NREM AHI | 1.8 ± 2.8 |
| Lowest O2 percent, % | 78.1 ± 9.8 |
Sleep time with oxygen saturation ≤ 90%, % | 7.8 ± 13.3 |
| Nasal surgery | 150 (92.0%) |
| Oropharynx surgery | 159 (97.5%) |
| Hypopharynx surgery | 41 (33.6%) |
| Single-level surgery | 10 (6.1%) |
| Multi-level surgery | 153 (93.9%) |
AHI apnea hypopnea index, REM rapid eye movement, NREM non-rapid eye movement.
Values are presented as mean ± SD or numbers (percentages).
Comparison of clinical parameters between responder and non-responder groups.
| Variables | Responder (n = 80) | Non-responder | |
|---|---|---|---|
| Age, years | 42.1 ± 11.7 | 44.8 ± 11.8 | 0.147 |
| Sex, male: female | 72:8 | 76:7 | 0.940 |
| Body mass index, kg/m2 | 25.7 ± 3.1 | 26.8 ± 3.3 | |
| Presence of Septal deviation, % | 67/80 (83.8%) | 80/83 (96.4%) | |
| Tonsil size grade | 1.9 ± 0.9 | 1.5 ± 0.8 | |
| Palate position grade | 2.6 ± 0.8 | 2.7 ± 0.7 | 0.137 |
| Friedman stage | 2.3 ± 0.6 | 2.5 ± 0.6 | 0.110 |
| Presence of uvula elongation, % | 41/80 (51.3%) | 44/83(53.0%) | 0.746 |
| Preoperative AHI, events/h | 33.0 ± 17.3 | 39.9 ± 20.2 | |
| Ratio of non-supine versus supine AHI | 0.41 ± 0.58 | 0.37 ± 0.34 | 0.671 |
| Sleep efficiency, % | 86.6 ± 8.7 | 85.1 ± 10.1 | 0.318 |
| Percent of stage REM, % | 20.9 ± 7.8 | 20.9 ± 6.9 | 0.998 |
| Ratio of REM versus NREM AHI | 2.0 ± 3.3 | 1.7 ± 2.0 | 0.530 |
| Lowest O2 percent, % | 80.9 ± 9.0 | 75.5 ± 9.8 | |
Sleep time with oxygen saturation ≤ 90%, % | 4.2 ± 7.4 | 11.2 ± 16.4 | |
Bold values indicates statistical significance (P < 0.05).
AHI, apnea hypopnea index; REM, rapid eye movement; NREM, non-rapid eye movement.
Values are presented as mean ± SD or numbers (percentages).
Figure 2Importance of the variables in each model. (a) Logistic regression, (b) random forest, (c) gradient boosting, and (d) support vector machine. AHI apnea–hypopnea index, BMI Body mass index, REM rapid eye movement, NREM non-rapid eye movement.
Figure 3Receiver operating characteristic (ROC) curves for machine learning models. Area under the curve (AUC) are shown for each model. The color of lines in the ROC curve represents each algorithm; logistic regression (black); random forest (green); gradient boosting (red); support vector machine (blue). GBM, gradient boosting machine; SVM, support vector machine.
Performance metrics of machine learning models and physician’s prediction.
| Accuracy | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|
| Logistic regression | 0.542 (0.400–0.683) | 0.417 (0.277–0.556) | 0.667 (0.533–0.800) | 0.556 (0.415–0.696) | 0.533 (0.392–0.674) |
| Random forest | 0.667 (0.533–0.800) | 0.542 (0.400–0.683) | 0.792 (0.677–0.907) | 0.722 (0.595–0.849) | 0.633 (0.497–0.770) |
| Gradient boosting | 0.708* (0.580–0.837) | 0.708 (0.580–0.837) | 0.708 (0.580–0.837) | 0.708 (0.580–0.837) | 0.708 (0.580–0.837) |
| Support vector machine | 0.667 (0.533–0.800) | 0.708 (0.580–0.837) | 0.625 (0.488–0.762) | 0.654 (0.519–0.788) | 0.682 (0.550–0.814) |
| Physician’s prediction | 0.522 (0.380–0.663) | 0.238 (0.117–0.358) | 0.795 (0.681–0.909) | 0.528 (0.387–0.669) | 0.520 (0.378–0.661) |
PPV positive predictive value, NPV negative prediction value.
*p < 0.05 compared with logistic regression or physician’s prediction.
The values were presented as mean (95% confidence interval).