| Literature DB >> 35054218 |
Cheng-Yu Tsai1, Yi-Chun Kuan2,3,4,5, Wei-Han Hsu6, Yin-Tzu Lin7, Chia-Rung Hsu2, Kang Lo5, Wen-Hua Hsu8, Arnab Majumdar1, Yi-Shin Liu9, Shin-Mei Hsu5, Shu-Chuan Ho9, Wun-Hao Cheng10, Shang-Yang Lin9, Kang-Yun Lee11, Dean Wu2,3,4,5, Hsin-Chien Lee12, Cheng-Jung Wu5,13, Wen-Te Liu5,9,11,14.
Abstract
Insomnia disorder (ID) and obstructive sleep apnea (OSA) with respiratory arousal threshold (ArTH) phenotypes often coexist in patients, presenting similar symptoms. However, the typical diagnosis examinations (in-laboratory polysomnography (lab-PSG) and other alternatives methods may therefore have limited differentiation capacities. Hence, this study established novel models to assist in the classification of ID and low- and high-ArTH OSA. Participants reporting insomnia as their chief complaint were enrolled. Their sleep parameters and body profile were accessed from the lab-PSG database. Based on the definition of low-ArTH OSA and ID, patients were divided into three groups, namely, the ID, low- and high-ArTH OSA groups. Various machine learning approaches, including logistic regression, k-nearest neighbors, naive Bayes, random forest (RF), and support vector machine, were trained using two types of features (Oximetry model, trained with oximetry parameters only; Combined model, trained with oximetry and anthropometric parameters). In the training stage, RF presented the highest cross-validation accuracy in both models compared with the other approaches. In the testing stage, the RF accuracy was 77.53% and 80.06% for the oximetry and combined models, respectively. The established models can be used to differentiate ID, low- and high-ArTH OSA in the population of Taiwan and those with similar craniofacial features.Entities:
Keywords: in-laboratory polysomnography; insomnia disorder; obstructive sleep apnea; random forest; respiratory arousal threshold
Year: 2021 PMID: 35054218 PMCID: PMC8774350 DOI: 10.3390/diagnostics12010050
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flowchart of the data collection in the study. The data collection procedure involved included and excluded criteria and eligible data amounts. The data of 1602 patients reporting insomnia as their chief complaint were collected, and 1579 of them were eligible for further analysis. Abbreviation: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea.
Figure 2Schematic workflow. All derived data were first subjected to mean comparison for the three disease groups. Next, all data were divided into training and testing datasets. Five matching learning approaches were employed in this study, namely, a logistic regression (LR) model, k-nearest neighbors (kNN), naive Bayes (NB), random forest (RF), and support vector machine (SVM). Two types of models were established by using these five approaches, including the oximetry model, which was only trained using oximetry parameters, and combined models, which were trained by both oximetry and anthropometric parameters. The trained model with the highest accuracy in the training outcomes for both model types was used to predict the testing dataset. Abbreviations: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea.
Figure 3Model establishment process. All data were separated at a ratio of 80:20 to act as the training–validation dataset and testing dataset. The k-fold cross-validation technique (k = 10) was employed in the training–validation stage to separately develop two types of models through five machine learning approaches. Subsequently, the models with the highest accuracy in oximetry typing and combined parameter typing were respectively used to predict the testing dataset for evaluating the overall accuracy and input feature importance. Abbreviations: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea; LR: logistic regression; kNN: k-nearest neighbors; NB: naive Bayes; RF: random forest; SVM: support vector machine; SpO2-mean: mean level of peripheral arterial oxygen saturation measured using pulse oximetry; SpO2-min: minimum level of peripheral arterial oxygen saturation measured using pulse oximetry; I-SpO2-<90%: index for the ratio of the amount of time during which the peripheral arterial oxygen saturation measured using pulse oximetry is lower than 90% to the sleep period time; ODI-3%-TRT: total number of oxygen desaturation events (>3%) divided by the total recording time; BMI: body mass index.
Demographic characteristics of the participants grouped by arousal threshold criteria.
| Categorical Variable | ID | Low-ArTH OSA | High-ArTH OSA |
|---|---|---|---|
| Sex (male/female) b | 137/267 | 361/263 | 422/129 |
| Age (years) a | 45.91 ± 13.81 #,△ | 52.42 ± 13.88 # | 53.47 ± 13.02 △ |
| BMI (kg/m2) a | 23.22 ± 4.03 #,△ | 26.06 ± 4.52 #,* | 28.08 ± 4.87 *,△ |
| Neck (cm) a | 34.05 ± 3.37 #,△ | 36.74 ± 3.70 #,* | 38.81 ± 3.90 *,△ |
| Waist (cm) a | 80.51 ± 10.25 #,△ | 89.69 ± 10.73 #,* | 96.13 ± 12.30 *,△ |
| Low-ArTH criteria a | |||
| AHI (events/h) | 2.21 ± 1.48 #,△ | 15.52 ± 7.01 #,* | 49.57 ± 21.71 *,△ |
| SpO2-min (%) | 92.44 ± 2.96 #,△ | 86.74 ± 5.26 #,* | 78.63 ± 8.30 *,△ |
| F-hypopnea (%) | 45.25 ± 10.94 #,△ | 91.21 ± 11.18 #,* | 67.28 ± 29.79 *,△ |
| OSA severity b | |||
| Mild, | - | 334 (53.53%) | 30 (5.44%) |
| Moderate, | - | 290 (46.47%) | 38 (6.90%) |
| Severe, | - | - | 483 (87.66%) |
Abbreviations: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea; AHI: apnea–hypopnea index; SpO2-min: minimum level of peripheral arterial oxygen saturation measured using pulse oximetry; F-hypopnea: percentage of respiratory events that were hypopneas. Data are expressed as mean ± standard deviation. a Differences among groups were assessed using the Kruskal–Wallis H test. b Differences among groups were assessed using the chi-square test. # p-value was less than 0.05 between the ID and low-ArTH OSA group. p-value was less than 0.05 between the ID and high-ArTH OSA group. * p-value was less than 0.05 between the low-ArTH OSA and high-ArTH OSA group.
Comparison of the PSG parameter results among the three groups.
| Categorical Variable | ID | Low-ArTH OSA | High-ArTH OSA |
|---|---|---|---|
| Sleep architecture | |||
| Sleep onset (min) | 30.42 ± 36.79 #,△ | 26.72 ± 31.6 # | 24.93 ± 34.11 △ |
| WASO (min) | 59.67 ± 55.54 #,△ | 62.81 ± 49.12 #,*, | 72.11 ± 55.46 *,△ |
| Wake (% of SPT) | 17.75 ± 16.35 △ | 18.63 ± 14.58 * | 21.65 ± 17.21 *,△ |
| NREM (% of SPT) | 71.87 ± 14.37 | 70.97 ± 12.85 | 69.84 ± 15.39 |
| REM (% of SPT) | 10.38 ± 6.51 △ | 10.41 ± 6.43 * | 8.52 ± 5.93 *,△ |
| Oximetry parameters | |||
| SpO2-mean (%) | 97.12 ± 1.12 #,△ | 96.01 ± 1.35 #,* | 94.61 ± 2.13 *,△ |
| ODI-3% (events/h) | 1.82 ± 1.54 #,△ | 12.97 ± 7.61 #,* | 46.99 ± 22.28 *,△ |
| Arousal parameters (events/h) | |||
| ArI | 13.67 ± 9.07 #,△ | 17.34 ± 9.12 #,* | 30.89 ± 18.36 *,△ |
| SpArI | 11.34 ± 7.68 #,△ | 9.49 ± 6.41 #,* | 6.80 ± 6.50 *,△ |
| RArI | 0.62 ± 0.97 #,△ | 5.99 ± 4.35 #,* | 22.22 ± 16.62 *,△ |
| SnArI | 0.13 ± 0.65 #,△ | 0.32 ± 1.02 # | 0.40 ± 1.40 △ |
| LMArI | 1.40 ± 3.11 | 1.38 ± 2.06 | 1.35 ± 2.13 |
Abbreviations: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea; WASO: wake after sleep onset; SPT: sleep period of time; NREM: nonrapid eye movement; REM: rapid eye movement; SpO2-mean: mean level of peripheral arterial oxygen saturation measured using pulse oximetry; ODI-3% (events/h): greater than or equal to the 3% oxygen desaturation index; ArI: arousal index; SpArI: spontaneous arousal index; RArI: respiratory arousal index; SnArI: snoring arousal index; LMArI: limb movement-related arousal index. Data are expressed as mean ± standard deviation. Differences among groups were assessed using the Kruskal–Wallis H test. # p-value was less than 0.05 between the ID and low-ArTH OSA group. p-value was less than 0.05 between the ID and high-ArTH OSA group. * p-value was less than 0.05 between the low-ArTH OSA and high-ArTH OSA group.
Comparison of the cross-validation results of the two model types established using various machine learning approaches.
| Categorical Variable | LR | kNN | NB | RF | SVM |
|---|---|---|---|---|---|
| Training set ( | ID: 332; Low-ArTH OSA: 416; High-ArTH OSA:515 | ||||
| Oximetry model | |||||
| Accuracy (%) | 77.28 | 77.67 | 77.20 | 79.57 | 77.04 |
| Precision (%) | 77.25 | 78.17 | 79.03 | 80.38 | 77.88 |
| Recall (%) | 78.79 | 78.60 | 78.71 | 80.65 | 77.07 |
| F1-score (%) | 77.59 | 78.16 | 77.79 | 80.11 | 77.31 |
| AUC (%) | 91.82 | 89.60 | 91.79 | 92.52 | 90.99 |
| Combined model | |||||
| Accuracy (%) | 79.65 | 71.65 | 77.75 | 80.60 | 78.70 |
| Precision (%) | 79.57 | 72.89 | 79.35 | 81.88 | 79.73 |
| Recall (%) | 81.05 | 71.73 | 78.94 | 81.48 | 78.66 |
| F1-score (%) | 79.97 | 72.05 | 78.34 | 81.19 | 78.98 |
| AUC (%) | 93.17 | 86.96 | 91.16 | 93.40 | 92.01 |
Abbreviations: LR: logistic regression; kNN, k-nearest neighbors; NB: naive Bayes; RF: random forest; SVM: support vector machine; ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea; AUC: area under the curve.
Classification performance and feature importance of the selected models (random forest) in the prediction of the testing set.
| Categorical Variable | Oximetry Model | Combined Model |
|---|---|---|
| Testing set ( | ID: 72; Low-ArTH OSA:109; High-ArTH OSA:135 | |
| Accuracy (%) | 77.53 | 80.06 |
| Precision (%) | 78.72 | 80.17 |
| Recall (%) | 78.17 | 81.24 |
| F1-score (%) | 78.14 | 80.41 |
| AUC (%) | 92.24 | 93.61 |
| Feature importance (%) | ||
| SpO2-mean | 5.43 | 6.46 |
| SpO2-min | 14.89 | 14.69 |
| I-SpO2-<90% | 15.10 | 13.40 |
| ODI-3%-TRT | 64.57 | 49.50 |
| Age | - | 3.25 |
| BMI | - | 4.06 |
| Neck | - | 3.29 |
| Waist | - | 4.51 |
| Sex | - | 0.84 |
Abbreviations: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea; AUC: area under the curve; SpO2-mean: mean level of peripheral arterial oxygen saturation measured using pulse oximetry; SpO2-min: minimum level of peripheral arterial oxygen saturation measured using pulse oximetry; I-SpO2-<90%: the ratio of the amount of time during which the peripheral arterial oxygen saturation measured using pulse oximetry is lower than 90% to the sleep period time; ODI-3%-TRT: total number of oxygen desaturation events (>3%) divided by the total recording time; BMI: body mass index.
Figure 4Classification results of the testing set using the selected models (random forest, RF). Confusion matrices indicating the accuracy of the multiclass classification using the selected model (RF). (A) The prediction results based on the participants’ oximetry-related parameters (oximetry model); (B) The outcomes of the classification model with the anthropometric and oximetry-related parameters (combined model). Abbreviation: ID: insomnia disorder; ArTH: respiratory arousal threshold; OSA: obstructive sleep apnea.