| Literature DB >> 32782313 |
Gabriele B Papini1,2,3, Pedro Fonseca4,5, Merel M van Gilst4,6, Jan W M Bergmans4,5, Rik Vullings4, Sebastiaan Overeem4,6.
Abstract
A large part of the worldwide population suffers from obstructive sleep apnea (OSA), a disorder impairing the restorative function of sleep and constituting a risk factor for several cardiovascular pathologies. The standard diagnostic metric to define OSA is the apnea-hypopnea index (AHI), typically obtained by manually annotating polysomnographic recordings. However, this clinical procedure cannot be employed for screening and for long-term monitoring of OSA due to its obtrusiveness and cost. Here, we propose an automatic unobtrusive AHI estimation method fully based on wrist-worn reflective photoplethysmography (rPPG), employing a deep learning model exploiting cardiorespiratory and sleep information extracted from the rPPG signal trained with 250 recordings. We tested our method with an independent set of 188 heterogeneously disordered clinical recordings and we found it estimates the AHI with a good agreement to the gold standard polysomnography reference (correlation = 0.61, estimation error = 3±10 events/h). The estimated AHI was shown to reliably assess OSA severity (weighted Cohen's kappa = 0.51) and screen for OSA (ROC-AUC = 0.84/0.86/0.85 for mild/moderate/severe OSA). These findings suggest that wrist-worn rPPG measurements that can be implemented in wearables such as smartwatches, have the potential to complement standard OSA diagnostic techniques by allowing unobtrusive sleep and respiratory monitoring.Entities:
Mesh:
Year: 2020 PMID: 32782313 PMCID: PMC7421543 DOI: 10.1038/s41598-020-69935-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographics of the participants (and for each set).
| All participants | Training set | Validation set | Test set | |
|---|---|---|---|---|
| Participants (#) (male) | 502 (316) | 175 (107) | 75 (48) | 252 (161) |
| Age (years) | 48±16 [18–82] | 49±16 [18–78] | 48±15 [18–79] | 48±16 [18–82] |
| BMI (kg/m2) | 27±5 [15–49] | 28±5 [15–42] | 27±4 [17–40] | 27±5 [15–49] |
| Total sleep time (min) | 411±68 [102–580] | 414±63 [199–580] | 416±57 [243–561] | 406±75 [102–540] |
| REM (% of sleep) | 17±6 [0–35] | 17±6 [0–32] | 17±6 [5–31] | 16±6 [0—35] |
| N1 (% of sleep) | 14±7 [1–56] | 13±7 [2–34] | 13±7 [1–39] | 15±8 [2–56] |
| N2 (% of sleep) | 52±8 [30–93] | 52±9 [32–93] | 53±8 [33–73] | 51±8 [30–78] |
| N3 (% of sleep) | 17±9 [0–49] | 17±9 [0–49] | 16±8 [1–36] | 17±9 [0–40] |
| Sleep efficiency (%) | 82±13 [19–99] | 83±11 [37–98] | 83±9 [57–98] | 81±14 [19–99] |
| AHI (events/h) | 14±16 [0–91] | 12±13 [0–70] | 16±18 [0–71] | 16±17 [0–91] |
| AI (events/h) | 1±3 [0–31] | 1±3 [0–24] | 1±3 [0–20] | 1±3 [0–31] |
| HI (events/h) | 12±12 [0–86] | 10±9 [0–47] | 13±14 [0–64] | 13±13 [0–86] |
| None/mild/moderate/severe OSA cases (#) | 183/150/100/69 | 77/45/38/15 | 27/20/13/15 | 79/85/49/39 |
| Participants with > 1 central apnea per hour (#) | 68 | 24 | 16 | 28 |
| Participants with > 1 mixed apnea per hour (#) | 38 | 9 | 6 | 23 |
| PLMI (events/h) | 3±2 [0–19] | 3±2 [0–9] | 3±2 [0–9] | 3±2 [0–19] |
| Disordered breathing/insomnia/movement disorder/parasomnia/no disorder cases (#) | 241/121/57/35/70 | 93/45/17/13/20 | 44/13/8/4/6 | 104/63/27/18/44 |
Data area shown as mean ± standard deviation [range]. The OSA severity classes are none with AHI < 5, mild with 5 AHI < 15, moderate with 15 AHI < 30, and severe with AHI 30. Sleep efficiency is ratio of total sleep time to time in bed. AI and HI are respectively the number of obstructive apneas and hypopneas per hour of sleep. PLMI is periodic leg movement index calculated as number of periodic leg movements events per hour of sleep. The last row reports the five most common sleep disorder categories in the datasets (according to the primary diagnosis).
Overview of the extracted features.
| Features type | Number of features (number of those with window shorter than literature) | Described in | Features computation window sizes (s) (number of epochs for template comparison) |
|---|---|---|---|
| Arousal probability | 10 (5) | [ | 120#, 30& |
| Frequency analysis | 23 (5) | [ | 300#, 120& |
| Adapted frequency analysis | 11 (5) | [ | 300#, 120& |
| Detrended fluctuation analysis | 5 | [ | 360 |
| Progressive detrended fluctuation analysis | 1 | [ | 60 |
| Windowed detrended fluctuation analysis | 1 | [ | 360 |
| High frequency pole analysis | 4 (2) | [ | 300#, 120& |
| Multi-scale entropy | 20 | [ | 540 |
| Local phase coordination | 7 (5) | [ | 150#, 90& |
| Time analysis and statistics | 74 (37) | [ | 300#, 30& |
| Sample entropy | 2 (1) | [ | 300#, 30& |
| Visibility graph analysis | 13 | [ | 210 |
| Hilbert transformation analysis | 12 (6) | [ | 300#, 30& |
| Amplitude analysis | 5 | [ | 150 |
| Frequency analysis | 3 | [ | 30 |
| Similarity respiratory patterns | 2 | [ | 30 (60) |
| Frequency peak analysis | 3 | [ | 30 |
| Template distance | 1 | [ | 30 (50) |
| Time analysis | 3 | [ | 30 |
| Visibility graph analysis | 4 | [ | 150 |
| Variance | 1 | [ | 30 |
| Sample entropy | 1 | [ | 180 |
| Activity counts | 1 | [ | 30 |
| Sleep stage probabilities | 4 | [ | 30 |
| Features coverage | 1 | – | 30 |
We calculated each feature using the methods proposed in the respective original methodological paper(s). Some of the features were calculated for different window sizes, and their number comprises both calculations (superscript symbols define the association between reference and the window size used).
rPPG quality recording exclusion criteria.
| rPPG quality metrics | Minimum values |
|---|---|
| IBI coverage (%) | 83 |
| Average pulse quality index | 0.85 |
| Median pulse quality index | 0.90 |
| Percentage of pulses with quality index > 0.6 (%) | 89 |
| 25th percentile pulse quality index | 0.80 |
| 75th percentile pulse quality index | 0.96 |
These metrics were calculated for each 30-s epoch and averaged for each recording. We considered the recordings not satisfying one or more of the minimum values characterized by low rPPG quality.
Figure 1The selected model architecture for the RE-epoch detection. The numbers below boxes indicates the dimensions (with 1150 being the maximum number of epochs). The rate indicates the drop out rate, N indicates the number of stacked convolution, F the number of filter, K the kernel size, K* the kernel size with dilation rate of 2, C the number of units of the dense layers and std the standard deviation of the Gaussian noise. The block types are further described in the Supplementary section Deep learning model.
RE-epoch detection performance on the hold-out set with and without the low rPPG quality recordings exclusion.
| Recordings | Number of epochs (RE-epochs) (#) | Prevalence /Bias index[ | Cohen’s kappa (kappa max[ | Accuracy (%) | Sensitivity (%) | Specificity (%) | PPV (%) | PR AUC | ROC AUC |
|---|---|---|---|---|---|---|---|---|---|
| All | 222039 (32456) | 0.73/0.04 | 0.36 (0.80) | 85 | 38 | 94 | 53 | 0.48 | 0.80 |
| Low rPPG quality recordings excluded | 171237 (22305) | 0.75/0.03 | 0.37 (0.84) | 86 | 39 | 94 | 51 | 0.47 | 0.82 |
The performance is for the overall amount of epochs contributing to the AHI (i.e. not Wake and epochs with less than 80% undefined features).
RE-epoch detection performance in the hold-out set with and without the low rPPG quality recordings.
| Recordings | Respiratory event | Number of RE-epochs (% with respect to total RE-epochs) | Sensitivity (%) |
|---|---|---|---|
| All | Hypopnea | 23371 (78) | 34 |
| Obstructive apnea | 2315 (8) | 61 | |
| Mixed apnea | 1251 (4) | 84 | |
| Central apnea | 1222 (4) | 40 | |
| Low rPPG quality recordings excluded | Hypopnea | 16909 (82) | 37 |
| Obstructive apnea | 1544 (7) | 60 | |
| Mixed apnea | 382 (2) | 78 | |
| Central apnea | 572 (3) | 60 |
The performance is for the overall number of epochs contributing to the AHI (i.e. not Wake and epochs with less than 80% undefined features).
RE-epoch performance for epochs characterized or not by limb movement events in the hold-out set with and without the low rPPG quality recordings.
| Recordings | Sleep event | Number of epochs (% of RE-epochs) | Sensitivity (%) | Specificity (%) | PPV (%) |
|---|---|---|---|---|---|
| All | Not limb movements | 126219 (10) | 38 | 96 | 67 |
| Limb movements | 39667 (2) | 38 | 88 | 32 | |
| Low rPPG quality recordings excluded | Not limb movements | 100021 (9) | 39 | 96 | 65 |
| Limb movements | 28428 (2) | 43 | 88 | 33 |
Only the epochs contributing to the AHI calculation are taken into consideration (i.e. not Wake and epochs with less than 80% undefined features).
Figure 2Analysis of the estimated AHI performance after removal of low-quality rPPG recordings. (a) Correlation between reference AHI versus estimated AHI; dashed lines delimit the canonical OSA severity classes and the dash-dotted line is the identity line. (b) Bland–Altman plot of the reference AHI and estimated AHI. The bias and the limits of agreement (i.e. 1.96 times the standard deviation of the difference) are shown as events/h. The red and the green dashed lines represent, respectively, the boundaries to define considerable under- and overestimations.
Screening performance for the estimated AHI with respect to the reference AHI for the canonical screening thresholds for the hold-out set with and without the low rPPG quality recordings.
| Recordings | Screening threshold | Participants above the threshold (#) | Sensitivity (%) | Specificity (%) | PPV (%) | Prevalence/Bias index[ | Cohen’s kappa (kappa max)[ | ROC AUC |
|---|---|---|---|---|---|---|---|---|
| All | AHI | 173 | 72 | 71 | 84 | 0.27/0.10 | 0.39 (0.78) | 0.80 |
| AHI | 88 | 59 | 90 | 75 | 0.38/0.07 | 0.51 (0.82) | 0.82 | |
| AHI | 39 | 41 | 98 | 80 | 0.77/0.07 | 0.49 (0.64) | 0.84 | |
| Low rPPG quality recordings excluded | AHI | 121 | 77 | 72 | 83 | 0.24/0.05 | 0.47 (0.90) | 0.84 |
| AHI | 60 | 62 | 91 | 75 | 0.42/0.06 | 0.55 (0.86) | 0.86 | |
| AHI | 24 | 46 | 98 | 79 | 0.80/0.05 | 0.53 (0.71) | 0.85 |
Figure 3Receiver operating characteristics and confusion matrix of the estimated for the three canonical AHI thresholds after removal of low-quality rPPG recordings. (a) AUC area under each curve; square markers indicate the points in the curve where the estimated AHI threshold for severity classification is equal to the canonical 5, 15 and 30 events/h. (b) OSA severity classes obtained from the AHI (reference severity) and estimated AHI (predicted severity) using the canonical thresholds. In each cell, the percentage per severity is shown (also visually indicated by the color scale) as well as the number of participants.
Figure 4Characteristics of the considerable underestimated participants that might have influenced the underestimation (for all the participants and for those with at least two class difference between reference and estimated OSA severity). Cardiac comorbidities include bundle-branch-block, premature ventricular/atrial contraction and paroxysmal atrial fibrillation. Cardiovascular medications include anti-arrhythmic compound, ACE-inhibiters, beta-blockers and thyroid hormones.