| Literature DB >> 36096868 |
Samuel H Waters1, Gari D Clifford2.
Abstract
BACKGROUND: Obtaining medical data using wearable sensors is a potential replacement for in-hospital monitoring, but the lack of data for such sensors poses a challenge for development. One solution is using in-hospital recordings to boost performance via transfer learning. While there are many possible transfer learning algorithms, few have been tested in the domain of EEG-based sleep staging. Furthermore, there are few ways for determining which transfer learning method will work best besides exhaustive testing. Measures of transferability do exist, but are typically used for selection of pre-trained models rather than algorithms and few have been tested on medical signals. We tested several supervised transfer learning algorithms on a sleep staging task using a single channel of EEG (AF7-Fpz) captured from an in-home commercial system.Entities:
Keywords: Domain adaptation; EEG; Machine learning; Sleep staging; Transfer learning; Wearable medical devices
Mesh:
Year: 2022 PMID: 36096868 PMCID: PMC9465946 DOI: 10.1186/s12938-022-01033-3
Source DB: PubMed Journal: Biomed Eng Online ISSN: 1475-925X Impact factor: 3.903
Fig. 1Architecture of base model
Fig. 2Architecture of open-source model, DeepSleepNet [43]. Image courtesy of Supratak et al. [43]
Cross-validation % accuracy (average ± standard deviation) obtained using each algorithm and source dataset to re-train the CNN
| CiCC | ISRUC | MASS | SHHS | WSC | MrOS | |
|---|---|---|---|---|---|---|
| Head Re-train | 78.1 ± 6.3 | 75.9 ± 6.8 | 75.5 ± 6.9 | 76.0 ± 6.5 | 76.5 ± 6.4 | 76.2 ± 8.9 |
| Subspace alignment | 53.7 ± 14.1 | 48.0 ± 12.6 | 43.3 ± 11.8 | 45.9 ± 11.3 | 45.0 ± 12.0 | 36.7 ± 12.9 |
| CORAL | 78.0 ± 6.3 | 76.1 ± 6.3 | 76.8 ± 6.1 | 75.5 ± 6.3 | 76.6 ± 6.0 | 77.1 ± 6.4 |
| Per-Class CORAL | 78.0 ± 5.7 | 75.7 ± 6.8 | 76.4 ± 5.9 | 75.6 ± 6.2 | 76.0 ± 6.1 | 77.2 ± 5.9 |
| DDC | 75.7 ± 8.3 | 77.7 ± 6.7 | 77.4 ± 8.5 | 78.8 ± 7.5 | 75.9 ± 9.1 | 79.0 ± 6.3 |
Cross-validation Cohen’s (average ± standard deviation) obtained using each algorithm and source dataset to re-train CNN
| CiCC | ISRUC | MASS | SHHS | WSC | MrOS | |
|---|---|---|---|---|---|---|
| Head Re-train | 0.689 ± 0.086 | 0.659 ± 0.093 | 0.652 ± 0.092 | 0.661 ± 0.087 | 0.669 ± 0.085 | 0.667 ± 0.115 |
| Subspace alignment | 0.332 ± 0.183 | 0.279 ± 0.151 | 0.146 ± 0.139 | 0.230 ± 0.140 | 0.183 ± 0.137 | 0.153 ± 0.136 |
| Per-Class CORAL | 0.690 ± 0.077 | 0.660 ± 0.090 | 0.666 ± 0.079 | 0.658 ± 0.083 | 0.663 ± 0.081 | 0.681 ± 0.080 |
| CORAL | 0.689 ± 0.085 | 0.663 ± 0.084 | 0.672 ± 0.081 | 0.655 ± 0.084 | 0.669 ± 0.080 | 0.679 ± 0.086 |
| DDC | 0.660 ± 0.109 | 0.686 ± 0.088 | 0.682 ± 0.111 | 0.703 ± 0.098 | 0.663 ± 0.118 | 0.704 ± 0.085 |
Accuracy, , and % of cases where each algorithm outperformed all other algorithms for bespoke CNN
| Algorithm | Average ± standard deviation % accuracy | Average ± standard deviation Cohen’s | % of cases where algorithm was best |
|---|---|---|---|
| Head Re-train | 76.4 ± 7.1 | 0.666 ± 0.095 | 22.2 |
| Subspace alignment | 45.4 ± 13.5 | 0.220 ± 0.163 | 0.7 |
| Per-Class CORAL | 76.5 ± 6.2 | 0.670 ± 0.083 | 9.0 |
| CORAL | 76.7 ± 6.2 | 0.671 ± 0.084 | 14.6 |
|
|
|
Correlations of each transferability measure with CNN accuracy for individual algorithms as well as overall
| Measure | Head Re-train, | CORAL, | Per-Class CORAL, | SA, | DDC, | Overall, |
|---|---|---|---|---|---|---|
| LEEP | − 0.03 | 0.24** | 0.28*** | 0.30*** | − 0.11 | − 0.07 |
| H-score | 0.36*** | 0.47*** | 0.47*** | − 0.05 | − 0.10 | 0.23*** |
| Hypothesis margin | − 0.07 | 0.05 | 0.07 | − 0.11 | 0.15 | 0.08 |
| Silhouette score | − 0.31 | − 0.21* | − 0.21* | 0.13 | 0.07 | − 0.11* |
| MMD, | − 0.25** | − 0.14 | − 0.11 | − 0.36*** | 0.33*** | − 0.11** |
| MMD, | 0.07 | 0.08 | 0.03 | − 0.44*** | 0.05 | − 0.04 |
| MMD, | 0.16 | 0.29*** | 0.25** | − 0.53*** | 0.07 | 0.05 |
| TDAS, | 0.17* | 0.24** | 0.28*** | − 0.17* | 0.01 | 0.11*** |
| TDAS, | − 0.36*** | − 0.42*** | − 0.42*** | 0.19*** | 0.08 | − 0.18*** |
| TDAS, | − 0.35*** | − 0.41*** | − 0.42* | 0.19* | 0.07 | − 0.18*** |
*p < 0.05
**p < 0.01
***p < 0.001
% Accuracy (average ± standard deviation) obtained for each algorithm and source when re-training additional layers of CNN
| CiCC | ISRUC | MASS | SHHS | WSC | MrOS | |
|---|---|---|---|---|---|---|
| Head Re-train | 79.0 ± 7.3 | 78.2 ± 7.2 | 79.0 ± 7.3 | 78.3 ± 7.3 | 79.1 ± 7.3 | 78.9 ± 7.4 |
| Subspace alignment | 56.1 ± 6.7 | 56.3 ± 6.8 | 54.2 ± 6.8 | 57.5 ± 6.8 | 52.5 ± 6.8 | 48.4 ± 6.9 |
| CORAL | 80.0 ± 8.4 | 78.4 ± 8.4 | 79.0 ± 8.4 | 78.2 ± 8.5 | 78.9 ± 8.5 | 78.7 ± 8.7 |
| Per-Class CORAL | 78.2 ± 7.3 | 75.8 ± 7.7 | 78.0 ± 7.7 | 77.6 ± 7.8 | 77.5 ± 7.8 | 77.3 ± 7.8 |
| DDC | 64.3 ± 9.9 | 65.3 ± 9.8 | 59.4 ± 9.8 | 64.2 ± 9.9 | 63.6 ± 9.8 | 62.6 ± 10.0 |
Cohen’s (average ± standard deviation) obtained for each algorithm and source when re-training additional layers of CNN
| CiCC | ISRUC | MASS | SHHS | WSC | MrOS | |
|---|---|---|---|---|---|---|
| Head Re-train | 0.704 ± 0.099 | 0.692 ± 0.098 | 0.703 ± 0.098 | 0.694 ± 0.099 | 0.704 ± 0.098 | 0.702 ± 0.100 |
| Subspace alignment | 0.364 ± 0.003 | 0.389 ± 0.003 | 0.326 ± 0.003 | 0.404 ± 0.003 | 0.298 ± 0.004 | 0.305 ± 0.004 |
| Per-Class CORAL | 0.691 ± 0.058 | 0.657 ± 0.118 | 0.691 ± 0.142 | 0.684 ± 0.162 | 0.681 ± 0.180 | 0.681 ± 0.200 |
| CORAL | 0.718 ± 0.062 | 0.696 ± 0.222 | 0.704 ± 0.303 | 0.693 ± 0.362 | 0.702 ± 0.402 | 0.700 ± 0.438 |
| DDC | 0.522 ± 0.000 | 0.535 ± 0.001 | 0.469 ± 0.002 | 0.525 ± 0.002 | 0.516 ± 0.002 | 0.498 ± 0.003 |
Accuracy, , and % of cases where each algorithm outperformed others when re-training additional layers of CNN
| Algorithm | Average ± standard deviation % accuracy | Average ± standard deviation Cohen’s | % of cases where algorithm was best |
|---|---|---|---|
| Subspace alignment | 54.2 ± 17.0 | 0.348 ± 0.216 | 2.1 |
| Per-Class CORAL | 77.4 ± 6.9 | 0.681 ± 0.096 | 12.5 |
| CORAL | 78.9 ± 6.4 | 0.702 ± 0.086 | 31.3 |
| DDC | 63.2 ± 20.5 | 0.511 ± 0.24 | 13.2 |
Correlations of each transferability measure with CNN accuracy when re-training additional layers of CNN
| Measure | Head Re-train, | CORAL, | Per-Class CORAL, | SA, | DDC, | Overall, |
|---|---|---|---|---|---|---|
| LEEP | − 0.14 | − 0.19* | − 0.12 | 0.08 | 0.03 | − 0.07 |
| H-score | 0.17* | 0.35*** | 0.31*** | − 0.03 | − 0.07 | 0.14*** |
| Hypothesis margin | 0.05 | 0.01 | 0.10 | − 0.15 | − 0.05 | − 0.01 |
| Silhouette score | − 0.05 | − 0.19* | − 0.09 | − 0.09 | 0.00 | − 0.08 |
| MMD, | 0.14 | 0.04 | − 0.03 | − 0.19* | − 0.01 | − 0.01 |
| MMD, | 0.14 | 0.12 | 0.03 | − 0.14 | − 0.02 | 0.03 |
| MMD, | 0.06 | 0.08 | 0.01 | − 0.16 | 0.04 | 0.00 |
| TDAS, | − 0.07 | − 0.05 | − 0.02 | − 0.05 | 0.01 | − 0.03 |
| TDAS, | 0.24** | 0.42*** | 0.28*** | 0.15 | 0.06 | 0.15*** |
| TDAS, | − 0.10 | 0.24** | − 0.09 | 0.10 | 0.01 | − 0.06 |
*p < 0.05
**p < 0.01
***p < 0.001
% Accuracy (average ± standard deviation) obtained for each algorithm and source using DeepSleepNet
| CiCC | ISRUC | MASS | SHHS | WSC | MrOS | |
|---|---|---|---|---|---|---|
| Head Re-train | 72.4 ± 9.6 | 59.9 ± 9.0 | 63.2 ± 9.4 | 62.3 ± 8.9 | 59.9 ± 8.4 | 64.2 ± 10.9 |
| Subspace alignment | 62.0 ± 6.8 | 55.3 ± 9.6 | 53.1 ± 11.4 | 52.4 ± 8.2 | 50.8 ± 9.2 | 53.5 ± 11.4 |
| CORAL | 70.6 ± 9.3 | 60.5 ± 8.8 | 61.8 ± 9.6 | 60.3 ± 8.1 | 58.8 ± 7.4 | 62.9 ± 8.8 |
| Per-Class CORAL | 70.8 ± 8.5 | 57.0 ± 8.8 | 62.0 ± 8.8 | 59.8 ± 7.4 | 57.7 ± 9.2 | 60.6 ± 10.8 |
| DDC | N/A | N/A | N/A | N/A | N/A | N/A |
Cohen’s (average ± standard deviation) obtained for each algorithm and source using DeepSleepNet
| CiCC | ISRUC | MASS | SHHS | WSC | MrOS | |
|---|---|---|---|---|---|---|
| Head Re-train | 0.611 ± 0.127 | 0.432 ± 0.133 | 0.477 ± 0.133 | 0.466 ± 0.123 | 0.432 ± 0.121 | 0.493 ± 0.140 |
| Subspace alignment | 0.446 ± 0.131 | 0.345 ± 0.135 | 0.301 ± 0.155 | 0.293 ± 0.113 | 0.264 ± 0.119 | 0.342 ± 0.150 |
| Per-Class CORAL | 0.584 ± 0.117 | 0.393 ± 0.118 | 0.449 ± 0.127 | 0.424 ± 0.106 | 0.391 ± 0.129 | 0.450 ± 0.137 |
| CORAL | 0.586 ± 0.119 | 0.437 ± 0.127 | 0.452 ± 0.134 | 0.431 ± 0.113 | 0.412 ± 0.107 | 0.478 ± 0.112 |
| DDC | N/A | N/A | N/A | N/A | N/A | N/A |
Accuracy, , and % of cases where each algorithm outperformed others when using DeepSleepNet
| Algorithm | Average ± standard deviation % accuracy | Average ± standard deviation Cohen’s | % of cases where algorithm was best |
|---|---|---|---|
| Head Re-train | 63.7 ± 4.2 | 0.637 ± 0.144 | 63.9 |
| Subspace alignment | 54.5 ± 3.6 | 0.545 ± 0.147 | 1.4 |
| Per-Class CORAL | 61.3 ± 4.6 | 0.613 ± 0.139 | 13.2 |
| CORAL | 62.5 ± 3.9 | 0.625 ± 0.132 | 19.4 |
| DDC | N/A | N/A | N/A |
Correlations of each transferability measure with DeepSleepNet accuracy for individual algorithms as well as overall
| Measure | Head Re-train, | CORAL, | Per-Class CORAL, | SA, | DDC, | Overall, |
|---|---|---|---|---|---|---|
| LEEP | 0.01 | 0.13 | 0.11 | 0.36*** | N/A | 0.15**** |
| H-score | 0.12 | 0.15 | 0.23* | 0.26*** | N/A | 0.19*** |
| Hypothesis margin | − 0.32*** | − 0.16* | − 0.27 | 0.10 | N/A | − 0.16*** |
| Silhouette score | − 0.31*** | − 0.21* | − 0.21* | 0.13 | 0.07 | − 0.11* |
| MMD, | − 0.40*** | − 0.31*** | − 0.46*** | − 0.18* | N/A | − 0.34*** |
| MMD, | − 0.28*** | − 0.29*** | − 0.40*** | − 0.29*** | N/A | − 0.31*** |
| MMD, | − 0.12 | -0.13 | − 0.28*** | - 0.13 | N/A | − 0.17*** |
| TDAS, | 0.48*** | 0.47*** | 0.40*** | 0.30*** | N/A | 0.40*** |
| TDAS, | 0.38*** | 0.37*** | 0.31*** | 0.25** | N/A | 0.33*** |
| TDAS, | − 0.11 | − 0.09 | − 0.10 | 0.07 | N/A | − 0.06 |
*p < 0.05
**p < 0.01
***p < 0.001
Fivefold cross-validation of neural network trained and tested on source datasets without transfer learning
| Dataset | Accuracy (%) | Cohen’s |
|---|---|---|
| SHHS | 80.8 | 0.725 |
| WSC | 83.6 | 0.731 |
| CiCC | 72.7 | 0.634 |
| MrOS | 85.5 | 0.774 |
| ISRUC | 73.7 | 0.66 |
| MASS | 79.8 | 0.713 |