| Literature DB >> 30247731 |
Emma E Laing1, Carla S Möller-Levet2, Derk-Jan Dijk3, Simon N Archer3.
Abstract
Acute and chronic insufficient sleep are associated with adverse health outcomes and risk of accidents. There is therefore a need for biomarkers to monitor sleep debt status. None are currently available. We applied elastic net and ridge regression to transcriptome samples collected in 36 healthy young adults during acute total sleep deprivation and following 1 week of either chronic insufficient (<6 hr) or sufficient sleep (~8.6 hr) to identify panels of mRNA biomarkers of sleep debt status. The size of identified panels ranged from 9 to 74 biomarkers. Panel performance, assessed by leave-one-subject-out cross-validation and independent validation, varied between sleep debt conditions. Using between-subject assessments based on one blood sample, the accuracy of classifying "acute sleep loss" was 92%, but only 57% for classifying "chronic sleep insufficiency." A reasonable accuracy for classifying "chronic sleep insufficiency" could only be achieved by a within-subject comparison of blood samples. Biomarkers for sleep debt status showed little overlap with previously identified biomarkers for circadian phase. Biomarkers for acute and chronic sleep loss also showed little overlap but were associated with common functions related to the cellular stress response, such as heat shock protein activity, the unfolded protein response, protein ubiquitination and endoplasmic reticulum-associated protein degradation, and apoptosis. This characteristic response of whole blood to sleep loss can further aid our understanding of how sleep insufficiencies negatively affect health. Further development of these novel biomarkers for research and clinical practice requires validation in other protocols and age groups.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30247731 PMCID: PMC6335875 DOI: 10.1093/sleep/zsy186
Source DB: PubMed Journal: Sleep ISSN: 0161-8105 Impact factor: 5.849
Figure 1.Protocol. In a crossover design participants entered a constant routine following either 1 week of sufficient (mean ± SEM of PSG assessed TST of 8.56 ± 0.06 hr) sleep or 1 week of insufficient (mean ± SEM of PSG assessed TST of 5.75 ± 0.06 hr) sleep. During the constant routine of 39–41 hr of wakefulness blood samples were collected 3 hourly for transcriptome samples (samples labeled #1 to #10) and hourly for melatonin assessments. At the end of each constant routine participants were given a 12 hr recovery period. Melatonin curves (gray area) represent the average melatonin curve (across all participants) during the constant routine for sufficient and insufficient sleep, respectively.
Figure 2.Overview of our approach to identifying panels of biomarkers and assessing their performance.
Size and performance of biomarker panels for the prediction/classification of different sleep debt variables
| Biomarkers for | Number of samples within the training and independent validation sets | Number of features (unique genes) in the final model | LOSO-CV performance | IV performance | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| All features and elastic net |
|
| All features and elastic net |
|
| All features and elastic net | |||
| Acute sleep loss variables | Prediction of “time awake, between-subject”; samples #1 to #10 | Training = 239 Validation = 233 | 26 (26) | 17 (17) | 59 (58) |
|
|
|
|
|
|
| Prediction of “time awake, within-subject”; sample #2 - sample #1, #3 - #1, #4 - #1, #5 - #1, #6 - #1, #7 - #1, #8 - #1, #9 - #1, #10 - #1 | Training = 185 Validation = 189 | 26 (26) | 9 (9) | 74 (73) |
|
|
|
|
|
| |
| Classification of “wakefulness of more than 24 hr, between-subject”; all samples, based on the predicted “time awake” value | Training = 239 Validation = 233 | 26 (26) | 17 (17) | 59 (58) | ACC = 75% Sn = 61% Sp = 84% MCC = 0.47 | ACC = 74% Sn = 61% Sp = 84% MCC = 0.46 | ACC = 79% Sn = 66% Sp = 89% MCC = 0.57 | ACC = 74% Sn = 50% Sp = 91% MCC = 0.46 | ACC = 73% Sn = 49% Sp = 88% MCC = 0.42 | ACC = 80% Sn = 70% Sp = 87% MCC = 0.59 | |
| Classification of “acute sleep loss, between-subject”; samples #1 vs. sample #9 | Training = 50 Validation = 49 | 26 (26) | 20 (20) | 68 (68) | ACC = 76% Sn = 71% Sp = 79% MCC = 0.51 | ACC = 72% Sn = 76% Sp = 69% MCC = 0.45 | ACC = 76% Sn = 76% Sp = 76% MCC = 0.52 | ACC = 76% Sn = 52% Sp = 93% MCC = 0.51 | ACC = 76% Sn = 57% Sp = 89% MCC = 0.5 | ACC = 92% Sn = 90% Sp = 93% MCC = 0.83 | |
| Classification of “acute sleep loss, within- subject”; difference between sample #2 and #1 vs. difference between sample #10 and #1 | Training = 41 Validation = 43 | 26 (26) | 12 (12) | 32 (31) | ACC = 88% Sn = 87% Sp = 89% MCC = 75% | ACC = 93% Sn = 87% Sp = 100% MCC = 0.86 | ACC = 90% Sn = 83% Sp = 100% MCC = 0.82 | ACC = 77% Sn = 72% Sp = 83% MCC = 0.55 | ACC = 74% Sn = 68% Sp = 83% MCC = 0.51 | ACC = 74% Sn = 72% Sp = 78% MCC = 0.49 | |
| Chronic sleep loss variables | Classification of “chronic sleep insufficiency”; samples #1 or #2 for insufficient sleep vs. samples #1 or #2 for sufficient sleep | Training = 30 Validation = 30 | 407 (407) | 14 (14) | 9(9) | ACC = 60% Sn = 67% Sp = 53% MCC = 0.2 | ACC = 53% Sn = 53% Sp = 53% MCC = 0.07 | ACC = 53% Sn = 60% Sp = 47% MCC = 0.07 | ACC = 57% Sn = 62% Sp = 53% MCC = 0.14 | ACC = 47% Sn = 54% Sp = 41% MCC = -0.05 | ACC = 50% Sn = 77% Sp = 29% MCC = 0.07 |
| Classification of “chronic sleep insufficiency”; samples #9 or #10 for insufficient sleep vs. samples #9 or #10 for sufficient sleep | Training = 34 Validation = 30 | 420 (420) | 7 (7) | 21 (21) | ACC = 59% Sn = 65% Sp = 53% MCC = 0.18 | ACC = 59% Sn = 71% Sp = 47% MCC = 0.18 | ACC = 47% Sn = 47% Sp = 47% MCC = -0.06 | ACC = 53% Sn = 38% Sp = 71% MCC = 0.09 | ACC = 57% Sn = 50% Sp = 64% MCC = 0.14 | ACC = 47% Sn = 44% Sp = 50% MCC = -0.06 | |
| Classification of “sleep decrease/increase”; difference in sample #2 or sample #3 from visit 1 and sample #2 or sample #3 from visit 2 | Training = 10 Validation = 9 | 413 (413) | 13 (13) | 36 (34) | ACC = 70% Sn = 80% Sp = 60% MCC = 0.41 | ACC = 50% Sn = 60% Sp = 40% MCC = 0 | ACC = 50% Sn = 60% Sp = 40% MCC = 0 | ACC = 89% Sn = 100% Sp = 80% MCC = 0.8 | ACC = 78% Sn = 100% Sp = 60% MCC = 0.63 | ACC = 78% Sn = 100% Sp = 60% MCC = 0.63 | |
| Classification of “sleep decrease/increase”; difference in sample #9 or sample #10 from visit 1 and sample #9 or sample #10 from visit 2 | Training = 14 Validation = 15 | 413 (413) | 18 (18) | 62 (62) | ACC = 64% Sn = 57% Sp = 71% MCC = 0.29 | ACC = 71 % Sn = 71% Sp = 71% MCC = 0.43 | ACC = 71% Sn = 71% Sp = 71% MCC = 0.43 | ACC = 67% Sn = 100% Sp = 44% MCC = 0.49 | ACC = 53% Sn =83 % Sp = 33% MCC = 0.18 | ACC = 40% Sn = 83% Sp = 11% MCC = -0.08 | |
Data shown for models based on “UPUS” training and validation sets only.
ACC = classification accuracy, Sn = sensitivity, Sp = specificity, MCC = Matthew’s correlation coefficient.
Figure 3.Prediction performance for “time awake” between- and within-subject. Prediction of “time awake” from one (between-subject) or two (within-subject) transcriptome samples. Blue: samples taken following 1 week of sufficient sleep; gold: samples taken following 1 wk of insufficient sleep. Dashed line indicates the line of unity. Red line indicates a linear regression of the relationship of predicted and observed. Data shown is based on the final “all features” elastic net model trained on all samples within the “UPUS” training set and applied to all samples within the corresponding independent validation set.
Figure 4.Predicted “time awake” value for classifying “wakefulness of >24 hr.” (a) Difference between the mean predicted value of “time awake” for samples collected at x hours awake and samples collected at y hours awake. Difference expressed as Cohen’s d effect size. Data based on predictions made for samples within the “UPUS” training data set when using the “all features” elastic net model for “time awake” trained on all samples within the “UPUS” training data set. No baseline correction applied. (b) Percentage of samples predicted to have a “time awake” value of greater than 24 hr for all observed values of “time awake.” Data based on predictions made for samples within the “UPUS” validation data set when using the “all features” elastic net model for “time awake” trained on all samples within the “UPUS” training data set. (c) Classification performance when classifying a sample within the “UPUS” validation set to one of two classes, “awake > 24 hr,” “awake < 24 hr” using the predicted “time awake” value of a sample using the “all features” elastic net model for “time awake” trained on all samples within the “UPUS” training data set. Black horizontal line represents the decision boundary at 24 hr. ACC = accuracy, SS= samples from sufficient sleep condition, IS = samples from insufficient sleep condition.
Figure 5.Classification performance for “acute sleep loss” between- and within-subject. (a) Classification of a sample to two classes: “acute sleep loss” and “no acute sleep loss.” “Between-subject” refers to the classification of “acute sleep loss” from a single transcriptome sample, collected at 7–8 hr or 31–32 hr of time awake. “Within-subject” refers to the classification of “acute sleep loss” from a baseline-corrected transcriptome sample collected at 10–11 hr or 34–35 hr of time awake. Black horizontal line represents the decision boundary for classifying a sample at a probability of 0.5. Blue: samples taken following 1 wk of sufficient sleep; gold: samples taken following 1 week of insufficient sleep, gray: all samples. Data shown based on the final “all features” elastic net model trained on all samples within the “UPUS” training set and applied to all samples within the corresponding independent validation set. ACC = accuracy. (b) Distribution of mRNA abundance values for features selected as classifiers of the sleep debt variable “acute sleep loss” when applying elastic net to “all features” within the “UPUS” training set (without baseline correction). mRNA abundance data is shown for all samples within the “UPUS” training and validation sets with no baseline correction.
Figure 6.Classification performance for “chronic sleep insufficiency” between- and within-subject. “Between-subject” refers to the classification of a single transcriptome sample to two classes: following sufficient sleep (SS) or following insufficient sleep (IS). Data shown based on the final “all features” elastic net model trained on all samples #1 (or #2) within the “UPUS” training set and applied to all samples #1 (or #2) within the corresponding independent validation set. “Within-subject” refers to the classification of a differential transcriptome sample to two classes: sleep decrease or sleep increase. Data shown based on the final “all features” elastic net model trained on all “visit 1”–“visit 2” samples, using samples #2 (or #3), within the “UPUS” training set and applied to all “visit 1”–“visit 2” samples, using samples #2 (or #3) within the corresponding independent validation set. Black horizontal line represents the decision boundary for classifying a sample at a probability of 0.5. ACC = accuracy.
Figure 7.Comparison of biomarker panels for different sleep debt related variables. Comparisons made between panels of biomarkers identified when using “all features” and “UPUS” as training data to elastic net. (a) Comparison by genes associated with biomarkers that can predict/classify between individuals (i.e. one sample based). Chronic 1 refers to biomarkers for “chronic sleep insufficiency” identified when using sample #1 (or #2). Chronic 9 refers to biomarkers for “chronic sleep insufficiency” identified when using sample #9 (or #10). Circadian phase refers to biomarkers for Circadian phase using one sample as defined in Laing et al. [17]. b) Comparison by genes associated with biomarkers that can predict/classify within individuals (i.e. two sample based). Sleep increase 2 refers to biomarkers for “sleep increase/decrease” using sample #2 (or #3), sleep increase 9 refers to biomarkers for “sleep increase/decrease” using sample #9 (or #10). Circadian phase refers to biomarkers for Circadian phase using two samples as defined in Laing et al. [17]. (c) Comparison by associated “top 10 enriched” GO terms, based on the percentage of genes associated with the panel that are associated with a given GO term. Highlighted columns are biomarker panels with >70% classification accuracy when applied to an independent validation set.