Literature DB >> 32860500

Dexmedetomidine-induced deep sedation mimics non-rapid eye movement stage 3 sleep: large-scale validation using machine learning.

Sowmya M Ramaswamy¹, Maud A S Weerink¹, Michel M R F Struys^1,2, Sunil B Nagaraj³.

Abstract

STUDY
OBJECTIVES: Dexmedetomidine-induced electroencephalogram (EEG) patterns during deep sedation are comparable with natural sleep patterns. Using large-scale EEG recordings and machine learning techniques, we investigated whether dexmedetomidine-induced deep sedation indeed mimics natural sleep patterns.
METHODS: We used EEG recordings from three sources in this study: 8,707 overnight sleep EEG and 30 dexmedetomidine clinical trial EEG. Dexmedetomidine-induced sedation levels were assessed using the Modified Observer's Assessment of Alertness/Sedation (MOAA/S) score. We extracted 22 spectral features from each EEG recording using a multitaper spectral estimation method. Elastic-net regularization method was used for feature selection. We compared the performance of several machine learning algorithms (logistic regression, support vector machine, and random forest), trained on individual sleep stages, to predict different levels of the MOAA/S sedation state.
RESULTS: The random forest algorithm trained on non-rapid eye movement stage 3 (N3) predicted dexmedetomidine-induced deep sedation (MOAA/S = 0) with area under the receiver operator characteristics curve >0.8 outperforming other machine learning models. Power in the delta band (0-4 Hz) was selected as an important feature for prediction in addition to power in theta (4-8 Hz) and beta (16-30 Hz) bands.
CONCLUSIONS: Using a large-scale EEG data-driven approach and machine learning framework, we show that dexmedetomidine-induced deep sedation state mimics N3 sleep EEG patterns. CLINICAL TRIALS: Name-Pharmacodynamic Interaction of REMI and DMED (PIRAD), URL-https://clinicaltrials.gov/ct2/show/NCT03143972, and registration-NCT03143972. © Sleep Research Society 2020. Published by Oxford University Press on behalf of the Sleep Research Society.

Entities: Chemical Disease Gene Species

Keywords: dexmedetomidine; electroencephalogram; machine learning; sedation monitoring; sleep

Year: 2021 PMID： 32860500 PMCID： PMC7879420 DOI： 10.1093/sleep/zsaa167

Source DB: PubMed Journal: Sleep ISSN： 0161-8105 Impact factor: 5.849

Dexmedetomidine is commonly used in intensive care units for sedating critically ill patients. Neurophysiologically, sedation using dexmedetomidine is known to promote sleep like electroencephalogram (EEG) patterns. Studies evaluating this relationship are performed through visual assessments of the spectrogram of the EEG from a limited set of healthy volunteers and does not capture heterogeneity. In this study, we use machine learning techniques and publicly available sleep EEG datasets to identify the sleep stage that is highly correlated with dexmedetomidine-induced sedation. We also show that the presence of spindles is not sufficient to distinguish between awake state and dexmedetomidine-induced sedation levels.

Introduction

Dexmedetomidine is an intravenous α2a adrenergic receptor agonist that is widely used in intensive care units (ICUs) for sedation and in clinical anesthesia during surgical procedures [1]. Recent findings that dexmedetomidine neurophysiologically approximates non-rapid eye movement (NREM) sleep [2-5], and reduces the incidence of delirium [6-8], have led to a renewed interest in administering dexmedetomidine to critically ill patients in ICUs. Investigations looking at the association between dexmedetomidine-induced sedation state and sleep stages relied mainly on the univariate (or visual) analysis of the electroencephalogram (EEG) spectrogram on a limited sample size and lacked external validation [2-5]. In addition, these studies do not account for inter- (and intra-) participant variability commonly seen in sleep and sedation EEG patterns. Dexmedetomidine sedation is characterized by slow oscillations (0–4 Hz) and spindle oscillations (12–16 Hz) similar to sleep spindles [2, 4]. At clinically recommended doses, dexmedetomidine induces a light state of sedation during which the patient is arousable and has hemodynamic and respiratory stability [9]. When combined with propofol, it provides cardiovascular stability, improves patient safety by decreasing the incidence of airway obstruction and hypoxia [10] and demonstrates propofol sparing effect [11]. These properties have led to growing interest in the use of dexmedetomidine in ICUs and during surgical procedures, and understanding the exact mechanism of dexmedetomidine sedation is important. Though studies suggest that dexmedetomidine-induced deep sedation mimics NREM stage 22 and stage 33 sleep, a large-scale validation at a population level is necessary to explore these findings. In our previous study [12], we developed a novel data-repurposing framework to predict anesthetic drug-induced deep sedation state from sleep EEG using deep learning algorithms. This framework was designed to eliminate the necessity for new clinical trials for developing automatic sedation level monitors. We observed that the deep learning algorithm provided the highest accuracy to predict dexmedetomidine-induced deep sedated state (Modified Observer’s Assessment of Alertness/Sedation [MOAA/S] score = 0) using NREM stage 3 sleep EEG. However, the exact reason and features learnt by neural networks in the deep learning model could not be revealed due to its “black-box” nature. To further explore this finding, in this study, we developed a framework using large-scale sleep EEG data and machine learning techniques to validate if dexmedetomidine-induced sedation indeed mimics NREM sleep stages. We trained machine learning algorithms on two large online sleep EEG datasets (N = 8,707) to predict dexmedetomidine-induced sedation levels on an external clinical trial dataset (N = 30). We investigated the robustness of several machine learning models, trained on individual sleep stages, to predict different levels of dexmedetomidine-induced sedation. In this way, we identified the sleep stage that is highly correlated with the dexmedetomidine-induced deep sedation.

Methods

Dataset

Polysomnography (PSG) dataset from three different sources was used in this study: the online Sleep Heart Health Study (SHHS) public dataset (N = 5,804) [13-16], the osteoporotic fractures in men study (MrOS; N = 2,097) [13, 14, 17–19], and the University Medical Center Groningen (UMCG) dexmedetomidine clinical trial dataset (N = 30) [20]. The SHHS study (mean age: 63.1 ± 11.2 years, M = 2,728, F = 2,993) had two rounds of PSG recordings and we only used the first round (SHHS-1) due to the availability of a large number of patients and consistent sampling frequency (125 Hz) in all recordings. Similarly, the MrOS study (mean age 73.5 ± 5.8 years, M = 2,907, F = 0) had two visits and we used EEG recordings from the first visit (visit-1) with successful overnight PSG recordings. Prior permission was obtained from the online portal (www.sleepdata.org) for the analysis. We used EEG recordings from two bipolar central EEG channels: C4/A1 and C3/A2 from the SHHS and the MrOS datasets. In both datasets, each 30 s EEG segment was manually scored into one of the six sleep states using Rechtschaffen and Kales (R&K) guidelines by sleep technicians: Wake (W), NREM stage 1(N1), stage 2 (N2), stage 3 (N3), stage 4 (N4), and rapid eye movement (R). This was converted into five stages according to the current American Academy of Sleep Medicine (AASM) guidelines [21]: W, N1, N2, N3 (N3 + N4), and R. We did not perform sleep stage scoring in the UMCG data, since the goal of this study was not to develop a sleep staging prediction algorithm. A detailed description of UMCG dexmedetomidine dataset (mean age 40.7± 15.8 years, M = 15, F = 15) and experimental protocol can be found elsewhere [20]. In short, the EEG signals contain 17 channels and were recorded at a sampling frequency of 5 kHz from 30 healthy volunteers. Each volunteer received dexmedetomidine administered through target-controlled infusion with increasing targeted effect site concentrations: 1 ng/mL (0–40 minutes), 2 ng/mL (40–90 minutes), 3 ng/mL (90–130 minutes), 5 ng/mL (130–170 minutes), and 8 ng/mL (170–220 minutes), and was ceased afterwards. During this session, the six-level MOAA/S score [22] was used to estimate the sedation level ranging from 5 (awake) to 0 (nonresponsive). An example from the SHHS and the UMCG dataset is shown in Figure 1.

Figure 1.

Example comparison of the multitaper EEG power spectrogram and their corresponding annotations from the SHHS and UMCG datasets. The spectrogram was obtained using multitaper spectral estimation via the chronux toolbox with the following parameters: time–bandwidth product TW = 3, window length T = 4 s (with 3.9 s overlap), number of tapers K = 5, and spectral resolution 2W of 1.5 Hz.

EEG preprocessing

EEG signals from two bipolar channels: C4/A1 and C3/A2 were used in this study as these specific channels were present in all three datasets. First, we used a bandpass filter to filter EEG between 0.1 and 30 Hz and then down sampled to 125 Hz (to match the SHHS dataset sampling frequency). We restricted the upper frequency limit to 30 Hz to minimize the influence of muscular and movement artifacts. We then segmented the EEG into 30 s nonoverlapping epochs corresponding to the duration of the sleep scoring for further analysis. Epochs satisfying any one of the following conditions were considered as artifacts and were not analyzed: (1) amplifier saturation/movement artifacts − abnormally high signal amplitude (>500 µV); (2) loose electrode artifacts − mean amplitude of the first channel = 2 × mean amplitude of the sum of signals. After artifact rejection, in total 5,767,772 and 4,101,684 epochs were obtained in the SHHS and MroS datasets, respectively. Similarly, for the UMCG data, we obtained 10,528 epochs corresponding to different MOAA/S scores. The distribution of the 30 s epochs in the different groups is shown in Figure 2.

Figure 2.

Distribution of (A) 30 s epochs in different sleep stages in SHHS (outer ring), MrOS (inner ring), (B) 30 s epochs in different levels of sedation (MOAA/S scores) in the UMCG datasets used in this study.

Spectral feature extraction

Each 30 s EEG epochs was further divided into 4 s small subepochs (with 0.1 s shift) and the spectrogram was estimated from each subepoch using Thompson’s multitaper spectral estimation method [23] via Chronux toolbox [24] from each channel using the following parameters: length of the window T = 4 s with 3.9 s overlap, time–bandwidth product TW = 3, number of tapers K = 5, and spectral resolution 2W of 1.5 Hz. We estimated 22 spectral features from the spectrogram of each epoch: power in different subbands: delta (0.1–4 Hz; p), theta (4–8 Hz; p), alpha (8–12 Hz; p), spindle (12–16 Hz; p), lower beta (16–20 Hz; p), and upper beta (20–30 Hz; p) bands, total power (0.1–30 Hz; p), normalized by total power— , , , , , , normalized by delta power—, , , , , and normalized by theta power—, , , . We obtained median across channels and then averaged features from 4 s subepochs. Each 30 s EEG epoch (30 × 125 = 3,750 samples), was, therefore, represented by 22 spectral features. We decided to use only spectral features for a straightforward interpretation of the results since spectral analysis is commonly used in the clinical environment for EEG interpretation.

Training and testing

Since there are four sleep stages (N1, N2, N3, and R) and a wake stage (W), we trained four separate models for binary classification: WN1 = trained on W and N1, WN2 = trained on W and N2, WN3 = trained on W and N3, and WR = trained on W and R. Each model was then used to differentiate between awake (MOAA/S = 5) and individual dexmedetomidine-induced hypnotic levels. For example, WN1 was used to differentiate between MOAA/S = 5 and MOAA/S = 4 (AM4), MOAA/S = 5 and MOAA/S = 3 (AM3), and so on for all training and testing combinations. Wake (W) EEG segments from SHHS dataset (and MOAA/S =5 from UMCG dataset) were labeled as 0, EEG segments corresponding to sleep stages (N1, N2, N3, and R) and hypnotic states (MOAA/S = 4, 3, 2, 1, and 0) were labeled as 1. Since the training data were highly imbalanced with unequal distribution of EEG segments in two classes, it can severely bias the performance of machine learning models [25]. To address the issue of class imbalance, we randomly selected an equal number of EEG segments corresponding to the length of the smallest class during training. For example, if there are K1 segments from class 0, K2 segments from class 1, and K1 > K2, then we randomly selected K2 class 1 segments. In this way, we (1) identified which sleep stage is homologous with the dexmedetomidine-induced sedation, and (2) performed external validation by using completely independent datasets for the training and testing. This process is illustrated in Figure 3.

Figure 3.

Illustration of the (A) proposed validation model used in this study and (B) different training and testing combinations performed in this study. Here, the machine learning model is trained in individual sleep states to predict different levels of sedation induced during dexmedetomidine infusion. All features were z-score standardized before training and the testing set features were standardized with respect to the training set before using them for classification.

Internal cross validation

To evaluate the robustness of the prediction performances of machine learning models, we also evaluated the performance of the models using a 10-fold internal cross-validation technique within each dataset, i.e. we divided the data into 10 approximately equal folds. In each fold, we used 90% of the data for training the random forest classifier and the remaining unseen 10% data for testing. Since the training dataset consisted unequal number of epochs from two classes, we created a balanced training set as described in the previous section. In each fold, features in the training set were normalized (by subtracting the mean and dividing by the standard deviation) and the testing set features were normalized with respect to the mean and standard deviation of the training set before using them for classification. This process was repeated until each 10% unseen data was used for testing.

Algorithm selection

Since many features were highly correlated, we used elastic-net (EN) regularization technique [26] for feature selection before using them for classification with the glmnet package [27]. EN technique involves the selection of optimal hyperparameters: ridge coefficient (α) and coefficient shrinkage (λ). Setting α = 0 is equivalent to ridge regression and can result in nonzero weights assigned to nondiscriminatory/irrelevant features due to L2 loss function. Setting α = 1 is equivalent to LASSO regression which used L1 loss function and is not an ideal choice in dataset with highly correlated features. We set α = 0.5 for equal contribution from L1/L2 loss function (EN regression), and selected λ that provided the lowest mean squared error in prediction during training (using 10-fold cross validation). We used three traditional machine learning algorithms: logistic regression, support vector machine with the Gaussian kernel ( = 1, σ = 1) and random forest (500 trees) for classification. We did not perform any model hyperparameter selection since the goal of this study was not to develop an accurate prediction system but to validate a clinical hypothesis. The area under the receiver operator characteristics curve (AUC) was used to evaluate the performance of machine learning models. AUC is commonly used in binary classification problems which provides the performance ability of models to distinguish between two classes. All simulations were performed using MATLAB 2018a scripting language (Natick, MA). All results are reported as mean (95% confidence interval) unless explicitly stated otherwise. To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. To test the statistical significance, we performed Wilcoxon rank sum test on the probability output of the machine learning models and p-values <0.05 were considered significant.

Results

Performance of machine learning models

The prediction performance of different models using SHHS and MrOS is summarized in Tables 1 and 2, respectively. The models trained on the WN3 sleep stage provided the best performance to predict dexmedetomidine-induced deep sedation (AM0). The highest prediction was obtained using random forest with an overall AUC = 0.83 (0.78–0.88) and 0.86 (0.85–0.87) using SHHS and MrOS datasets for training, respectively, outperforming other machine learning models.

Table 1.

Mean AUC of the machine learning models to predict different levels of sedation using different sleep stages in SHHS data

		Testing
Model	Training	AM4	AM3	AM2	AM1	AM0
LR	WN1	0.57	0.53	0.58	0.55	0.54
		(0.55–0.60)	(0.51–0.55)	(0.56–0.61)	(0.53–0.57)	(0.52–0.56)
	WN2	0.51	0.56	0.54	0.54	0.55
		(0.50–0.54)	(0.54–0.58)	(0.51–0.56)	(0.52–0.57)	(0.53–0.57)
	WN3	0.56	0.58	0.59	0.62	0.76
		(0.53–0.59)	(0.56–0.60)	(0.54–0.63)	(0.57–0.66)	(0.71–0.78)
	WR	0.51	0.55	0.53	0.52	0.51
		(0.50–0.55)	(0.53–0.57)	(0.50–0.55)	(0.50–0.55)	(0.50–0.54)
SVM	WN1	0.56	0.54	0.52	0.54	0.57
		(0.54–0.58)	(0.52–0.56)	(0.50–0.54)	(0.52–0.56)	(0.55–0.59)
	WN2	0.53	0.52	0.53	0.59	0.57
		(0.51–0.54)	(0.50–0.55)	(0.51–0.54)	(0.58–0.61)	(0.56–0.59)
	WN3	0.55	0.55	0.54	0.60	0.80
		(0.54–0.56)	(0.54–0.57)	(0.52–0.55)	(0.59–0.62)	(0.75–0.84)
	WR	0.51	0.58	0.51	0.55	0.58
		(0.50–0.53)	(0.57–0.60)	(0.50–0.53)	(0.53–0.57)	(0.56–0.60)
RF	WN1	0.58	0.53	0.59	0.51	0.53
		(0.55–0.60)	(0.51–0.56)	(0.57–0.61)	(0.50–0.54)	(0.51–0.55)
	WN2	0.51	0.59	0.59	0.53	0.58
		(0.50–0.54)	(0.57–0.61)	(0.56–0.61)	(0.51–0.55)	(0.56–0.60)
	WN3	0.59	0.56	0.59	0.66	0.83
		(0.56–0.61)	(0.54–0.59)	(0.56–0.62)	(0.64–0.68)	(0.78–0.88)
	WR	0.54	0.57	0.58	0.52	0.61
		(0.52–0.57)	(0.55–0.59)	(0.55–0.60)	(0.50–0.54)	(0.59–0.63)

In all three algorithms, the model trained on WN3 had the highest accuracy to predict deep sedation (MOAA/S = 0). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. LR = logistic regression; SVM = support vector machine; RF = Random forest; WN1 = model trained on wake (W) and N1 sleep state; WN2 = model trained on W and N2 sleep state; WN3 = model trained on W and N3 sleep state; WR = model trained on W and rapid eye movement sleep state; AM4 = model tested to predict MOAA/S score 4; AM3 = model tested to predict MOAA/S score 3; AM2 = model tested to predict MOAA/S score 2; AM1 = model tested to predict MOAA/S score 1; AM0 = model tested to predict MOAA/S score 0.

Table 2.

Mean AUC of the machine learning models to predict different levels of sedation using different sleep stages from MrOS data

		Testing
Model	Training	AM4	AM3	AM2	AM1	AM0
LR	WN1	0.52	0.52	0.52	0.60	0.54
		(0.51–0.54)	(0.50–0.53)	(0.51–0.54)	(0.58–0.61)	(0.52–0.55)
	WN2	0.54	0.55	0.52	0.55	0.60
		(0.53–0.55)	(0.54–0.56)	(0.51–0.54)	(0.53–0.56)	(0.58–0.61)
	WN3	0.55	0.54	0.54	0.61	0.71
		(0.54–0.56)	(0.53–0.55)	(0.52–0.55)	(0.59–0.62)	(0.69–0.72)
	WR	0.53	0.52	0.53	0.51	0.55
		(0.52–0.54)	(0.51–0.53)	(0.52–0.54)	(0.50–0.52)	(0.54–0.57)
SVM	WN1	0.54	0.55	0.52	0.58	0.60
		(0.51–0.57)	(0.53–0.57)	(0.50–0.54)	(0.55–0.60)	(0.58–0.63)
	WN2	0.54	0.51	0.51	0.52	0.58
		(0.51–0.56)	(0.50–0.54)	(0.50–0.53)	(0.50–0.55)	(0.56–0.60)
	WN3	0.58	0.56	0.60	0.67	0.79
		(0.55–0.61)	(0.54–0.58)	(0.58–0.63)	(0.65–0.70)	(0.77–0.80)
	WR	0.55	0.51	0.57	0.54	0.61
		(0.53–0.58)	(0.50–0.53)	(0.54–0.60)	(0.52–0.57)	(0.59–0.63)
RF	WN1	0.53	0.59	0.59	0.60	0.54
		(0.50–0.55)	(0.57–0.61)	(0.57–0.62)	(0.57–0.62)	(0.52–0.56)
	WN2	0.56	0.51	0.51	0.60	0.74
		(0.53–0.59)	(0.50–0.53)	(0.450–0.53)	(0.57–0.62)	(0.72–0.76)
	WN3	0.55	0.59	0.67	0.74	0.86
		(0.53–0.58)	(0.57–0.61)	(0.65–0.69)	(0.72–0.76)	(0.83–0.87)
	WR	0.53	0.57	0.54	0.57	0.57
		(0.50–0.55)	(0.55–0.59)	(0.52–0.57)	(0.55–0.59)	(0.55–0.59)

Similar to SHHS, the model trained on WN3 had the highest accuracy to predict deep sedation (MOAA/S = 0). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. For abbreviations please refer to Table 1.

Mean AUC of the machine learning models to predict different levels of sedation using different sleep stages in SHHS data In all three algorithms, the model trained on WN3 had the highest accuracy to predict deep sedation (MOAA/S = 0). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. LR = logistic regression; SVM = support vector machine; RF = Random forest; WN1 = model trained on wake (W) and N1 sleep state; WN2 = model trained on W and N2 sleep state; WN3 = model trained on W and N3 sleep state; WR = model trained on W and rapid eye movement sleep state; AM4 = model tested to predict MOAA/S score 4; AM3 = model tested to predict MOAA/S score 3; AM2 = model tested to predict MOAA/S score 2; AM1 = model tested to predict MOAA/S score 1; AM0 = model tested to predict MOAA/S score 0. Mean AUC of the machine learning models to predict different levels of sedation using different sleep stages from MrOS data Similar to SHHS, the model trained on WN3 had the highest accuracy to predict deep sedation (MOAA/S = 0). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. For abbreviations please refer to Table 1.

Discriminatory features

Figure 4 is the heatmap of a list of features selected by the EN technique for all training testing combinations. Features that are more discriminative/informative selected by the algorithm are represented by the shaded region. Different features were selected by the EN technique in different sleep stages. The model trained on W and N3 mainly used powers in the delta, theta, and beta bands to predict the dexmedetomidine deep sedation state (AM0). It should be noted that power in the spindle band in both SHHS and MrOS was not discriminatory.

Figure 4.

Heatmap of features selected by the EN regularization algorithm during the different training process. Here, the features selected by the EN algorithm are represented by the dark blue color. Different features were selected in (A) SHHS and (B) MrOS datasets. The power in the spindle band (normalized and un-normalized) was not selected as an important feature in both datasets.

Internal cross-validation performance

Next, we performed 10-fold internal cross validation to differentiate between Wake and N3 sleep stage in the SHHS and MrOS datasets; awake (MOAA/S = 5) and sedated (MOAA/S = 0) state in UMCG data using random forest algorithm. The algorithm classified two groups with an AUC of 0.94 (0.85–0.97), 0.96 (0.84–0.98), and 0.84 (0.79–0.90) in the SHHS, MrOS, and UMCG datasets, respectively.

Performance of individual features

To evaluate the performance of individual spectral EEG features, we performed classification within each dataset to discriminate between two groups: SHHS, MrOS (W vs N3), and UMCG (MOAA/S = 5 vs MOAA/S = 0). Figure 5 summarizes the prediction performance. The following top five features provided the highest discriminatory performance (mean prediction AUC): (0.88), (0.86), (0.83), (0.80), and (0.80) in SHHS; (0.77), (0.75), (0.75), (0.74), and (0.73) in MrOS; (0.75), (0.75), (0.74), and (0.74) in UMCG datasets.

Figure 5.

Prediction performance of individual features to differentiate between wake (W) and individual sleep stages in SHHS (red) and MrOS (blue) datasets; between awake (MOAA/S = 5) and individual sedated states in UMCG (blue) datasets. The prediction performance of power in the spindle band (normalized and un-normalized) to differentiate between awake and different sedation levels was poor (AUCs ≤0.6) in dexmedetomidine when compared with other spectral features. N1 = features tested to differentiate between wake and NREM stage 1 sleep state; N2 = features tested to differentiate between wake and NREM stage 2 sleep state; N3 = features tested to differentiate between wake and NREM stage 3 sleep state; R = features tested to differentiate between wake and rapid eye movement sleep state; M4 = features tested to differentiate between awake (MOAA/S score 5) and MOAA/S score 4; M3 = features tested to differentiate between awake (MOAA/S score 5) and MOAA/S score 3; M2 = features tested to differentiate between awake (MOAA/S score 5) and MOAA/S score 2; M1 = features tested to differentiate between awake (MOAA/S score 5) and MOAA/S score 1; M0 = features tested to differentiate between awake (MOAA/S score 5) and MOAA/S score 0.

Effect of age

To evaluate the effect of age on the prediction, we performed case controlled analysis matched by age. First, we grouped the EEG recordings into three groups in the UMCG data as group 1 = 18–35 years; group 2 = 35–50 years; and group 3 = 50–70 years. We then performed a binary classification between AM0 and WN3 using the random forest model for each group. The results are summarized in Table 3. In all three groups, the prediction performance was similar suggesting that the findings in this study can be generalized to all age groups (except below 18 years due to unavailability of data).

Table 3.

Prediction performance (AUCs) obtained using the random forest model

	Training
Testing	SHHS	MrOS
Group 1	0.81	0.84
	(0.75–0.89)	(0.74–0.90)
Group 2	0.84	0.87
	(0.76–0.91)	(0.72–0.90)
Group 3	0.82	0.86
	(0.76–0.87)	(0.71–0.91)

The binary classification was performed between WN3 and AM0 for individual groups (in UMCG data). Here, group 1 = 18–35 years; group 2 = 35–50 years; and group 3 = 50–70 years. There was no significant difference in the prediction performance between individual groups (p = 0.81). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. To test the statistical significance, we performed Wilcoxon rank sum test on the probability output of the random forest model and p-values <0.05 were considered significant. For abbreviations please refer to Table 1.

Prediction performance (AUCs) obtained using the random forest model The binary classification was performed between WN3 and AM0 for individual groups (in UMCG data). Here, group 1 = 18–35 years; group 2 = 35–50 years; and group 3 = 50–70 years. There was no significant difference in the prediction performance between individual groups (p = 0.81). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. To test the statistical significance, we performed Wilcoxon rank sum test on the probability output of the random forest model and p-values <0.05 were considered significant. For abbreviations please refer to Table 1.

Effect of sex

To evaluate the effect of sex, we used the random forest model trained on the WN3 sleep stage to predict deep sedation separately for males and females. We only used SHHS data for training since MrOS study was conducted only in males. The classification results are summarized in Table 4. While the performance was better when trained and tested within the same sex compared with cross-training and testing, it was not significant (p = 0.76).

Table 4.

Prediction performance (AUCs) obtained using the random forest model across different groups. Though the performance reduced during cross training and testing, it was not significant.

	Training
Testing	Male	Female
Male	0.85	0.81
	(0.74–0.91)	(0.70–0.89)
Female	0.80	0.84
	(0.71–0.90)	(0.72–0.90)

The binary classification was performed between WN3 and AM0 for males and females separately. The model was trained using only SHHS data. Though the performance dropped approximately by 4% during cross-training and testing, it was not significant (p = 0.76). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. To test the statistical significance, we performed Wilcoxon rank sum test on the probability output of the random forest model and p-values <0.05 were considered significant. For abbreviations please refer to Table 1.

Prediction performance (AUCs) obtained using the random forest model across different groups. Though the performance reduced during cross training and testing, it was not significant. The binary classification was performed between WN3 and AM0 for males and females separately. The model was trained using only SHHS data. Though the performance dropped approximately by 4% during cross-training and testing, it was not significant (p = 0.76). To estimate the 95% confidence interval, we used bootstrapping with 1,000 samplings. To test the statistical significance, we performed Wilcoxon rank sum test on the probability output of the random forest model and p-values <0.05 were considered significant. For abbreviations please refer to Table 1.

Discussion

In this study, we developed a framework using machine learning and large-scale EEG datasets to validate a clinical hypothesis: are EEG patterns during dexmedetomidine-induced deep sedation similar to NREM sleep EEG patterns? Results obtained in this study using both linear and nonlinear machine learning algorithms demonstrate that dexmedetomidine-induced deep sedation mimics the N3 sleep stage. Particularly, our best performing model using random forest algorithm predicted deep sedation with AUCs = 0.83 (0.78–0.88) and 0.86 (0.85–0.87) using features estimated from N3 sleep stage EEG in the SHHS and MrOS datasets, respectively. Two strengths of this study are (1) the use of a large number of heterogenous EEG recordings from two sleep studies, and (2) validation on an external dataset using machine learning algorithms. This type of validation is necessary to capture the inter- (and intra-) participant variability seen in the EEG recordings as shown in Figure 6. We believe that the framework and findings of this study using large heterogenous datasets can be generalized to external populations across different age groups and sex.

Figure 6.

Comparison of EEG power spectrogram from three participants during N3 sleep state in SHHS, MrOS and deep sedated state (MOAA/S = 0) in UMCG datasets. Here, the multitaper spectrogram was obtained using all 30 s EEG epochs from N3 and deep sedated state. Large variability can be seen in delta (0–4 Hz) and spindle (12–16 Hz) spectral regions in all three datasets. This suggests that the morphology of spindles seen during the biological NREM stage 3 sleep is different from spindles seen during dexmedetomidine-induced deep sedation. N3 = NREM stage 3 sleep state. Several previous studies have already looked at the association between natural sleep stages and dexmedetomidine-induced sedation states using EEG. In Huupponen et al. [4], EEG from 11 healthy participants undergoing dexmedetomidine sedation, and during physiological sleep in 10 healthy control participants were used to visually compare the sleep spindles between datasets. In this study, authors observe that dexmedetomidine produces a state closely resembling physiological N2 sleep state. However, since self-reported sedation assessment was not performed in these volunteers it is not clear from which sedation state the EEG spindles were used for comparison (these EEG segments were selected at times with high spindle density). In Purdon et al. [28], dexmedetomidine was administered as a low-dose infusion inducing a level of sedation in which the patient responds to minimal auditory or tactile stimulation (which translates to MOAA/S score = 4). It was shown that the dexmedetomidine spindles are consistent with a light state of sedation (MOAA/S score = 4 according to their definition of patient response). In Akeju et al. [5], authors show that the EEG dynamics induced by dexmedetomidine more closely approximates N2 sleep stage but in another study by the same authors [3], it is shown that dexmedetomidine promotes biomimetic N3 sleep stage in dose dependent manner. All these studies were performed through visual analysis (univariate analysis) of EEG spectrogram from limited set of healthy volunteers in a controlled environment which do not capture heterogeneity commonly seen in sleep physiology. Due to this, there are discrepancies between different studies and a large-scale validation of these findings was necessary. We took a data-driven approach to validate these findings and explored how dexmedetomidine-induced sedation states correspond with different sleep stages. Our results using two large external sleep EEG datasets suggest that at population level, dexmedetomidine-induced deep sedation (MOAA/S = 0) is synonymous to N3 sleep stage. Findings of our study are in line with several previous studies that have already demonstrated the relationship between dexmedetomidine-induced and natural sleep EEG patterns. This is due to the close proximity between the mechanisms of NREM sleep induction and how dexmedetomidine acts on brain circuits to induce sedation [29-32]. Using large sleep EEG datasets and machine learning algorithms, we further strengthened this relationship by capturing large heterogeneity seen in sleep EEG patterns. Different anesthetics induce distinct anesthetic states by targeting related neural circuitries [31, 32]. The anesthetic states can be identified and monitored by drug-specific signatures in unprocessed EEG as well as spectrogram [28]. Although the spindles are brief and episodic, dexmedetomidine sedation is characterized by slow oscillations and spindle-like activity with frequency range and spatial distribution similar to propofol-induced frontal alpha oscillations [4, 33, 34]. The EN feature selection algorithm selected different features across different sleep stages to predict deep sedation. Power in the delta, theta, alpha, and beta bands from the N3 sleep stage were selected by the EN algorithm to predict deep sedation state, which supports these findings (see Figure 4). It should be noted that power in the spindle band was not selected by the EN algorithm for prediction. When compared with other spectral features, power in the spindle band (normalized and un-normalized) was also not a good predictor to differentiate between awake and sedated states during dexmedetomidine infusion as a standalone feature as shown in Figure 5. For clinicians, this finding suggests that dexmedetomidine drug tritration should not be performed only by visualizing the presence of spindles in the spectrogram. For manufacturers, it is important that sedation level monitors should not be developed solely based on the spectrogram/spectral features for tracking dexmedetomidine sedation levels. There are several limitations to this study. First, we did not tune the model hyperparameters since the goal of this study was to validate a clinical hypothesis using linear and nonlinear machine learning algorithms instead of developing an accurate prediction model. Though different models performed differently in terms of prediction, all models provided highest performance to predict deep sedation state using N3 sleep stage. Second, we only used spectral features in this study for straightforward analysis and interpretation of results. The inclusion of additional features from time and entropy domain can improve the overall performance of the system for dexmedetomidine-induced sedation level prediction. Third, the results are yet to be validated in pediatric patients/group (<18 years). We could not perform this validation due to the unavailability of the data. Fourth, the SHHS and MroS datasets consisted of some patients with heterogenous pathophysiology and we did not exclude them from the analysis. However, we believe this is not an issue since the sleep stage scoring is performed using standard guidelines without any information about underlying pathophysiology. To summarize, we demonstrate using a data-driven approach that dexmedetomidine-induced EEG patterns mimics NREM stage 3 EEG sleep patterns using linear and nonlinear machine learning algorithms. Our approach also provides a data-driven framework to validate clinical hypotheses at population level using big data and machine learning algorithms.

27 in total

Review 1. Dexmedetomidine: new insights.

Authors: Jean Mantz; Julien Josserand; Sophie Hamada
Journal: Eur J Anaesthesiol Date: 2011-01 Impact factor: 4.330

2. Overview of recruitment for the osteoporotic fractures in men study (MrOS).

Authors: Janet Babich Blank; Peggy Mannen Cawthon; Mary Lou Carrion-Petersen; Loretta Harper; J Phillip Johnson; Eileen Mitson; Romelia Ramírez Delay
Journal: Contemp Clin Trials Date: 2005-10 Impact factor: 2.226

Review 3. Scaling Up Scientific Discovery in Sleep Medicine: The National Sleep Research Resource.

Authors: Dennis A Dean; Ary L Goldberger; Remo Mueller; Matthew Kim; Michael Rueschman; Daniel Mobley; Satya S Sahoo; Catherine P Jayapandian; Licong Cui; Michael G Morrical; Susan Surovec; Guo-Qiang Zhang; Susan Redline
Journal: Sleep Date: 2016-05-01 Impact factor: 5.849

4. Associations between sleep architecture and sleep-disordered breathing and cognition in older community-dwelling men: the Osteoporotic Fractures in Men Sleep Study.

Authors: Terri Blackwell; Kristine Yaffe; Sonia Ancoli-Israel; Susan Redline; Kristine E Ensrud; Marcia L Stefanick; Alison Laffan; Katie L Stone
Journal: J Am Geriatr Soc Date: 2011-11-07 Impact factor: 5.562

5. The Sleep Heart Health Study: design, rationale, and methods.

Authors: S F Quan; B V Howard; C Iber; J P Kiley; F J Nieto; G T O'Connor; D M Rapoport; S Redline; J Robbins; J M Samet; P W Wahl
Journal: Sleep Date: 1997-12 Impact factor: 5.849

6. Effect of Dexmedetomidine Added to Standard Care on Ventilator-Free Time in Patients With Agitated Delirium: A Randomized Clinical Trial.

Authors: Michael C Reade; Glenn M Eastwood; Rinaldo Bellomo; Michael Bailey; Andrew Bersten; Benjamin Cheung; Andrew Davies; Anthony Delaney; Angaj Ghosh; Frank van Haren; Nerina Harley; David Knight; Shay McGuiness; John Mulder; Steve O'Donoghue; Nicholas Simpson; Paul Young
Journal: JAMA Date: 2016-04-12 Impact factor: 56.272

7. Dexmedetomidine promotes biomimetic non-rapid eye movement stage 3 sleep in humans: A pilot study.

Authors: Oluwaseun Akeju; Lauren E Hobbs; Lei Gao; Sara M Burns; Kara J Pavone; George S Plummer; Elisa C Walsh; Tim T Houle; Seong-Eun Kim; Matt T Bianchi; Jeffrey M Ellenbogen; Emery N Brown
Journal: Clin Neurophysiol Date: 2017-10-20 Impact factor: 3.708

8. Spatiotemporal Dynamics of Dexmedetomidine-Induced Electroencephalogram Oscillations.

Authors: Oluwaseun Akeju; Seong-Eun Kim; Rafael Vazquez; James Rhee; Kara J Pavone; Lauren E Hobbs; Patrick L Purdon; Emery N Brown
Journal: PLoS One Date: 2016-10-06 Impact factor: 3.240

9. Combined use of dexmedetomidine and propofol in monitored anesthesia care: a randomized controlled study.

Authors: Kyu Nam Kim; Hee Jong Lee; Soo Yeon Kim; Ji Yoon Kim
Journal: BMC Anesthesiol Date: 2017-03-01 Impact factor: 2.217

10. Predicting Deep Hypnotic State From Sleep Brain Rhythms Using Deep Learning: A Data-Repurposing Approach.

Authors: Sunil Belur Nagaraj; Sowmya M Ramaswamy; Maud A S Weerink; Michel M R F Struys
Journal: Anesth Analg Date: 2020-05 Impact factor: 5.108

5 in total

1. Intranasal Dexmedetomidine for the Treatment of Pre-operative Anxiety and Insomnia: A Prospective, Randomized, Controlled, and Clinical Trial.

Authors: Wen Zeng; Li Chen; Xin Liu; Xujiang Deng; Kuan Huang; Maolin Zhong; Shubao Zhou; Lifang Zhan; Yulu Jiang; Weidong Liang
Journal: Front Psychiatry Date: 2022-05-30 Impact factor: 5.435

2. A randomized placebo-controlled double-blind study of dexmedetomidine on postoperative sleep quality in patients with endoscopic sinus surgery.

Authors: Yu Wu; Yuhua Miao; Xuzhen Chen; Xiaojian Wan
Journal: BMC Anesthesiol Date: 2022-06-01 Impact factor: 2.376

3. Median Effective Dose of Dexmedetomidine Inducing Bradycardia in Elderly Patients Determined by Up-and-Down Sequential Allocation Method.

Authors: Hua Yang; Yu Fu; Fang Deng; Yun Shao; Yu-Gang Lu; Jin-Chao Song
Journal: Int J Med Sci Date: 2022-06-13 Impact factor: 3.642

4. The glymphatic system and subarachnoid hemorrhage: disruption and recovery.

Authors: Stephan Quintin; Arnav Barpujari; Yusuf Mehkri; Jairo Hernandez; Brandon Lucke-Wold
Journal: Explor Neuroprotective Ther Date: 2022-06-21

5. Toward asleep DBS: cortico-basal ganglia spectral and coherence activity during interleaved propofol/ketamine sedation mimics NREM/REM sleep activity.

Authors: Jing Guang; Halen Baker; Orilia Ben-Yishay Nizri; Shimon Firman; Uri Werner-Reiss; Vadim Kapuller; Zvi Israel; Hagai Bergman
Journal: NPJ Parkinsons Dis Date: 2021-08-02

5 in total