Literature DB >> 35793672

Effects of auditory sleep modulation approaches on brain oscillatory and cardiovascular dynamics.

Stephanie Huwiler1, Manuel Carro Dominguez1, Silja Huwyler1, Luca Kiener1, Fabia M Stich1, Rossella Sala1, Florent Aziri1, Anna Trippel1, Christian Schmied2, Reto Huber3,4,5,6, Nicole Wenderoth1,4,7, Caroline Lustenberger1,3,4.   

Abstract

Slow waves, the hallmark feature of deep nonrapid eye movement sleep, do potentially drive restorative effects of sleep on brain and body functions. Sleep modulation techniques to elucidate the functional role of slow waves thus have gained large interest. Auditory slow wave stimulation is a promising tool; however, directly comparing auditory stimulation approaches within a night and analyzing induced dynamic brain and cardiovascular effects are yet missing. Here, we tested various auditory stimulation approaches in a windowed, 10 s ON (stimulations) followed by 10 s OFF (no stimulations), within-night stimulation design and compared them to a SHAM control condition. We report the results of three studies and a total of 51 included nights and found a large and global increase in slow-wave activity (SWA) in the stimulation window compared to SHAM. Furthermore, slow-wave dynamics were most pronouncedly increased at the start of the stimulation and declined across the stimulation window. Beyond the changes in brain oscillations, we observed, for some conditions, a significant increase in the mean interval between two heartbeats within a stimulation window, indicating a slowing of the heart rate, and increased heart rate variability derived parasympathetic activity. Those cardiovascular changes were positively correlated with the change in SWA, and thus, our findings provide insight into the potential of auditory slow wave enhancement to modulate cardiovascular restorative conditions during sleep. However, future studies need to investigate whether the potentially increased restorative capacity through slow-wave enhancements translates into a more rested cardiovascular system on a subsequent day. © Sleep Research Society 2022. Published by Oxford University Press on behalf of the Sleep Research Society.

Entities:  

Keywords:  auditory stimulation; cardiovascular recovery; heart rate variability; slow waves

Mesh:

Year:  2022        PMID: 35793672      PMCID: PMC9453626          DOI: 10.1093/sleep/zsac155

Source DB:  PubMed          Journal:  Sleep        ISSN: 0161-8105            Impact factor:   6.313


Auditory stimulation is a promising approach to modulate the hallmark brain oscillation of deep nonrapid eye movement sleep, the slow waves. Although many different stimulation approaches exist, the lack of a direct comparison of the effects within a single sleep period limits its best use in clinical and nonclinical settings. We demonstrate that auditory stimulation consistently enhances slow-wave activity and that the dynamic slow-wave response can be shaped by volume modulations. Additionally, besides only considering direct brain responses, we show immediate stimulation effects on cardiovascular dynamics and that the amount of slow-wave enhancement is related to a slowing of the heart rate. Thus, we provide a target for potential therapy options to increase cardiovascular recovery.

Introduction

Sleep represents a powerful system for promoting brain and body health. It is suggested to play a role in a plethora of functions such as cleaning away toxic by-products [1-3], synaptic homeostasis [4], memory consolidation [5-11], metabolic [12], and cardiovascular functions [13-16], and body core tissue turnover [17]. Particularly, nonrapid eye movement (NREM) sleep with its large-amplitude, low-frequency slow waves has been proposed to guide those beneficial effects (e.g. reviewed in ref. 18). Periods of neuronal activity are reflected in the slow-wave up-phase and periods of neuronal silence are reflected by the down-phase of slow waves [19], thereby coordinating the temporal interplay between thalamocortical sleep spindles and hippocampal sharp-wave ripples, which has for instance been shown to support long-term memory retention [20, 21]. Nevertheless, whether slow waves are an indispensable driver for the maintenance of a healthy brain and body remains still largely unexplored. To elucidate the functional role of slow waves for brain and body functions, modulation of these oscillations is needed. Over the last few years, especially auditory stimulation has emerged as a promising, noninvasive, and feasible approach to selectively modulate slow waves during deep sleep [9, 22–24]. However, various stimulation protocols exist, leading to inconsistent findings on behavioral outcomes (e.g. reviewed in ref. 25) and comparisons of those approaches on efficacy to selectively enhance or decrease slow waves are missing. Ngo and colleagues [9] were the first to report that targeting the ascending up-phase of ongoing slow waves seems to be important to elicit improvements in overnight memory consolidation. Down-phase stimulation on the other hand was shown to rather interfere with slow waves and the consolidation of declarative and motor memory [9, 26]. However, in addition to selecting an appropriate target phase of the auditory stimuli, the number of stimulations in a sequence is variable, as, e.g. a two tones stimulation protocol with a subsequent stimulation break afterwards [9, 23], or a windowed approach where auditory stimulation was only presented during an ON window of a predefined length [7, 8, 22]. Besides the mentioned procedures that rely all to a certain degree on the phase and/or presence of slow waves (closed-loop stimulation), completely open-loop auditory stimulation has been shown to enhance slow waves as well [11, 27]. An additional parameter to consider is the volume of stimulation and whether the stimuli are played through headphones or with loudspeakers. Additionally, some studies used fixed volumes between 50 and 60 dB [9, 23, 28], or individual and/or adaptive volumes ranging between 30 and 60 dB [10, 11, 22]. Although many stimulation approaches have been applied, auditory stimulation is still in its infancy. Thus, the full potential of auditory stimulation has not been exploited and more profound understanding of its effects is needed for this purpose. Furthermore, it is currently unknown whether auditory stimulation efficacy remains stable across a sleep cycle and whether the stimulation efficacy is even maintained across several seconds of stimulation. To advance the understanding of auditory slow-wave modulation, we propose here a novel approach to compare different auditory stimulation conditions within a single sleep period using a windowed 10 s stimulation ON (auditory stimulation played) followed by 10 s OFF (no auditory stimulation played) approach. This within-night design eliminated any between-night variability, which is observed as pronounced fluctuations of for instance sleep architecture or depth of NREM sleep across multiple nights within the same subject. Therefore, our setup allows for controlled comparisons between conditions. Because the ON window only starts if a set of stimulation prerequisites such as stable NREM sleep are met and a within-night SHAM control can be implemented, our design allows for a direct comparison of stimulation effects using a larger parameter space. Furthermore, this windowed approach entails many advantages compared to continuous approaches as underlying temporal dynamical brain and body responses can be analyzed. Thus, averaging across a longer time period can be avoided. Furthermore, such a windowed approach had recently been shown to be more efficient [29] in enhancing SWA compared to a continuous approach; however, this has been shown between nights only. Beyond the elicited responses of the auditory stimulation on brain activity, cardiovascular parameters such as 5 min heart rate variability (HRV) [22, 30] were shown to be altered upon slow wave enhancement. During sleep, the heart is mainly under the control of the autonomic nervous system (ANS) dominated by a cyclic pattern with parasympathetic predominance during NREM sleep [31] and sympathetic predominance during rapid eye movement (REM) sleep [32]. The strong positive correlation between SWA (power spectral density in the slow-wave frequency band) and parasympathetic activity [33] indicates a link between slow waves and the resting function of the parasympathetic nervous system. However, whether slow waves play a functional role in the restorative parasympathetically dominated control of the cardiovascular system remains to be elucidated. Human ANS activity, and especially parasympathetic activity, can be indirectly derived by HRV parameters [34] that are calculated based on the intervals between successive normal heartbeats (RR intervals). However, the ECG signal from which the HRV parameters are derived is continuous. Thus, temporarily more precise measures such as instantaneous heart rate (IHR, inverse of RR intervals) may suit better to reflect immediate cardiovascular dynamics as an outcome of altered cardiac autonomic regulation upon auditory slow-wave modulation. Here, we (1) compared widely used auditory stimulation approaches on their potential to modulate slow waves and (2) investigated whether auditory stimulation modulates not only slow waves but also cardiovascular dynamics during the time when auditory stimuli were played. Additionally, we explored the effects of auditory stimulations on sleep stage shifts and arousals and tested whether the timing of the stimulations within a night influences the effects on slow-wave modulation by splitting the night into the first 4 hr of stimulation and the remaining hours of stimulation. Altogether, we aimed to provide a framework about efficient stimulation conditions for future studies investigating the functional role of slow waves for the brain and body.

Methods

Participants

Overall, 63 healthy male participants were enrolled in three studies (study 1: n = 23, age = 40.25 ± 13.69 years; study 2: n = 9, age = 50.36 ± 6.74 years; study 3: n = 31, age = 45.48 ± 9.78 years) that served as preparation studies to optimize the auditory stimulation settings of the main trial (registered at ClinicalTrials.gov (NCT04166916)). All participants were nonsmokers, reported a regular sleep-wake rhythm, a body mass index between 17 and 30, and none of the participants had a presence of psychiatric/neurological diseases, presence or suspicion of sleep disorders, or presence of clinically significant concomitant diseases. Participants taking on-label sleep medication or medication affecting the cardiovascular system (e.g. beta-blockers) were further excluded. We recruited the participants from the community using advertisements on different platforms and were supported by the ETH Alumni Association. The study was approved by the cantonal Ethics Committee Zurich. All participants provided written informed consent before participation and received monetary compensation for their participation. For this analysis, we excluded seven participants of study 3 because of either severe sleep apnea (n = 2), high blood pressure (n = 1) meaning direct exclusion for the main trial, or very poor visually detected sleep quality in the nightly spectrogram (n = 4, e.g. almost complete night covered with alpha activity), and thus, no sleep scoring was performed. The last four participants had trouble falling asleep in the sleep lab and thus, almost no stimulations occurred. Because study 3 was a proof of principle about a dose-dependency effect of volume of stimulation and to directly compare those volumes (see Auditory Stimulation Conditions section), we chose a subset of participants from this study in order to maximize the effect while using identical volumes for all participants. Thus, we decided to exclude five participants that did not show a Hilbert Amplitude response to any of the sound volumes to isolate the effect in responders. Why those participants did not show a response to the tones remains to be explored and personalized sound settings may help to overcome this issue. Therefore, the included sample size of study 3 is n = 19 (age: 45.29 ± 11.00 years). Together with the participants of study 1 (n = 23) and study 2 (n = 9), we included in total of 51 nights in this analysis.

Experimental procedure

All participants underwent a screening period of a first phone screening where we verified the eligibility of participants. Thereafter, participants were invited to our sleep laboratory. All participants were asked to adhere to a regular sleep rhythm within two nights before the screening night, abstain from caffeine after 15:00 hr, naps, alcohol, and sports on the day of the screening night. Compliance was controlled by sleep logs and questionnaires. After arrival at our sleep lab, participants underwent a basic physiological assessment by measuring height, weight, waist circumference, and hip circumference. Thereafter, participants answered several questionnaires about demographics, health status, handedness, chronotype, sleep habits and sleep quality, and daytime sleepiness. However, we do not report the outcomes of the questionnaires as it is beyond the scope of this paper. We attached the high-density EEG system (Geodesic Sensor Net, Magstim EGI, Eugene, USA) and the ECG electrodes with a modified lead II Einthoven configuration using gold recording electrodes (Genuine Grass electrodes, Natus Medical Inc., Pleasanton, USA). Afterwards, we performed a simple audiometry to ensure participants can perceive a sound volume of 45 dB. All participants were able to hear 40 dB tones in both ears, except one only correctly perceived 40 dB in one ear. However, this participant was part of study 1 where only 45 dB sounds were played during the night. All impedances were kept below 40 kΩ. Finally, we attached a breathing belt and a canula for detecting sleep apnea (BrainProducts GmbH, Gilching, Germany and SOMNOmedics GmbH, Randersacker, Germany). Participants with clear sleep apnea were excluded from subsequent analyses. Participants were allowed to sleep for 8 hr according to their normal bedtime while we recorded polysomnography (BrainProducts GmbH, Gilching, Germany) and ECG at a sampling rate of 500 Hz. Furthermore, we recorded continuous blood pressure (Finapres Novascope, Finapres Medical Systems BV, Enschede, the Netherlands and SOMNOmedics GmbH, Randersacker, Germany); however, blood pressure data are not reported here.

Auditory stimulation protocol

Automatic stimulation design

To reliably compare auditory stimulation approaches, we designed a novel within-night stimulation protocol to compare different auditory stimulation conditions. The system is based on the algorithm of a previously established mobile stimulation device [35] that was adapted for a high-density in-lab solution using OpenViBE [36]. The stimulation conditions were presented in a pseudorandomized order in a windowed ONOFF approach during detected periods of stable NREM sleep by the algorithm, as illustrated in Figure 1. During the 10 s ON window, auditory stimuli were presented. We chose this stimulation duration because any change in cardiac autonomic modulation may need more than 5 s to occur depending on parasympathetic or sympathetic modulation [37]. After the ON window, the 10 s OFF window started where only triggers were saved but no auditory stimulation was applied. Note that for the SHAM control condition, auditory stimulation was applied in neither ON nor OFF windows; however, triggers were saved for both. All conditions were randomly shuffled and presented in that sequence, whenever the stimulation criteria described in the next paragraph were met. Only after every condition within a sequence was presented, the order of the conditions was reshuffled again for the complete sleep period of 8 hours. Note that the ONOFF window together will be referred to as the stimulation window. Auditory stimulations were presented through Etymotic insert earphones (Etymotic Research Inc., ER 3C).
Figure 1.

Experimental design. (A) Stimulation design of within-night presentation of different auditory stimulation conditions. Whenever the automatic sleep detection algorithm during the complete sleep period of 8 hours detected stable non-rapid eye movement sleep and the PLL reached the threshold target phase, a stimulation window was triggered. A single stimulation window consists of 10 s stimulation ON, where auditory stimulations were applied according to the presented condition, directly followed by a stimulation OFF window, where no auditory stimulation was presented. All conditions were presented in pseudorandomized order, meaning that they got shuffled in the beginning and only after all different conditions have been presented, the order got reshuffled again. Note that during the SHAM condition, no auditory stimulations were presented in neither the ON nor the OFF window. (B) Exemplary stimulation (rhythmic 1 Hz stimulation, ISI1) for a single stimulation window. (C) Circular distribution of the real target phase of the auditory stimulation for the conditions UP (ascending phase of all detected slow waves was targeted), DOWN (descending phase of all detected slow waves was targeted), and ISI1 stimulation for n = 23 participants of Study1 of the stimulation channel FpZ.

Experimental design. (A) Stimulation design of within-night presentation of different auditory stimulation conditions. Whenever the automatic sleep detection algorithm during the complete sleep period of 8 hours detected stable non-rapid eye movement sleep and the PLL reached the threshold target phase, a stimulation window was triggered. A single stimulation window consists of 10 s stimulation ON, where auditory stimulations were applied according to the presented condition, directly followed by a stimulation OFF window, where no auditory stimulation was presented. All conditions were presented in pseudorandomized order, meaning that they got shuffled in the beginning and only after all different conditions have been presented, the order got reshuffled again. Note that during the SHAM condition, no auditory stimulations were presented in neither the ON nor the OFF window. (B) Exemplary stimulation (rhythmic 1 Hz stimulation, ISI1) for a single stimulation window. (C) Circular distribution of the real target phase of the auditory stimulation for the conditions UP (ascending phase of all detected slow waves was targeted), DOWN (descending phase of all detected slow waves was targeted), and ISI1 stimulation for n = 23 participants of Study1 of the stimulation channel FpZ.

Stimulation logic

The first stable NREM sleep was defined as 10 min of uninterrupted detected NREM sleep and was reduced to 3 min of uninterrupted detected NREM sleep for every episode afterwards. The complete procedure on the stimulation logic is described in detail here [29]. Briefly, the EEG signal was notch filtered at 50 Hz, and then the EEG spectral power was calculated in the three frequency bands: low delta (2–4 Hz), high delta (3–5 Hz), and high beta (20–30 Hz) to determine whether the participant is in NREM sleep or non-NREM sleep (wake or REM). Furthermore, the first 10 min of uninterrupted NREM sleep served to calculate a beta power threshold for each night to account for a shift to wake, light sleep, or artifacts. Thus, if after the first 10 min beta power exceeded this threshold, stimulation was halted. To improve the accuracy of stimulations during NREM sleep, we implemented a correlation index for two electrooculography deviations that interrupted sleep detection if a high anti-correlation was measured. Additionally, we implemented a movement detection based on the absolute amplitude of the EEG signal to delay stimulations for 10 s after a movement has been detected. Finally, we implemented a slow-wave sleep detection algorithm based on the power in the delta frequency range to only stimulate when delta power exceeded a predefined threshold. To deliver stimulations at a specific phase of ongoing slow waves, a first-order phase-locked loop (PLL) was implemented according to Ferster et al. [38], which was initially adapted from Santostasi et al. [8]. We set the target phase of the PLL to 50° for up-phase stimulations and 230° for down-phase stimulations. Every stimulation ON window started after the target phase of the PLL has been reached (up-phase for all conditions except down-phase stimulations): stable NREM sleep, beta power below threshold, movement detection, underlying delta power to increase stimulation during deep NREM sleep, and the EOG correlation index. Those criteria are described in more detail elsewhere [29].

Auditory stimulation conditions

We compared various auditory stimulation approaches that are all described in Supplementary Table S1 as an overview. Here, we report the results of the most comparable approaches that fall into the following categories: phase-specific, rhythmic, and sound modulated. Previous research has shown that down-phase auditory stimulation suppresses slow waves [9, 26], whereas up-phase stimulation has repeatedly been shown to increase slow-wave activity (SWA) [9, 23, 39]. We employed a condition to target the ascending phase of slow waves (UP) and a condition to target the descending phase (DOWN) of slow waves and only detected slow waves by the algorithm were stimulated during the stimulation ON window. All auditory stimuli consisted of a 50 ms burst of pink noise. To elaborate on the stimulation protocol used by Ngo and colleagues [9] where they targeted the first ascending phase of a slow-wave and presented the second tone one cycle of a slow wave later, we extended this protocol to a continuous rhythmic stimulation to target the first ascending phase of a slow-wave and continuing in a 1 Hz rhythm throughout the stimulation ON window (ISI1). Finally, we tested two more sophisticated conditions: 1 Hz amplitude-modulated pink noise (ENVELOPE) and BINAURAL BEATS with a difference of 1 Hz between the two ears and carrier frequency of 400 Hz. Both methods should elicit entrainment in the target frequency range [11, 40, 41] of 1 Hz and therefore increase power spectral density in the slow-wave frequencies. During study 1, we kept the sound volume constant at 45 dB, afterwards we specifically investigated the effects of various sound volumes and sound volume modulation on changes in EEG and ECG. In study 2, we tested the effects of slowly increasing and decreasing sound volume during ISI1 stimulation. More precisely, the sound volumes of the ISI1Mod condition (reported in dB) were as follows: 40, 40, 42.5, 42.5, 45, 45, 45, 45, 42.5, 42.5. To further investigate the volume effects, we conducted the final study 3 employing the ISI1 condition with different sound volumes. (1) ISI1High: constant 45 dB and (2) ISI1Low: constant 42.5 dB. In addition to the above-reported conditions, we tested four more conditions within the three studies. We do not report the results of those conditions here because they either provide no additional information for our main research question or are highly explorative paradigms (e.g. not used in previous studies).

EEG analysis and sleep scoring

EEG analyses were performed using the EEGLAB toolbox [42] in MATLAB (R2019a, MathWorks Inc., Natick, MA), and preprocessing consisted of resampling to 200 Hz and using the PREP pipeline [43] to remove line-noise, robust average referencing, and bad channel interpolation. Afterwards, data were band-passed filtered between 0.5 and 40 Hz using FIR filters. Artifacts were rejected based on a semiautomatic artifact removal procedure that was described previously [44]. Additionally, we applied a previously published automatic arousal detection to the Fz, Pz, and Oz channels [45, 46]. This algorithm uses one EMG derivation and one EEG derivation to detect alpha and beta arousals [45, 46]. Sleep was scored visually for sleep stages in 20 s epoch according to standard criteria [47] by one expert scorer. Only stimulation windows in artifact and arousal-free N2 or N3 windows were included in further analyses. A fast Fourier transform was applied using the MATLAB pwelch function with a Hanning window of 4 s with 50% overlap to calculate the power spectral densities (PSD) in the frequency bands of interest. To compare the PSD distribution for the stimulation windows, we either normalized the complete power spectra by its cumulative power up to 30 Hz (e.g. applied in refs. 5 and 9) and for comparison the PSD without normalization because of the large increase in the SWA bands covering all changes in other frequency bands as well. Thereafter, the relative change was calculated by applying the following formula: . To analyze the phase of the stimulation and the temporal EEG response, we first band-pass filtered the EEG data using a Hamming windowed sinc FIR filter with the EEGLAB function pop_eegfiltnew in the respective frequency bands, and thereafter, Hilbert transformed the data to extract the Hilbert phase and amplitude. We will refer to the Hilbert amplitude as slow-wave dynamics. To investigate whether baseline SWA could influence SWA enhancement, we calculated a 5 s baseline low SWA before the start of the ISI1 stimulation ON window and compared it to the relative increase in low SWA during the first 5 s of the stimulation ON window between the ISI1 condition and the closest SHAM window. The percentage change was only calculated if the SHAM window occurred within 5 min of the ISI1 stimulation window. To elucidate whether power changes in the spindle frequency band are driven by sleep spindles or are rather accompanied by arousals because of the overlapping frequency bands, we detected all sleep spindles across all electrodes using an established algorithm [48, 49]. Briefly, we first down-sampled the signal to 200 Hz, and band-pass filtered the signal between 10 and 16 Hz using a Chebychev filter and the MATLAB function filtfilt. Afterwards, sleep spindles were detected with a threshold of two times the mean to five times the mean of the filtered signal. The spindle frequency and timepoint were derived for all detected spindles. Thereafter, we assigned for bins of 2 s of all stim windows for each participant a value of either 1 if a detected spindle started or 0 if no spindle start was detected in the respective bins. Finally, we calculated a probability distribution by dividing the sum of all detected events by the total amount of valid stimulation windows for each bin.

ECG analysis

ECG R peaks were automatically detected and visually corrected if necessary using PhysioZoo [50]. Segments of nondetectable R peaks or poor data quality were marked and excluded for further analyses. The generated RR intervals were processed with MATLAB (R2019a, MathWorks Inc.) and only RR intervals with all R peaks occurring completely in a stimulation window were extracted and the root mean square of successive RR differences (RMSSD) and the standard deviation of normal RR intervals (SDNN) was calculated. Furthermore, IHR at a specific time point was calculated by dividing 60 through the RR interval. Afterwards, continuous IHR was derived by interpolating missing datapoints using the function fillmissing with the method makima. Because we observed that the interpolation of the last beats in the stimulation window performed poorly, we excluded the last 2 s for the IHR plots and therefore only show the results of the complete ON and 8 s OFF window. To calculate the relative IHR, we calculated the mean HR of the closest SHAM window if both windows occurred within 5 min and used the following formula for every timepoint: . If no SHAM window occurred within 5 min, the window was discarded. Additionally, we extracted the following features for every stimulation window: longest RR interval, shortest RR interval, mean RR interval, and the difference between the longest and shortest RR interval. We will refer to the IHR and HRV measurements as cardiovascular dynamics.

Statistics

Statistical analyses were conducted in RStudio [51] version 1.2.5033 and MATLAB (R2019a). To compare the number of stimulation windows for the first 4 hr of stimulations to the remaining hours of stimulation, the number of stimulations within a stimulation window for the UP and DOWN condition and their differences for the first 4 hr of stimulations to the remaining hours of stimulation, we employed paired two-sided t-tests. Furthermore, comparisons of two conditions for instance for volume effects were based on paired t-tests as well. To compare the topographical distribution of the EEG power spectral density within specific frequency bands, we ran linear mixed-effects models for each electrode with the fixed factor condition and random factor participant using the MATLAB function fitmod. Post hoc p-values for the fixed effects between the auditory stimulation conditions and the SHAM condition were obtained using the function fixedEffects with the Satterwhaite approximation for the degrees of freedom and were corrected for multiple comparisons by false discovery rate [52] for each condition and all electrodes. For the Hilbert amplitude analysis, we ran linear mixed-effects models for every timepoint using fitmod and factor condition as fixed effects and participant as random effect. Post hoc p-values for each condition compared to SHAM were obtained using fixedEffects with the Satterwhaite approximation for the degrees of freedom. All time-course plots show the mean ± standard error of the mean (sem) and were generated in MATLAB using the boundedline function [53]. To correlate baseline low SWA with the relative increase of low SWA during stimulation, we employed repeated measures correlations using the R package rmcorr [54]. The HRV and HR values were compared using a linear mixed-effects model with fixed factors Condition and random factor Subject using the software package lme [55] in R. If the linear mixed-effects model was significant for the dependent variable (condition), we derived post hoc p-values using the Satterwhaite method of the package lmerTest [56]. Note that we did not expect all conditions to evoke similar effects on the brain and the body and therefore, we also calculated post hoc p-values if the p-value of the model showed trend level (p < .1). To correct for multiple comparisons, we used the R package emmeans [57], which uses the population marginal means, and the Hochberg method to adjust post hoc p-values. To directly compare the different volume conditions with each other, we employed two-sided paired t-tests. To further analyze the relationship between the change in SWA with the change in HR(V) features, we employed repeated measures correlations using the R package rmcorr [54]. p-values < .05 were considered significant, p-values < .1 were considered as trend level. Plots were generated using the ggplot2 package of R [58] or MATLAB.

Results

We report the results of three studies where we investigated the effects of different auditory stimulation approaches on brain oscillations and cardiovascular dynamics. We separated the different approaches in effects of phase, effects of sound modulation, effects of volume, and effects of time of the night. We first report how these settings affect slow-wave dynamics, spectral density, arousals, and spindles, thereafter the effects on cardiovascular dynamics, and finally how slow-wave and cardiovascular effects are correlated. Basic sleep architecture parameters for all reported studies are shown in Supplementary Table S1.

Effects of phase-targeted auditory stimulation on EEG power spectral density and slow-wave dynamics

First, we analyzed the performance of our algorithm to hit a specific phase of a slow wave in comparison with the rhythmic ISI1 protocol. The algorithm showed a high precision in hitting the up-phase or down-phase of all detected slow waves in the STIM ON window as illustrated in Figure 1C. The ISI1 approach led to a distribution of the stimulations across the complete cycle of slow waves. As a next step, we counted the number of stimulations within the stimulation windows. For the UP condition, a mean number of 6.28 ± 0.21 of stimulations and for the DOWN condition, a mean number of 6.33 ± 0.21 stimulations occurred within each window. First, we compared the nonnormalized change in power spectral density distribution as illustrated in Supplementary Figure S1A. We observed particularly strong enhancements in the slow-wave frequency bands, however, also increases in other frequency bands were present. Comparing the change in power spectral density distribution that was normalized to the cumulative power from 0.5 to 30 Hz (see Supplementary Figure S1B), only the slow-wave frequency band was increased compared to SHAM, whereas the higher frequencies were reduced. This clearly shows the large power increase in the slow-wave frequency bands induced by the auditory stimulations. Because of the strong effects in the slow-wave range, we next focused on slow-wave dynamics. We analyzed the Hilbert amplitude in the low slow-wave frequency band (0.5–2 Hz) during the stimulation windows. Figure 2A illustrates the percentage change in slow-wave Hilbert amplitude of the Fz electrode to SHAM and shows a clear increase for the UP, DOWN, and ISI1High conditions at the start of the stimulation that is, however, gradually attenuated during the stimulation period. The initial peak represents a mean increase of 81.96 ± 14.99 % (UP), 91.59 ± 13.70 % (DOWN), and 102.05 ± 15.90 % (ISI1High) compared to SHAM. As a next step, we investigated whether there was a difference in these maximal Hilbert amplitudes between all non-SHAM conditions. Thus, we ran a linear mixed-effects model of all conditions of study 1 (UP, DOWN, ISI1High, ENVELOPE, BINAURAL BEATS) that revealed a significant effect of condition [FCond (4,88) = 5.79, pcondition < .001] on the maximal relative Hilbert amplitude compared to SHAM. However, post hoc comparisons of the conditions UP, DOWN, and ISI1High showed no significant difference between any of these conditions (all p > 0.1).
Figure 2.

Results of auditory stimulation conditions on slow-wave dynamics and SWA. A + B: percentage change in Hilbert amplitude in the slow wave frequency band (0.5–2 Hz) for the conditions UP, DOWN, ISI1High relative to SHAM (A), or ENVELOPE and BINAURAL BEATS relative to SHAM (B) respectively, during the complete stimulation windows. Hilbert amplitude change is presented as mean ± standard error of the mean of the electrode Fz. The horizontal line at zero represents no change compared to the SHAM condition. Below the plot, the resulting p-values of post hoc comparisons of linear mixed-effects model with condition as an independent factor and subject as a random factor for every time point, is shown for each condition relative to SHAM in (A): UP, DOWN, ISI1High and (B): ENVELOPE and BINAURAL BEATS. Note that effect is most pronounced during beginning of stimulation ON window and diminishes with time. (C) Topographical change of SWA (0.5–2 Hz) for the conditions UP, DOWN, ISI1High, ENVELOPE, and BINAURAL BEATS compared to SHAM. Because of the difference of the SWA response seen in the Hilbert response, we divided the stimulation ON and OFF window in two 5 s windows each. Black dots indicate significant electrodes (p < .05) for the post hoc p-values resulting from linear mixed-effects effects model models with condition as an independent factor and subject as a random factor. p-values for each topoplot have been corrected for multiple comparisons by applying the false discovery rate. All plots are shown for n = 23 participants.

Results of auditory stimulation conditions on slow-wave dynamics and SWA. A + B: percentage change in Hilbert amplitude in the slow wave frequency band (0.5–2 Hz) for the conditions UP, DOWN, ISI1High relative to SHAM (A), or ENVELOPE and BINAURAL BEATS relative to SHAM (B) respectively, during the complete stimulation windows. Hilbert amplitude change is presented as mean ± standard error of the mean of the electrode Fz. The horizontal line at zero represents no change compared to the SHAM condition. Below the plot, the resulting p-values of post hoc comparisons of linear mixed-effects model with condition as an independent factor and subject as a random factor for every time point, is shown for each condition relative to SHAM in (A): UP, DOWN, ISI1High and (B): ENVELOPE and BINAURAL BEATS. Note that effect is most pronounced during beginning of stimulation ON window and diminishes with time. (C) Topographical change of SWA (0.5–2 Hz) for the conditions UP, DOWN, ISI1High, ENVELOPE, and BINAURAL BEATS compared to SHAM. Because of the difference of the SWA response seen in the Hilbert response, we divided the stimulation ON and OFF window in two 5 s windows each. Black dots indicate significant electrodes (p < .05) for the post hoc p-values resulting from linear mixed-effects effects model models with condition as an independent factor and subject as a random factor. p-values for each topoplot have been corrected for multiple comparisons by applying the false discovery rate. All plots are shown for n = 23 participants. The maximal relative increase was 81.96 ± 14.99% (UP), 91.59 ± 13.70% (DOWN), and 102.05 ± 15.90% (ISI1High) compared to SHAM, and the respective p-values of the post hoc comparisons are illustrated below in Figure 2A. Given the nonstationary effect of the auditory stimulation, we next analyzed the topographical effect in the low SWA (0.5–2 Hz) for every 5 s of the stimulation windows, which is illustrated in Figure 2C. We found a significant increase in SWA across all cortical regions. The change in SWA compared to SHAM is illustrated in Supplementary Figure S2C. Furthermore, as already expected based on the Hilbert amplitude results, the SWA enhancing effect was most pronounced during the first 5 s of the stimulation ON window. In contrast to previous studies [9, 26], we did not observe a decrease in SWA upon down-phase stimulation. However, as seen in Supplementary Figure S2B, there was a significant decrease in high SWA (2.25–4.5 Hz) in a central cluster of electrodes in the OFF window of all auditory stimulation conditions (UP, DOWN, ISI1High).

Effects of sound modulation on EEG power spectral density and slow-wave dynamics

Different from the phase targeting conditions where only bursts of pink noise were played, we applied two sound modulation conditions based on 10 s continuous sounds (either amplitude modulated pink noise [ENVELOPE] or continuous single-frequency sound with a 1 Hz discrepancy between the ears [BINAURAL BEATS]). Supplementary Figure S1, C and D shows the power spectral density distribution change compared to the SHAM condition. We observed a strong increase in the slow-wave frequency bands of the nonnormalized distribution for the BINAURAL BEATS, whereas the increase in the low frequencies was not as pronounced for the ENVELOPE condition. Furthermore, within the BINAURAL BEATS, the percentage power change of the lower alpha bands was comparable to the percentage power increase of the slow-wave range. Similar to the phase targeting conditions, we also investigated the slow-wave dynamics (Hilbert amplitude of Fz, 0.5–2 Hz) which is shown in Figure 2B as the percentage change compared to SHAM. The initial peak represents a mean increase of 131. 30 ± 19.14% for the BINAURAL BEATS, whereas the ENVELOPE stimulation resulted in a mean increase of 70.09 ± 14.71% compared to the SHAM condition. After the linear mixed-effects model showed a significant effect of condition (see section above), post hoc comparisons revealed a significant difference between the maximal Hilbert amplitude of the BINAURAL BEATS and ENVELOPE (p < .001). Furthermore, the increase in slow-wave dynamics compared to SHAM was only significant within the first 2 s for the ENVELOPE stimulation and around 5 s for the BINAURAL BEATS condition (see Figure 2B). The topographical SWA effect of the auditory stimulation for every 5 s of the stimulation windows is illustrated in Figure 2C. We found no significant difference between ENVELOPE and SHAM conditions for all 5 s windows after correction for multiple comparisons. On the other hand, BINAURAL BEATS stimulation significantly and globally enhanced SWA within the first 5 s. The relative change in SWA of both conditions compared to SHAM is shown in Supplementary Figure S2A.

Effects of auditory stimulation on sleep stage shifts and arousal

Effects on wake probability

Considering the increase in power spectral density in almost the complete frequency spectrum, we wondered whether auditory stimulation increased the probability of wake epochs to occur and whether some conditions more likely woke up participants than others. We found a significant effect of condition [FCond (5,110) = 5.79, pcondition < .001] on the probability of wake epochs to occur. Post hoc comparisons of the conditions revealed a significantly increased wake probability for BINAURAL BEATS condition [tBinauralBeats(110) = 4.66, p < .001]. All other conditions resulted in nonsignificant changes of the probability compared to the SHAM condition (all pCond > 0.1). In addition to sleep stage shifts towards wake epochs, we also compared the probability of automatically detected arousals occurring in a frontal (Fz), parietal (Pz), or occipital (Oz) electrode. We found no significant changes in either frontal, parietal or occipital detected arousals depending on the condition [FCond,Frontal(5,110) = 1.07, p = .38, FCond,Parietal(5,110) = 1.10, p = .37, FCond,Occipital(5,110) = 1.27, p = .28].

Effects on low alpha power and slow and fast spindle probability

To further ensure that the observed brain oscillatory response is indeed not accompanied by any arousal, we analyzed the 5 s topographical response in the low alpha frequency band (8–10 Hz) which is shown in Supplementary Figure S3. For the DOWN and ISI1 stimulation, we observed significant increases in alpha during the first 5 s of stimulation; however, the effects were observed across all cortical regions and not only occipital which might point towards arousal [59]. Additionally, we found a significant increase in low alpha across all 5 s windows of the stimulation ON and stimulation OFF window for the BINAURAL BEATS, and this increase in low alpha became more prominent in occipital regions with time, indicating an arousal-induced reaction. Because of a frequency overlap of alpha activity with low spindle activity, the increased alpha activity might potentially be caused by low spindle activity. Thus, we ran a spindle detection algorithm for all electrodes and calculated a spindle probability distribution our stimulation windows binned in 2 s. The probability distribution for slow (10–12 Hz) and fast (13–16 Hz) spindles for a frontal electrode (Fz), a centro-parietal electrode (CPz), and occipital electrode (Oz) is shown in Supplementary Figure S4. We found a significant increase in fast spindle probability especially in the frontal electrode for all conditions except for the ENVELOPE condition. Furthermore, we observed increased slow spindle probability in the frontal electrodes of all conditions. However, only for the BINAURAL BEATS, we observed significant increases in slow spindle activity also in the occipital electrode.

Effects of sound volume on slow-wave dynamics

Because we detected a large initial peak in the Hilbert slow-wave amplitude in all stimulation conditions of study 1, we wondered whether we could reshape this initial peak by using sound volume modulation. Therefore, we employed again 1 Hz rhythmic stimulation at 45 dB (ISI1High) and a sound volume modulated condition where we increased the sound volume from 40 to 45 dB within five tones (ISI1Mod). As illustrated by the relative Hilbert amplitude shown in Figure 3A, the increasing sound volume of the ISI1Mod significantly postponed the occurrence of the highest amplitude compared to the ISI1High stimulation on an individual level by 4.39 s [tCond(8) = 2.81, p = .02]. Although the slow-wave dynamics revealed significant differences between the two conditions in the beginning of the stimulation (Figure 3B), comparing the highest amplitude during the complete stimulation window of 39.37 ± 13.29% for the ISI1High and 33.42 ± 7.42% for the ISI1Mod showed no more significant difference between the two conditions [tCond(8) = 0.64, p = .54].
Figure 3.

Results of 1 Hz rhythmic stimulation (ISI1) with different sound volumes on slow-wave dynamics. A + C: percentage change of Hilbert amplitude in the slow-wave frequency band (0.5–2 Hz) for the conditions ISI1High (45 dB), ISI1Mod (40–45 dB) compared to SHAM (A) and ISI1High (45 dB), ISI1Low (42.5 dB) compared to SHAM (B), respectively. The horizontal line at zero represents no change compared to the SHAM condition. Data are presented as mean ± standard error of the mean for electrode Fz. Below the plot, the resulting p-values of the post hoc comparison of linear mixed-effects models with the condition as fixed factor and volunteer as random factor for each time point are shown. B + D: Percentage change of Hilbert amplitude in the slow-wave frequency band (0.5–2 Hz) for the condition ISI1High compared to ISI1Mod (C), and ISI1High compared to ISI1Low (B), respectively. The horizontal line at zero represents no change. Data are presented as mean ± standard error of the mean for electrode Fz. Below the plot, the resulting p-values of the post hoc comparison of linear mixed–effects models with condition as a fixed factor and subject as a random factor for each time point are shown. Plots A + B: show n = 9 participants, and plot C + D: n = 19 participants.

Results of 1 Hz rhythmic stimulation (ISI1) with different sound volumes on slow-wave dynamics. A + C: percentage change of Hilbert amplitude in the slow-wave frequency band (0.5–2 Hz) for the conditions ISI1High (45 dB), ISI1Mod (40–45 dB) compared to SHAM (A) and ISI1High (45 dB), ISI1Low (42.5 dB) compared to SHAM (B), respectively. The horizontal line at zero represents no change compared to the SHAM condition. Data are presented as mean ± standard error of the mean for electrode Fz. Below the plot, the resulting p-values of the post hoc comparison of linear mixed-effects models with the condition as fixed factor and volunteer as random factor for each time point are shown. B + D: Percentage change of Hilbert amplitude in the slow-wave frequency band (0.5–2 Hz) for the condition ISI1High compared to ISI1Mod (C), and ISI1High compared to ISI1Low (B), respectively. The horizontal line at zero represents no change. Data are presented as mean ± standard error of the mean for electrode Fz. Below the plot, the resulting p-values of the post hoc comparison of linear mixed–effects models with condition as a fixed factor and subject as a random factor for each time point are shown. Plots A + B: show n = 9 participants, and plot C + D: n = 19 participants. Within study 3, we investigated whether 45 dB ISI1 (ISI1High) stimulation differed from 42.5 dB ISI1 stimulation (ISI1Low) to further elaborate on the effects of sound volume on auditory slow wave enhancement and whether there might be an accumulated dose-dependency effect. Figure 3C illustrates the relative Hilbert response of the ISI1High, the ISI1Low compared to the SHAM showing significant differences in the Hilbert amplitude across the stimulation ON window between the conditions and SHAM. The highest amplitude at the location of the maximum of the Hilbert amplitude of the ISI1High was 56.72 ± 8.05% and 39.75% ± 7.45% for the ISI1Low. A paired t-test revealed a significant difference [tCond(18) = 3.42, p = .003] between the maximal amplitude of the ISI1High and ISI1Low. As illustrated in Figure 3D which shows the relative difference in the Hilbert amplitude between the ISI1High and ISI1Low, there was an increase for the ISI1High across the complete stimulation ON window.

Effects of time of the night on the efficacy of auditory stimulations

Effects on number of stimulations

Because previous research using auditory stimulation to modulate sleep slow waves often only stimulated for a certain time [9, 23], we wanted to investigate whether we observe a different effect on the brain depending on the time of stimulation. Thus, we divided the night into first 4 hr of stimulation, starting from the first stimulation ON window, and the remaining hours of stimulation. The number of stimulation windows during early and late nights is shown in Supplementary Table S2. Paired two-sided t-tests between the number of stimulation windows revealed a significant larger number of stimulation windows during early nights for all studies separately (study 1: tTime(22) = 4.13, p < .001; study 2: tTime(8) = 7.30, p < .001; study 3: tTime(18) = 6.76, p < .001). Next, we wanted to check whether the number of stimulations within a stimulation ON window differed for the early or late night. A paired two-sided t-test revealed a significantly higher number of stimulations occurring during the stimulation windows in the early nights of the UP condition (tTime(20) = 2.93, p = .008). On the other hand, we did not detect a significant difference in number of stimulation for the DOWN condition (tTime(20) = 1.58, p = .13).

Effects on SWA and slow-wave dynamics

As a next step, we also calculated the SWA for 5 s windows of the stimulation window. As illustrated in Figure 4, the SWA enhancement during the first 5 s is present during early and late nights. However, we detected only a significant SWA enhancement for the second 5 s during early nights. Furthermore, as shown in Supplementary Figure S5, if calculating the SWA across the complete stimulation window, we observed a significant effect of condition only for the early nights [FCond(5,110) = 9.51, p < .001], and post hoc comparisons showed a significant SWA enhancement for all conditions except the ENVELOPE condition [p < .05, except pENVELOPE = .89). However, comparing the SWA of the late-night, the linear mixed-effects model was not significant for condition [FCond(5,101.12) = 1.78, p = .12]. Furthermore, we also calculated the Hilbert amplitude response in the low slow-wave frequency bands (0.5–2 Hz) for early and late nights for the ISI1High condition and SHAM condition only. These results are illustrated in Supplementary Figure S5B. The plot matches with our SWA enhancement by that the Hilbert amplitude of the ISI1High seems to be enhanced for a longer time compared to the SHAM for the early night, and this enhancement is less pronounced during late nights. To investigate whether baseline SWA levels could influence these effects, we calculated 5 s baseline SWA levels before the ISI1 stimulation windows and ran a repeated-measures correlation analysis with the percentage increase in SWA compared to the closest SHAM window, if this window was occurring within 5 min. We found strong and global positive correlations, as for instance for electrode Fz (rrm = 0.41, prm < .001), all rrm values on a topographical level are shown in Supplementary Figure S6.
Figure 4.

Effects of time of the night on SWA (0.5–2 Hz). Early night shows SWA data of the Fz electrode for all stimulation conditions (UP, DOWN, ISI1High, ENVELOPE, BINAURAL BEATS, SHAM) for the first 4 hr after the first stimulation window started. Late-night represents the remaining hours of stimulations. We employed linear mixed effect models for all 5 s of the stimulation windows entering the stimulation condition as a fixed effect and subject as a random factor. All post hoc p-values represent the significance compared to the SHAM condition and have been adjusted for multiple comparisons using the Hochberg correction. ***p < .001, **p < .01, *p < .05, #0.05 < p < 0.1. All data are shown for n = 23 participants.

Effects of time of the night on SWA (0.5–2 Hz). Early night shows SWA data of the Fz electrode for all stimulation conditions (UP, DOWN, ISI1High, ENVELOPE, BINAURAL BEATS, SHAM) for the first 4 hr after the first stimulation window started. Late-night represents the remaining hours of stimulations. We employed linear mixed effect models for all 5 s of the stimulation windows entering the stimulation condition as a fixed effect and subject as a random factor. All post hoc p-values represent the significance compared to the SHAM condition and have been adjusted for multiple comparisons using the Hochberg correction. ***p < .001, **p < .01, *p < .05, #0.05 < p < 0.1. All data are shown for n = 23 participants. As a last step, we investigated whether time of the night influences the probability of wake epochs to occur upon stimulation. Because we only observed significant effects of the BINAURAL BEATS condition on wake probability, we employed a linear mixed-effects model with the interaction of time of the night and condition as fixed factors with only the SHAM and BINAURAL BEATS conditions, which was not significant [FCond × Time(1,64.061)= 0.028, p = .87]. Running a linear mixed-effects model for the early and late-night separately showed a significant effect of condition [FCond,Early(1,22) = 10.902, p = .003] and an increase in the probability by 8.96% for the BINAURAL BEATS. The same model showed trend-level [FCond,Late(1,20.942) = 3.57, p = .07] for the late-night, with an increase of the wake probability by 8.95%.

Effects of auditory stimulation on cardiovascular dynamics

Effects on IHR

Because the brain and the body are not uncoupled during sleep and the ANS represents a key pathway between the brain and body, we first wanted to investigate whether there are immediate, dynamical changes in instantaneous heart rate (IHR) during the stimulation windows. To overcome the general decline in heart rate occurring with the course of the night [60], we calculated a relative IHR compared to the mean heart rate of the closest SHAM stimulation window if this SHAM window occurred within 5 min. As illustrated in Figure 5, A and B, we observe a slight acceleration in the IHR during the first 4 s of all auditory stimulation conditions followed by a longer deceleration of IHR compared to the SHAM condition. To quantify these dynamic effects, we calculated the mean RR interval (note: inverse of heart rate, the longer the RR interval, the slower the heart rate), the longest and shortest RR interval of each stimulation window, the difference between the shortest and longest RR interval of each stimulation ONOFF together window, and the short time HRV indices RMSSD and SDNN. As shown in Figure 3C, we found a trend-level effect of the stimulation conditions on the prolongation of the mean RR interval [FCond (5,110) = 2.05, p = .078], with post hoc comparisons indicating significance (p < .05) only for the DOWN and ISI1High condition. Furthermore, we detected a significant influence of conditions for the longest RR interval [FCond (5,110) = 6.10, p < .001], and post hoc comparisons showed a significant prolongation for all conditions compared to SHAM (all p < .01). Analyzing the shortest RR interval in the stimulation window, we detected a significant influence of condition [FCond (5,110) = 5.69, p < .001], and post hoc comparisons between conditions revealed that only BINAURAL BEATS significantly shortened the shortest RR interval in the stimulation window compared to SHAM (p < .001). To quantify the ANS influence on the heart, we compared two HRV indices: first, we found a significant effect of condition on RMSSD [FCond (5,110) = 5.63, p < .001], and post hoc comparisons between the conditions showed a significant increase in RMSSD compared to SHAM for the UP, DOWN, and BINAURAL BEATS (all p < .05), and a trend for ISI1High (p = .0504). Moreover, we detected a significant influence of condition on SDNN [FCond (5,110) = 7.72, p < .001]. Post hoc comparisons revealed a significant increase for UP, DOWN, ISI1High, and BINAURAL BEATS (p < .05), whereas the effect of ENVELOPE remained on trend level (p = .0521).
Figure 5.

Cardiovascular response to auditory stimulation. A + B: percentage change in IHR during stimulation window for UP, DOWN, IS1High, and SHAM stimulation (A), or ENVELOPE, BINAURAL BEATS, and SHAM (B) compared to the mean of the closest SHAM window, if the closest window was within 5 min. IHR is presented as mean ± standard error of the mean. (C) Heart rate (variability) features for stimulation windows for the conditions UP, DOWN, ISI1High, ENVELOPE, BINAURAL BEATS, and SHAM stimulation. All data are presented as mean ± standard error of the mean. RRI: interval between two normal heartbeats, RMSSD: root mean square of successive differences between normal heartbeats. SDNN: standard deviation of normal heartbeats. Longest RRI: duration of the longest interval between two consecutive heartbeats in a stimulation window. Shortest RRI: duration of the shortest interval between two consecutive heartbeats in a stimulation window: RRI Diff: difference between the longest and shortest interval within a single stimulation window. Note that the RRI is inversely correlated with heart rate, thus, the longer the RRI, the lower the heart rate. Significance levels of the linear mixed-effects model with the condition as an independent factor and subject as a random factor are shown for each cardiovascular feature in the plot and corrected for multiple comparisons using the Hochberg method. ***p < .001, **p < .01, *p < .05, #0.05 < p < 0.1. Data is shown for n = 23 participants.

Cardiovascular response to auditory stimulation. A + B: percentage change in IHR during stimulation window for UP, DOWN, IS1High, and SHAM stimulation (A), or ENVELOPE, BINAURAL BEATS, and SHAM (B) compared to the mean of the closest SHAM window, if the closest window was within 5 min. IHR is presented as mean ± standard error of the mean. (C) Heart rate (variability) features for stimulation windows for the conditions UP, DOWN, ISI1High, ENVELOPE, BINAURAL BEATS, and SHAM stimulation. All data are presented as mean ± standard error of the mean. RRI: interval between two normal heartbeats, RMSSD: root mean square of successive differences between normal heartbeats. SDNN: standard deviation of normal heartbeats. Longest RRI: duration of the longest interval between two consecutive heartbeats in a stimulation window. Shortest RRI: duration of the shortest interval between two consecutive heartbeats in a stimulation window: RRI Diff: difference between the longest and shortest interval within a single stimulation window. Note that the RRI is inversely correlated with heart rate, thus, the longer the RRI, the lower the heart rate. Significance levels of the linear mixed-effects model with the condition as an independent factor and subject as a random factor are shown for each cardiovascular feature in the plot and corrected for multiple comparisons using the Hochberg method. ***p < .001, **p < .01, *p < .05, #0.05 < p < 0.1. Data is shown for n = 23 participants.

Correlation between SWA enhancement and cardiovascular features

As a next step, we investigated whether the auditory-induced SWA enhancement correlated with the abovementioned cardiovascular changes. Thus, we conducted a repeated-measures correlation between the percentage change of every stimulation condition compared to SHAM. For the cardiovascular changes, we used the complete stimulation window indices, whereas we took the SWA of the first 5 s of the stimulation ON window as we suspected those effects to be the driver of the cardiovascular changes considering the time-course of the IHR. As illustrated in Figure 6, we found a positive correlation between the difference in SWA and the difference in mean RR interval, and the difference in the longest RR intervals, particularly for a central and parietal electrode cluster. Furthermore, we found some frontal and central clusters showing a significant positive correlation of the change in SWA with SDNN. Some central and occipital electrode clusters showed significant negative correlations with the shortest RR interval, indicating that the higher the change in SWA the shorter the shortest RR interval. Additionally, in Supplementary Figure S7, we correlated the cardiovascular parameters with high SWA (2.25– 4.5 Hz) showing stronger correlations for especially the RMSSD difference and the mean RRI difference. Furthermore, we also correlated the cardiovascular parameters with changes in the low alpha (8–10 Hz) frequency band, which is illustrated in Supplementary Figure S8. We found SDNN to be positively correlated with an increase in low alpha across a cluster of right frontal, central, and parietal electrodes.
Figure 6.

Repeated measures correlations between percentage change for heart rate (variability; HR(V)) features and percentage change of SWA (0.5–2 Hz) in the first 5 s of the stimulation ON window of electrode Fz. For every participant (n = 22) and every condition a mean value for each variable was calculated. We excluded one participant because not all conditions had ECG measurements of high enough qualities. Topoplots show r values and significant electrodes (p < .05) are marked with black dots. We corrected the p-values for multiple comparisons using false discovery rate. The repeated measures correlation is plotted for the example electrode F3 (marked with a gray point in the topoplots), and the respective p-values and rrm values are shown in the plots. (A) Mean difference of consecutive heart beats (RRI). (B) Difference in the root mean square of successive difference of normal heart beats (RMSSD). (C) Differences of the longest RRI. (D) Difference in the standard deviation of differences between normal heartbeats (SDNN). (E) Percentage of difference of the shortest RRI.

Repeated measures correlations between percentage change for heart rate (variability; HR(V)) features and percentage change of SWA (0.5–2 Hz) in the first 5 s of the stimulation ON window of electrode Fz. For every participant (n = 22) and every condition a mean value for each variable was calculated. We excluded one participant because not all conditions had ECG measurements of high enough qualities. Topoplots show r values and significant electrodes (p < .05) are marked with black dots. We corrected the p-values for multiple comparisons using false discovery rate. The repeated measures correlation is plotted for the example electrode F3 (marked with a gray point in the topoplots), and the respective p-values and rrm values are shown in the plots. (A) Mean difference of consecutive heart beats (RRI). (B) Difference in the root mean square of successive difference of normal heart beats (RMSSD). (C) Differences of the longest RRI. (D) Difference in the standard deviation of differences between normal heartbeats (SDNN). (E) Percentage of difference of the shortest RRI.

Discussion

In this paper, we compared various widely used auditory stimulation conditions with a SHAM condition within a single sleep period and found strong and global enhancements of SWA of all stimulation conditions except the ENVELOPE condition. Furthermore, the increase in slow-wave dynamics within stimulation windows was most pronounced during the beginning of stimulation and decreased with the duration of stimulation. Additionally, the time-course of the increase in SWA can be altered by modulating the sound volume of the stimulation. We also showed that auditory stimulation applied during the second half of the night had a decreased efficiency of enhancing SWA. Moreover, we provide first evidence on the effects of auditory slow-wave modulation on the time-course of cardiovascular dynamics. We showed that auditory stimulation modulates the course of the IHR during the stimulation windows and changed cardiovascular parameters towards more resting cardiovascular conditions. Importantly, the extent of the cardiovascular change was correlated with the change of SWA, presumably indicating slow waves to be the driver of these changes. Altogether, we presented a novel stimulation protocol allowing resource-efficient comparisons between different auditory stimulation conditions on dynamic changes on slow-wave and cardiovascular levels.

Auditory stimulation globally enhances SWA

Down-phase stimulation increases SWA

We found a strong and global enhancement of SWA for all tested stimulation conditions except for the ENVELOPE condition. This enhancement was independent of the target phase of the auditory stimulation. Contrary to previous studies [9, 26] we did not observe a decrease in SWA and slow-wave dynamics of the down-phase stimulation compared to the SHAM condition, however, we even observed significant increases of SWA on a global cortical level. Comparing our results to the stimulation design of Fattinger and colleagues, we did not stimulate every down-phase of a slow wave, but only slow waves in the 10 s stimulation ON window of the DOWN condition. The slow-wave decreasing effects may accumulate over time of stimulation and our usage of various auditory stimulation conditions might have interfered with the slow-wave diminishing effects. Interestingly, we observed significant decreases in high SWA (2.25–4 Hz) over central areas during the stimulation OFF window, overlapping the targeted premotor cortex of Fattinger’s study [26]. However, these decreases have been observed for all reported stimulation conditions of study 1 and are thus not dependent on the target phase of the stimulation. We, therefore, conclude that global SWA decreases are not possible using auditory slow-wave modulation. Another possible explanation why we found contrary results to Fattinger and colleagues could be that they used a stimulation electrode located above the sensory-motor cortex and the stimulation only served to perturb slow waves on a local cortical level. We on the other hand targeted slow waves at the frequent prefrontal origin of slow waves from where the slow waves can propagate across the complete cortex [61]. Thus, we cannot exclude that the mechanism behind local and global slow wave down-modulation might be different. Finally, high-volume sounds have been previously applied to strongly perturb slow waves, but this stimulation also elicited arousals [62, 63], and increased wake [64]. Thus, sound volume may be an important factor to consider for perturbing slow waves and further studies are needed to investigate the underlying mechanisms on global (e.g. frontal) and local slow wave modulation.

Sudden sound volume change might be necessary to elicit SWA enhancement

Contrary to Simor et al. [11] who applied ENVELOPE stimulation mono-aurally, we were not able to find global SWA enhancement. However, FDR correction for multiple comparison is rather conservative, and we did observe significant SWA enhancement on single electrodes levels. Nevertheless, compared to the other conditions of study 1, ENVELOPE stimulation only minimally increased slow-wave dynamics. We hypothesize that the lack of a sudden change in sound volume that is given by the sinusoidal rise in sound volume happening for 0.5 s until the maximal volume is reached is less surprising than a quicker increase in volume and thus may lead to noise adaption effects as known during wakefulness [65]. Nevertheless, recordings in different (sub)-cortical brain areas could provide new insights on why a slower increase in sound volume might be less efficient than a quicker increase to enhance SWA.

Rhythmic and BINAURAL BEATS stimulations are as efficient as phase-targeted auditory stimulation in enhancing SWA

We found BINAURAL BEATS stimulation and ISI1 stimulation to perform similar in enhancing SWA and slow-wave dynamics as a closed-loop phase-targeted auditory stimulation (UP and DOWN). This might raise the question of whether using a closed-loop system and precise phase targeting is necessary for effective auditory slow-wave modulation or a completely open-loop stimulation protocol performs comparable. Although BINARUAL BEATS and ISI1 are less phase-controlled than UP or DOWN, we need to emphasize that the first tone occurred with a detected slow-wave up-phase. Therefore, we cannot draw any conclusions whether completely open-loop stimulation, as, for instance, Weigenand and colleagues [27] used, yields to a comparable SWA increase to the here reported conditions. In addition, the closed-loop and open-loop approaches might yield to different overnight behavioral outcomes. Weigenand et al. [27] reported no beneficial effects on overnight memory consolidation for their open-loop protocol, although SWA was enhanced during times of stimulation. However, because we used several stimulation conditions within a single night, we cannot assess any overnight behavioral outcomes.

Effect on SWA decreases with duration of auditory stimulation

We observed the strongest increases in SWA and slow-wave dynamics during the beginning of stimulation and this enhancement decreased with time of stimulation, although for all conditions except the UP and DOWN phase targeting condition, auditory stimulation was present during the complete stimulation ON window. In line with this finding is another study of Ngo and colleagues [66] where they investigated the efficiency of a two stimulation protocol to an ongoing stimulation protocol in enhancing slow waves and subsequent memory consolidation. They found no difference in the enhancement of slow waves or induction of trains of slow waves between the two conditions, concluding that there is a mechanism at play preventing the brain from hypersynchrony caused by slow-wave enhancement. Furthermore, Lustenberger and colleagues [29] showed that the overall effect of continuous auditory stimulation on SWA compared to windowed ONOFF stimulation did not differ between the conditions, although around double of the stimulations were applied in the continuous approach. Both studies indicate that breaks between stimulations favor subsequent auditory stimulation-induced slow-wave enhancements. As already proposed by Bellesi et al. [18], auditory stimulation during sleep may activate the non-lemniscal auditory processing pathway, and the target dorsal and caudo-medial medial geniculate body of the thalamus (part of the auditory thalamus) that activates the cortex to changing acoustic environment [67]. This global activation of the cortex is reflected by large amplitude and steep slow waves reflecting an efficient neuronal synchronization process that could be termed as K-complex or Type 1 slow wave [68]. K-complexes commonly arise as a reaction to sensory stimulation and serve to protect sleep continuity (e.g. reviewed in refs. 68 and 69). However, K-complexes can be associated with arousal reactions, indicating activation of the activating reticular ascending system (ARAS). The locus coeruleus (LC) as the main source of norepinephrine in the brain is a key player to modulate arousal levels [70]. Beyond modulating arousal levels, the LC may also have inhibitory effects on a sensory level (e.g. reviewed in ref. 71), and this inhibitory effect caused by the LC has also been reported in the auditory system [72]. Therefore, the decrease of the response to auditory stimulation might not only be caused by cortical or cortico-thalamic processes but already start with an adaptation of the response on the auditory sensor level or at the level of the LC, which may provide further evidence of the importance of breaks being important for efficient auditory stimulation. Therefore, we strongly propose to include breaks (e.g. using a windowed stimulation approach) for efficient auditory slow-wave enhancement. Nevertheless, future research is needed to further elaborate on the reasons for the decline in efficacy of auditory stimulation with the duration of stimulation.

Auditory slow-wave enhancement is not accompanied by arousals

Given the previous indications that the effect of the auditory slow-wave modulation might be arousal driven as the shape of the first increase in the Hilbert amplitude strongly resembles a K-complex, which was proposed to evoke sleep spindles [73], we wanted to investigate whether the observed SWA increase might be accompanied by arousals and therefore, a nonspecific increase across the complete EEG power spectrum. Particularly, an increase in occipital low alpha activity could indicate an auditory-induced arousal. Interestingly, we only found binaural beats to increase the alpha activity of the stimulation ON and OFF window, and this increase became more occipitally focused with duration of the stimulation OFF window. However, because spindles and alpha activity show overlapping EEG frequency bands, it is difficult to distinguish those just based on EEG power. Analyzing a spindle probability distribution for slow and fast spindles to account for the fact that there is a spindle gradient showing slower frontal and faster centro-parietal spindles [74], we found increases in slow and fast frontal spindle probability compared to SHAM. Furthermore, there were only slight differences in fast occipital spindles for some of the conditions. However, only binaural beats show significant increases in slow occipital spindles. Taken together with the significant increase of the probability of a wake epoch to co-occur with BINAURAL BEATS stimulation, we assume that only BINAURAL BEATS stimulation-induced arousals. A possible explanation why BINAURAL BEATS more likely woke or aroused participants could be that although we used 45 dB for all conditions of study 1, BINAURAL BEATS was the only one condition playing a continuous, pure-frequency sound for 10 s, most other conditions only used short, 50 ms bursts of 45 dB pink noise.

Stimulations towards the end of the night are less efficient in enhancing SWA

Many of the previously published studies only applied auditory stimulations for a certain amount of time during the beginning of the night [9, 23, 27, 66], whereas others stimulated during the complete night [10, 22]. Thus, we wanted to elucidate whether there is a different SWA response to auditory stimulation depending on the time during the night. Our results revealed that stimulation is more efficient during the first four hours of stimulation compared to the remaining hours of stimulation. Interestingly, there was no significant slow-wave enhancement during the late night when considering the ON and OFF window together. Additionally, there was a shorter increase in slow-wave dynamics during late nights. This less efficient stimulation towards the end of the night might be caused by generally decreased SWA [75] and less occurring high amplitude slow waves [76] leading to less slow waves that could be stimulated. However, only the UP and DOWN condition depended on the presence of slow waves within the stimulation ON window, whereas for the ISI1High condition, the same number of stimulations within the stimulation ON was applied in early and late nights. Therefore, the presence of slow waves per se cannot be the cause for the decrease in stimulation efficacy. In line with recent findings of Lustenberger et al. [29] showing that efficacy auditory slow-wave modulation can be predicted by baseline SWA, we also found that on a within-participant level baseline SWA directly before the start of a stimulation window significantly predicted the percentage of SWA increase during stimulation. SWA generally declines with the duration of the night [75] and thus may explain the observed decrease in in efficacy in enhancing SWA at later times during the night. Furthermore, the decrease in sleep pressure reflected by decreased SWA [77] accompanied by a reduction in synaptic strength [78] might also indicate slow-wave saturation effects, meaning that with duration of the night, the ability of the system to enhance slow waves might be limited. However, the generation of large subcortico-cortical neuronal synchronization reflected by K-complexes is not perturbed with duration of the night and K-complexes even tend to become larger during later parts of the night [68]. In contrast, slow-wave amplitude and slow-wave slopes are generally decreased in later parts of the night [76], and thus, only the cortico-cortical slow waves might be influenced by those possible saturation effects. Altogether, we provide further evidence to restrict slow-wave sleep modulation to NREM sleep during the early times of the night, where stable NREM sleep is dominating, and therefore provide efficient slow-wave modulation. Nevertheless, establishing the mechanisms that explain the change of response to auditory stimulation across the sleep period needs further investigation.

Auditory slow-wave enhancement alters cardiovascular features

IHR time-course reflects parasympathetic activity changes

We wanted to elucidate whether we detect changes in cardiovascular dynamics time-locked to the auditory stimulation windows. Considering the course of the IHR, we observe clear differences between the auditory stimulation conditions and SHAM. The time-course might indicate at first sight that there was a first acceleration of the heart rate that would indicate an arousal on a cardiovascular level. This initial visually observed increase in IHR could indicate an increase in sympathetic activity at first sight. However, sympathetic modulation needs more than 5 s until it changes cardiovascular functions [37]. On the other hand, parasympathetic activity rapidly changes the heart rate, therefore, being the main contributor to beat-to-beat variations of the heart rate [79]. Thus, the initial IHR course may rather be the outcome of decreased parasympathetic activation with the beginning of the stimulation. This might be an undesired cardiovascular response; however, we found no statistical evidence of a shortening of the shortest RR interval for all conditions, except the BINAURAL BEATS. Altogether, future studies should not only consider that the effects of auditory stimulation are not limited to the brain only, but that they might even entail unfavorable changes on the cardiovascular system as seen by the BINAURAL BEATS stimulation. The initial acceleration of IHR was followed by a longer-lasting period of deceleration as reflected by a prolongation of the longest RR interval for all conditions, and even a significant prolongation of the mean RR interval for the DOWN and ISI1High stimulation indicates a significant slowing of the heart rate. This change in cardiovascular dynamics was also reflected by the underlying change in cardiac autonomic regulation as reflected by the HRV changes. The observed increased SDNN for all conditions (except only trend-level for ENVELOPE) points towards generally increased autonomic activation [79] and the significantly increased RMSSD for the UP, DOWN, BINAURAL BEATS, and a trend for ISI1High indicates increased parasympathetic activation. Our finding is in line with correlational evidence showing increased HRV parameters with increased SWA [33, 80, 81], and two studies demonstrating effects of auditory stimulation on 5 min HRV measurements [22, 30]. However, the HRV results should always be interpreted with caution, as those measurements only indirectly reflect underlying ANS activity. But by taking into account the IHR or heart rate over the complete stimulation window, we can verify whether the observed HRV changes also correspond with the (I)HR as the outcome of the cardiac autonomic regulation reflected by HRV. As for instance seen with BINAURAL BEATS, we observed a significant increase in RMSSD, alongside with a significant shortening of the shortest RR interval. Together with the higher probability of inducing arousal-alike brain traits, we suggest that interpreting the increased RMSSD with caution. Nevertheless, for all other conditions, the detected changes in HRV match with the IHR and RR interval findings.

Slow waves are likely the driver of the induced cardiovascular changes

As a next step, we showed that the auditory evoked SWA enhancements directly translates to temporally coupled changes in the cardiac autonomic regulation, pointing towards more restful cardiovascular conditions when slow waves are prevailing. Although the induced changes on the heart are rather small (e.g. prolongation of the longest RRI by around 20 ms), we also need to consider the short time scale of only 10 s stimulation for this cardiovascular change to occur. Because we included all conditions in this correlation and not all conditions caused a similar effect on SWA, our results may indicate the more SWA we induce, the stronger the slowing of the heart. The effect of increased SWA over a certain time may also accumulate to stronger slowing of the heart, as it can be observed in normal sleep as well. Therefore, effects of the auditory slow-wave enhancement may accumulate over time to increase parasympathetic predominance and lower the HR more substantially. Autonomic imbalance caused by sympathetic overactivity or parasympathetic under-activity has been related to many unfavorable effects on the brain and body (e.g. it is a common mechanism of a majority of risks factors to develop cardiovascular diseases [82], the number one cause of death in the world [83]). Additionally, as aging, which is the number one risk factor for developing cardiovascular diseases [84], is accompanied with a significant decrease in SWA [85], auditory slow-wave enhancement with optimal settings may be a mean to counteract those unfavorable changes. However, applying one optimal auditory stimulation condition is needed to elucidate possible effects accumulated over stimulation time on heart rate and whether there are additional cardiovascular benefits for instance lowered blood pressure that could potentially be related to slow waves.

Limitations

Because we wanted to maximize the number of stimulation windows but still reliably compare HRV features, we calculated HRV for ~20 s windows. This is a very short time based on official guidelines provided by the Task Force of The European Society of Cardiology and The North American Society of Pacing and Electrophysiology [34]. Thus, we cannot generalize our HRV findings to clinical HRV analysis as e.g. 24 hr HRV recordings because the much longer time duration does not allow for this comparison. However, to overcome this issue, we analyzed the time-domain HRV measurements RMSSD and SDNN and compare the stimulation effects between the conditions only. Additionally, RMSSD was already shown to highly correlate in 10 s recordings compared to standard 5 min measurements [86]. Furthermore, the here presented results are intended to be used as guidelines for stimulation effects during the stimulation window itself. Therefore, we cannot draw any conclusion whether the same effects will be obtained if any of the here reported stimulation condition is applied as single condition across a complete night. Such a study would also allow to test behavioral and functional effects of the stimulations, as for instance overnight memory consolidation or ANS testing. Furthermore, we cannot assess whether the auditory stimulation could have altered normal sleep architecture or micro-architecture patterns. Finally, we measured participants with a broad age range, however, the participants were mainly middle-aged men. Schneider et al. [87] demonstrated that susceptibility to auditory stimulation might change with age. Although in study 1, 10 out of 23 were young (<30 years old) participants, the remaining studies could be repeated with younger participants to interpret their results in comparison to other studies using mainly young participants.

Conclusion

Here, we aimed to provide a framework for comparing auditory stimulation conditions in a novel within-night protocol on brain and cardiovascular responses. First, we found no evidence that phase of stimulation influences SWA enhancement and were thus not able to replicate previous research showing decreases in SWA upon down-phase stimulation. Additionally, we showed that stimulation induced SWA enhancement is most pronounced at the beginning of a stimulation window and decreases with the time of stimulation. Moreover, we also demonstrated that by changing the volume of stimulation the SWA enhancement can be modulated and that stimulation towards the end of the night gets less successful in enhancing SWA compared to earlier times of the night. However, the underlying causes for the decline in the SWA enhancement with the duration of stimulation and with time of the night remain to be elucidated. Finally, we provide first evidence that auditory stimulation directly affects the heart on a stimulation window level pointing towards more restful cardiovascular conditions as indicated by a prolongation of the longest RR interval, reflecting a slowing of the heart rate. Most importantly, these cardiovascular changes are correlated with auditory stimulation-induced SWA enhancement indicating increased restorative functions when slow waves are prevailing.

Funding

The corresponding author Caroline Lustenberger is supported by the Swiss National Science Foundation (PZ00P3_179795) and this work was conducted as part of the SleepLoop Flagship of Hochschulmedizin Zürich. This work was also supported by the Swiss National Science Foundation Grant 32003B_207719 and by the National Research Foundation, Prime Minister’s Office, Singapore under its Campus for Research Excellence and Technological Enterprise (CREATE) program (FHT). Reto Huber is supported by the Swiss National Science Foundation (320030_179443). Click here for additional data file.
  74 in total

1.  Reduced sleep spindle activity in schizophrenia patients.

Authors:  Fabio Ferrarelli; Reto Huber; Michael J Peterson; Marcello Massimini; Michael Murphy; Brady A Riedner; Adam Watson; Pietro Bria; Giulio Tononi
Journal:  Am J Psychiatry       Date:  2007-03       Impact factor: 18.112

2.  A novel slow (< 1 Hz) oscillation of neocortical neurons in vivo: depolarizing and hyperpolarizing components.

Authors:  M Steriade; A Nuñez; F Amzica
Journal:  J Neurosci       Date:  1993-08       Impact factor: 6.167

Review 3.  A quantitative systematic review of normal values for short-term heart rate variability in healthy adults.

Authors:  David Nunan; Gavin R H Sandercock; David A Brodie
Journal:  Pacing Clin Electrophysiol       Date:  2010-11       Impact factor: 1.976

4.  The Neuroprotective Aspects of Sleep.

Authors:  Andy R Eugene; Jolanta Masiak
Journal:  MEDtube Sci       Date:  2015-03

Review 5.  Short sleep duration predicts risk of metabolic syndrome: a systematic review and meta-analysis.

Authors:  Bo Xi; Dan He; Min Zhang; Jian Xue; Donghao Zhou
Journal:  Sleep Med Rev       Date:  2013-07-23       Impact factor: 11.609

6.  Old Brains Come Uncoupled in Sleep: Slow Wave-Spindle Synchrony, Brain Atrophy, and Forgetting.

Authors:  Randolph F Helfrich; Bryce A Mander; William J Jagust; Robert T Knight; Matthew P Walker
Journal:  Neuron       Date:  2017-12-14       Impact factor: 17.173

7.  Sleep homeostasis and cortical synchronization: III. A high-density EEG study of sleep slow waves in humans.

Authors:  Brady A Riedner; Vladyslav V Vyazovskiy; Reto Huber; Marcello Massimini; Steve Esser; Michael Murphy; Giulio Tononi
Journal:  Sleep       Date:  2007-12       Impact factor: 5.849

Review 8.  The Potential Role of Sleep in Promoting a Healthy Body Composition: Underlying Mechanisms Determining Muscle, Fat, and Bone Mass and Their Association with Sleep.

Authors:  Fabia M Stich; Stephanie Huwiler; Gommaar D'Hulst; Caroline Lustenberger
Journal:  Neuroendocrinology       Date:  2021-07-27       Impact factor: 4.914

9.  Tuning arousal with optogenetic modulation of locus coeruleus neurons.

Authors:  Matthew E Carter; Ofer Yizhar; Sachiko Chikahisa; Hieu Nguyen; Antoine Adamantidis; Seiji Nishino; Karl Deisseroth; Luis de Lecea
Journal:  Nat Neurosci       Date:  2010-10-31       Impact factor: 24.884

Review 10.  An Overview of Heart Rate Variability Metrics and Norms.

Authors:  Fred Shaffer; J P Ginsberg
Journal:  Front Public Health       Date:  2017-09-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.