Hypnopedia, or the capacity to learn during sleep, is debatable. De novo acquisition of reflex stimulus-response associations was shown possible both in man and animal. Whether sleep allows more sophisticated forms of learning remains unclear. We recorded during diurnal Non-Rapid Eye Movement (NREM) sleep auditory magnetoencephalographic (MEG) frequency-tagged responses mirroring ongoing statistical learning. While in NREM sleep, participants were exposed at non-awakenings thresholds to fast auditory streams of pure tones, either randomly organized or structured in such a way that the stream statistically segmented in sets of 3 elements (tritones). During NREM sleep, only tone-related frequency-tagged MEG responses were observed, evidencing successful perception of individual tones. No participant showed tritone-related frequency-tagged responses, suggesting lack of segmentation. In the ensuing wake period however, all participants exhibited robust tritone-related responses during exposure to statistical (but not random) streams. Our data suggest that associations embedded in statistical regularities remain undetected during NREM sleep, although implicitly learned during subsequent wakefulness. These results suggest intrinsic limitations in de novo learning during NREM sleep that might confine the NREM sleeping brain’s learning capabilities to simple, elementary associations. It remains to be ascertained whether it similarly applies to REM sleep.
A popular idea in the ‘50 s, hypnopedia was rapidly questioned by controlled experiments concluding to a lack of learning during sleep1,2,3. Successful recall of information presented during sleep was associated with increased electroencephalographic (EEG) alpha activity shortly after item presentation1,4,5, suggesting that delayed retrieval in earlier experiments was due to increased alertness and transition toward wakefulness after stimulus presentation.Lack of learning during sleep is compatible with a well-documented decreased responsiveness to external stimulation. Notwithstanding, residual sensory perception during sleep exists in olfactory6, tactile7,8,9, visual8 and auditory8,10,11,12,13,14,15,16 modalities. Short-latency evoked related potentials (ERPs) components (i.e., mirroring primary cortical activity) especially bear similarities with those recorded during wakefulness, whereas middle and long-latency components reflecting higher-level processes are more often distorted8. Furthermore, experimental data indicate that higher cognitive processes are not entirely inefficient during sleep. Residual cognitive abilities have been evidenced in the domains of semantic processing11,17, working memory18, detection of irregularities19 and extraction of information for motor response preparation20,21. Targeted memory reactivation (TMR) during sleep was also found to promote memory consolidation processes. Auditory or olfactory presentation during post-training sleep of items previously associated with the learned material improved memory retention above expected sleep-related consolidation benefits, thus suggesting interactions between memory content and sensory stimulation during sleep (see22,23,24 for reviews).
Recently, conditioning odour-related respiratory reflex to tones was successfully established during sleep in young adults, and the association preserved at wake25,26. These results provided a first rigorous experimental demonstration of the possibility to learn simple associations during human sleep, like in rodents27. Additionally, perceptual auditory learning that transferred to wakefulness was found efficient during REM sleep and light NREM2 sleep, but detrimental in deep NREM3 sleep28. Whether more sophisticated forms of learning are possible during sleep remains an open question. In infants, prior studies evidenced vowels detection training in seemingly sleeping newborns29, statistical language learning in newborns exhibiting active sleep30, and anticipatory auditory responses in 3-month-old infants in behavioural quiet sleep31. However, lack of polysomnographic measurements could not preclude the possibility that learning occurred during arousal, wake-like periods. Furthermore, the infant’s sleep is immature and disorganised32, and exhibit noticeable functional differences with adult sleep33. Altogether, available evidence suggests that acquisition of novel information during adult human sleep might restrict to elementary discriminative or conditioning processes34,35.
Statistical learning is the detection and retention of statistical regularities, such as the segmentation of continuous streams of syllables based on transitional probabilities between successive elements. This ability is present at wake in adults and children36,37, and in seemingly sleeping newborns30. Detecting statistical information within a flow of syllables is thought one of the key mechanisms subtending first-language acquisition in children37. It occurs incidentally, i.e. without explicit instructions about the detection of patterns but can be impaired when attention is diverted toward a concurrent task38. This makes statistical learning an interesting candidate to investigate learning abilities during sleep. Indeed, demonstrating that sleeping humans can detect statistical regularities would suggest the possibility to learn novel stimulus-stimulus associations during sleep. In this framework, we aimed at determining whether young adults can detect auditory statistical regularities during NREM sleep, and whether such learning can transfer during a subsequent wakefulness period. At the behavioural level, statistical learning is usually assessed using two-alternative forced choice (2AFC) tests36. That is, after passive exposure to the regularities embedded in the material, participants must discriminate between items constructed using high versus low (or null) transitional probabilities. Statistical learning is evidenced when participants exhibit a trend to select high transitional probability items, even if unaware of the presence of the regularities. However, behavioural tests are often lacking sensitivity39,40,41,42. Electrophysiological39 (EEG) and magnetoencephalographic40 (MEG) frequency-tagged responses have been shown to mirror the ongoing detection of statistical regularities. The principle of frequency tagging is that if a stimulus is systematically presented in a periodic, regular manner, then the neural population coding for that stimulus will be entrained to the same period43. The frequency of stimulus occurrence thus provides a target frequency tag to identify the associated brain response, which emerges as a peak in the power spectrum of the brain signal with a strong signal-to-noise ratio (SNR) as compared to neighbouring frequencies. This approach allows investigating stimulus-related neural responses at the individual level44, a particularly interesting feature for sleep learning studies3. Frequency-tagged responses also overcome the problems of overlapping ERPs in fast continuous streams of stimuli39,45,46, and possibly the overlap between auditory responses and sleep-specific oscillations (such as slow waves, K-complexes or spindles) during sleep.
In this study, we recorded magnetoencephalographic (MEG) frequency-tagged brain responses previously shown to mirror the covert acquisition of statistical regularities at wake40 to test whether young adults can learn high-order auditory regularities during sleep, and transfer this information to subsequent wakefulness. All participants gave a written, signed informed consent prior to this experiment performed in accordance with the relevant guidelines and regulations, and approved by the ULB-Erame Ethics committee. Twenty-one young adults had a 90-minutes afternoon nap opportunity in the MEG scanner. While in unequivocal Non-Rapid Eye Movement sleep stages NREM2 or NREM3 sleep47, they were exposed at non-awakening thresholds to alternating 5-minutes blocks of statistical (STAT) pure tone streams, in which tones statistically grouped as triplets, and random (RDM) streams, in which the succession of tones was random (see Fig. 1 and Methods).
Stimulus (tone) presentation rate was 5.505 Hz. If statistical regularities (tritones) embedded in the auditory stream are detected during sleep, a brain response will emerge at the frequency of occurrence of each tritone onset (1.835 Hz) in the STAT stream, alongside tone frequency-tagged auditory responses (5.505 Hz) present both in STAT and RDM conditions that reflect the mere detection of auditory stimulations. Half of the participants (n = 11) reached sufficient levels of NREM sleep to receive at least 5 minutes of STAT auditory stimulation. In a second step, all participants (previously exposed vs. not) were scanned again in the wakefulness state while exposed to the STAT and RDM auditory streams. At this stage, frequency-tagged MEG analyses aimed at testing whether participants previously exposed to the STAT stream during sleep would exhibit better learning (as compared to those not exposed), i.e. would detect faster the tritone structure in the material, which would evidence a transfer of the learned information from sleep to wakefulness.
Fatigue and sleepiness
Subjective fatigue and sleepiness scores before and after the nap opportunity phase are reported Table 1. A repeated measure ANOVA conducted on subjective sleepiness scores with the within-subject factor Moment (pre-Nap vs. post-Nap) and the between-subjects factor Group (NREM/Exposure vs. No NREM/No Exposure) failed to disclose significant main or interaction effects (ps > 0.09), suggesting similar sleepiness levels before and after the nap in both groups. A similar repeated measure ANOVA conducted on subjective fatigue scores disclosed a main Moment effect (F(1,16) = 5.0, p = 0.04), with higher subjective fatigue after than before the nap episode. Main Group and Moment by Group interaction effects were non significant (ps > 0.5).
Frequency-tagged responses to STAT and RAND streams during NREM sleep
Eleven participants were included in the NREM/Exposure condition as they were exposed to at least 5 minutes of the STAT auditory stream during NREM2 and/or NREM3 sleep stages (2 participants were exposed only to the STAT, but not to the RAND stream). On average, participants were exposed to STAT streams during 4.1 +/− 2.3 minutes in stage NREM2 and 5.9 +/− 4.7 min in stage NREM3 (Wilcoxon signed-rank test, W = 48.0, p = 0.40), and to RDM streams during 3.0 +/− 3.8 minutes in stage NREM2 and 7.2 +/− 5.7 min in stage NREM3 (W = 33.5, p = 0.073). Global sleep parameters derived from polysomnographic recordings are reported Table 2. For the NREM/Exposure condition, individual hypnograms illustrating the timing of exposure to STAT and RDM streams are reported in Supplementary Material Fig. S1, and individual duration of exposure per sleep stage and stream is reported in Supplementary Material Table T1. For the No NREM/No Exposure group, individual duration of (insufficient) exposure to RAND and STAT streams is reported in Supplementary Material Table T2.
First, we assessed global/overall segmentation during the sleep exposure. To do so, power was computed on averaged continuous 5-min streams. Topographies of tone- and tritone-related responses as well as SNR and power spectra are illustrated in Fig. 2. Individual topographies are available as Supplementary Material Fig. S2. Note that comparisons between STAT and RDM streams were conducted on 9 participants, excluding the two participants exposed to STAT streams only during sleep. Neural responses tagged at the tone presentation rate during sleep were found in all participants but one (who was excluded from future analyses), reflecting auditory stimulation processing. Tone-SNR was particularly strong (SNR > 10) in bilateral temporal sensors, and did not differ between STAT and RDM sequences (cluster-based permutation test; p > 0.1; Fig. 2, top panel). At the tritone presentation rate however, no frequency-tagged response was evidenced either for STAT or RDM streams, and tritone-related SNR did not differ between streams (cluster-based permutation test; p > 0.09; Fig. 2, bottom panel). Lack of between-group differences at the tritone level was substantiated by a Bayesian paired sample t-test in favour of the null hypothesis (BF10 = 0.33). At the tone level, the Bayesian factor for between-group differences was inconclusive (BF10 = 0.43), although far from the BF value > 3 needed to support the alternative hypothesis of between-group differences. Similar sensor space analyses performed at the first harmonic of each frequency of interest (FOI) disclosed similar results (see Supplementary Material Fig. S3). SNR and power spectra (Fig. 2, right column) disclosed an emergent peak at the tone-related frequency in both RDM and STAT streams. To the contrary, no SNR or power spectra peaks emerged at the tritone-related frequency or its first harmonics. These results evidence the successful auditory perception of individual tones during sleep, but no learning of the auditory regularities.
Second, we examined using linear mixed models the temporal evolution of tone- and tritone-related responses during the first 5 minutes of continuous STAT and RDM streams. The time course of the tritone- and the tone-related frequency-tagged responses is illustrated in Fig. 3. For tritone-related responses, random effects included the intercepts. The model that best accounted for tritone-related SNR included the predictor STREAM, i.e., tritone-related SNR ~ STREAM (notation A ~ B indicates A depends on B), but the effect was not significant (p > 0.1). For tone-related responses, random effects included the intercepts and the slopes for the MINUTES predictor. The most appropriate model was the one including the continuous MINUTES predictor (i.e., SNR ~ MINUTES), but the effect was also not significant (p > 0.2).
Frequency-tagged responses to STAT and RAND streams during subsequent Wake
One could argue that the subset of participants stimulated during sleep merely had poor learning abilities. Also, it cannot be excluded that even in the absence of overt brain responses during sleep, exposure to auditory regularities during sleep would facilitate subsequent learning. To address these issues, all participants went back in the MEG scanner 30 minutes after the end of the nap session. They were instructed to stay awake, keep the eyes on a fixation point, and quietly listen to auditory streams (2 times 5 minutes of STAT and RDM streams, counterbalanced). No mention was made of regularities.
Tone- and tritone-related frequency-tagged responses (whole population)
In a first step, we assessed overall segmentation over the entire population (n = 21), pooled, during awake passive listening of STAT and RDM streams, to identify sensors of interest (SOIs). Segmentation corresponded to the averaged 5-min periods for each stream type. Overall power spectra and topographies are illustrated in Fig. 4.
The analysis disclosed a robust tone frequency-tagged response for both STAT and RDM streams. Tone-related SNR was particularly high (SNR > 40) in bilateral temporal sensors. No tritone-frequency tagged response was evidenced during RDM streams. To the contrary, tritone-related SNR for STAT streams was locally increased. A cluster-based permutation test disclosed differences in tritone-related SNR between STAT and RDM streams (p < 0.05) in left and right temporal sensors. SNR and power spectra (Fig. 4, right column) disclosed an emergent peak at the tone-related frequency (and its first harmonic) in both RDM and STAT streams. An emergent peak at the tritone-related frequency (and its first harmonic) was only visible in STAT streams. Sensor space analyses performed on the first harmonic for each frequency of interest (FOI) disclosed similar results (Supplementary Material Fig. S4), reflecting the successful detection of statistical auditory regularities at wake.
Tone- and tritone-related frequency-related responses in Sensors of Interest [SOIs] at Wake in NREM/Exposure vs. No NREM/No Exposure participants
Subsequent analyses compared frequency-tagged responses in temporal sensors of interest (SOIs) between participants exposed to the STAT streams during the prior sleep episode (n = 11) and participants without prior exposure due to lack of stable NREM sleep (n = 10). Topographies of tritone-related responses for each group are illustrated Fig. 5.
Tritone-related SNR was averaged within the sensors indicating a global response (i.e., in the clusters identified in the preceding step; see Fig. 4). Mean tritone-related SNR in STAT streams was 1.85 +/− 0.67, greater than baseline value 1 (t(20) = 5.8, p < 0.001). Mean tritone-related SNR in RDM streams was 1.09 +/− 0.38, not different from baseline (t(20) = 1.1, p = 0.3). A repeated measures ANOVA with within-subject factor STREAM (STAT vs. RDM) and between-subjects factor GROUP (Prior Sleep/Exposure vs. Prior Wake/No Exposure) disclosed a main effect of STREAM (F(1,19) = 19.4, p < 0.001), no main GROUP effect (F(1,19) = 0.24, p = 0.4) and no STREAM by GROUP interaction effect (F(1,19) = 0.23, p = 0.6). Separate analyses conducted in left and right clusters disclosed similar results. To probe hemispheric laterality effects, a repeated measures ANOVA with within-subject factor SEQUENCE (STAT vs. RDM) and CLUSTER (LEFT vs. RIGHT) and between-subject GROUP (NREM/no NREM exposure) was conducted. It disclosed again a main effect of the sequence (F = 5.78,p = 0.0016), but no main effect of GROUP or CLUSTER (ps > 0.5), nor any interaction effect (ps > 0.1). Altogether, these results suggest that prior exposure to the STAT stream during sleep did not facilitate the detection of statistical regularities during subsequent wakefulness.
Finally, frequency-tagged responses to regularities were previously shown to develop at wake within the first 3 minutes of exposure40. Using linear mixed models, we tested for potential between-group differences in the temporal dynamics of tone- and tritone-related responses, i.e. whether the detection of regularities at wake developed faster in participants previously exposed to the STAT stream during sleep. Results show that the temporal evolution (from first to fifth minute) was not different between participants previously exposed vs. not exposed to the STAT streams during sleep (see Supplementary Material Results Section R1 and Supplementary Material Figs S5-7). Thus, the analyses consistently show that complex stimulus-stimulus associations embedded in statistical regularities remain undetected during sleep, whereas their brain tagging robustly emerges in the same individuals when awake (see individual topographies in Supplementary Material Fig. S2a).
Behavioural results (2AFC test)
At the 2AFC test immediately following the Wake (2nd) session, the mean +/− std participants’ recognition performance for both groups pooled was 46.4 +/− 15.0% (range: 25–88%), not different from the 50% chance level (Wilcoxon test, V = 37, p = 0.20). Additionally, we computed a Bayesian unilateral one-sample t-test with chance level as the test value. BF10 was 0.12, providing evidence in favour of the null hypothesis. Mean recognition scores were 49.4 + −19.4% in the No NREM/No Exposure group and 43.8 + −9.7% in the NREM/Exposure group (Fig. 6). Scores were not different between the two groups (Wilcoxon test, W = 61.5, p = 0.67). Lack of between-group differences was substantiated by a Bayesian paired sample t-test in favour of the null hypothesis (BF10 = 0. 24). Participants reported no awareness of the presence of tritones during the stream exposure phase. Additionally, none of the participants reported hearing the streams during sleep.
We took advantage of the power of MEG frequency-tagged responses analyses adapted to the study of statistical learning to investigate whether adults in NREM sleep can detect, and learn, statistical regularities embedded within auditory streams. Frequency-tagged analyses evidenced robust frequency-tagged responses at the tone presentation rate during NREM sleep, but no evidence of segmentation, i.e. no tritone frequency-tagged response during sleep. Nonetheless, all participants successfully segmented the STAT streams during the ensuing wake period, and the temporal evolution of the segmentation process was not different between participants previously exposed vs. not exposed to the STAT streams during NREM sleep. Hence, our results indicate that complex stimulus-stimulus associations embedded in statistical regularities might remain undetected during NREM sleep.
Exposure to auditory streams during NREM sleep elicited robust temporal bilateral responses tagged at the tone presentation rate for both STAT and RDM streams, corroborating studies showing auditory steady-states responses during sleep48,49. Tone-related SNR topographies were closely similar to those disclosed at exposure during wakefulness, suggesting that similar neural resources support the auditory processing of tones during sleep and wakefulness, most likely auditory area. There is evidence for similar responsiveness and properties of cortical activity in primary auditory cortices during wake and sleep states13,16,50,51,52,53. Note that SNR calculations do not allow direct comparisons of amplitude between sleep and wake periods, because background EEG/MEG noise changes with the state48. Our use of robust frequency-tagged MEG responses (as compared to ERP experiments) allowed not only group level statistics but also to assess auditory responses at the individual level. All participants, but one, exhibited clear auditory responses at the tone frequency level during NREM sleep both in STAT an RDM streams. Lack of auditory response for the latter participant is unclear. One possibility is that MEG signal was poor (although the same participant elicited clear auditory responses during the wake period). Another possibility would be that this participant had elevated arousal thresholds, precluding correct information transmission to auditory cortices. Indeed, levels of information transfer from thalamus to cortex are influenced by sound intensity in rats54, and auditory thresholds may vary from one person to another. Notwithstanding, individual tone-related frequency-tagged auditory responses ensured that deficient auditory processing at lower, primary processing levels was not the cause for lack of learning during sleep.
Despite preserved processing of auditory tones, our analyses failed to evidence segmentation at the tritone frequency of STAT auditory streams during NREM sleep, which would have suggested preserved learning capabilities. More precisely, tritone frequency-tagged responses were at baseline level (SNR = 1) during both STAT and RDM streams. Although a null result could always be accounted for by underpowered samples, or by a too weak SNR in slow frequencies (especially taking into account that during sleep, background EEG/MEG power of low-frequencies <2 Hz is drastically increased, and may contribute to noise levels in our SNR calculations), we are confident that the absence of tritone-related responses genuinely reflects lack of segmentation, for several reasons. First, data inspection shows that noise levels in power spectra were similar between sleep and wake conditions, as can be seen in our figures. Second, we also observed a lack of frequency-tagged responses at the first harmonic (F = 3.67 Hz) of the tritone presentation rate, i.e. outside the NREM3 slow oscillation range. Third, tone-related SNR was not different between STAT and RDM streams. Finally, a high inter-individual variability (which is often the case in implicit learning paradigms) cannot be excluded, with the consequence that some participants learned while others did not. However, once again the high SNR of frequency-tagged responses allowed us to carefully examine responses individually, and in no participant was visible in the spectra a peak at the tritone frequency (whereas tone-related responses were clearly present) even in those exposed up to 20 minutes to the STAT stream. Also, there was no evidence for differences in the evolution of the tritone segmentation process during the ensuing wake period. Altogether, it supports the conclusion that participants were unable to segment streams based on transitional probabilities during their sleep.
Why would statistical learning not be possible during sleep? Learning simple stimulus-response associations25,26,55 was already evidenced during NREM sleep in adults, as well as vowels and regularity detection in seemingly sleeping infants29,30,31. In rodents, electrical stimulation of the reward-related medial forebrain bundle during sleep following spontaneous activity in place cells was shown to lead to goal directed behaviours during subsequent wakefulness56. Although it suggests that some forms of learning are possible during sleep, other studies reported no traces of memory formation during sleep for more complex learning abilities such as retention of lists of words3 or real sounds words34. Abolished awareness during sleep seems not a sufficient explanation for lack of higher-order learning, since unconscious learning of more complex forms that conditioning was evidenced during wakefulness, for instance the retention of sequences of symbols unconsciously perceived57. Intriguingly, tone-induced conditioned respiratory responses transferrable to wakefulness were evidenced during NREM but not REM sleep25, to the opposite of a perceptual noise-memory paradigm in which participants detect repeating noise segments58, in which case exposure during REM sleep was beneficial, but exposure during deep NREM3 sleep exerted a detrimental effect on subsequent performance at wake28. As we only investigated NREM sleep in the present study, it remains to be ascertained whether statistical learning is as well hindered during REM sleep, or can take place like in this latter study28. However, another study investigating high-level speech parsing during sleep found comparable neural tracking of stimulus acoustics across sleep and wakefulness regardless of speech intelligibility, whereas neural tracking of higher-order linguistic constructs such as words or sentences was observed for intelligible speech during wakefulness only, and was not detectable at all during either NREM or REM sleep59. As our paradigm similarly involves the high-order detection of statistical regularities embedded within auditory streams, we may expect lack of learning during REM sleep as well, a prediction that remains to be tested.
In this experiment, we provided evidence that statistical learning seems not to develop during NREM sleep. However, statistical learning is a process that occurs at least partially implicitly60,61 and learning statistical regularities between syllables was evidenced in REM-like active sleep in newborns30. Previous studies evidenced learning during human sleep restricted to perceptual learning28,29 or conditioning25,26, the latter being subtended by diencephalic and/or subcortical structures (e.g. hippocampus or amygdala). Instead, statistical learning is mostly subtended by medial temporal lobe activity62. Although the directionality of the information flow reverses during NREM sleep (from cortex toward hippocampus during wakefulness; to hippocampus toward cortex during NREM sleep63), memory-related sound presentation during NREM sleep elicits hippocampal activity in humans64 and rodents65. Furthermore, trace conditioning (such as tone-odour associations in the Arzi et al. study25) also relies on hippocampal activity. This makes it unlikely that a disconnection between the hippocampus and sensory areas could account for lack of statistical learning. However, neural structures underlying statistical learning encompass not only medial temporal regions but also a large fronto-parieto-temporal network66,67,68. Reduced metabolic activity in these regions during NREM sleep69 might contribute to the difficulty to form novel representations in a statistical learning paradigm during sleep.
In a local-global oddball paradigm, top-down detection of global changes in tones streams was abolished during NREM sleep, whereas bottom-up local changes in auditory stimulation remained partially detected19. Similarly, exposure to either pseudo-words or comprehensible sentences during NREM sleep elicited increased activity in thalamus and primary auditory cortex like during wakefulness, but there was no activation of higher-order cortical areas involved in language processing (e.g., Broca’s area)70. Still, Wernicke’s area was still activated during NREM sleep exposure in the latter study70, but brain responses were similar between pseudo-words and comprehensible sentences, contrary to wakefulness. The authors suggested, in line with Strauss et al.19, that bottom-up processes are partially preserved during sleep, but that top-down feedback processes are abolished. Makov et al.59 also showed that alongside preserved low-level auditory processing, higher-level hierarchical linguistic parsing is severely disrupted during sleep. Finally, it was proposed that reduced cortico-cortical connectivity during NREM sleep drastically reduces the brain ability to integrate information71. Altogether, deactivation of higher processing brain structures, disrupted top-down processes and reduction of cortico-cortical connectivity during NREM sleep may explain the impaired ability to learn novel high-order statistical regularities.
In the studies mentioned above as well as in the present one, auditory stimulation was delivered regardless of synchronicity with sleep oscillations. Sleep is not a uniform state, and slow waves or sleep spindles are known to largely impact on sound processing. Although it was suggested that sleep spindles originating from the thalamus disrupt auditory perception72, rodents data challenged this thalamic gating hypothesis by showing preservation of auditory responses in primary auditory cortices during spindles73. Similarly in humans, thalamic and primary auditory activation patterns seem similar during sleep and wakefulness70. Still, evidence suggests that auditory processing in higher cortical regions is modulated by the occurrence of sleep spindles, but also by neural bi-stability occurring during NREM sleep, i.e. the fact that sounds are differentially processed during the up and down phase of the slow oscillations16,34. These studies indicate that the “gating” may originate more at the cortical than the thalamic level. Such modulations, even if not blocking the primary processing of tones, may impede activity in the higher cortical areas necessary for detection and retention of statistical regularities. Finally, the K-complexes observed in response to sensory stimulation74, viewed as mechanisms against cortical arousals75, are also known to feature extended cognitive processing of salient stimuli. For instance, activation of the primary auditory cortex increases in response to tones followed vs. not followed by K-complexes13. Although a tentative explanation, interactions between spindles, slow oscillations and K-complexes might also prevent/enhance auditory processing in a non-linear fashion, precluding the learning of statistical regularities displayed in a continuous manner.
Finally, another possibility to account for a lack of learning statistical regularities during NREM sleep would be that continuous and rapid succession of stimuli in the auditory streams did not provide sufficient time for plasticity to occur. Indeed, it was shown in TMR studies that tones delivered during a spindle not only result in a disruption of sound perception at higher cortical levels but also disrupts the spindle in itself73,76,77. TMR benefits were abolished when two successive sounds were presented at a short interstimulus interval (<1.2 sec), suggesting that a minimal duration is needed to integrate novel information before being able processing the next one during sleep76,77. Spindles are also known to promote plasticity in cortical regions78, and are related to sleep-dependent memory consolidation processes79,80,81,82,83,84. Interestingly, in the Arzi et al.25 tone-odour conditioning study during sleep, exposure to tones during learning triggered increases in the spindle related frequency band, again suggesting that spindles play an active role in the encoding of new information. Relatedly, the percentage of trials containing slow frontal spindles correlated with the neurophysiological markers of perceptual learning upon awakening in the Andrillon et al. study28. It is thus possible that continuous and fast (+/−5 Hz) delivery of tones in the present study disrupted sleep spindles and prevented memory formation.
Although no participant successfully segmented STAT streams during NREM sleep, frequency-tagged responses were significantly increased at the tritone frequency during exposure to STAT (vs. RDM) streams in the Wake condition. Enhanced tritone-related neural responses linearly developed over time in the first continuous five minutes of STAT streams. Topographies of tritone-responses were closely similar to our prior study40, with clearly increased responses in bilateral temporal sensors and particularly strong effects in left temporal sensors. Finally, linear increases were also similar (slopes and values) between our two studies. At variance with this prior study40 however, tritone responses were no longer present at the second exposure to STAT streams in the Wake session (thus after exposure to RDM streams following the first exposure to STAT streams). It suggests that whereas participants were sensible to statistical regularities during the first exposure, there would be no learning/relearning at second exposure to the STAT streams. One hypothesis to account for this lost effect is that exposure to the RDM stream between the two STAT sequences exerted an interference effect on the learned material and prevented relearning, which would however be in contradiction with our previous report40. Dissimilarities between the prior and current experiments might account for different effects. In our first study, participants were allowed a short break at their best convenience after the two first 5-minutes streams. In the present study, the 20 minutes session was continuous. A 20-minutes stimulation sequence may have been too overwhelming for our participants, especially considering the fact that they spent more than 4 hours in the MEG environment scanner, and listened to habituation streams for a large part of the time when not exposed to STAT or RAND streams. In our first study, only 30 minutes were spent in the MEG scanner. Sleepiness, fatigue and decreased attention paid to the auditory stream may thus have played a role. Accordingly, subjective fatigue paradoxically increased in both groups after the nap opportunity (sleepiness being unchanged). Also, participants were lying on their back in supine position, whereas they were seated in the first study, which may have induced position-related increases in sleepiness levels and inattentiveness to the auditory streams. Notwithstanding, expected learning effects were found during the first presentation of the STAT stream. If attention would have decreased of fluctuated with time, then our MEG index should also have progressively decreased/fluctuated. This was not the case as it linearly increased during these first 5 minutes, suggesting that this marker is sensible to learning.
At the behavioural level, performance on the 2AFC task was again at chance level, suggesting that learning of the statistical regularities as evidenced by tritone frequency-tagged responses was implicit. Recent works suggest that learning as assessed by the 2AFC task is largely contaminated by explicit knowledge42,85, whereas indirect measures of learning (such as target detection tasks or ERPs measures, and in our case tritone frequency-tagged responses) are more sensitive to implicit learning mechanisms42. Altogether, it suggests that tritone frequency-tagged responses during waking exposure reflect implicit learning processes. Regarding tone frequency-tagged responses, although we did not find a significant decrease in the global response at the tone frequency, minute-by-minute analysis showed that tone-response significantly decreased in the STAT (as compared to RDM) streams during the first minutes of the exposure, replicating our initial findings40. By contrast, during sleep, no difference was found between exposure to STAT and RDM streams. If the relationship between tritone response and stream segmentation is quite straightforward, interpreting a decrease in tone-related responses is more ambiguous. Several hypotheses can be proposed. First, decreased tone-related SNR in STAT streams may be related to basic (likely bottom-up) processes subtending neural sensitivity to distance lag or repetition (i.e., the time lag that separates two occurrences of the same stimulus40). Indeed, time lags are inevitably different between STAT (lag range 6–12) and RDM (lag range 2–12) streams, and sensitivity was previously shown higher at smaller distance lags86, possibly enhancing tone-related responses in RDM streams. An alternative but not exclusive possibility would be that diminished tone-related SNR in STAT streams reflects a form of sensitivity to statistical regularities, but not sufficient enough to efficiently perform stream segmentation. For instance, participants could detect that the tone C# is always followed by the tone D, but not segregate tones into tritones. Decreasing tone-related responses would then be related to a suppression-repetition effect driven by top-down expectations87. Nonetheless, tone-related response differences between STAT and RDM streams were abolished during NREM sleep. Future studies assessing statistical learning using frequency-tagged responses should investigate in details the mechanisms and implication of tone-related frequency changes.
To sum up, we showed in the present study that in young healthy participants proved able to implicitly learn segmenting auditory streams during wakefulness, segmentation processes seem abolished during NREM sleep. Frequency-tagged responses analysis showed that sleeping participants keep processing auditory information at the tone level, but that auditory processing was not influenced by the presence of statistical regularities in the auditory stream. Therefore, lack of frequency-tagged magnetic responses suggests statistical regularities remained undetected during NREM sleep.
26 participants (5 males, mean age: 23.7 +/− 2.7 years, range: 19–28) were recruited through public announcement at the Université libre de Bruxelles (ULB, Belgium). The ULB-Hospital Erasme Ethics Committee (Accreditation 021/406) approved the study and all participants gave a written, signed informed consent prior to the experiment. All methods were performed in accordance with the relevant guidelines and regulations for the safe use of magnetoencephalography and electroencephalography. Participants were screened for their facility to sleep in a non-familiar environment, and should not have already been recruited for a similar study. Two participants were excluded due to technical problems during the experiment, 2 for bad EEG signal and 1 due to a lack of tone-related auditory responses during exposure. Descriptions and analyses are restricted to the 21 remaining valid participants. All were in good health with reported normal hearing and no history of neurological or psychiatric disorders, non-musicians, right-handed (Edinburgh Handedness Inventory88 mean laterality score +/− sd = 78 + − 20) with normal anxiety levels (State-Trait Anxiety Inventory89 French version A mean score +/− sd = 28 +/− 5; version B mean score +/− sd = 38 +/− 7), satisfactory sleep quality (Pittsburgh Sleep Quality Index90 global score +/− sd = 3,9 +/− 1,7), and neutral or intermediate chronotype (Morningness-Eveningness Questionnaire91 mean score +/− sd = 51,6 +/− 12,1). During the experiment, participants’ sleep quality and quantity for the preceding night were evaluated using the St-Mary’s Hospital sleep questionnaire92. They were not sleep-deprived and were asked to keep regular sleep habits the night before, and to avoid energizing drinks the day of the experiment.
Participants were a posteriori assigned to one out of two conditions depending on their level of exposure to the auditory material during the proposed experimental nap. Participants exposed to at least 5 minutes of the statistical (STAT) stream during sleep were assigned to the NREM/Exposure group (N = 11), whereas participants not exposed or exposed less than 5 minutes to the STAT stream were assigned to the No NREM/No Exposure group (N = 10). Chronotype, anxiety, usual sleep quality (PSQI) and laterality levels were not different between groups (Wilcoxon-Mann-Witney test; ps > 0.1).
Three different types of auditory streams were constructed: statistical (STAT), random (RDM) and habituation (HAB) streams. All streams were composed of twelve pure tones sinus generated with Matlab 2011 (Mathworks Inc., Natick, USA). Tones were all 150 ms duration (5 ms rise and fall) with a 25 ms inter-stimuli interval, recorded at 96 kHz sampling rate, and ranged by half tones from C to B in a single octave in the English musical notation scheme (A = 440 Hz). Stimulus delivery and response collection were controlled using Psychtoolbox-393 running on Matlab 2011 (Mathworks Inc., Natick, USA). Sounds were delivered via a 60 × 60 cm2 MEG-compatible high-quality flat panel loudspeaker (Panphonics SSH sound shower, Panphonics, Espoo, Finland).
Statistical and random streams
Construction of statistical and random streams is described elsewhere40. Only essential information is reported here. Statistical (STAT) streams composed of a set of four possible tritones (G#CD, AC#G, FA#D# and EF#B) were used both in the Nap opportunity and subsequent Wake exposure phases. Tritones were pseudo-randomly determined, with the constraint that no transitional probabilities (TPs) were shared between two sets of tritones, and that tritones did not start nor ended with similar tones. The tritones did not sound melodious with regard to harmonious musical standards to prevent the mental construction of a tune while listening. Within a STAT stream, the four possible tritones were randomly delivered; with the constraint that immediate repetition of the same tritone was proscribed. TPs between individual tones within a tritone and between tritones were thus 100% and 33% respectively (see streams in Fig. 1C, and transition probabilities matrices in Supplementary Material Fig. S8). Random (RDM) streams were composed of the same individual tones concatenated in random order, with the only constraint that a tone was not repeated twice in a row. TPs between tones were thus 9%. In both STAT and RDM streams, an additional blank interval of 20 ms was introduced every three tones (i.e., between tritones in statistical streams) to enhance frequency tagged-responses39. Noticeably, this additional interval was shown to be unconsciously perceived94, and no frequency-tagged response at the blank interval presentation rate (i.e., identical to the tritones rate) was found in RDM streams in our prior study using the same material40, which supports the assumption that these 20 ms blanks are not detected. Stimulus presentation rates were 5.505 Hz for individual tones (i.e., tone duration + inter-stimuli-interval + a third of the subliminal pause) and 1.835 Hz for tritones (i.e., three times smaller than the tone rate). Tone and tritone presentation rates determined the frequencies of interest (FOIs) for the steady-state analysis of the MEG signal (see below).
During the naptime opportunity in the MEG scanner, participants were delivered habituation (HAB) streams when not exposed to STAT or RDM streams. The same twelve tones were continuously played following a natural and expected ascending then descending musical scale succession order (e.g. C/C#/D/D#/E/F/F#…; see Fig. 1C). In 30% of the cases (on average every 12,5 +/− 11,3 seconds, range 2–60 seconds), the ascending scale was repeated, in which case participants had to press a button box. The task is very simple, and participants’ performance was 100% while awake. The purpose of HAB streams was twofold. First, continuous presentations of HAB streams aimed at acclimatizing participants to the acoustic environment and diminish the probabilities of arousal when presenting experimental auditory STAT and RDM streams during sleep. Second, HAB streams aimed at inducing boredom and focusing the participants’ attention toward the tones in a monotonous setting to facilitate sleep onset.
Two alternative forced choice (2AFC) task
The 2AFC task aimed at behaviourally probing knowledge of the embedded regularities. It is identical to the one used in our previous study at wake40. A concurrent set of four possible tritones (TEST set; DG#F/D#A#A/BC#F#/GCE) was built using the same constraints than for the STAT stream. For each of the 16 trials, two tritones (one from the STAT set and one from the TEST set) were successively presented with a 1-sec interstimulus interval. The 4 tritones of the STAT set were pseudo-randomly combined with the 4 tritones of the TEST set. Hence, each tritone was presented 4 times in a different combination. The order of presentation (first or second within the pair) and the association between tritones from RAND and TEST sets were counterbalanced.
Participants entered the MEG laboratory around 12:30 and started by filling in questionnaires about sleep quality, chronotype, laterality and anxiety (see above), then were prepared for MEG and polysomnography (PSG) recordings (see below).
Sleep opportunity session
At about 2:00 pm, participants were installed in the MEG scanner in a confortable supine position, and informed that they would stay in the MEG scanner for about 90 minutes. They were instructed to stay still with the eyes closed while listening to the HAB streams, and to respond by pressing a button box (fORP; Current Designs Inc.) when required (i.e., when there was a repetition of an ascending scale) until they fall asleep. The loudness of the habituation stream was individually adjusted at their best convenience, and they were informed that if auditory streams hindered their sleep, they were allowed to ask diminishing the volume (which was the case for one participant). Ambient lights were either totally switched off or kept at very low intensity at the participant’s convenience. They were not informed that STAT and RDM streams would be displayed during sleep. Polysomnography (EEG, EOG, EMG) was monitored online throughout the 90 minutes period by the main experimenter (JF). When the participant’s exhibited stable and unequivocal NREM2 or NREM3 sleep, presentation of STAT and RDM streams was launched (STAT-RDM-STAT-RDM or RDM-STAT-RDM-STAT, counterbalanced between subjects). Successive intermixed short (5 minutes each) periods of STAT and RDM streams were delivered to minimize possible differences in the evolution of sleep stages between stream types (e.g., to avoid comparison of STAT stream-related neural activity in NREM2 sleep with RAND stream-related neural activity in NREM3 sleep). If a (micro-) arousal or the onset of a REM sleep episode was detected on the polysomnographic online recording, the STAT or RAND stream was immediately replaced by the HAB stream, without transition, then resumed again as soon as stable NREM2/NREM3 sleep conditions were restored. The maximal number of presentations during sleep was 8 × 5-minutes streams (i.e. 4 STAT and 4 RDM streams). If participants were still not sleeping after 60 minutes, they were authorized to leave the MEG room. Before and after the nap opportunity period in the MEG scanner, fatigue and sleepiness levels were assessed using the Visual Analogue Scale of Fatigue and Sleepiness95 and the Karolinska Scale of Sleepiness96.
After the nap opportunity period (at about 4:00 pm), participants left the MEG room to fully wake up and dissipate sleep inertia97. After maximum 30 minutes, they were once again installed in the MEG scanner in supine position, with the difference that they were required to keep the eyes open and stay awake, with the ambient lights on. They were then presented 4 × 5 minutes of STAT/RAND streams (STAT-RDM-STAT-RDM or RDM-STAT-RDM-STAT, counterbalanced). They were instructed to quietly listen to the auditory streams while fixating a fixation cross on the ceiling, but did not receive any explicit instructions to detect regularities.
After this second exposure session, participants were informed about the presence of regularities and presented the 2AFC test (see above). Pairs of tritones (n = 16) were aurally displayed, and participants had to indicate aloud which one of the two tritones was part of the exposure streams, either because they recognized it or because it sounded most familiar. The 2AFC test lasted around 5 minutes. Due to the small number of trials, MEG data were not recorded during this recognition task.
Polysomnography (PSG) and MEG data acquisition
PSG was acquired to determine (on-line and off-line) sleep stages. The EEG setting included 3 derivations (C3-A2, C4-A1 and Fz-A1) positioned according to the 10–20 electrodes placement system47. MEG-compatible single gold-plated electrodes (Reference E6650 IMMED, Belgium) were secured on the scalp using collodion, and impedance was kept below 10 Kohm. Bipolar EOG (recording both vertical and horizontal ocular movements), bipolar EMG on the chin and bipolar ECG were additionally recorded using similar electrodes. All electrodes were directly connected to the MEG setting to ensure synchronization between EEG and MEG signals. Online monitoring allowed delivering auditory streams in correct (NREM2 or deeper) sleep conditions. Accuracy of online detection was verified, and additional analyses conducted offline on EEG and EMG data band-pass filtered at 0,30–30 Hz and 10–100 Hz respectively, and sleep staging performed based on 30-s epoch windows according to the AASM criteria47.
MEG data were recorded using a whole-scalp-covering 306-channel neuromagnetometer (102 sensor chipsets, each comprising one magnetometer and two orthogonal planar gradiometers) installed in a light-weight magnetically shielded room (Vectorview and MaxShield; Elekta Oy, Helsinki, Finland), the characteristics of which are described elsewhere98. The MEG signal was recorded at a sampling rate of 1 kHz using a band-pass filter set at 0.1–330 Hz. Head position was continuously monitored using four head-tracking coils. The locations of the coils and at least 150 head-surface (on scalp, nose, and face) points with respect to anatomical fiducials were digitized using an electromagnetic tracker (Fastrack, Polhemus, Colchester, VT).
To ensure accurate synchronization for frequency-tagged analyses (see below), a trigger was sent in the MEG recording every 3 tones via a parallel port Arduino system (https://www.arduino.cc). Triggers were also sent into the MEG recording when there was a repetition during HAB streams and when participants pressed the button box.
MEG data pre-processing
External interferences and head movements were corrected offline using the signal space separation (SSS) method99. In addition, MEG data of each participant were realigned using the SSS method on the first MEG run of the first participant, to align the recordings of all subjects into a common sensor space.
All reported analyses were conducted in the sensor space using Matlab R2011a and Fieldtrip100. Separate but identical analyses of MEG data were conducted during the Sleep opportunity session for participants exposed to at least 5 min of the STAT stream (NREM/Exposure group; N = 11), and during the subsequent Wake sessions for both the NREM/Exposure (N = 11) and the No NREM/No Exposure (N = 10) groups.
Overall exposure to STAT and RDM streams: In a first step, frequency-tagged analyses aimed at evidencing global segmentation-related responses during exposure to RDM (tone-related) and STAT (tone- and tritone-related) streams. These analyses we conducted only in participants continuously exposed to at least 5 minutes (i.e., 560 cycles of the tritone frequency, skipping the first 12 tones to avoid transient effects) in both STAT and RDM streams. In the Wake (second) session, all participants were by design exposed to 4 × 5 minutes of STAT/RDM streams. In the Sleep nap opportunity (first) session, the duration of exposure to STAT and RDM streams varied from one participant to the other (see Supplementary Material Fig. S1 and Table T1) in the Prior Sleep/Exposure group, with two participants exposed to 5 minutes of STAT but not of RDM streams.
In each stream condition, the 5-min data were averaged to increase signal to noise ratio (SNR), thus dampening signal not phase-locked with auditory stimulus delivery. Computations on long 5-min epoch (frequency resolution 0.0033 Hz) ensure a high spectral resolution. MEG power spectra were computed using a Hanning taper Fast Fourier Transform (FFT) in the 1.5–12 Hz range. SNR of spectra power was computed for each frequency bin as the ratio between the power values of the bin and the averaged power of the 200 neighbouring frequency bins (skipping the two closest neighbouring frequency bins), calculated on combined gradiometers (i.e. the sum of power frequencies of both radial orientations). Tone-related SNR responses during sleep were visually inspected for each participant separately (individual topographies are available in Supplementary Material Fig. S2). Grand average tritone- and tone-related SNR across all participants were calculated for each condition (STAT and RDM streams) in each session.
Statistical analyses were focused on the two frequencies of interest (FOI; i.e., the tritone presentation rate at 1.835 Hz and the tone rate at 5.505 Hz), separately for the Sleep nap opportunity and the Wake sessions. To assess differences in tone- and tritone-related SNR between RDM and STAT streams, statistical analyses were conducted using non-parametric Monte-Carlo estimates from the permutation method (10′000 permutations, paired comparisons, computed for two levels of significance: alpha = 0.05 and alpha = 0.025, unilateral) and the cluster based statistics (alpha cluster = 0.05 and 0.025, unilateral) to control for multiple comparisons problem as implemented in Fieldtrip100. Cluster based analyses were performed using the neighbours definition template based on the triangulation method for Neuromag systems as provided by Fieldtrip (see template in Supplementary Material Fig. 9). Similar analyses were also conducted on the first harmonics of each FOIs (see Supplementary Material Figs S3 and S4).
Temporal evolution of tone and tritone frequency-tagged responses: In a second step, frequency-tagged analyses aimed at evidencing the temporal evolution of segmentation-related responses during the 5 minutes of exposure to the RDM and the STAT streams, as a function of the experimental group (NREM/Exposure vs. No NREM/No Exposure). For this purpose, MEG data were cropped into 1-min time windows (i.e. 61.04 seconds containing 112 cycles of the tritone frequency). FFT analyses similar to those described above were applied to each 1-min block (frequency resolution 0.0164 Hz). Fourier spectra values were successively averaged over two minutes to increase statistical power (i.e. minutes 1–2, 2–3, 3–4 and 4–5). Finally, FFT power and SNR were calculated as the ratio between power at a frequency and its 40 neighbouring frequencies (skipping the two closest).
Sensors of interest: We also determined sensors of interest (SOIs) for subsequent linear mixed models analyses (see next subsection). In the Wake (2nd) session, SOIs for tritone-related responses were selected as sensors in which tritone-related SNR was significantly above the value 1 (10′000 permutations, alpha = 0.05, uncorrected) at the end of the two 5-min STAT streams. Actually, those sensors also corresponded to all but one with a SNR > 1.25, i.e. a minimum of a 25% power increase as compared to neighbouring frequencies. For the tone-related responses, almost all sensors had SNR > 1. Thus, we selected SOIs as sensors in which averaged tone-SNR during minutes 1–2 of RDM streams was >5 (i.e. tone frequency power 5 times greater than in neighbouring frequencies). SNR values were averaged across SOIs for each FOI. In the Sleep nap opportunity (first) session, no sensor exhibited tritone-SNR > 1 at the end of STAT streams (ps > 0.05, uncorrected). Therefore SOIs were selected as those identified during the subsequent Wake period. For the tone-related responses, SOIs were selected as sensors in which SNR was >2 during minutes 1–2 of RDM streams.
Linear mixed models: Linear mixed models (LMM) were used to model the temporal dynamics of the tone and tritone responses in SOIs, reflecting the gradual evolution of tone detection and segmentation (tritone detection) processes during the two sessions. We used the “lme4” and “lmerTest” packages in the R environment (https://www.r-project.org/). In the present study, LMM featured several advantages as compared to repeated measures ANOVA or linear regressions. First, LMM deal with variant covariance between the different levels of temporal evolution across successive recorded minutes within a stream (i.e. factor MINUTES). Indeed, it is typical in learning paradigms (and even more especially in the present case since time slots were averaged) that correlation is different between data collected at different time points (for instance, tritone-SNR in statistical streams during the Wake session was more correlated between minutes 2/3 and 3/4 [R = 0.6, p < 0.001] than between minutes 2/3 and 4/5 [R = 0.2, p = 0.2]). Similarly, LMM deals with unequal variance between different levels in the factor MINUTES. Again, bigger SNR variance was expected (and observed) at the end of the exposure phase than at the beginning. Finally, LMM provides indications about the relationships between dependent measures and predictors (i.e., it indicates whether the relationship is categorical, linear, quadratic,…).
Sleep nap opportunity session: In the NREM/Exposure group, the first continuous 5-min RDM and STAT streams were selected for each participant. Model building was performed in two steps and separately for both tone- and tritone- responses. Fixed effects were the predictors MINUTES (1–2 vs. 2–3 vs. 3–4 vs. 4–5; categorical, continuous or quadratic) and STREAM type (STAT vs. RDM).
In the first step, the most appropriate model of the covariance structure of the data was selected. To do so, we defined all the possible combinations of the random effects accounting for the data (i.e., we specified random intercepts representing the correlation within participant, and a random slope of the MINUTE and STREAM predictors to account for the variation between participants in the effects). We used the most saturated model for the fixed effects (i.e., containing all the categorical predictors MINUTES and STREAM and the corresponding interactions) and the Residual Maximum Likelihood Method (REML), while varying the structure of the covariance (i.e., the specific combination of random effects). The most appropriate model for the covariance structure was selected based on the Akaike Information Criterion (AIC) corrected for small sample sizes. The AIC is a model selection criterion typically used for the covariance model selection (and also fixed effect model selection) when the different models compared are not nested. This criterion measures the relative quality of statistical models for a given data set: the smaller the AIC, the more parsimonious and appropriate is the model.
In the second step of model building, using the selected covariance structure identified in the first step and the Maximum Likelihood (ML) method, we selected the most appropriate model for the fixed effects. We derived all the possible fixed effect models, i.e., all the possible combinations of predictors STREAM and the categorical, continuous or quadratic predictor MINUTES. The model that best accounted for the data was chosen based on the AIC. Therefore, predictors and/or interactions terms not included in the final model did not add information that could improve model fitting.
Wake session: In the subsequent Wake session, model building was similarly performed in two steps and separately for the first and second five minutes streams for both tone- and tritone- responses. The between-subjects predictor GROUP (NREM/Exposure vs. No NREM/No Exposure) was added to the predictors MINUTES (1/2 to 4/5) and STREAM (STAT vs. RDM streams), and accounted for in all combinations as explained above. Incorporation of the GROUP factor aimed at determining whether exposure to the STAT auditory streams during the prior sleep opportunity in the NREM/Exposure group led to a faster detection/segmentation of the tritones during the Wake session, as compared to the No NREM/No Exposure group, which would indicate that subjects exposed to STAT streams during their sleep were sensitized to their underlying tritone structure.
Statistical analyses on behavioural performance at the 2AFC test were conducted using R package (https://www.r-project.org/). A correct response was defined as the accurate recognition of a tritone being part of the exposed STAT stream. Total scores in percentage (/16*100) were assessed for each participant. Since values were not normally distributed, performances of participants from both the No NREM/No Exposure and the NREM/Exposure groups were first pooled together and performance tested using the non-parametric Wilcoxon rank sum test against chance level (50%). In a second step, between-group differences were assessed using an independent Wilcoxon test between scores.
Bayesian analyses were computed where needed to provide additional characterization in the context of statistically null results. Bayesian analyses were computed using the free software JASP (Version 0.9) with default priors (JASP Team, 2018). By convention, a Bayes Factor (BF) > 3 is considered as substantial evidence for the alternative hypothesis (H1), BF values < 0.333 indicate substantial evidence for the null (H0), and BF values between 0.333 and 3 are inconclusive101.
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
Simon, C. W. & Emmons, W. H. Responses to material presented during various levels of sleep. Journal of Experimental Psychology 51, 89–97, https://doi.org/10.1037/h0043637 (1956).
Bruce, D. J., Evans, C. R., Fenwick, P. B. & Spencer, V. Effect of presenting novel verbal material during slow-wave sleep. Nature 225, 873–874 (1970).
Wood, J. M., Bootzin, R. R., Kihlstrom, J. F. & Schacter, D. L. Implicit and explicit memory for verbal information presented during sleep. Psychological Science 3, 236–239 (1992).
Tani, K. & Yoshii, N. Efficiency of verbal learning during sleep as related to the EEG pattern. Brain Res 17, 277–285 (1970).
Shimizu, A. et al. Memory retention of stimulations during REM and NREM stages of sleep. Electroencephalogr Clin Neurophysiol 43, 658–665 (1977).
Badia, P., Wesensten, N., Lammers, W., Culpepper, J. & Harsh, J. Responsiveness to olfactory stimuli presented in sleep. Physiol Behav 48, 87–90 (1990).
Bastuji, H., Perchet, C., Legrain, V., Montes, C. & Garcia-Larrea, L. Laser evoked responses to painful stimulation persist during sleep and predict subsequent arousals. Pain 137, 589–599, https://doi.org/10.1016/j.pain.2007.10.027 (2008).
Kakigi, R. et al. Sensory perception during sleep in humans: a magnetoencephalograhic study. Sleep Med 4, 493–507 (2003).
Sato, Y., Fukuoka, Y., Minamitani, H. & Honda, K. Sensory stimulation triggers spindles during sleep stage 2. Sleep 30, 511–518 (2007).
Atienza, M., Cantero, J. & Escera, C. Auditory information processing during human sleep as revealed by event-related brain potentials. Clin Neurophysiol 112, 2031–2045 (2001).
Bastuji, H., Perrin, F. & Garcia-Larrea, L. Semantic analysis of auditory input during sleep: studies with event related potentials. International Journal of Psychophysiology 46, 243–255 (2002).
Ibanez, A. M., Martin, R. S., Hurtado, E. & Lopez, V. ERPs studies of cognitive processing during sleep. Int J Psychol 44, 290–304, https://doi.org/10.1080/00207590802194234 (2009).
Dang-Vu, T. et al. Interplay between spontaneous and induced brain activity during human non-rapid eye movement sleep. Proc Natl Acad Sci USA 108, 15438–15443, https://doi.org/10.1073/pnas.1112503108 (2011).
Nashida, T. et al. Automatic auditory information processing in sleep. Sleep 23, 821–828 (2000).
Perrin, F., Bastuji, H. & Garcia-Larrea, L. Detection of verbal discordances during sleep. Neuroreport 13, 1345–1349 (2002).
Schabus, M. et al. The Fate of Incoming Stimuli during NREM Sleep is Determined by Spindles and the Phase of the Slow Oscillation. Front Neurol 3, 40, https://doi.org/10.3389/fneur.2012.00040 (2012).
Perrin, F., Garcia-Larrea, L., Mauguiere, F. & Bastuji, H. A differential brain response to the subject’s own name persists during sleep. Clin Neurophysiol 110, 2153–2164 (1999).
Daltrozzo, J., Claude, L., Tillmann, B., Bastuji, H. & Perrin, F. Working memory is partially preserved during sleep. PLoS One 7, e50997, https://doi.org/10.1371/journal.pone.0050997 (2012).
Strauss, M. et al. Disruption of hierarchical predictive coding during sleep. Proc Natl Acad Sci USA 112, E1353–1362, https://doi.org/10.1073/pnas.1501026112 (2015).
Kouider, S., Andrillon, T., Barbosa, L. S., Goupil, L. & Bekinschtein, T. A. Inducing task-relevant responses to speech in the sleeping brain. Curr Biol 24, 2208–2214, https://doi.org/10.1016/j.cub.2014.08.016 (2014).
Andrillon, T., Poulsen, A. T., Hansen, L. K., Leger, D. & Kouider, S. Neural Markers of Responsiveness to the Environment in Human Sleep. J Neurosci 36, 6583–6596, https://doi.org/10.1523/JNEUROSCI.0902-16.2016 (2016).
Farthouat, J. & Peigneux, P. in Analysis and Modeling of Coordinated Multi-neuronal Activity Vol. 12 Springer Series in Computational Neuroscience (ed Masami Tatsuno) Ch. 11, 225–243 (Springer New York, 2015).
Schouten, D. I., Pereira, S. I. R., Tops, M. & Louzada, F. M. State of the art on targeted memory reactivation: sleep your way to enhanced cognition. Sleep Medicine Reviews, https://doi.org/10.1016/j.smrv.2016.04.002 (2016).
Oudiette, D. & Paller, K. Upgrading the sleeping brain with targeted memory reactivation. Trends Cogn Sci 17, 142–149, https://doi.org/10.1016/j.tics.2013.01.006 (2013).
Arzi, A. et al. Humans can learn new information during sleep. Nat Neurosci 15, 1460–1465, https://doi.org/10.1038/nn.3193 (2012).
Arzi, A. et al. Olfactory aversive conditioning during sleep reduces cigarette-smoking behavior. J Neurosci 34, 15382–15393, https://doi.org/10.1523/JNEUROSCI.2291-14.2014 (2014).
Hennevin, E. & Hars, B. Second-order conditioning during sleep. Psychobiology 20, 166–176 (1992).
Andrillon, T., Pressnitzer, D., Leger, D. & Kouider, S. Formation and suppression of acoustic memories during human sleep. Nature communications 8, 179, https://doi.org/10.1038/s41467-017-00071-z (2017).
Cheour, M. et al. Speech sounds learned by sleeping newborns. Nature 415, 599–600, https://doi.org/10.1038/415599b (2002).
Teinonen, T., Fellman, V., Naatanen, R., Alku, P. & Huotilainen, M. Statistical language learning in neonates revealed by event-related brain potentials. BMC Neurosci 10, 21, https://doi.org/10.1186/1471-2202-10-21 (2009).
Nakano, T., Homae, F., Watanabe, H. & Taga, G. Anticipatory cortical activation precedes auditory events in sleeping infants. PLoS One 3, e3912, https://doi.org/10.1371/journal.pone.0003912 (2008).
Huber, R. & Born, J. Sleep, synaptic connectivity, and hippocampal memory during early development. Trends Cogn Sci 18, 141–152, https://doi.org/10.1016/j.tics.2013.12.005 (2014).
Martynova, O., Kirjavainen, J. & Cheour, M. Mismatch negativity and late discriminative negativity in sleeping human newborns. Neuroscience Letters 340, 75–78 (2003).
Cox, R., Korjoukov, I., de Boer, M. & Talamini, L. M. Sound asleep: processing and retention of slow oscillation phase-targeted stimuli. PLoS One 9, e101567, https://doi.org/10.1371/journal.pone.0101567 (2014).
Feld, G. B. & Diekelmann, S. Sleep smart-optimizing sleep for declarative learning and memory. Front Psychol 6, 622, https://doi.org/10.3389/fpsyg.2015.00622 (2015).
Saffran, J. R., Johnson, E. K., Aslin, R. N. & Newport, E. L. Statistical learning of tone sequences by human infants and adults. Cognition 70, 27–52 (1999).
Saffran, J. R., Aslin, R. N. & Newport, E. L. Statistical learning by 8-month-old infants. Science 274(Iss 5294), 1926–1928 (1996).
Toro, J. M., Sinnett, S. & Soto-Faraco, S. Speech segmentation by statistical learning depends on attention. Cognition 97, B25–34, https://doi.org/10.1016/j.cognition.2005.01.006 (2005).
Buiatti, M., Pena, M. & Dehaene-Lambertz, G. Investigating the neural correlates of continuous speech computation with frequency-tagged neuroelectric responses. Neuroimage 44, 509–519, https://doi.org/10.1016/j.neuroimage.2008.09.015 (2009).
Farthouat, J. et al. Auditory Magnetoencephalographic Frequency-Tagged Responses Mirror the Ongoing Segmentation Processes Underlying Statistical Learning. Brain Topogr 30, 220–232, https://doi.org/10.1007/s10548-016-0518-y (2017).
Turk-Browne, N. B., Scholl, B. J., Chun, M. M. & Johnson, M. K. Neural evidence of statistical learning: efficient detection of visual regularities without awareness. J Cogn Neurosci 21, 1934–1945, https://doi.org/10.1162/jocn.2009.21131 (2009).
Batterink, L. J., Reber, P. J., Neville, H. J. & Paller, K. A. Implicit and explicit contributions to statistical learning. J Mem Lang 83, 62–78, https://doi.org/10.1016/j.jml.2015.04.004 (2015).
Nozaradan, S. Exploring how musical rhythm entrains brain activity with electroencephalogram frequency-tagging. Philos Trans R Soc Lond B Biol Sci 369, 20130393, https://doi.org/10.1098/rstb.2013.0393 (2014).
Rossion, B. & Boremanse, A. Robust sensitivity to facial identity in the right human occipito-temporal cortex as revealed by steady-state visual-evoked potentials. J Vis 11, https://doi.org/10.1167/11.2.16 (2011).
Sussman, E., Steinschneider, M., Gumenyuk, V., Grushko, J. & Lawson, K. The maturation of human evoked brain potentials to sounds presented at different stimulus rates. Hearing research 236, 61–79, https://doi.org/10.1016/j.heares.2007.12.001 (2008).
Teinonen, T. & Huotilainen, M. Implicit segmentation of a stream of syllables based on transitional probabilities: an MEG study. J Psycholinguist Res 41, 71–82, https://doi.org/10.1007/s10936-011-9182-2 (2012).
Iber, C., Ancoli-Isreal, S., Chesson, A. & Quan, S. (ed The American Academy of Sleep Medicine) (2007).
Picton, T. W., John, M. S., Purcell, D. W. & Plourde, G. Human auditory steady-state responses: the effects of recording technique and state of arousal. Anesth Analg 97, 1396–1402 (2003).
Tlumak, A. I., Durrant, J. D., Delgado, R. E. & Boston, J. R. Steady-state analysis of auditory evoked potentials over a wide range of stimulus repetition rates in awake vs. natural sleep. Int J Audiol 51, 418–423, https://doi.org/10.3109/14992027.2011.645509 (2012).
Hennevin, E., Huetz, C. & Edeline, J. M. Neural representations during sleep: from sensory processing to memory traces. Neurobiol Learn Mem 87, 416–440 (2007).
Issa, E. & Wang, X. Sensory responses during sleep in primate primary and secondary auditory cortex. J Neurosci 28, 14467–14480, https://doi.org/10.1523/JNEUROSCI.3086-08.2008 (2008).
Nir, Y., Vyazovskiy, V. V., Cirelli, C., Banks, M. I. & Tononi, G. Auditory responses and stimulus-specific adaptation in rat auditory cortex are preserved across NREM and REM sleep. Cereb Cortex 25, 1362–1378, https://doi.org/10.1093/cercor/bht328 (2015).
Portas, C. M. et al. Auditory processing across the sleep-wake cycle: simultaneous EEG and fMRI monitoring in humans. Neuron 28, 991–999 (2000).
Issa, E. B. & Wang, X. Altered neural responses to sounds in primate primary auditory cortex during slow-wave sleep. J Neurosci 31, 2965–2973, https://doi.org/10.1523/JNEUROSCI.4920-10.2011 (2011).
Beh, H. C. & Barratt, P. E. H. Discrimination and conditioning during sleep as indicated by the electroencephalogram. Science 147, 1470–1471, https://doi.org/10.1126/science.147.3664.1470 (1965).
de Lavilleon, G., Lacroix, M. M., Rondi-Reig, L. & Benchenane, K. Explicit memory creation during sleep demonstrates a causal role of place cells in navigation. Nat Neurosci 18, 493–495, https://doi.org/10.1038/nn.3970 (2015).
Atas, A., Faivre, N., Timmermans, B., Cleeremans, A. & Kouider, S. Nonconscious learning from crowded sequences. Psychol Sci 25, 113–119, https://doi.org/10.1177/0956797613499591 (2014).
Agus, T. R., Thorpe, S. J. & Pressnitzer, D. Rapid formation of robust auditory memories: insights from noise. Neuron 66, 610–618, https://doi.org/10.1016/j.neuron.2010.04.014 (2010).
Makov, S. et al. Sleep Disrupts High-Level Speech Parsing Despite Significant Basic Auditory Processing. J Neurosci 37, 7772–7781, https://doi.org/10.1523/JNEUROSCI.0168-17.2017 (2017).
Perruchet, P. & Pacton, S. Implicit learning and statistical learning: one phenomenon, two approaches. Trends in Cognitive Sciences 10, 233–238 (2006).
Turk-Browne, N. B., Yi, D.-J. & Chun, M. M. Linking Implicit and Explicit Memory: Common Encoding Factors and Shared Representations. Neuron 49, 917–927 (2006).
Schapiro, A. C., Gregory, E., Landau, B., McCloskey, M. & Turk-Browne, N. B. The necessity of the medial temporal lobe for statistical learning. J Cogn Neurosci 26, 1736–1747, https://doi.org/10.1162/jocn_a_00578 (2014).
Buzsaki, G. Two-stage model of memory trace formation: a role for “noisy” brain states. Neuroscience 31, 551–570 (1989).
Rudoy, J., Voss, J., Westerberg, C. & Paller, K. Strengthening individual memories by reactivating them during sleep. Science 326, 1079, https://doi.org/10.1126/science.1179013 (2009).
Vinnik, E., Antopolskiy, S., Itskov, P. M. & Diamond, M. E. Auditory stimuli elicit hippocampal neuronal responses during sleep. Front Syst Neurosci 6, 49, https://doi.org/10.3389/fnsys.2012.00049 (2012).
Cunillera, T. et al. Time course and functional neuroanatomy of speech segmentation in adults. Neuroimage 48, 541–553, https://doi.org/10.1016/j.neuroimage.2009.06.069 (2009).
Karuza, E. A. et al. The neural correlates of statistical learning in a word segmentation task: An fMRI study. Brain Lang 127, 46–54, https://doi.org/10.1016/j.bandl.2012.11.007 (2013).
McNealy, K., Mazziotta, J. C. & Dapretto, M. Cracking the language code: neural mechanisms underlying speech parsing. J Neurosci 26, 7629–7639, https://doi.org/10.1523/JNEUROSCI.5501-05.2006 (2006).
Peigneux, P., Urbain, C. & Schmitz, R. (eds Colin Espie & Charles Morin) 11–37 (Oxford University Press, 2011).
Wilf, M. et al. Diminished Auditory Responses during NREM Sleep Correlate with the Hierarchy of Language Processing. PLoS One 11, e0157143, https://doi.org/10.1371/journal.pone.0157143 (2016).
Massimini, M. et al. Breakdown of cortical effective connectivity during sleep. Science 309, 2228–2232 (2005).
Cote, K. A., Epps, T. M. & Campbell, K. B. The role of the spindle in human information processing of high-intensity stimuli during sleep. J Sleep Res 9, 19–26 (2000).
Sela, Y., Vyazovskiy, V. V., Cirelli, C., Tononi, G. & Nir, Y. Responses in Rat Core Auditory Cortex are Preserved during Sleep Spindle Oscillations. Sleep 39, 1069–1082, https://doi.org/10.5665/sleep.5758 (2016).
Colrain, I. M. The K-complex: a 7-decade history. Sleep 28, 255–273 (2005).
Forget, D., Morin, C. M. & Bastien, C. H. The role of the spontaneous and evoked k-complex in good-sleeper controls and in individuals with insomnia. Sleep 34, 1251–1260, https://doi.org/10.5665/SLEEP.1250 (2011).
Schreiner, T., Lehmann, M. & Rasch, B. Auditory feedback blocks memory benefits of cueing during sleep. Nature communications 6, 8729, https://doi.org/10.1038/ncomms9729 (2015).
Farthouat, J., Gilson, M. & Peigneux, P. New evidence for the necessity of a silent plastic period during sleep for a memory benefit of targeted memory reactivation. Sleep Spindles & Cortical Up States 1, 14–26, https://doi.org/10.1556/2053.1.2016.002 (2017).
Rosanova, M. & Ulrich, D. Pattern-specific associative long-term potentiation induced by a sleep spindle-related spike train. J Neurosci 25, 9398–9405 (2005).
Born, J. & Wilhelm, I. System consolidation of memory during sleep. Psychol Res 76, 192–203, https://doi.org/10.1007/s00426-011-0335-6 (2012).
Cox, R., Hofman, W. & Talamini, L. Involvement of spindles in memory consolidation is slow wave sleep-specific. Learn Mem 19, 264–267, https://doi.org/10.1101/lm.026252.112 (2012).
Fogel, S. & Smith, C. The function of the sleep spindle: a physiological index of intelligence and a mechanism for sleep-dependent memory consolidation. Neurosci Biobehav Rev 35, 1154–1165, https://doi.org/10.1016/j.neubiorev.2010.12.003 (2011).
Gais, S., Molle, M., Helms, K. & Born, J. Learning-dependent increases in sleep spindle density. J Neurosci 22, 6830–6834 (2002).
Schabus, M. et al. Sleep spindles and their significance for declarative memory consolidation. Sleep 27, 1479–1485 (2004).
Tamminen, J., Lambon Ralph, M. A. & Lewis, P. A. The role of sleep spindles and slow-wave activity in integrating new information in semantic memory. J Neurosci 33, 15376–15381, https://doi.org/10.1523/JNEUROSCI.5093-12.2013 (2013).
Bertels, J., Franco, A. & Destrebecqz, A. How implicit is visual statistical learning? J Exp Psychol Learn Mem Cogn 38, 1425–1431, https://doi.org/10.1037/a0027210 (2012).
Boyer, M., Destrebecqz, A. & Cleeremans, A. Processing abstract sequence structure: learning without knowing, or knowing without learning? Psychol Res 69, 383–398, https://doi.org/10.1007/s00426-004-0207-4 (2005).
Todorovic, A., van Ede, F., Maris, E. & de Lange, F. P. Prior expectation mediates neural adaptation to repeated sounds in the auditory cortex: an MEG study. J Neurosci 31, 9118–9123, https://doi.org/10.1523/JNEUROSCI.1425-11.2011 (2011).
Oldfield, R. C. The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia 9, 97–113 (1971).
Spielberger, C. D., Gorsuch, R. L., Lushene, R., Vagg, P. R. & Jacobs, G. A. Manual for the State-Trait Anxiety Inventory. (Consulting Psychologists Press, 1983).
Buysse, D. J., Reynolds, C. F. III, Monk, T. H., Berman, S. R. & Kupfer, D. J. The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research. Psychiatry Res 28, 193–213 (1989).
Horne, J. & Ostberg, O. A self-assessment questionnaire to determine morningness-eveningness in human circadian rhythms. Int J Chronobiol 4, 97–110 (1976).
Ellis, B. W. et al. The St. Mary’s Hospital sleep questionnaire: a study of reliability. Sleep 4, 93–97 (1981).
Brainard, D. H. The Psychophysics Toolbox. Spat Vis 10, 433–436 (1997).
Pena, M., Bonatti, L. L., Nespor, M. & Mehler, J. Signal-driven computations in speech processing. Science 298, 604–607, https://doi.org/10.1126/science.1072901 (2002).
Lee, K. A., Hicks, G. & Nino-Murcia, G. Validity and reliability of a scale to assess fatigue. Psychiatry Res 36, 291–298 (1991).
Akerstedt, T. & Gillberg, M. Subjective and objective sleepiness in the active individual. Int J Neurosci 52, 29–37 (1990).
Hofer-Tinguely, G. et al. Sleep inertia: performance changes after sleep, rest and active waking. Cognitive Brain Research 22, 323–331 (2005).
De Tiege, X. et al. Recording epileptic activity with MEG in a light-weight magnetic shield. Epilepsy Res 82, 227–231 (2008).
Taulu, S. & Simola, J. Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurements. Phys Med Biol 51, 1759–1768, https://doi.org/10.1088/0031-9155/51/7/008 (2006).
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J. M. FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput Intell Neurosci 2011, 156869, https://doi.org/10.1155/2011/156869 (2011).
Dienes, Z. Bayesian Versus Orthodox Statistics: Which Side Are You On? Perspect Psychol Sci 6, 274–290 (2011).
The Belgian National Fund for Scientific Research (FRS-FNRS) funded in full or part JF, AA, WM and XdT during the time of the study. Experimental costs were supported by a FRS-FNRS grant T.109.13 to PP.
The authors declare no competing interests.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
About this article
Cite this article
Farthouat, J., Atas, A., Wens, V. et al. Lack of frequency-tagged magnetic responses suggests statistical regularities remain undetected during NREM sleep. Sci Rep 8, 11719 (2018). https://doi.org/10.1038/s41598-018-30105-5
Current Opinion in Physiology (2020)
Sleep Medicine Reviews (2020)
Social Cognitive and Affective Neuroscience (2020)
Tracking the Effects of Top–Down Attention on Word Discrimination Using Frequency-tagged Neuromagnetic Responses
Journal of Cognitive Neuroscience (2020)