Introduction

Sleep is widely assumed to support memory consolidation1,2,3,4. The beneficial function of sleep for memory has been proposed to rely on the reactivation of newly encoded memory traces during subsequent sleep. During a night of sleep, the brain undergoes different sleep stages that alternate in a cyclic manner. The different sleep stages may play distinct roles in memory reactivation and consolidation5. Human sleep is composed of rapid eye movement (REM) sleep and non-REM (NREM) sleep, which includes light sleep (stages S1 and S2) and deep sleep, i.e., slow wave sleep (SWS, stages S3 and S4). An increasing number of studies implicates mainly NREM sleep in the reactivation and consolidation of declarative memories, i.e., memory for facts and events6. NREM sleep is characterized by specific electrophysiological brain oscillations, particularly neocortical slow oscillations (SO, 0.5–1 Hz), thalamocortical spindles (9–15 Hz), and hippocampal ripples (80–200 Hz)7,8,9. The precise temporal coordination between these oscillations is assumed to constitute a crucial mechanism for memory consolidation10,11,12. SO provide a temporal frame for the occurrence of spindles mainly in the excitatory up-state of the SO, with ripples in turn being preferentially nested in the spindle troughs9. This temporal coupling orchestrates the reactivation of hippocampal memory representations to promote their redistribution and integration into neocortical networks for long-term storage1,13,14,15.

Although there is convincing evidence of the importance of these mechanisms for the reactivation and consolidation of memory, it is still a matter of debate whether the different sleep stages of NREM sleep, particularly S2 and SWS, play distinct roles in these processes1,16. SO are evident during both S2 and SWS, but to a much higher degree in SWS17. In fact, the amount of slow wave activity (SWA, 0.5–4 Hz) constitutes the main sleep feature to distinguish between S2 and SWS, with more than 20% of slow waves per sleep epoch indicating SWS according to standard criteria by Rechtschaffen & Kales18. A number of studies found the amount and intensity of SWS as well as of SWA and SO to be associated with better memory consolidation19,20,21,22. Sleep spindles are present during both S2 and SWS to a similar degree, and stage 2 spindle activity was also found to be related to memory consolidation23,24,25,26. Since S2 covers a higher percentage of the sleep night, some researchers pointed out the relevance of lighter sleep stage S2 for memory consolidation16,24. Others have argued that it is mainly the temporal coupling of spindles to SO that is essential for the reactivation of memories during sleep, and since there are more SO and more frequent SO-spindle coupling during SWS27, SWS could be assumed to be more important for the processing of memories than S2.

While memory reactivation occurs spontaneously in the sleeping brain, it can also be externally manipulated. Targeted Memory Reactivation (TMR) is a technique that allows the selective reactivation of specific memories during sleep. The basis of TMR is the association of sensory cues, such as sounds or odors, with learning content during the encoding phase. During subsequent sleep, the same cues are then presented again, which is assumed to trigger and/or facilitate endogenous memory reactivation mechanisms, thereby fostering memory consolidation28,29,30,31. Previous studies using TMR showed that the electrophysiological signatures of memory reactivation are characterized by a temporary increase in theta activity (4–8 Hz) as well as of spindle activity after re-exposure to the cues during NREM sleep32,33. Many previous studies on TMR during sleep presented reminder cues exclusively during SWS31,34,35,36,37,38. Others have applied TMR during the entire NREM sleep period32,39,40,41,42 or during the entire night irrespective of sleep stage43,44,45,46. According to a recent meta-analysis, TMR was effective during both S2 and SWS, although there were only very few studies for TMR applied only during S2 compared to a large number of studies on TMR during SWS or S2 + SWS28. Importantly, this meta-analysis did not differentiate between cueing that took place exclusively during SWS and cueing during both S2 + SWS.

In the present study, we directly compared TMR applied during S2 with TMR during SWS for declarative memory consolidation in the same participants. Based on the assumption that sleep-dependent memory reactivation strongly relies on the temporal coupling of spindles and ripples to SO, which is more pronounced during SWS, we hypothesized that TMR would be more effective during SWS than S2.

Methods

Participants

A total of 29 German native speakers took part in the study. Sample size was based on power analyses from our previous experiments using the same memory task37,47, and considering recommendations by a recent meta-analysis (n = 20 for within-subject designs)28. Nine participants were excluded from the final analysis due to an incorrect audio recording (3 subjects), an incomplete reactivation session (2 subjects), an unfinished task set due to technical issues (1 subject), or an extended period of wakefulness between the auditory cues (2 subjects) or during the night (1 subjects), resulting in 20 remaining subjects (Mage = 21.6 ± 0.5, 13 female). Participants reported having a regular sleep–wake cycle and not carrying out shift work at least 6 weeks before the experiments. They further did not have any history of neurological, psychiatric or endocrine disorder, did not take any medication at the time of the experiments, and were non-smokers. All participants had an adaptation night and gave informed consent before participation in the experiment. Participants received a financial compensation for their participation in the study. The study was approved by the ethics committee of the Medical Faculty of Tübingen University. All methods were performed in accordance with the regulations of the ethics committee of the Medical Faculty of Tübingen University as well as with the guidelines of the German Research Foundation (DFG) and the Declaration of Helsinki.

General procedure and design

The experiment started in the evening at ~ 21:30 h with placement of the electrodes for polysomnographic recording. The training session for the sound-syllable-word association memory task started at ~ 22:30 h (Fig. 1b). Directly afterwards, subjects slept 8 h with targeted memory reactivation during the first sleep cycle. In the next morning, subjects learned an interference task (starting 1 h after awakening). About 30 min after the end of interference learning they were tested for both tasks.

Figure 1
figure 1

Memory task and experimental design. (a) During training, 30 sound-syllable-word associations were presented (German words). For each association, the sound was presented first for 2.9 s and continued accompanied by the word written on the screen and spoken aloud for 1.5 s. After a 4 s break, the next association appeared. In the cued recall test, for each association, the sound was presented for 2.9 s and continued accompanied by the first syllable of the associated word for 1.5 s. Afterwards, a microphone appeared on the screen and participants had 5 s to say the word aloud (sound continued during the entire period). Independently of their answer, correct feedback was given with the word spoken and written on the screen for 1.5 s. During Targeted memory reactivation (TMR, Reactivation), each sound was first presented alone for an average of 2.9 s, then the syllable was played once with the sound continuing in the background for another 1.5 s. After a 7 s break, the next cue was presented (until each cue was presented once). In the interference task, subjects performed the same task as for the training session with the same sounds associated to new words. During testing, each sound was presented for 0.5 s and then subjects were asked to say the word aloud. (b) Training and the cued recall test took place in the evening (22:30 h) and subjects went to bed at ~ 23:00 h. TMR took place during the first sleep cycle, with half of the cues being presented in SWS and the other half in S2 (yellow and blue lines) in counterbalanced order. In the next morning, subjects learned an interference task after 1 h of waking up. Another 30 min later, they took part in the testing session.

Memory task

The task consisted of 30 associations between sounds and semantically related German words (for example, the sound of dropping coins associated with the word KASINO [casino]). Each sound had a duration of between 2,855 and 2,940 ms (on average 2.9 s). All words had three syllables and were pre-recorded by a female voice.

Training session: Each trial started with the sound presentation for 2.9 s, followed by the presentation of the associated word written on the screen and spoken aloud once via headphones, meanwhile the sound continued being played in the background for 1.5 s (see Fig. 1a). After 4 s break, the next association appeared. After all 30 associations were presented once, subjects performed a cued recall test to get an immediate measure of learning. For each association, subjects listened to the sound and the first syllable of the associated word, and were asked to say the complete word aloud. After 5 s, and independently of their answer, the sound plus the correct word was presented both written on the screen and aloud via headphones to provide correct feedback (1.5 s). Subjects that did not reach 40% correct responses (12 correct answers) during immediate recall were excluded from the analysis.

Interference task: The same sounds from the training session were associated with new words for the interference task. The words were also semantically related to the sounds and likewise consisted of three syllables, but with a different first syllable than the words of the original task. The procedure was exactly the same as for the training session.

Testing session: For each sound-syllable-word association, the sound was played for 0.5 s while a microphone´s image appeared on the screen notifying the subjects to say the associated word aloud. After a break of 4 s, the procedure continued with the next sound until all 30 associations were presented.

Targeted memory reactivation

Participants were asked to wear earphones just before going to sleep and to confirm they could hear white noise (43 dB), which was presented from going to bed until the reactivation session finished. Cueing was performed during the first hour of sleep: half of the reminder cues (i.e., sounds followed by the first syllable of the associated words, similar to the cued recall procedure) were presented in SWS and the other half in S2. SWS-cues and S2-cues were presented blockwise, i.e., either all SWS-cues first and then all S2-cues or vice versa. The sleep stage order was counterbalanced across subjects. Each sound (45 dB) was presented only once for 2.9 s and was then accompanied by the first syllable of the word for 1.5 s. After a 7 s interval, the next reminder cue was presented. Reactivation could be paused whenever signs of arousal or changes in sleep stage were detected, and restarted upon detection of the corresponding sleep stage.

Heard/not-heard task

Immediately after participants finished the last part of the testing session in the morning, they were asked to perform a control task. In this ‘Heard/not-heard task’ participants had to report whether they had heard any sounds or words while they were sleeping. The 30 sounds and corresponding first syllables were presented again via the same earphones used for TMR and subjects were asked to indicate for each sound-syllable association whether they had ‘Heard’ or ‘Not-Heard’ the association during sleep.

Sleep recording

Standard polysomnography was obtained including electroencephalographic (EEG), electromyographic (EMG), and electrooculographic (EOG) recordings with BrainAmp amplifiers (Brain Products, Munich, Germany). EEG was recorded from six scalp electrodes (F3, F4, C3, C4, P3 and P4 according to the International 10–20 System). Two electrodes on the left and right mastoids were used as combined reference and a ground electrode was placed on the forehead. Sampling rate was 200 Hz and raw data were recorded unfiltered. Different filters were applied subsequently for different purposes. A bandpass-filter between 0.16 and 35 Hz was applied for online visualization and online sleep stage detection. Polysomnographic recordings were scored offline as wake, stage 1, stage 2, stages 3 and 4 (SWS), and REM sleep according to standard criteria by Rechtschaffen & Kales18.

Time–frequency analyses and Evoked Response Potentials (ERPs)

EEG data was analyzed using custom-made codes in MATLAB 2020a (Mathworks) and the Fieldtrip toolbox55. Data was zero-phase filtered in the bands from 1 to 35 Hz (Butterfilter). Each reactivation event was cut into 12 s trials (− 7 to 5 s with t = 0 referring to the syllable cue onset). For one subject, reactivation markers were missing, thus leaving 19 subjects for the EEG analysis. From the remaining subjects, in total 15 trials with artifacts were excluded from the analysis. Trials were averaged per subject and finally a grand average for each condition was calculated (SWS-cueing vs. S2-cueing).

To evaluate the brain´s response to auditory cueing, evoked response potentials (ERP) were calculated aligned to the syllable cue onset and averaged for each condition. For the peak-to-peak amplitude calculation the temporal location of the negative and positive peaks were visually extracted from the grand average of each subject’s ERP. Time-frequency analyses were performed for each sleep stage separately. Power was calculated using a Morlet wavelet of 7 cycles every 0.005 seconds, relative to a baseline from -4 and -3 s (i.e., right before the sound onset). This processing resulted in the relative power of each frequency at each time point. ERP and time-frequency analyses were performed on central electrodes (i.e., average of C3 and C4).

SO, spindle and SO-Spindle coupling analyses

We further detected discrete slow oscillations (SO) and sleep spindles within three periods of interest: (1) during the cueing period (“React”), (2) a period immediately before the cueing period (“Pre”) and (3) a period immediately after the cueing period (“Post”). For those subjects, who had the entire cueing session without arousals and interruptions, the session took 2.85 min, i.e., ~3 min (for the 15 cues). Thus, we took 3 min as a standard and considered 3 min before the reactivation session and 3 min after the reactivation as pre-cueing and post-cueing periods, respectively. Since some subjects had interruptions in the cueing period, we only considered spindles and SO in those epochs with cueing markers for the cueing period.

The detection algorithm we used to identify SO and spindles is based on the methods of Mölle and collaborators56. Filters were applied over raw data for SO and spindle detection. A SO was detected when the peak-to-peak amplitude was larger than 75 µV in the filtered EEG (0.05-3.5 Hz). For fast spindle detection, the EEG was filtered between 12 and 15 Hz, then we calculated the root mean square (RMS) and applied a moving average of 200 ms. Using this smoothed RMS, a spindle was detected when this value was above the threshold of 1.5 standard deviation values from the mean, between 0.5 and 3 s. We also detected slow spindles (8-12 Hz), however, since there were no differences in any of the measures, we do not further report these findings and all reported analyses refer to fast spindles.

To examine the coupling of fast spindles to SO, we identified SO-spindle complexes for the three periods of interest defined above. We calculated the density of SO-spindle complexes (events/minute) during “Pre”, “React” and “Post” periods. SOs, fast spindles and SO-Spindle complexes density was calculated for the average of central, frontal and parietal channels. In the main text, we only report results of central channels for SO and SO-Spindle complexes, and parietal channels for spindles. However, analyses for the other channels show very similar patterns of findings (see Supplementary Information). Note that for the “Pre” and “Post” periods, all epochs were selected post-hoc as SWS in the SWS condition and S2 in the S2 condition based on the offline scoring. For the “React” periods, we found that 95.79 ± 15.27 % (mean ± SD) of cues were actually presented in S2 for the S2 condition, and 97.54 ± 5.45 % of cues were actually presented in SWS for the SWS condition, suggesting that there was very little deviation between online and offline scoring.

Statistics

Statistical analyses for behavioral and EEG analyses were performed in SPSS 28.0.1.1 and Matlab 2020a (Mathworks). Memory change (i.e., number of correct responses at testing minus the number of correct responses at training) was used as measure for memory performance. Statistical comparisons between SWS-cueing and S2-cueing conditions were done with paired-sample t-tests for memory change, learning measures, the interference task, and the different sleep parameters such as ERP amplitude. Additionally, a repeated-measures ANOVA with within-subjects factor “sleep stage” and between-subjects factor “order” was performed to check for the effect of the reactivation order (first reactivation in S2 vs. first reactivation in SWS).

For the EEG analyses, repeated-measures ANOVAs were applied to compare SO density, fast spindles and SO-spindle complexes between SWS and S2 across the different periods of interest (“Pre”, “React” and “Post”). In case of significant ANOVA effects, post-hoc t-Tests were calculated and Bonferroni correction for multiple comparisons was performed.

A value of P < 0.05 was considered significant. Effect size analyses were computed in Gpower 3.1. Bayesian analyses were performed in JASP 0.17.3.0 to formally test whether, in the absence of significant differences between conditions, the data support the null hypothesis.

Results

Participants learned 30 sound-syllable-word associations in the evening, with the sounds and words being semantically related (e.g., the sound of coins dropping plus the German word for ‘casino’)37,47 (Fig. 1a). During subsequent polysomnographically recorded sleep, auditory cues (sounds plus the first syllable of the words) were presented via earphones, with half of the cues being presented during SWS, and the other half during S2 (in counterbalanced order) during the first sleep cycle (Fig. 1b). Cued recall was tested in the next morning after learning of an interference task. Memory performance was assessed as ‘memory change’ from training to testing to control for individual differences at learning (i.e., correct responses at testing minus correct responses at training). 20 subjects were included in the final analyses (see “Methods” section for exclusion criteria).

Memory performance

To assess the differences in memory performance between cues presented in S2 and SWS, we performed a repeated-measures ANOVA with the within-subjects factor “sleep stage” (S2 vs. SWS) and the between-subjects factor “order” (first TMR in S2 vs. first TMR in SWS). Results revealed that memory performance was not more effective during SWS than S2 (Figure 2a). Performance was even slightly but non-significantly better for S2 cueing (main effect sleep stage: F1,18 = 4.05, P = 0.059). The order of conditions did not affect memory performance (main effect order: F1,18 = 0, P = 1; interaction effect: F1,18 = 2.45, P = 0.135). When comparing memory change directly between SWS and S2 (not taking order of conditions into account), similar results were obtained (P = 0.067, Table 1). The study yielded a statistical power of 0.83 for a one-tailed comparison (given our hypothesis of S2 < SWS), with an effect size of 0.6 and a sample size of 20, indicating a relatively high likelihood of detecting the specified effect in the analysis. Given the absence of significant differences between conditions, we formally tested whether our data support the null hypothesis of SWS not providing better cueing effects than S2. A Bayesian paired t-Test of our original hypothesis (i.e., S2 < SWS) using a standard prior (Cauchy distribution with scale factor 0.707) yielded strong evidence for the null model assuming equivalent performance between conditions (BF01 = 11.06; see Supplementary Figure S1). This was confirmed by robustness checks across different prior widths, as well as a sequential analysis across participants.

Figure 2
figure 2

Memory performance and changes in oscillatory activity upon cueing during SWS and S2. (a) Participants' performance was not significantly different for cues presented in S2 or SWS. Memory change: number of correct words at testing minus number of correct words at training. Means ± SEM (vertical bars) are shown. (b) Time–frequency representations (TFR) for cues presented during S2 and SWS, each with their corresponding event-related potential (ERP, black line). Color maps show power changes relative to a 1 s baseline right before sound onset (i.e. − 4 to − 3 s). TFR were aligned at time-point zero to the syllable cue onset. Lower panel indicates timeline for the sound (symbolized by musical notes) and the syllable cue (symbolized by speaker and ‘KA’ syllable). (c) Cues presented in SWS elicited larger ERP amplitudes than those in S2. Average ERP amplitudes for single subjects as well as the means ± SEM are shown. * P < 0.05.

Table 1 Memory measures.

Initial learning during the Training session differed unexpectedly for S2 and SWS cues (P = 0.025, Table 1), which was controlled for by calculating the memory change from Training to Testing. Performance for the Interference Task (i.e., same sounds associated to new words) was comparable between S2 and SWS cues for the Training session (P = 0.6), Testing session (P = 0.6) and for memory change (P = 0.4, Table 1).

To control for possible awareness of cue presentation during sleep, in the end of the experiment, subjects were presented with all reminder cues again and were asked to report whether they think that they have heard the cues during the night (‘Heard/not-heard task’). On average, they correctly recognized 6.2 ± 1.5 cues, with no difference between the SWS and S2 cueing conditions (P = 0.26).

Sleep analyses

Sleep scoring was performed according to standard criteria by Rechtschaffen & Kales18. All subjects included in the final analysis showed normal sleep patterns, with a total sleep time of 490.0 ± 3.1 min, and average time spent in the sleep stages (in min, mean ± SEM) of S1 (15.4 ± 2.1), S2 (290.4 ± 7.4), S3 (65.1 ± 5), S4 (13.8 ± 2.4), REM sleep (87.8 ± 6.3) and wake (19.9 ± 8.8).

To test for differences in sleep oscillatory activity when cues were presented during S2 and SWS, we performed time-frequency analyses upon cue presentation separately for S2 and SWS (Fig. 2b). First, we looked at the event-related potentials (ERP) upon the onset of the syllable cue, considering that the syllable (and not the sound) was found to be the relevant cue triggering memory reactivation in previous studies37,47. In both sleep stages, there was an elicited response upon syllable cue onset, however, the ERP amplitude was significantly higher when cues were presented in SWS compared to S2 (P = 0.012) (Figure 2c).

Next, we compared the density of slow oscillations (SOs), fast spindles, and SO-spindle complexes (i.e., SO and fast spindles occurring together) during three periods of interest: the entire period of cue presentation (“React”, 3 min for 15 cues), a pre-cueing period immediately before the cueing period (“Pre”, 3 min), and a post-cueing period immediately after the end of the cueing period (“Post”, 3 min). Repeated-measures ANOVAs (“sleep stage” x “Pre/React/Post”) revealed significant interaction effects for all three measures (all P < 0.002). SO density in frontal electrodes significantly increased from Pre to React (P < 0.001) and from Pre to Post for SWS periods (P < 0.001, Figure 3). Moreover, when comparing equivalent periods in S2 and SWS, SO density was significantly higher in SWS for all time windows (all P < 0.001). Spindle density in parietal electrodes was significantly higher during the Pre period in SWS than in S2 (P = 0.007), while a spindle density increase was observed from Pre to Post in S2 (P = 0.001). The density of SO-fast spindle complexes (as obtained in central electrodes) followed a similar pattern as SO density, with a higher density of SO-spindle complexes in SWS compared to S2 (all P < 0.001). However, during SWS, the density of SO-spindle complexes decreased from Pre to React (P < 0.001), and from Pre to Post (P < 0.001) (see Supplementary Table S1 for complete sleep oscillatory activity in central, frontal and parietal channels). There were no significant correlations between any of these measures and memory change. Bayesian correlations yielded moderate evidence for the null model (3 < BF01 < 4).

Figure 3
figure 3

Density of slow oscillations (SO), fast spindles and SO-Spindle complexes. Each bar indicates density calculated within a period of interest: “Pre”, a 3-min period immediately before the cueing period; “React”, during the 3-min cueing period; and “Post”, a 3-min period immediately after the cueing period. Yellow bars correspond to S2 and blue bars to SWS. SO density is shown for frontal electrodes, fast spindle density for parietal electrodes and SO-spindle complexes density is shown for central electrodes. Means ± SEM (vertical bars) are shown. Significances indicate results of Post-hoc t-tests corrected for multiple comparisons. ** P < 0.01, *** P < 0.001.

Discussion

In the present study, we compared the effectiveness of targeted memory reactivation (TMR) for the strengthening of declarative memories during the two main NREM sleep stages, S2 and SWS. Our results suggest that, at least for the time intervals analyzed between reactivation and testing in this study, TMR is not more effective during SWS, but both light and deep NREM sleep offer comparably good windows of opportunity for TMR. We explored reactivation-related brain activity for each sleep stage, confirming the expected electrophysiological responses upon cueing during NREM sleep. Furthermore, we observed higher ERP amplitudes for the cue-evoked responses during SWS compared to S2. Reactivation during SWS induced increases in slow oscillation (SO) density and decreases in the density of SO-fast spindle complexes during the reactivation period and thereafter. Reactivation during S2, on the other hand, induced increases in fast spindle density when comparing periods before and after reactivation. Overall, the density of SO and SO-fast spindle complexes was higher during SWS when compared to equivalent periods in S2. Interestingly, the reactivation-induced changes in oscillatory patterns were not correlated with memory performance.

The finding that TMR during SWS is not better than during S2 was in contrast with our hypothesis; we expected higher memory performance when TMR was performed during SWS than during S2. The findings suggest that, overall, subjects showed forgetting of the sound-syllable-word associations, and this was consistent for cueing during S2 and SWS. Previous studies argued that declarative memory consolidation is associated with SWS and processes characteristic of SWS, such as SO as well as the coupling between SO and spindles7,19,20,21,48. However, others have argued that lighter sleep stage S2 may be more important for active processes of memory reactivation than SWS16. Our results speak in favour of the importance of both NREM sleep stages, at least concerning externally triggered reactivation processes in the time frame examined here. Although any inference with regard to spontaneous reactivation during sleep remains tentative, we conclude that the brain seems to be equally susceptible for the induction of reactivation processes triggered by external memory cues in SWS and S2. This conclusion is in line with recent findings showing no differences in memory performance when TMR was performed during S2 and SWS for a vocabulary task49.

In the present study, we even observed descriptively stronger effects of TMR during S2, which should be explored in future research. Although we cannot say whether TMR may be more effective during S2, our results provide convincing evidence against our original hypothesis of stronger TMR effects in SWS. Effect size analyses together with Bayesian statistics indicate a relatively high likelihood of detecting the specified effect in the analyses and provide positive evidence for the null results.

Inspection of EEG activity for cueing during SWS and S2 confirmed the expected electrophysiological responses upon TMR during NREM sleep32,50,51. In both sleep stages, time frequency representations showed that there was an evoked response upon the presentation of the cue syllable characterized by a negative peak with power increases in theta/slow spindle frequencies and a positive peak with increased power in the fast spindle band. These findings suggest that cues were processed in the brain in both sleep stages as observed in previous studies32,33. Interestingly, the ERP amplitude was significantly higher during SWS than S2, although it is unclear whether higher ERP amplitudes reflect better responses towards cueing during sleep. There are findings showing that the amplitude of SO, akin to ERPs, is generally higher during SWS relative to S217. However, in the present study, we did not find an association between ERP amplitude and memory performance. This question should be subject to further investigation.

When TMR was applied during SWS, we observed an increase in SO density during the cueing period as well as shortly after the cueing period, when compared to a pre-cueing period of equivalent length. SO are the main feature of SWS, with about four times as many SO during SWS than during S217. Interestingly, this pattern of increase from pre to post cueing was not observed for S2, despite the presence of ERPs. On the other hand, when looking at the density of fast spindles during the same periods, we found a significant increase from pre to post cueing during S2 but not in SWS. These observations are in line with previous studies mainly implying SO for the effects of SWS19,20,21,22 and sleep spindles for the effects of S2 for declarative memory consolidation23,24,25,26. Surprisingly, the temporal coupling between SO and spindles, which is assumed to be functionally related to memory consolidation during sleep13,14,27,52, decreased from before to after the cueing period in SWS. This finding may explain the lack of stronger cueing effects during SWS compared to S2. However, considering that none of the observed changes in electrophysiological responses were associated with memory performance, the functional relevance of these responses remains to be elucidated.

Although we did not observe any differences between cueing during SWS and S2 in the present study, we would like to point out that we cannot make any conclusions with regard to the effectiveness of TMR per se. We only compared TMR in different sleep stages, but we did not include a condition without reactivation. The most likely explanation of our findings is that TMR was effective (as would be expected from our previous studies using the same paradigm37,47), with equal effectiveness for TMR during SWS and S2. It could be speculated that differential mechanisms support TMR during SWS and S2, resulting in the same behavioral outcome. However, considering that we did not observe any correlations between any of the analyzed sleep parameters and memory performance, other parameters may be relevant here (especially for memory reactivation during S2). Alternatively, it is also possible that, unexpectedly, there was no TMR effect in the present study at all. Although this possibility cannot be ruled out due to the lack of a no reactivation condition, two previous studies from our group observed robust TMR effects with the same paradigm37,47, rendering this explanation unlikely. Future studies should take potential mechanistic differences between reactivation in SWS and S2 into account and examine this question more systematically.

Our study has some limitations. First, the memory task and the reactivation protocol were mainly designed to test our behavioral hypothesis but were not ideally suited for EEG analyses. We used a low number of reactivation cues, i.e., just one repetition of each of the 30 associations, based on the positive results from our previous TMR studies using the same behavioral paradigm37,47. It is an open question in TMR research whether the number of cue repetitions during sleep is associated with reactivation success. Most TMR studies have presented multiple repetitions of each reminder cue32,53, with some of these studies observing improved memory performance while others found no effects54. Future studies should systematically explore the number of cue repetitions needed for successful TMR. Second, the sound-syllable cues of 3.5 s length were played upon detection of stable S2 or SWS. This means that TMR was not time-locked to endogenously detected SO, neither to a specific phase of the SO. The length of the cues was also longer than an average SO, in some cases leading to more than one evoked response per trial. Additionally, the cues included both verbal information and non-verbal acoustic stimuli and it is unclear whether the sound and the syllable cue elicited differential cortical processes. Another limitation is related to the fact that our SO analyses, i.e., SO density and SO-spindle complexes, resulted in low numbers or even values of zero during S2 in some cases, due to the low number of SO in S2 for the short reactivation periods. Moreover, we observed differences in learning levels for cues to be presented in SWS vs. S2. Although we corrected for this difference by calculating the memory change from training to testing, the findings should be interpreted with caution.