- 1Department of Psychological Sciences, University of Connecticut, Storrs, CT, United States
- 2Department of Biomedical Engineering, University of Connecticut, Storrs, CT, United States
The role of slow oscillations and spindles during sleep on memory retention has become an area of great interest in the recent decade. Accordingly, there are multiple studies that examine the efficacy of acoustic stimulation during sleep to facilitate slow oscillations and associated memory retention. Here, we run meta-analyses on a current set of 14 studies that use audible noise-burst sound stimulation to modulate overnight retention of word pairs (kS = 12 studies, kES = 14 effect sizes, n = 206 subjects). Our meta-analyses demonstrate a steady, yearly decline in effect size that accounts for 91.8% of the heterogeneity between studies. We find that the predicted effect on memory retention in 2013 favored the acoustic stimulation condition at dδ = 0.99 (95% CI [0.49, 1.49]), while the predicted effect in 2021 declined to a moderate and significant effect favoring no acoustic stimulation at dδ = −0.39 (95% CI [−0.73, −0.05]). Our meta-regression model finds no coded study-level characteristics could account for the decline in effect sizes over time other than the publication date alone. Using available data, we estimate that 34% of subjects are not actually blind to the acoustic stimulation condition due to hearing acoustic stimulation during sleep. In addition, we find that the test-retest reliability of memory retention scores is nearly zero (ρd = 0.01, 95% CI [−0.18, 0.21]), and through simulation demonstrate the impact this has on statistical power and observed effect sizes. Based on our analyses, we discuss the need for larger sample sizes, true placebo controls, age range restrictions, open-data sharing, and improvements in the reliability of memory retention tasks.
1. Introduction
Converging lines of research support the theory that synchronous slow waves, spindles, and ripple oscillations and their coupling reflect the memory consolidation process that happens during sleep. In general, performance on declarative (verbal, event, and place) and non-declarative (sensory and motor skill) memory tasks improves following sleep (Barrett and Ekstrand, 1972; Plihal and Born, 1997; Mednick et al., 2003, 2011; Ellenbogen et al., 2007; Nishida and Walker, 2007; Miyamoto et al., 2017). An increase in the density of synchronous slow cortical oscillations (0.5–1.0 Hz) and thalamic spindles (12–15 Hz) during sleep is correlated with performance on various memory tasks following sleep (De Gennaro and Ferrara, 2003; Fogel and Smith, 2011; Niknazar et al., 2015; Cowan et al., 2020). The density of slow waves is highest during stages 2 and 3 of non-rapid eye movement (NREM) sleep (Stokes et al., 2022). In contrast, the density of fast spindle frequency oscillations is highest during stage 2 sleep when spindles are phase coupled with slow waves creating temporally coherent k-complex events (De Gennaro and Ferrara, 2003; Fogel and Smith, 2011; Stokes et al., 2022). Intracranially, slow waves arise from alternating phases of high and low spike-rate output from cortical neurons (Massimini et al., 2004; Steriade, 2006; Sanchez-Vives, 2020). In contrast, spindles are driven by thalamocortical circuits (Steriade, 2006). Thalamocortical spindles are phase coupled to the positive phase of cortical slow waves (Steriade, 2006; Diekelmann and Born, 2010). This coupling between slow waves and spindles positively correlates with improved sleep-dependent memory over the course of development and in adulthood (Hahn et al., 2020; Kurz et al., 2020). Computational models support a physiological mechanism where cortical output during the negative phase of the slow wave suppresses thalamic spindles and release from suppression in transition to the positive phase of slow waves promotes synchronous spindles (Mak-McCully et al., 2014). At the synaptic level, slow waves and spindles are thought to have complementary effects on synaptic plasticity which ultimately improves the signal-to-noise ratio of synapses and networks mediating memory (Tononi and Cirelli, 2006; Miyamoto et al., 2017). Spindles in turn are coupled with faster synchronous ripple oscillations (80–250 Hz) generated by hippocampal networks during non-declarative memory acquisition and during sleep (Chrobak and Buzsáki, 1998; Diekelmann and Born, 2010; Miyamoto et al., 2017) The increase in coupled activity between recurrently connected hippocampal and neocortical brain networks during sleep is thought to mediate the transfer or consolidation of memories. Accordingly, phase-coupling of slow wave, spindle, and ripple oscillations across cortical, thalamic, and hippocampal networks all are considered biomarkers of memory consolidation (Isomura et al., 2006; Staresina et al., 2015; Rothschild et al., 2017; Navarrete et al., 2020).
There is a growing interest in developing effective approaches to enhance slow oscillations and their coupling with spindles to improve memory consolidation in various clinical populations. Populations with altered spindle activity include schizophrenia (Wamsley et al., 2012; Stokes et al., 2022), Alzheimer's disease (De Gennaro and Ferrara, 2003; Rauchs et al., 2008; Weng et al., 2020), Autism Spectrum Disorders (Limoges et al., 2005; Tessier et al., 2015), sleep disorders (Leong et al., 2022) as well as natural aging (Martin et al., 2013; Mander et al., 2017; Purcell et al., 2017; Helfrich et al., 2018; Djonlagic et al., 2021). Historically, drugs have been developed to facilitate sleep physiology and memory. Multiple studies find the sleep aid, zolpidem, increases the density and power of sleep spindles (Dijk et al., 2010; Lundahl et al., 2012; Mednick et al., 2013; Niknazar et al., 2015; Zhang et al., 2020) and the coupling of slow wave and spindle oscillations (Niknazar et al., 2015; Zhang et al., 2020; Leong et al., 2022). Moreover, zolpidem strengthens hippocampal-prefrontal network coupling (Kersanté et al., 2023) and is correlated with improved memory task performance on the following day (Niknazar et al., 2015; Zhang et al., 2020; Leong et al., 2022). Unfortunately, zolpidem also enhances negative emotional memory consolidation and a meta-study finds zolpidem use is associated with increased suicide rates (Simon et al., 2021; Khan et al., 2022). Thus, zolpidem may only be an effective approach for enhancing slow waves, spindles, and memory in certain subpopulations. Multiple non-pharmacological approaches are under investigation including non-phase-locked or feedback-controlled transcranial electrical stimulation to increase slow wave oscillation amplitudes and improve overnight memory consolidation (Marshall et al., 2006; Lustenberger et al., 2016). Another approach is acoustic stimulation phase-locked to slow oscillations which enhances slow wave and spindle oscillations during sleep and can improve memory consolidation (Ngo et al., 2013b). Finally, a variation of the latter approach is to play sound as a contextual cue while people are learning and then play the same sound for “targeted memory reactivation” and consolidation during sleep (Cairney et al., 2016; Hu et al., 2020).
Over the past 10 years, fourteen studies have examined the efficacy of using acoustic stimulation during sleep to enhance slow wave and spindle oscillations and memory. One of the first studies to examine both the physiological and behavioral effects of overnight acoustic stimulation is a seminal study by Ngo et al. (2013b). This study plays sequences of two pink noise sound bursts phase-locked to the positive phase of an ongoing slow oscillation (SO) during the first 2 h of overnight NREM (stage 2, 3) sleep and measures how this impacts performance on a declarative word-pair memory task the next day (Ngo et al., 2013b). Here, the inter-stimulus interval between two noise bursts is set to each individual subject's average slow wave interval. Two outcome measures include the physiologic metric of SO amplitude and the behavioral metric of performance on a memory task. Positive outcomes include a significant increase in SO and spindle amplitudes and improved performance on a word-pair memory task with acoustic stimulation vs. control condition without stimulation (Ngo et al., 2013b). In this initial study, there is a substantial effect size (d = 1.08) supporting this non-pharmacological approach for improving verbal memory performance. In the past 10 years, seven clinical studies in total have examined the effects of phase-locked acoustic stimulation during sleep on word-pair memory in young adults or children (Ngo et al., 2013b, 2015; Ong et al., 2016; Leminen et al., 2017; Prehn-Kristensen et al., 2020). Five out of these seven studies find phase-locked acoustic stimulation during sleep increases SO amplitudes and improves performance on word-pair memory tasks in children (Prehn-Kristensen et al., 2020) and young adults that do not have diagnosed clinical conditions (Ngo et al., 2013b, 2015; Ong et al., 2016; Leminen et al., 2017). Two of the seven studies examining phase-locked acoustic stimulation effects on adult populations do not find significant improvements in word-pair memory (Henin et al., 2019; Schneider et al., 2020; Harrington et al., 2021). Similar to some prior studies (Henin et al., 2019; Diep et al., 2020; Prehn-Kristensen et al., 2020), the 2021 study by Harrington and colleagues uses a different type of word-pair memory test which requires subjects to remember unfamiliar word pairs. Based on this, they suggest that overnight SO phase-locked acoustic stimulation is more effective for consolidation (or reconsolidation) of familiar word pairs (Harrington et al., 2021). Similarly, other studies find phase-locked acoustic stimulation that significantly increases slow wave and spindle amplitudes does not improve performance on spatial memory (Henin et al., 2019), visual object and facial memory, or non-declarative finger tapping procedural memory tasks (Leminen et al., 2017). Given that different types of memory engage distinct brain areas and neuronal networks, it is possible that the current acoustic stimulation design is optimal for certain types of memory consolidation. To explore this possibility, Henin et al. (2019) set out to compare how phase-locked acoustic stimulation impacts both word-pair and spatial memory tasks using the same word-pair memory task employed by prior studies including that of Ngo and colleagues. Surprisingly, the Henin et al. (2019) study did not find a significant increase in performance with congruent or incongruent word-pairs in spite of observing significant increases in SO and spindle amplitudes for both. Collectively, there are five studies that support a phase-locked acoustic stimulation approach for enhancing declarative verbal memory but there remains considerable variability across all the studies completed to date.
Two recent meta-analytic studies (Wunderlin et al., 2021; Stanyer et al., 2022) examine potential study design moderators that could account for variability in effects across studies using the acoustic stimulation approach to enhance memory consolidations. Collectively, these meta-analytic studies consider age, phase-locking, and the type of memory task as potential contributors to variations in the efficacy of acoustic stimulation on memory. It is challenging to attribute the low effect size across studies to age alone because there are only three out of 14 current studies using older cohorts. For example, Papalambros et al. (2017) find a cohort of older adults (mean age 75.2 years) benefits from phase-locked acoustic stimulation showing improved verbal memory and substantial effect size (d = 0.63) on the order of that found with younger cohorts (Ngo et al., 2013b). In contrast, Schneider et al. (2020) find a cohort of older adults (mean age = 54.6 years) does not show enhanced memory performance following phase-locked acoustic stimulation. A post-hoc comparison by Schneider and colleagues finds that with or without acoustic stimulation SO amplitudes are smaller for older (mean age = 54.6 years) vs. younger (age 24.2 years) cohorts. Based on this result, Schneider and colleagues suggest the physiological capacity to generate and augment slow-wave oscillations in older populations may be reduced. The use of phase-locked vs. non-phase-locked acoustic stimulation is another potential contributor. However, there are not many studies probing memory effects with non-phase-locked stimulation or with older age cohorts. Indeed, Wunderlin and colleagues consider the combination of age and phase-locking as potential moderators for study outcomes. Accordingly, when combining studies using phase-locked acoustic stimulation and younger cohorts, the memory performance effect size increases almost two-fold larger than the overall mean (d = 0.25, 95% CI [−0.02, 0.53 ] to d = 0.44, 95% CI [0.09, 0.79]). This suggests that phase-locked acoustic stimulation in younger adults could be an effective tool to modulate memory retention. Toward this end, Wunderlin and colleagues compare portable systems that allow for multiple nights of electroencephalogram (EEG) recordings and phase-locked acoustic stimulation delivery with the goal of collecting quality data over multiple nights to strengthen future studies (Zeller et al., 2023). A meta-analysis by Stanyer and colleagues reports a memory performance effect size (d = 0.68, 95% CI [0.06, 1.30]) that is almost three-fold larger than reported for a similar set of studies in the meta-analysis by Wunderlin and colleagues. Nevertheless, Stanyer and colleagues also recommend future studies should include repeated measures and larger sample sizes to reduce between-study heterogeneity and help the future development of this novel approach. Collectively, these recent meta-analyses point toward potential ways to optimize future acoustic stimulation studies designed to improve memory and other cognitive operations.
In this review, we perform a meta-analysis of fourteen clinical studies completed between 2013 and 2023 that deliver acoustic stimulation during NREM sleep and measure the effects on slow wave amplitude and memory task performance. Specifically, we aimed to address the following four research questions:
R1. What is the average effect of acoustic stimulation during slow-wave sleep on sleep-dependent changes in word-pair retention?
R2. Does the effect size change with subsequent replications?
R3. If there are changes in the reported effects in subsequent replications, is this attributable to any codable study characteristics (e.g., mean age of sample, semantic congruence of word pairs, blinding procedures, change in SO power)?
R4. Are word-pair cued-recall tasks reliable enough to demonstrate changes in memory retention if they are present?
In order to address these questions, first we reproduce and extend prior meta-analyses by including three additional datasets from two new publications. We confirm the positive memory effects reported previously by Wunderlin et al. (2021) when including the same subset of studies using phase-locked acoustic stimulation and non-elderly adults. Upon including two new studies, one of which includes children and a second that includes an incongruent word-pair task, we find a significant downward trend in the effects of phase-locked stimulation on memory performance between 2013 and 2023. We run a meta-regression model examining eight potential moderators driving this downward trend including age, gender, phase-locked condition, properties of memory tasks (word count and word congruence), sleep condition (overnight vs. nap), double-blind procedures, and physiological increase in slow waves. None of these study-level characteristics account for this trend. In addition, difference scores are known to be highly unreliable in cognitive performance assessments, including the Stroop test (Hedge et al., 2018). Here, we examine the test-retest reliability of the difference scores (pre-sleep vs. post-sleep) used in all these studies to measure changes in memory retention. We find that these difference scores have low reliability and a strong potential to decrease statistical power. Accordingly, we hypothesize that unreliable memory task performance difference scores likely contribute a large degree of heterogeneity across studies testing acoustic stimulus effects on memory. Finally, we discuss how the memory task score, statistical approach, sample size, and other study design features may be improved to strengthen future research investigating the efficacy of acoustic stimulation to boost slow waves, spindle oscillations, and improve memory consolidation.
2. Methods and materials
2.1. Study selection
The goal of the current meta-analysis is to confirm and extend the work of two previous meta-analytic reviews (Wunderlin et al., 2021; Stanyer et al., 2022). Here, we combine studies included in these two meta-analyses (Figure 1A). Additionally, we compare the statistical measures across our study and the two prior meta-analyses and confirm a high correspondence with Wunderlin and colleagues (Figure 1B) but less so with Stanyer and colleagues (Table 1 and Figure 1C). To expand on these prior studies, we conducted a supplemental google scholar search of the last 3 years (2020–2023), using the following key-word search: (“acoustic stimulation” OR “auditory stimulation”) AND (“slow wave sleep” OR “slow oscillations”) AND (“memory” OR “consolidation”). The first 50 most relevant results are given a full-text screening for inclusion. This results in two relevant articles (Prehn-Kristensen et al., 2020; Harrington et al., 2021). Harrington et al. (2021) is likely excluded from previous meta-analyses (Wunderlin et al., 2021; Stanyer et al., 2022) as a consequence of being published after literature review. Prehn-Kristensen et al. (2020) may have been excluded as it compares subjects with and without attention deficit hyperactivity disorder (ADHD). Here, we extract the necessary effect size for the healthy control subjects for both the rewarded and non-rewarded conditions, which are included in the present meta-analysis. Additionally, we searched for studies that had cited the already-included articles and found no additional studies. In sum, we obtained kS = 12 studies (kES = 14 effect sizes), and N = 206 healthy subjects of variable age (Supplementary Figure 1).
Figure 1. Standardized mean differences in the current meta-analysis and previous meta-analyses. (A) All effect sizes in the current meta-analysis estimated using Glass' estimator. (B, C) Standardized mean differences (SMD) reported from previous meta-analyses (Wunderlin et al., 2021; Stanyer et al., 2022) plotted against reproduced estimates using their respective methodologies. Purple lines indicate the mean of the reproduced and reported SMD where the intersection demonstrates how they diverge from equality.
Table 1. Standardized mean difference estimates and reproduced estimates from previous meta-analyses.
2.2. Experimental study designs
Key study design features were aligned across all studies included. First, all studies included examined the efficacy of noise-burst acoustic stimulation during slow-wave sleep on overnight memory retention in the form of a word-pair cued recall task. Two experimental sessions consisted of a SHAM condition (control, no acoustic stimulation) and a STIM condition (with acoustic stimulation) during sleep (nap or overnight). The studies all had repeated measures and cross-over designs. That is, the same subjects participated in the control (SHAM, no audio) and acoustic stimulation (STIM) conditions. Ordering of STIM and SHAM conditions was appropriately counterbalanced in all studies. Word pairs were shown to participants, and they were subsequently assessed on their immediate recall prior to the sleep session. After waking, memory was assessed on the same word pairs. Memory retention scores were the difference in recall accuracy between pre-sleep and post-sleep assessments in all studies included. Memory retention scores were then compared between STIM and SHAM conditions.
2.3. Data collection
Data included in this meta-analysis was obtained from each publication or by contacting the corresponding authors. Relevant study characteristics were recorded (see Table 2) alongside effect size information. Fortunately, we were able to collect all characteristics of each moderator of each study. Most coded characteristics were directly reported in the respective manuscripts. However, slow oscillation power was not, therefore we calculated the standardized mean difference (SMD) in slow oscillation power/amplitude between STIM and SHAM. To quantify the effect of acoustic stimulation on memory retention, the standardized mean differences in word-pair retention scores between STIM and SHAM conditions were recorded. To obtain repeated measures effect sizes, correlation coefficients must be calculated between pre-sleep and post-sleep measurements. Since most studies did not report pre-sleep vs. post-sleep correlations, corresponding authors were contacted in order to acquire raw data, or data was extracted from figures containing sufficient information using WebPlotDigitizer. We were able to obtain raw data sets for seven independent samples from five studies (Ngo et al., 2013b, 2015; Weigenand et al., 2016; Henin et al., 2019; Schneider et al., 2020). Two samples were included from Ngo et al. (2015), but the second experiment had no SHAM condition therefore this sample is not used in the effect size calculations, however it is included in the reliability section of the meta-analysis. Since each study implemented a crossover design, we were able to estimate reliability coefficients for each of the seven raw data sets. Additionally, five studies reported the proportion of subjects that were aware of the stimulus at some point during the STIM condition (Ngo et al., 2013b, 2015; Weigenand et al., 2016; Prehn-Kristensen et al., 2020; Schneider et al., 2020). This proportion of subjects aware of the acoustic stimulus at any point was extracted from these five studies to examine the possibility that subjects were potentially not entirely blind to study conditions due to the no audio vs. acoustic stimulation designs employed by all (Figure 2C).
Figure 2. Effect decline and heterogeneity. (A) Meta-regression for publication date to observed effects across studies. Data-labels indicate first author of respective publication. (B) Results of a leave-one-out cross-validation (leave-one-out-cross-validation) where each regression coefficient is the fitted slope estimate when the corresponding study is removed. Note the resulting regression coefficients are similar indicating that no one study has major influence on the regression slope parameter. (C) The proportion of people reported hearing the auditory stimuli during the night when it was truly the STIM condition.
2.4. Effect-size calculation
Previous meta-analyses reported different formulations of standardized mean differences (SMD) in word-pair retention scores between STIM and SHAM conditions. Wunderlin et al. (2021) used a repeated measures estimator (drm), whereas Stanyer et al. (2022) presumably used an average variance estimator (dav). Our meta-analysis finds that variances in STIM and SHAM conditions violate the assumption of equal variances. This is evidenced by the average (n-weighted) variance ratio of 1.38 (). This indicates that the STIM condition has a 38% higher variance than the SHAM condition. Therefore, we use the Glass' estimator (dδ) to calculate standardized mean difference (SMD) because it does not assume equal variances. An in-depth description of various effect-size calculations can be seen in Lakens (2013). All the estimators, drm, dav, and dδ only differ in how they are standardized. Thus, they all take the form of:
Where J(n) is the small sample correction factor, , and Msham and Mstim correspond to the mean of the SHAM and STIM condition, respectively. The differences between estimators are in their calculation of the standardizer, S*, such that:
Where Sstim and Ssham indicates the standard deviation for STIM and SHAM, respectively. The associated standard errors (se) for each SMD estimator are as follows:
Where n indicates the sample size. The point estimate and standard error of the repeated measures estimator (drm) requires the correlation between STIM and SHAM retention scores (rd) which are rarely reported in repeated measure studies. Therefore, based on acquired raw data sets and plot digitizing (extracting data from figures), we calculated all available correlations. For the remaining studies where correlations were not available, the weighted mean correlation (i.e., random effects) is used to populate missing values.
2.5. Random-effects modeling
To perform our new meta-analysis and confirm prior meta-analyses and assess potential sources of variation in our regression model we estimate the true effect size using a Random-effects model approach (Figure 1A). Random-effects modeling is used to estimate the mean of true effect sizes. Random-effects models allow for variation between true effect sizes (heterogeneity) across studies as should be the case when studies vary in their population, methodology, or design. We use the restricted maximum likelihood estimator to calculate between-study heterogeneity, that is, the standard deviation of true effect sizes (τ). All meta-analytic modeling is completed in R using the metafor package (Viechtbauer, 2010).
2.6. Mixed-effects (meta-regression) modeling
A meta-regression model was implemented to quantify the potential moderating effects of multiple study-level characteristics on the effect size (dδ). Because our initial analysis identified a downward trend in effect size with publication date (Figure 2A), the regression model was logistically set up to examine whether independent moderators could account for this decline. Thus, we built nine different regression models to investigate whether the effect size was conditionally dependent on the study's publication date or other moderating variables (e.g., age; see Tables 2, 3). For interpretability, we also report the effects for various subgroups based off of categorical moderators (Table 4). The first model consists of a single moderator (publication date) while the other models consist of both publication date and one other moderating variable. As detailed above, the restricted maximum likelihood estimation method was used for all meta-analytic models. Additionally, we utilized the moving constant technique (Johnson and Huedo-Medina, 2011) by setting the initial moderator value to zero, this allows the intercept to be interpreted as the predicted effect size (dδ) at the date of the original publication (Ngo et al., 2013b). To compare the nine regression models, each model is re-fit with maximum likelihood (as opposed to restricted maximum likelihood) which is necessary to conduct a likelihood ratio test between models. The likelihood ratio test assesses whether the addition of a given model parameter is warranted.
In addition, the sparsity of study-level characteristics could potentially drive our analyses to be insensitive to additive contributions of study-level characteristics to the observed heterogeneity in effect sizes. Many studies differed from Ngo et al. (2013b) methodologically, and some of those studies differed in multiple facets. To address this, we have constructed a “Methodological Deviation” factor consisting of the sum score of four other coded study design characteristics. Specifically, the sum score is taken as the sum of how study-level characteristics diverge from the seminal publication (Ngo et al., 2013b). The four characteristics used are age, semantic congruence, word count, and phase-locking. For example, if a study included subjects with similar mean age as Ngo et al. (2013b) (young adults), that component of general risk “Methodological Deviation” was coded as zero. The maximum “Methodological Deviation” is four, being relatively methodologically inconsistent with Ngo et al. (2013b), and the minimum zero being highly methodologically consistent with Ngo et al. (2013b). Subsequently, the “Methodological Deviation” factor was run with the omission of Ngo et al. (2013b). A major limitation to this index is that, since it is simply a sum score, it assumes that each methodological deviation from Ngo et al. (2013b) has identical weights.
2.7. Sensitivity analysis
Outliers can cause spurious results in small sample meta-regressions, therefore a leave-one-out cross-validation procedure was used to assess the robustness of the meta-regression model. For each iteration, one effect size is removed from the data set and the model is fit to the remaining effect sizes (Figure 2B). If the meta-regression model is to be considered robust, then each iteration of the leave-one-out-cross-validation should not greatly impact the model parameters. Specifically, each iteration of the leave-one-out-cross-validation should yield a significant slope coefficient, and no slope coefficient should significantly differ from the full model.
2.8. Meta-analysis of reliability coefficients
Reliability is a psychometric property that indexes the precision of a measurement instrument. A test-retest reliability coefficient is a method of estimating the reliability of a task by administering the same task on two separate occasions and then calculated the correlation between time points (see a diagrammatic representation in Figures 3A, B). In all the experiments within this meta-analysis, subjects underwent a cross-over design, that is, the same participants were in the control (SHAM) and treatment (STIM) conditions. Since there are multiple measurements for each participant, this allows for the opportunity to assess the test-retest reliability of the word-pair recall task used to measure memory retention. The word-pair cued recall task is characterized by three phases: (1) A learning phase where participants are presented with a list of word pairs, (2) a pre-sleep (immediate) recall phase where participants are assessed on their word-pair recall accuracy and scored on the number of word-pairs they correctly recall, (3) a post-sleep recall phase where participants are assessed on their word-pair recall accuracy and scored on the number of word-pairs they correctly recall. Memory retention is scored by the number of word pairs correctly recalled post-sleep subtracted from the number of word pairs correctly recalled pre-sleep (see Figures 3A, B):
Figure 3. The effect of reliability on significance tests and observed effect sizes. (A) Measurement model of experimental paradigm indicate that true pre and post-sleep scores (unobserved scores uncontaminated by measurement error) underlies the observed pre-sleep and post-sleep scores. Differences in true scores between delayed and immediate recall are indicative of memory retention. (B) Venn diagram representation of how pre-post test correlation effects the reliability of difference scores. (C) For variable true effect sizes with a median sample size of ñ = 15, the statistical power increases with the reliability of the measure. The dashed lines indicate the estimated reliability of the word-pair recall difference scores. (D) The biasing effect of low reliability on observed SMDs (dδ) demonstrates that decreases in reliability bias SMDs toward zero. The effect of this bias is stronger in higher SMDs. The dashed lines indicate the estimated reliability of the word-pair recall difference scores.
Where the difference score, Xd is used as the final memory retention score. However, word-pair recall is an imperfect measure of memory retention, therefore we can calculate the observed difference scores of Xd in terms of true scores (T; scores indicative of actual memory retention) and error variance (E; scores indicative of random error):
In classical test theory, the reliability of a measure is a ratio of true variance to observed variance such that:
To compute the reliability of a difference score we need the reliability of pre-sleep scores (ρpre), reliability of post-sleep scores (ρpost) and the correlation between pre and post-sleep scores (rpp). The reliability of pre-sleep (ρpre) is the test-retest correlation between the SHAM condition's pre-sleep score and the STIM condition's pre-sleep scores. The reliability of post-sleep (ρpost) is the test-retest correlation between the SHAM condition's post-sleep score and the STIM condition's post-sleep scores. Lastly, the pre/post correlation (rpp) is estimated from the correlation between pre and post-sleep scores of the SHAM condition:
The estimates from each study are pooled to estimate the reliability of word-pair recall task as an indicator of memory retention (Table 5). A convenient property of Glass' estimator (dδ) is the simplicity of the correction for task reliability. For a discussion on corrections for repeated measures effect sizes see the blog post by Pustejovsky (2023). For dδ, the unreliability of the task introduces error variance into the total observed variance, thus inflating the observed standard deviation and subsequently reducing the observed standardized mean difference (dδ,obs). This attenuation can be formally defined as:
Where dδ, true is the true standardized mean difference (see Figure 3C). This attenuation is also carried over to the standard error such that:
The attenuation of the observed effect size (dδ, obs) and the corresponding change in the standard error (seδ, obs) altogether results in decreased statistical power with lower reliability (see Figure 3C).
3. Results
The aim of this meta-analysis is to compare and extend two previous meta-analyses (Wunderlin et al., 2021; Stanyer et al., 2022) with the goal of identifying ways to improve future studies on acoustic stimulation for memory enhancement. Our full analysis includes fourteen effect sizes from 12 studies that test how acoustic stimulation during sleep impacts post-sleep memory task performance (see Section 2, Figure 1). Our full analysis includes three additional effect sizes from two publications that followed the initial meta-analyses by Wunderlin and colleagues (Prehn-Kristensen et al., 2020; Harrington et al., 2021) (Figure 1A, boxes indicate distinct studies included for each meta-analysis). As a starting point, we first analyze a subset of 11 studies and confirm that our standardized mean difference effects are virtually identical to effect sizes reported by Wunderlin and colleagues (mean absolute difference in SMD estimates = 0.02; Table 1 and Figure 1B). Moreover, we confirm that the corresponding pooled effects for these eleven studies are small (drm = 0.25, 95% CI [−0.02, 0.53], p = 0.211), consistent with the prior report by Wunderlin and colleagues (drm = 0.25, 95% CI [−0.02, 0.53]). When analyzing a subset of six studies that employ phase-locked acoustic stimulation and include non-elderly adults, Wunderlin and colleagues report almost a two-fold increase in effect size (drm = 0.44, 95% CI [0.09, 0.79]). Accordingly, we confirm that phase-locked acoustic stimulation protocol and age have a significant positive result for this same subset of combined studies (drm = 0.44, 95% CI [0.07, 0.80]). However, this effect is no longer significant (dδ = 0.23, 95% CI [−0.12, 0.58], p = 0.206) when we include three additional effect sizes for the current meta-analysis (Prehn-Kristensen et al., 2020; Harrington et al., 2021). As detailed below, our regression models find a progressive decline in effects over the past 10 years. This trend also explains away the subgroup effect previously reported by Wunderlin and colleagues for phase-locked stimulation in young adults (Wunderlin studies: p = 0.512; All current studies: p = 0.642), while the effect of publication date remains significant (Wunderlin studies: p = 0.002, All current studies: p < 0.0001). The Stanyer meta-analysis (Stanyer et al., 2022) examines the possible differences in effects for studies testing declarative vs. non-declarative memory. Stanyer and colleagues find no significant differences across memory task type and suggest that future studies should include more subjects and potentially more overnight measures which we concur. However, we find our standardized mean difference effects are not strongly correlated with those reported by Stanyer and colleagues (mean absolute difference in SMD estimates = 0.84; Figure 1C and Table 1). For our full meta-analysis, we examine the possibility that the inclusion of several recent studies that use double-blind procedures could account for the progressive decline in effect size (see Figure 2A, blue symbols). As observed for other moderators, double-blind procedures did not account for the decline in effect size (Table 3). Finally, we calculated the percentage of subjects that reported hearing acoustic stimuli in five independent studies (Section 2, Figure 2C). We estimate that 34% of participants report hearing acoustic stimuli at some point during their sleep session (kS = 5 studies, 27 of 80 participants). This raises the possibility that many subjects in these studies are indeed not blinded to the experimental condition which could create placebo, or nocebo, effects (Petersen et al., 2014; Lindheimer et al., 2015; Faraone et al., 2022). This leads us to suggest that future studies should include a true placebo acoustic stimulation condition, as has been done for previous physiological studies (Ngo et al., 2013a).
3.1. Decline in observed effect sizes over time
Our meta-analysis examining a total of 14 studies finds a progressive annual decline in reported memory effects with acoustic stimulation during sleep. Accordingly, there is a decline in reported effects (standardized mean difference) between 2013 and 2023 (Figure 2A). Our meta-regression models examine multiple potential sources of variability contributing to this decline (Section 2) including publication date, age, gender, phase-locking, word count, and congruence of words in memory tasks, overnight sleep condition, double-blind procedures, and the physiological index of slow oscillation power (see Table 3). For effect sizes within categorical moderators, see Table 4. Using a meta-regression model (Section 2, Model 1) where publication date is the sole moderator, we find that the date of publication demonstrates a negative trend that accounts for 91.8% of the heterogeneity between studies (B = −0.166 [−0.254, −0.078], Qm = 13.72, p = 0.0002, kES = 14 effect sizes, n = 206, τresid = 0.110, see Figure 2A and Table 3). Given the small number of effect sizes in this literature, we looked to evaluate the robustness of this trend with leave-one-out cross-validation (Section 2.7, Figure 2A) to ensure highly influential studies do not artifactually produce this trend. Our leave-one-out cross-validation (Section 2) analysis finds that the regression coefficient (beta) in each iteration remains significant and unchanged from the full model coefficient (Figure 2B). This indicates that the decline in effect size is not likely driven by outliers.
When adding one variable at a time using a step-wise procedure (Section 2), we find that no other study-level characteristic accounts for the annual decline in study effects (see Table 3, Models 2–9). Corrected Akaike and Bayesian Information Criterion (AICc and BIC, respectively) tests confirm that the publication date (Model 1) is the most parsimonious (according to BIC), explanatory model of the set (according to AICc). Additionally, log-likelihood test demonstrates that the addition of any alternative moderators does not improve the model fit. When we look at the predicted effect size (based on Model 1) across time, the first study (Ngo et al., 2013b) publication date has a large effect size dδ = 0.99 (95% CI [0.49, 1.49]) that declines to a net negative effect of dδ = −0.39 (95% CI [−0.73, −0.05]) in the year of the most recent publication (see Harrington et al., 2021, Figure 2A). This indicates that recent studies show a markedly reduced effect of acoustic stimulation on memory task performance, in spite of having similar cohort sizes. Next, we consider the possibility that there are cumulative methodological deviations accounting for the publication date effect. The relatively low number of studies deviating from Ngo et al. (2013b) in certain study characteristics (age, phase-locking, whole night) could lead to such analyses being underpowered for observing unique moderator effects. Likewise, combinations of study characteristics could drive heterogeneity in observed effect sizes. However, our “Methodological Deviation” moderator model (Section 2) indicates that cumulative methodological deviation does not account for any of the heterogeneity in effect sizes (B = −0.034 [−0.257, 0.190], p = 0.768). This indicates that a cumulative methodological deviation from Ngo et al. (2013b) does not explain the decline in effect sizes across studies or years.
3.2. Reliability of word-pair association retention task
In neuroscience, psychometric evaluations and measurement errors for assessing cognitive processes such as working memory and memory consolidation are rarely investigated. A potential shortcoming of standard test vs. retest memory task difference scores used in all the studies included in our meta-analysis is the loss of shared variability (Figure 3). For example, when there is no correlation between a first test (e.g., pre-sleep) vs. a second test (e.g., post-sleep) condition there is no shared variability resulting in a highly reliable difference score (Figure 3B, top row). In contrast, if subjects have similar variability across test and retest conditions there can be highly overlapped variability that is lost in the difference score measure (Figure 3B, rows 2 and 3). We estimate the reliability of the memory task difference scores as the ratio of true variance to total observed variance (Section 2, true plus error variance, Equation 10). If there is a loss in total observed variance due to overlapped variability for test vs. retest conditions this will result in low reliability for the memory performance difference score. The reliability of the word-pair association task used to measure memory retention (and consolidation) in this literature has yet to be evaluated. Thankfully, we can analyze this as investigators have provided seven raw data sets from a subset of five studies in the current meta-analysis (Section 2, Table 5). Each of these studies includes independent measures of memory task performance before and after sleep that can be used to calculate pre-sleep and post-sleep test-retest reliability (Section 2, Figure 3A). Using this model, we find that the pre-sleep memory task performance has fair reliability with a ρpre = 0.711 (95% CI [0.615, 0.807]). The post-sleep memory task performance shows a slightly better reliability of ρpost = 0.835 (95% CI [0.775, 0.896]). However, the correlation between pre-sleep and post-sleep scores from the control sleep condition (SHAM acoustic stimulation) is notably strong at rpp = 0.949 (95% CI [0.920, 0.979]). This high correlation between test-retest scores reduces the reliability of the difference score, as described above (Figure 3B). Additionally, we compute correlations for pre-sleep vs. post-sleep tests which are delayed by one week by calculating the correlation between pre-sleep scores from the STIM condition and post-sleep scores for the SHAM condition (Section 2). This delayed test-retest correlation is = 0.742 (95% CI [0.655, 0.830]). Based on classical test theory (Section 2, Equation 7) our estimates indicate a near zero reliability for difference scores (ρd = 0.006, 95% CI [−0.188, 0.199]). Even if the delayed pre-sleep vs. post-sleep correlation is used (a more liberal estimate), the reliability of the difference score is still close to zero, = 0.015 (95% CI [−0.178, 0.208]). This indicates that the observed changes in word-pair retention may be attributable to noise. This is surprising considering that the correlation between difference scores for STIM vs. SHAM conditions is rd = 0.465 (95% CI [0.247, 0.682]), as the difference score correlation should be bounded by the reliability estimate. However, even if we treat this correlation as an estimate of the reliability of difference scores, the reliability would still be considered very poor. In summary, measurement error in the task used to measure memory retention is expected to bias effect sizes and decrease statistical power (Figures 3C, D). It is possible that the poor reliability of the memory task difference scores in conjunction with low sample sizes and variable study-level characteristics could drive the observed heterogeneity in effect sizes.
4. Discussion
The use of acoustic simulation during slow-wave sleep to modulate slow waves and overnight word pair memory retention has been the focus of several studies, meta-analyses, and reviews within the last decade. Within this body of literature, there exists a variety of reported effects on memory performance, with meta-analyses suggesting small-to-moderate overall effect sizes and high degrees of variability across studies (Wunderlin et al., 2021; Stanyer et al., 2022). Previously, Wunderlin et al. (2021) found a larger memory enhancement effect when pooling studies that had included phase-locked acoustic stimulation and non-elderly participant age. For this same pool of studies, we confirm this result. Additionally, we confirm their report of a low overall effect and a large degree of heterogeneity even when controlling for acoustic stimulation and age. In addition, we extend the Wunderlin study by analyzing three additional recent studies and find there is no longer a significant pooled effect. Moreover, our meta-regression models examine multiple potential sources of variability including publication date, age, gender, phase-locking, word count and congruence of words in memory tasks, overnight sleep, double-blind procedures, and the physiological index of slow oscillation power (see Tables 2, 3). We find that none of these variables, except for publication date, accounts for the cumulative decline of memory effects with acoustic stimulation (see Table 3).
4.1. Publication date
Previous meta-analyses (Wunderlin et al., 2021; Stanyer et al., 2022) reported small-to-moderate effects of acoustic stimulation on sleep-dependent changes in word-pair memory retention as well as a large degree of heterogeneity unexplained. We demonstrate a yearly decline in reported effects, explaining 91.8% of the heterogeneity. In light of these findings, we also presented updated analyses of moderators examined in previous meta-analyses such as age and semantic congruence in word-pair memory tasks (Table 3). As expected, the addition of all other coded study characteristics did not explain heterogeneity above solely publication date. These trends describe a yearly decline in the reported efficacy of acoustic stimulation to modulate sleep-related word-pair memory consolidation (or retention). We also report that studies consistently demonstrate significant modulation of slow oscillation power ranging from dδ = 0.48 to dδ = 2.91. Heterogeneity in observed effects on memory performance was not attributable to that of observed effects of acoustic stimulation on slow oscillation power.
Decline effects are a common phenomenon in psychological sciences (Schooler, 2011; Pietschnig et al., 2019; Schimmack, 2020) and other disciplines (Munafò et al., 2007). The decline effect describes the phenomena where effect sizes decrease upon subsequent replications of the initial study. The yearly decline presented here likewise explains previously reported significant findings (Wunderlin et al., 2021) for acoustic stimulation overnight memory retention. The inclusion of publication date as a covariate in Model 6 by Wunderlin et al. (2021) diminished the effect of phase-locked stimulation in non-elderly where the regression coefficient is no longer significant. A finding which, is consistent still with the addition of recently published works with phase-locked acoustic stimulation in non-elderly (children and adults; Prehn-Kristensen et al., 2020; Harrington et al., 2021). This is further demonstrated to be robust to the contribution of outliers via our leave-one-out-cross-validation analysis (Figure 2B). Highlighting the consistency of the significant effect of publication date on the heterogeneity in observed effect sizes across studies.
There are many reported mechanisms for the Decline Effect. Recent work (Pietschnig et al., 2019) has demonstrated that underpowered designs of initial studies drive observed effect declines in intelligence literature. Contrary to this, as demonstrated through our leave-one-out-cross-validation analysis of the effect of publication date on the observed effect sizes across studies, we see that no single study drives this trend. A relevant factor here is that the presently analyzed studies do not vary greatly in their sample size (n range = 11–24) and therefore statistical power. In addition, other work has argued that unpublished findings contribute to effect declines across studies (Schooler, 2011), as well as for selective reporting (Schimmack, 2020). However, while we can not confirm any previously purported mechanisms driving effect declines here, we do suspect that the poor reliability of the task being used in the present literature does decrease statistical power and bias the effect size estimates (Figures 3C, D).
4.2. Task reliability
The measurement properties of behavioral and cognitive tests are rarely investigated in behavioral and cognitive neuroscience. In this literature, memory retention is measured using difference scores of word-pair recall tests (post-sleep scores minus pre-sleep scores). Difference scores are sometimes necessary to isolate cognitive processes such as inhibitory control which is often measured using a color-word congruence or Stroop test where the reaction times of congruent trials are subtracted from the reaction time of the incongruent word pairs to create a difference score. However, difference scores are notoriously unreliable (Rogosa and Willett, 1983; Hedge et al., 2018), especially when the pre-test and post-test is highly correlated, which is the case in this literature. Here, we find that the word-pair task expressed as a difference score is not a reliable index of memory retention. When calculating statistical power at different “true” effect sizes, we find that low difference score reliability is associated with lower power (Figure 3B, vertical dotted line). Additionally, this low difference score reliability can result in variable and biased observed effect size estimates as illustrated for a range of simulated true effect sizes (Figure 3D). With lower reliability, effect size estimates for dδ and dav are consistently biased toward zero. Taken together, it is evident that the reliability of overnight changes in word-pair retention is inadequate, and drives biased effect size estimates and decreases statistical power. Given we are only able to obtain reliability estimates for a small portion of the presently analyzed study, we can not with confidence evaluate study-level differences in reliability as a potential moderator. However, low task reliability could potentially account for the large heterogeneity in effects across studies.
4.3. Limitations
While the above analyses substantially explain a large degree of heterogeneity found in this body of literature, there remain other factors that could be of interest. There is great evidence suggesting that the rhythmic coordination of oscillations during slow-wave sleep is an essential component of memory processes during slow-wave sleep (Isomura et al., 2006; Staresina et al., 2015; Rothschild et al., 2017; Navarrete et al., 2020). It is our view that this is the theoretical and physiological basis for all the studies analyzed in the present study: Ngo et al. (2013b), Ngo et al. (2015), Ong et al. (2016), Leminen et al. (2017), Henin et al. (2019), Choi et al. (2019), Prehn-Kristensen et al. (2020), Schneider et al. (2020), Diep et al. (2020), and Harrington et al. (2021). Accordingly, the degree to which acoustic stimulation enhances slow-oscillation power and its temporal coupling to spindles could determine the corresponding memory consolidation. The frequency distribution of spindle oscillations and coupling of slow waves and spindle oscillations change during development through adulthood (Purcell et al., 2017; Hahn et al., 2020; Kurz et al., 2020). Additionally, slow wave activity declines with aging in adults (Mander et al., 2013, 2017). In theory, the efficacy of acoustic and other forms of stimulation to enhance slow waves, spindles, and memory consolidation could vary across development and aging. Hence, there is a strong rationale to constrain age to strengthen initial future studies validating an approach. Studies such as that of Schneider and colleagues find older adults can have reduced pre-stimulus slow wave oscillations compared with young adults and from a physiological standpoint this may impact the ability to enhance the slow waves (Schneider et al., 2020). However, the vast majority of studies fail to report the phase-locking estimates and their standard deviations for the arrival of acoustic stimuli relative to slow-oscillations, with the exception of three studies (Ong et al., 2016; Leminen et al., 2017; Papalambros et al., 2017). Similarly, sleep spindles are known to be coupled with slow oscillations and spindle density correlates with memory retention and general cognitive ability (Reynolds et al., 2018; Ujma, 2018) and spindles are considered biomarkers of the memory reconsolidation process (Hennies et al., 2016; Denis et al., 2021). Additionally, studies indicate slow and fast spindles play distinct roles in memory consolidation (Barakat et al., 2011; Mölle et al., 2011; Ayoub et al., 2013; Cox et al., 2017). In line with this, several studies included in this meta-analysis find distinct time scales of effects on slow vs. fast spindles (Ngo et al., 2013b; Ong et al., 2016; Weigenand et al., 2016; Henin et al., 2019) while other studies simply reported effects on fast spindles (Ngo et al., 2015; Leminen et al., 2017; Schneider et al., 2020; Harrington et al., 2021). However, several studies also reported reductions in sleep spindle power during periods of slow-wave sleep without acoustic stimulation (Weigenand et al., 2016; Diep et al., 2020; Schneider et al., 2020). Taken together, studies varied greatly in the extent to which they broadly characterized changes in these features during sleep. Inconsistent reporting of characteristic changes in slow wave and spindle oscillations and their relationships to one another and behavioral measures makes meta-analytically addressing these problems infeasible with such small sample sizes. In line with this, the sparse distribution of study-level characteristics in combination with these small sample sizes could lead to a lack of clear variability between study subgroups, and moderator analyses being insensitive to contributions of study-level characteristics to the heterogeneity. Thus, better tracking and reporting of these metrics in future studies with larger sample sizes would greatly facilitate future studies and meta-analyses.
4.4. Conclusions and recommendations for future research
In conclusion, our results demonstrate a yearly decline in the reported efficacy of acoustic stimulation to modulate overnight word-pair retention. The presence of such a trend in small bodies of studies investigating novel interventions is unsurprising (Schimmack, 2020). For the studies presented here, this finding is accompanied by another highly relevant characteristic across studies: the memory tasks used are largely unfit for the present study designs. While several study characteristics were unable to be accounted for in this meta-analysis, and the present literature analyzed is small, we find the results presented here suggestive of the following five recommendations:
• Current studies are underpowered due to small sample sizes. Future research should utilize larger samples to increase statistical power.
• Reliability of the memory task should be strengthened. Increasing the number of sleep and memory test sessions will provide repeated measurements that will enhance reliability, as reliability increases with the number of retests. Additionally, future studies could utilize latent change score modeling using data from three or more sessions, as this affords the ability to dissociate variance and to estimate the change in “true scores” between intervention and control conditions, thereby mitigating the statistical unreliability of memory task difference scores (Hedge et al., 2018).
• Provided steps are taken to ensure statistically robust measures and reliability, multiple forms of memory tasks may be suitable. For example, a meta-analysis finds acoustic, cue-related targeted memory reactivation enhances memory performance across 91 experiments that employ a variety of declarative and skill acquisition memory tasks. This approach likely taps into similar physiological circuits, given that targeted acoustic cues enhance memory when played during stages 2 and 3 of non-rapid-eye-movement (NREM) sleep but not during other stages of sleep (Hu et al., 2020).
• Our estimates suggest ~34% of subjects are not blind to the study conditions as they report hearing acoustic stimulation during the STIM condition. Many studies reported varying proportions of the participants becoming aware of acoustic stimuli during the evening. This raises the risk, as each study consisted of only a single STIM and SHAM session, that at least some subjects were cognoscente of when they were undergoing the experimental condition. The incorporation of a random-phase stimulation condition, similar to Ngo et al. (2013a), would be a simple yet effective way to provide a placebo-control in this regard.
• Mixed-reporting of any behavioral data or EEG features and characteristics hinders accurate accounting for variation in many characteristics (e.g., spindle features) across studies. All of which are research trajectories that could be greatly informative to a variety of disciplines. Future work should prioritize transparency by making their code and data (physiological and behavioral) publicly available (Wilkinson et al., 2016).
The application of these recommendations to future studies would greatly reduce the risk of small-sample bias, low statistical power, and confounding psychological effects. They would facilitate future research-synthesis efforts by providing the greater statistical power necessary for analyses of behavioral and physiological effects of acoustic stimulation of slow oscillations.
Data availability statement
The original contributions presented in the study are publicly available. This data can be found here: https://osf.io/8b49k/.
Author contributions
TH: conceptualization, data curation, methodology, formal analysis, writing—original draft, and writing—reviewing and editing. MJ: conceptualization, data curation, visualization, methodology, formal analysis, software, and writing—reviewing and editing. HR: funding, senior-authorship, and writing—reviewing and editing. JC: funding, senior-authorship, supervision, and writing—reviewing and editing. All authors contributed to the article and approved the submitted version.
Funding
This study was supported by Brain-Computer Interface Core, State of Connecticut (grant 2806580), HR (PI). The University of Connecticut Institute for Brain and Cognitive Science (IBACS grant 2541260), JC (PI), HR (Co-PI), TH (Graduate Fellow), and MJ (Graduate Fellow). NIH, Collaborative Research and Computational Neuroscience (CRCNS), (grant 1R01DC020097-01) HR (Co-PI). The University of Connecticut College of Liberal Arts and Sciences (CLAS), (grant 2539120), HR (PI) and TH (Graduate Fellow).
Conflict of interest
HR has ownership interest in Elemind Technologies, Inc. and this private company did not sponsor this research.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frsle.2023.1082253/full#supplementary-material
References
Ayoub, A., Aumann, D., Hörschelmann, A., Kouchekmanesch, A., Paul, P., Born, J., et al. (2013). Differential effects on fast and slow spindle activity, and the sleep slow oscillation in humans with carbamazepine and flunarizine to antagonize voltage-dependent Na+ and Ca2+ channel activity. Sleep 36, 905–911. doi: 10.5665/sleep.2722
Barakat, M., Doyon, J., Debas, K., Vandewalle, G., Morin, A., Poirier, G., et al. (2011). Fast and slow spindle involvement in the consolidation of a new motor sequence. Behav. Brain Res. 217, 117–121. doi: 10.1016/j.bbr.2010.10.019
Barrett, T. R., and Ekstrand, B. R. (1972). Effect of sleep on memory: III. Controlling for time-of-day effects. J. Exp. Psychol. 96, 321–327. doi: 10.1037/h0033625
Cairney, S. A., Lindsay, S., Sobczak, J. M., Paller, K. A., and Gaskell, M. G. (2016). The benefits of targeted memory reactivation for consolidation in sleep are contingent on memory accuracy and direct cue-memory associations. Sleep 39, 1139–1150. doi: 10.5665/sleep.5772
Choi, J., Won, K., and Jun, S. C. (2019). Acoustic stimulation following sleep spindle activity may enhance procedural memory consolidation during a nap. IEEE Access 7, 56297–56307. doi: 10.1109/ACCESS.2019.2913457
Chrobak, J. J., and Buzsáki, G. (1998). Operational dynamics in the hippocampal-entorhinal axis. Neurosci. Biobehav. Rev. 22, 303–310. doi: 10.1016/S0149-7634(97)00016-X
Cowan, E., Liu, A., Henin, S., Kothare, S., Devinsky, O., and Davachi, L. (2020). Sleep spindles promote the restructuring of memory representations in ventromedial prefrontal cortex through enhanced hippocampal-cortical functional connectivity. J. Neurosci. 40, 1909–1919. doi: 10.1523/JNEUROSCI.1946-19.2020
Cox, R., Schapiro, A. C., Manoach, D. S., and Stickgold, R. (2017). Individual differences in frequency and topography of slow and fast sleep spindles. Front. Hum. Neurosci. 11:433. doi: 10.3389/fnhum.2017.00433
De Gennaro, L., and Ferrara, M. (2003). Sleep spindles: an overview. Sleep Med. Rev. 7, 423–440. doi: 10.1053/smrv.2002.0252
Denis, D., Mylonas, D., Poskanzer, C., Bursal, V., Payne, J. D., and Stickgold, R. (2021). Sleep spindles preferentially consolidate weakly encoded memories. J. Neurosci. 41, 4088–4099. doi: 10.1523/JNEUROSCI.0818-20.2021
Diekelmann, S., and Born, J. (2010). The memory function of sleep. Nat. Rev. Neurosci. 11, 114–126. doi: 10.1038/nrn2762
Diep, C., Ftouni, S., Manousakis, J. E., Nicholas, C. L., Drummond, S. P. A., and Anderson, C. (2020). Acoustic slow wave sleep enhancement via a novel, automated device improves executive function in middle-aged men. Sleep 43:zsz197. doi: 10.1093/sleep/zsz197
Dijk, D. J., James, L. M., Peters, S., Walsh, J. K., and Deacon, S. (2010). Sex differences and the effect of gaboxadol and zolpidem on EEG power spectra in NREM and REM sleep. J. Psychopharmacol. 24, 1613–1618. doi: 10.1177/0269881109105788
Djonlagic, I., Mariani, S., Fitzpatrick, A. L., Van Der Klei, V. M. G. T. H., Johnson, D. A., Wood, A. C., et al. (2021). Macro and micro sleep architecture and cognitive performance in older adults. Nat. Hum. Behav. 5, 123–145. doi: 10.1038/s41562-020-00964-y
Ellenbogen, J. M., Hu, P. T., Payne, J. D., Titone, D., and Walker, M. P. (2007). Human relational memory requires time and sleep. Proc. Natl. Acad. Sci. U.S.A. 104, 7723–7728. doi: 10.1073/pnas.0700094104
Faraone, S. V., Newcorn, J. H., Cipriani, A., Brandeis, D., Kaiser, A., Hohmann, S., et al. (2022). Placebo and nocebo responses in randomised, controlled trials of medications for ADHD: a systematic review and meta-analysis. Mol. Psychiatry 27, 212–219. doi: 10.1038/s41380-021-01134-w
Fogel, S. M., and Smith, C. T. (2011). The function of the sleep spindle: a physiological index of intelligence and a mechanism for sleep-dependent memory consolidation. Neurosci. Biobehav. Rev. 35, 1154–1165. doi: 10.1016/j.neubiorev.2010.12.003
Hahn, M. A., Heib, D., Schabus, M., Hoedlmoser, K., and Helfrich, R. F. (2020). Slow oscillation-spindle coupling predicts enhanced memory formation from childhood to adolescence. eLife 9:e53730. doi: 10.7554/eLife.53730
Harrington, M. O., Ngo, H.-V. V., and Cairney, S. A. (2021). No benefit of auditory closed-loop stimulation on memory for semantically-incongruent associations. Neurobiol. Learn. Mem. 183:107482. doi: 10.1016/j.nlm.2021.107482
Hedge, C., Powell, G., and Sumner, P. (2018). The reliability paradox: why robust cognitive tasks do not produce reliable individual differences. Behav. Res. Methods 50, 1166–1186. doi: 10.3758/s13428-017-0935-1
Helfrich, R. F., Mander, B. A., Jagust, W. J., Knight, R. T., and Walker, M. P. (2018). Old brains come uncoupled in sleep: slow wave-spindle synchrony, brain atrophy, and forgetting. Neuron 97, 221–230.e4. doi: 10.1016/j.neuron.2017.11.020
Henin, S., Borges, H., Shankar, A., Sarac, C., Melloni, L., Friedman, D., et al. (2019). Closed-loop acoustic stimulation enhances sleep oscillations but not memory performance. eNeuro 6. doi: 10.1523/ENEURO.0306-19.2019
Hennies, N., Ralph, M. A. L., Kempkes, M., Cousins, J. N., and Lewis, P. A. (2016). Sleep spindle density predicts the effect of prior knowledge on memory consolidation. J. Neurosci. 36, 3799–3810. doi: 10.1523/JNEUROSCI.3162-15.2016
Hu, X., Cheng, L. Y., Chiu, M. H., and Paller, K. A. (2020). Promoting memory consolidation during sleep: a meta-analysis of targeted memory reactivation. Psychol. Bull. 146, 218–244. doi: 10.1037/bul0000223
Isomura, Y., Sirota, A., Özen, S., Montgomery, S., Mizuseki, K., Henze, D. A., et al. (2006). Integration and segregation of activity in entorhinal-hippocampal subregions by neocortical slow oscillations. Neuron 52, 871–882. doi: 10.1016/j.neuron.2006.10.023
Johnson, B. T., and Huedo-Medina, T. B. (2011). Depicting estimates using the intercept in meta-regression models: the moving constant technique. Res. Synth. Methods 2, 204–220. doi: 10.1002/jrsm.49
Kersanté, F., Purple, R. J., and Jones, M. W. (2023). The GABAA receptor modulator zolpidem augments hippocampal-prefrontal coupling during non-REM sleep. Neuropsychopharmacology 48, 594–604. doi: 10.1038/s41386-022-01355-9
Khan, H., Garg, A., Yasmeen, N., Agarwal, N. B., Yadav, D. K., Ashif Khan, M., et al. (2022). Zolpidem use and risk of suicide: a systematic review and meta-analysis. Psychiatry Res. 316:114777. doi: 10.1016/j.psychres.2022.114777
Kurz, E.-M., Conzelmann, A., Barth, G. M., Renner, T. J., Zinke, K., and Born, J. (2020). How do children with autism spectrum disorder form gist memory during sleep? A study of slow oscillation-spindle coupling. Sleep 44:zsaa290. doi: 10.1093/sleep/zsaa290
Lakens, D. (2013). Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and anovas. Front. Psychol. 4:863. doi: 10.3389/fpsyg.2013.00863
Leminen, M. M., Virkkala, J., Saure, E., Paajanen, T., Zee, P. C., Santostasi, G., et al. (2017). Enhanced memory consolidation via automatic sound stimulation during non-REM sleep. Sleep 40:zsx003. doi: 10.1093/sleep/zsx003
Leong, C. W. Y., Leow, J. W. S., Grunstein, R. R., Naismith, S. L., Teh, J. Z., D'Rozario, A. L., et al. (2022). A systematic scoping review of the effects of central nervous system active drugs on sleep spindles and sleep-dependent memory consolidation. Sleep Med. Rev. 62:101605. doi: 10.1016/j.smrv.2022.101605
Limoges, l., Mottron, L., Bolduc, C., Berthiaume, C., and Godbout, R. (2005). Atypical sleep architecture and the autism phenotype. Brain 128, 1049–1061. doi: 10.1093/brain/awh425
Lindheimer, J. B., O'Connor, P. J., and Dishman, R. K. (2015). Quantifying the placebo effect in psychological outcomes of exercise training: a meta-analysis of randomized trials. Sports Med. 45, 693–711. doi: 10.1007/s40279-015-0303-1
Lundahl, J., Deacon, S., Maurice, D., and Staner, L. (2012). EEG spectral power density profiles during NREM sleep for gaboxadol and zolpidem in patients with primary insomnia. J. Psychopharmacol. 26, 1081–1087. doi: 10.1177/0269881111424457
Lustenberger, C., Boyle, M. R., Alagapan, S., Mellin, J. M., Vaughn, B. V., and Fröhlich, F. (2016). Feedback-controlled transcranial alternating current stimulation reveals a functional role of sleep spindles in motor memory consolidation. Curr. Biol. 26, 2127–2136. doi: 10.1016/j.cub.2016.06.044
Mak-McCully, R. A., Deiss, S. R., Rosen, B. Q., Jung, K.-Y., Sejnowski, T. J., Bastuji, H., et al. (2014). Synchronization of isolated downstates (K-complexes) may be caused by cortically-induced disruption of thalamic spindling. PLoS Comput. Biol. 10:e1003855. doi: 10.1371/journal.pcbi.1003855
Mander, B. A., Rao, V., Lu, B., Saletin, J. M., Lindquist, J. R., Ancoli-Israel, S., et al. (2013). Prefrontal atrophy, disrupted NREM slow waves and impaired hippocampal-dependent memory in aging. Nat. Neurosci. 16, 357–364. doi: 10.1038/nn.3324
Mander, B. A., Winer, J. R., and Walker, M. P. (2017). Sleep and human aging. Neuron 94, 19–36. doi: 10.1016/j.neuron.2017.02.004
Marshall, L., Helgadóttir, H., Mölle, M., and Born, J. (2006). Boosting slow oscillations during sleep potentiates memory. Nature 444, 610–613. doi: 10.1038/nature05278
Martin, N., Lafortune, M., Godbout, J., Barakat, M., Robillard, R., Poirier, G., et al. (2013). Topography of age-related changes in sleep spindles. Neurobiol. Aging 34, 468–476. doi: 10.1016/j.neurobiolaging.2012.05.020
Massimini, M., Huber, R., Ferrarelli, F., Hill, S., and Tononi, G. (2004). The sleep slow oscillation as a traveling wave. J. Neurosci. 24, 6862–6870. doi: 10.1523/JNEUROSCI.1318-04.2004
Mednick, S., Nakayama, K., and Stickgold, R. (2003). Sleep-dependent learning: a nap is as good as a night. Nat. Neurosci. 6, 697–698. doi: 10.1038/nn1078
Mednick, S. C., Cai, D. J., Shuman, T., Anagnostaras, S., and Wixted, J. (2011). An opportunistic theory of cellular and systems consolidation. Trends Neurosci. 34, 504–514. doi: 10.1016/j.tins.2011.06.003
Mednick, S. C., McDevitt, E. A., Walsh, J. K., Wamsley, E., Paulus, M., Kanady, J. C., et al. (2013). The critical role of sleep spindles in hippocampal-dependent memory: a pharmacology study. J. Neurosci. 33, 4494–4504. doi: 10.1523/JNEUROSCI.3127-12.2013
Miyamoto, D., Hirai, D., and Murayama, M. (2017). The roles of cortical slow waves in synaptic plasticity and memory consolidation. Front. Neural Circuits 11:92. doi: 10.3389/fncir.2017.00092
Mölle, M., Bergmann, T. O., Marshall, L., and Born, J. (2011). Fast and slow spindles during the sleep slow oscillation: disparate coalescence and engagement in memory processing. Sleep 34, 1411–1421. doi: 10.5665/SLEEP.1290
Munafò, M. R., Matheson, I. J., and Flint, J. (2007). Association of the DRD2 gene Taq1A polymorphism and alcoholism: a meta-analysis of case-control studies and evidence of publication bias. Mol. Psychiatry 12, 454–461. doi: 10.1038/sj.mp.4001938
Navarrete, M., Valderrama, M., and Lewis, P. A. (2020). The role of slow-wave sleep rhythms in the cortical-hippocampal loop for memory consolidation. Curr. Opin. Behav. Sci. 32, 102–110. doi: 10.1016/j.cobeha.2020.02.006
Ngo, H.-V. V., Claussen, J. C., Born, J., and Mölle, M. (2013a). Induction of slow oscillations by rhythmic acoustic stimulation. J. Sleep Res. 22, 22–31. doi: 10.1111/j.1365-2869.2012.01039.x
Ngo, H.-V. V., Martinetz, T., Born, J., and Mölle, M. (2013b). Auditory closed-loop stimulation of the sleep slow oscillation enhances memory. Neuron 78, 545–553. doi: 10.1016/j.neuron.2013.03.006
Ngo, H.-V. V., Miedema, A., Faude, I., Martinetz, T., Mölle, M., and Born, J. (2015). Driving sleep slow oscillations by auditory closed-loop stimulation-a self-limiting process. J. Neurosci. 35, 6630–6638. doi: 10.1523/JNEUROSCI.3133-14.2015
Niknazar, M., Krishnan, G. P., Bazhenov, M., and Mednick, S. C. (2015). Coupling of thalamocortical sleep oscillations are important for memory consolidation in humans. PLoS ONE 10:e0144720. doi: 10.1371/journal.pone.0144720
Nishida, M., and Walker, M. P. (2007). Daytime naps, motor memory consolidation and regionally specific sleep spindles. PLoS ONE 2:e341. doi: 10.1371/journal.pone.0000341
Ong, J. L., Lo, J. C., Chee, N. I. Y. N., Santostasi, G., Paller, K. A., Zee, P. C., et al. (2016). Effects of phase-locked acoustic stimulation during a nap on EEG spectra and declarative memory consolidation. Sleep Med. 20, 88–97. doi: 10.1016/j.sleep.2015.10.016
Papalambros, N. A., Santostasi, G., Malkani, R. G., Braun, R., Weintraub, S., Paller, K. A., et al. (2017). Acoustic enhancement of sleep slow oscillations and concomitant memory improvement in older adults. Front. Hum. Neurosci. 11:109. doi: 10.3389/fnhum.2017.00109
Petersen, G. L., Finnerup, N. B., Colloca, L., Amanzio, M., Price, D. D., Jensen, T. S., et al. (2014). The magnitude of nocebo effects in pain: a meta-analysis. Pain 155, 1426–1434. doi: 10.1016/j.pain.2014.04.016
Pietschnig, J., Siegel, M., Eder, J. S. N., and Gittler, G. (2019). Effect declines are systematic, strong, and ubiquitous: a meta-meta-analysis of the decline effect in intelligence research. Front. Psychol. 10:2874. doi: 10.3389/fpsyg.2019.02874
Plihal, W., and Born, J. (1997). Effects of early and late nocturnal sleep on declarative and procedural memory. J. Cogn. Neurosci. 9, 534–547. doi: 10.1162/jocn.1997.9.4.534
Prehn-Kristensen, A., Ngo, H.-V. V., Lentfer, L., Berghäuser, J., Brandes, L., Schulze, L., et al. (2020). Acoustic closed-loop stimulation during sleep improves consolidation of reward-related memory information in healthy children but not in children with attention-deficit hyperactivity disorder. Sleep 43:zsaa017. doi: 10.1093/sleep/zsaa017
Purcell, S. M., Manoach, D. S., Demanuele, C., Cade, B. E., Mariani, S., Cox, R., et al. (2017). Characterizing sleep spindles in 11,630 individuals from the National Sleep Research Resource. Nat. Commun. 8:15930. doi: 10.1038/ncomms15930
Pustejovsky, J. E. (2023). Cohen's DZ Makes Me Dizzy When Considering Measurement Error. Available online at: jepusto.com
Rauchs, G., Schabus, M., Parapatics, S., Bertran, F., Clochon, P., Hot, P., et al. (2008). Is there a link between sleep changes and memory in Alzheimer's disease? Neuroreport 19, 1159–1162. doi: 10.1097/WNR.0b013e32830867c4
Reynolds, C. M., Short, M. A., and Gradisar, M. (2018). Sleep spindles and cognitive performance across adolescence: a meta-analytic review. J. Adolesc. 66, 55–70. doi: 10.1016/j.adolescence.2018.04.003
Rogosa, D. R., and Willett, J. B. (1983). Demonstrating the reliability of the difference score in the measurement of change. J. Educ. Measure. 20, 335–343. doi: 10.1111/j.1745-3984.1983.tb00211.x
Rothschild, G., Eban, E., and Frank, L. M. (2017). A cortical-hippocampal-cortical loop of information processing during memory consolidation. Nat. Neurosci. 20, 251–259. doi: 10.1038/nn.4457
Sanchez-Vives, M. V. (2020). Origin and dynamics of cortical slow oscillations. Curr. Opin. Physiol. 15, 217–223. doi: 10.1016/j.cophys.2020.04.005
Schimmack, U. (2020). A meta-psychological perspective on the decade of replication failures in social psychology. Can. Psychol. 61, 364–376. doi: 10.1037/cap0000246
Schneider, J., Lewis, P. A., Koester, D., Born, J., and Ngo, H.-V. V. (2020). Susceptibility to auditory closed-loop stimulation of sleep slow oscillations changes with age. Sleep 43:zsaa111. doi: 10.1093/sleep/zsaa111
Schooler, J. (2011). Unpublished results hide the decline effect. Nature 470, 437–437. doi: 10.1038/470437a
Simon, K. C., Whitehurst, L. N., Zhang, J., and Mednick, S. C. (2021). Zolpidem maintains memories for negative emotions across a night of sleep. Affect. Sci. 3, 389–399. doi: 10.1007/s42761-021-00079-1
Stanyer, E. C., Baniqued, P. D. E., Awais, M., Kouara, L., Davies, A. G., Killan, E. C., et al. (2022). The impact of acoustic stimulation during sleep on memory and sleep architecture: a meta-analysis. J. Sleep Res. 31:e13385. doi: 10.1111/jsr.13385
Staresina, B. P., Bergmann, T. O., Bonnefond, M., van der Meij, R., Jensen, O., Deuker, L., et al. (2015). Hierarchical nesting of slow oscillations, spindles and ripples in the human hippocampus during sleep. Nat. Neurosci. 18, 1679–1686. doi: 10.1038/nn.4119
Steriade, M. (2006). Grouping of brain rhythms in corticothalamic systems. Neuroscience 137, 1087–1106. doi: 10.1016/j.neuroscience.2005.10.029
Stokes, P. A., Rath, P., Possidente, T., He, M., Purcell, S., Manoach, D. S., et al. (2022). Transient oscillation dynamics during sleep provide a robust basis for electroencephalographic phenotyping and biomarker identification. Sleep 46:zsac223. doi: 10.1093/sleep/zsac223
Tessier, S., Lambert, A., Chicoine, M., Scherzer, P., Soulières, I., and Godbout, R. (2015). Intelligence measures and stage 2 sleep in typically-developing and autistic children. Int. J. Psychophysiol. 97, 58–65. doi: 10.1016/j.ijpsycho.2015.05.003
Tononi, G., and Cirelli, C. (2006). Sleep function and synaptic homeostasis. Sleep Med. Rev. 10, 49–62. doi: 10.1016/j.smrv.2005.05.002
Ujma, P. P. (2018). Sleep spindles and general cognitive ability–A meta-analysis. Sleep Spindles Cortical Up States 2, 1–17. doi: 10.1556/2053.1.2016.001
Viechtbauer, W. (2010). Conducting meta-analyses in R with the metafor package. J. Stat. Softw. 36, 1–48. doi: 10.18637/jss.v036.i03
Wamsley, E. J., Tucker, M. A., Shinn, A. K., Ono, K. E., McKinley, S. K., Ely, A. V., et al. (2012). Reduced sleep spindles and spindle coherence in schizophrenia: mechanisms of impaired memory consolidation? Biol. Psychiatry 71, 154–161. doi: 10.1016/j.biopsych.2011.08.008
Weigenand, A., Mölle, M., Werner, F., Martinetz, T., and Marshall, L. (2016). Timing matters: open-loop stimulation does not improve overnight consolidation of word pairs in humans. Eur. J. Neurosci. 44, 2357–2368. doi: 10.1111/ejn.13334
Weng, Y.-Y., Lei, X., and Yu, J. (2020). Sleep spindle abnormalities related to Alzheimer's disease: a systematic mini-review. Sleep Med. 75, 37–44. doi: 10.1016/j.sleep.2020.07.044
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., et al. (2016). The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3:160018.
Wunderlin, M., Züst, M. A., Hertenstein, E., Fehér, K. D., Schneider, C. L., Klöppel, S., et al. (2021). Modulating overnight memory consolidation by acoustic stimulation during slow-wave sleep: a systematic review and meta-analysis. Sleep 44:zsaa296. doi: 10.1093/sleep/zsaa296
Zeller, C. J., Züst, M. A., Wunderlin, M., Nissen, C., and Klöppel, S. (2023). The promise of portable remote auditory stimulation tools to enhance slow-wave sleep and prevent cognitive decline. J. Sleep Res. 2023:e13818. doi: 10.1111/jsr.13818
Keywords: meta-analysis, slow-wave sleep, slow oscillations, acoustic stimulation, memory, reliability, decline effect
Citation: Harlow TJ, Jané MB, Read HL and Chrobak JJ (2023) Memory retention following acoustic stimulation in slow-wave sleep: a meta-analytic review of replicability and measurement quality. Front. Sleep 2:1082253. doi: 10.3389/frsle.2023.1082253
Received: 28 October 2022; Accepted: 11 April 2023;
Published: 11 May 2023.
Edited by:
Dan Denis, University of York, United KingdomReviewed by:
Marina Wunderlin, Universitäre Psychiatrische Dienste Bern, SwitzerlandMiguel Navarrete, St. Jude Children's Research Hospital, United States
Copyright © 2023 Harlow, Jané, Read and Chrobak. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tylor J. Harlow, dHlsb3IuaGFybG93JiN4MDAwNDA7dWNvbm4uZWR1; Matthew B. Jané, bWF0dGhldy5qYW5lJiN4MDAwNDA7dWNvbm4uZWR1
†These authors share first authorship