- 1Department of Speech, Language and Hearing Sciences, Indiana University Bloomington, Bloomington, IN, United States
- 2School of Communication Sciences and Disorders, The University of Memphis, Memphis, TN, United States
- 3Rotman Research Institute, Baycrest Centre, Toronto, ON, Canada
- 4Department of Psychology, University of Toronto, Toronto, ON, Canada
- 5Department of Psychiatry, University of Toronto, Toronto, ON, Canada
- 6Institute of Medical Science, University of Toronto, Toronto, ON, Canada
- 7Department of Audiology, San José State University, San Jose, CA, United States
- 8Music and Health Science Research Collaboratory, University of Toronto, Toronto, ON, Canada
Emerging evidence suggests transcranial direct current stimulation (tDCS) can improve cognitive performance in older adults. Similarly, music listening may improve arousal and stimulate subsequent performance on memory-related tasks. We examined the synergistic effects of tDCS paired with music listening on auditory neurobehavioral measures to investigate causal evidence of short-term plasticity in speech processing among older adults. In a randomized sham-controlled crossover study, we measured how combined anodal tDCS over dorsolateral prefrontal cortex (DLPFC) paired with listening to autobiographically salient music alters neural speech processing in older adults compared to either music listening (sham stimulation) or tDCS alone. EEG assays included both frequency-following responses (FFRs) and auditory event-related potentials (ERPs) to trace neuromodulation-related changes at brainstem and cortical levels. Relative to music without tDCS (sham), we found tDCS alone (without music) modulates the early cortical neural encoding of speech in the time frame of ∼100–150 ms. Whereas tDCS by itself appeared to largely produce suppressive effects (i.e., reducing ERP amplitude), concurrent music with tDCS restored responses to those of the music+sham levels. However, the interpretation of this effect is somewhat ambiguous as this neural modulation could be attributable to a true effect of tDCS or presence/absence music. Still, the combined benefit of tDCS+music (above tDCS alone) was correlated with listeners’ education level suggesting the benefit of neurostimulation paired with music might depend on listener demographics. tDCS changes in speech-FFRs were not observed with DLPFC stimulation. Improvements in working memory pre to post session were also associated with better speech-in-noise listening skills. Our findings provide new causal evidence that combined tDCS+music relative to tDCS-alone (i) modulates the early (100–150 ms) cortical encoding of speech and (ii) improves working memory, a cognitive skill which may indirectly bolster noise-degraded speech perception in older listeners.
Introduction
Transcranial direct current stimulation (tDCS) is a type of neuromodulation technique that exerts effects over a specified cortical target region of the brain. Anodal and cathodal electrodes are placed on the scalp and low intensity direct current is applied resulting in polarity-dependent changes in cortical activity (Neuling et al., 2012). Though the mechanisms by which tDCS-modulated brain activity are debated (Stagg et al., 2009), it is thought anodal stimulation elicits depolarization of neurons in cortical target regions, increasing the probability of action potentials, whereas cathodal stimulation acts to hyperpolarize surrounding membrane potentials thereby decreasing the likelihood of neuronal firing (Stagg et al., 2009; Thair et al., 2017; Wang et al., 2020). Given its ability to upregulate/downregulate neuronal signaling, tDCS is being actively explored as a means to enhance behavioral functions via selected targeting of brain areas associated with certain perceptual-cognitive processes.
Transcranial direct current stimulation studies have overwhelmingly focused on improving memory functions (for review, see Thair et al., 2017). Fewer studies have examined neuroplastic effects of tDCS on aspects of audition and sensory processing (cf. Antal et al., 2004; Bolognini et al., 2011; Castaño-Castaño et al., 2019). Generally in these studies, 1–2 mA of direct current is applied over auditory cortex for 10–20-min intervals (Wang et al., 2020). Changes in brain activity can then be tracked non-invasively via scalp-recorded electroencephalography (EEG) and event-related brain potentials (ERPs). Using these techniques, two studies found that tDCS influenced the strength of the auditory P50 (frontocentral positive wave peaking at ∼50 ms), suggesting tDCS can modulate auditory stimulus gating (Zaehle et al., 2011; Terada et al., 2015). However, Kunzelmann et al. (2018) noted that electrode location, current intensity, and stimulation interval have dramatic effects on tDCS effects and corresponding neural responses, which may account for the highly variable and subtle effects of neurostimulation on auditory perception reported in prior work.
Nevertheless, a handful of studies have shown positive effects of tDCS on auditory processing. For instance, cathodal stimulation over auditory temporal cortex improves phonetic categorization and enhanced the amplitude of the P50 ERP in response to consonant vowel sounds (Heimrath et al., 2016). Anodal tDCS has also been used to modulate auditory stream segregation (Deike et al., 2016) and improve performance on sound discrimination tasks (Ladeira et al., 2011; Impey and Knott, 2015). Both positive and negative results have been reported for pitch discrimination tasks with stimulation over Heschl’s gyrus (Mathys et al., 2010; Matsushita et al., 2021). For instance, Matsushita et al. (2021) showed that pitch discrimination learning was disrupted when anodal was applied over the right (but not left) auditory cortex. Mixed results have been reported in other sensory domains including vision (Antal et al., 2004; Bolognini et al., 2011; Castaño-Castaño et al., 2019). In relation to speech perception, cathodal tDCS over right inferior frontal gyrus (i.e., right homolog of canonical Broca’s area) increases prosodic comprehension during dichotic listening (Alexander et al., 2012). Thus, while there is some discrepancy on the direction of tDCS effects (i.e., enhancement or suppression of brain function) and whether anodal vs. cathodal stimulation is more effective, the mere presence of these neuromodulatory effects on auditory brain activity demonstrates tDCS can be used to identify causal relationships between stimulating neural pathways inducing neuroplasticity and behavior (Boroda et al., 2020). Although tDCS may be effective for alleviating some deficits in aberrant cognitive aging (e.g., dementias; Hsu et al., 2015), it is not yet clear whether such benefits extend to the healthy aging population (Nilsson et al., 2015; Berryhill and Martin, 2018). If tDCS does facilitate neuroplasticity and influence perceptual processing, then it could be offered in clinical settings to enhance or at least offset declines in receptive hearing abilities that normally decline with age.
Listening to music is another potential activity that may aid perceptual-cognitive functions. Indeed, enjoyable music can heighten arousal and induce positive affect, thereby influencing subsequent cognitive performance (Thompson et al., 2001; Husain et al., 2002). Autobiographically salient music (i.e., music evoking salient personal memories), in particular, can evoke strong positive affect and heighten arousal (Belfi et al., 2016). In older adults, music listening can boost various cognitive domains related to memory including autobiographical memory recall (El Haj et al., 2015), working memory (WM) (Chow et al., 2021), and semantic memory (Bottiroli et al., 2014). Relative to our interest in auditory processing, both long- and short-term music engagement in older adults has also been associated with neural and behavioral improvements in sensory and speech processing (e.g., Parbery-Clark et al., 2012; Alain et al., 2014, 2019; Bidelman and Alain, 2015). In line with the arousal-and-mood hypothesis (Thompson et al., 2001; Husain et al., 2002), we have proposed that listening to autobiographically salient music may have the potential to improve short-term performance in certain cognitive domains, including speech processing, by improving positive affect and heightening arousal state (Chow et al., 2021).
To this end, we recently investigated the synergistic effects of brief (20 min) tDCS and autobiographically salient music listening on cognitive skills in healthy older adults (Chow et al., 2021). EEGs recorded during word recognition recall tracked changes in brain function following tDCS neurostimulation over dorsolateral prefrontal cortex (DLPFC). DLPFC was chosen as the stimulation site to investigate improvements in performance for WM and episodic memory tasks as suggested by prior tDCS studies (Berryhill and Jones, 2012; Manenti et al., 2013). We found changes in several neural signatures underlying recognition memory following tDCS+Music compared to sham stimulation. Additionally, we found improvements in backward memory span and emotional affect under the paired tDCS+Music stimulation. Our findings suggested that listening to autobiographically salient music amplifies the effects of tDCS on cognitive skills and corresponding brain function. Yet, whether the synergistic effects of combined tDCS and music similarly enhance older adults’ auditory speech processing was not formally evaluated.
Here, we report on new EEG data from the sample of older adults reported in Chow et al. (2021). We aimed to determine whether anodal tDCS over DLPFC alters the brain’s early auditory sensory processing of speech. DLPFC is considered a higher-order (non-auditory) brain region and was targeted in previous study to impact memory-related functions. However, several lines of evidence suggest DLPFC might also exert influences on auditory-sensory processing. For example, direct connections between prefrontal and primary auditory cortex have been identified in neuroanatomical tracing studies (Hackett et al., 1999; Plakke and Romanski, 2014). Lesion and non-invasive imaging studies in humans and other mammals also demonstrate that frontal areas modulate auditory activity in superior temporal gyrus (Knight et al., 1989; Fritz et al., 2010; Price et al., 2019; Bidelman and Myers, 2020) including the inputs to primary auditory cortex (Knight et al., 1989). Based on these findings, we reasoned tDCS stimulation over DLPFC might causally impact sound processing in early auditory cortex (and perhaps even earlier in auditory brainstem) as measured by auditory ERPs. In addition, given the strong links between WM skills and speech processing (particularly in noise; Füllgrabe and Rosen, 2016a; Vermeire et al., 2019; Yeend et al., 2019), we reasoned that if tDCS indeed modulates WM performance (cf., Chow et al., 2021), this might relate to older adults’ performance on clean and noise-degraded speech processing tasks.
To this end, we measured frequency-following responses (FFRs) and auditory ERPs to trace tDCS-related changes at both brainstem (FFRs) and cortical (ERPs) levels of the auditory system. We also focused on auditory behavioral measures including hearing thresholds, speech-in-noise processing (QuickSIN) (Killion et al., 2004), and WM (digit span) performance, as these factors are important determinants of day-to-day speech recognition abilities in older adults (e.g., Killion et al., 2004; Moore et al., 2014; Füllgrabe and Rosen, 2016a,b; Bidelman et al., 2019; Vermeire et al., 2019; Yeend et al., 2019). We recorded speech-evoked ERPs after anodal tDCS was concurrently administered to participants listening to autobiographically salient music. These measures were compared to those recorded either after sham stimulation with music listening, or after tDCS administered in silence (Chow et al., 2021). We hypothesized that anodal tDCS paired with music listening would amplify speech-evoked neural responses, providing causal evidence for short-term neuroplasticity in speech processing among older adults.
Materials and Methods
Participants
The sample is identical to that reported in our companion paper on the effects of neurostimulation on cognitive processing in older adults (Chow et al., 2021). Results of tDCS and music listening on recognition memory, WM, self-reported ratings of emotional affect [measured by the Positive Affect and Negative Affect Schedule (PANAS)] (Watson et al., 1988), and cognitive ERPs related to word recognition were reported previously (Chow et al., 2021). Here, we investigated the impact of tDCS and music stimulation on speech-evoked ERPs and auditory-related behavioral measures (i.e., audiometric thresholds, speech-in-noise perception; auditory verbal WM).
Briefly, participants were older adults (N=14; 72.6 ± 5.03 years; 11 females) who reported normal or corrected-to-normal vision and hearing, with no history of neuropsychiatric illness or disorder. This sample size was similar to those of past tDCS and music listening studies (Picazio et al., 2015; Mansouri et al., 2017). Additionally, we used a within-subject design so that all subjects participated in each type of session, and optimizing the possibility of detecting tDCS effects. All participants were right-handed, had a collegiate level of education (16.8 ± 4.4 years), and had no prior experience with tDCS or any of the experimental tasks. Audiometric testing indicating normal hearing bilaterally [average pure-tone audiometric (PTA) thresholds (500, 1,000, and 2,000 Hz): 17.0 ± 5.8 dB HL]. Details of other exclusion criteria are provided in Chow et al. (2021). Individuals were recruited from the participant database at the Rotman Research Institute as well as local advertisements. Written informed consent was obtained in compliance with a protocol approved by the Research Ethics Board of the Rotman Research Institute at Baycrest Centre. All participants were informed about the procedures and possible risks of tDCS prior to starting and received monetary compensation for their time.
Pre-experiment Musical Listening Interview
Prior to study participation, participants were asked about musical listening preferences and past musical experiences to ensure autobiographical saliency of the music to be played during the study (Chow et al., 2021). During the interview, participants were probed for their preferred genres of music and asked to share their favorite songs and artists from their past and present. Participants were also probed for music listening preferences throughout each decade of their lifespan starting from childhood and teenage years, and for specific songs (both song names and their respective artists) that held considerable idiosyncratic meaning or evoked personal memories of the past. This information was then used to compile a personalized playlist of songs for each participant lasting ∼21 min (i.e., the duration of the tDCS stimulation phase). Specific songs that were probed through the interview were prioritized for inclusion in each playlist, and the remaining time allotted to the playlist included songs that represented their favorite musical artists and genres from their past. Each playlist thus contained both participant-selected music and music selected by experimenters based on participants’ individual preferences. Playlists were curated from Spotify. Where feasible, study recordings were selected over live versions of each song. Songs varied in time period and musical genre, spanning classical, rock, jazz, folk, pop, country, and film scores. Playlists included both vocal and instrumental-only songs, although most were vocal in nature.
Experimental Design and Procedure
We used a counterbalanced, crossover (repeated-measures) design in which participants underwent three testing sessions each featuring a different neurostimulation condition. The entire protocol, including measures reported previously in Chow et al. (2021) is summarized in Figure 1. Sessions were conducted ∼1 week apart scheduled at the same time of day to prevent carry-over effects (Thair et al., 2017) and circadian influences on performance (Krause and Cohen Kadosh, 2014; Li et al., 2015). In the tDCS-only condition, participants received anodal tDCS in silence. In the tDCS+Music condition, participants received anodal tDCS while simultaneously listening to autobiographically salient music. In the Sham+Music condition, participants received sham tDCS stimulation while simultaneously listening to autobiographically salient music. Critically, participants were blind to session type (although participants were no doubt aware of the presence/absence of music). Sessions were counterbalanced across participants.
Figure 1. Summary of experimental protocol and measures. Participants first completed audiometric and speech-in-noise testing. They then participated in pre- and post-test behavioral and tDCS sessions, followed by EEG recording. Note that all participants cycled through each of the three tDCS session types separated by ∼1 week (within-subject design). PANAS and word recognition memory ERP data are reported in Chow et al. (2021). Here, we report on auditory data (pure-tone audiogram, QuickSIN, speech-ERPs/FFRs) and their relation to changes in working memory scores pre-to-post stimulation.
During the first session, participants completed demographic questionnaires, audiometric testing, and the QuickSIN test (Killion et al., 2004) to ensure normal hearing thresholds and assess complex (i.e., speech-in-noise) listening abilities. Participants then completed measures of self-reported emotional affect (PANAS; Watson et al., 1988) and WM. The latter was assessed pre- and post-tDCS via the forward and backward digit spans tests from the Wechsler Memory Scale (WMS-III) (Wechsler, 1997). Both the PANAS and digit span tests were assessed pre- and post-tDCS.
The stimulation phase followed, involving one of tDCS-only, tDCS+Music, or Sham+Music conditions. In the conditions involving music listening, each song from the personalized playlist was played in its entirety. The order of songs was randomized for each participant’s playlist between the tDCS+Music and Sham+Music sessions. Autobiographically salient music was played through soundbooth speakers at an individual comfortable listening level that was kept constant for each participant’s two music listening conditions. Immediately following tDCS, participants completed another measure of emotional affect and WM.
After behavioral assays and a word recall EEG task (25 min) [reported in Chow et al. (2021)], participants completed EEG testing in which we recorded auditory ERPs and FFRs to a train of rapid speech phonemes. Individual tokens were a 100-ms English consonant-vowel /ba/ from the UCLA Nonsense Syllable Test (Dubno and Schaefer, 1992). The mean fundamental frequency (F0) was 150 Hz. Other acoustic characteristics are reported elsewhere (Bidelman et al., 2019). A total of 2,400 trials (alternating polarity; SOA = 250 ms) were collected. This same speech token was used to record both ERPs and FFRs simultaneously (e.g., Bidelman et al., 2013). Stimuli were presented binaurally through ER-3A insert earphones (Etymotic Research Inc., Elk Grove, IL, United States) at 76 dB SPL (A-weighted). Participants completed the remaining stimulation conditions on separate visits with the same procedure but without the initial hearing assessments from the first study visit. All testing sessions were conducted while the participant was seated comfortably in a recliner inside an electromagnetically shielded double-walled sound-attenuated booth.
Neurostimulation
Transcranial direct current stimulation was performed in accordance with the relevant guidelines and regulations, including safety protocols based on established guidelines on tDCS administration (Rossi et al., 2009). A constant direct current (2 mA, 20 min) was administered by a battery-driven constant current stimulator from TCT Research (TCT Research Limited, Kowloon, Hong Kong) through 35 cm2 saline-soaked synthetic sponge electrodes (Figure 2A). The anode was placed over the left DLPFC, located over the F3 electrode location according to the International 10–20 system of EEG electrode placement. Evidence for using a F3 anode for targeting the left DLPFC is supported by prior studies (Seibt et al., 2015). The cathode was placed over the contralateral supraorbital region. We further confirmed the foci of stimulation via model simulations in the COMETS2 toolbox (Lee et al., 2017), which showed substantial current flow across DLPFC under this montage (Figure 2B).
Figure 2. tDCS stimulation. (A) Montage of the stimulating electrodes. Green points show standard 10–20 electrode locations Lower traces show the time course (not to scale) of current stimulation for actual (tDCS) vs. sham sessions. (B) COMETS2 simulation (Lee et al., 2017) of the tDCS-induced current activation that targeted dorsolateral prefrontal cortex.
Transcranial Direct Current Stimulation
Actual neurostimulation lasted 21 min. At the start of the stimulation phase, the current gradually increased in a ramp-like fashion from 0 to 2 mA over 60 s, remained constant for 19 min and 50 s, then was ramped down to 0 mA over 10 s (Figure 2A).
Sham
Current also gradually increased to 2 mA over a time window of 60 s, then immediately ramped down to 0 mA over 10 s period where it was maintained at that level of current for 20 min until the end of the stimulation phase. To ensure that no participant experienced adverse side effects, a post-stimulation questionnaire was administered that solicited listeners’ perception of several side effects (Likert scale: “0” being absent, “4” being severe).
Electroencephalography Recording and Preprocessing
Neuroelectric brain activity (EEG) was recorded from eight electrodes (Fpz, Fz, Cz, Pz, P3/4, and TP9/10) at standard 10–20 locations on the scalp (Oostenveld and Praamstra, 2001) using BioSemi ActiveTwo amplifiers (BioSemi V.O.F., Amsterdam, Netherlands). This system used a pair of common mode sense and driven right leg reference electrodes. EEGs were digitized online at 16,384 Hz. EEG pre-processing was done using the Brainstorm MATLAB toolbox (Tadel et al., 2011). Traces were then downsampled to 5,000 Hz and re-referenced to linked mastoids (i.e., average of TP9/10 electrodes) for subsequent ERP analysis. Ocular artifacts (saccades and blinks) were corrected in the continuous EEG using a principal component analysis (Picton et al., 2000). Cleaned EEGs were then bandpass filtered (1–40 Hz), epoched (−50–200 ms), baseline corrected to the pre-stimulus period, and trial-wise averaged to obtained speech-evoked ERPs for each tDCS session per listener. ERPs and scalp topographies were visualized using the MNE toolbox (v0.23) (Gramfort et al., 2013) and BESA® Research 7.1 (BESA, GmbH).
Cluster-Based Permutation Analysis (Topographic ANOVA)
We used cluster-based permutation statistics (Maris and Oostenveld, 2007) to test for session-related changes in the cortical ERPs. A two-stage analysis was conducted in BESA Statistics 2.1 (BESA, GmbH). First, we computed an omnibus repeated measures (rm) ANOVA (F-test) contrasting ERP amplitudes between sessions at every time sample in the epoch window. This preliminary step identified contiguous clusters of data points both in time (adjacent samples) and electrodes where responses differed between tDCS sessions (i.e., fell below the cluster-building alpha, p < 0.05). For each spatiotemporal cluster, the sum of t-values of sampling points within each cluster then form the cluster-level statistics (or so called “cluster values”). In the subsequent analysis phase, permutation testing was then conducted using a Monte-Carlo resampling technique, which involved comparing the observed cluster value with random cluster values drawn from a permutation distribution. This permutation distribution was created by randomly assigning levels of the factor of interest (here, session) and iteratively conducting the same test (N = 1,000 resamples), retrieving the maximum cluster value for each permutation. The largest absolute cluster value is subsequently compared to the permutation distribution of maximal cluster values. If the maximum cluster value from the observed data is larger than 95% of the maximum cluster values in the permutation distribution, then the null hypothesis (i.e., that the sessions are sampled from the same distribution) is rejected. This identified significant post hoc differences by permuting between session conditions (e.g., Oostenveld et al., 2011). These contrasts were corrected with Scheffe’s test, using Holm–Bonferroni adjustments. This identified contiguous time samples for which the session conditions were not interchangeable. Importantly, this clustering process corrects for multiple comparisons across the aggregate of all time points and electrodes by controlling the familywise error rate. For an in-depth overview of permutation statistics as implemented in BESA Statistics, see Maris and Oostenveld (2007).
Brainstem Frequency-Following Responses
Our high sample rate allowed us to recover the FFR, a neurophonic response that reflects phase-locked activity to speech with dominant sources in the brainstem when recorded via EEG (Bidelman, 2018; Ross et al., 2020). To this end we measured the root mean squared (RMS) amplitude of brainstem FFRs, computed during the steady-state portion (20–120 ms) of the response waveform. FFRs were analyzed at the Fz electrode, referenced to average mastoids (i.e., Fz – TP9/TP10). This montage is optimal for recording FFRs of midbrain origin (Bidelman, 2015). FFR RMS amplitudes were analyzed via a one-way mixed-model ANOVA (fixed factor = session type; subjects = random effect) in R (lme4 package; Bates et al., 2015).
Results
Brainstem Frequency-Following Responses
Frequency-following responses are shown in Figure 3. Recent studies have shown FFRs are modulated when tDCS stimulation is applied over the right temporal lobe (Mai and Howell, 2021). A one-way ANOVA including the factor tDCS session (tDCS+Music; tDCS alone; Sham+Music) and the dependent variable FFR RMS amplitude indicated FFRs were invariant across tDCS sessions [F(2,26) = 0.49, p = 0.61] (Figure 3C). Given the poor morphology and null session effects, we did not pursue FFRs further and focus only on analysis of the cortical ERPs hereafter.
Figure 3. Speech-evoked FFRs (brainstem responses) across tDCS sessions. Grand averaged FFRs (mean of 2400 trials per participant) are shown at electrode Fz, referenced to linked mastoids (i.e., Fz – TP9/10 montage). (A) FFR time waveforms and (B) response spectra. FFR F0, corresponding to the voice pitch of the speech stimulus (/ba/) is demarcated in the FFRs.(C) FFR RMS showed a trend for decreasing amplitudes with tDCS but the session effect was not significant (p = 0.61). Error bars = 95% CI.
Cortical Auditory Event-Related Potentials
Speech-evoked cortical responses are shown as topographic maps and global field power in Figure 4. Prominent activity was observed within 200 ms of stimulus onset corresponding to the canonical P1-N1-P2 deflections of the auditory ERPs.
Figure 4. Speech-evoked ERPs (cortical responses) across tDCS sessions. Hot colors, increased positive voltage. (Top) Maps are shown at a latency of 75 ms, the approximate latency of the P1 wave. (Bottom) Butterfly overlay of the auditory ERPs (only the Sham+music condition is shown for clarity). Channel colors in the bottom panel correspond with the electrode names marked in the top middle heatmap. Gray area = global field power (GFP).
The topographic omnibus ANOVA revealed session-related modulations in the timeframe of the N1 wave (100–150 ms; Figure 5A). Differences were most prominent at the vertex where the auditory ERPs are maximal on the scalp surface. However, cluster-based permutation testing (corrected for multiple comparisons across time samples and electrodes) revealed this session effect was attributable to modulations within two time windows. The first cluster from 126 to 141 ms showed weaker (i.e., less negative) N1 responses over the P3 electrode in the tDCS-only vs. Sham+music condition (p = 0.018, Figure 5B). The second cluster spanning 100–118 ms showed larger (i.e., more positive) responses at Cz for the tDCS+music compared to the tDCS-only condition (p = 0.001; Figure 5C). Collectively, these findings reveal tDCS and/or music (or a combination of both) modulate in the early cortical processing of speech. Whereas tDCS alone generally weakened neural responses, concurrent music restored responses to nominal activation levels similar to the Sham+Music condition (without tDCS).
Figure 5. tDCS and tDCS+music differentially alters auditory cortical responses to speech. (A) Results of a cluster-based permutation testing (omnibus ANOVA) comparing ERPs across sessions. Speech ERPs are modulated in the 100–150 ms time window depending on session type. (B) Post hoc effect contrasting tDCS-only vs. Sham+music. Music listening (in the absence of tDCS) yielded stronger speech responses in 150 ms timeframe compared to tDCS stimulation alone. (C) Post hoc effect contrasting tDCS+music vs. tDCS alone. Whereas tDCS suppressed the auditory ERPs, concurrent music listening seemingly counteracted the suppressive effect of tDCS on neural responses. **p < 0.01, *p < 0.05, error bars = ±1 SEM.
Brain-Behavior Relations
We assessed the behavioral relevance of the ERP changes due to tDCS as well as relations between behavioral and demographic variables via bivariate linear regressions (“fitlm” in MATLAB). Shapiro–Wilk tests confirmed normality of the variables (all Ps > 0.064). We focused on auditory measures including hearing thresholds, speech-in-noise processing (QuickSIN), and working memory (digit span) performance, as these factors are important determinants of everyday speech recognition abilities in older adults (e.g., Killion et al., 2004; Moore et al., 2014; Füllgrabe and Rosen, 2016a,b; Bidelman et al., 2019; Vermeire et al., 2019; Yeend et al., 2019). We computed the music-induced modulation of the ERP during tDCS above and beyond tDCS neurostimulation alone as the difference in ERP response amplitudes identified via the permutation statistics (i.e., tDCS+music – tDCS-only; see bars in Figure 5C) and regressed this ERP measure against each of the behavioral and demographic measures. Analysis showed that the tDCS+music changes in ERP amplitude was not associated with QuickSIN, PTA thresholds, nor WM performance (all ps > 0.05; data not shown). Similarly, a linear model incorporating the combination of variables shown in Figure 6 did not reveal any interaction or multivariate factors as significant predictors.
Figure 6. tDCS induced changes in neuro-behavioral responses depend on education and relate to speech in noise listening skills. (A) The music-induced changes during tDCS above and beyond neurostimulation alone (i.e., tDCS+music – tDCS-only) is negatively related to education; listeners with more educational attainment experience less music-induced change in the ERPs during tDCS. (B) Improvements in WM (backward digit span) pre-to-post tDCS stimulation is related to better SIN listening skills. Pre/post test scores are collapsed across sessions. Dotted lines = 95% CI.
Because educational level moderates some of the benefits from tDCS (Berryhill and Jones, 2012; Krebs et al., 2020), we further evaluated correlations between tDCS neuromodulation and listeners’ education. We found a negative association between education attainment and the music-related modulation in ERPs during tDCS; listeners with more formal education showed lower neuroplastic change in the N1 than those with lesser education (R2 = 0.31, p = 0.039; Figure 6A). Additionally, performance on the QuickSIN averaged 2.09 ± 1.31 dB SNR loss, consistent with scores in this age bracket (Zendel and Alain, 2012; Hutka et al., 2013; Bidelman et al., 2019). While QuickSIN was not correlated with neural measures per se (all ERPs were evoked by clean speech), we did find that the change in WM1 performance pre- to post-tDCS (averaged across sessions) was associated with QuickSIN scores. Namely, larger behavioral gains in backward digit span pre-to-post tDCS were associated with better speech-in-noise recognition (i.e., lower QuickSIN scores) (R2 = 0.34, p = 0.028; Figure 6B).
Discussion
We assessed whether combining anodal tDCS over DLPFC paired with listening to autobiographically salient music alters neural speech processing in older adults compared to either music listening under sham stimulation or tDCS alone. Our findings reveal tDCS and/or music induced modulations in the early neural encoding of speech between 100 and 150 ms after stimulus onset. Whereas tDCS effects appeared largely suppressive (i.e., reducing ERP amplitude as compared to Music+sham), concurrent music with tDCS restored ERP amplitude to baseline levels. However, we note this neural effect could be attributable to a true effect of tDCS or presence/absence music. Changes in WM were also observed from pre- to post intervention session, and were related to better speech-in-noise listening skills. Our findings provide new causal evidence that tDCS stimulation modulates the early neural encoding of speech as indexed by the auditory ERPs. Moreover, we show that tDCS-related improvements in WM [previously reported in Chow et al. (2021)] are also related to noise-degraded speech perception in older adults.
In the same sample of listeners, we have previously shown that tDCS while listening to autobiographically salient music amplifies gains in WM relative to either tDCS or music listening alone (Chow et al., 2021). Several studies have reported tDCS-related improvements in WM for younger adults (Andrews et al., 2011; Jeon and Han, 2012) but similar effects in older adults have been equivocal. Moreover, to our knowledge, only one other study (Mansouri et al., 2017) has examined effects of anodal tDCS on executive functioning by measuring response inhibition in younger adults pre- and post-tDCS in the presence of background music. That study found that high-tempo background music interacts with tDCS on response inhibition. Our dataset extends this work to the domain of WM and healthy older adults. We also extend our previous findings on the effects of tDCS on older adults’ memory recognition (Chow et al., 2021) by demonstrating tDCS over DLPFC has a modulatory effect on the neural encoding of speech sounds.
Literature on the effects of tDCS on the auditory ERPs have been equivocal (for review, see Wang et al., 2020). Heimrath et al. (2016) found anodal tDCS over temporal lobe increased the P1 (∼50 ms) response to consonant-vowel speech tokens similar to those used here. However, other studies have failed to observe significant ERP changes with similar tDCS stimulation protocols (Kunzelmann et al., 2018; López-Caballero et al., 2020). Our study differs from prior work in that our stimulation targets were located over frontal (rather than auditory temporal) cortical sites. We found that tDCS suppressed the auditory ERPs in time range of the N1, but that concurrent music reinvigorated neural responses and seemingly counteracted the suppressive tDCS effect. By stimulating DLPFC, we therefore find small but measurable tDCS-induced changes in the brain’s early auditory-cortical encoding of speech. Presumably, these DLPFC changes in auditory responses result from the top-down functional projections from prefrontal areas to auditory cortex (Hackett et al., 1999; Plakke and Romanski, 2014). Moreover, we note that EEG was recorded ∼25 min after the neurostimulation sessions. Thus, the observed effects might actually underestimate the true neuroplastic effects had FFRs/ERPs been conducted immediately after tDCS intervention. Still, even after 25 min, tDCS effects on the brain appear to persist long enough to observe modulations in the cortical ERPs (but not brainstem FFRs) post-stimulation.
What might be the mechanistic account of these data? The putative generator(s) of the N1 are thought to lie near the auditory cortices within the Sylvian fissure (Näätänen and Picton, 1987; Scherg et al., 1989; Picton et al., 1999). Functionally, N1 is associated with the formation of perceptual object representations and auditory feature coding (Näätänen and Picton, 1987; Alain and Arnott, 2000). The auditory N1 also receives top-down influences from prefrontal brain regions (Chao and Knight, 1997; Knight et al., 1999) and is often exaggerated with aging, which has been taken as evidence for reduced inhibition from distal (frontal) areas (Chao and Knight, 1997; Alain et al., 1999, 2022; Caspary et al., 2008; Bidelman et al., 2014). More robust neural connectivity between frontal and temporal cortices is also related to better speech-in-noise processing in older adults (Price et al., 2019). Presumably, the largely suppressive tDCS-related changes in N1 observed here may result from similar efferent mechanisms, by which increased involvement of the frontal cortices due to anodal (excitatory) stimulation of DLPFC (Thair et al., 2017) modulates the early sensory encoding of speech sounds. Alternatively, we cannot rule out the possibility that the effects of tDCS on the auditory ERPs might result from other mechanisms. For instance, the N1 is modulated by attention and arousal state (Hillyard et al., 1973; Coull, 1998); thus, it is conceivable that the observed tDCS changes in auditory responses may have resulted from subtle (covert) changes in attention to the speech stimuli. However, we note our ERP paradigm featured strictly passive listening, so this explanation appears insufficient.
Transcranial direct current stimulation did not have an appreciable effect on speech-evoked FFRs. We offer several explanations for these null findings. First, FFRs are dominantly of brainstem origin when recorded via EEG (Bidelman, 2018; Ross et al., 2020). As such, the response may be too deep (peripheral) in the brain to show sensitivity to distal scalp stimulation. Relatedly, our tDCS paradigm targeted frontal lobe sites. Thus, the orientation and location of intracranial current densities induced by tDCS were not optimized for neuromodulation of lower auditory brain areas (i.e., brainstem). However, we note that even with more optimal foci for auditory neuromodulation (e.g., superior temporal lobe), the effects of magnetic (TMS) and current (tDCS) stimulation on FFRs remains equivocal (López-Caballero et al., 2020; Mai and Howell, 2021). For example, continuous theta burst TMS over right auditory cortex fails to yield measurable changes in FFRs suggesting deeper, more peripheral sources (i.e., brainstem) which are insensitive to cortical stimulation (López-Caballero et al., 2020). In contrast, Mai and Howell (2021) showed that tDCS neurostimulation over right auditory cortex can cause subtle decreases in speech-FFR amplitudes (i.e., F0 coding), though apparently without concomitant changes in pitch discrimination behavior (Mai and Howell, 2021). Thus, it is possible that more sessions or more targeted, high-definition (i.e., focal) tDCS stimulation sites proximal to auditory cortex may have been more successful in altering FFRs, either directly by altering phase-locked FFR activity stemming from auditory cortex and/or top-down modulation of brainstem FFR components via corticofugal efferent pathways (e.g., Price and Bidelman, 2021). Lastly, our stimulus design may have weakened possible tDCS-FFR effects. Indeed, there is some suggestion that binaural stimulus presentation (as used here) might smear tDCS-induced changes in FFR, which are more prominent for monaural (contralateral) stimulus delivery (Mai and Howell, 2021). Our consonant-vowel speech tokens evoke weaker FFRs given their short duration of their vowel periodicity (<40 ms) (Bidelman et al., 2019). Future studies with more optimal FFR-evoking stimuli (e.g., sustained vowels) and stimulation sites (e.g., temporal lobe) are needed to fully evaluate the effects of tDCS on speech-FFRs (cf. Mai and Howell, 2021).
Our correlational findings offer preliminary evidence for tDCS-induced modulation of cognitive skills in the form of better WM post-intervention [as also reported in Chow et al. (2021)]. Notable in present study, we show these WM improvements attributable to tDCS may have a potential transfer effect, bolstering degraded speech perception performance. Still, since we only collected baseline QuickSIN scores, a way to directly test this hypothesis in future studies would be to compare pre- and post-tDCS stimulation QuickSIN performance. Difficulty understanding speech-in-noise is highly prevalent among the aging population and older adults exhibit greater listening effort in adverse listening situations (e.g., Helfer and Wilber, 1990; Wong et al., 2010). We found pre- to post-tDCS session changes in WM were correlated with better (i.e., lower threshold) QuickSIN performance. WM capacity is highly predictive of degraded speech-listening skills in younger adults, and especially in older adults (e.g., Füllgrabe and Rosen, 2016a,b; Alain et al., 2018; Vermeire et al., 2019; Yoo and Bidelman, 2019; Bidelman and Yoo, 2020). Thus, our behavioral data suggest tDCS paired with music listening may be a viable intervention to boost cognitive and WM performance, and in turn, receptive communication skills that decline during the lifespan. Still, the generalizability of these findings may be limited by the high level of education and greater number of females in our cohort. Indeed, there is some indication that responsivity to tDCS is larger in females than males (Dedoncker et al., 2016). Although our sample size was similar to those of past tDCS and music listening studies (Picazio et al., 2015; Mansouri et al., 2017), we note our dataset (Chow et al., 2021) is small in size, which is a limitation of our study. Still, our data show that the degree of neuroplasticity induced by tDCS+Music is correlated with listeners’ years of formal education. Consequently, those with less scholastic achievement may have “more to gain” from tDCS interventions. tDCS effects are also variable across cognitive domains and are highly dependent on stimulation parameters (e.g., stimulation current intensity, duration, and site) (Huo et al., 2021). Additionally, future studies could assess whether the type (genre, content) and degree of familiarity of music listeners self-select differentially alters auditory responses (cf. Brown and Bidelman, 2022). Our design also did not include a tDCS sham without music listening, which would have allowed for more robust inference regarding the independent effects of tDCS and music on the auditory ERPs. There is a limitation of our study which warrants future investigation. Longitudinal and larger sample studies are also needed to identify the proper dosage of neurostimulation that might produce optimal neuro-behavioral plasticity at the individual level.
We should also emphasize that the present data did not show a link between neural and behavioral outcomes. That is, we did not observe correlations between tDCS+music benefits and auditory measures including QuickSIN, hearing thresholds, and WM performance. Thus, like our previous report examining tDCS and recognition memory (Chow et al., 2021), the behavioral relevance of our findings remains open. It is possible the effects of tDCS+music pairing are indirect and somewhat epiphenomenal. Indeed, TMS stimulation of left DLPFC increases musical reward sensitivity through fronto-striatal pathways (Mas-Herrero et al., 2018). tDCS modulation of these pathways may heighten arousal from music listening and thereby also heighten the effects of cognitive performance and auditory processing as observed here. Pairing tDCS with personally meaningful music may also make the experience of tDCS more tolerable and enjoyable, allowing more focused task performance after stimulation (see Chow et al., 2021).
Regardless of underlying mechanism(s), our study affirms that listening to music, particularly to songs that evoke autobiographical memories, may heighten responsivity to the effects of tDCS for healthy older adults. The larger effect of music+tDCS than music or tDCS alone supports general findings in the literature of an advantage of combining tDCS with other cognitive tasks to maximize neuroplastic effects (e.g., Assecondi and Shapiro, 2018). Moreover, we demonstrate tDCS over DLPFC causes small but measurable changes in neuroplasticity related to WM and the neural encoding of speech sounds. These neurobehavioral changes may have indirect benefits to speech-in-noise processing in older adults. Our findings help inform future work aimed at tailoring personalized neurostimulation interventions in older adults to bolster cognitive skills and complex auditory processing.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics Statement
The studies involving human participants were reviewed and approved by the Research Ethics Board of the Rotman Research Institute at Baycrest Centre. The participants provided their written informed consent to participate in this study.
Author Contributions
AN-G, JR, and CA contributed to the conception and design. AN-G and RC conducted the data acquisition. RC, KB, and GB conducted the data processing and analysis. All authors contributed to interpreting the results and writing the manuscript.
Funding
This work was partially supported by the Lorraine Johnston Foundation, National Institute on Deafness and Other Communication Disorders, and National Institute on Aging of the National Institutes of Health (R01DC016267).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We thank Aline Moussard and Victoria Nieborowska for assistance in participant recruitment and data collection, and Alexandra Beltran-Montoya, Vanessa Chan, Manda Fischer, Gabrielle Katz, Shahier Paracha, Karishma Ramdeo, and Parnian Tajbakhsh for assistance with data preparation and preprocessing.
Footnotes
- ^ As reported in Chow et al. (2021), an ANOVA conducted on digit span backward WM scores showed a tDCS session × time interaction [F(2,26) = 4.12, p = 0.028]. Pairwise comparisons revealed a significant increase in WM scores pre-to-posttest in the tDCS+Music condition (p = 0.012) but no change in performance for the tDCS-only (p = 0.671) or Sham+Music (p = 0.418) conditions.
References
Alain, C., Achim, A., and Woods, D. L. (1999). Separate memory-related processing for auditory frequency and patterns. Psychophysiology 36, 737–744.
Alain, C., and Arnott, S. R. (2000). Selectively attending to auditory objects. Front. Biosci. 5:D202–212. doi: 10.2741/alain
Alain, C., Chow, R., Lu, J., Rabi, R., Sharma, V. V., Shen, D., et al. (2022). Aging enhances neural activity in auditory, visual, and somatosensory cortices: the common cause revisited. J. Neurosci. 42, 264–275. doi: 10.1523/JNEUROSCI.0864-21.2021
Alain, C., Du, Y., Bernstein, L. J., Barten, T., and Banai, K. (2018). Listening under difficult conditions: an activation likelihood estimation meta-analysis. Hum. Brain Mapp. 39, 2695–2709. doi: 10.1002/hbm.24031
Alain, C., Moussard, A., Singer, J., Lee, Y., Bidelman, G. M., and Moreno, S. (2019). Music and visual art training modulate brain activity in older adults. Front. Neurosci. 13:182. doi: 10.3389/fnins.2019.00182
Alain, C., Zendel, B. R., Hutka, S., and Bidelman, G. M. (2014). Turning down the noise: the benefit of musical training on the aging auditory brain. Hear. Res. 308, 162–173. doi: 10.1016/j.heares.2013.06.008
Alexander, T., Avirame, K., and Lavidor, M. (2012). Improving emotional prosody detection in the attending ear by cathodal tDCS suppression of the competing channel. Neurosci. Lett. 508, 52–55. doi: 10.1016/j.neulet.2011.12.017
Andrews, S. C., Hoy, K. E., Enticott, P. G., Daskalakis, Z. J., and Fitzgerald, P. B. (2011). Improving working memory: the effect of combining cognitive activity and anodal transcranial direct current stimulation to the left dorsolateral prefrontal cortex. Brain Stimul. 4, 84–89. doi: 10.1016/j.brs.2010.06.004
Antal, A., Kincses, T. Z., Nitsche, M. A., Bartfai, O., and Paulus, W. (2004). Excitability changes induced in the human primary visual cortex by transcranial direct current stimulation: direct electrophysiological evidence. Invest. Ophthalmol. Vis. Sci. 45, 702–707.
Assecondi, S., and Shapiro, K. (2018). The benefits of combined brain stimulation and cognitive training: a pilot study. J. Vis. 18, 119–119.
Bates, D., Mächler, M., Bolker, B., and Walker, S. (2015). Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48.
Belfi, A. M., Karlan, B., and Tranel, D. (2016). Music evokes vivid autobiographical memories. Memory 24, 979–989.
Berryhill, M. E., and Jones, K. T. (2012). tDCS selectively improves working memory in older adults with more education. Neurosci. Lett. 521, 148–151.
Berryhill, M. E., and Martin, D. (2018). Cognitive effects of transcranial direct current stimulation in healthy and clinical populations: an overview. J. ECT 34, e25–e35. doi: 10.1097/YCT.0000000000000534
Bidelman, G. M. (2015). Multichannel recordings of the human brainstem frequency-following response: scalp topography, source generators, and distinctions from the transient ABR. Hear. Res. 323, 68–80. doi: 10.1016/j.heares.2015.01.011
Bidelman, G. M. (2018). Subcortical sources dominate the neuroelectric auditory frequency-following response to speech. Neuroimage 175, 56–69. doi: 10.1016/j.neuroimage.2018.03.060
Bidelman, G. M., and Alain, C. (2015). Musical training orchestrates coordinated neuroplasticity in auditory brainstem and cortex to counteract age-related declines in categorical vowel perception. J. Neurosci. 35, 1240–1249. doi: 10.1523/JNEUROSCI.3292-14.2015
Bidelman, G. M., Moreno, S., and Alain, C. (2013). Tracing the emergence of categorical speech perception in the human auditory system. Neuroimage 79, 201–212. doi: 10.1016/j.neuroimage.2013.04.093
Bidelman, G. M., and Myers, M. H. (2020). Frontal cortex selectively overrides auditory processing to bias perception for looming sonic motion. Brain Res. 1726:146507. doi: 10.1016/j.brainres.2019.146507
Bidelman, G. M., Price, C. N., Shen, D., Arnott, S. R., and Alain, C. (2019). Afferent-efferent connectivity between auditory brainstem and cortex accounts for poorer speech-in-noise comprehension in older adults. Hear. Res. 382:107795. doi: 10.1016/j.heares.2019.107795
Bidelman, G. M., Villafuerte, J. W., Moreno, S., and Alain, C. (2014). Age-related changes in the subcortical-cortical encoding and categorical perception of speech. Neurobiol. Aging 35, 2526–2540. doi: 10.1016/j.neurobiolaging.2014.05.006
Bidelman, G. M., and Yoo, J. (2020). Musicians show improved speech segregation in competitive, multi-talker cocktail party scenarios. Front. Psychol. 11:1927. doi: 10.3389/fpsyg.2020.01927
Bolognini, N., Rossetti, A., Casati, C., Mancini, F., and Vallar, G. (2011). Neuromodulation of multisensory perception: a tDCS study of the sound-induced flash illusion. Neuropsychologia 49, 231–237. doi: 10.1016/j.neuropsychologia.2010.11.015
Boroda, E., Sponheim, S. R., Fiecas, M., and Lim, K. O. (2020). Transcranial direct current stimulation (tDCS) elicits stimulus-specific enhancement of cortical plasticity. Neuroimage 211:116598. doi: 10.1016/j.neuroimage.2020.116598
Bottiroli, S., Rosi, A., Russo, R., Vecchi, T., and Cavallini, E. (2014). The cognitive effects of listening to background music on older adults: processing speed improves with upbeat music, while memory seems to benefit from both upbeat and downbeat music. Front. Aging Neurosci. 6:284. doi: 10.3389/fnagi.2014.00284
Brown, J. A., and Bidelman, G. M. (2022). Song properties and familiarity affect speech recognition in musical noise. Psychomusicology [preprint]. doi: 10.1037/pmu0000284
Caspary, D. M., Ling, L., Turner, J. G., and Hughes, L. F. (2008). Inhibitory neurotransmission, plasticity and aging in the mammalian central auditory system. J. Exp. Biol. Med. 211, 1781–1791.
Castaño-Castaño, S., Feijoo-Cuaresma, M., Paredes-Pacheco, J., Morales-Navas, M., Ruiz-Guijarro, J. A., Sanchez-Santed, F., et al. (2019). tDCS recovers depth perception in adult amblyopic rats and reorganizes visual cortex activity. Behav. Brain Res. 370:111941. doi: 10.1016/j.bbr.2019.111941
Chao, L. L., and Knight, R. T. (1997). Prefrontal deficits in attention and inhibitory control with aging. Cereb. Cortex 7, 63–69.
Chow, R., Noly-Gandon, A., Moussard, A., Ryan, J. D., and Alain, C. (2021). Effects of transcranial direct current stimulation combined with listening to preferred music on memory in older adults. Sci. Rep. 11:12638. doi: 10.1038/s41598-021-91977-8
Coull, J. T. (1998). Neural correlates of attention and arousal: insights from electrophysiology, functional neuroimaging and psychopharmacology. Prog. Neurobiol. 55, 343–361. doi: 10.1016/s0301-0082(98)00011-2
Dedoncker, J., Brunoni, A. R., Baeken, C., and Vanderhasselt, M. A. (2016). A systematic review and meta-analysis of the effects of transcranial direct current stimulation (tDCS) over the dorsolateral prefrontal cortex in healthy and neuropsychiatric samples: influence of stimulation parameters. Brain Stimul. 9, 501–517. doi: 10.1016/j.brs.2016.04.006
Deike, S., Deliano, M., and Brechmann, A. (2016). Probing neural mechanisms underlying auditory stream segregation in humans by transcranial direct current stimulation (tDCS). Neuropsychologia 91, 262–267. doi: 10.1016/j.neuropsychologia.2016.08.017
Dubno, J. R., and Schaefer, A. B. (1992). Comparison of frequency selectivity and consonant recognition among hearing-impaired and masked normal-hearing listeners. J. Acoust. Soc. Am. 91, 2110–2121. doi: 10.1121/1.403697
El Haj, M., Antoine, P., Nandrino, J. L., Gély-Nargeot, M. C., and Raffard, S. (2015). Self-defining memories during exposure to music in Alzheimer’s disease. Int. Psychogeriatr. 27, 1719–1730. doi: 10.1017/S1041610215000812
Fritz, J. B., David, S. V., Radtke-Schuller, S., Yin, P., and Shamma, S. A. (2010). Adaptive, behaviorally gated, persistent encoding of task-relevant auditory information in ferret frontal cortex. Nat. Neurosci. 13, 1011–1019. doi: 10.1038/nn.2598
Füllgrabe, C., and Rosen, S. (2016a). “Investigating the role of working memory in speech-in-noise identification for listeners with normal hearing,” in Physiology, Psychoacoustics and Cognition in Normal and Impaired Hearing, eds P. Van Dijk, D. Başkent, E. Gaudrain, E. De Kleine, A. Wagner, and C. Lanting (Cham: Springer International Publishing), 29–36. doi: 10.1007/978-3-319-25474-6_4
Füllgrabe, C., and Rosen, S. (2016b). On the (un)importance of working memory in speech-in-noise processing for listeners with normal hearing thresholds. Front. Psychol. 7:1268. doi: 10.3389/fpsyg.2016.01268
Gramfort, A., Luessi, M., Larson, E., Engemann, D., Strohmeier, D., Brodbeck, C., et al. (2013). MEG and EEG data analysis with MNE-Python. Front. Neurosci. 7:267. doi: 10.3389/fnins.2013.00267
Hackett, T. A., Stepniewska, I., and Kaas, J. H. (1999). Prefrontal connections of the parabelt auditory cortex in macaque monkeys. Brain Res. 817, 45–58.
Heimrath, K., Fischer, A., Heinze, H. J., and Zaehle, T. (2016). Changed categorical perception of consonant-vowel syllables induced by transcranial direct current stimulation (tDCS). BMC Neurosci. 17:8. doi: 10.1186/s12868-016-0241-3
Helfer, K., and Wilber, L. (1990). Hearing loss, aging, and speech perception in reverberation and in noise. J. Speech Hear. Res. 33, 149–155.
Hillyard, S. A., Hink, R. F., Schwent, V. L., and Picton, T. W. (1973). Electrical signs of selective attention in the human brain. Science 182, 177–180.
Hsu, W.-Y., Zanto, T. P., Anguera, J. A., Lin, Y.-Y., and Gazzaley, A. (2015). Delayed enhancement of multitasking performance: effects of anodal transcranial direct current stimulation on the prefrontal cortex. Cortex 69, 175–185. doi: 10.1016/j.cortex.2015.05.014
Huo, L., Zhu, X., Zheng, Z., Ma, J., Ma, Z., Gui, W., et al. (2021). Effects of transcranial direct current stimulation on episodic memory in older adults: a meta-analysis. J. Gerontol. B Psychol. Sci. Soc. Sci. 76, 692–702.
Husain, G., Thompson, W. F., and Schellenberg, E. G. (2002). Effects of musical tempo and mode on arousal, mood, and spatial abilities. Music Percept. 20, 151–171.
Hutka, S., Alain, C., Binns, M., and Bidelman, G. M. (2013). Age-related differences in the sequential organization of speech sounds. J. Acoust. Soc. Am. 133, 4177–4187. doi: 10.1121/1.4802745
Impey, D., and Knott, V. (2015). Effect of transcranial direct current stimulation (tDCS) on MMN-indexed auditory discrimination: a pilot study. J. Neural Transm. 122, 1175–1185.
Jeon, S. Y., and Han, S. B. (2012). Improvement of the working memory and naming by transcranial direct current stimulation. Ann. Rehabil. Med. 36, 585–595.
Killion, M. C., Niquette, P. A., Gudmundsen, G. I., Revit, L. J., and Banerjee, S. (2004). Development of a quick speech-in-noise test for measuring signal-to-noise ratio loss in normal-hearing and hearing-impaired listeners. J. Acoust. Soc. Am. 116, 2395–2405. doi: 10.1121/1.1784440
Knight, R. T., Scabini, D., and Woods, D. L. (1989). Prefrontal cortex gating of auditory transmission in humans. Brain Res. 504, 338–342.
Knight, R. T., Staines, W. R., Swick, D., and Chao, L. L. (1999). Prefrontal cortex regulates inhibition and excitation in distributed neural networks. Acta Psychol. 101, 159–178.
Krause, B., and Cohen Kadosh, R. (2014). Not all brains are created equal: the relevance of individual differences in responsiveness to transcranial electrical stimulation. Front. Syst. Neurosci. 8:25. doi: 10.3389/fnsys.2014.00025
Krebs, C., Klöppel, S., Heimbach, B., and Peter, J. (2020). Education moderates the effect of tDCS on episodic memory performance in cognitively impaired patients. Brain Stimul. 13, 1396–1398. doi: 10.1016/j.brs.2020.07.008
Kunzelmann, K., Meier, L., Grieder, M., Morishima, Y., and Dierks, T. (2018). No effect of transcranial direct current stimulation of the auditory cortex on auditory-evoked potentials. Front. Neurosci. 12:880. doi: 10.3389/fnins.2018.00880
Ladeira, A., Fregni, F., Campanhã, C., Valasek, C. A., De Ridder, D., Brunoni, A. R., et al. (2011). Polarity-dependent transcranial direct current stimulation effects on central auditory processing. PLoS One 6:e25399. doi: 10.1371/journal.pone.0025399
Lee, C., Jung, Y.-J., Lee, S. J., and Im, C.-H. (2017). COMETS2: an advanced MATLAB toolbox for the numerical analysis of electric fields generated by transcranial direct current stimulation. J. Neurosci. Methods 277, 56–62. doi: 10.1016/j.jneumeth.2016.12.008
Li, L. M., Uehara, K., and Hanakawa, T. (2015). The contribution of interindividual factors to variability of response in transcranial direct current stimulation studies. Front. Cell Neurosci. 9:181. doi: 10.3389/fncel.2015.00181
López-Caballero, F., Martin-Trias, P., Ribas-Prats, T., Gorina-Careta, N., Bartrés-Faz, D., and Escera, C. (2020). Effects of cTBS on the frequency-following response and other auditory evoked potentials. Front. Hum. Neurosci. 14:250. doi: 10.3389/fnhum.2020.00250
Mai, G., and Howell, P. (2021). Causal relationship between the right auditory cortex and speech-evoked envelope-following response: evidence from combined transcranial stimulation and electroencephalography. Cereb. Cortex 2021:bhab298. doi: 10.1093/cercor/bhab298
Manenti, R., Brambilla, M., Petesi, M., Ferrari, C., and Cotelli, M. (2013). Enhancing verbal episodic memory in older and young subjects after non-invasive brain stimulation. Front. Aging Neurosci. 5:49. doi: 10.3389/fnagi.2013.00049
Mansouri, F. A., Acevedo, N., Illipparampil, R., Fehring, D. J., Fitzgerald, P. B., and Jaberzadeh, S. (2017). Interactive effects of music and prefrontal cortex stimulation in modulating response inhibition. Sci. Rep. 7:18096. doi: 10.1038/s41598-017-18119-x
Maris, E., and Oostenveld, R. (2007). Nonparametric statistical testing of EEG- and MEG-data. J. Neurosci. Methods 164, 177–190.
Mas-Herrero, E., Dagher, A., and Zatorre, R. J. (2018). Modulating musical reward sensitivity up and down with transcranial magnetic stimulation. Nat. Hum. Behav. 2, 27–32. doi: 10.1038/s41562-017-0241-z
Mathys, C., Loui, P., Zheng, X., and Schlaug, G. (2010). Non-invasive brain stimulation applied to Heschl’s gyrus modulates pitch discrimination. Front. Psychol. 1:193. doi: 10.3389/fpsyg.2010.00193
Matsushita, R., Puschmann, S., Baillet, S., and Zatorre, R. J. (2021). Inhibitory effect of tDCS on auditory evoked response: simultaneous MEG-tDCS reveals causal role of right auditory cortex in pitch learning. Neuroimage 233:117915. doi: 10.1016/j.neuroimage.2021.117915
Moore, D. R., Edmondson-Jones, M., Dawes, P., Fortnum, H., Mccormack, A., Pierzycki, R. H., et al. (2014). Relation between speech-in-noise threshold, hearing loss and cognition from 40–69 years of age. PLoS One 9:e107720. doi: 10.1371/journal.pone.0107720
Näätänen, R., and Picton, T. (1987). The N1 wave of the human electric and magnetic response to sound: a review and an analysis of the component structure. Psychophysiology 24, 375–425. doi: 10.1111/j.1469-8986.1987.tb00311.x
Neuling, T., Wagner, S., Wolters, C., Zaehle, T., and Herrmann, C. (2012). Finite-element model predicts current density distribution for clinical applications of tDCS and tACS. Front. Psychiatry 3:83. doi: 10.3389/fpsyt.2012.00083
Nilsson, J., Lebedev, A. V., and Lövdén, M. (2015). No significant effect of prefrontal tDCS on working memory performance in older adults. Front. Aging Neurosci. 7:230. doi: 10.3389/fnagi.2015.00230
Oostenveld, R., Fries, P., Maris, E., and Schoffelen, J. M. (2011). Fieldtrip: open source software for advanced analysis of meg, eeg, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011:156869. doi: 10.1155/2011/156869
Oostenveld, R., and Praamstra, P. (2001). The five percent electrode system for high-resolution EEG and ERP measurements. Clin. Neurophysiol. 112, 713–719. doi: 10.1016/s1388-2457(00)00527-7
Parbery-Clark, A., Anderson, S., Hittner, E., and Kraus, N. (2012). Musical experience offsets age-related delays in neural timing. Neurobiol. Aging 33, e1481–e1484. doi: 10.1016/j.neurobiolaging.2011.12.015
Picazio, S., Granata, C., Caltagirone, C., Petrosini, L., and Oliveri, M. (2015). Shaping pseudoneglect with transcranial cerebellar direct current stimulation and music listening. Front. Hum. Neurosci. 9:158. doi: 10.3389/fnhum.2015.00158
Picton, T. W., Alain, C., Woods, D. L., John, M. S., Scherg, M., Valdes-Sosa, P., et al. (1999). Intracerebral sources of human auditory-evoked potentials. Audiol. Neurootol. 4, 64–79.
Picton, T. W., Van Roon, P., Armilio, M. L., Berg, P., Ille, N., and Scherg, M. (2000). The correction of ocular artifacts: a topographic perspective. Clin. Neurophysiol. 111, 53–65. doi: 10.1016/s1388-2457(99)00227-8
Plakke, B., and Romanski, L. M. (2014). Auditory connections and functions of prefrontal cortex. Front. Neurosci. 8:199. doi: 10.3389/fnins.2014.00199
Price, C. N., Alain, C., and Bidelman, G. M. (2019). Auditory-frontal channeling in α and β bands is altered by age-related hearing loss and relates to speech perception in noise. Neuroscience 423, 18–28. doi: 10.1016/j.neuroscience.2019.10.044
Price, C. N., and Bidelman, G. M. (2021). Attention reinforces human corticofugal system to aid speech perception in noise. Neuroimage 235:118014. doi: 10.1016/j.neuroimage.2021.118014
Ross, B., Tremblay, K. L., and Alain, C. (2020). Simultaneous EEG and MEG recordings reveal vocal pitch elicited cortical gamma oscillations in young and older adults. Neuroimage 204:116253. doi: 10.1016/j.neuroimage.2019.116253
Rossi, S., Hallett, M., Rossini, P. M., and Pascual-Leone, A. (2009). Safety, ethical considerations, and application guidelines for the use of transcranial magnetic stimulation in clinical practice and research. Clin. Neurophysiol. 120, 2008–2039.
Scherg, M., Vajsar, J., and Picton, T. W. (1989). A source analysis of the late human auditory evoked potentials. J. Cogn. Neurosci. 1, 336–355.
Seibt, O., Brunoni, A. R., Huang, Y., and Bikson, M. (2015). The Pursuit of DLPFC: non-neuronavigated Methods to Target the Left Dorsolateral Pre-frontal Cortex With Symmetric Bicephalic Transcranial Direct Current Stimulation (tDCS). Brain Stimul. 8, 590–602. doi: 10.1016/j.brs.2015.01.401
Stagg, C. J., Best, J. G., Stephenson, M. C., O’shea, J., Wylezinska, M., Kincses, Z. T., et al. (2009). Polarity-Sensitive Modulation of Cortical Neurotransmitters by Transcranial Stimulation. J. Neurosci. 29, 5202–5206. doi: 10.1523/JNEUROSCI.4432-08.2009
Tadel, F., Baillet, S., Mosher, J. C., Pantazis, D., and Leahy, R. M. (2011). Brainstorm: a user-friendly application for MEG/EEG analysis. Comput. Intell. Neurosci. 2011:879716. doi: 10.1155/2011/879716
Terada, H., Kurayama, T., Nakazawa, K., Matsuzawa, D., and Shimizu, E. (2015). Transcranial direct current stimulation (tDCS) on the dorsolateral prefrontal cortex alters P50 gating. Neurosci. Lett. 602, 139–144. doi: 10.1016/j.neulet.2015.07.003
Thair, H., Holloway, A. L., Newport, R., and Smith, A. D. (2017). Transcranial direct current stimulation (tDCS): a beginner’s guide for design and implementation. Front. Neurosci. 11:641. doi: 10.3389/fnins.2017.00641
Thompson, W. F., Schellenberg, E. G., and Husain, G. (2001). Arousal, mood, and the Mozart effect. Psychol. Sci. 12, 248–251.
Vermeire, K., Knoop, A., De Sloovere, M., Bosch, P., and Van Den Noort, M. (2019). Relationship between working memory and speech-in-noise recognition in young and older adult listeners with age-appropriate hearing. J. Speech Lang. Hear. Res. 62, 3545–3553. doi: 10.1044/2019_JSLHR-H-18-0307
Wang, Y., Shi, L., Dong, G., Zhang, Z., and Chen, R. (2020). Effects of Transcranial Electrical Stimulation on Human Auditory Processing and Behavior-A Review. Brain Sci. 10:531. doi: 10.3390/brainsci10080531
Watson, D., Clark, L. A., and Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: the PANAS scales. J. Pers. Soc. Psychol. 54, 1063–1070.
Wechsler, D. (1997). ”Wechsler memory scale (WMS-III). San Antonio, TX: The Psychological Corporation.
Wong, P. C., Ettlinger, M., Sheppard, J. P., Gunasekera, G. M., and Dhar, S. (2010). Neuroanatomical characteristics and speech perception in noise in older adults. Ear Hear. 31, 471–479. doi: 10.1097/AUD.0b013e3181d709c2
Yeend, I., Beach, E. F., and Sharma, M. (2019). Working Memory and Extended High-Frequency Hearing in Adults: diagnostic Predictors of Speech-in-Noise Perception. Ear Hear. 40, 458–467. doi: 10.1097/AUD.0000000000000640
Yoo, J., and Bidelman, G. M. (2019). Linguistic, perceptual, and cognitive factors underlying musicians’ benefits in noise-degraded speech perception. Hear. Res. 377, 189–195. doi: 10.1016/j.heares.2019.03.021
Zaehle, T., Beretta, M., Jäncke, L., Herrmann, C. S., and Sandmann, P. (2011). Excitability changes induced in the human auditory cortex by transcranial direct current stimulation: direct electrophysiological evidence. Exp. Brain Res. 215, 135–140.
Keywords: aging, brain stimulation, event-related potential (ERP), frequency-following response (FFR), speech-in-noise (SIN) perception
Citation: Bidelman GM, Chow R, Noly-Gandon A, Ryan JD, Bell KL, Rizzi R and Alain C (2022) Transcranial Direct Current Stimulation Combined With Listening to Preferred Music Alters Cortical Speech Processing in Older Adults. Front. Neurosci. 16:884130. doi: 10.3389/fnins.2022.884130
Received: 25 February 2022; Accepted: 17 June 2022;
Published: 06 July 2022.
Edited by:
Samira Anderson, University of Maryland, United StatesReviewed by:
Adi Lifshitz-Ben-Basat, Ariel University, IsraelBasil Christoph Preisig, University of Zurich, Switzerland
Copyright © 2022 Bidelman, Chow, Noly-Gandon, Ryan, Bell, Rizzi and Alain. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Gavin M. Bidelman, Z2JpZGVsQGluZGlhbmEuZWR1
†These authors have contributed equally to this work