- 1Department of Experimental Psychology, University of Groningen, Groningen, Netherlands
- 2Research School of Behavioural and Cognitive Neurosciences, University of Groningen, Groningen, Netherlands
- 3Department of Psychology, Centre for Applied Behavioural Sciences, School of Social Sciences, Heriot-Watt University, Edinburgh, United Kingdom
- 4Department of Otorhinolaryngology, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
During the normal course of aging, perception of speech-on-speech or “cocktail party” speech and use of working memory (WM) abilities change. Musical training, which is a complex activity that integrates multiple sensory modalities and higher-order cognitive functions, reportedly benefits both WM performance and speech-on-speech perception in older adults. This mini-review explores the relationship between musical training, WM and speech-on-speech perception in older age (> 65 years) through the lens of the Ease of Language Understanding (ELU) model. Linking neural-oscillation literature associating speech-on-speech perception and WM with alpha-theta oscillatory activity, we propose that two stages of speech-on-speech processing in the ELU are underpinned by WM-related alpha-theta oscillatory activity, and that effects of musical training on speech-on-speech perception may be reflected in these frequency bands among older adults.
Introduction: The Ease of Language Understanding Model and Working Memory
Under ideal listening conditions, speech perception is considered automatic (Johnson and Ralston, 1994). With increasing complexity, such as in background noise, it is an active cognitive process (Heald and Nusbaum, 2014; see Mattys et al., 2012), and one’s ability to parse information relies on combined sensory and cognitive capacities (van Knijff et al., 2018). When the background masker is non-speech, such as a steady noise, the speech acoustic cues are limited due to direct obliteration from the masker. In such “energetic masking” (Brungart, 2001), speech perception relies on the construction of the speech meaning from the audible temporal and spectral sections of the target speech, using linguistic and semantic constraints, but otherwise no further interference comes from the masker. In contrast, in speech-on-speech perception — a special case of “informational masking” (Pollack, 1975)— where the masker can be a single-talker, multitalker, or babble (gibberish), further interference can occur due to the overlapping temporal and spectral properties of the target and masking speech, and linguistic and semantic content of the masker speech.
During speech-on-speech perception, spectral and temporal components of the speech streams can be used to segregate target and masker streams to better understand a target speaker. For example, increased spatial separation (Viswanathan et al., 2016) and speaker voice differences (Darwin et al., 2003; Başkent and Gaudrain, 2016) seem to improve performance. In segregating target and masker components, the listener also needs to actively inhibit interference from the masker (Tun et al., 2002). Hence, higher-order cognitive functions like selective attention (Oberfeld and Klöckner-Nowotny, 2016) and working memory (WM; Sörqvist and Rönnberg, 2012) are also engaged when speech is attended in the presence of competing speech.
The Ease of Language Understanding (ELU) model (Rönnberg et al., 2013, 2019, 2021) highlights the importance of sensory-cognitive integration in speech-on-speech perception. An initial sensory module that Rapidly, Automatically, and Multimodally Binds Phonological information (RAMBPHO) allows for easy lexical access and understanding of speech. Presbycusis (age-related hearing loss) or interference from background speech can cause a mismatch in phonological binding of input and the representations in long-term memory. The phonological mismatch can result in increased reliance on WM to process degraded speech through an interplay with semantic long-term memory and episodic long-term memory (ELTM) in which linguistic inferences are formed based on prior knowledge.
Working memory is central to goal-directed behaviors where information must be stored and manipulated (Chai et al., 2018), and contains a general executive attention component (Chung et al., 2018) linked to inhibitory control (Getzmann et al., 2018; Tiego et al., 2018). The model’s conception of WM also assumes a central pool of cognitive resources that may be allocated to either storage or processing of information. Consequently, increased processing demands lead to reduced storage capacity and vice versa. The ELU emphasizes the integral role of WM during speech perception, especially in the presence of interfering sounds, and posits two distinct functional processes that determine successful comprehension: pre-diction and post-diction. The pre-diction aspect of the ELU suggests that WM is automatically and implicitly involved in the top-down modulation of perceptual processing in terms of priming the cognitive system via resource, or attentional allocation and the gating of sensory input. This in turn results in easier RAMBPHO and quicker understanding. Conversely, when a speech-signal is degraded, the post-diction component of WM works to integrate limited perceptual information with accessible representations from semantic long-term memory and ELTM in a more effortful pursuit of understanding (Rönnberg et al., 2021).
Working memory is associated with alpha (8–12 Hz) and theta (4–8 Hz) oscillatory activity (Yurgil et al., 2020) and appears to be underpinned by a fronto-parietal network that incorporates cortical and subcortical regions (Eriksson et al., 2015). In general, alpha-activity underpinned by the prefrontal cortex (PFC) is associated with the gating of sensory information during WM updating (Manza et al., 2014; Misselhorn et al., 2019), and the functional inhibition of activity in brain regions not involved with stimulus processing (Klimesch et al., 2007). Theta-activity is thought to represent interactions between the PFC, posterior regions and hippocampus (Strunk et al., 2017) throughout short-term storage, manipulation and retrieval of information (Hsieh and Ranganath, 2014; Roux and Uhlhaas, 2014). Interestingly, functional differences in WM alpha-theta oscillations reported in the literature seem to coincide with components of the ELU’s explicit processing loop —pre-diction and post-diction— that are triggered by phonological mismatch.
We suggest that alpha-activity may govern pre-diction via increased listening effort, in which the active inhibition of distractor speech gates relevant information, allowing for more successful binding during RAMBPHO. During speech processing, increases in alpha power are linked to heightened listening effort, particularly in adverse conditions where the auditory signal is degraded (Obleser et al., 2012; Dimitrijevic et al., 2019; Paul et al., 2021). This alpha-activity is related to the gating of lexical integration and selective inhibition of task-irrelevant noise (Strauß et al., 2014a,b), which likely becomes more challenging in the presence of background speech.
Meanwhile, when the speech signal is increasingly degraded, enhanced theta-activity represents the manipulation of limited sensory information to successfully trigger and retrieve appropriate lexical and semantic representations in long term memory stores, which are then integrated to form an understanding. Furthermore, fronto-parietal theta-activity has been implicated in lexical access and retrieval (Meyer et al., 2015; Marko et al., 2019) specifically from ELTM (Roberts et al., 2018; Nyhus et al., 2019), as well as being involved in resolving lexical ambiguity (Strauß et al., 2014a).
In sum, we propose that the dissociable roles of alpha and theta activity, apparent in WM subprocesses and highlighted during speech processing, directly underpin the pre-diction and post-diction components illustrated in the ELU (Figure 1), respectively. This pertains especially to speech-on-speech perception, which requires one to process a degraded speech signal and inhibit background speech. While this physiological underpinning seems intuitive and has in part been alluded to in previous ELU publications (Rönnberg et al., 2019), we will show in the coming sections how this phenomenon may be relevant for aging research. Namely, reduced WM-alpha-theta activity that supports pre-diction and post-diction processes after phonological mismatch may explain poorer speech-on-speech perception in older adults. Musical training is suggested in younger adult literature to be associated with increased alpha-theta activity and in older adult literature to enhance working memory. Thus (a background with) musical training in older adults may preserve alpha-theta oscillatory activity associated with working memory longer into older age (i.e., slow the decline). Preserved alpha-theta activity may accordingly preserve the ELU explicit-loop process that can repair phonological mismatch during speech-on-speech perception.
Figure 1. Hypothesized framework. Left, During speech-on-speech perception, the Ease of Language Understanding model (ELU; Rönnberg et al., 2019) describes how mismatch in phonological binding can occur which in turn triggers an explicit processing loop to repair the mismatch. The repair loop is governed by WM, consisting of the subcomponents pre-diction (blue) and post-diction (pink). Post-diction explicitly reevaluates lexical context, and pre-diction informs the (re-)binding of phonological information from the target speech signal in part by inhibiting distractor speech. Here, we suggest that the pre-diction and post-diction subcomponents of the loop are underpinned by alpha- and theta oscillatory activity, respectively. Right, We hypothesize further that any age-related decline in WM-related alpha-theta activity can be delayed if musical training occurred in younger age, or slowed by musical training in older age. In turn, musical training may preserve speech-on-speech perception longer into older age because preserved alpha-theta activity ensures proper repair of mismatched phonological binding. WM = working memory.
Speech on Speech Perception and WM in Older Age
Aging affects sensory and cognitive mechanisms and can make speech-on-speech perception more challenging (Goossens et al., 2017). As the population continues to age and life expectancy becomes longer, addressing these challenges becomes even more relevant. Often these changes occur simultaneously, making it difficult to parse what limitations come from changes in sensory or cognitive mechanisms. On the sensory side, the incidence of hearing loss increases with advanced age (Cunningham and Tucci, 2017). In the population older than 65 years, fewer than 50% have normal hearing, which lowers to 10% in those older than 80 (Homans et al., 2017). Since speech-on-speech perception relies in part on parsing target and masker streams based on the spectrotemporal acoustic cues, e.g., pitch contours, common onset and offset, common modulations, voicing, and similar (Bregman, 1990), presbycusis could have a negative effect by potentially altering these cues. However, even when hearing loss is minimized, speech-on-speech perception has been shown to be difficult in middle-aged (Başkent et al., 2014) and older individuals (Zobel et al., 2019).
On the cognitive side, age-related changes have been observed as decline in specific functions such as speed of processing, WM, long and short-term memory (Salthouse, 1996; Deary et al., 2009; Murman, 2015; Pliatsikas et al., 2019), but not in crystallized functions such as verbal knowledge (Park and Gutchess, 2002), which likely contribute to older individuals compensating for other changes when understanding degraded speech (Saija et al., 2014; Başkent et al., 2016). Specifically related to WM, poorer speech perception in older adults has been linked to age-related changes in WM, and particularly poorer attentional control compared to younger adults (Goossens et al., 2017). Likewise, lower WM and inhibitory control performance for normal hearing older adults is associated with more profound difficulties during speech-on-speech (Vermeire et al., 2019), and auditory working memory scores have been observed as the strongest predictor of performance (Anderson et al., 2013). Thus from the perspective of the ELU, one contribution to impoverished speech-on-speech understanding in older adults could be reduced functionality in the WM-governed explicit processing loop consisting of pre-diction and post-diction that occurs after phonological mismatch (RAMBPHO).
Here we suggest a physiological framework for the ELU such that pre-diction and post-diction are underpinned by alpha and theta-oscillatory activity, respectively. This framework is consistent with neurophysiological accounts of WM decline in older adults as well. Working memory decline in older adults is marked by reductions in myelination in the PFC (Schulze et al., 2011), which in turn is associated with WM-related oscillatory activity in alpha- and theta bands (e.g., Strunk et al., 2017; Misselhorn et al., 2019). Accordingly, in older adults, observed decline in WM positively correlates with alpha-theta activity, and is predicted by parahippocampal, striatal, and superior longitudinal fasciculus white matter integrity (Steiger et al., 2019; Kumral et al., 2020). Moreover, where age-related WM decline is observed, associated modulations of alpha-theta activity could explain difficulties in speech-on-speech perception (see Strauß et al., 2014a,b). Slower resting-state alpha and lower alpha-theta power are reported in high WM-load tasks (Rondina et al., 2019), and alpha-power reduction accompanies increased susceptibility to distractions (ElShafei et al., 2020) such as speech maskers (Tun et al., 2002), which increases the likelihood of phonological mismatch during pre-diction. Older adults reputedly sometimes bind information from target and distractor streams, resulting in poorer recall (Strunk et al., 2017), possibly due to deficits in WM maintenance (Jarjat et al., 2019) and ELTM retrieval (Korkki et al., 2020) that are reflected by changes in frontal theta-activity (Tóth et al., 2014).
Taken together, we suggest that within the framework of the ELU, older adults who show signs of WM decline may have a reduced capacity to gate information and inhibit distractor speech due to reductions in alpha-activity, leading to poorer pre-diction. Additionally, declines in WM and subsequent theta-activity result in a lessened ability to maintain and manipulate a degraded speech signal during lexical retrieval, leading to poorer post-diction.
Musical Training and WM in Older Age
Musical training is a complex activity that integrates multiple sensory modalities and higher-order cognitive functions (Olszewska and Marchewka, 2021). Musical training studies cannot provide definitive evidence as to whether cognitive benefits are exclusively attained through lifelong musical training, or whether they can be achieved through training once already in older age, due to limitations in both cross-sectional (Schellenberg, 2015; Swaminathan and Schellenberg, 2018; see Hanna-Pladdy and MacKay, 2011) and intervention (Román-Caballero et al., 2018) designs. However, studies with both design types contribute to the overall picture of musical training influencing WM and speech-on-speech perception in older age to some degree.
Though effect sizes vary, cross-sectional evidence has revealed better WM performance in older adult musicians compared to non-musicians (see Table 1). In the studies that directly assess WM, several have reported significantly higher digit span (Mansens et al., 2018; Gray and Gow, 2020; Zhang et al., 2021), visuospatial span (Amer et al., 2013) and better performance on measures of WM maintenance and recall with older musicians (Parbery-Clark et al., 2011). There is evidence to also suggest increased efficiency for older musicians in tasks that indirectly measure WM, or in which WM is an important underpinning of task success. Better problem solving and reasoning (Hanna-Pladdy and Gajewski, 2012; Gray and Gow, 2020), faster and more accurate performance during auditory Stroop and reading with distraction tasks (Amer et al., 2013), superior sustained attention (Tierney et al., 2020), as well as advantages in speech-on-speech (Parbery-Clark et al., 2011; Zendel et al., 2019; Zhang et al., 2021) and verbal learning and fluency (Mansens et al., 2018) have all been reported. Additionally, those studies in which no group differences in digit span were found also revealed a musician advantage in short delayed recall tasks and letter-number sequencing in the same population (Hanna-Pladdy and MacKay, 2011; Hanna-Pladdy and Gajewski, 2012). The combined findings could allude to the limitations in relying on span tasks as a measure of WM, given that it is difficult to parse the temporal components of WM processing, within which older musicians may differ depending on the several factors that account for their musical experience and aptitude. This also highlights the difficulties associated with these studies adopting the commonly used “years of experience” (Hanna-Pladdy and MacKay, 2011; Hanna-Pladdy and Gajewski, 2012; Amer et al., 2013; Gray and Gow, 2020), or similarly weak “yes” or “no” musical training survey items (Mansens et al., 2018) as grouping criteria, as studies cannot then draw within-group comparisons between differing engagement in musical training and those temporal components of WM. Nevertheless, the broad cross-sectional data indicates a positive relationship between musical training and components of WM in older age, and studies that included additional grouping criteria based on the age of onset and current musical activity support the findings (Parbery-Clark et al., 2011; Zhang et al., 2021).
Musical intervention studies with older adults often report positive results with regards to WM as well. Following 6 weeks of one-to-one piano lessons combined with 3 h of practice per week, Bugos et al. (2007) reported better digit-span performance for musically trained older adults over controls that was maintained at a 6-month follow-up. Elsewhere, older adults who engaged in 6 months of weekly group piano lessons improved within-group digit-span performance whereas controls did not; there was, however, no between-group difference at post-test (Seinfeld et al., 2013). Although promising in terms of improving WM in older age and maintaining improvements once musical training stopped, more intervention studies are needed to determine causality.
Musical Training and Preserved Alpha-Theta Activity in Older Age?
In older age, benefits in WM related to musical training have been associated with reduced distractor interference and better inhibitory control (Moussard et al., 2016) reputedly originating from the PFC (Alain et al., 2018). As noted, the PFC is thought to underpin alpha recruitment during sensory gating and functional inhibition (Klimesch et al., 2007; Manza et al., 2014; Misselhorn et al., 2019). Beyond this, the relationship between musical training and WM-related alpha-theta activity is still somewhat unclear and to our knowledge, no research has explored musical training-related modulations of alpha recruitment during active inhibition (see Yurgil et al., 2020), especially later in life. However, reported phenomena from younger adults hint that musical training may preserve alpha-theta oscillatory activity. Increased alpha-theta connectivity has been reported in younger adult musicians over non-musicians and is thought to be involved in the propagation of information across long-range brain networks via the PFC (Klein et al., 2016). Evidence also suggests a positive effect of musical training on theta-activity specifically related to verbal WM encoding and retrieval. For instance, recent research saw musically trained younger adults outperform non-musicians during a measure of semantic integration and recall of newly learned words, where better performance correlated with increased theta connectivity between ventral and dorsal speech streams (Dittinger et al., 2018). Moreover, increased theta-activity was observed to correlate with the number of years of musical training. Similarly, younger adult musicians performed better than non-musicians in a verbal WM learning and recall task, and increased theta-activity correlated with better performance (Cheung et al., 2017). Next to this oscillation-specific evidence, recent behavioral studies reported positive association between musical training and inhibition (Moreno and Farzan, 2015), as well as musical training benefits in gating information during WM updating (Okada and Slevc, 2018).
The limited literature available from younger adults seems to align with the observed relationship between musical training and WM in older age. Taken together, this supports the idea that musical training may preserve WM-related alpha-theta activity longer into older age. In turn, age-related decline may be delayed from the perspective of the ELU, we suggest that preserved WM-related oscillatory activity in older adults may allow for better explicit repair of mismatched phonological binding during speech-on-speech perception: Preserved alpha-activity may protect from age-related decline in active inhibition of distractor speech during pre-diction. Similarly, we argue that preserved theta-activity in older adults may contribute to more successful post-diction through protection from declines in storage and manipulation of a degraded speech signal, and retrieval of appropriate semantic and lexical representations to understand target speech.
In line with this idea, there is a small body of research reporting improved speech on speech in older adults with musical training compared to adults with no musical training. Improvements in speech-on-speech perception for a group of 69-year-old adults following 6 months of piano lessons were thought to be mediated by benefits in WM (Worschech et al., 2021). Findings that revealed greater post-intervention allocation of attentional resources during speech-on-speech perception for those who underwent musical training, support this view (represented by enhanced attention-related ERP components; Zendel et al., 2019). Better verbal digit-span performance also reputedly mediated better speech-on-speech performance for older musicians in a recent cross-sectional study (Zhang et al., 2021). In Zhang et al. (2021), differences in speech-on-speech were less pronounced in younger musicians, and correlations between WM metrics and improved speech-on-speech perception were more pronounced in older adults compared to younger adults. Given that WM contains a general executive attention component (Chung et al., 2018), WM differences were perhaps reflected in another study when older musicians reportedly outperformed non-musicians during a speech-on-speech perception task, with measures of attentional control accounting for 54% of variance (Tierney et al., 2020). In one recent study where neither musical training nor WM was found to correlate with speech-on-speech (Mussoi, 2021), the author acknowledged that the study was underpowered.
Limitations
In this mini-review, we do not present comprehensive accounts of speech perception models and make no claims about how the current perspective for the role of WM and related oscillatory activity relates to other models (e.g., Grossberg, 1980; McClelland and Elman, 1986) or neurocognitive frameworks (e.g., Kotz and Schwartze, 2010; Strauß et al., 2014b; Meyer, 2018). We do not fully review all neural mechanisms related to aging (see Cabeza et al., 2018), speech-on-speech processing or hearing loss (e.g., Anderson et al., 2013), nor WM (see Baddeley, 2012; D’Esposito and Postle, 2015; Oberauer, 2019; Xu, 2021). There has been no exhaustive evaluation of WM-related oscillatory activity literature, and the appraisal of the aging, musical training, speech-on-speech, and WM literature is limited to how they intersect. This is in part due to the exploratory nature of this framework, which aims only to provide a direction for future research.
Summary and Research Perspectives
Our mini-review suggests that within the scope of the ELU model, pre-diction and post-diction components may be underpinned by WM-related alpha and theta activity, respectively. As WM function and associated alpha-theta oscillatory activity can decline during normal aging (Rondina et al., 2019), and lower alpha power is related to a lessened ability to inhibit irrelevant information (ElShafei et al., 2020), we argue that musician benefits to WM in older age may relate to preserved top-down modulation of alpha oscillations in sensory regions when understanding speech-on-speech and enhanced theta-activity relating to WM maintenance and retrieval. Specifically, preserved alpha-band activity could thus improve information gating through active inhibition and increased effort during the ELU’s pre-diction, while theta activity underpins lexical retrieval and the reconstruction of limited perceptual information during the ELU’s post-diction.
Our account highlights the need for further understanding of the relationship between WM, speech-on-speech perception, and musical training in older age. We suggest future musical training paradigms targeting speech-on-speech perception in older adults should not only limit to performance measures of music and speech perception, but also measure WM-related alpha-theta oscillations during speech-on-speech tasks. This will determine whether improvements in WM dictate any observed advantage in older musicians over non-musicians in speech-on-speech perception, and whether improvements in WM dictate this in turn. Additionally, the time course of WM-related alpha-theta oscillatory activity may be able to reflect real-time occurrence of the ELU’s pre-diction and post-diction.
Should the positive relationship between musical training, WM and speech-on-speech perception be causal, this would be extremely promising, not only to clarify inconsistencies in the literature, but also at the societal level. With an increasing life expectancy and aging population, the possible benefits of musical training, whether achievable earlier or later in life, are important to society. Future research addressing online recruitment of WM-related neural resources such as alpha-theta activity during speech-on-speech perception tasks could further help to understand this relationship. Causation can be established by musical intervention studies, and musician/non-musician cross-sectional studies should carefully address heterogeneity in WM capabilities and musical backgrounds of older adults. The knowledge of what effects occur due to age-related sensory changes and what effects due to age-related cognitive changes can be gained with experimental designs that utilize different groups of older individuals with varying hearing status, ranging from age minimal to age typical hearing loss. This new information from such studies can lead to better ways of utilizing successful sensory-cognitive integration during speech-on-speech perception in terms of cognitive compensation. This may also provide an avenue for future customized training and rehabilitation tools for older adults with or without presbycusis.
Author Contributions
RG, AS, and EH conceived of the research. All authors contributed to the manuscript and contributed to the article and approved the submitted version.
Funding
This study was funded by the Dorhout Mees Foundation, the Gratama Foundation, Heinsus Houbolt Funds, and the VICI Grant No. 016.VICI.170.111 from the Netherlands Organization for Scientific Research (NWO) and the Netherlands Organization for Health Research and Development (ZonMw).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2022.806439/full#supplementary-material
References
Alain, C., Khatamian, Y., He, Y., Lee, Y., Moreno, S., Leung, A. W. S., et al. (2018). Different neural activities support auditory working memory in musicians and bilinguals: neural resources in musicians and bilinguals. Ann. N. Y. Acad. Sci. 1423, 435–446. doi: 10.1111/nyas.13717
Amer, T., Kalender, B., Hasher, L., Trehub, S. E., and Wong, Y. (2013). Do Older professional musicians have cognitive advantages? PLoS One 8:e71630. doi: 10.1371/journal.pone.0071630
Anderson, S., Parbery-Clark, A., White-Schwoch, T., Drehobl, S., and Kraus, N. (2013). Effects of hearing loss on the subcortical representation of speech cues. J. Acoust. Soc. Am. 133, 3030–3038. doi: 10.1121/1.4799804
Baddeley, A. (2012). Working memory: theories, models, and controversies. Annu. Rev. Psychol. 63, 1–29. doi: 10.1146/annurev-psych-120710-100422
Başkent, D., and Gaudrain, E. (2016). Musician advantage for speech-on-speech perception. J. Acoust. Soc. Am. 139, EL51–EL56. doi: 10.1121/1.4942628
Başkent, D., Clarke, J., Pals, C., Benard, M. R., Bhargava, P., Saija, J., et al. (2016). Cognitive compensation of speech perception with hearing impairment, cochlear implants, and aging: how and to what degree can it be achieved? Trends Hearing 20, 1–16.
Başkent, D., van Engelshoven, S., and Galvin, J. J. (2014). Susceptibility to interference by music and speech maskers in middle-aged adults. J. Acoust. Soc. Am. 135, EL147–EL153. doi: 10.1121/1.4865261
Bregman, A. S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: The MIT Press.
Brungart, D. S. (2001). Informational and energetic masking effects in the perception of two simultaneous talkers. J. Acoust. Soc. Am. 109, 1101–1109. doi: 10.1121/1.1345696
Bugos, J. A., Perlstein, W. M., McCrae, C. S., Brophy, T. S., and Bedenbaugh, P. H. (2007). Individualized Piano Instruction enhances executive functioning and working memory in older adults. Aging Mental Health 11, 464–471. doi: 10.1080/13607860601086504
Cabeza, R., Albert, M., Belleville, S., Craik, F., Duarte, A., Grady, C., et al. (2018). Cognitive neuroscience of healthy aging: maintenance, reserve, and compensation. Nat. Rev. Neurosci. 19:701. doi: 10.1038/s41583-018-0068-2
Chai, W. J., Abd Hamid, A. I., and Abdullah, J. M. (2018). Working memory from the psychological and neurosciences perspectives: a review. Front. Psychol. 9:401. doi: 10.3389/fpsyg.2018.00401
Cheung, M. C., Chan, A. S., Liu, Y., Law, D., and Wong, C. W. (2017). Music training is associated with cortical synchronization reflected in EEG coherence during verbal memory encoding. PLoS One 12:e0174906. doi: 10.1371/journal.pone.0174906
Chung, S., Fieremans, E., Kucukboyaci, N. E., Wang, X., Morton, C. J., Novikov, D. S., et al. (2018). Working memory and brain tissue microstructure: white matter tract integrity based on multi-shell diffusion MRI. Sci. Rep. 8:3175. doi: 10.1038/s41598-018-21428-4
Cunningham, L. L., and Tucci, D. L. (2017). Hearing loss in adults. N. Engl. J. Med. 377, 2465–2473. doi: 10.1056/NEJMra1616601
Darwin, C. J., Brungart, D. S., and Simpson, B. D. (2003). Effects of fundamental frequency and vocal- tract length changes on attention to one of two simultaneous talkers. J. Acoust. Soc. Am. 114, 2913–2922. doi: 10.1121/1.1616924
Deary, I. J., Corley, J., Gow, A. J., Harris, S. E., Houlihan, L. M., Marioni, R. E., et al. (2009). Age-associated cognitive decline. Br. Med. Bull. 92, 135–152. doi: 10.1093/bmb/ldp033
D’Esposito, M., and Postle, B. R. (2015). The cognitive neuroscience of working memory. Annu. Rev. Psychol. 66, 115–142.
Dimitrijevic, A., Smith, M. L., Kadis, D. S., and Moore, D. R. (2019). Neural indices of listening effort in noisy environments. Sci. Rep. 9:11278. doi: 10.1038/s41598-019-47643-1
Dittinger, E., Valizadeh, S. A., Jäncke, L., Besson, M., and Elmer, S. (2018). Increased functional connectivity in the ventral and dorsal streams during retrieval of novel words in professional musicians. Hum. Brain Mapp. 39, 722–734. doi: 10.1002/hbm.23877
ElShafei, H. A., Fornoni, L., Masson, R., Bertrand, O., and Bidet-Caulet, A. (2020). Age-related modulations of alpha and gamma brain activities underlying anticipation and distraction. PLoS One 15:e0229334. doi: 10.1371/journal.pone.0229334
Eriksson, J., Vogel, E. K., Lansner, A., Bergström, F., and Nyberg, L. (2015). Neurocognitive architecture of working memory. Neuron 88, 33–46. doi: 10.1016/j.neuron.2015.09.020
Getzmann, S., Wascher, E., and Schneider, D. (2018). The role of inhibition for working memory processes: ERP evidence from a short-term storage task. Psychophysiology 55:e13026. doi: 10.1111/psyp.13026
Goossens, T., Vercammen, C., Wouters, J., and van Wieringen, A. (2017). Masked speech perception across the adult lifespan: impact of age and hearing impairment. Hear. Res. 344, 109–124. doi: 10.1016/j.heares.2016.11.004
Gray, R., and Gow, A. J. (2020). How is musical activity associated with cognitive ability in later life? Aging Neuropsychol. Cogn. 27, 617–635. doi: 10.1080/13825585.2019.1660300
Grossberg, S. (1980). Intracellular mechanisms of adaptation and self-regulation in self-organizing networks:the role of chemical transducers. Bull. Math. Biol. 42, 365–396.
Hanna-Pladdy, B., and Gajewski, B. (2012). Recent and past musical activity predicts cognitive aging variability: direct comparison with general lifestyle activities. Front. Hum. Neurosci. 6:198. doi: 10.3389/fnhum.2012.00198
Hanna-Pladdy, B., and MacKay, A. (2011). The relation between instrumental musical activity and cognitive aging. Neuropsychology 25, 378–386. doi: 10.1037/a0021895
Heald, S. L. M., and Nusbaum, H. C. (2014). Speech perception as an active cognitive process. Front. Systems Neurosci. 8:35. doi: 10.3389/fnsys.2014.00035
Homans, N. C., Metselaar, R. M., Dingemanse, J. G., van der Schroeff, M. P., Brocaar, M. P., Wieringa, M. H., et al. (2017). Prevalence of age-related hearing loss, including sex differences, in older adults in a large cohort study. Laryngoscope 127, 725–730. doi: 10.1002/lary.26150
Hsieh, L. T., and Ranganath, C. (2014). Frontal midline theta oscillations during working memory maintenance and episodic encoding and retrieval. Neuroimage 85, 721–729.
Jarjat, G., Portrat, S., and Hot, P. (2019). Aging influences the efficiency of attentional maintenance in verbal working memory. J. Gerontol. Series B 74, 600–608. doi: 10.1093/geronb/gby067
Johnson, K., and Ralston, J. V. (1994). Automaticity in speech perception: some speech/nonspeech comparisons. Phonetica 51, 195–209. doi: 10.1159/000261975
Klein, C., Diaz Hernandez, L., Koenig, T., Kottlow, M., Elmer, S., and Jäncke, L. (2016). The influence of pre-stimulus EEG activity on reaction time during a verbal sternberg task is related to musical expertise. Brain Topogr. 29, 67–81. doi: 10.1007/s10548-015-0433-7
Klimesch, W., Sauseng, P., and Hanslmayr, S. (2007). EEG alpha oscillations: the inhibition-timing hypothesis. Brain Res. Rev. 53, 63–88. doi: 10.1016/j.brainresrev.2006.06.003
Korkki, S. M., Richter, F. R., Jeyarathnarajah, P., and Simons, J. S. (2020). Healthy ageing reduces the precision of episodic memory retrieval. Psychol. Aging 35, 124–142. doi: 10.1037/pag0000432
Kotz, S. A., and Schwartze, M. (2010). Cortical speech processing unplugged: a timely subcortico-cortical framework. Trends Cogn. Sci. 14, 392–399. doi: 10.1016/j.tics.2010.06.005
Kumral, D., Cesnaite, E., Beyer, F., Hensch, T., Sander, C., Hegerl, U., et al. (2020). Relationship between regional white matter hyperintensities and alpha oscillations in older adults. Neurobiol. Aging. Online ahead of print. doi: 10.1016/j.neurobiolaging.2021.10.006
Mansens, D., Deeg, D. J. H., and Comijs, H. C. (2018). The association between singing and/or playing a musical instrument and cognitive functions in older adults. Aging Mental Health 22, 970–977. doi: 10.1080/13607863.2017.1328481
Manza, P., Hau, C. L. V., and Leung, H. C. (2014). Alpha power gates relevant information during working memory updating. J. Neurosci. 34, 5998–6002.
Marko, M., Cimrová, B., and Riečanský, I. (2019). Neural theta oscillations support semantic memory retrieval. Sci. Rep. 9:17667. doi: 10.1038/s41598-019-53813-y
Mattys, S. L., Davis, M. H., Bradlow, A. R., and Scott, S. K. (2012). Speech recognition in adverse conditions: a review. Lang. Cogn. Processes 27, 953–978. doi: 10.1080/01690965.2012.705006
McClelland, J. L., and Elman, J. L. (1986). The TRACE model of speech perception. Cognit. Psychol. 18, 1–86. doi: 10.1016/0010-0285(86)90015-0
Meyer, L. (2018). The neural oscillations of speech processing and language comprehension: state of the art and emerging mechanisms. Eur. J. Neurosci. 48, 2609–2621. doi: 10.1111/ejn.13748
Meyer, L., Grigutsch, M., Schmuck, N., Gaston, P., and Friederici, A. D. (2015). Frontal-posterior theta oscillations reflect memory retrieval during sentence comprehension. Cortex 71, 205–218. doi: 10.1016/j.cortex.2015.06.027
Misselhorn, J., Friese, U., and Engel, A. K. (2019). Frontal and parietal alpha oscillations reflect attentional modulation of cross-modal matching. Sci. Rep. 9:5030. doi: 10.1038/s41598-019-41636-w
Moreno, S., and Farzan, F. (2015). Music training and inhibitory control: a multidimensional model. Ann. N. Y. Acad. Sci. 1337, 147–152. doi: 10.1111/nyas.12674
Moussard, A., Bermudez, P., Alain, C., Tays, W., and Moreno, S. (2016). Life-long music practice and executive control in older adults: an event-related potential study. Brain Res. 1642, 146–153. doi: 10.1016/j.brainres.2016.03.028
Murman, D. (2015). The impact of age on cognition. Sem. Hearing 36, 111–121. doi: 10.1055/s-0035-1555115
Mussoi, B. S. (2021). The impact of music training and working memory on speech recognition in older age. J. Speech Lang. Hear Res. 64, 4524–4534. doi: 10.1044/2021_JSLHR-20-00426
Nyhus, E., Engel, W. A., Pitfield, T. D., and Vakkur, I. M. W. (2019). Increases in theta oscillatory activity during episodic memory retrieval following mindfulness meditation training. Front. Hum. Neurosci. 13:311. doi: 10.3389/fnhum.2019.00311
Oberauer, K. (2019). Working memory and attention - a conceptual analysis and review. J. Cogn. 2:36. doi: 10.5334/joc.58
Oberfeld, D., and Klöckner-Nowotny, F. (2016). Individual differences in selective attention predict speech identification at a cocktail party. eLife 5:e16747. doi: 10.7554/eLife.16747
Obleser, J., Wöstmann, M., Hellbernd, N., Wilsch, A., and Maess, B. (2012). Adverse listening conditions and memory load drive a common α oscillatory network. J. Neurosci. 32, 12376–12383. doi: 10.1523/JNEUROSCI.4908-11.2012
Okada, B. M., and Slevc, L. R. (2018). Individual differences in musical training and executive functions: a latent variable approach. Mem. Cogn. 46, 1076–1092. doi: 10.3758/s13421-018-0822-8
Olszewska, A. M., and Marchewka, A. (2021). How musical training shapes the adult brain: predispositions and neuroplasticity. Front. Neurosci. 15:630829. doi: 10.3389/fnins.2021.630829
Parbery-Clark, A., Strait, D. L., Anderson, S., Hittner, E., and Kraus, N. (2011). Musical experience and the aging auditory system: implications for cognitive abilities and hearing speech in noise. PLoS One 6:e18082. doi: 10.1371/journal.pone.0018082
Park, D. C., and Gutchess, A. H. (2002). Aging, cognition, and culture: a neuroscientific perspective. Neurosci. Biobehav. Rev. 26, 859–867. doi: 10.1016/s0149-7634(02)00072-6
Paul, B. T., Chen, J., Le, T., Lin, V., and Dimitrijevic, A. (2021). Cortical alpha oscillations in cochlear implant users reflect subjective listening effort during speech-in-noise perception. PLoS One 16:e0254162. doi: 10.1371/journal.pone.0254162
Pliatsikas, C., Veríssimo, J., Babcock, L., Pullman, M. Y., Glei, D. A., Weinstein, M., et al. (2019). Working memory in older adults declines with age, but is modulated by sex and education. Quarterly J. Exp. Psychol. 72, 1308–1327. doi: 10.1177/1747021818791994
Roberts, B. M., Clarke, A., Addante, R. J., and Ranganath, C. (2018). Entrainment enhances theta oscillations and improves episodic memory. Cogn. Neurosci. 9, 181–193. doi: 10.1080/17588928.2018.1521386
Román-Caballero, R., Arnedo, M., Triviño, M., and Lupiáñez, J. (2018). Musical practice as an enhancer of cognitive function in healthy aging—a systematic review and meta-analysis. PLoS One 13:e0207957. doi: 10.1371/journal.pone.0207957
Rondina, R. II, Olsen, R. K., Li, L., Meltzer, J. A., and Ryan, J. D. (2019). Age-related changes to oscillatory dynamics during maintenance and retrieval in a relational memory task. PLoS One 14:e0211851. doi: 10.1371/journal.pone.0211851
Rönnberg, J., Holmer, E., and Rudner, M. (2019). Cognitive hearing science and ease of language understanding. Int. J. Audiol. 58, 247–261. doi: 10.1080/14992027.2018.1551631
Rönnberg, J., Holmer, E., and Rudner, M. (2021). Cognitive hearing science: three memory systems, two approaches, and the ease of language understanding model. J. Speech Lang. Hear Res. 64, 359–370. doi: 10.1044/2020_JSLHR-20-00007
Rönnberg, J., Lunner, T., Zekveld, A., Sörqvist, P., Danielsson, H., Lyxell, B., et al. (2013). The Ease of Language Understanding (ELU) model: theoretical, empirical, and clinical advances. Front. Systems Neurosci. 7:31. doi: 10.3389/fnsys.2013.00031
Roux, F., and Uhlhaas, P. J. (2014). Working memory and neural oscillations: Alpha-gamma versus theta-gamma codes for distinct WM information? Trends Cogn. Sci. 18, 16–25. doi: 10.1016/j.tics.2013.10.010
Saija, J. D., Akyürek, E. G., Andringa, T. C., and Başkent, D. (2014). Perceptual restoration of degraded speech is preserved with advancing age. J. Assoc. Res. Otolaryngol. 15, 139–148. doi: 10.1007/s10162-013-0422-z
Salthouse, T. A. (1996). The processing-speed theory of adult age differences in cognition. Psychol. Rev. 103, 403–428. doi: 10.1037/0033-295X.103.3.403
Schellenberg, E. G. (2015). Music training and speech perception: a gene-environment interaction: music training and speech perception. Ann. N. Y. Acad. Sci. 1337, 170–177. doi: 10.1111/nyas.12627
Schulze, E. T., Geary, E. K., Susmaras, T. M., Paliga, J. T., Maki, P. M., and Little, D. M. (2011). Anatomical correlates of age-related working memory declines. J. Aging Res. 2011:606871. doi: 10.4061/2011/606871
Seinfeld, S., Figueroa, H., Ortiz-Gil, J., and Sanchez-Vives, M. V. (2013). Effects of music learning and piano practice on cognitive function, mood and quality of life in older adults. Front. Psychol. 4:810. doi: 10.3389/fpsyg.2013.00810
Sörqvist, P., and Rönnberg, J. (2012). Episodic long-term memory of spoken discourse masked by speech: what is the role for working memory capacity? J. Speech Lang. Hear. Res. 55, 210–218. doi: 10.1044/1092-4388(2011/10-0353)
Steiger, T. K., Herweg, N. A., Menz, M. M., and Bunzeck, N. (2019). Working memory performance in the elderly relates to theta-alpha oscillations and is predicted by parahippocampal and striatal integrity. Sci. Rep. 9:706. doi: 10.1038/s41598-018-36793-3
Strauß, A., Kotz, S. A., Scharinger, M., and Obleser, J. (2014a). Alpha and theta brain oscillations index dissociable processes in spoken word recognition. Neuroimage 97, 387–395. doi: 10.1016/j.neuroimage.2014.04.005
Strauß, A., Wöstmann, M., and Obleser, J. (2014b). Cortical alpha oscillations as a tool for auditory selective inhibition. Front. Hum. Neurosci. 8:350. doi: 10.3389/fnhum.2014.00350
Strunk, J., James, T., Arndt, J., and Duarte, A. (2017). Age-related changes in neural oscillations supporting context memory retrieval. Cortex 91, 40–55. doi: 10.1016/j.cortex.2017.01.020
Swaminathan, S., and Schellenberg, E. G. (2018). Musical competence is predicted by music training. cognitive abilities, and personality. Sci. Rep. 8:9223. doi: 10.1038/s41598-018-27571-2
Tiego, J., Testa, R., Bellgrove, M. A., Pantelis, C., and Whittle, S. (2018). A hierarchical model of inhibitory control. Front. Psychol. 9:1339. doi: 10.3389/fpsyg.2018.01339
Tierney, A., Rosen, S., and Dick, F. (2020). Speech-in-speech perception, non-verbal selective attention, and musical training. J. Exp. Psychol. Learn. Mem. Cogn. 46, 968–979. doi: 10.1037/xlm0000767
Tóth, B., Kardos, Z., File, B., Boha, R., Stam, C. J., and Molnár, M. (2014). Frontal midline theta connectivity is related to efficiency of WM maintenance and is affected by aging. Neurobiol. Learn. Mem. 114, 58–69. doi: 10.1016/j.nlm.2014.04.009
Tun, P., O’Kane, G., and Wingfield, A. (2002). Distraction by competing speech in young and older adult listeners. Psychol. Aging 17, 453–467. doi: 10.1037//0882-7974.17.3.453
van Knijff, E. C., Coene, M., and Govaerts, P. J. (2018). Speech understanding in noise in elderly adults: the effect of inhibitory control and syntactic complexity: speech in noise understanding in elderly adults. Int. J. Lang. Commun. Disord. 53, 628–642. doi: 10.1111/1460-6984.12376
Vermeire, K., Knoop, A., De Sloovere, M., Bosch, P., and van den Noort, M. (2019). Relationship between working memory and speech-in-noise recognition in young and older adult listeners with age-appropriate hearing. J. Speech Lang. Hear. Res. 62, 3545–3553. doi: 10.1044/2019_JSLHR-H-18-0307
Viswanathan, N., Kokkinakis, K., and Williams, B. T. (2016). Spatially separating language masker from target results in spatial and linguistic masking release. J. Acoust. Soc. Am. 140, EL465–EL470. doi: 10.1121/1.4968034
Worschech, F., Marie, D., Jünemann, K., Sinke, C., Krüger, T. H. C., Großbach, M., et al. (2021). Improved speech in noise perception in the elderly after 6 months of musical instruction. Front. Neurosci. 15:696240. doi: 10.3389/fnins.2021.696240
Xu, Y. (2021). Towards a better understanding of information storage in visual working memory. Visual Cogn. 29, 437–445. doi: 10.1080/13506285.2021.1946230
Yurgil, K. A., Velasquez, M. A., Winston, J. L., Reichman, N. B., and Colombo, P. J. (2020). Music training, working memory, and neural oscillations: a review. Front. Psychol. 11:266. doi: 10.3389/fpsyg.2020.00266
Zendel, B. R., West, G. L., Belleville, S., and Peretz, I. (2019). Musical training improves the ability to understand speech-in-noise in older adults. Neurobiol. Aging 81, 102–115. doi: 10.1016/j.neurobiolaging.2019.05.015
Zhang, L., Fu, X., Luo, D., Xing, L., and Du, Y. (2021). Musical experience offsets age-related decline in understanding speech-in-noise: type of training does not matter. working memory is the key. Ear Hear. 42, 258–270. doi: 10.1097/AUD.0000000000000921
Keywords: older adults, alpha, theta, working memory, musical training, speech-on-speech perception
Citation: Gray R, Sarampalis A, Başkent D and Harding EE (2022) Working-Memory, Alpha-Theta Oscillations and Musical Training in Older Age: Research Perspectives for Speech-on-speech Perception. Front. Aging Neurosci. 14:806439. doi: 10.3389/fnagi.2022.806439
Received: 31 October 2021; Accepted: 24 March 2022;
Published: 11 May 2022.
Edited by:
Jennifer Campos, University Health Network, CanadaReviewed by:
Alexandra Parbery-Clark, Swedish Medical Center, United StatesSebastian Puschmann, University of Oldenburg, Germany
Copyright © 2022 Gray, Sarampalis, Başkent and Harding. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Eleanor E. Harding, e.e.harding@rug.nl