- 1Max Planck Research Group “Auditory Cognition”, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
- 2International Max Planck Research School on Neuroscience of Communication, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
Listening to speech is often demanding because of signal degradations and the presence of distracting sounds (i.e., “noise”). The question how the brain achieves the task of extracting only relevant information from the mixture of sounds reaching the ear (i.e., “cocktail party problem”) is still open. In analogy to recent findings in vision, we propose cortical alpha (~10 Hz) oscillations measurable using M/EEG as a pivotal mechanism to selectively inhibit the processing of noise to improve auditory selective attention to task-relevant signals. We review initial evidence of enhanced alpha activity in selective listening tasks, suggesting a significant role of alpha-modulated noise suppression in speech. We discuss the importance of dissociating between noise interference in the auditory periphery (i.e., energetic masking) and noise interference with more central cognitive aspects of speech processing (i.e., informational masking). Finally, we point out the adverse effects of age-related hearing loss and/or cognitive decline on auditory selective inhibition. With this perspective article, we set the stage for future studies on the inhibitory role of alpha oscillations for speech processing in challenging listening situations.
1. Introduction
In ecological listening situations, auditory signals are rarely perceived in quiet due to the presence of different auditory maskers such as distracting background speech or environmental noise. Thus, sounds from different sources greatly overlap spectro-temporally at the level of the listener's ear. What are the neural correlates that facilitate selective listening to relevant target signals despite irrelevant auditory input (i.e., the “cocktail party problem”; Cherry, 1953)? At the central neural level, two complementary mechanisms of top–down control (i.e., regulation of subsidiary cognitive processes) should be considered: First, top–down selective attention to relevant information (Fritz et al., 2007) could facilitate target processing by enhancing the neural response to the attended stream (i.e., gain control; Lee et al., 2013). Second, top–down selective inhibition of maskers (Melara et al., 2002) could help to direct limited processing capacities away from irrelevant information (Desimone and Duncan, 1995), thereby avoiding full processing of distractors (Foxe and Snyder, 2011).
In this regard, interference of auditory maskers might be the result of both insufficient attention to the target and poor inhibition of noise and distractors. In this perspective article we focus on the latter, that is, neural mechanisms of auditory selective inhibition. We propose that cortical alpha (~10 Hz) oscillations are an important tool for top–down control as they regulate the inhibition of masker information during speech processing in challenging listening situations.
2. The Functional Significance of Alpha Oscillations
Neural oscillations in the alpha frequency range (~10 Hz) are the most dominant signal measurable in the human magneto- and electroencephalogram (M/EEG), going back to their first description by Berger (1931). The earliest observations of the alpha rhythm revealed that its amplitude is enhanced in humans who are awake but not actively engaged in any task. This finding led initially to the view that high alpha power might simply reflect the default state of brain inactivity or “cortical idling” (for a review, see Pfurtscheller et al., 1996).
Only within the last two decades, the functional significance of alpha oscillations has been recognized and furthermore its ubiquitous role across sensory modalities (visual: for review see Mathewson et al., 2011; sensorimotor: e.g., Haegens et al., 2012; auditory: e.g., Hartmann et al., 2012) and cognitive tasks (working memory: e.g., Jensen et al., 2002; attention: for a review see Klimesch, 2012; decision making: e.g., Cohen et al., 2009). One unifying mechanism suggested for alpha rhythms across modalities and brain areas is that it provides a neural means to functionally inhibit the processing of currently task-irrelevant or task-detrimental information (Jensen and Mazaheri, 2010; Foxe and Snyder, 2011). Please note that the opposite mechanism also has been proposed where higher inter-areal alpha phase synchronization does not index cortical inhibition but increased information processing such as for internal (working memory related) information processes (Palva and Palva, 2011). The functional inhibition hypothesis, though, has received neurophysiological support. For example, both alpha power (i.e., squared amplitude) and alpha phase modulate neuronal spike rate (Haegens et al., 2011) and thus can directly affect the efficiency of neural information flow. In future, the alpha network needs to be further characterized by its phase–amplitude coupling to gamma oscillations (Jensen et al., 2012) and its role in top–down control as implemented in different cortical layers (Buffalo et al., 2011; Spaak et al., 2012) or in thalamico-cortical communication (Strauss et al., 2010; Roux et al., 2013).
Despite the abundance of studies on the role of alpha activity for visual selective inhibition, there are currently few studies that directly examine the role of alpha activity in the auditory modality. Recently, a series of studies found modulations in alpha power in a variety of auditory tasks prompted by degraded spectral detail (Obleser and Weisz, 2012), missing temporal expectations (Wilsch et al., 2014), working memory load (Leiberg et al., 2006; Obleser et al., 2012), or syntactic complexity (Meyer et al., 2013). Together, these findings provide good evidence that alpha oscillatory power can be a reliable indicator of auditory cognitive load (see also Luo et al., 2005; Kaiser et al., 2007). In the following section, we argue that part of this cognitive load occurs due to auditory selective inhibition as a compensatory mechanism for demanding listening situations and manifests in enhanced alpha power.
3. Alpha Oscillations as a Tool for Auditory Selective Inhibition
A common observation from our laboratory is a prominent increase in alpha power when participants listen to auditory materials presented against background noise (e.g., Wilsch et al., 2014). Figure 1A, for example, shows the grand average alpha power of 11 participants during a lexical decision task on isolated words presented in quiet (published in Strauß et al., 2014) and in white noise. For words in quiet, alpha power at around 10 Hz did not considerably increase after word onset. However, when words were presented in noise, alpha power was increased during the first 500 ms after word onset corresponding to the first two thirds of the average word duration. This effect was strongest over temporal and occipital sites (topography in Figure 1A) suggesting the inhibition of the task irrelevant visual modality but also compensatory mechanisms within speech-related areas. Critically, alpha power difference did not depend on ITPC (inter-trial phase coherence) differences, as indicated by the absence of a stronger ITPC in noise compared to quiet (Figure 1B). In fact, no significant ITPC differences were observed between 0.2 and 0.5 s. We therefore presume that induced (i.e., not strictly stimulus-locked; Freunberger et al., 2009) alpha power is crucial for speech processing in challenging listening conditions as it suppresses irrelevant information.
Figure 1. The proposed role of alpha activity for speech processing in noise. (A) Average absolute alpha power of 11 participants performing a lexical decision task on words in quiet (top) and in white noise (bottom). SNRs were titrated individually using a two-down-one-up staircase adaptive tracking procedure. Average SNR was −10.22 dB ±1.95 (SD) such that participants performed about 71% correct. Speech onset is indicated by the black vertical line at 0 s; average word length = 750 ms; EEG recorded from 61 scalp electrodes; time-frequency analysis using Morlet wavelets. Plots show measures of absolute power averaged over all scalp electrodes. Topography depicts the alpha power difference for speech in noise–quiet. Data were SCD (source current density)-transformed before power estimation to improve spatial resolution. (B) Inter-trial phase coherence (ITPC) as a measure of phase-locking of oscillations over trials. ITPC is bound between 0 and 1; higher ITPC values indicate stronger phase alignment across trials. (C) A simple framework of alpha oscillations for speech processing in noise. Acoustic signals overlap energetically as they enter the ear. At the brain level, features of speech and noise are processed as far as possible in distinct processing channels (depicted here with arrows; for details see text). High alpha power inhibits channels processing noise features to allow for an optimal task performance with minimized noise interference.
Figure 1C illustrates a tentative framework for how alpha oscillations could support auditory selective inhibition. Sounds arriving at the listener's ear must be further processed in the brain to extract task-relevant information. One way to think about the proposed mechanism is in terms of auditory object selection which requires object formation in the first place (Shinn-Cunningham, 2008). An auditory object might be formed on the basis of common spectro-temporal features, harmonicity, simultaneous onsets, or spatial grouping (Griffiths and Warren, 2004; Bizley and Cohen, 2013). We refer to all these different features used to form auditory objects as “channels” of auditory information represented by the arrows in Figure 1C. The concept of channels has a long tradition (Broadbent, 1958) and is inspired by the most clear distinction of target and distractor used in many dichotic listening paradigms where left and right ear channel need to be separated. Nevertheless, channels in our framework should be conceived as functional auditory processing units rather than anatomical pathways. As soon as these channels are defined, attention or inhibition can be selectively applied, given attentionally flexible fields in the auditory cortices (Petkov et al., 2004). Note that even though in the visual modality claims about alpha oscillations in feature-based (Romei et al., 2012) and object-based (Kinsey et al., 2011) attention have been made, we do not make any assumption about this distinction in our framework and use the term “channels” for both features and objects, or early and late selection.
If speech is presented in quiet (Figure 1C, top panel), alpha power is low in channels processing features of the speech signal to support processing of task-relevant information. Accordingly, the net resulting alpha power in the M/EEG would continue on baseline level (Figure 1A) and decrease during word integration (>400 ms). If, however, speech is presented in the presence of maskers (e.g., environmental noise, distracting talkers; Figure 1C, bottom panel), alpha power needs to be up-regulated first in those channels processing noise features before it is going to be suppressed during word integration (Figure 1A). Enhanced alpha activity inhibits processing of noise and thereby “protects” (Klimesch, 1999; Roux and Uhlhaas, 2014) the task- or performance-relevant information in the speech signal from noise interference.
Importantly, the up-regulation of alpha power in channels that process noise is not an automatic (“bottom–up”) process but critically depends on “top–down” attentional control. For instance, in a multi-talker situation, target and distracting talker switch roles permanently, as the listener decides to change the conversational partner. In such a situation, M/EEG alpha power would be constantly at a high level; however, the deployment of alpha power onto the different processing channels would be changing continuously.
What is the functional role of high alpha activity for word processing in noise? To answer this question, it is essential to distinguish between interpretations in which alpha activity is related to target processing from these related to noise processing. It is possible that the reduced intelligibility of words in noise leads to sub-optimal word processing and thus to less alpha suppression in brain areas relevant for speech processing (Strauß et al., 2014). The inverse mechanism, as we put forward in the current framework, is equally likely by which alpha power is enhanced for temporarily irrelevant information and thereby compensates for perceived cognitive effort (increased when listening to speech in noise: Larsby et al., 2005; Helfer et al., 2010; Zekveld et al., 2011). In this regard, alpha would “protect” the lexical processes from noise interference. The challenge will be to experimentally dissect these (not mutually exclusive) mechanisms. We now review initial evidence for alpha's inhibitory role in audition.
Currently, there are only few studies that show alpha power modulations when participants simultaneously listen to two auditory streams, that is, one signal and one masker. In one study by Kerlin et al. (2010), participants were simultaneously listening to two spatially separated speech streams. On each trial, an initial visual cue indicated whether they were supposed to attend the left or right stream. During speech presentation, EEG alpha power was enhanced over the cerebral hemisphere contralateral to the masker, while alpha power was reduced contralateral to the to-be-attended stream. The authors concluded that this alpha lateralization indexes the direction of auditory attention to speech in space. Importantly, this finding corroborates our view that enhanced alpha power in brain areas engaged in distractor processing decreases further processing of the distractor and hence, facilitates processing of the target signal. However, two questions arise from this study: First, as the direction of auditory attention was cued visually in this study, it might be that the alpha lateralization indicates the allocation of supramodal rather than auditory selective attention (Farah et al., 1989). Second, spatial attention may play a special role not least because of auditory processing models suggesting separate what- and where-pathways (Rauschecker and Scott, 2009).
In three other recent studies, alpha power modulations were consistently found during the anticipation of auditory target signals from the left or right (Banerjee et al., 2011; Müller and Weisz, 2012; Ahveninen et al., 2013). In these studies, participants were cued to attend either the auditory event on the left or right, and to ignore the distractor on the other side. Alpha power was enhanced during the anticipation of auditory stimulation contralateral to the distractor. These results demonstrate alpha lateralization effects already during the preparation for an auditory selective listening task. This is in line with studies reporting high pre-stimulus alpha power when participants are about to miss a (visual) target (van Dijk et al., 2008; Busch et al., 2009; Romei et al., 2010). In terms of our framework (Figure 1C), anticipatory high alpha power successfully blocks in-depth processing of sensory information that might lead to missing the target.
However, interpretations of these studies are limited for our model, since alpha power modulations were found only during the anticipation but not during the actual processing of competing auditory streams. More data are clearly needed on the peri-stimulus alpha dynamics. As the spatial resolution of M/EEG is limited, prospective experiments could induce alpha oscillations over specific brain areas using transcranial alternating current stimulation (tACS) to assess the influence of alpha modulations on listening success under adverse acoustic conditions. Moreover, future studies could record the electrocorticogram (ECoG) directly from the cortical surface to track alpha sources and reveal the interplay between frequency bands. Such higher spatial resolution would allow to differentiate between alpha activity in brain regions associated with processing the masker or the signal. As of now, we are left to speculate how spatially specific alpha oscillations might operate, for example along a cochleotopic gradient in primary auditory cortex. The best data to infer from stems from visual cortex, where for example Buffalo and colleagues recorded with two electrode tips in attended vs. non-attended receptive fields less than a millimeter apart and report attention-dependent opposing, and deep-layer-specific alpha changes (expressed as alpha spike-field coherence; Buffalo et al., 2011). Comparable data are, to our knowledge, still missing for auditory areas.
In the next two sections, we will elaborate first, at which levels of auditory processing alpha power might be deployed for the inhibition of different kinds of auditory maskers, and second, how age and hearing loss might affect auditory selective inhibition.
4. Masking Release Via Alpha Enhancement Along the Auditory Pathway
So far, we have shown that alpha oscillations are an attractive neural candidate mechanism of selective auditory inhibition. There are different aspects which need to be systematically investigated in order to determine the role of alpha: Which neural circuits “deploy” or trigger high-alpha states? And in terms of the current framework: What kind of channels can be attenuated by enhanced alpha power?
Currently, there are few studies mapping the sources of alpha power during masked auditory processing. Some evidence has accumulated showing noise-invariant representations of the signal in auditory cortices (Chang et al., 2010; Ding and Simon, 2012) with the degree of invariance increasing from peripheral to cortical processing stages (Rabinowitz et al., 2013). If we assume that alpha is an important central mechanism to inhibit various types of maskers, these studies suggest that masking release via alpha enhancement might occur as early as in primary auditory cortex. A first direct hint to this idea might be the case of an illusory sound percept like tinnitus, which can be centrally suppressed by means of increasing alpha power in primary auditory cortex (Leske et al., 2013; Weisz et al., 2014). This is in line with research showing that attention modulates activity in sensory cortices corresponding to the modality of the stimulus (e.g., Heinrich et al., 2011; Wild et al., 2012). Thus, alpha activity in primary auditory cortex might be crucially contributing to inhibiting the formation of auditory objects.
In future studies investigating underlying alpha sources, a distinction between energetic and informational masking might be crucial (Brungart et al., 2001; Mattys et al., 2009; Scott and McGettigan, 2013; for a more comprehensive overview of potential adverse listening conditions see Mattys et al., 2012). Energetic masking describes the competition of auditory target and masker in the auditory periphery due to spectro-temporal overlay of the two signals, causing an overlap of excitation patterns in the cochlea and auditory nerve (Durlach et al., 2003). One type of background signal often assumed to cause primarily energetic masking is white noise (e.g., Arbogast et al., 2005) which is quasi-stationary and has high energy in a broad frequency range (for discussion see Stone et al., 2012). Although informational masking is sometimes defined only negatively as all masking effects not accounted for by energetic masking (cf. Gutschalk et al., 2008), a more refined definition is required, especially when it comes to speech processing. When target speech is masked by a competing talker, it is not just the energetic overlap of the two signals that causes masker interference. Rather, the speech masker initiates phonetic and semantic processing that interferes with the linguistic processing of the target (Schneider et al., 2007). Thus, informational masking describes the interference of target and masker at a more central, cognitive level, whereas energetic masking refers to energetic overlap in the auditory periphery.
According to the framework described above, alpha oscillations might be important for inhibition of both types of maskers, however, in different brain areas. We presume that energetic maskers are inhibited by enhanced alpha activity in auditory cortex (Müller and Weisz, 2012). In contrast, processing of informational maskers like competing speech should rather be inhibited by alpha activity in higher auditory areas such as posterior superior temporal gyrus (pSTG) and beyond, relevant for linguistic processing (Scott et al., 2004, 2009). In addition to the proposed inhibition of auditory input, alpha oscillations are involved in supramodal or crossmodal inhibition of the currently task-irrelevant modality (Banerjee et al., 2011).
5. Effects of Age and Hearing Loss on Auditory Distractor Inhibition
In acoustically demanding multi-talker situations, older listeners typically experience more difficulties compared with younger adults. It is however unclear, in how far these difficulties are caused by age-related decline in perceptual auditory acuity (hearing loss or loss of temporal and spectral resolution; Fostick and Babkoff, 2013), decline of cognitive functioning with age, or both (Wingfield et al., 2005). Crucial for the present framework, however, both auditory perceptual and cognitive decline could lead to insufficient masker inhibition. First, compared with normal-hearing controls, listeners with hearing loss are less successful in utilizing spectral (Lorenzi et al., 2006), temporal (Tremblay et al., 2003), and spatial auditory cues (Neher et al., 2009) important for the perceptual segregation of different sound sources. Thus, attending to relevant and inhibiting irrelevant sound sources is impaired, as auditory features are lacking to distinguish the different sound sources in the first place (Shinn-Cunningham and Best, 2008). Second, age negatively affects many aspects of cognitive functioning (Park et al., 2003), amongst it the ability to suppress irrelevant but salient auditory distractors (Chao and Knight, 1997; Tun et al., 2002; Passow et al., 2014). Thus, even if the perceptual segregation of sound sources is accomplished successfully, the insufficient inhibition of maskers may cause interference.
In line with prior studies that found age effects on brain oscillatory activity in the alpha frequency range (Yordanova et al., 1998; Klimesch, 1999; Böttger et al., 2002), we consider it valuable to investigate alpha oscillations in demanding listening tasks as an indicator of age-dependent auditory cognitive effort of masker inhibition. We presume that auditory selective inhibition, realized by alpha activity in channels relevant for masker processing (Figure 1C), serves as a compensatory mechanism as multi-talker listening conditions become more demanding, for instance due to a decreasing signal-to-noise ratio (SNR). The study of alpha oscillations could help to reveal how listeners of different age exert top–down attentional control to facilitate processing of task-relevant signals and inhibit processing of interfering maskers. In particular, this line of research might foster the understanding of why older listeners find it more exhausting to participate in cocktail party-like listening situations compared with younger listeners (Pichora–Fuller, 2003).
6. Conclusions
In this perspective article, we have presented a framework for studying alpha oscillations as a tool for auditory selective inhibition in challenging listening situations. We have presented initial evidence qualifying alpha oscillations as a pivotal mechanism affecting listening in multi-talker situations. Future studies could expand these findings and study the role of alpha oscillations (1) during speech perception in ecologically valid listening situations, (2) in the presence of energetic and informational maskers, and (3) for aging and hearing-impaired listeners.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
Antje Strauß, Malte Wöstmann, and Jonas Obleser are supported by a Max Planck Research Grant to Jonas Obleser. The authors are grateful for in-depth discussions with the members of the Max Planck Research Group “Auditory Cognition” during manuscript preparation.
References
Ahveninen, J., Huang, S., Belliveau, J. W., Chang, W.-T., and Hämäläinen, M. (2013). Dynamic oscillatory processes governing cued orienting and allocation of auditory attention. J. Cogn. Neurosci. 25, 1926–1943. doi: 10.1162/jocn_a_00452
Arbogast, T. L., Mason, C. R., and Kidd, G. Jr. (2005). The effect of spatial separation on informational masking of speech in normal-hearing and hearing-impaired listeners. J. Acoust. Soc. Am. 117, 2169–2180. doi: 10.1121/1.1861598
Banerjee, S., Snyder, A. C., Molholm, S., and Foxe, J. J. (2011). Oscillatory alpha-band mechanisms and the deployment of spatial attention to anticipated auditory and visual target locations: supramodal or sensory-specific control mechanisms? J. Neurosci. 31, 9923–9932. doi: 10.1523/JNEUROSCI.4660-10.2011
Berger, H. (1931). Über das Elektrenkephalogramm des Menschen. Arch. Psychiatr. Nervenkr. 94, 16–60. doi: 10.1007/BF01835097
Bizley, J. K., and Cohen, Y. E. (2013). The what, where and how of auditory-object perception. Nat. Rev. Neurosci. 14, 693–707. doi: 10.1038/nrn3565
Böttger, D., Herrmann, C. S., and von Cramon, D. Y. (2002). Amplitude differences of evoked alpha and gamma oscillations in two different age groups. Int. J. Psychophysiol. 45, 245–251. doi: 10.1016/S0167-8760(02)00031-4
Broadbent, D. E. (1958). Perception and Communication. Oxford, UK: Pergamon Press. doi: 10.1037/10037-000
Brungart, D. S., Simpson, B. D., Ericson, M. A., and Scott, K. R. (2001). Informational and energetic masking effects in the perception of multiple simultaneous talkers. J. Acoust. Soc. Am. 110, 2527–2538. doi: 10.1121/1.1408946
Buffalo, E. A., Fries, P., Landman, R., Buschman, T. J., and Desimone, R. (2011). Laminar differences in gamma and alpha coherence in the ventral stream. Proc. Natl. Acad. Sci. U.S.A. 108, 11262–11267. doi: 10.1073/pnas.1011284108
Busch, N. A., Dubois, J., and VanRullen, R. (2009). The phase of ongoing EEG oscillations predicts visual perception. J. Neurosci. 29, 7869–7876. doi: 10.1523/JNEUROSCI.0113-09.2009
Chang, E. F., Rieger, J. W., Johnson, K., Berger, M. S., Barbaro, N. M., and Knight, R. T. (2010). Categorical speech representation in human superior temporal gyrus. Nat. Neurosci. 13, 1428–1432. doi: 10.1038/nn.2641
Chao, L. L., and Knight, R. T. (1997). Prefrontal deficits in attention and inhibitory control with aging. Cereb. Cortex 7, 63–69. doi: 10.1093/cercor/7.1.63
Cherry, E. C. (1953). Some experiments on the recognition of speech, with one and with two ears. J. Accoust. Soc. Am. 25, 975–979. doi: 10.1121/1.1907229
Cohen, M. X., Elger, C. E., and Fell, J. (2009). Oscillatory activity and phase-amplitude coupling in the human medial frontal cortex during decision making. J. Cogn. Neurosci. 21, 390–402. doi: 10.1162/jocn.2008.21020
Desimone, R., and Duncan, J. (1995). Neural mechanisms of selective visual attention. Annu. Rev. Neurosci. 18, 193–222. doi: 10.1146/annurev.ne.18.030195.001205
Ding, N., and Simon, J. Z. (2012). Emergence of neural encoding of auditory objects while listening to competing speakers. Proc. Natl. Acad. Sci. U.S.A 109, 11854–11859. doi: 10.1073/pnas.1205381109
Durlach, N. I., Mason, C. R. Jr., G. K., Arbogast, T. L., Colburn, H. S., and Shinn-Cunningham, B. G. (2003). Note on informational masking (l). J. Acoust. Soc. Am. 113, 2984–2987. doi: 10.1121/1.1570435
Farah, M. J., Wong, A. B., Monheit, M. A., and Morrow, L. A. (1989). Parietal lobe mechanisms of spatial attention: modality-specific or supramodal? Neuropsychologia 27, 461–470. doi: 10.1016/0028-3932(89)90051-1
Fostick, L., and Babkoff, H. (2013). Temporal and non-temporal processes in the elderly. J. Basic Clin. Physiol. Pharmacol. 24, 191–199. doi: 10.1515/jbcpp-2013-0049
Foxe, J. J., and Snyder, A. C. (2011). The role of alpha-band brain oscillations as a sensory suppression mechanism during selective attention. Front. Psychol. 2:154. doi: 10.3389/fpsyg.2011.00154
Freunberger, R., Fellinger, R., Sauseng, P., Gruber, W., and Klimesch, W. (2009). Dissociation between phase-locked and nonphase-locked alpha oscillations in a working memory task. Hum. Brain Mapp. 30, 3417–3425. doi: 10.1002/hbm.20766
Fritz, J. B., Elhilali, M., David, S. V., and Shamma, S. A. (2007). Auditory attention-focusing the searchlight on sound. Curr. Opin. Neurobiol. 17, 437–455. doi: 10.1016/j.conb.2007.07.011
Griffiths, T. D., and Warren, J. D. (2004). What is an auditory object? Nat. Rev. Neurosci. 5, 887–892. doi: 10.1038/nrn1538
Gutschalk, A., Micheyl, C., and Oxenham, A. J. (2008). Neural correlates of auditory perceptual awareness under informational masking. PLoS Biol. 6:e138. doi: 10.1371/journal.pbio.0060138
Haegens, S., Luther, L., and Jensen, O. (2012). Somatosensory anticipatory alpha activity increases to suppress distracting input. J. Cogn. Neurosci. 24, 677–685. doi: 10.1162/jocn_a_00164
Haegens, S., Nácher, V., Luna, R., Romo, R., and Jensen, O. (2011). α-oscillations in the monkey sensorimotor network influence discrimination performance by rhythmical inhibition of neuronal spiking. Proc. Natl. Acad. Sci. U.S.A. 108, 19377–19382. doi: 10.1073/pnas.1117190108
Hartmann, T., Schlee, W., and Weisz, N. (2012). It's only in your head: expectancy of aversive auditory stimulation modulates stimulus-induced auditory cortical alpha desynchronization. Neuroimage 60, 170–178. doi: 10.1016/j.neuroimage.2011.12.034
Heinrich, A., Carlyon, R. P., Davis, M. H., and Johnsrude, I. S. (2011). The continuity illusion does not depend on attentional state: FMRI evidence from illusory vowels. J. Cogn. Neurosci. 23, 2675–2689. doi: 10.1162/jocn.2011.21627
Helfer, K. S., Chevalier, J., and Freyman, R. L. (2010). Aging, spatial cues, and single- versus dual-task performance in competing speech perception. J. Acoust. Soc. Am. 128, 3625–3633. doi: 10.1121/1.3502462
Jensen, O., Bonnefond, M., and VanRullen, R. (2012). An oscillatory mechanism for prioritizing salient unattended stimuli. Trends Cogn. Sci. 16, 200–206. doi: 10.1016/j.tics.2012.03.002
Jensen, O., Gelfand, J., Kounios, J., and Lisman, J. E. (2002). Oscillations in the alpha band (9–12 hz) increase with memory load during retention in a short-term memory task. Cereb. Cortex 12, 877–882. doi: 10.1093/cercor/12.8.877
Jensen, O., and Mazaheri, A. (2010). Shaping functional architecture by oscillatory alpha activity: gating by inhibition. Front. Hum. Neurosci. 4:186. doi: 10.3389/fnhum.2010.00186
Kaiser, J., Heidegger, T., Wibral, M., Altmann, C. F., and Lutzenberger, W. (2007). Alpha synchronization during auditory spatial short-term memory. Neuroreport 18, 1129–1132. doi: 10.1097/WNR.0b013e32821c553b
Kerlin, J. R., Shahin, A. J., and Miller, L. M. (2010). Attentional gain control of ongoing cortical speech representations in a “cocktail party”. J. Neurosci. 30, 620–628. doi: 10.1523/JNEUROSCI.3631-09.2010
Kinsey, K., Anderson, S. J., Hadjipapas, A., and Holliday, I. E. (2011). The role of oscillatory brain activity in object processing and figure-ground segmentation in human vision. Int. J. Psychophysiol. 79, 392–400. doi: 10.1016/j.ijpsycho.2010.12.007
Klimesch, W. (1999). EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis. Brain Res. Rev. 29, 169–195. doi: 10.1016/S0165-0173(98)00056-3
Klimesch, W. (2012). Alpha-band oscillations, attention, and controlled access to stored information. Trends Cogn. Sci. 16, 606–617. doi: 10.1016/j.tics.2012.10.007
Larsby, B., Hällgren, M., Lyxell, B., and Arlinger, S. (2005). Cognitive performance and perceived effort in speech processing tasks: effects of different noise backgrounds in normal-hearing and hearing-impaired subjects. Int. J. Audiol. 44, 131–143. doi: 10.1080/14992020500057244
Lee, A. K. C., Larson, E., Maddox, R. K., and Shinn-Cunningham, B. G. (2013). Using neuroimaging to understand the cortical mechanisms of auditory selective attention. Hear. Res. 307, 111–120. doi: 10.1016/j.heares.2013.06.010
Leiberg, S., Lutzenberger, W., and Kaiser, J. (2006). Effects of memory load on cortical oscillatory activity during auditory pattern working memory. Brain Res. 1120, 131–140. doi: 10.1016/j.brainres.2006.08.066
Leske, S., Tse, A., Oosterhof, N. N., Hartmann, T., Müller, N., Keil, J., et al. (2013). The strength of alpha and beta oscillations parametrically scale with the strength of an illusory auditory percept. Neuroimage 88C, 69–78. doi: 10.1016/j.neuroimage.2013.11.014
Lorenzi, C., Gilbert, G., Carn, H., Garnier, S., and Moore, B. C. J. (2006). Speech perception problems of the hearing impaired reflect inability to use temporal fine structure. Proc. Natl. Acad. Sci. U.S.A. 103, 18866–18869. doi: 10.1073/pnas.0607364103
Luo, H., Husain, F. T., Horwitz, B., and Poeppel, D. (2005). Discrimination and categorization of speech and non-speech sounds in an MEG delayed-match-to-sample study. Neuroimage 28, 59–71. doi: 10.1016/j.neuroimage.2005.05.040
Mathewson, K. E., Lleras, A., Beck, D. M., Fabiani, M., Ro, T., and Gratton, G. (2011). Pulsed out of awareness: EEG alpha oscillations represent a pulsed-inhibition of ongoing cortical processing. Front. Psychol. 2:99. doi: 10.3389/fpsyg.2011.00099
Mattys, S. L., Brooks, J., and Cooke, M. (2009). Recognizing speech under a processing load: dissociating energetic from informational factors. Cogn. Psychol. 59, 203–243. doi: 10.1016/j.cogpsych.2009.04.001
Mattys, S. L., Davis, M. H., Bradlow, A. R., and Scott, S. K. (2012). Speech recognition in adverse conditions: a review. Lang. Cogn. Process. 27, 953–978. doi: 10.1080/01690965.2012.705006
Melara, R. D., Rao, A., and Tong, Y. (2002). The duality of selection: excitatory and inhibitory processes in auditory selective attention. J. Exp. Psychol. Hum. Percept. Perform. 28, 279–306. doi: 10.1037/0096-1523.28.2.279
Meyer, L., Obleser, J., and Friederici, A. D. (2013). Left parietal alpha enhancement during working memory-intensive sentence processing. Cortex 49, 711–721. doi: 10.1016/j.cortex.2012.03.006
Müller, N., and Weisz, N. (2012). Lateralized auditory cortical alpha band activity and interregional connectivity pattern reflect anticipation of target sounds. Cereb. Cortex 22, 1604–1613. doi: 10.1093/cercor/bhr232
Neher, T., Behrens, T., Carlile, S., Jin, C., Kragelund, L., Petersen, A. S., et al. (2009). Benefit from spatial separation of multiple talkers in bilateral hearing-aid users: effects of hearing loss, age, and cognition. Int. J. Audiol. 48, 758–774. doi: 10.3109/14992020903079332
Obleser, J., and Weisz, N. (2012). Suppressed alpha oscillations predict intelligibility of speech and its acoustic details. Cereb. Cortex 22, 2466–2477. doi: 10.1093/cercor/bhr325
Obleser, J., Wöstmann, M., Hellbernd, N., Wilsch, A., and Maess, B. (2012). Adverse listening conditions and memory load drive a common alpha oscillatory network. J. Neurosci. 32, 12376–12383. doi: 10.1523/JNEUROSCI.4908-11.2012
Palva, S., and Palva, J. M. (2011). Functional roles of alpha-band phase synchronization in local and large-scale cortical networks. Front. Psychol. 2:204. doi: 10.3389/fpsyg.2011.00204
Park, H. L., O'Connell, J. E., and Thomson, R. G. (2003). A systematic review of cognitive decline in the general elderly population. Int. J. Geriatr. Psychiatry 18, 1121–1134. doi: 10.1002/gps.1023
Passow, S., Westerhausen, R., Hugdahl, K., Wartenburger, I., Heekeren, H. R., Lindenberger, U., et al. (2014). Electrophysiological correlates of adult age differences in attentional control of auditory processing. Cereb. Cortex 24, 249–260. doi: 10.1093/cercor/bhs306
Petkov, C. I., Kang, X., Alho, K., Bertrand, O., Yund, E. W., and Woods, D. L. (2004). Attentional modulation of human auditory cortex. Nat. Neurosci. 7, 658–663. doi: 10.1038/nn1256
Pfurtscheller, G., Stancák, Jr., A., and Neuper, C. (1996). Event-related synchronization (ERS) in the alpha band – an electrophysiological correlate of cortical idling: a review. Int. J. Psychophysiol. 24, 39–46. doi: 10.1016/S0167-8760(96)00066-9
Pichora–Fuller, M. K. (2003). Cognitive aging and auditory information processing. Int. J. Audiol. 42, 2S26–2S32. doi: 10.3109/14992020309074641
Rabinowitz, N. C., Willmore, B. D. B., King, A. J., and Schnupp, J. W. H. (2013). Constructing noise-invariant representations of sound in the auditory pathway. PLoS Biol. 11:e1001710. doi: 10.1371/journal.pbio.1001710
Rauschecker, J. P., and Scott, S. K. (2009). Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci. 12, 718–724. doi: 10.1038/nn.2331
Romei, V., Gross, J., and Thut, G. (2010). On the role of prestimulus alpha rhythms over occipito-parietal areas in visual input regulation: correlation or causation? J. Neurosci. 30, 8692–8697. doi: 10.1523/JNEUROSCI.0160-10.2010
Romei, V., Thut, G., Mok, R. M., Schyns, P. G., and Driver, J. (2012). Causal implication by rhythmic transcranial magnetic stimulation of alpha frequency in feature-based local vs. global attention. Eur. J. Neurosci. 35, 968–974. doi: 10.1111/j.1460-9568.2012.08020.x
Roux, F., and Uhlhaas, P. J. (2014). Working memory and neural oscillations: alpha–gamma versus theta–gamma codes for distinct WM information? Trends Cogn. Sci. 18, 16–25. doi: 10.1016/j.tics.2013.10.010
Roux, F., Wibral, M., Singer, W., Aru, J., and Uhlhaas, P. J. (2013). The phase of thalamic alpha activity modulates cortical gamma-band activity: evidence from resting-state MEG recordings. J. Neurosci. 33, 17827–17835. doi: 10.1523/JNEUROSCI.5778-12.2013
Schneider, B. A., Li, L., and Daneman, M. (2007). How competing speech interferes with speech comprehension in everyday listening situations. J. Am. Acad. Audiol. 18, 559–572. doi: 10.3766/jaaa.18.7.4
Scott, S. K., and McGettigan, C. (2013). The neural processing of masked speech. Hear. Res. 303, 58–66. doi: 10.1016/j.heares.2013.05.001
Scott, S. K., Rosen, S., Beaman, C. P., Davis, J. P., and Wise, R. J. S. (2009). The neural processing of masked speech: evidence for different mechanisms in the left and right temporal lobes. J. Acoust. Soc. Am. 125, 1737–1743. doi: 10.1121/1.3050255
Scott, S. K., Rosen, S., Wickham, L., and Wise, R. J. S. (2004). A positron emission tomography study of the neural basis of informational and energetic masking effects in speech perception. J. Acoust. Soc. Am. 115, 813–821. doi: 10.1121/1.1639336
Shinn-Cunningham, B. G. (2008). Object-based auditory and visual attention. Trends Cogn. Sci. 12, 182–186. doi: 10.1016/j.tics.2008.02.003
Shinn-Cunningham, B. G., and Best, V. (2008). Selective attention in normal and impaired hearing. Trends Amplif. 12, 283–299. doi: 10.1177/1084713808325306
Spaak, E., Bonnefond, M., Maier, A., Leopold, D. A., and Jensen, O. (2012). Layer-specific entrainment of gamma-band neural activity by the alpha rhythm in monkey visual cortex. Curr. Biol. 22, 2313–2318. doi: 10.1016/j.cub.2012.10.020
Stone, M. A., Füllgrabe, C., and Moore, B. C. J. (2012). Notionally steady background noise acts primarily as a modulation masker of speech. J. Acoust. Soc. Am. 132, 317–326. doi: 10.1121/1.4725766
Strauß, A., Kotz, S. A., Scharinger, M., and Obleser, J. (2014). Alpha and theta brain oscillations index dissociable processes in spoken word recognition. Neuroimage. doi: 10.1016/j.neuroimage.2014.04.005. [Epub ahead of print].
Strauss, D. J., Corona-Strauss, F. I., Trenado, C., Bernarding, C., Reith, W., Latzel, M., et al. (2010). Electrophysiological correlates of listening effort: neurodynamical modeling and measurement. Cogn. Neurodyn. 4, 119–131. doi: 10.1007/s11571-010-9111-3
Tremblay, K. L., Piskosz, M., and Souza, P. (2003). Effects of age and age-related hearing loss on the neural representation of speech cues. Clin. Neurophysiol. 114, 1332–1343. doi: 10.1016/S1388-2457(03)00114-7
Tun, P. A., O'Kane, G., and Wingfield, A. (2002). Distraction by competing speech in young and older adult listeners. Psychol. Aging 17, 453–467. doi: 10.1037/0882-7974.17.3.453
van Dijk, H., Schoffelen, J.-M., Oostenveld, R., and Jensen, O. (2008). Prestimulus oscillatory activity in the alpha band predicts visual discrimination ability. J. Neurosci. 28, 1816–1823. doi: 10.1523/JNEUROSCI.1853-07.2008
Weisz, N., Lüchinger, C., Thut, G., and Müller, N. (2014). Effects of individual alpha rTMS applied to the auditory cortex and its implications for the treatment of chronic tinnitus. Hum. Brain Mapp. 35, 14–29. doi: 10.1002/hbm.22152
Wild, C. J., Yusuf, A., Wilson, D. E., Peelle, J. E., Davis, M. H., and Johnsrude, I. S. (2012). Effortful listening: the processing of degraded speech depends critically on attention. J. Neurosci. 32, 14010–14021. doi: 10.1523/JNEUROSCI.1528-12.2012
Wilsch, A., Henry, M. J., Herrmann, B., Maess, B., and Obleser, J. (2014). Alpha oscillatory dynamics index temporal expectation benefits in working memory. Cereb. Cortex. doi: 10.1093/cercor/bhu004. [Epub ahead of print].
Wingfield, A., Tun, P. A., and McCoy, S. L. (2005). Hearing loss in older adulthood what it is and how it interacts with cognitive performance. Curr. Direct. Psychol. Sci. 14, 144–148. doi: 10.1111/j.0963-7214.2005.00356.x
Yordanova, J. Y., Kolev, V. N., and Başar, E. (1998). EEG theta and frontal alpha oscillations during auditory processing change with aging. Electroencephalogr. Clin. Neurophysiol. 108, 497–505. doi: 10.1016/S0168-5597(98)00028-8
Keywords: alpha, neural oscillations, effortful listening, inhibition, masking, speech, aging, hearing loss
Citation: Strauß A, Wöstmann M and Obleser J (2014) Cortical alpha oscillations as a tool for auditory selective inhibition. Front. Hum. Neurosci. 8:350. doi: 10.3389/fnhum.2014.00350
Received: 27 February 2014; Accepted: 08 May 2014;
Published online: 28 May 2014.
Edited by:
Carolyn McGettigan, Royal Holloway University of London, UKReviewed by:
Johanna M. Zumer, University of Birmingham, UKRebecca E. Millman, York NeuroImaging Centre, UK
Copyright © 2014 Strauß, Wöstmann and Obleser. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Antje Strauß, Max Planck Research Group “Auditory Cognition”, Max Planck Institute for Human Cognitive and Brain Sciences, Stephanstraße 1A, 04103 Leipzig, Germany e-mail: strauss@cbs.mpg.de
†These authors have contributed equally to this work.