- 1Chair of Cognitive and Clinical Neuroscience, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany
- 2Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
Predictive processing, a leading theoretical framework for sensory processing, suggests that the brain constantly generates predictions on the sensory world and that perception emerges from the comparison between these predictions and the actual sensory input. This requires two distinct neural elements: generative units, which encode the model of the sensory world; and prediction error units, which compare these predictions against the sensory input. Although predictive processing is generally portrayed as a theory of cerebral cortex function, animal and human studies over the last decade have robustly shown the ubiquitous presence of prediction error responses in several nuclei of the auditory, somatosensory, and visual subcortical pathways. In the auditory modality, prediction error is typically elicited using so-called oddball paradigms, where sequences of repeated pure tones with the same pitch are at unpredictable intervals substituted by a tone of deviant frequency. Repeated sounds become predictable promptly and elicit decreasing prediction error; deviant tones break these predictions and elicit large prediction errors. The simplicity of the rules inducing predictability make oddball paradigms agnostic about the origin of the predictions. Here, we introduce two possible models of the organizational topology of the predictive processing auditory network: (1) the global view, that assumes that predictions on the sensory input are generated at high-order levels of the cerebral cortex and transmitted in a cascade of generative models to the subcortical sensory pathways; and (2) the local view, that assumes that independent local models, computed using local information, are used to perform predictions at each processing stage. In the global view information encoding is optimized globally but biases sensory representations along the entire brain according to the subjective views of the observer. The local view results in a diminished coding efficiency, but guarantees in return a robust encoding of the features of sensory input at each processing stage. Although most experimental results to-date are ambiguous in this respect, recent evidence favors the global model.
1. Introduction
The massive bundle of corticofugal fibers stemming from auditory cortex and targeting nuclei of the subcortical auditory pathway (Winer, 1984, 2005b; Schofield, 2011) have posed a puzzling problem to the auditory neuroscience community for decades (Syka et al., 1988; Winer, 2005a; Robinson and McAlpine, 2009; He and Yu, 2010). Sensory processing is classically understood as a bottom up problem, where increasingly complex features are read-out in a hierarchical constructive manner (Epstein, 1993; Martin, 1994; DeCharms and Zador, 2000). But then, what is the corticofugal system good for, and why is it that massive?
One possibility is that sensory processing is not a purely bottom-up process, but that top-down information is used proactively to encode sensory input (Mumford, 1992; Rao and Ballard, 1999; Friston, 2003, 2005). This is the thesis of the predictive processing framework (PPF) (Heeger, 2017; Spratling, 2017; Keller and Mrsic-Flogel, 2018; Walsh et al., 2020): that higher-level regions of the cerebral cortex keep and update a model of the sensory world that is used to predict, in a generative manner, the sensory input at lower-level regions of the cerebral cortex; and that neurons at those lower-level regions encode prediction error: the difference between the predictions and the actual input. Prediction error is further conveyed to the higher-level representation and used there to adjust the generative model. Extending this role to the corticofugal system between cerebral cortex and subcortical sensory pathway nuclei suggests that predictions drawn by generative models in cerebral cortex are conveyed to subcortical sensory neurons that encode prediction error (Von Kriegstein et al., 2008; Diaz et al., 2012; Malmierca et al., 2015). Many authors have argued that the PPF might underlay cognitive processes beyond perception including (e.g.,): vocalization in humans (Okada et al., 2018) and birds (Yildiz and Kiebel, 2011), learning in cognitive development (Nagai, 2019), episodic memory (Barron et al., 2020), abstract cognition and reasoning (Spratling, 2016), inculturation (Fabry, 2018), and even the emergence of faith (Andersen, 2019). Here we focus on sensory processing and, in particular, on auditory perception.
Over the last decade the auditory neuroscience community has robustly shown the predominance of neurons encoding prediction error neurons in subcortical sensory pathway nuclei (Anderson et al., 2009; Malmierca et al., 2009, 2014, 2019; Grimm et al., 2016; Parras et al., 2017; Carbajal and Malmierca, 2018). Although these results are often taken as proof that the corticofugal system is indeed transmitting predictions, most experimental paradigms control predictability using simple rules that can be readily encoded at the same processing stage as the prediction error (Eytan et al., 2003; Mill et al., 2011; Wang et al., 2014; May et al., 2015); i.e., without needing a top-down system. We will call this the ‘local model’ of predictive coding in the following. Conversely, we refer to predictive coding as a ‘global model’ if a generative model at higher-levels of the processing hierarchy generates predictions for the lower levels. The distinction between local and global models of predictive coding is important for understanding the function of the corticofugal pathway. It is also important for the understanding of the nature of sensory processing: if predictions are computed at higher stages of the processing hierarchy and transmitted downwards, that would mean that the auditory system can only make sense of stimuli that are conceivable at these higher-level representations.
Predictability plays an important role in sensory processing: there are many benefits of predictability on behavioral performance in the neurotypical brain (e.g., Davis and Johnsrude, 2007; Jaramillo and Zador, 2011; Sohoglu and Davis, 2016; Mazzucato et al., 2019). If using such predictability for understanding the world is not possible, this likely results in dysfunction. Deficits in the predictive elements of the PPF have already been suggested to explain a number of symptoms in neuropsychiatric conditions, including in schizophrenia (Horga et al., 2014; Sterzer et al., 2018, 2019), autism-spectrum disorders (van de Cruys et al., 2014; van Schalkwyk et al., 2017), attention deficit and hyperactivity (Gonzalez-Gadea et al., 2015), and mood and eating disorders (Frank et al., 2016; Clark et al., 2018). Deficits in the predictive elements of the PPF have also been directly linked to dysfunction of cortico-thalamic pathways and sensory auditory thalamus in developmental dyslexia (Diaz et al., 2012; Müller-Axt et al., 2017; Tschentscher et al., 2019). Understanding the computational mechanism for encoding predictability and the role of the corticofugal system in predictive sensory processing is a necessary prerequisite for a mechanistic understanding of these disorders.
In the following, we review the existing literature on predictive processing in the auditory sensory system with a special focus on the potential role of the corticofugal pathway. We focus on audition because it is the modality where subcortical predictive processing has been explored the most in the last decade (Nelken and Ulanovsky, 2007; Garrido et al., 2009; Grimm et al., 2011; Escera and Malmierca, 2014; Malmierca et al., 2015; Heilbron and Chait, 2017; Carbajal and Malmierca, 2018).
2. Global and Local Models of the Predictive Processing Framework
A longstanding question on sensory processing is whether perception is purely exploratory or rather a process of inference (Von Helmholtz, 1867; Atick, 1992; Bejjanki et al., 2011; Lochmann and Deneve, 2011; Purves et al., 2015; de Lange et al., 2018). In the exploratory view, observers passively receive information from their senses and construct a representation of their sensory surrounds based on a lump of perceptual objects (Epstein, 1993; Martin, 1994; Quiroga et al., 2005; Chechik et al., 2006; Wood et al., 2019). The exploratory view is implemented by the so-called representational framework of sensory processing (Epstein, 1993; DeCharms and Zador, 2000; DiCarlo et al., 2012). The representational framework sees perception as a constructive process carried out by a cascade of feature detectors: neurons that analyse neural activity at the immediately lower hierarchical stage and respond selectively to certain activation patterns (Figure 1A). For instance, a neuron that responds selectively to the word percept integrates inputs from neurons encoding the syllables per and cept; the neuron encoding per receives inputs from neurons encoding p-, e, and -r; and, if the word is decoded from the auditory modality, the neuron encoding p- receives inputs from the neurons encoding each of the formant transitions (frequency-modulated sweeps) that characterize the consonant.
Figure 1. Representational and predictive processing frameworks. (A) Schematic view of a possible implementation of the representational framework during the decoding of the syllable /per-/. (B) Schematic view of a possible implementation of the predictive processing framework while performing predictions on the incoming syllable /-cept/. Purple features are predictions. The middle panel depicts a simulation of the neural activity in the auditory nerve across the tonotopic axis (y-axis) elicited by the spoken word percept. (C) Example implementation of the local model: whereas predictions at the word level are used to encode prediction error at the immediately lower level as formulated by the PPF, predictions are not used to calculate the predictive model of the lower stages, and thus the remaining levels depend on bottom-up information only. (D) Example implementation of the global model: predictions at the word level propagate downwards affecting the predictive model of all stages (dashed descending arrows), so that prediction error at the lowest level is encoded with respect to a model that is ultimately defined in the highest level. Intermediate implementations where interactions between the models exist up to a certain level only are also possible but not displayed here. Note that (B) depicts the global model.
The PPF (Heeger, 2017; Spratling, 2017; Keller and Mrsic-Flogel, 2018; Walsh et al., 2020) presents the same hierarchical organization as the representational framework, but the feature detectors are used for inference rather than exploration. In the PPF, a feature detector needs two ingredients: a (generative) model that builds hypotheses about the sensory world, and a prediction error unit that tests these hypotheses against the actual sensory input. The PPF is intimately linked with Bayesian inference (Friston, 2005; Kiebel et al., 2008), where the posterior conclusions drawn from the data are amplifications or reductions of a prior belief. This means that according to the PPF we are more likely to perceive what we expect. In an extreme interpretation of the framework, it implies that if we do not have an implicit prior belief that a perceptual object might exist, we cannot perceive its existence at all. Today, evidence from psychophysics (de Lange et al., 2018), human neuroimaging (Siman-Tov et al., 2019; Walsh et al., 2020), animal neurophysiology (Bendixen et al., 2012; Carbajal and Malmierca, 2018; Pakan et al., 2018), and theoretical neuroscience (Brenner et al., 2000; Fairhall et al., 2001; Huang and Rao, 2011; Badcock et al., 2019), converges in the idea that predictions on the sensory world are constantly used to encode sensory input.
To-date there are at least three different algorithms describing how the PPF could be implemented in the brain (for a review, see Spratling, 2017). All of them hypothesize the existence of two kinds of sensory neurons: those that encode the generative model, and those that encode prediction error. A neuron encoding the generative model at a certain stage of the processing hierarchy k receives inputs from its associated prediction error units, that signal if the model is correct or incorrect. It also receives input from generative models at higher stages l > k, that guide the generation of predictions at level k. A prediction error unit at stage k compares the predictions of its associated generative model at stage k+1 with the sensory inputs incoming from the immediately lower processing stage (Figure 1B).
Although the PPF was first formulated as a theory on sensory processing in the cerebral cortex (Rao and Ballard, 1999; Friston, 2005; Shipp, 2016), the existence of potential prediction error units in the subcortical auditory pathway has been reported widely during the last decade (Anderson et al., 2009; Malmierca et al., 2009, 2014, 2019; Grimm et al., 2016; Parras et al., 2017; Carbajal and Malmierca, 2018). Whether this prediction error is, as proposed by the PPF, a signature of active inference, is still unclear (Carbajal and Malmierca, 2018). If that was the case, prediction error in subcortical sensory structures should signal error with respect to global hypotheses of the sensory world. This means that, if after hearing per- we expect the word to be completed with a -cept (Figure 1B), an auditory signal breaking such prediction (like, for instance, -meable) should elicit prediction errors in those neurons encoding the syllable -cept, but also errors on the neurons encoding the -c- and its corresponding spectrotemporal properties such as formant transitions. This schema assumes that predictions are transmitted downwards through an inverse hierarchical structure (Figure 1D). We call this the global model, because it assumes that predictions at the highest stage in the processing hierarchy are used to inform generative models globally across the brain (Kiebel et al., 2008; Malmierca et al., 2015; Siman-Tov et al., 2019; Casado-Román et al., 2020).
An alternative possibility is that predictions and its associated errors are computed locally (Eytan et al., 2003; Mill et al., 2011; Wang et al., 2014; May et al., 2015). We call this scenario the local model. Predictions at each stage are performed accordingly to the level of abstraction of the local representation and its local time constant of integration. In this scenario, violations to the prediction -cept would elicit prediction error in the populations encoding the syllable -cept, but not in the populations encoding the formant transitions of the syllable -c- (Figure 1C). Although not strictly adherent to the principles of the PPF, this local strategy presents its own advantages. First, it still optimizes the neural code by encoding only those parts of the stimulus that are not predictable. Second, it keeps robust representations of the stimuli that are independent of each other across stages of the processing hierarchy - this has the advantage that it could be used to simultaneously test multiple hypotheses. Third, it does not require a constant top-down transmission of expectations.
3. Prediction Error Responses Are Ubiquitous in the Auditory Thalamus and Midbrain
Prediction error responses in the mammal subcortical auditory pathway (Malmierca et al., 2015; Parras et al., 2017) have been mostly investigated through stimulus-specific adaptation (SSA). SSA is a phenomenon where individual sensory neurons adapt to specific stimulus properties (Ulanovsky et al., 2003, 2004). SSA is typically shown in passive listening conditions (and often in anesthetized animals) using some variation of the classical oddball paradigm: sequences of several repetitions of a standard tone that are interrupted by rarely occurring deviants. Deviants typically differ from the standards only in the tone frequency. In oddball sequences, pure tones are separated by fixed inter-stimulus-intervals (ISI), so that the onset of the next tone in the sequence is always predictable. By repetition of the standard tones, oddball paradigms induce predictions on the frequency of the next tone. The experimenter can control the amount of prediction error elicited by the deviants with two variables: the frequency difference between deviant and standard (which controls the amount of error of the prediction with respect to the actual sensory input) and the probability of occurrence of the deviant (which controls how certain the model is about the prediction that the next stimulus will be a standard). SSA to frequency deviants has been consistently found in the auditory thalamus (medial geniculate body, MGB) (Anderson et al., 2009; Antunes et al., 2010; Richardson et al., 2013; Duque et al., 2014; Parras et al., 2017) and midbrain (inferior colliculus, IC) (Malmierca et al., 2009; Zhao et al., 2011; Duque et al., 2012; Pérez-González et al., 2012; Ayala et al., 2013, 2015, 2016; Ayala and Malmierca, 2015, 2018; Duque and Malmierca, 2015; Parras et al., 2017; Valdés-Baizabal et al., 2017, 2020) of non-human mammals, as well as in the human IC and MGB (Cacciaglia et al., 2015; Tabas et al., 2020). Several studies have failed to detect any SSA in neurons or populations of the first stage of the auditory subcortical pathway: the auditory brainstem (cochlear nucleus, CN) (Duque et al., 2012, 2018; Ayala et al., 2013, 2015; Parras et al., 2017). Although SSA has been mostly investigated using frequency deviants, similar adaptation dynamics have been demonstrated to amplitude modulation deviants (Gao et al., 2014) and, in bats, to frequency modulation deviants (Thomas et al., 2012). However, there seems to be no SSA to loudness deviants (Duque et al., 2016). The SSA magnitude is typically measured with the SSA index, a ratio that compares the neuronal responses to a tone when used as a standard with the responses to the same tone when used as a deviant.
Although positive SSA indices are often taken as an indication that the neuron encodes prediction error (i.e., surprise to the violation of a prediction), positive SSA indices could also result from simple repetition suppression to the standard (Taaseh et al., 2011; Parras et al., 2017; Carbajal and Malmierca, 2018). Parras et al. developed a novel approach to disentangle repetition suppression from prediction error by comparing the responses to deviants in classical oddball sequences with the responses to the same sounds when embedded in control sequences that contain varying non-predictable stimuli. They argued that, if the responses to deviants encoded prediction error, these responses should be stronger when a precise prediction on the incoming stimuli is available (as in oddball sequences) than when predictions are broad (as in the control sequences, where all control stimuli have the same likelihood of occurrence). They termed the difference of the responses to the deviant tone in oddball and control sequences the index of prediction error (iPE), and demonstrated that neurons showing SSA do typically show positive iPEs. However, iPE>0 is not a sufficient indication of prediction error because, as modeling studies have shown (Eytan et al., 2003; Mill et al., 2011, 2012), positive iPEs can also arise due to simple repetition suppression due to suppressed inhibition. In any case, it is useful to consider SSA as an aggregation of two separate phenomena: the suppression of the responses to the standards, and the recovery of the responses to the deviant.
The IC, MGB, and also auditory cortex are typically subdivided in primary and secondary subdivisions. The bulk of primary subdivisions, consisting of the entire CN, the central nucleus of the IC (cIC), and the ventral section of the MGB (vMGB), constitute the so-called lemniscal pathway, characterized by narrow frequency tuning bands and faithful encoding of the stimulus properties (Lee and Sherman, 2011).
The bulk of secondary subdivisions, consisting of the cortex of the IC (xIC), and the medial (mMGB) and dorsal (dMGB) sections of the MGB, constitute the non-lemniscal pathway, characterized by wider or absent frequency tuning and stronger corticofugal projections (Lee and Sherman, 2011). While the primary or lemniscal subdivision is attributed with the task of transmitting sensory information directly to the cerebral cortex, the secondary or non-lemniscal subvidision is thought to play a secondary role (Lee and Sherman, 2011). If the PPF is the main mechanism for sensory processing, it should govern sensory coding in the lemniscal pathway.
Animal studies seem to converge in that SSA is more prevalent (i.e., present in a larger fraction of neurons) and stronger (i.e., showing larger SSA indices) in non-lemniscal sections of IC and MGB (Anderson et al., 2009; Ayala et al., 2016; Parras et al., 2017). SSA neurons in the non-lemniscal pathways also show larger iPEs than their lemniscal counterparts. This finding is, however, not backed by studies in humans, which found no topological organization of SSA across IC or MGB (Cacciaglia et al., 2015; Tabas et al., 2020), or even reported comparable SSA indices in lemniscal and non-lemniscal MGB (Tabas et al., 2020).
SSA is elicited in IC and MGB in both, awake and anesthetized animals (Richardson et al., 2013; Duque and Malmierca, 2015; Parras et al., 2017), and under passive and active listening in humans (Cacciaglia et al., 2015; Tabas et al., 2020). One study reported higher SSA indices under anesthesia due to a global reduction of spontaneous activity (Duque and Malmierca, 2015); another study reported generally higher iPEs in the awake condition (Parras et al., 2017). Therefore, although SSA might be modulated by awareness, it is also present in states of reduced consciousness. This is fully in line with the principles of the PPF, where inference on the sensory world is computed autonomously as a coding strategy, rather than as a conscious inference effort.
In IC and MGB, the SSA index always increases with increasing frequency difference between deviant and standard, with decreasing ISI, and with decreasing probability of occurrence of the deviant. This phenomenology seems to indicate that neurons showing SSA do encode prediction error with respect to the hypothesis that the next tone will be a standard, and that this error is larger when there is a precise hypothesis than when there is none. Whether this model is computed locally (in the IC or the MGB) or globally and projected across the hierarchy (see Figures 1C,D) is still unclear. Early studies interpreted the fact that SSA is more prominent in non-lemniscal subdivisions of the rodent auditory pathway as evidence of global computation (Malmierca et al., 2015; Ayala et al., 2016). Later, evidence that both SSA indices and iPE increase along the rodent ascending auditory pathway led to the interpretation that prediction error is also computed locally at each stage (Parras et al., 2017; Carbajal and Malmierca, 2018). Functional MRI (fMRI) studies in humans also indicated that IC and MGB showed stronger responses to sounds that broke the predictions than to sounds for which predictions were not available. They did, however, not find that these effects were more prominent in the non-lemniscal subdivisions (Cacciaglia et al., 2015; Tabas et al., 2020).
4. Encoding Fidelity in the Auditory Brainstem Is Enhanced by Repetition and Predictability
Electroencephalographic (EEG) methods present a much higher temporal resolution than fMRI (Buxton, 2009), allowing to measure directly the responses to each individual tone in the sequence. However, fine temporal resolution is offered at the expense of spatial precision: triangulating the origin of the evoked potentials in the brain is generally an ill-posed problem. This difficulty makes measuring responses from subcortical nuclei, particularly because they are located centrally in the brain, especially challenging (Boston and Moller, 1985; Coffey et al., 2019).
To-date, the only non-invasive measurements of subcortical auditory evoked activity are the auditory brainstem response (ABR) and the frequency-following response (FFR). The ABR (Jewett et al., 1970; Parkkonen et al., 2009) consist of a series of short transient auditory evoked potentials peaking within 8 ms after tone onset with sources ranging from the auditory nerve up to the MGB. Human ABRs do not seem to show SSA to broadband spectrum deviants (Slabu et al., 2010) nor loudness deviants (Althen et al., 2011).
The FFR (Gerken et al., 1975; Boston and Moller, 1985; Chandrasekaran and Kraus, 2010) is a component of the auditory evoked fields that is synchronized to the acoustical signal. Although the FFR partially stems from sources in cerebral cortex (Coffey et al., 2016), most generators seem to be subcortical (Bidelman, 2018; Coffey et al., 2019), especially when it is synchronized to stimulus frequencies above the cortical limit for phase-locking (estimated to be between 50 and 250 Hz).
Two studies have shown SSA of the absolute power of the FFR in a neighborhood of the frequencies characterizing the stimuli (Shiga et al., 2015; Alho et al., 2019). However, the entrainment of the FFR to the stimulus waveform seems to follow the exact opposite trend than SSA: an increased entrainment to standards as compared to deviants (Chandrasekaran et al., 2009; Strait et al., 2011; Slabu et al., 2012; Skoe et al., 2014; Lau et al., 2017; Font-Alaminos et al., 2020). We call this phenomenology repetition entrainment enhancement. The repetition entrainment enhancement of the FFR seems independent of stimulus class: it has been shown for syllables (Chandrasekaran et al., 2009; Strait et al., 2011; Slabu et al., 2012; Gorina-Careta et al., 2016; Lau et al., 2017; Alho et al., 2019), amplitude modulated tones (Shiga et al., 2015), pitch contours in Mandarin syllables (Skoe et al., 2014), and pure tones (Font-Alaminos et al., 2020). One of these studies showed that the repetition entrainment enhancement is present even when the onset of the sounds is not predictable (i.e., with jittered ISIs) (Slabu et al., 2012), although predictable onsets do result in lower FFR power and higher FFR entrainment (Gorina-Careta et al., 2016). Moreover, two studies showed that the magnitude of the repetition entrainment enhancement correlates with the ability of the subjects to recognize speech in noise (Chandrasekaran et al., 2009; Strait et al., 2011). The fact that the FFR adapts its properties to the stimulation history is contrary to the predictions of a representational framework. The FFR repetition entrainment enhancement is, however, also not straightforward to interpret within the PPF. If the FFR represented prediction error, we would expect a gradual decay of the signal (and with it its SNR and quality of the entrainment) with each repetition of a standard. Another possibility is that the FFR encodes the generative model of the sensory input, which becomes more and more precise with each repetition. However, if that was the case the FFR would always represent expectations, which means that we would expect an FFR tuned to the standard during the presentation of a deviant. It is possible that the generative model corrects itself quickly after detecting that the stimulus is not a standard, which would result in a reduction of the FFR entrainment to the deviant. If that was the case the entrainment to the deviants would be much weaker than to the first standards in the sequence; however, the literature reports that the deviant and first standard elicit the same FFR entrainment (Font-Alaminos et al., 2020). A last possibility is that the FFR has contributions from both, prediction error and generative model units, and that the balance between the contribution of one and the other results in the observed phenomenology.
5. Mixed Evidence on the Global Model Based on Deactivation of High-Order Processing Structures
Studies reviewed so far have established that sensory processing in the auditory pathway cannot be explained by a representational framework. The studies suggested that computation of expectations and prediction error is common to many mammals, and that it occurs even during sleep and anesthesia. However, all these studies use paradigms that have as a core feature repetition to induce predictions on the sensory input. This means that prediction error is computed with respect to a model of the sensory world that could have been generated locally, at the level of the IC and MGB, or globally, at a higher level of the processing hierarchy. Thus, results reviewed so far are ambivalent with respect to the actual organization of the PPF and are compatible with both, the local and global models. The next sections of this review focus on studies that tried to disentangle between these two possibilities.
Neural populations encoding higher levels of abstraction are thought to be encoded at successively higher stages in the processing hierarchy (Kiebel et al., 2008). This hierarchical organization is exquisitely presented in the auditory system, where the CN encodes a faithful representation of raw sensory input (Rhode and Smith, 1986), the IC and MGB encode intermediate features like formant transitions (Kuo and Wu, 2012), and auditory cortex encodes the identity of sounds as complex auditory objects (Chechik and Nelken, 2012). One way to test whether expectations are computed globally and transmitted downwards through the auditory hierarchy is to study whether SSA in IC and MGB depends on the cerebral cortex.
Three animal studies have compared SSA in subcortical sensory pathway nuclei before and during reversible deactivation of the ipsilateral auditory cortex. Two of the studies used a cryoloop to temporarily deactivate rat's auditory cortex, and measured SSA in neurons of the IC (Anderson and Malmierca, 2013) and MGB (Antunes and Malmierca, 2011). Both studies reported that SSA in single neurons was affected by deactivation of the cerebral cortex. The overall averaged amount of SSA in IC and MGB did, however, not significantly change during deactivation. The authors concluded that although cerebral cortex may modulate subcortical SSA, it does not generate it. This means that SSA cannot be solely driven by the global model of the PPF. In contrast, a third study (Bauerle et al., 2011) used muscimol to deactivate auditory cortex and measured SSA in neurons of vMGB (i.e., in the lemniscal section) in gerbils. The authors found that SSA was completely abolished after muscimol application, concluding that SSA indeed depends on cerebral cortex function, supporting the global model.
Divergences between the three studies could be caused by: (1) different deactivation methods, (2) different species, or (3) differences between the lemniscal and non-lemniscal pathways. Whereas deactivation by cryoloop allowed the investigators to show recovered responses after cortical inactivation, the longer recovery periods and possible diffusion of muscimol (Lomber, 1999) prevented Bauerle et al. from recording post-inactivation responses. Thus, Bauerle et al. could not completely rule out that the abolition of SSA after drug administration was due to irreversible damages induced during the application of the drug or diffusion of the drug into thalamus (Bauerle et al., 2011). Although the authors claim that side effects of muscimol were unlikely, reproduction of the results are needed to confirm that deactivation of auditory cortex abolishes SSA in vMGB.
Although the studies using the cryoloop (Antunes and Malmierca, 2011; Anderson and Malmierca, 2013) do not report whether neurons belong to the lemniscal or non-lemniscal subdivisions of the IC and MGB, the relatively elevated SSA indices [SSAi > 0.18 in IC (Anderson and Malmierca, 2013) and average SSAi ~ 0.31 in MGB (Antunes and Malmierca, 2011)] indicate that most recorded neurons in the cryoloop experiments belonged to the non-lemniscal subdivisions. In comparison, SSA indices from the vMGB in Bauerle et al. (2011) were around SSAi = 0.07, even though they used shorter ISIs and higher intensity levels than the cryoloop studies, which potentially elicits higher levels of SSA. One possibility is that the cerebral cortex triggers SSA only in the lemniscal pathway. This would be surprising, given that most corticofugal fibers target neurons in the non-lemniscal subdivisions of the IC and MGB (Lee and Sherman, 2011). Another possibility is that cortical control of non-lemniscal areas depends on the stimuli used and the specific experimental task and that the conditions used in the animal experiments so far do not elicit top-down control of SSA.
The thalamic reticular nucleus (TRN) is a laminar GABAergic nucleus that covers large parts of the thalamus and serves as interface to the cerebral cortex (Ohara and Lieberman, 1985; Pinault, 2004). TRN neurons show even stronger SSA than nuclei of the auditory sensory pathway, with SSA indices that double those of the (non-lemniscal) MGB (Yu et al., 2009). Moreover, TRN deactivation has been shown to affect the responses on MGB after (not during) the presentation of a deviant (Yu et al., 2009). This suggests that the deactivation does not influence the prediction error component of MGB responses, but potentially rather the encoding of the generative model of the sensory world. However, Yu et al. measured the effect of TRN deactivation in just one MGB neuron so these results should be interpreted with caution until replications are available.
In summary, there are only very few studies investigating corticofugal influences on presumed prediction error responses in IC and MGB. Only two studies show that SSA in IC and MGB is driven by top-down control (Yu et al., 2009; Bauerle et al., 2011).
6. Favoring Evidence for the Global Model Based on Manipulation of High-Order Expectations
An alternative approach to study the computational principles of the subcortical sensory pathway nuclei is to measure adaptation in subcortical sensory nuclei while manipulating predictions that are unlikely to stem directly from subcortical processing. Such predictions can be derived either from complex statistical regularities that are unlikely to be encoded in subcortical sensory structures or from cognitive representations that are characterized by high levels of abstraction.
One first step toward such an approach is to use paradigms that tap into so-called meta-adaptation. Meta-adaptation is a phenomenon where adaptation dynamics themselves adapt depending on changes in the context in which the adaptation dynamics occur (Robinson et al., 2016): Robinson and colleagues exposed Guinea pigs to repeated switches between quiet and loud environments. They observed that the adjustment in the dynamic range of neurons in IC accelerated after repeated exposure to the two different environments. Thus, the adaptation of the dynamic range adapted to the novel but familiar environmental context. This meta-adaptation effect largely attenuated after the experimenters deactivated auditory cortex using a cryoloop. Under the light of the PPF, the faster adaptation dynamics would result from the prediction that switches occur often. The result that meta-adaptation on IC depends on the integrity of the cerebral cortex can thus be interpreted as evidence that the generative model is encoded in auditory cortex, favoring the global model.
Malmierca et al. (2019) used an elegant paradigm with complex statistical regularities to investigate responses in the anesthetized rat's IC. The authors used as predictable entity a pattern of two tones that was presented in a repetitive manner (i.e., A-B-A-B-A-B…). To elicit prediction error, the pattern was rarely violated by a repetition of one of the tones (…A-B-A-B-B). The rationale was that the representation of the tone dyad A-B is putatively encoded at higher processing levels than the representation of a single tone typically exploited in SSA experiments. Neurons encoding prediction error in IC would therefore only respond to violations of the pattern if predictions encoded in higher levels are used to predict sensory input in the IC. The authors reported that only 14 of 281 measured samples of IC neurons, located in both lemniscal and non-lemniscal subdivisions of the nucleus, showed statistically significant prediction errors to violation of the patterns. The study was the first to investigate SSA in the subcortical sensory pathway with a paradigm that it likely represented in complex generative models in the brain. However, since the fraction of neurons with significant prediction error reported in the study (14/281≃4.98%) was close to the false-discovery rate of the study (α = 0.05), replications would need to confirm this effect unequivocally.
Yu et al. (2009) used a different approach to control predictability: They used a light to cue the onset of the auditory stimuli while recording from neurons of the anesthetized rat's MGB. They found that the visual cue resulted in significantly suppressed responses in 20 of 118 (≃17%) measured neurons and significantly enhanced responses in 23 of them (≃19.5%), both way above the false discovery rate of the study. Assuming that the causal link connecting the visual cue to the expectations on the auditory input is computed at a processing stage other than the MGB, we interpret these results as evidence for the global model. Favoring this interpretation, the authors show that deactivation of the TRN suppresses the effects of cuing in both directions; however, this result is once again shown in a single neuron and should be interpreted with caution until replications are available.
Lau et al. (2017) showed entrainment enhancement of the FFR in humans driven by high-order predictability using pitch contours of Mandarin syllables. The authors presented a target syllable in three different contexts: an unpredictable context, where the likelihood of the target was 1/3; a repetitive context, where all stimuli were repetitions of the target; and a patterned context, where the target was presented in a pattern of three syllables that was repeated over and over. The results demonstrated that the FFR entrainment was enhanced by predictability (i.e., that the FFR was more correlated to the stimulus waveform in the two predictable contexts than in the unpredictable context). In addition, the entrainment was stronger for the high-order predictability (i.e., in the patterned context) than when predictability was dictated by repetition. Although predictability enhancement cannot be interpreted as prediction error dynamics within the PPF, the result that predictability stemming from a higher level of abstraction has a stronger weight than predictability stemming from repetition in the strength of the FFR supports the global model.
The most recent evidence for the global model comes from a study in humans from our lab (Tabas et al., 2020) where we manipulated high-order predictions while preserving local stimulus statistics. We used fMRI to measure responses in the IC and MGB to a variation of the classical oddball sequence where the predictability of the deviants was manipulated using abstract rules. We disclosed to the participants that in each oddball sequence one of the standards at positions 4, 5, or 6 will be substituted by a deviant (Figure 2A). Since each position was equally likely across the experiment, after 3 repetitions of the standard subjects expect a deviant in position 4 with a likelihood of p = 1/3, after hearing 4 standards they expect a deviant in position 5 with p = 1/2, and after 5 standards subjects fully expect a deviant in position 6. According to the local model of the PPF, only the ratio between deviants and standards will have an effect on the strength of the responses to the deviant tones (Figure 2A, blue); according to the global model of the PPF, the responses will be the weaker the higher the likelihood of the tone according to the abstract rules (Figure 2A, red).
Figure 2. Evidence for the global model in the human IC and MGB. (A) Schematic view of the expected responses for the different trials by the local (blue) and the global (red) models. (B) Posterior probability map of the global (red) and local (blue) models. z-coordinates correspond to the MNI space.
Using Bayesian Model comparison, we showed that responses in the IC and MGB (Figure 2B) were far more likely to be produced by a mechanism following the principles of the global model (where the magnitude of the response decreased with predictability) than by a mechanism following the principles of the local model (where the magnitude of the responses decreased with the number of times the stimulus has been repeated before). The global model was similarly prominent in both lemniscal and non-lemniscal sections of the MGB, revealing once again no particular functional organization of the human auditory pathway in respect of the PPF.
7. Subcortical Predictive Processing in Other Sensory Modalities
Although here we have focused on the auditory modality, it is likely that the processing architecture of other sensory modalities follows similar principles. Indeed, analogous functional and anatomical organizations have been found between the auditory and visual (Rauschecker, 2015), and visual and somatosensory (Pack and Bensmaia, 2015) systems. Moreover, if the auditory pathway is organized according to a global PPF, this organization should necessarily extend to other sensory modalities: otherwise the predictive potential of the global model would be largely under-exploited. There is indeed plenty of evidence that information across modalities is integrated and applied to sensory coding according to the principles of the PPF (see reviews, von Kriegstein, 2012; van Wassenhove, 2013; Talsma, 2015). In this section, we describe a few examples of predictive processing in the visual and somatosensory subcortical pathways.
Predictive coding was originally enunciated as a visual theory (Rao and Ballard, 1999). Most literature on visual predictive processing is concerned with the problem of extra-classical receptive field properties in response to concurrent stimulation (e.g., Aitchison and Lengyel, 2017). Some studies have, however, also considered how predictions on future events are used during the encoding of visual information in the subcortical visual pathway. Evidence for predictive processing of this kind has been reported in the retina (Hosoya et al., 2005; Kastner and Baccus, 2013; Howlett et al., 2017; Johnston et al., 2019; Kastner et al., 2019), including a study demonstrating SSA to movement in retinal bipolar cells (Ölveczky et al., 2007); in the superior colliculus, in the form of SSA to Gabor patterns (Jin and Glickfeld, 2020) and luminance (Boehnke et al., 2011); and in visual thalamus to location and polarity of light bars (Dhruv and Carandini, 2014). In the visual thalamus, predictive feedback has been suggested to stem from corticofugal efferents from primary visual cortex (Jehee and Ballard, 2009; Zabbah et al., 2014), but has yet not been demonstrated empirically. Thus, evidence to-date in the visual subcortical pathway is compatible with both, the global and local models of the PPF.
Adaptation to local stimulus statistics has also been reported in the mammal (Khatri et al., 2004; Mohar et al., 2013; Liu et al., 2017) and human somatosensory thalamus (Allen et al., 2016), but results are compatible with both, the local and global models of the PPF. One of these studies (Mohar et al., 2013) found a functional subdivision of the somatosensory thalamus similar to that of the animal literature in the auditory modality: non-lemniscal subdivisions showed stronger adaptation dynamics than their lemniscal counterparts. Evidence for the global model was provided by a study (Pais-Vieira et al., 2013) that considered the effect of anticipation on somatosensory thalamus during the activation of the facial whiskers of the rat. Pais-Vieira et al. found that effects of anticipation clearly present in somatosensory thalamus vanished after deactivation of the somatosensory cortex with muscimol. Perhaps the most compelling demonstration that the somatosensory pathway is organized according to the PPF is the common placebo effect, described by the PPF as a drastic reduction of pain sensation by the imposition of a strong analgesic prior (Büchel et al., 2014). Favoring the global model, reduction of activity to painful stimulation after the administration of a placebo has been found in the medulla (Matre et al., 2006; Eippert et al., 2009).
8. Conclusion
Converging evidence indicates that hierarchical predictive processing is a key feature, if not the principal encoding strategy, of subcortical sensory pathways. In the auditory modality, it is clear that encoding in the IC and MGB are strongly driven by expectations on the incoming stimuli. There is, however, still mixed evidence about the underlying mechanism of these expectations. We have summarized the two extreme possible architectures of the predictive processing network in two opposing views: a local model, where each stage in the hierarchy encodes its own representation of the stimulus and performs predictions on the representation of the immediately lower stage; and the global model, where a global prediction, encoded at higher processing stages, propagates downwards generating local predictions at all subsequent cortical and subcortical stages. In our review of the literature we have found a few studies favoring the local model, several studies favoring the global model, and a large number of studies whose results are agnostic to the architecture of the predictive processing network.
One possibility is that the feedback propagation of the global model is adapted according to the specific context. Electrophysiological experiments in animals are typically performed under anesthesia. This work has impressively shown that SSA is a fundamental automatic reaction of sensory systems rather than a phenomenon triggered by particular cognitive actions or arousal. In anesthesia, however, animals experience sounds under the same context: that of drug-induced coma. The two studies that investigated pure tone SSA on awake animals (Duque and Malmierca, 2015; Parras et al., 2017) demonstrated that SSA is pervasive in alert states, but used passive listening conditions. Whether the cognitive context and behavioral relevance of the stimuli might have deeper repercussions when more complex models of the sensory world are necessary in order to compute expectations has not being investigated yet. This possibility could explain why evidence for the global model in the IC of anesthesized rats was inconclusive (Malmierca et al., 2019), while there was strong evidence for the global model in awake human participants (Tabas et al., 2020). It is also possible that not all processing stages conform under the same context to the same model: different stages of the processing hierarchy might depend to a greater or smaller degree on high-level expectations. Differences between the two studies could, however, also be explained by the many methodological differences between animal and human studies and by potential species-specific differences in rodent and human sensory systems.
An important open question is exactly where the neural units that encode the model of the sensory world used to compute prediction error in the subcortical nuclei are located. Although in theory there is no reason why prediction error units could not encode the generative model in a multiplexed code, prediction error and the model are usually argued to be encoded in distinct populations (e.g., Friston, 2003, 2005; Bastos et al., 2012; Spratling, 2017; Keller and Mrsic-Flogel, 2018). However, in comparison with the evidence for the ubiquity of prediction error units, evidence for the existence of the generative model units is scarce in the cerebral cortex (Bell et al., 2016; Fiser et al., 2016; Walsh et al., 2020), and practically non-existent in subcortical areas. There is weak evidence that TRN neurons might have an active role on applying these models in MGB (Yu et al., 2009), but the fact that TRN neurons themselves show SSA render this hypothesis unlikely. According to the existing formulations of the PPF (Kiebel et al., 2008; Spratling, 2017; Keller and Mrsic-Flogel, 2018), each representational level should have a corresponding local model (see Figure 1B). This means that, if we accept that the IC and MGB encode different stages of the processing hierarchy, the MGB should have a local population of neurons encoding predictions that are transferred to the IC. Some algorithms actually locate the predictive model at the same processing stage, meaning that the populations encoding the model would actually reside in the IC (Spratling, 2017). There is, however, still no evidence for the encoding of these models in subcortical stages. Direct corticofugal connections exist all the way down to the cochlear nucleus (Winer, 2005a), so it is theoretically possible that all subcortical nuclei are located at the same hierarchical stage with respect to the PPF and that their corresponding model is located in primary auditory cortex. However, the presence of thalamo-collicular, thalamo-cochlear, and colliculo-cochlear efferents (Schofield, 2011) indicate that predictions are most likely also conveyed across different subcortical stages.
Another key ingredient necessary to understand the architecture of the PPF is the exact mechanism underlying the computation of prediction error and generation of predictions at each stage. Some PPF algorithms have suggested that prediction error might be computed by subtracting the predictions from the sensory input via inhibition (Wacongne et al., 2012). However, predictability leads to behavioral benefits and wrong predictions can sometime bias perception toward incorrect percepts (de Lange et al., 2018); an inhibitory account of the computation of prediction error would not be able to account for any of these effects. Moreover, the dependence of the repetition entrainment enhancement of the FFR on predictability and the enhancement of the responses in the MGB by visual cues (Yu et al., 2009) seem to indicate that predictions can enhance the sensory representation, rather than inhibiting it. Future models of the PPF face the challenge of reconciling these findings with the repetition/predictability suppression characteristic of prediction error.
Understanding the neural mechanisms underlying sensory processing is the only robust approach to understand perception. If, as enunciated by the global PPF model, sensory processing is a process of inference, we should remove all claims of perceptual objectivity, pay close attention to our priors and our internal models of the world, and question ourselves about the realities we cannot perceive just because they are not part of our model space. Although absolute inference is an unlikely scenario, since we must have formed our current models based on empirical experiences, it is possible that our reliance on inference grows more and more as we age (Lucas et al., 2014; Sherratt and Morand-Ferron, 2018; Cohen et al., 2020). SSA and the discovery of the encoding of prediction in subcortical sensory pathways have opened the gates to a deep exploration on sensory organization that might have strong philosophical repercussions on the way we understand what we call reality. If future work departs from paradigms that are unspecific about the underlying model of the sensory world, research on the PPF could lead us to the roots of the mechanisms that make us see, hear, and feel.
Author Contributions
AT and KK reviewed the literature and wrote the manuscript. Both authors contributed to the article and approved the submitted version.
Funding
This study was funded by the European Research Council ERC Consolidator Grant SENSOCOM (647051).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Aitchison, L., and Lengyel, M. (2017). With or without you: predictive coding and Bayesian inference in the brain. Curr. Opin. Neurobiol. 46, 219–227. doi: 10.1016/j.conb.2017.08.010
Alho, K., Żarnowiec, K., Gorina-Careta, N., and Escera, C. (2019). Phonological task enhances the frequency-following response to deviant task-irrelevant speech sounds. Front. Hum. Neurosci. 13:245. doi: 10.3389/fnhum.2019.00245
Allen, M., Fardo, F., Dietz, M. J., Hillebrandt, H., Friston, K. J., Rees, G., et al. (2016). Anterior insula coordinates hierarchical processing of tactile mismatch responses. Neuroimage 127, 34–43. doi: 10.1016/j.neuroimage.2015.11.030
Althen, H., Grimm, S., and Escera, C. (2011). Fast detection of unexpected sound intensity decrements as revealed by human evoked potentials. PLoS ONE 6:e28522. doi: 10.1371/journal.pone.0028522
Andersen, M. (2019). Predictive coding in agency detection. Religion Brain Behav. 9, 65–84. doi: 10.1080/2153599X.2017.1387170
Anderson, L. A., Christianson, G. B., and Linden, J. F. (2009). Stimulus-specific adaptation occurs in the auditory thalamus. J. Neurosci. 29, 7359–7363. doi: 10.1523/JNEUROSCI.0793-09.2009
Anderson, L. A., and Malmierca, M. S. (2013). The effect of auditory cortex deactivation on stimulus-specific adaptation in the inferior colliculus of the rat. Eur. J. Neurosci. 37, 52–62. doi: 10.1111/ejn.12018
Antunes, F. M., and Malmierca, M. S. (2011). Effect of auditory cortex deactivation on stimulus-specific adaptation in the medial geniculate body. J. Neurosci. 31, 17306–17316. doi: 10.1523/JNEUROSCI.1915-11.2011
Antunes, F. M., Nelken, I., Covey, E., and Malmierca, M. S. (2010). Stimulus-specific adaptation in the auditory thalamus of the anesthetized rat. PLoS ONE 5:e14071. doi: 10.1371/journal.pone.0014071
Atick, J. J. (1992). Could information theory provide an ecological theory of sensory processing? Network Comput. Neural Syst. 3, 213–251. doi: 10.1088/0954-898X_3_2_009
Ayala, Y. A., and Malmierca, M. S. (2015). Cholinergic modulation of stimulus-specific adaptation in the inferior colliculus. J. Neurosci. 35, 12261–12272. doi: 10.1523/JNEUROSCI.0909-15.2015
Ayala, Y. A., and Malmierca, M. S. (2018). The effect of inhibition on stimulus-specific adaptation in the inferior colliculus. Brain Struct. Funct. 223, 1391–1407. doi: 10.1007/s00429-017-1546-4
Ayala, Y. A., Pérez-gonzález, D., Duque, D., Nelken, I., and Malmierca, M. S. (2013). Frequency discrimination and stimulus deviance in the inferior colliculus and cochlear nucleus. Front. Neural Circ. 6:119. doi: 10.3389/fncir.2012.00119
Ayala, Y. A., Pérez-González, D., and Malmierca, M. S. (2016). Stimulus-specific adaptation in the inferior colliculus: the role of excitatory, inhibitory and modulatory inputs. Biol. Psychol. 116, 10–22. doi: 10.1016/j.biopsycho.2015.06.016
Ayala, Y. A., Udeh, A., Dutta, K., Bishop, D., Malmierca, M. S., and Oliver, D. L. (2015). Differences in the strength of cortical and brainstem inputs to SSA and non-SSA neurons in the inferior colliculus. Sci. Rep. 5:10383. doi: 10.1038/srep10383
Badcock, P. B., Friston, K. J., and Ramstead, M. J. (2019). The hierarchically mechanistic mind: a free-energy formulation of the human psyche. Phys. Life Rev. 31, 104–121. doi: 10.1016/j.plrev.2018.10.002
Barron, H. C., Auksztulewicz, R., and Friston, K. (2020). Prediction and memory: a predictive coding account. Prog. Neurobiol. 192:101821. doi: 10.1016/j.pneurobio.2020.101821
Bastos, A. M., Usrey, W. M., Adams, R. A., Mangun, G. R., Fries, P., et al. (2012). Canonical microcircuits for predictive coding. Neuron 76, 695–711. doi: 10.1016/j.neuron.2012.10.038
Bauerle, P., von der Behrens, W., Kossl, M., and Gaese, B. H. (2011). Stimulus-specific adaptation in the gerbil primary auditory thalamus is the result of a fast frequency-specific habituation and is regulated by the corticofugal system. J. Neurosci. 31, 9708–9722. doi: 10.1523/JNEUROSCI.5814-10.2011
Bejjanki, V. R., Beck, J. M., Lu, Z. L., and Pouget, A. (2011). Perceptual learning as improved probabilistic inference in early sensory areas. Nat. Neurosci. 14, 642–650. doi: 10.1038/nn.2796
Bell, A. H., Summerfield, C., Morin, E. L., Malecek, N. J., and Ungerleider, L. G. (2016). Encoding of stimulus probability in macaque inferior temporal cortex. Curr. Biol. 26, 2280–2290. doi: 10.1016/j.cub.2016.07.007
Bendixen, A., SanMiguel, I., and Schröger, E. (2012). Early electrophysiological indicators for predictive processing in audition: a review. Int. J. Psychophysiol. 83, 120–131. doi: 10.1016/j.ijpsycho.2011.08.003
Bidelman, G. M. (2018). Subcortical sources dominate the neuroelectric auditory frequency-following response to speech. Neuroimage 175, 56–69. doi: 10.1016/j.neuroimage.2018.03.060
Boehnke, S. E., Berg, D. J., Marino, R. A., Baldi, P. F., Itti, L., and Munoz, D. P. (2011). Visual adaptation and novelty responses in the superior colliculus. Eur. J. Neurosci. 34, 766–779. doi: 10.1111/j.1460-9568.2011.07805.x
Boston, J. R., and Moller, A. R. (1985). Brainstem auditory-evoked potentials. Crit. Rev. Biomed. Eng. 13, 97–123.
Brenner, N., Bialek, W., and de Ruyter van Steveninck, R. (2000). Adaptive rescaling maximizes information transmission. Neuron 26, 695–702. doi: 10.1016/S0896-6273(00)81205-2
Büchel, C., Geuter, S., Sprenger, C., and Eippert, F. (2014). Placebo analgesia: a predictive coding perspective. Neuron 81, 1223–1239. doi: 10.1016/j.neuron.2014.02.042
Buxton, R. B. (2009). Introduction to Functional Magnetic Resonance Imaging. Cambridge: Cambridge University Press. doi: 10.1017/CBO9780511605505
Cacciaglia, R., Escera, C., Slabu, L., Grimm, S., Sanjuán, A., Ventura-Campos, N., et al. (2015). Involvement of the human midbrain and thalamus in auditory deviance detection. Neuropsychologia 68, 51–58. doi: 10.1016/j.neuropsychologia.2015.01.001
Carbajal, G. V., and Malmierca, M. S. (2018). The neuronal basis of predictive coding along the auditory pathway: from the subcortical roots to cortical deviance detection. Trends Hear. 22:233121651878482. doi: 10.1177/2331216518784822
Casado-Román, L., Carbajal, G. V., Pérez-González, D., and Malmierca, M. S. (2020). Prediction error signaling explains neuronal mismatch responses in the medial prefrontal cortex. PLoS Biol. 18:e3001019. doi: 10.1371/journal.pbio.3001019
Chandrasekaran, B., Hornickel, J., Skoe, E., Nicol, T., and Kraus, N. (2009). Context-dependent encoding in the human auditory brainstem relates to hearing speech in noise: implications for developmental dyslexia. Neuron 64, 311–319. doi: 10.1016/j.neuron.2009.10.006
Chandrasekaran, B., and Kraus, N. (2010). The scalp-recorded brainstem response to speech: neural origins and plasticity. Psychophysiology 47, 236–246. doi: 10.1111/j.1469-8986.2009.00928.x
Chechik, G., Anderson, M. J., Bar-Yosef, O., Young, E. D., Tishby, N., and Nelken, I. (2006). Reduction of information redundancy in the ascending auditory pathway. Neuron 51, 359–368. doi: 10.1016/j.neuron.2006.06.030
Chechik, G., and Nelken, I. (2012). Auditory abstraction from spectro-temporal features to coding auditory entities. Proc. Natl. Acad. Sci. U.S.A. 109, 18968–18973. doi: 10.1073/pnas.1111242109
Clark, J. E., Watson, S., and Friston, K. J. (2018). What is mood? A computational perspective. Psychol. Med. 48, 2277–2284. doi: 10.1017/S0033291718000430
Coffey, E. B., Nicol, T., White-Schwoch, T., Chandrasekaran, B., Krizman, J., Skoe, E., et al. (2019). Evolving perspectives on the sources of the frequency-following response. Nat. Commun. 10, 1–10. doi: 10.1038/s41467-019-13003-w
Coffey, E. B. J., Herholz, S. C., Chepesiuk, A. M. P., Baillet, S., and Zatorre, R. J. (2016). Cortical contributions to the auditory frequency-following response revealed by MEG. Nat. Commun. 7:11070. doi: 10.1038/ncomms11070
Cohen, A. O., Nussenbaum, K., Dorfman, H. M., Gershman, S. J., and Hartley, C. A. (2020). The rational use of causal inference to guide reinforcement learning strengthens with age. NPJ Sci. Learn. 5, 1–9. doi: 10.1038/s41539-020-00075-3
Davis, M. H., and Johnsrude, I. S. (2007). Hearing speech sounds: Top-down influences on the interface between audition and speech perception. Hear. Res. 229, 132–147. doi: 10.1016/j.heares.2007.01.014
de Lange, F. P., Heilbron, M., and Kok, P. (2018). How do expectations shape perception? Trends Cogn. Sci. 22, 764–779. doi: 10.1016/j.tics.2018.06.002
DeCharms, C., and Zador, A. (2000). Neural representation andthe cordical code. Annu. Rev. Neurosci. 23, 613–647. doi: 10.1146/annurev.neuro.23.1.613
Dhruv, N. T., and Carandini, M. (2014). Report cascaded effects of spatial adaptation in the early visual system. Neuron 81, 529–535. doi: 10.1016/j.neuron.2013.11.025
Diaz, B., Hintz, F., Kiebel, S. J., and von Kriegstein, K. (2012). Dysfunction of the auditory thalamus in developmental dyslexia. Proc. Natl. Acad. Sci. U.S.A. 109, 13841–13846. doi: 10.1073/pnas.1119828109
DiCarlo, J. J., Zoccolan, D., and Rust, N. C. (2012). How does the brain solve visual object recognition? Neuron 73, 415–434. doi: 10.1016/j.neuron.2012.01.010
Duque, D., and Malmierca, M. S. (2015). Stimulus-specific adaptation in the inferior colliculus of the mouse: anesthesia and spontaneous activity effects. Brain Struct. Funct. 220, 3385–3398. doi: 10.1007/s00429-014-0862-1
Duque, D., Malmierca, M. S., and Caspary, D. M. (2014). Modulation of stimulus-specific adaptation by GABA(A) receptor activation or blockade in the medial geniculate body of the anaesthetized rat. J. Physiol. 592(Pt 4), 729–743. doi: 10.1113/jphysiol.2013.261941
Duque, D., Pais, R., and Malmierca, M. S. (2018). Stimulus-specific adaptation in the anesthetized mouse revealed by brainstem auditory evoked potentials. Hear. Res. 370, 294–301. doi: 10.1016/j.heares.2018.08.011
Duque, D., Pérez-González, D., Ayala, Y. A., Palmer, A. R., and Malmierca, M. S. (2012). Topographic distribution, frequency, and intensity dependence of stimulus-specific adaptation in the inferior colliculus of the rat. J. Neurosci. 32, 17762–17774. doi: 10.1523/JNEUROSCI.3190-12.2012
Duque, D., Wang, X., Nieto-Diego, J., Krumbholz, K., and Malmierca, M. S. (2016). Neurons in the inferior colliculus of the rat show stimulus-specific adaptation for frequency, but not for intensity. Sci. Rep. 6, 1–15. doi: 10.1038/srep24114
Eippert, F., Finsterbusch, J., Bingel, U., and Büchel, C. (2009). Direct evidence for spinal cord involvement in Placebo Analgesia. Science 326:404. doi: 10.1126/science.1180142
Epstein, W. (1993). The representational framework in perceptual theory. Percept. Psychophys. 53, 704–709. doi: 10.3758/BF03211747
Escera, C., and Malmierca, M. S. (2014). The auditory novelty system: an attempt to integrate human and animal research. Psychophysiology 51, 111–123. doi: 10.1111/psyp.12156
Eytan, D., Brenner, N., and Marom, S. (2003). Selective adaptation in networks of cortical neurons. J. Neurosci. 23, 9349–9356. doi: 10.1523/JNEUROSCI.23-28-09349.2003
Fabry, R. E. (2018). Betwixt and between: the enculturated predictive processing approach to cognition. Synthese 195, 2483–2518. doi: 10.1007/s11229-017-1334-y
Fairhall, A. L., Lewen, G. D., Bialek, W., and de Ruyter van Steveninck, R. R. (2001). Efficiency and ambiguity in an adaptive neural code. Nature 412, 787–792. doi: 10.1038/35090500
Fiser, A., Mahringer, D., Oyibo, H. K., Petersen, A. V., Leinweber, M., and Keller, G. B. (2016). Experience-dependent spatial expectations in mouse visual cortex. Nat. Neurosci. 19, 1658–1664. doi: 10.1038/nn.4385
Font-Alaminos, M., Ribas-Prats, T., Gorina-Careta, N., and Escera, C. (2020). Emergence of prediction error along the human auditory hierarchy. Hear. Res. 399:107954. doi: 10.1016/j.heares.2020.107954
Frank, G. K., Collier, S., Shott, M. E., and O'Reilly, R. C. (2016). Prediction error and somatosensory insula activation in women recovered from anorexia nervosa. J. Psychiatry Neurosci. 41, 304–311. doi: 10.1503/jpn.150103
Friston, K. (2003). Learning and inference in the brain. Neural Netw. 16, 1325–1352. doi: 10.1016/j.neunet.2003.06.005
Friston, K. (2005). A theory of cortical responses. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 360, 815–836. doi: 10.1098/rstb.2005.1622
Gao, P. P., Zhang, J. W., Cheng, J. S., Zhou, I. Y., and Wu, E. X. (2014). The inferior colliculus is involved in deviant sound detection as revealed by BOLD fMRI. Neuroimage 91, 220–227. doi: 10.1016/j.neuroimage.2014.01.043
Garrido, M. I., Kilner, J. M., Stephan, K. E., and Friston, K. J. (2009). The mismatch negativity: a review of underlying mechanisms. Clin. Neurophysiol. 120, 453–463. doi: 10.1016/j.clinph.2008.11.029
Gerken, G. M., Moushegian, G., Stillman, R. D., and Rupert, A. L. (1975). Human frequency-following responses to monaural and binaural stimuli. Electroencephalogr. Clin. Neurophysiol. 38, 379–386. doi: 10.1016/0013-4694(75)90262-X
Gonzalez-Gadea, M. L., Chennu, S., Bekinschtein, T. A., Rattazzi, A., Beraudi, A., Tripicchio, P., et al. (2015). Predictive coding in autism spectrum disorder and attention deficit hyperactivity disorder. J. Neurophysiol. 114, 2625–2636. doi: 10.1152/jn.00543.2015
Gorina-Careta, N., Zarnowiec, K., Costa-Faidella, J., and Escera, C. (2016). Timing predictability enhances regularity encoding in the human subcortical auditory pathway. Sci. Rep. 6, 1–9. doi: 10.1038/srep37405
Grimm, S., Escera, C., and Nelken, I. (2016). Early indices of deviance detection in humans and animal models. Biol. Psychol. 116, 23–27. doi: 10.1016/j.biopsycho.2015.11.017
Grimm, S., Escera, C., Slabu, L., and Costa-Faidella, J. (2011). Electrophysiological evidence for the hierarchical organization of auditory change detection in the human brain. Psychophysiology 48, 377–384. doi: 10.1111/j.1469-8986.2010.01073.x
He, J., and Yu, Y. (2010). Role of descending control in the auditory pathway. Oxford Handb. Audit. Neurosci. 2, 247–268. doi: 10.1093/oxfordhb/9780199233281.013.0011
Heeger, D. J. (2017). Theory of cortical function. Proc. Natl. Acad. Sci. U.S.A. 114, 1773–1782. doi: 10.1073/pnas.1619788114
Heilbron, M., and Chait, M. (2017). Great expectations: Is there evidence for predictive coding in auditory cortex? Neuroscience. 389, 54–73. doi: 10.1016/j.neuroscience.2017.07.061
Horga, G., Schatz, K. C., Abi-Dargham, A., and Peterson, B. S. (2014). Deficits in predictive coding underlie hallucinations in schizophrenia. J. Neurosci. 34, 8072–8082. doi: 10.1523/JNEUROSCI.0200-14.2014
Hosoya, T., Baccus, S. A., and Meister, M. (2005). Dynamic predictive coding by the retina. Nature 436, 71–77. doi: 10.1038/nature03689
Howlett, M. H., Smith, R. G., and Kamermans, M. (2017). A novel mechanism of cone photoreceptor adaptation. PLoS Biol. 15:e2001210. doi: 10.1371/journal.pbio.2001210
Huang, Y., and Rao, R. P. N. (2011). Predictive coding. WIREs Cogn. Sci. 2, 580–593. doi: 10.1002/wcs.142
Jaramillo, S., and Zador, A. M. (2011). The auditory cortex mediates the perceptual effects of acoustic temporal expectation. Nat. Neurosci. 14, 246–253. doi: 10.1038/nn.2688
Jehee, J. F., and Ballard, D. H. (2009). Predictive feedback can account for biphasic responses in the lateral geniculate nucleus. PLoS Comput. Biol. 5:e1000373. doi: 10.1371/journal.pcbi.1000373
Jewett, D. L., Romano, M. N., and Williston, J. S. (1970). Human auditory evoked potentials: possible brain stem components detected on the scalp. Science 167, 1517–1518. doi: 10.1126/science.167.3924.1517
Jin, M., and Glickfeld, L. L. (2020). Magnitude, time course, and specificity of rapid adaptation across mouse visual areas. J. Neurophysiol. 124, 245–258. doi: 10.1152/jn.00758.2019
Johnston, J., Seibel, S. H., Darnet, L. S. A., Renninger, S., Orger, M., and Lagnado, L. (2019). A retinal circuit generating a dynamic predictive code for oriented features. Neuron 102, 1211.e3–1222.e3. doi: 10.1016/j.neuron.2019.04.002
Kastner, D. B., and Baccus, S. A. (2013). Spatial segregation of adaptation and predictive sensitization in retinal ganglion cells. Neuron 79, 541–554. doi: 10.1016/j.neuron.2013.06.011
Kastner, D. B., Ozuysal, Y., Panagiotakos, G., and Baccus, S. A. (2019). Adaptation of inhibition mediates retinal sensitization. Curr. Biol. 29, 2640.e4–2651.e4. doi: 10.1016/j.cub.2019.06.081
Keller, G. B., and Mrsic-Flogel, T. D. (2018). Predictive processing: a canonical cortical computation. Neuron 100, 424–435. doi: 10.1016/j.neuron.2018.10.003
Khatri, V., Hartings, J. A., and Simons, D. J. (2004). Adaptation in thalamic barreloid and cortical barrel neurons to periodic whisker deflections varying in frequency and velocity. J. Neurophysiol. 92, 3244–3254. doi: 10.1152/jn.00257.2004
Kiebel, S. J., Garrido, M. I., Moran, R. J., and Friston, K. J. (2008). Dynamic causal modelling for EEG and MEG. Cogn. Neurodyn. 2, 121–136. doi: 10.1007/s11571-008-9038-0
Kuo, R. I., and Wu, G. K. (2012). The generation of direction selectivity in the auditory system. Neuron 73, 1016–1027. doi: 10.1016/j.neuron.2011.11.035
Lau, J. C., Wong, P. C., and Chandrasekaran, B. (2017). Context-dependent plasticity in the subcortical encoding of linguistic pitch patterns. J. Neurophysiol. 117, 594–603. doi: 10.1152/jn.00656.2016
Lee, C. C., and Sherman, S. M. (2011). On the classification of pathways in the auditory midbrain, thalamus, and cortex. Hear. Res. 276, 79–87. doi: 10.1016/j.heares.2010.12.012
Liu, C., Foffani, G., Scaglione, A., Aguilar, J., and Moxon, K. A. (2017). Adaptation of thalamic neurons provides information about the spatiotemporal context of stimulus history. J. Neurosci. 37, 10012–10021. doi: 10.1523/JNEUROSCI.0637-17.2017
Lochmann, T., and Deneve, S. (2011). Neural processing as causal inference. Curr. Opin. Neurobiol. 21, 774–781. doi: 10.1016/j.conb.2011.05.018
Lomber, S. G. (1999). The advantages and limitations of permanent or reversible deactivation techniques in the assessment of neural function. J. Neurosci. Methods 86, 109–117. doi: 10.1016/S0165-0270(98)00160-5
Lucas, C. G., Bridgers, S., Griffiths, T. L., and Gopnik, A. (2014). When children are better (or at least more open-minded) learners than adults: developmental differences in learning the forms of causal relationships. Cognition 131, 284–299. doi: 10.1016/j.cognition.2013.12.010
Malmierca, M. S., Anderson, L. A., and Antunes, F. M. (2015). The cortical modulation of stimulus-specific adaptation in the auditory midbrain and thalamus: a potential neuronal correlate for predictive coding. Front. Syst. Neurosci. 9:19. doi: 10.3389/fnsys.2015.00019
Malmierca, M. S., Cristaudo, S., Pérez-González, D., and Covey, E. (2009). Stimulus-specific adaptation in the inferior colliculus of the anesthetized rat. J. Neurosci. 29, 5483–5493. doi: 10.1523/JNEUROSCI.4153-08.2009
Malmierca, M. S., Ni no-Aguillón, B. E., Nieto-Diego, J., Porteros, Á., Pérez-González, D., and Escera, C. (2019). Pattern-sensitive neurons reveal encoding of complex auditory regularities in the rat inferior colliculus. Neuroimage 184, 889–900. doi: 10.1016/j.neuroimage.2018.10.012
Malmierca, M. S., Sanchez-Vives, M. V., Escera, C., and Bendixen, A. (2014). Neuronal adaptation, novelty detection and regularity encoding in audition. Front. Syst. Neurosci. 8:111. doi: 10.3389/fnsys.2014.00111
Martin, K. A. (1994). A brief history of the “feature detector”. Cereb. Cortex 4, 1–7. doi: 10.1093/cercor/4.1.1
Matre, D., Casey, K. L., and Knardahl, S. (2006). Placebo-induced changes in spinal cord pain processing. J. Neurosci. 26, 559–563. doi: 10.1523/JNEUROSCI.4218-05.2006
May, P. J., Westö, J., and Tiitinen, H. (2015). Computational modelling suggests that temporal integration results from synaptic adaptation in auditory cortex. Eur. J. Neurosci. 41, 615–630. doi: 10.1111/ejn.12820
Mazzucato, L., Camera, G., and Fontanini, A. (2019). Expectation-induced modulation of metastable activity underlies faster coding of sensory stimuli. Nat. Neurosci. 22, 787–796. doi: 10.1038/s41593-019-0364-9
Mill, R., Coath, M., Wennekers, T., and Denham, S. L. (2011). A neurocomputational model of stimulus-specific adaptation to oddball and Markov sequences. PLoS Comput. Biol. 7:e1002117. doi: 10.1371/journal.pcbi.1002117
Mill, R., Coath, M., Wennekers, T., and Denham, S. L. (2012). Characterising stimulus-specific adaptation using a multi-layer field model. Brain Res. 1434, 178–188. doi: 10.1016/j.brainres.2011.08.063
Mohar, B., Katz, Y., and Lampl, I. (2013). Opposite adaptive processing of stimulus intensity in two major nuclei of the somatosensory brainstem. J. Neurosci. 33, 15394–15400. doi: 10.1523/JNEUROSCI.1886-13.2013
Müller-Axt, C., Anwander, A., and von Kriegstein, K. (2017). Altered structural connectivity of the left visual thalamus in developmental dyslexia. Curr. Biol. 27, 3692.e4–3698.e4. doi: 10.1016/j.cub.2017.10.034
Mumford, D. (1992). On the computational architecture of the neocortex II: The role o f cortico-cortical loops. Biol. Cybernet. 66, 241–251. doi: 10.1007/BF00198477
Nagai, Y. (2019). Predictive learning: its key role in early cognitive development. Philos. Trans. R. Soc. B Biol. Sci. 374:20180030. doi: 10.1098/rstb.2018.0030
Nelken, I., and Ulanovsky, N. (2007). Mismatch negativity and stimulus-specific adaptation in animal models. J. Psychophysiol. 21, 214–223. doi: 10.1027/0269-8803.21.34.214
Ohara, P. T., and Lieberman, A. R. (1985). The thalamic reticular nucleus of the adult rat: experimental anatomical studies. J. Neurocytol. 14, 365–411. doi: 10.1007/BF01217752
Okada, K., Matchin, W., and Hickok, G. (2018). Neural evidence for predictive coding in auditory cortex during speech production. Psychon. Bull. Rev. 25, 423–430. doi: 10.3758/s13423-017-1284-x
Ölveczky, B. P., Baccus, S. A., and Meister, M. (2007). Retinal adaptation to object motion. Neuron 56, 689–700. doi: 10.1016/j.neuron.2007.09.030
Pack, C. C., and Bensmaia, S. J. (2015). Seeing and feeling motion: canonical computations in vision and touch. PLoS Biol. 13:e1002271. doi: 10.1371/journal.pbio.1002271
Pais-Vieira, M., Lebedev, M. A., Wiest, M. C., and Nicolelis, M. A. (2013). Simultaneous top-down modulation of the primary somatosensory cortex and thalamic nuclei during active tactile discrimination. J. Neurosci. 33, 4076–4093. doi: 10.1523/JNEUROSCI.1659-12.2013
Pakan, J. M., Francioni, V., and Rochefort, N. L. (2018). Action and learning shape the activity of neuronal circuits in the visual cortex. Curr. Opin. Neurobiol. 52, 88–97. doi: 10.1016/j.conb.2018.04.020
Parkkonen, L., Fujiki, N., and Mäkelä, J. P. (2009). Sources of auditory brainstem responses revisited: contribution by magnetoencephalography. Hum. Brain Mapp. 30, 1772–1782. doi: 10.1002/hbm.20788
Parras, G. G., Nieto-Diego, J., Carbajal, G. V., Valdés-Baizabal, C., Escera, C., and Malmierca, M. S. (2017). Neurons along the auditory pathway exhibit a hierarchical organization of prediction error. Nat. Commun. 8:2148. doi: 10.1038/s41467-017-02038-6
Pérez-González, D., Hernández, O., Covey, E., and Malmierca, M. S. (2012). GABA A-mediated inhibition modulates stimulus-specific adaptation in the inferior colliculus. PLoS ONE 7:e34297. doi: 10.1371/journal.pone.0034297
Pinault, D. (2004). The thalamic reticular nucleus: structure, function and concept. Brain Res. Rev. 46, 1–31. doi: 10.1016/j.brainresrev.2004.04.008
Purves, D., Morgenstern, Y., and Wojtach, W. T. (2015). Perception and reality: Why a wholly empirical paradigm is needed to understand vision. Front. Syst. Neurosci. 9:156. doi: 10.3389/fnsys.2015.00156
Quiroga, R. Q., Reddy, L., Kreiman, G., Koch, C., and Fried, I. (2005). Invariant visual representation by single neurons in the human brain. Nature 435, 1102–1107. doi: 10.1038/nature03687
Rao, R. P. N., and Ballard, D. H. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci. 2, 79–87. doi: 10.1038/4580
Rauschecker, J. P. (2015). Auditory and visual cortex of primates: a comparison of two sensory systems. Eur. J. Neurosci. 41, 579–585. doi: 10.1111/ejn.12844
Rhode, W. S., and Smith, P. H. (1986). Encoding timing and intensity in the ventral cochlear nucleus of the cat. J. Neurophysiol. 56, 261–286. doi: 10.1152/jn.1986.56.2.261
Richardson, B. D., Hancock, K. E., and Caspary, D. M. (2013). Stimulus-specific adaptation in auditory thalamus of young and aged awake rats. J. Neurophysiol. 110, 1892–1902. doi: 10.1152/jn.00403.2013
Robinson, B. L., Harper, N. S., and McAlpine, D. (2016). Meta-adaptation in the auditory midbrain under cortical influence. Nat. Commun. 7:13442. doi: 10.1038/ncomms13442
Robinson, B. L., and McAlpine, D. (2009). Gain control mechanisms in the auditory pathway. Curr. Opin. Neurobiol. 19, 402–407. doi: 10.1016/j.conb.2009.07.006
Schofield, B. R. (2011). “Central descending auditory pathways,” in Auditory and Vestibular Efferents, eds D. Ryugo and R. Fay (New York, NY: Springer), 261–290. doi: 10.1007/978-1-4419-7070-1_9
Sherratt, T. N., and Morand-Ferron, J. (2018). The adaptive significance of age-dependent changes in the tendency of individuals to explore. Anim. Behav. 138, 59–67. doi: 10.1016/j.anbehav.2018.01.025
Shiga, T., Althen, H., Cornella, M., Zarnowiec, K., Yabe, H., and Escera, C. (2015). Deviance-related responses along the auditory hierarchy: combined FFR, MLR and MMN evidence. PLoS ONE 10:e0136794. doi: 10.1371/journal.pone.0136794
Shipp, S. (2016). Neural elements for predictive coding. Front. Psychol. 7:1792. doi: 10.3389/fpsyg.2016.01792
Siman-Tov, T., Granot, R. Y., Shany, O., Singer, N., Hendler, T., and Gordon, C. R. (2019). Is there a prediction network? Meta-analytic evidence for a cortical-subcortical network likely subserving prediction. Neurosci. Biobehav. Rev. 105, 262–275. doi: 10.1016/j.neubiorev.2019.08.012
Skoe, E., Chandrasekaran, B., Spitzer, E. R., Wong, P. C., and Kraus, N. (2014). Human brainstem plasticity: the interaction of stimulus probability and auditory learning. Neurobiol. Learn. Mem. 109, 82–93. doi: 10.1016/j.nlm.2013.11.011
Slabu, L., Escera, C., Grimm, S., and Costa-Faidella, J. (2010). Early change detection in humans as revealed by auditory brainstem and middle-latency evoked potentials. Eur. J. Neurosci. 32, 859–865. doi: 10.1111/j.1460-9568.2010.07324.x
Slabu, L., Grimm, S., and Escera, C. (2012). Novelty detection in the human auditory brainstem. J. Neurosci. 32, 1447–1452. doi: 10.1523/JNEUROSCI.2557-11.2012
Sohoglu, E., and Davis, M. H. (2016). Perceptual learning of degraded speech by minimizing prediction error. Proc. Natl. Acad. Sci. U.S.A. 113, E1747E1756. doi: 10.1073/pnas.1523266113
Spratling, M. W. (2016). Predictive coding as a model of cognition. Cogn. Process. 17, 279–305. doi: 10.1007/s10339-016-0765-6
Spratling, M. W. (2017). A review of predictive coding algorithms. Brain Cogn. 112, 92–97. doi: 10.1016/j.bandc.2015.11.003
Sterzer, P., Adams, R. A., Fletcher, P., Frith, C., Lawrie, S. M., Muckli, L., et al. (2018). The predictive coding account of psychosis. Biol. Psychiatry 84, 634–643. doi: 10.1016/j.biopsych.2018.05.015
Sterzer, P., Voss, M., Schlagenhauf, F., and Heinz, A. (2019). Decision-making in schizophrenia: a predictive-coding perspective. Neuroimage 190, 133–143. doi: 10.1016/j.neuroimage.2018.05.074
Strait, D. L., Hornickel, J., and Kraus, N. (2011). Subcortical processing of speech regularities underlies reading and music aptitude in children. Behav. Brain Funct. 7, 1–11. doi: 10.1186/1744-9081-7-44
Syka, J., Popelá, J., Druga, R., and Vlková, A. (1988). Descending central auditory pathway - structure and function. Audit. Pathway 279–292. doi: 10.1007/978-1-4684-1300-7
Taaseh, N., Yaron, A., and Nelken, I. (2011). Stimulus-specific adaptation and deviance detection in the rat auditory cortex. PLoS ONE 6:e23369. doi: 10.1371/journal.pone.0023369
Tabas, A., Mihai, G., Kiebel, S., Trampel, R., and von Kriegstein, K. (2020). Predictive coding underlies adaptation in the subcortical sensory pathway. arXiv preprint arXiv:2003.11328. doi: 10.7554/eLife.64501
Talsma, D. (2015). Predictive coding and multisensory integration: an attentional account of the multisensory mind. Front. Integr. Neurosci. 9:19. doi: 10.3389/fnint.2015.00019
Thomas, J. M., Morse, C., Kishline, L., O'Brien-Lambert, A., Simonton, A., Miller, K. E., et al. (2012). Stimulus-specific adaptation in specialized neurons in the inferior colliculus of the big brown bat, Eptesicus fuscus. Hear. Res. 291, 34–40. doi: 10.1016/j.heares.2012.06.004
Tschentscher, N., Ruisinger, A., Blank, H., Diaz, B., and von Kriegstein, K. (2019). Reduced structural connectivity between left auditory thalamus and the motion-sensitive planum temporale in developmental dyslexia. J Neurosci. 27, 1720–1732. doi: 10.1523/JNEUROSCI.1435-18.2018
Ulanovsky, N., Las, L., Farkas, D., and Nelken, I. (2004). Multiple time scales of adaptation in auditory cortex neurons. J. Neurosci. 24, 10440–10453. doi: 10.1523/JNEUROSCI.1905-04.2004
Ulanovsky, N., Las, L., and Nelken, I. (2003). Processing of low-probability sounds by cortical neurons. Nat. Neurosci. 6, 391–398. doi: 10.1038/nn1032
Valdés-Baizabal, C., Carbajal, G. V., Pérez-González, D., and Malmierca, M. S. (2020). Dopamine modulates subcortical responses to surprising sounds. PLoS Biol. 18:e3000744. doi: 10.1371/journal.pbio.3000744
Valdés-Baizabal, C., Parras, G. G., Ayala, Y. A., and Malmierca, M. S. (2017). Endocannabinoid modulation of stimulus-specific adaptation in inferior colliculus neurons of the rat. Sci. Rep. 7:6997. doi: 10.1038/s41598-017-07460-w
van de Cruys, S., Evers, K., van der Hallen, R., van Eylen, L., Boets, B., De-Wit, L., et al. (2014). Precise minds in uncertain worlds: Predictive coding in autism. Psychol. Rev. 121, 649–675. doi: 10.1037/a0037665
van Schalkwyk, G. I., Volkmar, F. R., and Corlett, P. R. (2017). A predictive coding account of psychotic symptoms in autism spectrum disorder. J. Autism Dev. Disord. 47, 1323–1340. doi: 10.1007/s10803-017-3065-9
van Wassenhove, V. (2013). Speech through ears and eyes: interfacing the senses with the supramodal brain. Front. Psychol. 4:388. doi: 10.3389/fpsyg.2013.00388
Von Helmholtz, H. (1867). Handbuch der physiologischen Optik: mit 213 in den Text eingedruckten Holzschnitten und 11 Tafeln. Leipzig: Voss.
von Kriegstein, K. (2012). “A multisensory perspective on human auditory communication,” in The Neural Bases of Multisensory Processes, eds M. M. Murray and M. T. Wallace (Boca Raton, FL: CRC Press), 683–700. doi: 10.1201/9781439812174-43
Von Kriegstein, K., Dogan, Ö., Grüter, M., Giraud, A. L., Kell, C. A., Grüter, T., et al. (2008). Simulation of talking faces in the human brain improves auditory speech recognition. Proc. Natl. Acad. Sci. U.S.A. 105, 6747–6752. doi: 10.1073/pnas.0710826105
Wacongne, C., Changeux, J.-P., and Dehaene, S. (2012). A neuronal model of predictive coding accounting for the mismatch negativity. J. Neurosci. 32, 3665–3678. doi: 10.1523/JNEUROSCI.5003-11.2012
Walsh, K. S., McGovern, D. P., Clark, A., and O'Connell, R. G. (2020). Evaluating the neurophysiological evidence for predictive processing as a model of perception. Ann. N. Y. Acad. Sci. 1464, 242–268. doi: 10.1111/nyas.14321
Wang, H., Han, Y. F., Chan, Y. S., and He, J. (2014). Stimulus-specific adaptation at the synapse level in vitro. PLoS ONE 9:e114537. doi: 10.1371/journal.pone.0114537
Winer, J. A. (1984). The human medial geniculate body. Hear. Res. 15, 225–280. doi: 10.1016/0378-5955(84)90031-5
Winer, J. A. (2005a). Decoding the auditory corticofugal systems. Hear. Res. 207, 1–9. doi: 10.1016/j.heares.2005.06.007
Winer, J. A. (2005b). “Three systems of descending projections to the inferior colliculus,” in The Inferior Colliculus, eds J. A. Winer and C. E. Schreiner (New York, NY: Springer-Verlag), 231–247. doi: 10.1007/0-387-27083-3_8
Wood, K. C., Town, S. M., and Bizley, J. K. (2019). Neurons in primary auditory cortex represent sound source location in a cue-invariant manner. Nat. Commun. 10, 1–15. doi: 10.1038/s41467-019-10868-9
Yildiz, I. B., and Kiebel, S. J. (2011). A hierarchical neuronal model for generation and online recognition of birdsongs. PLoS Comput. Biol. 7:e1002303. doi: 10.1371/journal.pcbi.1002303
Yu, X.-J., Xu, X.-X., He, S., and He, J. (2009). Change detection by thalamic reticular neurons. Nat. Neurosci. 12, 1165–1170. doi: 10.1038/nn.2373
Zabbah, S., Rajaei, K., Mirzaei, A., Ebrahimpour, R., and Khaligh-Razavi, S. M. (2014). The impact of the lateral geniculate nucleus and corticogeniculate interactions on efficient coding and higher-order visual object processing. Vis. Res. 101, 82–93. doi: 10.1016/j.visres.2014.05.006
Keywords: predictive coding, medial geniculate body, inferior colliculus, abstract processing, sensory coding, auditory processing, subcortical sensory pathway
Citation: Tabas A and von Kriegstein K (2021) Adjudicating Between Local and Global Architectures of Predictive Processing in the Subcortical Auditory Pathway. Front. Neural Circuits 15:644743. doi: 10.3389/fncir.2021.644743
Received: 21 December 2020; Accepted: 16 February 2021;
Published: 12 March 2021.
Edited by:
Livia de Hoz, Charité - Universitätsmedizin Berlin, GermanyReviewed by:
Manuel S. Malmierca, University of Salamanca, SpainKasia M. Bieszczad, Rutgers, The State University of New Jersey, United States
Copyright © 2021 Tabas and von Kriegstein. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Alejandro Tabas, YWxlamFuZHJvLnRhYmFzJiN4MDAwNDA7dHUtZHJlc2Rlbi5kZQ==