Neural field theory of adaptive effects on auditory evoked responses and mismatch negativity in multifrequency stimulus sequences

Babaie-Janvier, Tahereh; Gabay, Natasha C.; McInnes, Alexander; Robinson, Peter A.

doi:10.3389/fnhum.2023.1282924

HYPOTHESIS AND THEORY article

Front. Hum. Neurosci., 03 January 2024

Sec. Brain Imaging and Stimulation

Volume 17 - 2023 | https://doi.org/10.3389/fnhum.2023.1282924

Neural field theory of adaptive effects on auditory evoked responses and mismatch negativity in multifrequency stimulus sequences

Tahereh Babaie-Janvier^1,2

Natasha C. Gabay^1,2

Alexander McInnes¹

Peter A. Robinson^1,2^*

¹School of Physics, The University of Sydney, Sydney, NSW, Australia
²Center of Excellence for Integrative Brain Function, The University of Sydney, Sydney, NSW, Australia

Physiologically based neural field theory (NFT) of the corticothalamic system, including adaptation, is used to calculate the responses evoked by trains of auditory stimuli that differ in frequency. In oddball paradigms, fully distinguishable frequencies lead to different standard (common stimulus) and deviant (rare stimulus) responses; the signal obtained by subtracting the standard response from the deviant is termed the mismatch negativity (MMN). In this analysis, deviant responses are found to correspond to unadapted cortex, whereas the part of auditory cortex that processes the standard stimuli adapts over several stimulus presentations until the final standard response form is achieved. No higher-order memory processes are invoked. In multifrequency experiments, the deviant response approaches the standard one as the deviant frequency approaches that of the standard and analytic criteria for this effect to be obtained. It is shown that these criteria can also be used to understand adaptation in random tone sequences. A method of probing MMNs and adaptation in random tone sequences is suggested to makes more use of such data.

1 Introduction

Neural processing of sensory information in normal and abnormal states is commonly investigated using evoked responses (ERs) to impulsive stimuli, measured via non-invasive methods of electroencephalography (EEG) or magnetoencephalography (MEG) (Näätänen and Alho, 1997; Tervaniemi et al., 1997; Luck and Kappenman, 2011; Niedermeyer and Lopes da Silva, 2011; Luck, 2014). A so-called oddball paradigm is widely used to analyze the effects evoked by any violation of regularity—e.g., changes in frequency, location, duration, or intensity (Näätänen, 2003; Luck and Kappenman, 2011; Luck, 2014), in which a train of so-called standard ( $S$ ) stimuli is interrupted by rarer stimuli, termed deviant ( $D$ ), as seen in Figure 1. Such irregularities elicit very different responses when the two types of stimuli are fully discriminable, whereas marginally discriminable stimuli give an intermediate response (Sams et al., 1985; Näätänen, 2003; Garrido et al., 2009a, 2013). Throughout this article, we denote stimuli with calligraphic font to distinguish them from responses, written in italic.

Figure 1

Figure 1. Schematic examples of typical standard (blue) and deviant (red) evoked responses to auditory stimuli in an auditory oddball experiment. Traditional phenomenological “components” (peaks and troughs) are labeled N1, N2, and P3. Note that negative signals correspond to the upward direction in these plots, in accord with convention.

Significantly, the first stimulus in a stimulus train always evokes a D response, whereas standard responses only emerge after a few successive presentations, approaching their limiting form S_∞ over an adaptation timescale of several seconds (Näätänen, 2003; Garrido et al., 2009a, 2013; Luck and Kappenman, 2011; Luck, 2014). Involvement of adaptation is also inferred because a pause in stimulation causes S responses to relax to the D form over a few seconds (Cowan, 1984; Winkler et al., 1993; Loveless et al., 1996; Näätänen, 2003). Similarly, when two $D$ stimuli occur consecutively or an $S$ follows two $D$ s, both D and S responses differ from their prototypical forms (Sams et al., 1984). This implies that adaptation to recent stimuli is at least partly responsible for the different responses. It thus counts against interpretations that assert that the system establishes expectations of the long-term statistical properties of incoming stimuli (Näätänen et al., 1978, 1989, 1993, 2005, 2010; Tiitinen et al., 1993; Kraus et al., 1995, 1996; Tervaniemi et al., 1997; Atienza et al., 2001; Näätänen, 2003; Garrido et al., 2009b; Luck and Kappenman, 2011; Luck, 2014), although it does not rule out some contribution from such effects.

A D response is also evoked by other irregularities within a train of stimuli, including a repeated tone in an otherwise descending sequence where no prior tone is repeated (Näätänen et al., 1989; Tervaniemi et al., 1997; Näätänen, 2003; Garrido et al., 2009b, 2013); after a stimulus that is omitted or changed in duration or intensity (Näätänen et al., 1989, 2007; Yabe et al., 1997; Näätänen, 2003; Salisbury, 2012); or when the overall frequency range of an ensemble of random stimuli exceeds the discriminability threshold (Sams et al., 1985; Garrido et al., 2013).

ERs are most commonly phenomenologically parameterized by the timings and amplitudes of so-called components, which approximately correspond to peaks and troughs in the waveform (Luck and Kappenman, 2011; Luck, 2014). It is widely assumed that each component has a fixed timing (latency) and polarity (positive or negative) in normal subjects and that cognitive processes only change their amplitudes (Hillyard and Anllo-Vento, 1998; Hillyard et al., 1998). In this vein, the S response is often subtracted from the D response to compute the so-called mismatch negativity (MMN), which has been argued to be a separate component that results from top-down memory-based comparison processes in higher-order cortical areas that flag deviance from a pre-established regularity (Näätänen et al., 1978, 1989, 1993, 2005, 2010; Tiitinen et al., 1993; Kraus et al., 1995, 1996; Tervaniemi et al., 1997; Atienza et al., 2001; Näätänen, 2003; Garrido et al., 2009b; Luck and Kappenman, 2011; Luck, 2014). In the present study, we base our description of ERs on the underlying physical brain activity that they reflect, and only use component terminology as a convenient shorthand to designate timings and polarities of peaks and troughs. In this notation, N1 and N2 denote negative peaks at around 100 and 200 ms post-stimulus, and P3 denotes a positive peak at 300 ms.

An alternative to the above view is that the auditory MMN is the result of cortical adaptation to repeated $S$ stimuli that changes the S response at the relevant point of the tonotopic map, whereas the point that corresponds to the $D$ stimuli undergoes little adaptation, which mostly relaxes before the next such stimulus arrives (Atienza et al., 2001; Jääskeläinen et al., 2004). This does not preclude contributions from higher-order memory processes; however, basic biophysics, the evolution of S and D responses during long trains, the decay of their distinction during a few-second stimulation pause, and the existence of MMN in coma (during which there is arguably no higher order processing) all imply a role for adaptation (Schröger, 1998; Näätänen, 2003; Jääskeläinen et al., 2004; Sussman et al., 2014). Ruusuvirta (2021) pointed out that there are still uncertainties about the precise mechanisms of adaptation and its importance in determining ER structure, which reinforces the need to test the extent to which adaptation can even potentially account for ER structure.

Our approach to testing the potential role of adaptation is to model ERs mechanistically in terms of the response of cortical activity to incoming stimuli, focusing on the effects of slow adaptation and adding frequency dependence to our prior neural field theory (NFT; Robinson et al., 2021) to see whether they can account for the evolution of ERs from deviant to standard form when driven by multiple stimuli, and for frequency-dependent features. NFT has been extensively used to model ERs, ongoing EEG characteristics, and other phenomena (Rennie et al., 2002; Robinson et al., 2002; Kerr et al., 2008, 2009, 2011; Babaie-Janvier and Robinson, 2018, 2019, 2020; Mukta et al., 2020). In particular, Kerr et al. (2011) successfully fitted NFT impulse-response models of S and D ERs to data from cohorts of up to nearly 1,500 subjects, albeit without including adaptation. They showed that inferred prestimulus parameters for S and D responses could be significantly different from each other and from those of background EEG (van Albada et al., 2010; Kerr et al., 2011). Our recent study (Babaie-Janvier and Robinson, 2018, 2019, 2020) also showed that stimulus-driven gain changes occur as part of ERs and affect their form. Most recently, Robinson et al. (2021) incorporated adaptation into the NFT model of ERs and used it to calculate S and D responses to sequences of simple stimuli, including the development of distinct response characteristics. This provides a means by which a wide range of experimental outcomes can be reproduced using a single model, including the entire waveform, not just its peaks and troughs. Moreover, it predicts observed changes in amplitudes and timings of oscillations due to changes in corticothalamic parameters, implying that fixed-latency components do not best reflect the underlying dynamics. This quantitative approach also enables one to determine how much of the dynamics can be accounted for by adaptation and what remainder might be due to higher-order top-down memory-related stimulus-comparison processes. Using these physically based approaches, they showed that the building blocks of responses are the same damped corticothalamic oscillations that account for ongoing EEG characteristics and other phenomena.

In the present study, we use our recent NFT model (Babaie-Janvier and Robinson, 2020; Robinson et al., 2021) and generalize it to incorporate the auditory tonotopic map to allow for stimuli to overlap in their adaptive effects, instead of being assumed to be entirely distinct. This method enables us to treat responses to a train of stimuli in which frequent and infrequent tones are not fully distinguishable (Robinson et al., 2021), and to relate the response characteristics to the probability distribution of random stimuli (Garrido et al., 2013). We thus aim to explore the extent to which adaptation can account for the occurrence of standard, deviant, and intermediate responses to trains of stimuli that can differ in frequency.

The structure of the article is as follows: Section 2 provides an overview of the necessary background theory for an interdisciplinary readership, followed by extension of the NFT of ERs with adaptation to also include tonotopy, in the absence of higher-order feedbacks. In Section 3, we use the model to predict the MMN as a function of stimulus discriminability in an oddball paradigm and predict responses in random tone experiments. In each case, the results are compared with experimental outcomes in the literature. Section 4 summarizes the main findings and outlines directions for future study.

2 Materials and methods

This material summarizes and further develops the necessary theory for our analysis. Section 2.1 briefly summarizes the relevant background aspects of the use of physiologically based neural field theory in modeling large-scale brain activity and reviews the essential components of our specific corticothalamic model, which has previously been successfully tested against experimental results in other contexts (Rennie et al., 2002; Kerr et al., 2008, 2011; Babaie-Janvier and Robinson, 2020; Robinson et al., 2021). To avoid undue repetition, we refer the reader particularly to the study by Robinson et al. (2021) for further details of the model and its mathematical treatment in both time and frequency domains, so that new aspects can be focused on here. Section 2.2 then discusses how the tonotopic map and auditory inputs are treated and establishes criteria for significant adaptation. The connection to measured ERs is then discussed in Section 2.3.

2.1 NFT of corticothalamic evoked responses

Cortical evoked responses (ERs), as measured by EEG or MEG techniques, are generated primarily by perturbations in the activity ϕ_e arriving at synapses of pyramidal excitatory cells due to dynamics in the corticothalamic system (Nunez and Cutillo, 1995). Our corticothalamic model, shown in Figure 2, incorporates the cortex and thalamus and their connectivities; each includes distinct population of neurons: cortical excitatory (e) and inhibitory (i) neurons, the thalamic reticular nucleus (TRN; r), thalamic relay neurons (s), and non-corticothalamic neurons that provide external inputs (n). In this study, the relevant relay nucleus is the medial geniculate nucleus (MGN), whose projections are to the primary auditory cortex (A1). The model incorporates the auditory projection system with reciprocal corticothalamic feedback projections, excitatory projections to the TRN from MGN-A1 feedforward axons and A1-MGN feedback axons, and inhibitory projections from the TRN onto MGN relay neurons.

Figure 2

Figure 2. (A) Physiologically based corticothalamic model in which the arrows represent excitatory effects and the circles depict inhibitory ones. The populations are cortical excitatory (e) and inhibitory (i) neurons, the thalamic reticular nucleus (r), thalamic relay neurons (s) that project to the cortex, and non-corticothalamic neurons responsible for external inputs (n). (B) Schematic of propagation of neural activity to population a from population b, where both fast (through η) and slow (through μ) modulation of the neuronal gain by local feedback is given by Equation (9).

The NFT discussed by Robinson et al. (2021) yields partial differential equations for the mean firing rates ϕ_a of neurons in the various structures mentioned in the previous paragraph, with a = e, i, r, s, n. The firing rate ϕ_e in cortical pyramidal neurons has previously been shown to be the one most closely related to EEG signals (Nunez and Cutillo, 1995; Nunez and Srinivasan, 2006), a relationship we continue to assume here.

Solution of physiologically based NFT equations of the corticothalamic model first yields spatially uniform steady states of the system, which are interpreted as characterizing the baseline of normal activity, with firing rates ϕ_a that are in accord with experiment (Robinson et al., 2002, 2004). Linear perturbations from these steady states have been shown to correspond to time dependent brain activity, leading to successful comparisons with numerous experimental phenomena, including evoked responses (Robinson et al., 1997, 2002, 2004, 2005; Rennie et al., 2002; O'Connor and Robinson, 2004; Kerr et al., 2008; van Albada et al., 2010; Roberts and Robinson, 2012; Abeysuriya et al., 2015). In this study, we consider only the large-scale global ER because it has been shown to dominate the measurable signal, as seen in Figure 12 of Mukta et al. (2020).

Application of the Laplace transform

\begin{array}{l} L [f (t)] (s) = f (s) = \int_{0}^{\infty} f (t) e^{- s t} d t, & (1) \end{array}

to the NFT equations yields the following equation for the activity ϕ_a at each neural population a in terms of activity arriving from other populations b:

\begin{array}{l} {\hat{D}}_{a} (s) [ϕ_{a}^{(0)} + ϕ_{a}^{(1)} (s)] = \hat{L} (s) \sum_{b} G_{a b} [ϕ_{b}^{(0)} + ϕ_{b}^{(1)} (s) e^{(- s τ_{a b})}], & (2) \end{array}

where we retain first order perturbations (superscript 1) from the steady state (superscript 0), and

\begin{array}{l} {\hat{D}}_{a} (s) = {(1 + s / γ_{a})}^{2}, & (3) \end{array}

\begin{array}{l} \hat{L} (s) = α β / [(s + α) (s + β)], & (4) \end{array}

where $\hat{L} (s)$ is the operator that embodies the temporal response of cell-body potentials to afferent pulse rate fields ϕ_b by encapsulating the rates β and α of the response's rise and fall, ${\hat{D}}_{a} (s)$ corresponds to a damped wave operator (Jirsa and Haken, 1996; Robinson et al., 1997) with the damping rate γ_a satisfying γ_a = v_a/r_a, where r_a and v_a are the characteristic range and conduction velocity of axons of type a (in the corticothalamic system, only the axons of excitatory cortical neurons are long enough to cause significant propagation effects on large scales; in the other populations, we assume the axonal length to be small enough that it can be neglected, whence r_a ≈ 0 and ${\hat{D}}_{a} \approx 1$ ), and the gains G_ab, in general, are the response in neuron a due to unit input from neuron b; i.e., the number of additional pulses out for each additional pulse in.

For first-order perturbations, we can write

\begin{array}{l} ϕ_{e}^{(1)} (t) = \int_{- \infty}^{t} T_{e n} (t - t^{'}) ϕ_{n}^{(1)} (t^{'}) d t^{'}, & (5) \end{array}

for a purely temporal response, where the other linear perturbations have been eliminated from the equations, T_en is the resulting linear response function, which embodies the system linear response to a perturbation, with $T_{e n} (t - t^{'}) = 0$ for t < t′ to preserve causality. In Equation (5), $ϕ_{n}^{(1)}$ is the incoming non-corticothalamic stimulus to the corticothalamic system. The form in Equation (1) can be generalized to include spatial aspects, but here we focus on the temporal domain to bring out the main aspects without undue complexity (Kerr et al., 2008). Equation (1) can be Laplace transformed to yield

\begin{array}{l} ϕ_{e}^{(1)} (s) = T_{e n} (s) ϕ_{n}^{(1)} (s), & (6) \end{array}

which expresses the transfer function as the ratio of output to input in the Laplace domain. If the input in Equation (6) is a delta function $ϕ_{n} (t^{'}) = δ (t^{'} - t_{0})$ , one finds

\begin{array}{l} ϕ_{e}^{(1)} (t) = T_{e n} (t - t_{0}), & (7) \end{array}

whence we see that the transfer function and the ER to a delta input are one and the same. More generally, subsequent physical phenomena such as volume conduction, measurement effects, and postprocessing should be included in the overall transfer function from stimulus to measurement, but we omit discussion of these issues because they do not strongly affect the time course of large-scale ERs, which is our focus here.

The transfer function itself can be changed by the stimulus, owing to a variety of fast and slow dynamical effects that cause the gains G_ab to evolve in time (Koch, 1999; Rennie et al., 1999, 2000, 2002; Robinson and Roy, 2015; Babaie-Janvier and Robinson, 2019) due to current or recent activity, including plasticity, long-term potentiation/depression, adaptation, facilitation, habituation, and sensitization (Koch, 1999; Rennie et al., 2000; Robinson and Roy, 2015; Babaie-Janvier and Robinson, 2019). Rennie et al. (1999), Koch (1999), Robinson et al. (2002), and Robinson and Roy (2015) introduced a general mathematical form for gain changes that are driven by local activity and that relax toward equilibrium with a characteristic timescale that can be applied to a broad range of local feedback mechanisms in which presynaptic neuronal activity modulates neuronal gains. For moderate perturbations, it yields

\begin{array}{l} G_{a b} (s) = G_{a b}^{(0)} + G_{a b}^{(1)} (s), & (8) \end{array}

\begin{array}{l} G_{a b}^{(1)} (s) = [g_{a b} F (s) + h_{a b} H (s)] ϕ_{b}^{(1)} (s), & (9) \end{array}

where $G_{a b}^{(0)}$ is the static gain and $G_{a b}^{(1)}$ is the gain perturbation caused by local feedback. Here, F(t) describes the temporal dynamics of fast gain modulation on timescales of up to a few hundred ms and g_ab is its strength, whereas H(t) is a slow adaptation process on timescales of 5 – 10 s, with h_ab the corresponding strength; g_ab and h_ab are assumed constant in the present study. Figure 2 depicts this modulation schematically. Gain dynamics driven by postsynaptic firing is postponed to future study, but can be treated similarly (Rennie et al., 1999; Robinson and Roy, 2015; Robinson et al., 2021). For simplicity, we use the forms

\begin{array}{l} F (s) = η / (s + η), & (10) \end{array}

\begin{array}{l} H (s) = μ / (s + μ) . & (11) \end{array}

In the time domain, F(t) = H(t) = 0 for t < 0 to enforce causality, while the positive rate constants η and μ are the inverse timescales of the modulatory processes and the forms (10) and (11) are normalized to unit integral over time. Previous study found η = 25 s⁻¹ (Rennie et al., 1999; Babaie-Janvier and Robinson, 2019), and later study set μ = 0.65 s⁻¹ because of the several-second timescales over which S response characteristics develop and decay (Robinson et al., 2021). Substituting the dynamic form of G_ab from Equations (8) and (9) into Equation (2), one finds

\begin{array}{l} {\hat{D}}_{a} (s) ϕ_{a}^{(1)} (s) = \hat{L} (s) \sum_{b} [G_{a b}^{(0)} e^{- s τ_{a b}} + ϕ_{b}^{(0)} {g_{a b} F (s) + h_{a b} H (s)}] ϕ_{b}^{(1)} (s), & (12) \end{array}

the right side of which expresses two types of first-order responses: the first term in the square brackets is the response that would occur without change to the steady-state gains, while the second term is the response due to stimulus-induced gain changes acting on the steady-state activity (Robinson et al., 2021).

It is straightforward to eliminate the other first-order quantities to obtain the transfer function to excitatory cortical activity from auditory signals that reach the thalamus (Babaie-Janvier and Robinson, 2019, 2020), giving

\begin{array}{l} T_{e n} (s) = \frac{ϕ_{e}^{(1)} (s)}{ϕ_{n}^{(1)} (s)} = \frac{χ_{e s n} (s)}{M_{c} (s) P_{t} (s) - P_{c} (s)}, & (13) \end{array}

which expresses the ratio of the response change $ϕ_{e}^{(1)}$ to a change in the input $ϕ_{n}^{(1)}$ (i.e., to a stimulus). The full analysis shows that the various terms in this equation have the specific forms (Robinson et al., 2021)

\begin{array}{l} χ_{a b} (s) = \hat{L} (s) [G_{a b}^{(0)} e^{- s τ_{a b}} + ϕ_{b}^{(0)} {g_{a b} F (s) + h_{a b} H (s)}], & (14) \end{array}

\begin{array}{l} M_{c} (s) = {\hat{D}}_{e} (1 - χ_{e i}) - χ_{e e}, & (15) \end{array}

\begin{array}{l} P_{t} (s) = 1 - χ_{s r s},, & (16) \end{array}

\begin{array}{l} P_{c} (s) = χ_{e s e} + χ_{e s r e}, & (17) \end{array}

χ_abc = χ_ab χ _bc. Table 1 lists nominal values of model parameters for resting EEG (Robinson et al., 2004) and gain modulation parameters calibrated and used in previous studies (Babaie-Janvier and Robinson, 2019, 2020; Robinson et al., 2021). These values were estimated for normal adults and have been extensively used and verified in comparisons with experiments, as mentioned in Section 1.

Table 1

Table 1. Estimated brain parameters for normal adults in the alert eyes-open state.

2.2 Stimulus profile at auditory cortex

In our previous study (Robinson et al., 2021), we assumed that $S$ and $D$ stimuli could be clearly distinguished via a large frequency separation and that EEG electrodes just responded to the total response without distinguishing spatial locations. In the current study, we relax the first assumption but retain the second. This requires us to examine the frequency content of the stimulus and its mapping to auditory cortex.

2.2.1 Tone-burst stimulus

ER experiments typically use short tone bursts of sinusoidal waves of frequency f₀ and duration τ. Such a burst can be written

\begin{array}{l} X (t) = sin (2 π f_{0} t) W (t) [H (t) - H (t - τ)], & (18) \end{array}

where Θ(u) is the step function

\begin{array}{l} Θ (u) = {\begin{matrix} 1, & 0 \leq u, \\ 0, & u < 0 . \end{matrix} & (19) \end{array}

In Equation (18), the difference of the two step functions restricts the stimulus to the interval 0 < t < τ and we use the notation $X$ to indicate that this is the externally applied stimulus, which still has to be transduced by auditory pathways before arriving at the cortex as ϕ_n. The remaining factor in Equation (18) is the window function W(t) that determines the shape of the burst within the overall interval τ. To minimize the generation of side-lobes at frequencies far from f₀, we use the Tukey window

\begin{array}{l} W (u) = {\begin{array}{l} {sin}^{2} [\frac{π t}{2 p τ}], & u < p τ; \\ 1, & ρ τ < u < (1 - p) τ; \\ {sin}^{2} [\frac{π (τ - t)}{2 p τ}], & (1 - p) τ < u; \end{array} & (20) \end{array}

which smooths the burst over an interval pτ at each end of the interval, with p < 0.5; we use p = 0.2. Fourier or Laplace transformation of Equation (18) implies that a frequency range Δf_u ≈ 2/[(1 − ρ)τ] is present in a tone burst (Bracewell, 1986), which expresses the frequency–time uncertainty relation. Figure 3 shows a tone burst of f₀ = 200 Hz and τ = 50 ms with p = 0.2, along with its Fourier spectrum.

Figure 3

Figure 3. Tone burst of the form in Equation (18) at f₀ = 200 Hz and τ = 50 ms with p = 0.2. (A) Time series showing the sinusoidal signal modulated by the Tukey window. (B) Corresponding frequency spectrum.

2.2.2 Transfer to the auditory cortex via the tonotopic map

When an auditory stimulus arrives at the ear, it passes via the eardrum and stapes to the cochlea, which narrows progressively with distance. Cilia near the entrance respond most strongly to low-frequency signals, while those further in respond to higher frequencies. Each group of cilia stimulates neurons that correspond to a narrow range of frequencies around its preferred one, with a firing rate that is proportional to the logarithm of the intensity (Pickles, 2013). Thus, when a frequency is present, the corresponding neurons are active with a firing rate that depends on the intensity of the signal at that frequency.

Neurons at various stages of the auditory pathway remain topographically arranged according to their optimal frequency response. This tonotopic organization mirrors the distribution of receptors in the cochlea, with a gradient extending between neurons that preferentially respond to high frequencies and those that respond best to low frequencies. Tonotopy is preserved via the medial geniculate nucleus (MGN) of the thalamus to the primary auditory cortex, where frequencies are approximately logarithmically spaced in a one-dimensional tonotopic map, in which each frequency f is mapped to a position x(f) (Talavage et al., 2004; Herdener et al., 2013; Saenz and Langers, 2014).

There is some spreading of neural projections in the pathways to the auditory cortex. This means that a pure tone of a certain frequency f₀ stimulates cortical neurons across a small range of adjacent locations around x(f₀), corresponding to a frequency range Δf_nat, which is around 0.3% of f₀ for frequencies of order 1–2 kHz, which are typical in ER experiments, and about 3 Hz for frequencies below 1,000 Hz.

Any sinusoidal wave train that is cut short to a time interval of length ΔT to make a tone burst has an unavoidable spread in frequency Δf_u ≈ 1/T, via the uncertainty principle. Hence, the total spread of cortical stimulation corresponds to a spread of frequencies Δf, with

\begin{array}{l} Δ f \approx \sqrt{{(Δ f_{nat})}^{2} + {(Δ f_{u})}^{2}} . & (21) \end{array}

Since it is mathematically impossible to say simultaneously exactly which frequencies are present at which times during a short burst, due to the uncertainty principle, we approximate their effect on the cortex by assuming that they are all present throughout the interval of the burst. This approximation is well justified for durations of only a few tens of ms because the dynamics of the cortical response effectively integrate over the burst which appears like a delta-function in time if it is sufficiently short. Therefore, the stimulus that arrives at the primary auditory cortex can be approximated as

\begin{array}{l} ϕ_{n} (x, s) = \int_{- \infty}^{+ \infty} w (x - x_{0}) S (f_{0}, s) d x, & (22) \end{array}

where x₀ = x(f₀) and ∫w(x − x₀)dx = 1. A suitable approximate form is

\begin{array}{l} w (x - x_{0}) = \frac{1}{Δ f \sqrt{2 π}} exp [- \frac{{(x - x_{0})}^{2}}{2 {(Δ f)}^{2}}] . & (23) \end{array}

It is worth noting that the stimulus profile at the cortex can be calculated in one of two nearly equivalent ways: (i) first calculate the stimulus spectrum for given f₀ and τ, giving its intrinsic width ~Δf_u and then convolve the spectrum with the function that governs the spread Δf_nat of afferents to the auditory cortex to yield the total spread on the tonotopic map, or (ii) first specify f₀ and Δf_u and then use the total spread from the above equation to estimate the spread on the tonotopic map directly. Here, we use the latter approach, which leads to a more straightforward implementation.

2.2.3 Criteria for significant adaptive effects

Now that Δf has been defined, we can now easily define criteria that characterize when significant adaptive effects due to one stimulus will affect another to produce a more S-like second response.

If a tone burst of central frequency f = f₀ and duration τ occurs at t = t₀, it causes adaptive changes within a neighborhood of f₀ of width Δf. These changes last a time t_H ≈ 5 − 10 s, so any stimulus occurring at f₁ in that frequency range and t₁ in that time interval will encounter a region of auditory cortex that has undergone adaptation due to the first stimulus. It is convenient to use the following two parameters when investigating adaptive effects:

\begin{array}{l} ρ = \frac{| f_{0} - f_{1} |}{Δ f}, & (24) \end{array}

\begin{array}{l} ζ = R (f_{0}, Δ f) t_{H} . & (25) \end{array}

The discriminability ρ is the ratio of the frequency separation to the spectral width Δf of $ϕ_{n}^{(1)}$ and is large when the two spectra stimulate quite different parts of the auditory cortex. The quantity ζ is the product of the rate R(f₀, Δf) at which stimuli arrive within Δf of f₀ and the adaptation timescale t_H and represents the mean number of stimuli that might potentially be affected by the first stimulus, or equivalently, the number of previous stimuli that might affect it. Significant adaptive effects will only occur if ρ ≲ 1 and ζ ≳ 1.

2.3 Measured ER

Once the stimulus $ϕ_{n}^{(1)} (f, t)$ is known as a function of frequency and time on the primary auditory cortex, its local contributions to the ER, including adaptation, can be calculated using Equations (5) and (13). In the present case, we assume that the recording electrode responds to the whole stimulated cortical area, so the measured ER is obtained by integrating over all frequencies.

2.3.1 ER: first stimulus

Let us consider the first ER in a sequence, so there has been no prior adaptation and the transfer function does not depend on position x (i.e., on frequency). The cortical response to $ϕ_{n}^{(1)} (x, s)$ is

\begin{array}{l} ϕ_{e}^{(1)} (x, s) = T_{e n} (s) ϕ_{n}^{(1)} (x, s), & (26) \end{array}

\begin{array}{l} = T_{e n} (s) \int w (x - x^{'}) ϕ_{c}^{(1)} (x^{'}, s) d x^{'}, & (27) \end{array}

because

\begin{array}{l} ϕ_{n}^{(1)} (x, s) = \int w (x - x^{'}) ϕ_{c}^{(1)} (x^{'}, s) d x^{'} . & (28) \end{array}

Here, we have written $ϕ_{c}^{(1)}$ for the auditory neural signal that would arrive from the cochlea if there were no spreading due to anatomical effects or the finite duration of the tone burst, while the weight function w(x − x′) is used to incorporate both these effects. This function has central peak and a characteristic width Δx that corresponds to Δf, and is normalized to satisfy

\begin{array}{l} \int w (x - x^{'}) d x^{'} = 1; & (29) \end{array}

We also assume that w is symmetric, with w(x − x′) = w(x′ − x), as in the example in Equation (23). This formulation is simpler than, but not quite as accurate as, the alternative of w representing only the spread due to Δf_nat and using the actual spectral profile of the tone burst, as discussed in Section 2.2.2.

Because we assume that the electrode that detects the ER does not resolve the fine spatial scales of the tonotopic map, we must integrate the response over x, so we find, aside from an overall normalization,

\begin{array}{l} ER (s) = \int ER (x, s) d x, & (30) \end{array}

\begin{array}{l} ER (x, s) = ϕ_{e}^{(1)} (x, s), & (31) \end{array}

\begin{array}{l} = T_{e n} (s) \int w (x - x^{'}) ϕ_{c}^{(1)} (x^{'}) d x^{'}, & (32) \end{array}

from Equations (26) and (28). Hence, upon substituting Equation (32) into Equation (30), we obtain

\begin{array}{l} ER (s) = \int T_{e n} (s) [\int w (x - x^{'}) ϕ_{c}^{(1)} (x^{'}, s) d x^{'}] d x, & (33) \end{array}

\begin{array}{l} = T_{e n} (s) \int [\int w (x - x^{'}) d x] ϕ_{c}^{(1)} (x^{'}, s) d x^{'}, & (34) \end{array}

\begin{array}{l} = T_{e n} (s) \int ϕ_{c}^{(1)} (x^{'}, s) d x^{'}, & (35) \end{array}

from Equation (29). Hence, the ER is the response to the total integrated signal that arrives at the auditory cortex at the frequency represented by s.

2.3.2 ER: subsequent stimuli

Because cortical stimulation from a tone burst centered at f₀ is strongest at x₀ = x(f₀), this point will experience the strongest adaptation, with adaptive changes falling off with distance (i.e., with the frequency difference). As a result, we must replace T(s) by T(x, s) in Equation (33) when the next stimulus arrives, as in previous studies where x was ignored. Specifically, T(x, s) is calculated by inserting the instantaneous values of the G_ab(t) into Equation (13), which will include long-lasting adaptive changes in general. This yields

\begin{array}{l} ER (x, s) = T_{e n} (x, s) \int w (x - x^{'}) ϕ_{c}^{(1)} (x^{'}, s) d x^{'}, & (36) \end{array}

in place of Equation (33). Hence, upon substituting Equation (36) into Equation (30), interchanging the order of integration, and recalling that w is symmetric, we find

\begin{array}{l} ER (s) = \int \int w (x - x^{'}) T_{e n} (x, s) ϕ_{c}^{(1)} (x^{'}, s) d x^{'} d x, & (37) \end{array}

\begin{array}{l} = \int [\int w (x^{'} - x) T_{e n} (x, s) d x] ϕ_{c}^{(1)} (x^{'}, s) d x^{'}, & (38) \end{array}

\begin{array}{l} = \int T_{eff} (x^{'}, s) ϕ_{c}^{(1)} (x^{'}, s) d x^{'} . & (39) \end{array}

Here, an effective transfer function has been defined to be

\begin{array}{l} T_{eff} (x^{'}, s) = \int w (x^{'} - x) T_{e n} (x, s) d x . & (40) \end{array}

This implies that the effective transfer function at x′ is a weighted average of those at neighboring points. Hence, if the core of a stimulated region has undergone strong adaptation, its effects will be mixed with those of edge regions where adaptation is weaker, thus leading to a mixture of dominant $S$ -like and weaker $D$ -like features in the ER. The result (39) reproduces our previous study if w is approximated as being very narrow. Then, Equation (40) yields $T_{eff} (x^{'}, s) \approx T_{e n} (x^{'}, s)$ . If w is a delta function (which is not possible in the real system), then $T_{e n} (x^{'}, s) \approx T_{e n} (x_{0}, s)$ , and we recover Equation (33) and our previous results if the redundant first argument is omitted. The fact that w always has a non-zero width implies that some mixing of characteristics will always occur.

3 Results

We now apply the above theory to model ERs in two studies from the literature: (i) an oddball paradigm in which the frequency offset between standard and deviant stimuli is varied to examine how discriminability affects the MMN (Sams et al., 1985) and (ii) a series of fixed-frequency probe tones inserted into a random-frequency tone sequence, in which the ER has been shown to depend on the probability of background tones in the vicinity of the probe frequency (Garrido et al., 2013). These are illustrated in Figures 4A, B, respectively.

Figure 4

Figure 4. Stimulus sequences used in the experiments analyzed, with central stimulus frequencies indicated by dots at the stimulus onset times. (A) Oddball paradigm. (B) Random-frequency sequence with probe stimuli shown as triangles and squares.

3.1 Oddball sequence

Figure 4A schematically shows the auditory oddball paradigm used by Sams et al. (1985) to investigate the effects of frequency discriminability on the difference between D and S responses. It is common to term the difference between the two responses the mismatch negativity (MMN), with

\begin{array}{l} MMN (D, S, t) = D (t) - S (t) . & (41) \end{array}

The MMN can be defined for any pair of responses, but it is most common to use the limiting form of S(t) after a long sequence of identical stimuli as the reference. We write this form as S_∞(t).

In the experiments of Sams et al. (1985), a series of 1,000 Hz standard tones ( $S$ ) of duration τ = 50 ms with ~1 ms rise and fall times was presented with an interstimulus interval of 1 s. These were replaced by deviant tones $D$ with a probability of 0.2, which differed only in their frequency, which was fixed at 1,002, 1,004, 1,008, 1,016, or 1,032 Hz, respectively, in each of five trials.

At 1,000 Hz, Δf_nat ≈ 3 Hz, while τ = 50 ms implies Δf_u ≈ 20 Hz, so Δf ≈ 20 Hz, dominated by the spectral width of the tone burst. Equations (24) and (25) then imply that ρ = 0.1, 0.2, 0.4, 0.8, 1.6 and ζ = 5 − 10. Hence, we predict that deviant frequencies of 1,002 and 1,004 Hz would produce S-like responses, those of 1,016 and 1,032 Hz would be D-like, and those of 1,008 Hz would be intermediate with significant D-like characteristics.

In this study, we denote the unadapted deviant response as $D_{1}$ . Therefore, a baseline MMN₀ can be defined as

\begin{array}{l} {MMN}_{0} (t) = MMN (D_{1}, S_{\infty}, t) = D_{1} (t) - S_{\infty} (t) . & (42) \end{array}

Figure 5 compares experimental results with the results of our numerical calculations for the above parameters; we use regularly spaced $D$ stimuli, with one every 5 stimuli, to avoid the need to average. Figure 5A shows that the D responses for 1, 002 Hz are almost equal to the S response, with the D response slightly sharper and only a very small MMN. These findings are in agreement with experimental results by Sams et al. (1985) and are as expected because the 2 Hz frequency offset is much smaller than Δf ≈ 20 Hz, so all stimuli cause adaptation in overlapping regions of the cortex. As the frequency offset increases through Figures 5B–E, the response progressively evolves away from S_∞ toward D₁, and is nearly identical to the latter for offsets of 16 and 32 Hz, with a correspondingly larger MMN that approximates MMN₀. All these results are in accord with the experimental findings of Sams et al. (1985) and imply that our estimate of Δf provides a good estimate of when the transition is complete; at half this value, an intermediate form is seen, as is evident in Figure 5C. Note that the residual differences between the theoretical and experimental curves cannot be considered to be significant because only six subjects' data were averaged to obtain these curves and ERs typically exhibit significant intersubject variation.

Figure 5

Figure 5. Model S and D responses for different deviant frequencies compared with experimental results adapted from Sams et al. (1985). Each row presents the results for a particular deviant frequency, as labeled in the second column. For each frequency, the first column shows model predictions for ERs excited with the deviant frequency (red) compared with the baseline D₁ response (black solid) and the fully adapted S_∞ response (black dashed); the second column shows the corresponding experimental result for deviant (heavy line) and standard (light line) stimuli; the third column shows the model MMN (solid) compared with MMN₀ (dotted), and the final column shows the experimental MMN. (A) ERs and MMNs for deviant frequency f_D = 1, 002 Hz. (B) Same as (A) for f_D = 1, 004 Hz. (C) Same as (A) for f_D = 1, 008 Hz. (D) Same as (A) for f_D = 1, 016 Hz. (E) Same as (A) for f_D = 1, 032 Hz.

Individual ERs occur on timescales of a few hundred ms, which are much shorter than the adaptation timescale of 5–10 s. Hence, they can be viewed as being the impulse responses of a cortical region that has adapted from having its initial gains [_{G_ab]i} (see Table 1) to gains determined by the slow adaptation parameters h_ab, at the end of a long sequence of standard stimuli. In our case, we write the latter gains as [_{G_ab]f} and state their values after 10 stimuli in Table 1. Comparison of the initial and final gains in Table 1 shows that the largest fractional changes within the corticothalamic system involve increases in the magnitudes of inhibitory gains (especially, ei and sr) and reductions in excitation (especially, es and se); there is a countervailing increase in the sn gain where stimuli enter the system. Overall, this is consistent with the overall level of activity being approximately maintained, but there being a substantially lower positive feedback in the corticothalamic loop that is comprised of the es and se connections. This loop is chiefly responsible for generating ~10 Hz alpha oscillations, so the reduction in its loop gain due to adaptation is consistent with the lower amplitude of such oscillations in the adapted (standard) response than in the initial (deviant) one. These results accord with our previous findings (Robinson et al., 2021), but with simultaneous and improved matches to typical standard and deviant responses.

3.2 Random-frequency sequence with probe stimuli

In the experiment of Garrido et al. (2013), illustrated in Figure 4B, subjects were presented with a random-frequency sequence of tones that could have either a narrow overall frequency distribution or a broad one. Superposed on this were two sequences of randomly spaced, fixed-frequency probe tones, one at the 500 Hz mean frequency of the random distribution, termed standard, $S$ , and one at four times that frequency, termed deviant, $D$ . Each probe sequence contained 10% of the overall number of stimuli. A key aim of the experiment was to explore how the S and D responses depended on the breadth of the background random frequency distribution and the frequency of the probe relative to its center—i.e., on the relative probability that random stimuli were in the vicinity of a given probe frequency.

Garrido et al. (2013) used stimuli with τ = 50 ms, rise and fall times of 10 ms (hence, an effective duration of 40 ms between half-maximum points), and interstimulus interval of 500 ms. The mean frequency of the Gaussian random distribution was 500 Hz, and it had a logarithmic standard deviation σ of either 0.5 octaves (narrow distribution, $N$ ) or 1.5 octaves (broad distribution, $B$ ). The probe frequencies were 500 and 2,000 Hz, and the timescale of their results implied t_H ≈ 10 s.

Garrido et al. (2013) published average responses, binned according to the quantity N_a, which was the number of immediately preceding tones that all fell outside a frequency window of width Δx of 1/3 octave (i.e., about 130 Hz each side of the 500 Hz probe tones, and 520 Hz each side of the 2,000 Hz probe tones). Large N_a was thus a very coarse-grained proxy for not being a recent stimulus at a nearby frequency, but the 1/3 octave range was not chosen based on the cortical response properties. We note that the experimental parameters give Δf ≈ 20 Hz, which corresponds to spreads of only Δx = 0.07 octaves at 500 Hz and 0.018 octaves at 2,000 Hz, so the bandwidth involved in calculating N_a is too wide for precise comparisons.

When considering a given probe frequency corresponding to x_p, we can rewrite Equation (25), for the typical number of prior stimuli that affect a given response, as

\begin{array}{l} ζ \approx R t_{H} \frac{2 Δ x}{σ \sqrt{2 π}} exp [- \frac{{(x_{p} - \bar{x})}^{2}}{2 σ^{2}}], & (43) \end{array}

where $\bar{x}$ corresponds to 500 Hz and 2Δx appears because this is the total range around x_p that drives adaptation at x_p. Using the above parameter values, we find ζ ≈ 2.3 for the $S N$ condition (standard probe amid a narrow background distribution), ζ = 0.75 for the $S B$ condition, ζ = 1.8 × 10⁻⁴ for the $D N$ condition, and ζ = 0.075 for the $D B$ condition. This implies that probe responses should be close to the fully adapted S form for the $S N$ and $S B$ conditions and close to the unadapted D form for the $D B$ and $D N$ conditions. Figure 6 illustrates the adaptation window that underlies Equation (43). The red square shows the arrival of a probe stimulus. Prior stimuli within a time t_H seconds before and within a frequency range of ±Δf adaptively affect the ER to the probe, particularly if are recent and close in frequency. The more prior stimuli lie in the window, the closer the ER will be to a fully adapted standard S_∞, whereas if the window is empty of prior stimuli, the ER will be close to the deviant D₁.

Figure 6

Figure 6. Schematics of adaptive window for a given stimulus of a particular frequency. The red square shows the arrival of a stimulus and the window determines the adaptive interval within which previous stimuli can have adaptive effects on the response. Similarly, the stimulus in question can affect responses in a similar window that follows it.

The above values of ζ for the four conditions studied by Garrido et al. (2013) shows that the predictions are consistent with the experimental results: in the cases where ζ≪1, which correlates with large N_a, Garrido et al. (2013) found $D$ -like (unadapted) responses, whereas for ζ ≳ 1, the responses were more $S$ -like, owing to greater adaptation. These results thus accord with our expectation that there should be little adaptation if there have been few or no stimuli within the window shown in Figure 6. However, the 1/3 octave bandwidth used in defining N_a is much larger than the physical width, so the correlation is weakened because many data points with moderate N_a do not involve significant adaptation, but were included along with strongly adapted cases in the experimental averages.

The above results suggest a more efficient use of data, and a streamlined experimental procedure, in ER experiments on random frequency stimuli. Instead of using separate fixed-frequency probes of the responses to random stimuli, the random stimuli can be used to probe one another. In this case, the responses would simply be binned and averaged according to the value of ζ, and N_a would not be used; one could even use the actual value of the number of stimuli in the window shown in Figure 6, with a weight function to smooth the edges of the window. Moreover, to improve statistics toward the edges of the frequency distribution, a uniform distribution in frequency or its logarithm, rather than a Gaussian, could be employed.

4 Summary and conclusion

In this study, we have generalized our previous theory of evoked responses with adaptation to allow for frequency-dependent responses, to obtain criteria for when significant adaptation occurs, and to determine whether adaptation suffices to reproduce standard (adapted), deviant (unadapted), and intermediate responses. The results have been applied to explain the response dynamics seen in experiments in which standard and deviant tones differ only in frequency, and in which random-frequency tones are presented. The main results are as follows:

(i) Extension to frequency-dependent adaptation was achieved by allowing for the intrinsic spread in frequency of a tone burst due to its finite duration plus the known spread due to the divergence in projections to the auditory cortex via the tonotopic map. These effects mean that adaptation at the nominal tonotopic location of a given frequency is affected by stimuli at nearby frequencies.

(ii) Stimuli cause adaptation at neighboring frequencies and subsequent times, causing later stimuli in the affected zone to produce evoked responses more like standards than deviants. The quantities ρ and ζ defined in Equations (24) and (25) can be used to quantify the affected frequency–time range: when numerous stimuli are received, significant adaptation occurs for ρ ≲ 1 and ζ ≳ 1. Typically, these correspond to adaptive effects from one stimulus affecting the responses to other stimuli within a few Hz and ~5 s.

(iii) The main gain changes tended to increase cortical inhibition and reduce positive corticothalamic feedback, while maintaining overall mean brain activity levels by increasing the gain where external stimuli enter the thalamus. However, positive feedback via the corticothalamic loop was significantly reduced, leading to lower amplitude ~10 Hz oscillations in the adapted (standard) response than in the initial (deviant) one. These results were consistent with our previous study (Robinson et al., 2021). Good matches to both standard and deviant responses, and during adaptation driven by a sequence of stimuli, were obtained using a single set of parameters.

(iv) The results were found to be consistent with experimental results for oddball sequences in which the deviant stimuli differed only in their frequency relative to the standards (Sams et al., 1985).

(v) In the case of random-frequency stimulation (Garrido et al., 2013), the criteria mentioned in (ii) were found to be consistent with the experimental results. Specifically, significant adaptation occurs if expected number of stimuli within the adaptation time–frequency window exceeds about 1, as expressed by Equation (43). By using the present criteria and binning according to the number of prior stimuli in the window shown in Figure 6, every stimulus can be used as a probe of the adaptive effects due to prior stimuli at nearby frequencies and times, rather than having to rely on probe stimuli at specific frequencies. This approach would make fuller use of such data, thereby enabling shorter experimental protocols.

Overall, these results significantly extend the range of experiments on evoked response sequences that can be explained by adaptive effects in sensory cortex within a neural field theory framework, showing that many mismatch negativity findings can be explained by adaptation at relevant points in the tonotopic map, so long as adaptation exists and notwithstanding some debate as to its exact mechanisms (Ruusuvirta, 2021). Future study could usefully apply similar methods to investigate deviant stimuli that differ only in intensity or duration, sequences of descending tones in which one tone is repeated, or more abstract deviance rules. Such analyses will help to distinguish local adaptive effects from those of top-down feedbacks from higher cortical areas—an essential contribution toward probing the levels at which different aspects of stimuli are processed.

Data availability statement

Information for existing publicly accessible datasets is contained within the article.

Author contributions

TB-J: Conceptualization, Formal analysis, Investigation, Methodology, Software, Visualization, Writing - original draft, Writing - review & editing. NG: Formal analysis, Investigation, Methodology, Software, Visualization, Writing - original draft, Writing - review & editing. AM: Formal analysis, Investigation, Methodology, Software, Writing - original draft, Writing - review & editing. PR: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing - original draft, Writing - review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by the Australian Research Council under Center of Excellence grant CE140100007 and Laureate Fellowship grant FL140100025.

Acknowledgments

The authors thank M. Garrido for stimulating discussions.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abeysuriya, R. G., Rennie, C. J., and Robinson, P. A. (2015). Physiologically based arousal state estimation and dynamics. J. Neurosci. Methods 253, 55–69. doi: 10.1016/j.jneumeth.2015.06.002

PubMed Abstract | Crossref Full Text | Google Scholar

Atienza, M., Cantero, J. L., and Escera, C. (2001). Auditory information processing during human sleep as revealed by event-related brain potentials. Clin. Neurophysiol. 112, 2031–2045. doi: 10.1016/S1388-2457(01)00650-2

PubMed Abstract | Crossref Full Text | Google Scholar

Babaie-Janvier, T., and Robinson, P. A. (2018). Neural field theory of corticothalamic prediction with control systems analysis. Front. Hum. Neurosci. 12:334. doi: 10.3389/fnhum.2018.00334

PubMed Abstract | Crossref Full Text | Google Scholar

Babaie-Janvier, T., and Robinson, P. A. (2019). Neural field theory of corticothalamic attention with control systems analysis. Front. Neurosci. 13:1240. doi: 10.3389/fnins.2019.01240

Crossref Full Text | Google Scholar

Babaie-Janvier, T., and Robinson, P. A. (2020). Neural field theory of evoked response potentials with attentional gain dynamics. Front. Hum. Neurosci. 14:293. doi: 10.3389/fnhum.2020.00293

PubMed Abstract | Crossref Full Text | Google Scholar

Bracewell, R. N. (1986). The Fourier Transform and Its Applications. New York, NY: McGraw-Hill.

Google Scholar

Cowan, N. (1984). On short and long auditory stores. Psychol. Bull. 96:341. doi: 10.1037/0033-2909.96.2.341

Crossref Full Text | Google Scholar

Garrido, M. I., Kilner, J. M., Kiebel, S. J., Stephan, K. E., Baldeweg, T., and Friston, K. J. (2009a). Repetition suppression and plasticity in the human brain. NeuroImage 48, 269–279. doi: 10.1016/j.neuroimage.2009.06.034

PubMed Abstract | Crossref Full Text | Google Scholar

Garrido, M. I., Kilner, J. M., Stephan, K. E., and Friston, K. J. (2009b). The mismatch negativity: a review of underlying mechanisms. Clin. Neurophysiol. 120, 453–463. doi: 10.1016/j.clinph.2008.11.029

PubMed Abstract | Crossref Full Text | Google Scholar

Garrido, M. I., Sahani, M., and Dolan, R. J. (2013). Outlier responses reflect sensitivity to statistical structure in the human brain. PLoS Comput. Biol. 9:e1002999. doi: 10.1371/journal.pcbi.1002999

PubMed Abstract | Crossref Full Text | Google Scholar

Herdener, M., Esposito, F., Scheffler, K., Schneider, P., Logothetis, N. K., Uludag, K., et al. (2013). Spatial representations of temporal and spectral sound cues in human auditory cortex. Cortex 49, 2822–2833. doi: 10.1016/j.cortex.2013.04.003

PubMed Abstract | Crossref Full Text | Google Scholar

Hillyard, S. A., and Anllo-Vento, L. (1998). Event-related brain potentials in the study of visual selective attention. Proc. Natl. Acad. Sci. U.S.A. 95, 781–787. doi: 10.1073/pnas.95.3.781

PubMed Abstract | Crossref Full Text | Google Scholar

Hillyard, S. A., Vogel, E. K., and Luck, S. J. (1998). Sensory gain control (amplification) as a mechanism of selective attention: electrophysiological and neuroimaging evidence. Philos. Trans. R. Soc. B 353, 1257–1270. doi: 10.1098/rstb.1998.0281

PubMed Abstract | Crossref Full Text | Google Scholar

Jääskeläinen, I. P., Ahveninen, J., Bonmassar, G., Dale, A. M., Ilmoniemi, R. J., Levänen, S., et al. (2004). Human posterior auditory cortex gates novel sounds to consciousness. Proc. Natl. Acad. Sci. U.S.A. 101, 6809–6814. doi: 10.1073/pnas.0303760101

PubMed Abstract | Crossref Full Text | Google Scholar

Jirsa, V. K., and Haken, H. (1996). Field theory of electromagnetic brain activity. Phys. Rev. Lett. 77, 960–963. doi: 10.1103/PhysRevLett.77.960

PubMed Abstract | Crossref Full Text | Google Scholar

Kerr, C. C., Rennie, C. J., and Robinson, P. A. (2008). Physiology-based modeling of cortical auditory evoked potentials. Biol. Cybern. 98, 171–184. doi: 10.1007/s00422-007-0201-1

PubMed Abstract | Crossref Full Text | Google Scholar

Kerr, C. C., Rennie, C. J., and Robinson, P. A. (2009). Deconvolution analysis of target evoked potentials. J. Neurosci. Methods 179, 101–110. doi: 10.1016/j.jneumeth.2009.01.003

PubMed Abstract | Crossref Full Text | Google Scholar

Kerr, C. C., Rennie, C. J., and Robinson, P. A. (2011). Model-based analysis and quantification of age trends in auditory evoked potentials. Clin. Neurophysiol. 122, 134–147. doi: 10.1016/j.clinph.2010.05.030

PubMed Abstract | Crossref Full Text | Google Scholar

Koch, C. (1999). Biophysics of Computation. Oxford: Oxford University Press.

Google Scholar

Kraus, N., McGee, T., Carrell, T. D., and Sharma, A. (1995). Neurophysiologic bases of speech discrimination. Ear Hear. 16, 19–37. doi: 10.1097/00003446-199502000-00003

PubMed Abstract | Crossref Full Text | Google Scholar

Kraus, N., McGee, T. J., Carrell, T. D., Zecker, S. G., Nicol, T. G., and Koch, D. B. (1996). Auditory neurophysiologic responses and discrimination deficits in children with learning problems. Science 273, 971–973. doi: 10.1126/science.273.5277.971

PubMed Abstract | Crossref Full Text | Google Scholar

Loveless, N., Levänen, S., Jousmäki, V., Sams, M., and Hari, R. (1996). Temporal integration in auditory sensory memory: neuromagnetic evidence. Electroencephalogr. Clin. Neurophysiol. 100, 220–228. doi: 10.1016/0168-5597(95)00271-5

PubMed Abstract | Crossref Full Text | Google Scholar

Luck, S. J. (2014). An Introduction to the Event-Related Potential Technique. Cambridge, MA: MIT Press.

Google Scholar

Luck, S. J., and Kappenman, E. S. (2011). The Oxford Handbook of Event-Related Potential Components. New York, NY: Oxford University Press. doi: 10.1093/oxfordhb/9780195374148.001.0001

Crossref Full Text | Google Scholar

Mukta, K. N., Robinson, P. A., Pagès, J. C., Gabay, N. C., and Gao, X. (2020). Evoked response activity eigenmode analysis in a convoluted cortex via neural field theory. Phys. Rev. E 102:062303. doi: 10.1103/PhysRevE.102.062303

PubMed Abstract | Crossref Full Text | Google Scholar

Näätänen, R. (2003). Mismatch negativity: clinical research and possible applications. Int. J. Psychophysiol. 48, 179–188. doi: 10.1016/S0167-8760(03)00053-9

Crossref Full Text | Google Scholar

Näätänen, R., and Alho, K. (1997). Mismatch negativity-the measure for central sound representation accuracy. Audiol. Neurotol. 2, 341–353. doi: 10.1159/000259255

PubMed Abstract | Crossref Full Text | Google Scholar

Näätänen, R., Astikainen, P., Ruusuvirta, T., and Huotilainen, M. (2010). Automatic auditory intelligence: an expression of the sensory-cognitive core of cognitive processes. Brain Res. Rev. 64, 123–136. doi: 10.1016/j.brainresrev.2010.03.001

PubMed Abstract | Crossref Full Text | Google Scholar

Näätänen, R., Gaillard, A. W., and Mäntysalo, S. (1978). Early selective-attention effect on evoked potential reinterpreted. Acta Psychol. 42, 313–329. doi: 10.1016/0001-6918(78)90006-9

PubMed Abstract | Crossref Full Text | Google Scholar

Näätänen, R., Jacobsen, T., and Winkler, I. (2005). Memory-based or afferent processes in mismatch negativity (MMN): a review of the evidence. Psychophysiology 42, 25–32. doi: 10.1111/j.1469-8986.2005.00256.x

PubMed Abstract | Crossref Full Text | Google Scholar

Näätänen, R., Paavilainen, P., and Reinikainen, K. (1989). Do event-related potentials to infrequent decrements in duration of auditory stimuli demonstrate a memory trace in man? Neurosci. Lett. 107, 347–352. doi: 10.1016/0304-3940(89)90844-6

PubMed Abstract | Crossref Full Text | Google Scholar

Näätänen, R., Paavilainen, P., Rinne, T., and Alho, K. (2007). The mismatch negativity (MMN) in basic research of central auditory processing: a review. Clin. Neurophysiol. 118, 2544–2590. doi: 10.1016/j.clinph.2007.04.026

Crossref Full Text | Google Scholar

Näätänen, R., Paavilainen, P., Titinen, H., Jiang, D., and Alho, K. (1993). Attention and mismatch negativity. Psychophysiology 30, 436–450. doi: 10.1111/j.1469-8986.1993.tb02067.x

Crossref Full Text | Google Scholar

Niedermeyer, E., and Lopes da Silva, F. H. (2011). Electroencephalography: Basic Principles, Clinical Applications, and Related Fields. Baltimore, MD: Lippincott Williams & Wilkins.

Google Scholar

Nunez, P. L., and Cutillo, B. A. (1995). Neocortical Dynamics and Human EEG Rhythms. Oxford: Oxford University Press.

Google Scholar

Nunez, P. L., and Srinivasan, R. (2006). Electric Fields of the Brain: The Neurophysics of EEG. Oxford: Oxford University Press. doi: 10.1093/acprof:oso/9780195050387.001.0001

Crossref Full Text | Google Scholar

O'Connor, S. C., and Robinson, P. A. (2004). Spatially uniform and nonuniform analyses of electroencephalographic dynamics, with application to the topography of the alpha rhythm. Phys. Rev. E 70:011911. doi: 10.1103/PhysRevE.70.011911

PubMed Abstract | Crossref Full Text | Google Scholar

Pickles, J. (2013). An Introduction to the Physiology of Hearing. Leiden: Brill.

Google Scholar

Rennie, C. J., Robinson, P. A., and Wright, J. J. (1999). Effects of local feedback on dispersion of electrical waves in the cerebral cortex. Phys. Rev. E 59, 3320–3329. doi: 10.1103/PhysRevE.59.3320

Crossref Full Text | Google Scholar

Rennie, C. J., Robinson, P. A., and Wright, J. J. (2002). Unified neurophysical model of EEG spectra and evoked potentials. Biol. Cybern. 86, 457–471. doi: 10.1007/s00422-002-0310-9

PubMed Abstract | Crossref Full Text | Google Scholar

Rennie, C. J., Wright, J. J., and Robinson, P. A. (2000). Mechanisms of cortical electrical activity and emergence of gamma rhythm. J. Theor. Biol. 205, 17–35. doi: 10.1006/jtbi.2000.2040

PubMed Abstract | Crossref Full Text | Google Scholar

Roberts, J. A., and Robinson, P. A. (2012). Quantitative theory of driven nonlinear brain dynamics. NeuroImage 62, 1947–1955. doi: 10.1016/j.neuroimage.2012.05.054

PubMed Abstract | Crossref Full Text | Google Scholar

Robinson, P. A., Gabay, N. C., and Babaie-Janvier, T. (2021). Neural field theory of evoked response sequences and mismatch negativity with adaptation. Front. Hum. Neurosci. 15:655505. doi: 10.3389/fnhum.2021.655505

PubMed Abstract | Crossref Full Text | Google Scholar

Robinson, P. A., Rennie, C. J., and Rowe, D. L. (2002). Dynamics of large-scale brain activity in normal arousal states and epileptic seizures. Phys. Rev. E 65:041924. doi: 10.1103/PhysRevE.65.041924

PubMed Abstract | Crossref Full Text | Google Scholar

Robinson, P. A., Rennie, C. J., Rowe, D. L., and O'Connor, S. C. (2004). Estimation of multiscale neurophysiologic parameters by electroencephalographic means. Hum. Brain Mapp. 23, 53–72. doi: 10.1002/hbm.20032

PubMed Abstract | Crossref Full Text | Google Scholar

Robinson, P. A., Rennie, C. J., Rowe, D. L., O'Connor, S. C., and Gordon, E. (2005). Multiscale brain modelling. Philos. Trans. R. Soc. Lond. B Biol. Sci. 360, 1043–1050. doi: 10.1098/rstb.2005.1638

Crossref Full Text | Google Scholar

Robinson, P. A., Rennie, C. J., and Wright, J. J. (1997). Propagation and stability of waves of electrical activity in the cerebral cortex. Phys. Rev. E 56, 826–840. doi: 10.1103/PhysRevE.56.826

Crossref Full Text | Google Scholar

Robinson, P. A., and Roy, N. (2015). Neural field theory of nonlinear wave-wave and wave-neuron processes. Phys. Rev. E 91:062719. doi: 10.1103/PhysRevE.91.062719

PubMed Abstract | Crossref Full Text | Google Scholar

Ruusuvirta, T. (2021). The release from refractoriness hypothesis of N1 of event-related potentials needs reassessment. Hear. Res. 399:107923. doi: 10.1016/j.heares.2020.107923

PubMed Abstract | Crossref Full Text | Google Scholar

Saenz, M., and Langers, D. R. (2014). Tonotopic mapping of human auditory cortex. Hear. Res. 307, 42–52. doi: 10.1016/j.heares.2013.07.016

PubMed Abstract | Crossref Full Text | Google Scholar

Salisbury, D. F. (2012). Finding the missing stimulus mismatch negativity (MMN): Emitted MMN to violations of an auditory gestalt. Psychophysiology 49, 544–548. doi: 10.1111/j.1469-8986.2011.01336.x

PubMed Abstract | Crossref Full Text | Google Scholar

Sams, M., Alho, K., and Näätänen, R. (1984). Short-term habituation and dishabituation of the mismatch negativity of the ERP. Psychophysiology 21, 434–441. doi: 10.1111/j.1469-8986.1984.tb00223.x

PubMed Abstract | Crossref Full Text | Google Scholar

Sams, M., Hämäläinen, M., Antervo, A., Kaukoranta, E., Reinikainen, K., and Hari, R. (1985). Cerebral neuromagnetic responses evoked by short auditory stimuli. Electroencephalogr. Clin. Neurophysiol. 61, 254–266. doi: 10.1016/0013-4694(85)91092-2

PubMed Abstract | Crossref Full Text | Google Scholar

Schröger, E. (1998). Measurement and interpretation of the mismatch negativity. Behav. Res. Methods Instrum. Comput. 30, 131–145. doi: 10.3758/BF03209423

Crossref Full Text | Google Scholar

Sussman, E. S., Chen, S., Sussman-Fort, J., and Dinces, E. (2014). The five myths of MMN: redefining how to use MMN in basic and clinical research. Brain Topogr. 27, 553–564. doi: 10.1007/s10548-013-0326-6

PubMed Abstract | Crossref Full Text | Google Scholar

Talavage, T. M., Sereno, M. I., Melcher, J. R., Ledden, P. J., Rosen, B. R., and Dale, A. M. (2004). Tonotopic organization in human auditory cortex revealed by progressions of frequency sensitivity. J. Neurophysiol. 91, 1282–1296. doi: 10.1152/jn.01125.2002

PubMed Abstract | Crossref Full Text | Google Scholar

Tervaniemi, M., Ilvonen, T., Karma, K., Alho, K., and Näätänen, R. (1997). The musical brain: brain waves reveal the neurophysiological basis of musicality in human subjects. Neurosci. Lett. 226, 1–4. doi: 10.1016/S0304-3940(97)00217-6

PubMed Abstract | Crossref Full Text | Google Scholar

Tiitinen, H., Alho, K., Huotilainen, M., Ilmoniemi, R. J., Simola, J., and Näätänen, R. (1993). Tonotopic auditory cortex and the magnetoencephalographic (MEG) equivalent of the mismatch negativity. Psychophysiology 30, 537–540. doi: 10.1111/j.1469-8986.1993.tb02078.x

PubMed Abstract | Crossref Full Text | Google Scholar

van Albada, S. J., Kerr, C. C., Chiang, A. K. I., Rennie, C. J., and Robinson, P. A. (2010). Neurophysiological changes with age probed by inverse modeling of EEG spectra. Clin. Neurophysiol. 121, 21–38. doi: 10.1016/j.clinph.2009.09.021

PubMed Abstract | Crossref Full Text | Google Scholar

Winkler, I., Reinikainen, K., and Näätänen, R. (1993). Event-related brain potentials reflect traces of echoic memory in humans. Percept. Psychophys. 53, 443–449. doi: 10.3758/BF03206788

PubMed Abstract | Crossref Full Text | Google Scholar

Yabe, H., Tervaniemi, M., Reinikainen, K., and Näätänen, R. (1997). Temporal window of integration revealed by MMN to sound omission. Neuroreport 8, 1971–1974. doi: 10.1097/00001756-199705260-00035

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: evoked responses, mismatch negativity, neural field theory, adaptation, oddball paradigm, stimulus discriminability

Citation: Babaie-Janvier T, Gabay NC, McInnes A and Robinson PA (2024) Neural field theory of adaptive effects on auditory evoked responses and mismatch negativity in multifrequency stimulus sequences. Front. Hum. Neurosci. 17:1282924. doi: 10.3389/fnhum.2023.1282924

Received: 25 August 2023; Accepted: 27 October 2023;
Published: 03 January 2024.

Edited by:

Changming Wang, Capital Medical University, China

Reviewed by:

Gerald Cooray, Karolinska Institutet (KI), Sweden
Timo Ruusuvirta, University of Turku, Finland

Copyright © 2024 Babaie-Janvier, Gabay, McInnes and Robinson. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Peter A. Robinson, cGV0ZXIucm9iaW5zb25Ac3lkbmV5LmVkdS5hdQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Neural field theory of adaptive effects on auditory evoked responses and mismatch negativity in multifrequency stimulus sequences

1 Introduction

2 Materials and methods

2.1 NFT of corticothalamic evoked responses

2.2 Stimulus profile at auditory cortex

2.2.1 Tone-burst stimulus

2.2.2 Transfer to the auditory cortex via the tonotopic map

2.2.3 Criteria for significant adaptive effects

2.3 Measured ER

2.3.1 ER: first stimulus

2.3.2 ER: subsequent stimuli

3 Results

3.1 Oddball sequence

3.2 Random-frequency sequence with probe stimuli

4 Summary and conclusion

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher's note

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good