A Phenomenological Model of the Electrically Stimulated Auditory Nerve Fiber: Temporal and Biphasic Response Properties

Horne, Colin D. F.; Sumner, Christian J.; Seeber, Bernhard U.

doi:10.3389/fncom.2016.00008

ORIGINAL RESEARCH article

Front. Comput. Neurosci. , 08 February 2016

Volume 10 - 2016 | https://doi.org/10.3389/fncom.2016.00008

A Phenomenological Model of the Electrically Stimulated Auditory Nerve Fiber: Temporal and Biphasic Response Properties

$\r\nColin D. F. Horne$ Colin D. F. Horne¹

Christian J. Sumner¹^*

Bernhard U. Seeber²

¹Medical Research Council Institute of Hearing Research, University Park, Nottingham, UK
²Audio Information Processing, Department of Electrical and Computer Engineering, Technische Universität München, Munich, Germany

We present a phenomenological model of electrically stimulated auditory nerve fibers (ANFs). The model reproduces the probabilistic and temporal properties of the ANF response to both monophasic and biphasic stimuli, in isolation. The main contribution of the model lies in its ability to reproduce statistics of the ANF response (mean latency, jitter, and firing probability) under both monophasic and cathodic-anodic biphasic stimulation, without changing the model's parameters. The response statistics of the model depend on stimulus level and duration of the stimulating pulse, reproducing trends observed in the ANF. In the case of biphasic stimulation, the model reproduces the effects of pseudomonophasic pulse shapes and also the dependence on the interphase gap (IPG) of the stimulus pulse, an effect that is quantitatively reproduced. The model is fitted to ANF data using a procedure that uniquely determines each model parameter. It is thus possible to rapidly parameterize a large population of neurons to reproduce a given set of response statistic distributions. Our work extends the stochastic leaky integrate and fire (SLIF) neuron, a well-studied phenomenological model of the electrically stimulated neuron. We extend the SLIF neuron so as to produce a realistic latency distribution by delaying the moment of spiking. During this delay, spiking may be abolished by anodic current. By this means, the probability of the model neuron responding to a stimulus is reduced when a trailing phase of opposite polarity is introduced. By introducing a minimum wait period that must elapse before a spike may be emitted, the model is able to reproduce the differences in the threshold level observed in the ANF for monophasic and biphasic stimuli. Thus, the ANF response to a large variety of pulse shapes are reproduced correctly by this model.

Introduction

Cochlear implants restore the perception of sound to deafened individuals. The speech processor maps acoustic waveforms to trains of electrical pulses at each electrode of an array inserted in the cochlea, which directly stimulate auditory nerve fibers (ANFs). Cochlear implantees often achieve high levels of speech understanding in quiet single-talker situations. However, they are significantly disadvantaged compared with normal hearing and even moderately hearing impaired listeners in complex and noisy acoustic environments (e.g., Cullington and Zeng, 2008; Wilson and Dorman, 2008; Kerber and Seeber, 2012).

To restore hearing, the cochlear implant must convey sufficient information about the acoustic scene to the central auditory system. It is less clear how much information would be required to restore “normal” levels of functionality and how to encode information optimally given the limitations of the electrode-nerve interface. It is clear that only some of the normal acoustic “cues” are available with contemporary cochlear implants. In the predominant coding strategy individual electrodes carry discrete current pulses at a fixed rate, and the amplitude of the current is modulated according to the extracted envelope of sound in a fixed frequency range (Continuous interleaved sampling strategy, CIS, Wilson et al., 1991). Fine temporal information is removed by the speech processor and not coded in pulse timings. This information is known to be important for the perception of pitch and sound localization, both of which strongly influence the process of forming discrete perceptual acoustic objects from a mixture in normal hearing.

How the information should be encoded, given the limitations of the devices, is a difficult question. One potential method for manipulating ANF responses is via the shapes of electrical pulses. When stimulated with an electrical current pulse, the ANF may elicit an action potential after a stochastic delay. The shape of the stimulating pulse affects the probability of the ANF eliciting an action potential in response to it, and the temporal distribution of the action potential, if elicited. Further, temporally-separated pulses may interact within short time windows, blurring the distinction between individual pulses. Current implants use only biphasic pulses with identically shaped phases. Transforming a cathodic monophasic pulse into a cathodic-anodic biphasic pulse by introducing a trailing, anodic phase is necessary to achieve charge balance, a prerequisite for long-term use in patients. The additional anodic phase decreases the probability of the stimulus evoking an action potential in the ANF, but less so if a delay, or interphase gap (IPG), is introduced between the two opposite-polarity phases (Shepherd and Javel, 1999). The requirement to charge balance can be met by a wide range of pulse-shapes.

Future stimulation strategies might manipulate pulse-shape to improve information transmission or to reduce power consumption. For example, pseudomonophasic pulses with a short cathodic stimulating phase followed by a longer anodic phase of lower, charge-balanced amplitude are more efficient for stimulation and yield a larger dynamic range than biphasic pulses (Macherey et al., 2006), phase duration and interphase-gap have a pronounced effect on loudness (Carlyon et al., 2005), and the polarity order of multiphasic pulses can alter perceived pitch (van Wieringen et al., 2008; Carlyon et al., 2013). Further, strategies with novel pulse shapes have the potential to control the negative impact of current spread, e.g., by varying interphase gap, phase duration and the relative amplitude of the second phase, thereby changing firing probability of neurons in a larger region around the electrode. Moreover, when attempting to code fine temporal information, e.g., binaural cues needed for sound localization, the exact timing of pulses in the auditory nerve becomes crucial.

It is hard to evaluate the effectiveness of a stimulation strategy. It would be useful to observe the responses of the individual fibers of the stimulated auditory nerve. However, recording from single nerve fibers is an invasive procedure that is not possible in patients. Measures like neural response telemetry cannot be used with regular stimulation strategies and auditory brainstem responses cannot give insight into the responses of individual nerve fibers. Computational neural models can help predict the neural response when stimulating with changing pulse patterns and shapes, and hence help with the development of future stimulation strategies. Models could be used to find a stimulation pattern for which the neural response matches a target response as closely as possible, or to maximize information transmission. To this end, we have developed a model that simulates the auditory nerve fiber response to an electrical stimulus, which is sensitive to pulse shape parameters.

To be useful for developing stimulation strategies that manipulate pulse shape, a model must be capable of realistically responding to a stimulus pulse of complex shape, with varying phase durations and interphase gaps. One method by which to achieve this is to directly model the biophysics of the neuron. Biophysical models have been developed which are successful in reproducing the response characteristics of the ANF (e.g., Rubinstein, 1995; Cartee, 2000; Rattay et al., 2001; Negm and Bruce, 2008; Woo et al., 2010). However, while they have previously been used to study the responses of large populations of ANFs (e.g., Imennov and Rubinstein, 2009), they are difficult to use: the parameter-space of a biophysical model is vast and the individual parameters affect the response of the neuron in complex ways. There has been no procedure published for systematically parameterizing a biophysical model to reproduce a desired set of response statistics.

Phenomenological models provide an alternative to biophysical models. Phenomenological models reproduce only the statistics of the response, without explicitly modeling the biophysics of the ANF. By doing so, the parameter-space is reduced and it is possible to directly and independently control individual response characteristics via the model parameters. A variety of phenomenological models have been developed to reproduce the responses to sensory inputs or synaptic input (e.g., McGregor, 1987; Gerstner and Kistler, 2002; Izekevich, 2003). They rely on the fact that many of the complexities of their behavior, such as spike generation, are stereotypical. Perhaps the most commonly used model is the leaky-integrate-and-fire (LIF) model (for a review see Gerstner and Kistler, 2002). This has linear subthreshold filtering of the inputs, a fixed spike threshold and dispenses with all the dynamics of the spike generation.

Phenomenological models have previously been used to model the electrically stimulated ANF (e.g., Bruce et al., 1999; Hamacher, 2004; Carlyon et al., 2005; Chen and Zhang, 2007; Macherey et al., 2007; Cohen, 2009a,b,c,d; Chen, 2012; Goldwyn et al., 2012). The required constraints on these models are different to those of other domains. Whereas models with deterministic intrinsic properties (e.g., Rothman and Manis, 2003; Laudanski et al., 2010) are adequate to explain the responses to intracellular current injection or synaptic input in the auditory brainstem, and many other central neurons, modeling ANF responses to electrical stimulation requires a stochastic model. Models that incorporate noise into the firing threshold (Bruce et al., 1999; Gerstner and Kistler, 2002) allow for realistic firing probabilities for some stimulation protocols. However, the latency of firing does not emerge naturally in these models, which requires still further sources of stochasticity (Hamacher, 2004), and neither does the sensitivity to pulse shape.

The focus of this study has been to develop a model capable of reproducing the statistics of the ANF's response to both monophasic and biphasic stimuli of arbitrary amplitude, phase duration and interphase gaps. The model presented is the first phenomenological model to respond directly to a range of current pulse shapes and reproduce the effect that an immediate or delayed trailing, anodic phase has on the probability of a cathodic stimulus evoking an action potential in the ANF. The model is computationally efficient and easily parameterized, making it suitable for simulating the response of a large population of fibers.

Our model is based on the stochastic leaky integrate and fire (SLIF) neuron, a well-studied phenomenological model of the electrically stimulated neuron. The SLIF neuron discretizes the action potential as a single moment of spiking. In our model, the membrane potential of the ANF is modeled by processing the stimulus current with a leaky integrator. As in the SLIF neuron, excitation occurs when the membrane potential exceeds a stochastic threshold. Unlike the SLIF neuron, we add a delay between the moment at which the membrane potential exceeds the threshold and the moment at which the resulting spike is emitted. This emulates the delay in the generation of the action potential that is present in the ANF. Further, inspired by empirical observations, we allow the spike to be canceled if sufficient anodic stimulation occurs before the spike is emitted. By doing so, we are able to reproduce the effect of the interphase gap on the probability of a cathodic-anodic biphasic stimulus evoking an action potential in the ANF.

The description of our model is split into three sections. First, we introduce the existing SLIF neuron, describing its parameterization and summarizing its capabilities and limitations (Section Stochastic Leaky Integrate and Fire Neuron). We then extend the SLIF neuron to introduce a delay between the moment at which the membrane potential exceeds the threshold and the moment at which the resulting spike is elicited. This forms a self-contained model in itself, reproducing temporal properties of the ANF's response to a monophasic stimulus (Section Temporal Leaky Integrate and Fire Neuron). Finally, we further extend the model so that a spiking may be canceled by anodic current, comparing its results against those from cat ANFs (Section Biphasic Leaky Integrate and Fire Neuron).

Stochastic Leaky Integrate and Fire Neuron

The stochastic leaky integrate-and-fire (SLIF) neuron provides a simple model of the electrically stimulated neuron. In the model, the neural membrane is considered to be a leaky integrator of current, with an associated membrane potential.

Model Description

The stimulus signal I(t) is processed by a leaky integrator to give V(t), which can be interpreted as the membrane potential of the model neuron (Abbott and Kepler, 1990; Gerstner, 1995). The stimulus signal and the membrane potential are related by the ordinary differential equation

\begin{matrix} τ \frac{d V}{d t} = - R I - V, & (1) \end{matrix}

where τ is the time constant of the neural membrane and R is its resistance, arbitrarily assumed to be 1Ω. A spike is generated at the moment V(t) first exceeds a threshold value θ, an event that we refer to as threshold crossing. Throughout the paper, we use t₀ to denote the time of threshold crossing. In order to reproduce the stochastic properties of excitation, θ is a normally-distributed random variable with mean μ and standard deviation σ. Bruce et al. (1999) have demonstrated that this form of stochasticity provides for excellent fits for input-output functions of individual nerve fibers. Integrating Equation (1) we can obtain an expression for firing probability:

\begin{matrix} P_{S L I F} = Φ (\frac{- I [1 - e^{- d / τ}] - μ}{σ}) & (2) \end{matrix}

For a cathodic pulse of a duration, d, where Φ is the cumulative distribution function (CDF) of the Gaussian distribution. The model was implemented in Matlab, with leaky integration implemented via the filter function, with a sample rate of 1 MHz. Table 1 gives an overview of model parameters of the three models presented in this article and Table 2 summarizes all model variables.

TABLE 1

Table 1. Full set of model parameters and their values.

TABLE 2

Table 2. Overview of model variables.

Model Response Properties

The SLIF neuron has three parameters: μ, σ, and τ. In this section, we show how these can be uniquely determined to reproduce data from cat ANFs. We then outline the shortfalls of the SLIF neuron that will be addressed by the models presented in the remainder of the paper.

Excitation

As defined in this paper, the SLIF neuron is excited by negative, or cathodic, current. Positive, or anodic, current hyperpolarises the SLIF neuron, driving it further from excitation.

Input-Output Function

The input-output function of a neuron relates stimulus level to firing probability, for some stimulus pulse of fixed duration. It has been found that the input-output function of the ANF stimulated with a monophasic current pulse can be well approximated by the CDF of the Gaussian distribution (Dynes, 1996). The probability of a stimulus of current level l evoking an action potential is thus given by

\begin{matrix} Φ (\frac{l - m}{s}), & (3) \end{matrix}

where Φ is the Gaussian CDF and m and s are the mean and standard deviation of the input-output function, respectively. The mean corresponds to the threshold level of the neuron—the level at which the neuron responds to the stimulus with a probability of 0.5. The standard deviation is a measure of the width of the input-output function, and thus, the dynamic range of the neuron. It is convenient to quantify the dynamic range as the ratio of the standard deviation and the mean (Verveen, 1961), giving relative spread (RS):

\begin{matrix} R S = \frac{s}{m} . & (4) \end{matrix}

The input-output function of the SLIF neuron (2) has the same form as Equation (3). Thus, by equating (2) and (3) and since l = −I,

\begin{matrix} m = \frac{μ}{1 - \exp (- d / τ)} & (5) \end{matrix}

and

\begin{matrix} s = \frac{σ}{1 - \exp (- d / τ)}, & (6) \end{matrix}

Inverting these equations gives the values for the model parameters μ and σ that are needed for the SLIF neuron to reproduce the input-output function of an arbitrary ANF with threshold m and RS s/m. Increasing μ decreases the SLIF neuron's excitability and increasing σ increases its dynamic range. Figure 1A shows the input-output function of the SLIF neuron when parameterized to reproduce data for a cat ANF (Miller et al., 1999).

FIGURE 1

Figure 1. The SLIF neuron may be parameterized to quantitatively reproduce input-output and strength-duration data. (A) The input-output function of the SLIF neuron (solid line) fitted to data (open circles) from a cat ANF (Miller et al., 1999). The stimulus is a monophasic pulse (40 μs duration) presented in isolation. (B) The monophasic strength-duration function of the SLIF neuron (solid line) fitted to data (open circles) from a cat ANF (van den Honert and Stypulkowski, 1984).

In the case of a monophasic stimulus, it has been hypothesized that the RS is a characteristic of the neuron and does not depend on stimulus duration (Verveen and Derksen, 1965). Like the real neuron, the RS of the SLIF neuron does not depend on stimulus duration.

Strength-Duration Function

The threshold level of a monophasic stimulus pulse depends on its duration, with greater durations incurring lower thresholds. The strength-duration function relates stimulus duration to threshold level and is often summarized by two measures: rheobase and chronaxie. As the stimulus duration increases, the threshold level reaches an asymptotic value—the rheobase. The stimulus duration that has a threshold level of twice the rheobase is the chronaxie. Measures of the chronaxie and strength-duration functions of cat ANFs were made by van den Honert and Stypulkowski (1984). They found that the threshold level I_thr, when measured in amperes, was well predicted by the equation

\begin{matrix} I_{t h r} = \frac{I_{0}}{1 - \exp (- k d)}, & (7) \end{matrix}

where d is the stimulus duration, in seconds, I₀ is the rheobase, in amperes, and log(2)/k is the chronaxie, in seconds. The form of Equation (7) is consistent with other studies of neurons (e.g., Lapicque, 1907; Dean and Lawrence, 1985). The strength-duration function of the SLIF neuron has the same form, with k = 1∕τ (Hill, 1936). Inverting the equation gives the model parameter τ in terms of the chronaxie, allowing the model to reproduce the chronaxie of an arbitrary ANF. Figure 1B shows the strength-duration function of the SLIF neuron when parameterized to reproduce data from a cat ANF (den Honert and Stypulkowski, 1984).

Temporal Response Properties

The latency of the ANF's response to a stimulus is defined as the delay between the onset of the stimulus and the observation of the action potential by the recoding electrode. It is stimulus-dependent and stochastic in nature. The jitter of the ANF's response is defined as the standard deviation of the latency. Figure 2 plots mean latency (Figure 2A) and jitter (Figure 2B) for a cat ANF's response to a brief (40 μs) monophasic stimulus (Miller et al., 1999). Increasing stimulus level reduces both the mean latency and the jitter of the response. Also plotted is the mean latency and jitter of the SLIF neuron under identical conditions. The SLIF neuron lacks the extent of temporal stochasticity that is observed in the ANF (jitter at threshold level is 1 μs for the model and 112 μs for the ANF). Further, the mean latency is under-predicted by the SLIF neuron (latency at threshold level is 38 μs for the model and 681 μs for the ANF) and does not show the dependence on stimulus level that is seen in the ANF. It is not possible to parameterize the SLIF neuron to reproduce these temporal response properties whilst simultaneously maintaining the input-output and strength-duration functions that have already been fitted to data from cat ANFs. These failings of the SLIF neuron have been noted previously (Hamacher, 2004; Fredelake and Hohmann, 2012; Goldwyn et al., 2012) and are addressed by our extension to the SLIF neuron in Section Temporal Leaky Integrate and Fire Neuron.

FIGURE 2

Figure 2. The SLIF neuron does not reproduce the temporal response statistics of the ANF or their dependence on stimulus level. (A) Mean latency, (B) Jitter of the responses to a monophasic stimulus (40 μs duration) for the SLIF neuron (solid lines) and a cat ANF (Miller et al., 1999; open circles). The stimulus levels span the dynamic range of the ANF. The cat ANF data used in Figure 1A and in this figure all come from the same ANF. Note the change in ordinate scale.

Biphasic Response Properties

The threshold level of a cathodic pulse is elevated by the inclusion of a trailing anodic phase, transforming it into a cathodic-anodic biphasic pulse (Gorman and Mortimer, 1983; Shepherd and Javel, 1999; Miller et al., 2001). As the IPG is increased, the threshold level tends toward that of the cathodic phase alone, reaching its asymptote after ~250 μs (Shepherd and Javel, 1999). The SLIF neuron is fundamentally unable to reproduce this increase in threshold level associated with cathodic-anodic biphasic stimulation. A threshold crossing, if one occurs, will always occur during the excitatory, cathodic current. If the threshold crossing occurs, then it cannot be undone by the trailing, anodic phase. If a threshold crossing does not occur during the leading cathodic phase, then it cannot occur during the trailing anodic phase. Thus, any trailing, anodic current present in a stimulus has no effect on the threshold level of that stimulus in the SLIF neuron.

Summary

This section has introduced the SLIF neuron and shown that it may be analytically parameterized to reproduce the strength-duration and input-output functions of the ANF's response to a monophasic stimulus. The ease with which these important response statistics may be fitted to data makes the SLIF neuron an attractive candidate for modeling the response of the electrically stimulated ANF. However, we have also shown that the latency distribution of the SLIF neuron does not reproduce that of the ANF and that the SLIF neuron is unable to respond to a cathodic-anodic biphasic stimulus in a way that mimics the ANF.

Temporal Leaky Integrate and Fire Neuron

In this section, we extend the SLIF neuron to reproduce the temporal properties of the ANF's response to a monophasic stimulus. We do so by introducing a stochastic delay between the time of threshold crossing and the time of spiking. The delay has no effect on the probability of the neuron responding to a stimulus, which is unchanged from that of the SLIF neuron. As such, the input-output and strength-duration functions of the SLIF neuron are preserved. We refer to the resulting model as the temporal LIF (TLIF) neuron.

Model Assumptions

The TLIF neuron makes a number of assumptions regarding how the ANF responds to the stimulus. We introduce these assumptions here, prior to providing a description of the model.

Predicting the Latency Distribution from the Firing Probability

We assume that the latency distribution of the ANF's response to a stimulus is well predicted by the probability of the stimulus obtaining a response. Thus, any changes in latency with the stimulus follows directly from the change in firing probability. We further assume that the latency distribution is well approximated by a Gaussian distribution.

The Action Potential Initiation Period

When a neuron is depolarized sufficiently to evoke an action potential, a delay occurs between the membrane being depolarized by the stimulus and the action potential being generated. During this delay, further stimulation can continue to affect the time at which the action potential is generated (van den Honert and Mortimer, 1979; Miller et al., 2001). We refer to this delay as the action potential initiation period. We assume that the duration of the action potential initiation period is stochastic and stimulus-dependent, with its variability equal to the variability of the spike timing that is observed by the recording electrode.