Behavioral Measures of Cochlear Gain Reduction Depend on Precursor Frequency, Bandwidth, and Level

DeRoy Milvae, Kristina; Strickland, Elizabeth A.

doi:10.3389/fnins.2021.716689

ORIGINAL RESEARCH article

Front. Neurosci., 04 October 2021

Sec. Auditory Cognitive Neuroscience

Volume 15 - 2021 | https://doi.org/10.3389/fnins.2021.716689

This article is part of the Research TopicDescending Control in the Auditory SystemView all 20 articles

Behavioral Measures of Cochlear Gain Reduction Depend on Precursor Frequency, Bandwidth, and Level

Kristina DeRoy Milvae^*†

Elizabeth A. Strickland

Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN, United States

Sensory systems adjust to the environment to maintain sensitivity to change. In the auditory system, the medial olivocochlear reflex (MOCR) is a known physiological mechanism capable of such adjustment. The MOCR provides efferent feedback between the brainstem and cochlea, reducing cochlear gain in response to sound. The perceptual effects of the MOCR are not well understood, such as how gain reduction depends on elicitor characteristics in human listeners. Physiological and behavioral data suggest that ipsilateral MOCR tuning is only slightly broader than it is for afferent fibers, and that the fibers feed back to the frequency region of the cochlea that stimulated them. However, some otoacoustic emission (OAE) data suggest that noise is a more effective elicitor than would be consistent with sharp tuning, and that a broad region of the cochlea may be involved in elicitation. If the elicitor is processed in a cochlear channel centered at the signal frequency, the growth of gain reduction with elicitor level would be expected to depend on the frequency content of the elicitor. In the current study, the effects of the frequency content and level of a preceding sound (called a precursor) on signal threshold was examined. The results show that signal threshold increased with increasing precursor level at a shallower slope for a tonal precursor at the signal frequency than for a tonal precursor nearly an octave below the signal frequency. A broadband noise was only slightly more effective than a tone at the signal frequency, with a relatively shallow slope similar to that of the tonal precursor at the signal frequency. Overall, these results suggest that the excitation at the signal cochlear place, regardless of elicitor frequency, determines the magnitude of ipsilateral cochlear gain reduction, and that it increases with elicitor level.

Introduction

An impressive feat that the human auditory system achieves is the ability to hear sounds that range from low to extremely high intensities. Most neurons in the auditory system respond sensitively to changes over a dynamic range of 30–40 dB, yet we are able to hear over a dynamic range of approximately 120 dB (Viemeister, 1988). This discrepancy between the dynamic range of nerve fibers and the dynamic range of hearing is referred to as the “dynamic range problem” (Evans, 1981; Viemeister, 1988). One way that the auditory system may overcome the dynamic range problem is by adapting its dynamic range based on the environment. Greater understanding of the adaptive nature of the auditory system has the potential to inform future treatments for hearing loss.

Efferent projections along the entire auditory pathway provide a possible means to adjust the dynamic range. A specific known physiological mechanism that is consistent with this function is the medial olivocochlear reflex (MOCR). The MOCR is an efferent pathway between the brainstem and cochlear outer hair cells that is elicited by sound and acts to decrease cochlear gain, with an onset delay of approximately 25 ms (James et al., 2005; Backus and Guinan, 2006). This gain reduction has been well documented physiologically in neural responses (Winslow and Sachs, 1987; Guinan and Gifford, 1988) and basilar membrane responses (Cooper and Guinan, 2003) in animal models, and in otoacoustic emission (OAE) responses (Backus and Guinan, 2006; Lilaonitkul and Guinan, 2009b) in humans. The MOCR is a bilateral reflex, with evidence suggesting that the ipsilateral pathway, where gain reduction is elicited by preceding sound in the same ear, may be stronger (Lilaonitkul and Guinan, 2012; but see Guinan et al., 2003). This makes the ipsilateral evoked response of interest and the focus of this paper.

The ipsilateral MOCR is elicited by preceding sound, but the frequency of the elicitor affects the magnitude of cochlear gain reduction. Neural measurements in cats have shown that olivocochlear bundle (OCB) fibers have tuning curves that are on average slightly broader than auditory nerve tuning curves and that the feedback loop is frequency-specific, such that preceding sound leads to larger reductions in gain near the cochlear place associated with that frequency (Liberman and Brown, 1986). Bonfils and Puel (1987) examined frequency selectivity of the MOCR by measuring forward masking of compound action potentials (CAPs), the synchronized response of the auditory nerve, in anesthetized guinea pigs to tone pips. These measurements were made with an intact and sectioned crossed (ipsilateral) OCB. Sectioning the crossed OCB caused a decrease in forward masking that occurred when the masker-onset to probe-onset was 40 ms, but not when that same duration was reduced to 30 ms. This suggests efferent contributions to forward masking that occur with a time delay between 30 and 40 ms. Functional tuning curves derived from the decrease in masking were relatively sharp (Q₁₀ of 6.6) and centered on the probe frequency, suggesting again that the ipsilateral pathway is elicited in a frequency-specific way and that tuning is similar to that of afferent fibers (Q₁₀ of 5–7.3; Bonfils et al., 1986). Similarly, tuning of ipsilateral MOCR effects is sharp when measured with stimulus frequency otoacoustic emissions (SFOAEs) in humans. In SFOAE measurements, the effects of preceding sound may be measured as the combined change in magnitude and phase of the SFOAE, or with magnitude and phase separated. It is not clear what measure is most relevant for the effects of the MOCR on perception. Tuning curves derived from ipsilateral elicitors, with magnitude and phase combined, showed sharp tuning for narrowband or tonal elicitors, with a tip near the probe frequency (Lilaonitkul and Guinan, 2009b). When magnitude and phase were separated, tuning for equal-input elicitors was sharp for magnitude, and more broadly distributed for phase (Lilaonitkul and Guinan, 2012).

In summary, both neural and SFOAE tuning data suggest that ipsilateral elicitation of the MOCR at a cochlear place is primarily driven by energy entering the auditory filter at that cochlear place. However, bandwidth effects have also been measured using SFOAEs that challenge this conclusion. MOCR effects increase with elicitor bandwidth and fixed overall level in a way not explained by additional excitation in the tails of the auditory filter, suggesting that there is integration of elicitation across almost the entire cochlea (Lilaonitkul and Guinan, 2009a) and that broadband noise stimuli are stronger elicitors of cochlear gain reduction than narrowband stimuli (e.g., Guinan, 2018). It is not clear if this bandwidth effect reflects a true difference between the MOCR in human and animal models, or if anesthesia or measurement techniques have led to these differences. Psychoacoustic methods provide an alternative approach to study decreases in cochlear gain in humans which may be due to the MOCR; behavioral measures could provide additional evidence for or against integration of elicitation with wider bandwidths.

Forward masking is a psychoacoustic method to explore cochlear gain reduction with eliciting preceding sound, called a precursor (Krull and Strickland, 2008; Jennings et al., 2009; Roverud and Strickland, 2014; Yasin et al., 2014; DeRoy Milvae and Strickland, 2018). Experimental design can be tailored to the time course of activation of the MOCR to estimate cochlear gain reduction with forward masking (e.g., Yasin et al., 2014; DeRoy Milvae and Strickland, 2018). With this approach (see example paradigm used in this experiment in Figure 1), the frequency content of the precursor can be varied to examine how frequency content of the elicitor affects gain reduction. Robust gain reduction has been measured with tonal (Jennings et al., 2009; Roverud and Strickland, 2014) and broadband noise (Yasin et al., 2014; DeRoy Milvae and Strickland, 2018) precursors, but comparisons have not yet been made within-subject to examine if the broadband noise precursors are more effective elicitors of cochlear gain reduction.

FIGURE 1

Figure 1. Schematic of the temporal masking paradigm used in this experiment, including a 50-ms precursor, 20-ms masker, and 6-ms signal. The precursor or masker is removed in some experiments, but the temporal relationships are not changed. The frequency content of the precursors and maskers also vary across experiments, but the signal is always presented at 4 kHz. The gray dotted line shows a schematic of the timecourse of forward masking due to neural excitation. The gray solid line shows a schematic of the timecourse of forward masking due to cochlear gain reduction with a precursor present.

However, cochlear gain reduction is not the only mechanism for forward masking. Neural excitation also plays a role in forward masking (see dotted line in Figure 1), and models based on this mechanism suggest additivity of masking, meaning that once compression is applied, the intensities of maskers add in their impact on the threshold of a closely following sound (Penner and Shiffrin, 1980; Oxenham and Moore, 1994; Plack et al., 2006). These models assume a static cochlear input-output function, but cochlear gain reduction occurs over time, affecting the cochlear non-linearity (Krull and Strickland, 2008; Roverud and Strickland, 2010). Previous work has shown that models including gain reduction fit data as well or better than those modeled with a static cochlear non-linearity (Jennings and Strickland, 2012; Roverud and Strickland, 2014). In one paradigm with a noise precursor, on-frequency masker, and 4-kHz signal, the signal level was fixed at 15–20 dB SL (sensation level) and masker threshold was measured for a range of precursor levels. The masker level had to be increased to effectively mask the signal with a precursor, more consistent with forward masking due to gain reduction than additivity of masking (Strickland et al., 2018). In this experiment, additivity of masking and gain reduction will again be compared, to establish that the forward masking in this experiment is more consistent with cochlear gain reduction. The paradigm to test this and the predicted results are shown in Figure 2. The Power Spectrum Model of masking is used in these predictions, such that detection occurs at a constant effective signal-to-masker ratio at the output of a single auditory filter at the signal frequency [for a review, see Jennings (2021)]. As in a similar paradigm at a lower frequency (DeRoy Milvae and Strickland, 2018), on- and off-frequency maskers will be obtained that elicit the same signal threshold (column 1 of Figure 2). An on-frequency precursor will then be added to each condition with the same temporal paradigm shown in Figure 1. Predictions are in the second two columns of Figure 2, for additivity of masking and gain reduction, respectively. If the additional masking is additive and does not change the cochlear non-linearity, a similar shift in threshold is expected with the addition of the precursor, not dependent on the frequency of the masker (arrows in column 2 of Figure 2). However, if the additional masking is related to cochlear gain reduction, no change in threshold is expected with an on-frequency masker, since the signal and masker are on the same function and are equally affected, but a large shift in threshold is expected with an off-frequency masker, since the signal is affected by the gain reduction and the masker is not (Cooper and Guinan, 2006; arrow in column 3 of Figure 2).

FIGURE 2

Figure 2. Schematic of cochlear input-output functions and threshold predictions in Experiment 1. Signal threshold (S) occurs at a criterion signal-to-masker ratio (SMR), in this case 0 dB (first column) for an on-frequency (top row) and off-frequency (bottom row) masker (M). With the addition of a precursor (P), predictions differ for forward masking due to additivity of masking or gain reduction. With additivity of masking (second column), a similar shift in signal threshold is expected when the same precursor is presented with equally effective on- and off-frequency maskers (arrows in second column). With gain reduction (third column), a larger shift in signal threshold is expected in the off-frequency case, since the masker is not affected by gain reduction at the signal frequency place (arrow in third column). The input-output functions, S, and M from the first column are repeated in gray in the second and third columns to illustrate the predicted changes with the introduction of a precursor.

If the masking associated with the precursor is more consistent with cochlear gain reduction, the effects of precursor frequency content can be explored and interpreted in terms of gain reduction. In the case of tonal elicitors, it was hypothesized that gain reduction would occur in a frequency-specific way, as observed with tonal elicitors in previous physiological studies in both animal models and humans (Liberman and Brown, 1986; Bonfils and Puel, 1987; Lilaonitkul and Guinan, 2009b, 2012). In this case, an on-frequency precursor should be a more effective masker than an off-frequency precursor at the same level. Examination of forward masking with increasing precursor level also provides further evidence about tuning; because an on-frequency precursor grows compressively in the auditory channel at the signal frequency place, gain reduction should increase at a slower rate than 1 dB/dB with increasing precursor level. Because an off-frequency precursor should grow linearly in the auditory channel at the signal place, gain reduction should increase at a rate of approximately 1 dB/dB with increasing precursor level. Support for these hypotheses also comes from previous modeling of forward masking data. Modeling off-frequency-elicited gain reduction with level increasing with a slope of 1 dB/dB and on-frequency-elicited gain reduction with level with a shallower slope has predicted forward-masking data well (Roverud and Strickland, 2014). In addition, on- and off-frequency forward masking has been measured previously by Oxenham and Plack (2000), but not interpreted with consideration of cochlear gain reduction.

In the case of broadband noise elicitors, it was hypothesized that they would be more effective elicitors of cochlear gain reduction than tones, as observed in human SFOAE data (Lilaonitkul and Guinan, 2009a). To compare gain reduction with tones and noises, masking at the level of the noise entering an equivalent rectangular bandwidth (ERB; Glasberg and Moore, 1990), an estimated cochlear filter, will be compared to masking at the level of the tonal precursors. Greater masking with the noise would suggest integration across frequency to elicit gain reduction. If, however, the masking with the noise is similar to an on-frequency tone, it would suggest that integration across frequency does not take place and instead that ipsilateral cochlear gain reduction has similar tuning to that seen with afferent nerve fibers.

In this experiment, estimates of cochlear input-output functions were measured for individual participants using a forward masking technique. We hypothesized that shifts in input-output functions with preceding sound at 4 kHz are more consistent with cochlear gain reduction than additivity of masking, as observed previously with a similar paradigm at 1 kHz (DeRoy Milvae and Strickland, 2018). Cochlear gain reduction was examined as a function of the level and frequency content of preceding sound in an effort to examine how the peripheral auditory system remains sensitive across a wide range of input signals, and to examine how elicitation of cochlear gain reduction is tuned. We hypothesized that gain reduction would increase with precursor level, but the slope of increasing gain reduction with increasing precursor level would be shallower with an on-frequency precursor than with an off-frequency precursor, due to cochlear compression of the precursor at the signal place. This would suggest that gain reduction from an ipsilateral elicitor is driven by excitation in an auditory filter at or near the signal frequency, like other forms of forward masking. With a broadband noise precursor, we hypothesized that stronger gain reduction would be elicited than seen with tonal stimuli, as seen with SFOAE measurements in humans. The outcome of this research is an estimate of cochlear gain reduction in decibels, obtained through perceptual measures in humans.

Experiment 1: Forward Masking With a Precursor Is More Consistent With Cochlear Gain Reduction Than Additivity of Masking

Growth-of-masking (GOM) functions were measured to obtain an estimate of each participant’s cochlear input-output function (Oxenham and Plack, 1997; Plack and Oxenham, 1998) with and without preceding stimulation, a precursor (Krull and Strickland, 2008; Jennings et al., 2009; Roverud and Strickland, 2010) under our temporal paradigm (see Figure 1). The additional masking with preceding sound could be interpreted as a decrease in cochlear gain, but there are other possible explanations, such as masking due to neural excitation, which predicts additivity of masking given a correction for peripheral compression (Penner and Shiffrin, 1980; Oxenham and Moore, 1994, 1995). A gain reduction hypothesis was tested against additivity of masking using on- and off-frequency forward maskers that resulted in the same signal threshold, making them equally effective maskers of the signal. When the same precursor is added to each condition, additivity of forward masking predicts a similar shift in threshold, regardless of masker frequency. However, gain reduction predicts that the addition of a precursor before an off-frequency masker will lead to a larger shift in threshold (see Figure 2). Because an off-frequency masker is processed linearly at the signal place at basal frequencies (Cooper and Guinan, 2006), its gain is not reduced by preceding on-frequency sound, and it is predicted to be a more effective forward masker.

Methods

Participants

Seven young adults (P1–P7) between the ages of 19 and 26 years (median: 21 years) participated in this experiment. All were female except for P5, who was male. All participants had normal audiometric thresholds (15 dB HL or less) at octave frequencies from 0.25 to 8 kHz and present distortion product otoacoustic emissions from 1.5 to 10 kHz. Some participants did not take part in all experiments.

Stimuli

Growth of Masking

Two types of GOM functions were measured for each participant in a forward masking paradigm to estimate the cochlear input-output function at full gain (without reduction in cochlear gain associated with prior sound stimulation) and reduced gain. For the full-gain GOM function, stimuli consisted of a 20-ms, 2.4-kHz tonal masker (including 5-ms cos² onset and offset ramps) followed by a 6-ms, 4-kHz tonal signal (including 3-ms cos² onset and offset ramps) with no time delay between masker and signal. As in previous studies (e.g., DeRoy Milvae and Strickland, 2018), this masker and signal duration were chosen to be near the estimated onset delay of 20–25 ms for the MOCR (James et al., 2005; Backus and Guinan, 2006), so that there is very little MOCR activation, if any, in this condition. Masker level was fixed between 30 and 95 dB SPL in order to trace out a GOM function for each individual. Signal level was varied to determine the signal level at masking threshold.

A second GOM function at reduced gain was measured for each individual using the same masker and signal, but with the addition of preceding sound before the masker, called a precursor (see Figure 1 for temporal paradigm). This function was measured with a 50-ms, 40 dB SPL, 4-kHz tonal precursor (including 5-ms cos² onset and offset ramps) presented prior to the masker and signal. The precursor duration was 50 ms, as this has been found to be the most effective duration for an on-frequency precursor to shift threshold given this temporal paradigm (Roverud and Strickland, 2014). A level of 40 dB SPL for a tonal on-frequency precursor has been found to produce robust gain reduction in previous studies (Roverud and Strickland, 2010; Jennings and Strickland, 2012).

In addition to the GOM functions, gain reduction was estimated by comparing the signal threshold in quiet to the signal threshold preceded by the precursor and no masker, with a 20-ms silent gap between precursor and signal (in place of the masker). This estimate has shown to be consistent with gain reduction estimates measured with a masker present (DeRoy Milvae and Strickland, 2018; DeRoy Milvae et al., 2021) for listeners with normal thresholds in quiet.

Equally Effective Maskers

On-frequency maskers were identified that were equally effective (produced the same signal threshold) as off-frequency maskers used to measure GOM functions. The 6-ms, 4-kHz signal (including 3-ms cos² onset and offset ramps) was fixed at the threshold level obtained when it was preceded by a 20-ms, 2.4-kHz masker (including 5-ms cos² onset and offset ramps). The level of a 20-ms, 4-kHz masker (including 5-ms cos² onset and offset ramps) was then varied to measure threshold and find the lowest masker level where the signal could be detected. This level was then confirmed to produce the same signal threshold as the off-frequency masker by fixing the masker level and varying the signal level. This was done for two points on the lower leg of the GOM function for each participant, although an effect of masker frequency with the addition of a precursor was expected as long as the point chosen was not affected by compression.

To examine whether shifts in forward masking with a precursor were more consistent with gain reduction than additivity of masking, an identical precursor was presented before the two equally effective maskers and signal threshold was measured in each condition (measurements from the GOM function used for the off-frequency conditions). Additivity of masking predicts that adding a 50-ms, 40 dB SPL, 4-kHz precursor (including 5-ms cos² onset and offset ramps) before the on-frequency masker and off-frequency masker that produce the same signal threshold should cause an identical shift in threshold (see column 2 of Figure 2). This method does not rely on the measurement of the input-output function for interpretation. It was hypothesized that a larger shift in signal threshold would be seen for the off-frequency condition, more consistent with precursor masking related to cochlear gain reduction (see column 3 of Figure 2).

Procedure

The experiment took place in a double-walled sound-attenuating booth (IAC, Bronx, NY, United States). Tucker–Davis Technologies (TDT, Alachua, FL, United States) hardware was used. Stimuli were digitally generated at a sampling rate of 25 kHz. They were then sent to four separate digital-to-analog channels (TDT DA3-4, 16-bit), low pass filtered at 10 kHz (TDT FT5 and FT6-2), mixed (TDT SM3), buffered (TDT HB6), and output to the participant’s right ear via an ER-2 (Etymotic Research, Inc., Elk Grove Village, IL, United States) insert earphone. This insert earphone has a flat frequency response at the eardrum for frequencies from 0.25 to 8 kHz. Participants wore both the left and right earphones, even though sound was not presented to the left ear, to reduce interference from ambient noise.

Participants performed a three-interval forced choice task. Intervals were separated by 500 ms of silence and participants indicated the interval containing the signal by pressing a key. Visual indicators were used to identify the intervals and feedback was given to indicate the veracity of the participant’s choice. The signal level was adjusted while the masker level was held constant to approximate a detection threshold of 70.7% correct on the psychometric function (Levitt, 1971) for the range of masker levels tested. To determine the on-frequency masker levels needed to elicit a similar signal threshold as off-frequency maskers, the masker level was adjusted while the signal level was held constant. Participants completed 2–5 h of training on GOM tasks to control for learning effects and 1–3 h of training with on-frequency masker conditions for the equally effective maskers task. Less training was needed on this task because participants were already familiar with the general forward masking task. Two runs for each condition collected on the final day of participation are included in the experimental data. However, on-frequency masked thresholds of P2 continued to show high variability after training. For this reason, more than two estimates of each threshold were attempted for this participant, with an average of 3.5 threshold estimates measured per condition that did not have to be removed from experimental data due to high standard deviations. Off-frequency conditions were also repeated for this subject instead of using the measurements from the GOM function, so that measurements with equally effective maskers were collected at a similar point in time for this highly variable listener. In addition, an experimenter error led to three thresholds collected for P5 in the 65 dB SPL off-frequency masker and precursor condition (reduced gain GOM function and off-frequency equally effective masker condition with a precursor), but this additional threshold was similar to the first two measured and was not believed to influence the results.

During each masked trial, high pass noise was presented to limit off-frequency listening (Nelson et al., 2001). It began 50 ms before the first stimulus and ended 50 ms after the signal. The noise was presented at a spectrum level 50 dB below the signal level (varying adaptively with the signal level), and had 5-ms cos² onset and offset ramps and a bandwidth of 4.8–8.0 kHz. Because P2 demonstrated difficulty with the tasks when the high pass noise was present, resulting in inconsistent thresholds across trials, the noise was removed during testing for this participant.

Each run consisted of 50 trials. The step size was 5 dB before the second reversal in signal (or masker) level, and then the step size decreased to 2 dB. Runs were excluded if the standard deviation was greater than 5 dB for one or two final runs or if less than six reversals were present. The final even number of reversals at the 2-dB step size were averaged to estimate threshold for each run.

Results

Growth of Masking

Growth-of-masking functions without a precursor (open circles) and with a precursor (filled circles) are plotted in Figure 3. Open triangles represent the signal threshold when the signal is presented alone. Filled triangles represent the signal threshold when the precursor is present but there is no masker (20-ms gap of silence between precursor and signal). As shown in previous work (Krull and Strickland, 2008; Jennings et al., 2009; Roverud and Strickland, 2010), the precursor shifted the lower leg of the GOM function to higher signal levels (a rightward shift). This shift is consistent with a decrease in cochlear gain. P2 had a limited number of thresholds for the precursor condition because this participant’s runs often resulted in standard deviations that were above 5 dB, and those thresholds were not included. It was observed that the masker-absent gain reduction estimate (difference between open and filled triangles in Figure 3) was a reasonable estimate for the gain reduction observed by the shift in the GOM function, as shown previously (DeRoy Milvae and Strickland, 2018; DeRoy Milvae et al., 2021).

FIGURE 3

Figure 3. Individual GOM functions and masker-absent gain reduction estimates. Signal thresholds for the masker-alone condition are plotted as open circles and signal thresholds with the addition of a precursor are plotted as filled circles. Signal threshold without a preceding masker is plotted as an open triangle, and signal threshold with a precursor and 20-ms delay is plotted as a filled triangle. The difference between the triangles is the masker-absent gain reduction estimate. Arrows indicate the off-frequency masker levels used in the equally-effective-masker conditions. Error bars represent one standard deviation.

Equally Effective Maskers

Individual signal thresholds are shown in Table 1 and average threshold shifts with the addition of a precursor at two masker frequencies are shown in Figure 4. As was shown in Figure 3, the precursor shifted signal thresholds to higher levels when the masker was 2.4 kHz. In the 4 kHz masker case, there was a much smaller shift in threshold. One-tailed t-tests (with a Holm-Bonferroni correction) were performed to test for significance that the threshold for the off-frequency condition with an added precursor was higher than that of the on-frequency condition with an added precursor at the individual level, and significant differences (p < 0.05) are noted by asterisks in Table 1. P1, P2, and P3 showed a significantly higher threshold for the off-frequency condition with an added precursor at one level of matched threshold, t(2) = 8.03, p = 0.046; t(6) = 8.43, p < 0.001; and t(2) = 5.23, p = 0.041; respectively. P5 showed this same effect at two levels of matched threshold. For a matched threshold of 27 dB SPL, t(3) = 6.98, p = 0.027, and for a matched threshold of 29 dB SPL, t(2) = 9.57, p = 0.043. Other participants showed a similar trend that did not reach significance. In addition, the data were averaged across participants by taking the average difference between the precursor condition and the masker-alone condition for each masker frequency (averaging the two levels for each participant). A one-tailed t-test was performed for these data and there was a significant difference between the average change in threshold for a 2.4-kHz masker and a 4-kHz masker when an identical precursor is added, t(8) = 4.91, p = 0.006.

TABLE 1

Table 1. Individual data with equally effective maskers.

FIGURE 4

Figure 4. Bars indicate the group average increase in signal threshold with a precursor preceding equally effective off-frequency (2.4-kHz) and on-frequency (4-kHz) maskers. Signal threshold shift with a precursor was averaged for two matched signal levels for each participant (symbols). Error bars represent one standard deviation.

Discussion

The shift in signal threshold with a precursor and no masker (difference between open and filled triangles in Figure 3) was demonstrated to be a reasonable estimate of gain reduction, as observed previously (Roverud and Strickland, 2010; DeRoy Milvae and Strickland, 2018). However, in some cases it was lower than that observed in the GOM function; for example, the masker-absent threshold shift was smaller than that with a masker present for P3. Lower estimates may be found with this method since the MOCR can reduce the spontaneous rate of auditory nerve fibers (Guinan and Gifford, 1988). Therefore, the masker-absent estimate of gain reduction may sometimes underestimate gain reduction.

With equally effective maskers that differed in frequency, a larger shift in threshold was induced for the off-frequency masker condition than for the on-frequency masker condition with the introduction of an on-frequency precursor (Figure 4). Since the change in threshold depended on masker frequency, the masking provided by the precursor was more consistent with gain reduction than neural excitation alone. Additivity of masking would predict a similar change in threshold, regardless of masker frequency. The current data show that when the effects of a precursor on an on-frequency and off-frequency masking condition are compared, the change in signal threshold is not easily explained by additivity of masking. This difference in threshold shift measured was consistent with gain reduction in that the precursor in both cases elicits gain reduction at the 4-kHz place, differentially affecting the on- and off-frequency maskers. Since the 2.4-kHz masker should have an approximately linear response at the 4-kHz place, it is not affected by the gain reduction elicited by the precursor and is thus a more effective masker than the 4-kHz masker in this condition. This leads to a greater shift in threshold for the off-frequency masker condition. Even with this differential effect, some change in threshold can be seen for the on-frequency masker. This effect is still consistent with gain reduction. It can occur if the gain is decreased enough that the signal becomes inaudible. Alternatively, residual additivity of masking, after accounting for gain reduction, could also explain the increase in threshold with an on-frequency masker.

This result is similar to that observed previously at 4 kHz (Jennings et al., 2009) and at a lower signal frequency (DeRoy Milvae and Strickland, 2018). A differential effect of a precursor on masking of a signal by on- and off-frequency maskers below the signal frequency has also been seen in studies in which the signal level was fixed and the masker level was varied to measure a psychoacoustic tuning curve or a temporal masking curve. In these cases, the addition of the precursor decreases the masker level needed to mask the signal for the off-frequency masker, but not for the on-frequency one. This has been seen with a contralateral precursor (Kawase et al., 2000; Fletcher et al., 2016) and an ipsilateral precursor (Jennings and Strickland, 2012).

Additional evidence supporting a gain reduction explanation comes from Roverud and Strickland (2014), a study exploring differences in forward masking with on- and off-frequency precursors. They measured the shift in threshold following an off-frequency masker produced by an on- or off-frequency precursor, as a function of precursor duration. For the 2.4-kHz precursor, threshold increased with precursor duration for durations up to 160 ms. For the 4-kHz precursor, however, threshold increased with precursor duration up to 50 ms, but then either plateaued or in some cases oscillated. This was modeled using a temporal window model combined with gain reduction elicited by the precursor. For an on-frequency precursor, the precursor itself was affected by gain reduction, and thus effectiveness fluctuated with duration. The off-frequency precursor was not affected by gain reduction within the signal channel, and thus effectiveness continued to grow with duration.

Experiment 2: Signal Threshold With Increasing Level of Tonal Precursors

The results of Experiment 1 support the theory that a shift in signal threshold with a precursor reflects gain reduction. In that case, it is of interest to examine gain reduction as a function of precursor frequency and level, to examine the tuning of cochlear gain reduction elicitation. The results of previous studies suggest that gain reduction may increase at a slope of approximately 1 dB/dB of increasing precursor level for a masker well below the signal frequency (Oxenham and Plack, 2000; Roverud and Strickland, 2014), and increase at a shallower slope for a masker at the signal frequency (Plack and Oxenham, 1998; Oxenham and Plack, 2000; Roverud and Strickland, 2014). This experiment replicates and builds on aspects of the design of Oxenham and Plack (2000), and results are interpreted taking into account a gain reduction hypothesis.