Familiar Tonal Context Improves Accuracy of Pitch Interval Perception

Graves, Jackson E.; Oxenham, Andrew J.

doi:10.3389/fpsyg.2017.01753

ORIGINAL RESEARCH article

Front. Psychol. , 09 October 2017

Sec. Auditory Cognitive Neuroscience

Volume 8 - 2017 | https://doi.org/10.3389/fpsyg.2017.01753

Familiar Tonal Context Improves Accuracy of Pitch Interval Perception

$\r\nJackson E. Graves*$ Jackson E. Graves^*

Andrew J. Oxenham

Department of Psychology, University of Minnesota, Minneapolis, MN, United States

A fundamental feature of everyday music perception is sensitivity to familiar tonal structures such as musical keys. Many studies have suggested that a tonal context can enhance the perception and representation of pitch. Most of these studies have measured response time, which may reflect expectancy as opposed to perceptual accuracy. We instead used a performance-based measure, comparing participants’ ability to discriminate between a “small, in-tune” interval and a “large, mistuned” interval in conditions that involved familiar tonal relations (diatonic, or major, scale notes), unfamiliar tonal relations (whole-tone or mistuned-diatonic scale notes), repetition of a single pitch, or no tonal context. The context was established with a brief sequence of tones in Experiment 1 (melodic context), and a cadence-like two-chord progression in Experiment 2 (harmonic context). In both experiments, performance significantly differed across the context conditions, with a diatonic context providing a significant advantage over no context; however, no correlation with years of musical training was observed. The diatonic tonal context also provided an advantage over the whole-tone scale context condition in Experiment 1 (melodic context), and over the mistuned scale or repetition context conditions in Experiment 2 (harmonic context). However, the relatively small benefit to performance suggests that the main advantage of tonal context may be priming of expected stimuli, rather than enhanced accuracy of pitch interval representation.

Introduction

Pitch, a primary dimension of auditory sensation, is an attribute closely related to the fundamental frequency (F0) or overall periodicity of a sound. In speech, rising and falling pitch contours serve as cues to a speaker’s emotions, intentions, and emphasis, and as cues to semantic meaning in tonal languages. In music, sequences of pitch define melody and simultaneous combinations of pitch define the harmony of chords. In Western music, as in many other traditions, pitches are organized into discrete categories within a tonal hierarchy such as a musical key. Listeners, especially those with musical training, are sensitive to these hierarchies, rating some notes as better “completions” than others following a musical scale (Krumhansl and Shepard, 1979), or following a single chord or sequence of chords (Krumhansl and Kessler, 1982). The resulting “tone profiles” of perceived pitch relationships within the key cannot be predicted simply from proximal stimulus similarities, and instead are thought to reflect prior knowledge and exposure (Parncutt and Bregman, 2000). Tonal structure is a strong factor influencing psychological expectancies for both melody and harmony (Schmuckler, 1989). For melodies, listener expectations are also heavily influenced by contour (Cuddy and Lunney, 1995), in accordance with contour-based models (Narmour, 1990). Thus, to fully describe listener expectations for melodic continuation, it is necessary to consider both tonal structure and melodic contour as separate influences (Graves et al., 2014).

Sensitivity to tonal hierarchies may be the result of a process of statistical learning, wherein listeners come to expect musical patterns to which they have been frequently exposed. Statistical learning for pitch patterns has been observed on a small scale in both infants and adults (Saffran et al., 1999), in a process analogous to learning of word segmentation in language development. On a larger scale, tonal expectations in Western listeners are well explained by statistical regularities in familiar Western music such as folk songs and chorales (Pearce and Wiggins, 2006). This learning likely occurs very early in life, as infants as young as 7 months are sensitive to familiar tonal structures (Cohen et al., 1987). However, specialization for a particular tonal hierarchy may take time to develop fully: while 6–8 month-old infants are equally able to detect violations of various tonal structures, Western adults are most sensitive to violations of the Western diatonic scale (Lynch et al., 1990; Trainor and Trehub, 1992).

Once learned, tonal sensitivity is a robust phenomenon. Familiar melodies are stored in long-term memory based on tonal structure, not only contour (Dowling and Fujitani, 1971), and even short-term memory for novel melodies is influenced by tonality (Dowling, 1978; Boltz, 1991). In fact, for musically trained listeners, tonal hierarchies need not even be cued physically: tone profiles of pitch relationships within a musical key can also be measured following imagined (not physically presented) tonal hierarchies (Vuvan and Schmuckler, 2011).

Accessing these overlearned tonal hierarchies can facilitate pitch processing when the relevant pitches are highly expected within the tonal structure. For various pitch processing tasks, response times are faster for expected than for unexpected chords, based on the preceding harmonic progressions (Bharucha, 1987; Bigand and Pineau, 1997; Tillmann et al., 2008; Tillmann and Marmel, 2013), as well as for notes primed by melodic context (Marmel et al., 2008, 2011). The mechanism of this facilitation of processing may be either priming of expected pitches, or enhanced perception and representation of important pitches or harmonies within the tonal hierarchy. Under the former expectation-based explanation, a pitch that is predicted or expected by a tonal hierarchy may produce a faster response time simply because less time is required to react to an unsurprising or predictable event. This explanation is favored by most reaction-time studies, e.g., Bigand and Pineau (1997). Under the second perceptual-accuracy-based explanation, however, response times could be faster with tonal context because the representation of pitch at some level in the auditory system becomes more accurate, rendering the task easier. This may take the form of anticipatory activation of expected pitches (e.g., Bharucha, 1987). Increased event-related potential (ERP) amplitudes to pitches high in the tonal hierarchy provide additional evidence for enhanced neural representation of these pitches (e.g., Krohn et al., 2007).

If the decreased response times in these studies reflect an enhanced sensory representation of pitch, we might expect to observe improvements in measures of performance or accuracy as well. One such measure is pitch discrimination, where the listener directly compares two pitches presented in sequence. For this task, tonal context has been found to improve accuracy, but the observed effects have been small relative to effects on response time, and in some cases may be modulated by differences in timbre between tones (Warrier and Zatorre, 2002; Marmel et al., 2008; Borchert and Oxenham, 2010). Harmonic priming studies have used a dissonance detection task in which the listener must detect the presence of an augmented root or augmented fifth (both highly dissonant chord members in Western music). Tonal context also has small and inconsistent effects on accuracy for this task (Bigand and Pineau, 1997; Tillmann and Marmel, 2013), in contrast with robust effects on response time. However, in these tasks, the mistuning can be detected also by the presence of acoustics beats in the waveform of the dissonant interval (McDermott et al., 2010b), meaning that the pitch interval itself need not be discriminated by the participants. Thus, the lack of a robust effect of tonal context on task accuracy in these situations does not necessarily imply a lack of pitch enhancement through tonal context.

It is possible that tonal context effects are stronger for pitch interval discrimination than for simple pitch discrimination. Pitch intervals determine tonal hierarchies and set pitch apart from other auditory dimensions such as brightness and loudness (McDermott et al., 2008; Graves et al., 2014). Interval discrimination may be a more difficult task, due to the higher cognitive load required to represent distances (intervals) as opposed to individual values (pitches) in working memory. This could be the reason that discrimination thresholds, or difference limens (DLs), for pitch intervals are large compared to basic pitch DLs, which are exceptionally low among auditory dimensions (McDermott et al., 2010a). With more room for improvement, one might expect that any enhancement of the sensory representation of pitch would be especially beneficial on a pitch interval perception task. In addition, one known effect of tonal structure on pitch interval perception is that tonality allows for categorical perception of discrete intervals, as opposed to a continuous range. There is some evidence that musicians may more accurately discriminate pitch intervals at category boundaries than within an interval category, although this effect is not robust, and is sensitive to differences in experimental methodology (Burns and Ward, 1978). However, the effect was not observed at all in non-musicians, suggesting that categorical perception, if present, is learned. In a convergent finding, small frequency oscillations are more easily detected when centered around perfect octaves and fifths than neighboring intervals (Demany and Semal, 1992). The subjective “octave” category is slightly stretched relative to a physical octave (doubling in frequency), but approaches the physical octave when tonal context is introduced (Cuddy and Dobbins, 1988). Activating a tonal hierarchy could potentially enhance pitch interval perception by sharpening distinctions between primed interval categories. In other words, within a tonal context, a musical interval that is larger or smaller than expected may result in the second note being perceived as a “sour note” with respect to its expected pitch value, rather than in terms of the interval size between it and the preceding note.

A previous study found that the discrimination of musical intervals was better following a short melody than for intervals presented in isolation, suggesting that tonal context does enhance perception of pitch intervals (Wapnick et al., 1982). However, certain aspects of that study’s methodology leave its results open to interpretation. Firstly, only participants with a very high degree of musical experience were tested, and these participants received additional extensive training on an interval labeling task before completing the interval discrimination task. Many of them reported having absolute pitch, and all of them showed some degree of absolute pitch labeling ability, making it unclear whether the participants even needed to compare the two tones in each trial to complete the task. Although benefit from melodic context should not depend on absolute pitch possession, this may have transformed the putative relative-pitch task into functionally an absolute-pitch task. Secondly, the first pitch of the first interval on discrimination trials was always held constant, potentially allowing participants to use absolute pitch, instead of relative pitch, and so employ basic pitch discrimination instead of pitch interval discrimination. Thirdly, no distinction was made between a musical context that defines a congruent tonal hierarchy (such as a major key) and a tonally incongruent musical context: participants heard either a familiar melody or nothing. Thus, the benefit of a tonal context may be due to the reinforcement of tonality, or simply due to the presence of any context pitches, regardless of their tonal congruence.

The present study sought to determine whether a prior tonal context enhances pitch representations in a way that improves pitch interval discrimination. In order to ensure that participants were perceiving relative pitch intervals, we roved all absolute pitches in the study across a continuous range of fundamental frequencies. To dissociate various potential interpretations of a difference between familiar melodic context and no context, we also included three control conditions: a Repetition condition to test the effect of simply reinforcing the target pitch without any reference to a tonal (e.g., major or minor) center, and two unfamiliar melodic contexts (Mistuned and Whole-Tone Scales) for comparison with the more familiar (diatonic, Major Scale) context. If familiar tonal hierarchies do in fact facilitate pitch processing by enhancing the sensory representation of pitch or pitch intervals, we would predict that tonal context improves performance in an interval discrimination task, but only in cases in which the context provides congruent tonal cues. The first experiment established context using a melodic sequence of single pitches. The second experiment established context using a harmonic sequence of multiple pitches in the form of an authentic cadence. In all cases, care was taken to ensure that none of the context tones was of the same pitch class as the test tone itself, to avoid the possibility that participants were making a direct comparison between the test tone and one of the context tones.

Experiment 1: Melodic Context for Pitch Intervals

Materials and Methods

Stimuli

Participants heard sequences of pitches carried by harmonic complex tones. The tones were generated with all harmonics lower than the Nyquist frequency (22.05 kHz), and were lowpass filtered with a cutoff frequency of 200 Hz and a -12 dB/octave slope. The overall level of each tone after filtering was 60 dB SPL. The tones were generated within MATLAB (The Mathworks, Natick, MA, United States), using a 24-bit L22 soundcard (Lynx Studio Technology, Costa Mesa, CA, United States), presented diotically through HD650 headphones (Sennheiser United States, Old Lyme, CT, United States) at a sampling rate of 44.1 kHz.

Figure 1 shows the paradigm for stimulus presentation in the five melodic context conditions. The task-relevant stimuli on each trial were two tones presented sequentially, each with a duration of 400 ms, including 10-ms raised-cosine rise and fall ramps, separated by a gap of 100 ms. Trials in the No Context condition consisted only of the test interval formed by these two test tones. The F0 of the first test tone was randomly chosen from a uniform distribution within a 1.5-octave range from 200 to 565.69 Hz (approximately G3 to C#5). On half of the trials, the second test tone’s F0 was higher than that of the first by a ratio exactly equal to a standard interval in the diatonic equal-tempered scale. On the other half of trials, the second test tone’s F0 was higher than the frequency that would be chosen by the standard interval size; we termed the ratio of this discrepancy ΔF0. Thus, if the test tone was one semitone higher than the frequency that would have been selected in the standard interval, ΔF0 would be approximately 6% (2^1/12). Participants were instructed to judge whether each interval was “small, in tune” (when the F0 difference was exactly the standard interval size) or “large, mistuned” (when the F0 difference was greater than the standard interval size by ΔF0). In this way, participants had the option of using either the size cue (“small” or “large”) or the tuning cue (“in tune” or “mistuned”) to complete this task. Two standard interval sizes were tested in two separate phases of the experiment. These were two semitones (a major second) or five semitones (a perfect fourth) in the equal-temperament tuning system. We chose common intervals in Western tonal music because our participants were more likely to have been exposed to Western tonal music than other musical styles. By avoiding standard intervals larger than five semitones, we avoided repeating any pitch classes from the context sequence.

FIGURE 1

FIGURE 1. Schematic diagrams in spectrogram form of a single trial in each of the five conditions for Experiment 1 (melodic context). Pitch distances are labeled in semitones (ST). Context conditions are also illustrated with musical notation.

On trials in the other four conditions, the test interval was preceded by a melodic context sequence, consisting of four tones with durations of 400 ms each (including 10-ms raised-cosine rise and fall ramps), separated by 100-ms gaps, with 600 ms silence between the context sequence and the final two test tones. In each context condition, the F0 of the final tone in the context sequence was equal to the F0 of the first tone in the test interval. In the Repetition condition, all four context tones had the same F0 as the first test tone. In the Mistuned condition, each tone in the context sequence was exactly 1.5 semitones higher than the previous tone. In the Whole Tone condition, each tone in the context sequence was exactly two semitones higher than the previous tone. Finally, in the Major condition, the context sequence corresponded to a major (diatonic) scale ascending from the dominant (5th) scale degree to the tonic, with successive interval sizes of two semitones, two semitones, and one semitone. These four context conditions were designed to dissociate the effects of pitch reinforcement, directional context, tuning cues, and tonal hierarchy, respectively. All four context conditions provide some pitch reinforcement: additional examples of the first pitch of the test interval may be helpful. Mistuned, Whole Tone, and Major conditions all provide directional context: upward intervals of fixed size are presented, against which the test interval could be compared. Only Whole Tone and Major conditions fit within the Western 12-tone chromatic scale, and only the Major condition fits within the Western hierarchical diatonic scale.

Participants

Twenty-one participants, 9 male and 12 female, were recruited from the Twin Cities campus of the University of Minnesota. They ranged from 18 to 25 years of age (Mean = 19.8, SD = 1.9), and from 0 to 15 reported years of musical experience (Mean = 5.9, SD = 5.2), with musical experience defined as regularly playing any musical instrument. All participants were screened for normal audiometric hearing thresholds, defined as not exceeding 20 dB hearing level (HL) for frequencies between 250 and 8000 Hz. All protocols were approved by the University of Minnesota Institutional Review Board. The experiment was completed in a single 2-h session per participant. All participants provided written informed consent, in accordance with the Declaration of Helsinki, and were compensated for their participation.

Procedure

To allow the participants to gain familiarity with the standard interval size (two or five semitones), each participant completed the entire experiment for one standard interval before being tested on the other standard interval. The order of the standard interval presentation was counterbalanced between participants, such that 11 participants completed the procedure for the two-semitone standard first, while 10 participants completed the procedure for the five-semitone standard first.

Because the task was novel and not intuitive for many participants, each participant began with orientation and training before moving on to the testing phase. The orientation phase consisted of listening to five labeled examples of the small, in-tune interval and five examples of the large, mistuned interval. For this demonstration, the ΔF0 ratio was fixed at 8% (larger than a semitone). During the orientation phase, participants did not respond, but merely listened to the labeled examples.

The training phase consisted of 3 blocks of 40 trials each in the No Context condition. For the first block, ΔF0 was fixed at 12.6% (just larger than two semitones). For the second block, ΔF0 was 8%, and for the third it was 5% (just under a semitone). No time limit was imposed on responses during this training period. Participants generally performed near ceiling during this training phase, making few errors, as the ΔF0s used were large.

Following training, each participant’s DL for ΔF0 in the No Context condition was estimated in a pilot phase of the experiment using an adaptive tracking procedure. The geometric mean estimated DL was 3.1% for the two-semitone standard, 95% CI [2.3% 4.2%], and 2.8% for the five-semitone standard, 95% CI [2.0% 4.0%]. This wide range of thresholds is typical for frequency discrimination tasks, as recently illustrated in a study of 100 participants with normal hearing (Whiteford and Oxenham, 2015).

The estimated DLs, determined for each participant individually, were used to set the ΔF0 in the main testing phase of the experiment. Based on pilot testing, participants were expected to perform at sensitive levels (below ceiling and above chance) when tested with ΔF0 set to roughly 1/4 the threshold estimated by the adaptive tracking procedure. This discrepancy may be due to learning occurring over the course of the experiment. Accordingly, each participant was tested with ΔF0 set to 25% of his or her initially estimated threshold. Thus, ΔF0 was constant for each participant for each standard interval size, but different across participants and standard interval sizes according to the estimated DL.

The testing phase for each standard interval condition consisted of 25 blocks of 20 trials each. Each block contained trials with one of the five context conditions. On each trial, participants were presented with the stimulus and asked “Which kind of interval – small, in-tune or large, mistuned?” Participants were required to indicate their response via key press within 1 s of stimulus offset. The time limit was introduced in order to prevent mental rehearsal of the stimulus following the presentation. If a participant failed to respond within this time limit, the experiment program recorded a response of “small, in-tune” and proceeded to the next trial. Since this was the correct response on half of the trials, running out of time gave the participant a 50% chance of being correct. Participants were instructed to avoid running out of time, and accordingly this happened rarely: the percentage of trials on which a participant ran out of time ranged from 0.2 to 6.4% (Mean = 1.79%, SD = 1.50%).

Each participant completed five blocks for each of the five context conditions during this phase. The context condition varied from block to block, with the order of context conditions determined randomly for each consecutive set of five blocks. Participants were instructed to focus only on the final two tones (the test interval) if a context sequence was present. After the testing phase was completed for one standard interval condition, the procedure was repeated in its entirety for the other standard interval condition, starting with new orientation, training, and DL estimation periods.

Analysis

Individual participants’ sensitivity (d′) was estimated by subtracting the z-scored (the inverse cumulative normal distribution function) false alarm rate from the z-scored hit rate. In this calculation, a hit was defined as correctly detecting the large, mistuned interval, while a false alarm was defined as incorrectly responding “large, mistuned” when the small, in-tune interval was presented. Figure 2 shows the pattern of performance across conditions for standard interval size. Supplementary Table S1 shows the degree of benefit for each condition relative to No Context, for each participant.

FIGURE 2

FIGURE 2. Interval discrimination performance from Experiment 1 (melodic context). Performance in d′ is shown for the two-semitone (left) and five-semitone (right) standard interval sizes. Performance in the No Context condition was treated as a baseline (horizontal dashed line). Error bars represent ±1 standard error of the mean across participants. Horizontal solid lines with asterisks show significant pairwise comparisons between conditions for each standard interval size.

The d′ values in the No Context condition, averaged across all participants, were between 0.5 and 1, indicating that our estimates from the pilot phase successfully produced performance that was well above chance (d′ = 0) but below ceiling (d′ > ∼2.5). A paired-samples t-test comparing d′ values in the No Context condition for the two standard interval sizes was not significant (p = 0.52), suggesting that our pilot estimates of DLs in the baseline condition had been successful at targeting roughly equal levels of performance between the two standard interval sizes. Beyond that, since participants were tested at different ΔF0 levels according to their individual estimated DLs, comparisons of absolute d′ values between participants are uninformative. We analyzed the effect of all five context conditions with a repeated-measures ANOVA on these d′ values, and ran post hoc pairwise comparisons to determine the advantage of each condition over the baseline No Context, as well as benefit of one condition over another.

Results

The repeated-measures ANOVA on d′ values, with two within-subjects factors of standard interval size and context condition, revealed a main effect of context condition, F(4,80) = 10.26, p < 0.001, η² = 0.339. Post hoc pairwise comparisons of context conditions with Bonferroni correction (criterion p = 0.05/10 = 0.005) showed significant benefit over No Context for Major Scale (mean difference = 0.47, p < 0.001, Cohen’s d = 1.02) (Cohen, 1988) and Repetition (mean difference = 0.45, p < 0.001, Cohen’s d = 0.96) contexts, as well as an advantage of Major Scale over Whole-Tone Scale context (mean difference = 0.28, p = 0.001, Cohen’s d = 0.53). No other pairwise comparisons reached significance.

We also observed a significant interaction between context and standard interval size, F(4,80) = 6.049, p < 0.001, η² = 0.232. The interaction reflects in part the difference in the benefit from the Major Scale and Repetition contexts for the two- and five-semitone standard interval sizes. We performed 25 post hoc pairwise comparisons to investigate this interaction: 10 comparisons between context conditions for each of the 2 standard interval sizes, and 1 comparison between standard interval sizes for each of the 5 conditions. With Bonferroni correction (criterion p = 0.05/25 = 0.002), for the two-semitone standard interval size, d′ values were higher in the Major Scale context than No Context (mean difference = 0.73, p < 0.001, Cohen’s d = 1.09) or Whole Tone (mean difference = 0.43, p = 0.001, Cohen’s d = 0.90) conditions. For the five-semitone standard, d′ values were higher in the Repetition context than No Context (mean difference = 0.54, p < 0.001, Cohen’s d = 1.04), Mistuned (mean difference = 0.43, p < 0.001, Cohen’s d = 1.28), or Whole Tone (mean difference = 0.45, p = 0.001, Cohen’s d = 0.82) conditions. No other pairwise comparisons between conditions, nor comparisons between standard intervals within conditions, reached significance. We observed no main effect of standard interval size.

Discussion

The results of Experiment 1 suggest that performance on an interval discrimination task is significantly affected by the tonal context in which the task is performed. The Major Scale melodic context provided an advantage over the No Context or Whole-Tone Scale conditions, but no advantage over the Repetition or Mistuned-Scale conditions. Thus, no clear evidence was obtained for the benefit of establishing an over-learned (major-scale) tonal context over a simple repetition of the reference tone.

The interaction effect between context condition and standard interval size suggests that the pattern of improvement from context was different for the two- and five-semitone standard tasks. One evident difference between these patterns of results is the effect of the Repetition context and the Major Scale context in the two tasks. The best performance in the two-semitone-standard task was from the Major Scale context, whereas the best performance in the five-semitone standard task was from the Repetition context. In interpreting this difference, it is worth considering possible unintended tonal implications of the melodic context sequences. The intended interpretation of the Major Scale context was as the final four notes of an ascending major scale, ending on the tonic. Under this interpretation, both the two-semitone interval and the five-semitone interval fit in the established key. However, participants may have interpreted this sequence instead as the first four notes of an ascending major scale, beginning on the tonic. Under this interpretation, only the two-semitone interval fits in the established key. This ambiguity may explain the reduced improvement of this context sequence in the five-semitone-standard task.

The Repetition context, though intended as one level in a series of control conditions (disambiguating the effect of reiterating a reference pitch), could be interpreted as a repeating 5th scale degree (the dominant), anticipating the arrival of the tonic, which is exactly five-semitones higher. This is a common pattern in traditional Western music, and the effect may have been enhanced by the rhythmic pattern established by the temporal paradigm of this experiment, such that the final tone of the test interval can be heard to fall on a downbeat. This interpretation may explain the heightened improvement of the Repetition context sequence in the five-semitone-standard task.

If the Repetition condition had only the simple effect we intended, to reinforce the reference pitch, the simplest interpretation would be that the familiar tonal context provided an advantage over unfamiliar tonal context, but it did not provide an advantage over simple repetition of the first pitch in the test interval. This would suggest that the benefit of melodic context observed by Wapnick et al. (1982) can be disrupted with unfamiliar tonality, but may have more to do with repetition and reinforcement of target pitches than with the establishment of tonal structure. However, if we do interpret the Repetition context as inducing an accidental “tonal context” itself, these results are reasonably consistent with Wapnick et al. (1982).

Experiment 2: Harmonic Context for Pitch Intervals

Rationale

The results of Experiment 1 were mixed: familiar diatonic tonal context improved performance on pitch interval discrimination over no context and one unfamiliar context, Whole Tone, but not over the other unfamiliar context, Mistuned, or over simple tone repetition. Specifically for the five-semitone standard interval, familiar diatonic context provided no significant advantage over no context. One possible explanation of the small degree of benefit over no context, and the lack of benefit of the familiar tonal context with the five-semitone standard interval, is that the context of a sequence of four single tones did not establish a sufficiently strong and unambiguous sense of tonality. Indeed, many past studies have used chord progressions, rather than individual notes, to establish a clear tonality (Krumhansl and Kessler, 1982; Bharucha, 1987; Bigand and Pineau, 1997; Parncutt and Bregman, 2000; Tillmann et al., 2008). These studies have generally found stronger effects of tonality on response time than studies that used single notes (Krumhansl and Shepard, 1979; Warrier and Zatorre, 2002; Marmel et al., 2008).

To address this concern, we used chords to provide a more robust and unambiguous establishment of tonal context and to remove the potential ambiguities of the contexts used in Experiment 1. We also redefined the No Context condition in Experiment 2 to include noise bursts preceding the test interval, in order to preserve attentional and temporal cuing without pitch reinforcement.

Since musically trained listeners are more sensitive to tonal hierarchies than listeners without musical training (Krumhansl and Shepard, 1979), any effect of context may be greater in musicians than non-musicians. Indeed, for relative pitch tasks, listeners with musical experience may be uniquely sensitive to preceding context that induces tonality (Dowling, 1986). Using the results from both Experiment 1 and Experiment 2, we also investigated whether participants with musical training were more likely to see an advantage from Major Scale context.