- 1Department of Psychological Sciences, University of Connecticut, Storrs, CT, USA
- 2Department of Physics, University of Connecticut, Storrs, CT, USA
- 3Department of Music, Center for Computer Research in Music and Acoustics, Stanford University, Stanford, CA, USA
- 4Center for Complex Systems and Brain Sciences, Florida Atlantic University, Boca Raton, FL, USA
Entrainment of cortical rhythms to acoustic rhythms has been hypothesized to be the neural correlate of pulse and meter perception in music. Dynamic attending theory first proposed synchronization of endogenous perceptual rhythms nearly 40 years ago, but only recently has the pivotal role of neural synchrony been demonstrated. Significant progress has since been made in understanding the role of neural oscillations and the neural structures that support synchronized responses to musical rhythm. Synchronized neural activity has been observed in auditory and motor networks, and has been linked with attentional allocation and movement coordination. Here we describe a neurodynamic model that shows how self-organization of oscillations in interacting sensory and motor networks could be responsible for the formation of the pulse percept in complex rhythms. In a pulse synchronization study, we test the model's key prediction that pulse can be perceived at a frequency for which no spectral energy is present in the amplitude envelope of the acoustic rhythm. The result shows that participants perceive the pulse at the theoretically predicted frequency. This model is one of the few consistent with neurophysiological evidence on the role of neural oscillation, and it explains a phenomenon that other computational models fail to explain. Because it is based on a canonical model, the predictions hold for an entire family of dynamical systems, not only a specific one. Thus, this model provides a theoretical link between oscillatory neurodynamics and the induction of pulse and meter in musical rhythm.
1. Perception of Pulse and Meter in Musical Rhythms
The sounds that humans use for communication are temporally structured sequences of events, such as musical notes or speech syllables. Rhythm refers to the pattern of timing and stress in the amplitude envelope of an acoustic sequence. Musical rhythms are usually perceived to have a pulse, or basic beat, in the approximate frequency range 0.5–4 Hz (London, 2004). Meter is a perceived temporal structure that includes the pulse frequency, as well as slower beat frequencies that accent pulse cycles (< 2 Hz), and higher beat frequencies that subdivide the pulse (4–8 Hz) (Lerdahl and Jackendoff, 1983; London, 2004). Neural resonance theory (Large and Snyder, 2009) hypothesizes that pulse and meter correspond to neural rhythms that synchronize with acoustic rhythms, influencing temporal expectancy, attention, and movement coordination. Theoretical approaches to understanding perceived structure have been based on neurodynamic models of neural oscillation (for a review, see Large, 2008). However, only recently has a clear picture begun to emerge regarding synchronization of neural oscillations and the neural structures that support responses to musical rhythm.
In this paper, we summarize current knowledge about the synchronization of neural rhythms to musical rhythms and outline a neurodynamic model of pulse perception based on entrainment of neural oscillation. First, in §2, we present a brief overview of the main theories and experimental findings related to musical pulse and meter. We discuss the potential function of neural oscillations in establishing the perceived temporal structure of complex musical rhythms. In §3, we sketch a neurodynamic model of pulse perception based on the interaction between oscillatory neural networks. The model incorporates the basic findings of the past 20 or so years and makes a key prediction about the formation of the pulse percept. In §4, we evaluate the fundamental prediction of the theory, that perceived temporal structures may correspond to frequencies that are not physically present in the amplitude envelope. Finally, in §5, we discuss remaining important questions regarding the link between pulse perception and neural resonance.
2. Neural Resonance to Musical Rhythms
Neural population rhythms—as observed in the local field potential (LFP), the electroencephalogram (EEG), and the magnetoencephalogram (MEG)—are cyclical fluctuations of baseline neuronal activity that can be observed in neocortical and thalamic regions of the brain (Steriade et al., 1993; Buzsáki and Draguhn, 2004; Slézia et al., 2011). Thalamocortical oscillations exhibit 1∕f frequency spectra with peaks in specific frequency bands, including delta (~ 1–4 Hz), theta (~ 4–8 Hz), beta (~ 13–30 Hz), and gamma (~ 30–70 Hz) (Buzsáki, 2006). The frequency range of musical pulse (London, 2004) corresponds nicely with the delta band, while faster metrical frequencies fall within the theta band, and slower metrical frequencies occupy the sub-delta range (Musacchia et al., 2014, see Figure 1). Jones originally hypothesized that endogenous perceptual rhythms synchronize with temporally structured sequences, generating expectancies for future events (Jones, 1976). Dynamic attending theory (DAT; Jones, 1976; Jones and Boltz, 1989; Large and Jones, 1999) addressed the issue of how neural rhythms may be exploited by an organism to enable attentional coordination with the dynamic external world (for a review, see Jones, 2008). The sensory-motor theory of rhythm perception (Todd, 1994, 1995; Todd and Lee, 2015) hypothesized that rhythm and pulse perception involve a sensory representation of the input as well as a motor representation of the body. A related theoretical framework that accounts for the predictive role of the motor system has recently been proposed (Schubotz, 2007). Converging lines of investigation have provided evidence supporting both theoretical perspectives, and suggest that interaction between sensory and motor regions of the brain may provide a mechanism for predicting sequence timing (Schroeder et al., 2010). In this paper, we further develop a neural resonance theory of rhythm perception, hypothesizing that oscillatory interactions between sensory and motor areas are sufficient to give rise to percepts of pulse and meter in complex musical rhythms (Large and Snyder, 2009).
Figure 1. (A) Piano score of Isaac Albeniz's Iberia II, Triana. (B) Annotation showing pulse (black dots), metrical structure (all dots), and phrasing (brackets). Pulse frequencies overlap with cortical oscillations in the delta range, while other metrical frequencies extend into theta and sub-delta ranges. Frequency relationships among metrical levels include harmonics, subharmonics, and other integer ratios. Adapted from Large (2014).
Behavioral evidence for dynamic attending in adults and infants comes from time discrimination and pitch discrimination studies that revealed an advantage for auditory sequences that induce a metric percept, using both sensitivity and reaction time measures (Yee et al., 1994; McAuley and Kidd, 1995; Jones and Yee, 1997; Large and Jones, 1999; Barnes and Jones, 2000; Jones and McAuley, 2005; Bergeson and Trehub, 2006; Jones et al., 2006). Such findings are consistent with early-developing predispositions for temporal regularity, and a preference for more stable temporal organizations, as predicted by nonlinear resonance (Large, 2008). In support of sensory-motor theory, a fundamental sound-movement interaction was demonstrated in infants and adults such that vestibular stimulation can influence whether an ambiguous pattern is perceived in duple or triple meter (Phillips-Silver and Trainor, 2005, 2007). The top-down influence of motor networks on sensory representations has been recently studied, showing how the former can “sharpen” the temporal selection of auditory information (Morillon et al., 2014, 2015). In addition, the finding of perceptual narrowing in infants demonstrated that meter perception is plastic and depends upon musical enculturation (Hannon and Trehub, 2005).
EEG and MEG studies have directly tested the role of endogenous oscillation. One EEG study revealed that fluctuations in induced beta- and gamma-band power synchronized with periodic and metrical rhythms, revealing both sensory-driven and anticipatory responses to tones (Snyder and Large, 2005; Figure 2A). A related MEG study also found sensory and anticipatory responses in the beta- and gamma-band (Fujioka et al., 2009). When subjects were instructed to impose a subjective meter on a periodic stimulus, subharmonic responses were observed in induced beta band MEG activity, closely resembling those produced by physical accents (Iversen et al., 2009). One EEG study reported synchronization of delta rhythms, phasic responses in theta, and augmented phase synchronization throughout the beta/gamma range, modulated by the stimulus periodicity (Will and Berg, 2007). Another study revealed entrainment of the delta rhythm to the onset of target tones, and reaction times that correlated with the phase of the delta band oscillation at target onset, directly supporting the role of neural oscillation in attending (Stefanics et al., 2010). The steady-state evoked potential (SS-EP) technique revealed that a periodic rhythm elicited a sustained response in the delta band, and meter imagery elicited an additional subharmonic resonance corresponding to the metric interpretation (Nozaradan et al., 2011; Figure 2B). Another study showed that complex rhythms elicited multiple SS-EPs in the EEG spectrum at frequencies corresponding to the rhythmic pattern envelope, and the amplitude of the SS-EPs at pulse and meter frequencies was selectively enhanced, suggesting a role for neural oscillations in pulse and meter induction (Nozaradan et al., 2012). Thus, delta, beta and gamma band responses to auditory rhythms observed in both in EEG and MEG in humans have confirmed predictions of dynamic attending theory. Given the role of beta in motor processing and long-range intra-cortical interaction, these findings are consistent with the idea that the motor system influences the perception of sound, even in the absence of overt movement.
Figure 2. (A) EEG revealed synchronized fluctuations in induced beta- and gamma-band power that anticipated tone onsets, were sensitive to intensity accents, and persisted when expected tones were omitted. Evoked activity occurred after tone onsets and were strongly diminished after tone omissions (Snyder and Large, 2005; reprinted with permission). (B) A periodic rhythm elicited a steady-state evoked potential (SS-EP) at the stimulus repetition frequency, and meter imagery elicited subharmonic resonances corresponding to the metric interpretation of this periodic rhythm (Nozaradan et al., 2011; reprinted with permission). (C) fMRI showed that listening to musical rhythms recruits both auditory and motor areas of the brain even in perception tasks without a motor component (Chen et al., 2008a; reprinted with permission). (D) Functional connectivity analysis revealed a cortico-subcortical network including the putamen, SMA, and PMC under conditions that may require internal generation of the pulse (Grahn and Rowe, 2009; reprinted with permission). (E) MEG revealed oscillatory interactions in a striato-thalamo-cortical network (Fujioka et al., 2012; reprinted with permission).
Functional magnetic resonance imaging (fMRI) studies of musical rhythm perception have clearly demonstrated that listening to musical rhythms recruits both auditory and motor areas of the brain (Chen et al., 2008a; Figure 2C). A functionally connected network is implicated in extracting higher-order features of a rhythm's temporal structure, with the dorsal premotor cortex mediating auditory-motor interactions, and functional coupling of auditory-motor networks observed even in perception tasks without a motor component (Chen et al., 2008b). Such studies have implicated the basal ganglia (Grahn and Rowe, 2009), cerebellum and supplementary motor area (Chen et al., 2008a; Bengtsson et al., 2009), as well as the dorsal premotor cortex and right frontal lobe (Bengtsson et al., 2009). Functional connectivity analysis implicates a cortico-subcortical network including the putamen, SMA, and PMC for the analysis of temporal sequences, especially under conditions that may require internal generation of the pulse (Grahn and Rowe, 2009; Figure 2D). Basal ganglia may play an important functional role in the formation of the pulse percept (Chapin et al., 2010; Kung et al., 2013) or pulse prediction (Grahn and Rowe, 2013). Although functional imaging cannot resolve rhythmic time scales, MEG source analysis has revealed beta band interactions in auditory and motor networks during musical rhythm processing, implying oscillatory interactions among auditory and motor cortices as well as the cerebellum, thalamus, and parahippocampal gyrus (Fujioka et al., 2012; Figure 2E). Synchronization of beta suppression with the auditory input (Fujioka et al., 2012) and the spatial overlap of the beta network with the striato-thalamo-cortical network implicated by functional imaging suggests that the mechanism of pulse and meter perception is fundamentally oscillatory. Moreover, in macaque monkeys, beta- and gamma-band oscillations measured using LFPs during synchronization-continuation tasks suggest differential roles in rhythmic tapping versus stimulus processing (Bartolo et al., 2014). Observations of beta oscillations in basal ganglia during synchronization and continuation further support their role in the striato-thalamocortical circuit during control of rhythm (Bartolo and Merchant, 2015; for a recent review, see Merchant et al., 2015a). Interaction between sensory and motor regions of the brain has been found not only when listening to music, but also in speech perception and production (Rauschecker and Scott, 2009).
A summary of this scenario is outlined in Figure 3. Interaction of excitatory and inhibitory neuronal populations gives rise to population rhythms throughout the brain (Brunel, 2003; Börgers and Kopell, 2003; Buzsáki and Draguhn, 2004; Stefanescu and Jirsa, 2008), including sensory and motor networks. When sensory stimuli are presented in a periodic pattern, ambient delta-band oscillations entrain to the structure of the stimulus stream (Will and Berg, 2007; Nozaradan et al., 2011, 2012). Fluctuations in beta- and gamma-band rhythms synchronize as well, consistent with an oscillatory hierarchy in auditory cortex (Lakatos et al., 2005). Neuronal entrainment emerges rapidly and facilitates behavioral responses (Stefanics et al., 2010). The perception of pulse and meter involves broadly distributed motor systems (Chen et al., 2008a; Grahn and Rowe, 2009, 2013; Kung et al., 2013) which respond with synchronized fluctuations in gamma- and beta-band amplitude (Fujioka et al., 2012), enabling coordination of perception and rhythmic movements with musical rhythms (Chen et al., 2008b; Nozaradan et al., 2013). Auditory-motor coupling is reciprocal (Phillips-Silver and Trainor, 2005, 2007), and connections within and between sensory and motor systems are assumed to be plastic (Hannon and Trehub, 2005).
Figure 3. Interaction of excitatory and inhibitory neuronal populations gives rise to population rhythms in sensory and motor networks. When sensory stimuli are presented in a periodic pattern, auditory cortical oscillations entrain to the structure of the stimulus stream. The perception of pulse and meter in music also involves broadly distributed motor systems. Auditory-motor coupling is reciprocal, as vestibular stimulation can influence auditory rhythm perception. Connections within and between sensory and motor systems are assumed to be plastic.
3. Oscillatory Network Interactions Could Give Rise to Musical Pulse Perception
A fundamental challenge to understanding a complex neural system such as the one just described lies in the integration of data from multiple modalities and multiple levels of observation into a coherent systems model of perception, attention and behavior. Musical rhythm provides a unique opportunity in this regard, because acoustic signals are thought to drive entrainment of neuronal population oscillations. Nonlinear oscillation in general, and neural oscillation in particular, have been widely studied in the mathematical and physical sciences over the past 50 years, putting us in a unique position to link levels of observation from spiking neurons to population dynamics and from population dynamics to perception and behavior, because sophisticated mathematical tools are available for understanding the behavior of such systems. Models of neuronal oscillations arising from the interaction of excitatory and inhibitory populations of neurons can be used to study mechanisms of neural oscillation (Whittington et al., 2000), with implications for many different aspects of neural information processing (Wilson and Cowan, 1973; Hoppensteadt and Izhikevich, 1999; Varela et al., 2001; Koepsell et al., 2010; Ainsworth et al., 2012). Models may be driven with external input (Brunel, 2003; Large et al., 2010), may include synaptic plasticity (Hoppensteadt and Izhikevich, 1996b; Brunel, 2003), and predict a surprisingly rich repertoire of behaviors, including steady states, Hopf bifurcations, double limit cycle bifurcations, bursting, and chaotic dynamics (Brunel, 2000; Stefanescu and Jirsa, 2008; Ledoux and Brunel, 2011).
Mathematical models can be useful in linking oscillation of high dimensional neuronal populations with lower dimensional population-level models that capture much of the behavioral richness observed in high dimensional systems (Wilson and Cowan, 1973; Stefanescu and Jirsa, 2008), and are amenable to theoretical and computational analysis (Aronson et al., 1990; Hoppensteadt and Izhikevich, 1996a). We have proposed one such population-level model (1; Large et al., 2010) to describe the dynamics of networks of neural oscillators with different natural frequencies responding to external stimulation:
Here, the variables xi and yi represent excitatory and inhibitory activities in the ith neuronal population, respectively. The nonlinear functions fi and gi describe the intrinsic (uncoupled) dynamics of the excitatory and inhibitory subpopulations, and pi and qi describe interactions between neuronal subpopulations with external input s(t). λ is a set of model parameters and τi is varied to create a frequency gradient as is found in many parts of the auditory system (Langner, 1992). ϵ is a small number that represents weak interaction (Hoppensteadt and Izhikevich, 1996a; Large et al., 2010).
This very general model implies a set of generic predictions about emergent neuronal oscillations under the influence of time-varying external input (e.g., musical rhythms) that hold under a broad set of assumptions (Hoppensteadt and Izhikevich, 1996a; Large et al., 2010). Equation (2) describes a canonical gradient frequency neural oscillator network derived from Equation (1) using normal form theory (Large et al., 2010), that has been used to model entrainment of perception, attention and behavior to rhythmic stimuli (Large, 2010).
Here the roman i denotes the imaginary unit and zi is the complex-valued state variable for the ith neural oscillator whose real and imaginary parts can be thought of as the activation of the excitatory and inhibitory subpopulations, respectively. The ith oscillator's natural frequency is given by fi = 1∕τi. The β's are nonlinear damping parameters, and the complete expansion of intrinsic terms describes a fully saturating nonlinearity (Murdock, 2003; Large, 2010). The parameter α controls the system's intrinsic behavior. α = 0 is the critical point, above which the system exhibits spontaneous oscillation and below which damped oscillation, through an Andorov-Hopf bifurcation, illustrated in Figure 4A. The interaction of the external signal with the intrinsic oscillatory dynamics makes the key predictions in this model.
Figure 4. (A) Intrinsic oscillatory dynamics can take the form of a Hopf bifurcation, where a value α = 0 is the critical point, between damped (left) and spontaneous (right) oscillation. (B) Mode locking in the canonical model. Within each resonance region (shaded), the canonical model mode-locks to input at the ratio shown in figure (c: coupling strength, f: oscillator's intrinsic frequency, f0: input frequency). Insets show the inputs and traces produced by a canonical model. (C) Architecture of a model that captures interacting oscillatory dynamics in sensory and motor networks. A rhythm is input to a sensory network; sensory and motor networks are reciprocally connected, providing input to one another.
Nonlinear stimulus coupling predicts mode-locked responses of neural oscillators to the stimulus. Mode-locking is a generalization of phase-locking in which a periodic stimulus interacts with an intrinsic oscillatory dynamics of a neuron or neural circuit, causing k cycles of an oscillation to lock to m cycles of the stimulus, where k and m are integers, as shown in Figure 4B (for phase-locking, k = m = 1). Mode-locking predicts neural responses at harmonics, subharmonics, integer ratios, and combination frequencies of those present in a given rhythmic stimulus. Mode-locking in the canonical model is captured using a full expansion of resonant monomials (Large et al., 2010) which is expressed as a “passive” nonlinear function of input from another oscillator, zj, multiplied by an “active” nonlinear function of the current state, zi:
We use two gradient frequency networks to model the functional coupling of auditory-motor networks observed in perception tasks without a motor component (Chen et al., 2008a; Velasco and Large, 2011). The sensory network is intended to capture auditory cortical entrainment, while the motor network is intended to capture the dynamics of a broadly distributed network including basal ganglia and cortical areas. As illustrated in Figure 4C, the sensory network takes a rhythmic input, sends output to a motor network, and the motor network send input back to the sensory network (for a detailed description, see Velasco and Large, 2011). Connections within and between networks are assumed to be plastic and tuned by musical enculturation (cf., Hannon and Trehub, 2005; Large, 2010; Velasco and Large, 2011).
Although the model makes only very general assumptions regarding underlying neural structures (e.g., Chen et al., 2008a), it makes strong commitments about the oscillatory dynamics of auditory-motor interactions (Will and Berg, 2007; Fujioka et al., 2012; Nozaradan et al., 2013). The sensory oscillators are tuned to operate near a Hopf bifurcation (Hoppensteadt and Izhikevich, 1997), as shown in Figure 4A; the motor oscillators are tuned to operate near a double limit cycle bifurcation (Izhikevich, 2000; Velasco and Large, 2011). The double limit cycle regime of the motor network means that the model can capture synchronization-continuation behavior, continuing to produce rhythmic behavior after the stimulus ceases (Wing and Kristofferson, 1973a,b). Here, however, we are interested in evaluating predictions about synchronization (Velasco and Large, 2011). To predict mean field time series as observed in EEG recordings (e.g., Will and Berg, 2007; Stefanics et al., 2010), we sum the output of all oscillators in each network (Figure 5, left). To predict steady state evoked potentials (SS-EPs, e.g., Berens and Velasco, 2009) we take a frequency analysis (DFT) of the mean field (Figure 5, right).
Figure 5. The network of Figure 4C was stimulated with (A) an isochronous rhythm and (B) a “missing pulse” rhythm. Output of the sensory network is in green and output of the motor network is in blue. The mean field time series (left) was obtained by summing the output of all the oscillators in the network over time. The SS-EP is obtained by Fourier analysis (DFT) of the mean field time series. DFT of the stimulus envelope is shown in black on the sensory SS-EP axis.
As shown in Figure 5A, for a periodic stimulus, both sensory and motor networks produce synchronized oscillations at the pulse frequency, and generate harmonics (Repp, 2008) and subharmonics (Vos, 1973; Bolton, 1894; Nozaradan et al., 2011). In the case of a complex rhythm, however, it becomes clear that the two networks are doing something quite different from one another. The mean field time trace for the sensory network represents the input rhythm rather faithfully, producing well-defined pulses at input event times. By contrast, the motor network entrains at the pulse frequency. The rhythm itself contains no energy at the pulse frequency (or its second subharmonic; DFT in Figure 5B, SS-EP, solid black), however, in the motor network the strongest response is found at the pulse frequency. In other words, the development of the pulse percept depends on the interaction of these two oscillatory systems.
This predicts that an oscillatory network interaction can lead to spontaneous pulse induction in complex rhythms—even in the most extreme case of a rhythm for which there is no energy at the pulse frequency. Thus, the theoretical prediction is that pulse may be perceived at a frequency that is not physically present in the rhythmic stimulus (Large, 2010; Velasco and Large, 2011). The prediction for an isochronous rhythm (Figure 5A) is consistent with observations of subharmonic resonance in EEG (Nozaradan et al., 2011), however, in that EEG study subharmonic resonance was observed only when subjects were instructed to imagine it. The behavior of the sensory network in response to the complex rhythm (Figure 5B) is consistent with the observation of the enhancement of metrical frequencies in the SS-EP (Nozaradan et al., 2012). However, the prediction of network interaction goes further in that the motor network produces metrical frequencies that are completely absent from the rhythm. This is a strong prediction that has never been tested.
This points up a critical issue in the interpretation of most of the empirical studies that have been conducted to date. The problem that arises in the theoretical interpretation of synchronized periodic movements or population-level neural recordings is that of linking observations of synchrony with intrinsic dynamics of emergent population oscillations (Whittington et al., 2000). EEG oscillations, for example, represent a remote consequence of periodic activity in the area of the brain between recording electrodes and a reference electrode. To generate a signal distinct from background noise, this activity must be manifest in a large enough proportion of neurons proximal to the electrode, implying a certain degree of synchronization among neurons. However, there are at least two distinct mechanisms by which neuronal synchrony can occur: it can be a consequence of an emergent population oscillation within the network of cells under the electrode—with its own intrinsic dynamics—or it could be merely a consequence of a common rhythmic input (Whittington et al., 2000). In neural resonance theory it is postulated that both are present, i.e., a common rhythmic input entrains an emergent population oscillation. In this case, interpretation of a synchronized neural responses is especially tricky, because a synchronized response itself does not imply interaction of the input with an intrinsic oscillatory dynamics. However, demonstration of a pulse percept at a frequency that is not physically present in a stimulus rhythm would directly implicate intrinsic oscillatory dynamics in the perception of pulse.
4. Does Rhythmic Input Interact with Intrinsic Dynamics to Form The Pulse Percept?
Thus, the fundamental prediction to be tested is whether people will perceive a pulse frequency that is not present in a rhythmic stimulus. This is a strong prediction, because if pulse perception could be experimentally observed at a missing frequency, it would rule out the interpretation of synchronization due merely to a common input frequency, and equivalently models of pulse induction based on linear resonance. On the other hand, if a predicted missing pulse were not observed, it would falsify the hypothesis that pulse induction occurs due interaction between a rhythmic stimulus and the intrinsic dynamics of endogenous oscillatory networks.
Pulse is operationally defined as the frequency at which a listener coordinates a periodic movement with a complex rhythm (Large, 2008). Thus, we asked participants to listen to eleven rhythms ranging from isochronous to highly complex, where the most complex rhythms contained no spectral energy at the pulse frequency predicted by the model, as shown in Figure 6. Participants were instructed to listen to each rhythm until they heard a steady pulse, and then tap along with the rhythm at that rate. The fourteen participants were primarily undergraduates from Florida Atlantic University, who received partial credit in fulfillment of undergraduate psychology courses, one participant was a volunteer, and none were professional musicians (8 males, 6 females; mean age 25.3; mean years of musical training: 3.2). All participants provided informed consent, following guidelines set by the Institutional Review Board of Florida Atlantic University.
Figure 6. Each stimulus pattern contained eight events distributed over eight pulse cycles. Complexity was varied by manipulating the number of events that fell in phase versus anti-phase with the intended pulse. At complexity level 0, all eight events were in-phase, at complexity level 1, one event was anti-phase, at complexity level 2, two events were anti-phase, and so on. Level 4 was the most complex, with four events in-phase and four events anti-phase. One isochronous control (level 0), two level 1 patterns, two level 2 patterns, two level 3 patterns, and four level 4 patterns were used, for a total of 11 rhythms.
Rhythms were presented at five levels of complexity (0–4) and at five different tempi (i.e., pulse frequencies): 2.28 Hz (420 ms), 2.17 Hz (460 ms), 2 Hz (500 ms), 1.85 Hz (540 ms), and 1.72 Hz (580 ms). The location of acoustic events on a metric grid was used to manipulate the level of rhythmic complexity (i.e., syncopation, see Figure 6). Counting quarter-note beats as “strong” and eighth-note subdivisions as “weak” (Lerdahl and Jackendoff, 1983), the level of syncopation was varied by manipulating the number of events that fell on weak beats, as illustrated in Figure 6. Weak beats (subdivisions) can also be thought of as existing in anti-phase relation to the in-phase strong beats. At complexity level 0, all eight events fell on strong beats, at complexity level 1, one event fell on a weak beat and seven events fell on strong beats, and so on. Level 4 was the greatest level of syncopation, with four events on weak beats and four events on strong beats. Each pattern consisted of 8 sounded events (262 Hz sine waves, 50 ms duration, 5-ms rise and fall times) laid out on an eight-beat metrical grid with binary subdivisions. Thus, the total duration of each pattern was sixteen (eighth-note) beats, equivalent to eight quarter-note beats or two four-beat measures. Each trial consisted of six pattern repetitions. Different combinations of tempo and rhythm were presented in a pseudorandom order such that consecutive trials always had different rhythms, and different tempos. Thus, participants were forced to find both the frequency and the phase of the pulse anew for each rhythm; they could not simply tap at the same tempo throughout the experiment.
We measured instantaneous tapping frequency distributions to determine whether subjects induced a pulse at the intended frequency. Instantaneous tapping frequency was computed as 1∕ITI (ITI = inter-tap interval in seconds) and tapping frequencies were normalized to a frequency of 2 Hz so they could be combined into a single distribution at each level of complexity. Spectral analysis (DFT) of the stimulus rhythms (Figure 7A, black) shows that at the hypothetical pulse frequency amplitude decreases with increasing complexity. At complexity level 4, the amplitude is precisely zero at 2 and 1 Hz for each rhythm. Normalized instantaneous tapping frequency (Figure 7A, red histogram) displays a main peak at the normalized pulse frequency of 2 Hz for all rhythms at all levels of complexity, with lesser peaks at 1 and 0.5 Hz, and for some rhythms, a diffuse peak around 4 Hz. Thus, the participants most often tapped the predicted pulse frequency even for the most complex rhythms, which had no spectral amplitude at that frequency. The distributions for the higher levels of complexity (levels 3 and 4) were significantly different from the distributions for the lower complexity levels [Kolmogorov-Smirnov, D(N = 101) = 0.24, p < 0.01]. The change appeared to be due to a decrease in the proportion of taps at 2 Hz and an increase in the proportion of (unsynchronized) taps between 1 and 2 Hz.
Figure 7. (A) Tapping frequencies were normalized to a 2 Hz (120 bpm) tempo to allow comparison between trials at different tempos. Tapping frequency distributions (red histograms) were computed by binning normalized instantaneous tapping frequencies from 0 to 5.00 Hz in bin widths of 0.05 Hz. Distributions were computed for each rhythm separately, including every tap interval across trials. Black lines show amplitude spectrum of the stimulus envelope for comparison. (B) Circular means of tap phases for each trial (blue circles) and grand mean for each complexity level (red line).
Next, we examined synchronization for each trial. The sequence of tap times was converted into a sequence of phases relative to the predicted pulse frequency, and the circular mean was computed for each trial (Batschelet, 1981; Figure 7B; blue circles). The grand mean was then computed for each complexity level (Figure 7B, red line). The length, r, of the mean vector is the synchronization coefficient, a measure of tapping variability (1 corresponds to no variability, and 0 corresponds to maximum variability), and the angle, ϕ, of the mean vector indicates the relationship to the hypothetical pulse, where positive values of ϕ indicate late tapping and negative values indicate early taps, on average (Batschelet, 1981).
At the lowest level of complexity, mean vectors clustered near the unit circle, indicating a high degree of synchronization, and near a phase of 0, indicating in-phase tapping. As complexity increased, a growing number of vectors clustered near zero, indicating lack of synchronization. The percentages of trials that were synchronized at r>0.5 (a highly conservative criterion Fisher, 1993) were 97, 96, 89, 60, and 55% for complexity levels 0–4, respectively. Thus, the more complex rhythms were more difficult to synchronize with, however, the majority of the time participants synchronized at the predicted pulse frequency, even for the rhythms that had no stimulus energy at that frequency. Finally, for the most complex rhythms (levels 3 and 4) a significant number of mean relative phases clustered near a relative phase of π, indicating anti-phase tapping. The proportion of anti-phase tapping changed across the five levels of syncopation [χ2(1, n = 565) = 64.59, p < 0.001], but was equally distributed across tempos, [χ2(1, n = 565) = 3.16, p>0.10]. Although the stimulus tempo range was relatively small, the fact that this analysis showed that the majority of trials were well phase locked to the pulse indicates subjects were not merely tapping near a preferred tempo (e.g., 2 Hz). If this had been the case, these trials would not have shown such strong phase locking.
As predicted by the model, participants synchronized—either in-phase or anti-phase—predominantly at the missing pulse frequency. This behavior is consistent with the prediction that formation of the pulse percept arises due to entrainment of emergent neuronal oscillations. It also rules out the potential alternatives that synchronization is merely a consequence of a common rhythmic input, or that the pulse percept may arise due to linear resonance. Theoretically speaking, it is critical to distinguish the role of a common stimulus frequency from the intrinsic dynamics of a emergent oscillation (Whittington et al., 2000), and the missing pulse rhythms used here enabled us to dissociate the two. Such rhythms may also be useful in studying the development of rhythm perception and synchronization (e.g., Kirschner and Tomasello, 2009), as well as the capabilities and limitations of “beat-deaf” individuals (e.g., Phillips-Silver et al., 2011) and non-human animals (e.g., Patel et al., 2009; Cook et al., 2013; Large and Gray, 2015), as it allows us to tease apart the stimulus from any potential contribution of endogenous neural rhythms. Additionally, more thorough analysis of synchronization behavior will enable us to fit model parameters and evaluate the detailed predictions of the neurodynamic model.
Recent findings using the steady-state evoked potential (SS-EP) approach provide converging evidence for the hypothesis of nonlinear resonance in pulse perception. In one study, meter imagery elicited subharmonic resonance at metric frequencies for a stimulus with no energy at the to-be-imagined subharmonics (Nozaradan et al., 2011). This study provided direct evidence of mode-locking, however, the subharmonic resonance was not spontaneous, it was instructed. Therefore, it does not directly address the issue of spontaneous pulse perception in complex rhythms. In another study, the amplitudes of the SS-EPs at pulse and meter frequencies were shown to be selectively enhanced in syncopated rhythms even though acoustic energy was not predominant at these frequencies (Nozaradan et al., 2012). While the result of that study matched predictions for the sensory network (Large, 2010; Velasco and Large, 2011) such a finding is not inconsistent with linear resonance (Oppenheim and Schafer, 1975), because no responses were found at frequencies that were absent from the stimulus. This study demonstrated perceived pulse at frequencies that were completely absent from the stimulus rhythms.
5. Discussion
The hypothesis of neural resonance to musical rhythms has been supported by a great deal of behavioral and electrophysiological evidence since Jones's original dynamic attending proposal nearly forty years ago. Not only has the framework stood up to empirical testing, but it has turned out to be compatible and complementary in interesting ways with sensory-motor theory, originally proposed by Todd as an alternative conception of musical rhythm perception. It is not surprising that sensory and motor rhythms interact, but what is somewhat unexpected is the possibility that oscillatory sensory-motor interaction itself could give rise to the percepts of pulse and meter. These data do not speak to how the sensory and motor systems interact; for that, additional work will be needed. Moreover, it remains to test the theory at the level of neural population oscillations, using EEG and MEG in humans, which is, in fact, the level of prediction at which the mathematical theory operates. Nevertheless, the fundamental prediction of the theory, that pulse should be perceived in rhythms with no energy in the amplitude envelope at the pulse frequency, is supported by behavioral evidence.
The phenomenon demonstrated empirically here, the perception of a “missing pulse,” is reminiscent in some ways of the “missing fundamental” phenomenon of pitch perception (for a review, see Plack and Oxenham, 2005). If the energy at the fundamental frequency is removed from the complex spectrum of a periodic sound, the perceived pitch remains unchanged, matching the pitch of a sinusoid with the frequency of the missing fundamental (Schouten, 1938; Licklider, 1956). Both phenomena involve the perception of a frequency that is “missing” from the stimulus. The mechanisms of pitch perception are still debated by theorists (Plack and Oxenham, 2005). However, the analogous effect may indicate the involvement of a similar mechanism, i.e., neural oscillation, albeit at different time scales. For example, a model based on the oscillations of cochlear nucleus chopper cells has been proposed to account for complex pitch perception (Meddis and O'Mard, 2006). It is important to note, however, that while these phenomena are similar, they are not the same. Perceived pitch generally corresponds to the fundamental of a pitched sound, whereas the perceived pulse of a musical rhythm is not its fundamental frequency.
A fundamental aspect of neural computation that our model does not explicitly take into account is the discharge rate and the temporal codes of action potentials produced by single cells in the central nervous system. It has been suggested, for example, that oscillatory signals themselves do not carry information, but instead all information is represented in the spiking activity of cells (see e.g., Shadlen and Movshon, 1999). Recent neurophysiological experiments in monkeys have revealed that neurons in SMA dynamically encode duration and serial order elements of the produced intervals in a synchronization-continuation task (Merchant et al., 2013; Crowe et al., 2014; Merchant et al., 2015b). However, oscillations and spikes interact in important ways, and efforts are currently being made to understand how spike discharge information coordinates with local and global neural oscillatory activity (Kayser et al., 2009; Buzsáki et al., 2012; Musacchia et al., 2014). For example, the active sensing framework of Schroeder and colleagues has clear parallels with neural resonance theory that are beginning to be recognized and exploited (Henry and Herrmann, 2014; Musacchia et al., 2014). This approach describes rhythmic attention in terms of excitation-inhibition cycles tied to slow neural oscillations that can be entrained by rhythmic stimulation (Lakatos et al., 2005, 2008; Schroeder et al., 2010). Entrainment of cortical rhythms is thought to engage neurodynamic mechanisms of temporal prediction to segregate incoming information and organize spike timing (Musacchia et al., 2014). Experiments are investigating concepts of attending dynamics at the neuronal level and, in combination with the emerging study of behavioral synchronization in nonhuman animals (Patel et al., 2009; Zarco et al., 2009; Honing et al., 2012; Cook et al., 2013; Large and Gray, 2015), promises insights that are capable of linking neuronal dynamics with perception, attention and behavior.
Recently, Patel and Iversen have claimed that entrainment of neural oscillation to external rhythms is not sufficient to explain beat perception (Patel and Iversen, 2014). In their view, neural oscillation cannot account for the fact that: “pure perception of a musical beat (i.e., listening in the absence of overt movement) strongly engages the motor system,” and they suggest that “beat perception involves more than the passive entrainment of neural responses to sound.” They propose the action simulation for auditory prediction (ASAP) hypothesis as an alternative. However, these objections represent misconceptions regarding the fundamental nature of neural oscillation and the specific predictions of neural resonance theory. Here, we have provided a specific model of pulse and meter perception based on oscillatory interactions between auditory and motor systems, and we have tested the prediction that pulse perception is based on entrainment of active, endogenous neural oscillations. Our result conclusively rules out passive synchronization of neural activity with external rhythms. While the ASAP hypothesis makes predictions regarding the specific neural structures involved in the perception of pulse and meter, it makes no predictions regarding the nature of the computations that take place in these regions. By contrast, the neural resonance approach makes specific predictions about the nature of the neurodynamic processes involved in musical rhythm perception (see Large, 2008; Large and Snyder, 2009) and it explains a phenomenon that other models fail to explain. It does not make strong commitments to specific brain regions, however, and more empirical data will be necessary to understand how the various neural structures involved contribute to observed neurodynamics. Nevertheless, the theory is based on a canonical model of neural oscillation, thus the predictions hold for an entire family of dynamical systems, suggesting that it will scale as we learn more about the dynamical interactions in the striato-thalamo-cortical network underlying rhythm perception. Neural resonance theory is also consistent with Hebbian theory (Large, 2010), suggesting that it will be able to account for findings regarding plasticity and perceptual narrowing in rhythmic enculturation (e.g., Hannon and Trehub, 2005).
Additionally, Patel and Iversen have identified neural resonance theory with Darwin's idea that “The perception …of musical …rhythm is probably common to all animals, and no doubt depends on the common physiological nature of their nervous systems” (Patel and Iversen, 2014). They argue against neural resonance based on their interpretation that “[rhesus monkeys] do not perceive a beat in rhythmic auditory patterns …[and] could not learn to tap in synchrony with an auditory metronome.” However, their fundamental argument appears to be based on the assumption that if neural resonance underlies pulse perception in humans, similar networks must be present in all animals. Moreover, Patel (2014) and Fitch (2012) reason that if that is true, then all animals should be able to synchronize body movements to rhythms. As we have argued elsewhere (Large and Gray, 2015), species differences in perception-action coordination do not speak to the question of whether or not the fundamental neural processes are oscillatory. Species differences could be accommodated within a neural resonance framework, for example, simply in terms of the differences in auditory-motor coupling (see e.g., Merchant and Honing, 2013). Furthermore, to the extent that ASAP is to be identified with Patel's “vocal learning hypothesis,” recent observations of entrainment to musical rhythms in a sea lion (Cook et al., 2013), and entrainment to simpler rhythms in a chimpanzee (Hattori et al., 2013) and a bonobo (Large and Gray, 2015) are providing mounting counter evidence.
An understanding of the relationship between auditory neurophysiology, auditory population dynamics and auditory perception is an elusive goal, due to the intricate circuitry, the many structural levels involved, and the highly nonlinear nature of the neural responses. Significant theoretical advances will be necessary to understand signal processing, pattern formation, and plasticity in this complex and highly nonlinear system. Our approach draws upon the well-established theory of nonlinear dynamical systems to build novel and increasingly comprehensive models, and make predictions for physiological and behavioral experiments. This may allow us to directly link, for the first time, complex neuronal dynamics in animal models, neural population dynamics observed in human EEG and MEG, and perceptual dynamics assessed behaviorally, within an overarching theoretical framework that describes system dynamics at multiple levels of observation.
6. Funding
This work was supported in part by the grant AFOSR, 2012-16, “Signal Processing, Plasticity and Pattern Formation in Networks of Neural Oscillators,” FA9550-12-10388.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Ainsworth, M., Lee, S., Cunningham, M. O., Traub, R. D., Kopell, N. J., and Whittington, M. A. (2012). Rates and rhythms: a synergistic view of frequency and temporal coding in neuronal networks. Neuron 75, 583. doi: 10.1016/j.neuron.2012.08.004
Aronson, D. G., Ermentrout, G. B., and Kopell, N. (1990). Amplitude response of coupled oscillators. Physica D 41, 449. doi: 10.1016/0167-2789(90)90007-C
Barnes, R., and Jones, M. R. (2000). Expectancy, attention, and time. Cogn. Psychol. 41, 254–311. doi: 10.1006/cogp.2000.0738
Bartolo, R., and Merchant, H. (2015). β oscillations are linked to the initiation of sensory-cued movement sequences and the internal guidance of regular tapping in the monkey. J. Neurosci. 35, 4635–4640. doi: 10.1523/JNEUROSCI.4570-14.2015
Bartolo, R., Prado, L., and Merchant, H. (2014). Information processing in the primate basal ganglia during sensory-guided and internally driven rhythmic tapping. J. Neurosci. 34, 3910–3923. doi: 10.1523/JNEUROSCI.2679-13.2014
Bengtsson, S. L., Ullén, F., Ehrsson, H. H., Hashimoto, T., Kito, T., Naito, E., et al. (2009). Listening to rhythms activates motor and premotor cortices. Cortex 45, 62–71. doi: 10.1016/j.cortex.2008.07.002
Berens, P., and Velasco, M. J. (2009). The Circular Statistics Toolbox for Matlab. Research Group Bethge, Biologische Kybernetik, Max-Planck-Gesellschaft, Max Planck Institute for Biological Cybernetics, Tübingen. Report No: 184. Available online at: http://www.kyb.tuebingen.mpg.de/fileadmin/user_upload/files/publications/MPIK-TR-184_[0].pdf
Bergeson, T. R., and Trehub, S. E. (2006). Infants' perception of rhythmic patterns. Music Percept. 23, 345–360. doi: 10.1525/mp.2006.23.4.345
Börgers, C., and Kopell, N. (2003). Synchronization in networks of excitatory and inhibitory neurons with sparse, random connectivity. Neural Comput. 15, 509–539. doi: 10.1162/089976603321192059
Brunel, N. (2000). Dynamics of sparsely connected networks of excitatory and inhibitory spiking neurons. J. Computat. Neurosci. 8, 183–208. doi: 10.1023/A:1008925309027
Brunel, N. (2003). Dynamics and plasticity of stimulus-selective persistent activity in cortical network models. Cereb. Cortex 13, 1151–1161. doi: 10.1093/cercor/bhg096
Buzsáki, G., Anastassiou, C. A., and Koch, C. (2012). The origin of extracellular fields and currentseeg, ecog, lfp and spikes. Nat. Rev. Neurosci. 13, 407–420. doi: 10.1038/nrn3241
Buzsáki, G., and Draguhn, A. (2004). Neuronal oscillations in cortical networks. Science 304, 1926–1929. doi: 10.1126/science.1099745
Chapin, H. L., Zanto, T., Jantzen, K. J., Kelso, S. J. A., Steinberg, F., and Large, E. W. (2010). Neural responses to complex auditory rhythms: the role of attending. Front. Psychol. 1:224. doi: 10.3389/fpsyg.2010.00224
Chen, J. L., Penhune, V. B., and Zatorre, R. J. (2008a). Listening to musical rhythms recruits motor regions of the brain. Cereb. Cortex 18, 2844–2854. doi: 10.1093/cercor/bhn042
Chen, J. L., Penhune, V. B., and Zatorre, R. J. (2008b). Moving on time: brain network for auditory-motor synchronization is modulated by rhythm lexity and musical training. J. Cogn. Neurosci. 20, 226–239. doi: 10.1162/jocn.2008.20018
Cook, P., Rouse, A., Wilson, M., and Reichmuth, C. (2013). A California sea lion (Zalophus californianus) can keep the beat: motor entrainment to rhythmic auditory stimuli in a non vocal mimic. J. Comp. Psychol. 127, 412–427. doi: 10.1037/a0032345
Crowe, D. A., Zarco, W., Bartolo, R., and Merchant, H. (2014). Dynamic representation of the temporal and sequential structure of rhythmic movements in the primate medial premotor cortex. J. Neurosci. 34, 11972–11983. doi: 10.1523/JNEUROSCI.2177-14.2014
Fitch, W. T. (2012). “The biology and evolution of rhythm: unravelling a paradox,” in Language and Music as Cognitive Systems, eds P. Rebuschat, M. Rohmeier, J. A., Hawkins, and I. Cross (Oxford, UK: Oxford University Press), 73–95.
Fujioka, T., Trainor, L. J., Large, E. W., and Ross, B. (2009). Beta and gamma rhythms in human auditory cortex during musical beat processing. Ann. N.Y. Acad. Sci. 1169, 89–92. doi: 10.1111/j.1749-6632.2009.04779.x
Fujioka, T., Trainor, L. J., Large, E. W., and Ross, B. (2012). Internalized timing of isochronous sounds is represented in neuromagnetic beta oscillations. J. Neurosci. 32, 1791–1802. doi: 10.1523/JNEUROSCI.4107-11.2012
Grahn, J. A., and Rowe, J. B. (2009). Feeling the beat: premotor and striatal interactions in musicians and non-musicians during beat processing. J. Neurosci. 29, 7540–7548. doi: 10.1523/JNEUROSCI.2018-08.2009
Grahn, J. A., and Rowe, J. B. (2013). Finding and feeling the musical beat: striatal dissociations between detection and prediction of regularity. Cereb. Cortex 23, 913–921. doi: 10.1093/cercor/bhs083
Hannon, E. E., and Trehub, S. E. (2005). Tuning in to musical rhythms: infants learn more readily than adults. Proc. Natl. Acad. Sci. U.S.A. 102, 12289–12290. doi: 10.1073/pnas.0504254102
Hattori, Y., Tomonaga, M., and Matsuzawa, T. (2013). Spontaneous synchronized tapping to an auditory rhythm in a chimpanzee. Sci. Rep. 3:1566. doi: 10.1038/srep01566
Henry, M. J., and Herrmann, B. (2014). Low-Frequency Neural Oscillations Support Dynamic Attending in Temporal Context. Timing Time Percept. 2, 62–86. doi: 10.1163/22134468-00002011
Honing, H., Merchant, H., Háden, G., Prado, L., and Bartolo, R. (2012). “Probing beat induction in Rhesus monkeys: is beat induction species-specific,” in Proceedings of the International Conference on Music Perception and Cognition (Thessaloniki), 454–455.
Hoppensteadt, F., and Izhikevich, E. M. (1997). Weakly Connected Neural Networks. Applied Mathematical Sciences. New York, NY: Springer.
Hoppensteadt, F. C., and Izhikevich, E. M. (1996a). Synaptic organizations and dynamical properties of weakly connected neural oscillators I: analysis canonical model. Biol. Cybern. 75, 117–127. doi: 10.1007/s004220050279
Hoppensteadt, F. C., and Izhikevich, E. M. (1996b). Synaptic organizations and dynamical properties of weakly connected neural oscillators II: learning e information. Biol. Cybern. 75, 126–135. doi: 10.1007/s004220050280
Hoppensteadt, F. C., and Izhikevich, E. M. (1999). Oscillatory neurocomputers with dynamic connectivity. Phys. Rev. Lett. 82, 2983–2986. doi: 10.1103/PhysRevLett.82.2983
Iversen, J. R., Repp, B. H., and Patel, A. D. (2009). Top-down control of rhythm perception modulates early auditory responses. Ann. N.Y. Acad. Sci. 1169, 58–73. doi: 10.1111/j.1749-6632.2009.04579.x
Izhikevich, E. M. (2000). Neural excitability, spiking and bursting. Int. J. Bifurcat. Chaos. 10, 1171–1266. doi: 10.1142/s0218127400000840
Jones, M. R. (1976). Time, our lost dimension: toward a new theory of perception, attention, and memory. Psychol. Rev. 83, 323–335. doi: 10.1037/0033-295X.83.5.323
Jones, M. R. (2008). “Musical time,” in Oxford Handbook of Music Psychology, eds S. Hallam, I. Cross, and M. Thaut (Oxford: Oxford University), 81–92.
Jones, M. R., and Boltz, M. (1989). Dynamic attending and responses to time. Psychol. Rev. 96, 459–491. doi: 10.1037/0033-295X.96.3.459
Jones, M. R., Johnston, H. M., and Puente, J. (2006). Effects of auditory pattern structure on anticipatory and reactive attending. Cogn. Psychol. 53, 59–96. doi: 10.1016/j.cogpsych.2006.01.003
Jones, M. R., and McAuley, J. D. (2005). Time judgments in global temporal contexts. Percept. Psychophys. 67, 398–417. doi: 10.3758/BF03193320
Jones, M. R., and Yee, W. (1997). Sensitivity to time change: the role of context and skill. J. Exp. Psychol. Hum. Percept. Perform. 23, 693–709. doi: 10.1037/0096-1523.23.3.693
Kayser, C., Montemurro, M. A., Logothetis, N. K., and Panzeri, S. (2009). Spike-phase coding boosts and stabilizes information carried by spatial and temporal spike patterns. Neuron 61, 597–608. doi: 10.1016/j.neuron.2009.01.008
Kirschner, S., and Tomasello, M. (2009). Joint drumming: Social context facilitates synchronization in preschool children. J. Exp. Psychol. 102, 299–314. doi: 10.1016/j.jecp.2008.07.005
Koepsell, K., Wang, X., Hirsch, J. A., and Sommer, F. T. (2010). Exploring the function of neural oscillations in early sensory systems. Front. Neurosci. 4:53. doi: 10.3389/neuro.01.010.2010
Kung, S.-J., Chen, J. L., Zatorre, R. J., and Penhune, V. B. (2013). Interacting cortical and basal ganglia networks underlying finding and tapping to the musical beat. J. Cogn. Neurosci. 25, 401–420. doi: 10.1162/jocn_a_00325
Lakatos, P., Karmos, G., Mehta, A. D., Ulbert, I., and Schroeder, C. E. (2008). Entrainment of neuronal oscillations as a mechanism of attentional selection. Science 320, 110–113. doi: 10.1126/science.1154735
Lakatos, P., Shah, A. S., Knuth, K. H., Ulbert, I., Karmos, G., and Schroeder, C. E. (2005). An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. J. Neurophysiol. 94, 1904–1911. doi: 10.1152/jn.00263.2005
Langner, G. (1992). Periodicity coding in the auditory system. Hear. Res. 60, 115–142. doi: 10.1016/0378-5955(92)90015-F
Large, E. W. (2008). “Resonating to musical rhythm: theory and experiment,” in The Psychology of Time, ed S. Grondin (Cambridge: Emerald), 189–231.
Large, E. W. (2010). “Neurodynamics of music,” in Springer Handbook of Auditory Research: Music Perception, eds R. J. Mari and F. R. Richard, and P. N. Arthur (New York, NY: Springer), 201–231.
Large, E. W. (2014). “Rhythm perception: pulse and meter,” in Encyclopedia of Computational Neuroscience, eds D. Jaeger and R. Jung (New York, NY: Springer New York), 1–6. doi: 10.1007/978-1-4614-7320-6_106-2
Large, E. W., Almonte, F., and Velasco, M. (2010). A canonical model for gradient frequency neural networks. Physica D 239, 905–911. doi: 10.1016/j.physd.2009.11.015
Large, E. W., and Gray, P. (2015). Spontaneous tempo and entrainment in a bonobo (pan paniscus). J. Comp. Psychol. doi: 10.1037/com0000011. [Epub ahead of print].
Large, E. W., and Jones, M. R. (1999). The dynamics of attending: how people track time varying events. Psychol. Rev. 106, 119–159. doi: 10.1037/0033-295X.106.1.119
Large, E. W., and Snyder, J. S. (2009). Pulse and meter as neural resonance. Ann. N.Y. Acad. Sci. 1169, 46–57. doi: 10.1111/j.1749-6632.2009.04550.x
Ledoux, E., and Brunel, N. (2011). Dynamics of networks of excitatory and inhibitory neurons in response to time-dependent inputs. Front. Comput. Neurosci. 5:25. doi: 10.3389/fncom.2011.00025
Lerdahl, F., and Jackendoff, R. S. (1983). A Generative Theory of Tonal Music. Cambridge: MIT Press.
Licklider, J. (1956). “Auditory frequency analysis,” in Information Theory, ed C. Cherry (New York, NY: Academic Press), 253–268.
London, J. M. (2004). Hearing in Time: Psychological Aspects of Musical Meter. New York, NY: Oxford University Press.
McAuley, J. D., and Kidd, G. R. (1995). Temporally directed attending in the discrimination of tempo: further evidence for an entrainment model. J. Acoust. Soc. Am. 97, 3278. doi: 10.1121/1.411574
Meddis, R., and O'Mard, L. (2006). Virtual pitch in a computational physiological model. J. Acoust. Soc. Am. 120, 3861–3896. doi: 10.1121/1.2372595
Merchant, H., Grahn, J., Trainor, L., Rohrmeier, M., and Fitch, W. T. (2015a). Finding the beat: a neural perspective across humans and non-human primates. Philos. Trans. R. Soc. B 370:20140093. doi: 10.1098/rstb.2014.0093
Merchant, H., and Honing, H. (2013). Are non-human primates capable of rhythmic entrainment? Evidence for the gradual audiomotor evolution hypothesis. Front. Neurosci. 7:274. doi: 10.3389/fnins.2013.00274
Merchant, H., Pérez, O., Bartolo, R., Méndez, J. C., Mendoza, G., Gámez, J., et al. (2015b). Sensorimotor neural dynamics during isochronous tapping in the medial premotor cortex of the macaque. Eur. J. Neurosci. 41, 586–602. doi: 10.1111/ejn.12811
Merchant, H., Pérez, O., Zarco, W., and Gámez, J. (2013). Interval tuning in the primate medial premotor cortex as a general timing mechanism. J. Neurosci. 33, 9082–9096. doi: 10.1523/JNEUROSCI.5513-12.2013
Morillon, B., Hackett, T. A., Kajikawa, Y., and Schroeder, C. E. (2015). Predictive motor control of sensory dynamics in auditory active sensing. Curr. Opin. Neurobiol. 31, 230–238. doi: 10.1016/j.conb.2014.12.005
Morillon, B., Schroeder, C. E., and Wyart, V. (2014). Motor contributions to the temporal precision of auditory attention. Nat. Commun. 5:5255. doi: 10.1038/ncomms6255
Murdock, J. (2003). Normal Forms and Unfoldings for Local Dynamical Systems. Springer Monographs in Mathematics. New York, NY: Springer.
Musacchia, G., Large, E. W., and Schroeder, C. E. (2014). Thalamocortical mechanisms for integrating musical tone and rhythm. Hear. Res. 308, 50–59. doi: 10.1016/j.heares.2013.09.017
Nozaradan, S., Chemin, B., and Mouraux, A. (2013). From sounds to movement and back: how movement shapes internal representation of musical rhythms. Front. Hum. Neurosci. doi: 10.3389/conf.fnhum.2013.214.00044. Available online at: http://www.frontiersin.org/10.3389/conf.fnhum.2013.214.00044/event_abstract
Nozaradan, S., Peretz, I., Missal, M., and Mouraux, A. (2011). Tagging the neuronal entrainment to beat and meter. J. Neurosci. 31, 10234–10240. doi: 10.1523/JNEUROSCI.0411-11.2011
Nozaradan, S., Peretz, I., and Mouraux, A. (2012). Selective neuronal entrainment to the beat and meter embedded in a musical rhythm. J. Neurosci. 32, 17572–17581. doi: 10.1523/JNEUROSCI.3203-12.2012
Oppenheim, A. V., and Schafer, R. W. (1975). Digital Signal Processing. Englewood Cliffs, NJ: Prentice Hall.
Patel, A. D. (2014). The evolutionary biology of musical rhythm: Was darwin wrong? PLoS Biol. 12:e1001821. doi: 10.1371/journal.pbio.1001821
Patel, A. D., and Iversen, J. R. (2014). The evolutionary neuroscience of musical beat perception: the action simulation for auditory prediction (asap) hypothesis. Front. Syst. Neurosci 8:57. doi: 10.3389/fnsys.2014.00057
Patel, A. D., Iversen, J. R., Bregman, M. R., and Schulz, I. (2009). Experimental evidence for synchronization to a musical beat in a nonhuman animal. Curr. Biol. 19, 830. doi: 10.1016/j.cub.2009.03.038
Phillips-Silver, J., Toiviainen, P., Gosselin, N., Piché, O., Nozaradan, S., Palmer, C., et al. (2011). Born to dance but beat deaf: a new form of congenital amusia. Neuropsychologia 49, 961–969. doi: 10.1016/j.neuropsychologia.2011.02.002
Phillips-Silver, J., and Trainor, L. J. (2005). Feeling the beat: movement influences infant rhythm perception. Science 308, 1430–1430. doi: 10.1126/science.1110922
Phillips-Silver, J., and Trainor, L. J. (2007). Hearing what the body feels: auditory encoding of rhythmic movement. Cognition 105, 546. doi: 10.1016/j.cognition.2006.11.006
Plack, C., and Oxenham, A. (2005). “The psychophysics of pitch,” in Pitch: Neural Coding and Perception, eds C. Plack, R. Fay, A. Oxenham, and A. Popper (New York, NY: Springer), 7–55.
Rauschecker, J. P., and Scott, S. K. (2009). Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci. 12, 718–724. doi: 10.1038/nn.2331
Repp, B. H. (2008). Multiple temporal references in sensorimotor synchronization with metrical auditory sequences. Psychol. Res. 72, 79–98. doi: 10.1007/s00426-006-0067-1
Schouten, J. (1938). “The perception of subjective tones,” in Proceedings of the Koninklijke Nederlandese Akademie van Wetenschapen, Vol. 41 (Eindhoven), 1086–1093.
Schroeder, C. E., Wilson, D. A., Radman, T., Scharfman, H., and Lakatos, P. (2010). Dynamics of active sensing and perceptual selection. Curr. Opin. Neurobiol. 20, 172–176. doi: 10.1016/j.conb.2010.02.010
Schubotz, R. I. (2007). Prediction of external events with our motor system: towards a new framework. Trends Cogn. Sci. 11, 211–218. doi: 10.1016/j.tics.2007.02.006
Shadlen, M. N., and Movshon, J. A. (1999). Synchrony unbound: a critical evaluation of the temporal binding hypothesis. Neuron 24, 67–77. doi: 10.1016/S0896-6273(00)80822-3
Slézia, A., Hangya, B., Ulbert, I., and Acsády, L. (2011). Phase advancement and nucleus-specific timing of thalamocortical activity during slow cortical oscillation. J. Neurosci. 31, 607–617. doi: 10.1523/JNEUROSCI.3375-10.2011
Snyder, J. S., and Large, E. W. (2005). Gamma-band activity reflects the metric structure of rhythmic tone sequences. Cogn. Brain Res. 24, 117–126. doi: 10.1016/j.cogbrainres.2004.12.014
Stefanescu, R. A., and Jirsa, V. K. (2008). A low dimensional description of globally coupled heterogeneous neural networks of excitatory and inhibitory neurons. PLoS Comput. Biol. 4:e1000219. doi: 10.1371/journal.pcbi.1000219
Stefanics, G., Hangya, B., Hernádi, I., Winkler, I., Lakatos, P., and Ulbert, I. (2010). Phase entrainment of human delta oscillations can mediate the effects of expectation on reaction speed. J. Neurosci. 30, 13578–13585. doi: 10.1523/JNEUROSCI.0703-10.2010
Steriade, M., McCormick, D. A., and Sejnowski, T. J. (1993). Thalamocortical oscillations in the sleeping and aroused brain. Science 262, 685. doi: 10.1126/science.8235588
Todd, N. P., and Lee, C. S. (2015). The sensory-motor theory of rhythm and beat induction 20 years on: a new synthesis and future perspectives. Front. Hum. Neurosci. 9:444. doi: 10.3389/fnhum.2015.00444
Todd, N. P. M. (1994). The auditory primal sketch: a multi-scale model of rhythmic grouping. J. New Music Res. 23, 25–69. doi: 10.1080/09298219408570647
Todd, N. P. M. (1995). The kinematics of musical expression. J. Acoust. Soc. Am. 97, 1940–1949. doi: 10.1121/1.412067
Varela, F., Lachaux, J. P., Rodriguez, E., and Martinerie, J. (2001). The brainweb: phase synchronization and large-scale integration. Nat. Rev. Neurosci. 2, 229–239. doi: 10.1038/35067550
Velasco, M. J., and Large, E. W. (2011). “Pulse detection in syncopating rhythms using neural oscillators,” in Proceedings of the 12th Annual Conference of the International Society for Music Information Retrieval (Miami, FL).
Whittington, M. A., Traub, R. D., Kopell, N., Ermentrout, G. B., and Buhl, E. H. (2000). Inhibition-based rhythms: experimental and mathematical observations on network dynamics. Int. J. Psychophysiol. 38, 315–336.
Will, U., and Berg, E. (2007). Brain wave synchronization and entrainment to periodic acoustic stimuli. Neurosci. Lett. 424, 55–60. doi: 10.1016/j.neulet.2007.07.036
Wilson, H. R., and Cowan, J. D. (1973). A mathematical theory of the functional dynamics of cortical and thalamic nervous tissue. Kybernetik 13, 5–80. doi: 10.1007/BF00288786
Wing, A. M., and Kristofferson, A. B. (1973a). Response delays and the timing of discrete motor responses. Percept. Psychophys. 14, 5–12. doi: 10.3758/BF03198607
Wing, A. M., and Kristofferson, A. B. (1973b). The timing of interresponse intervals. Percept. Psychophys. 13, 455–460. doi: 10.3758/BF03205802
Yee, W., Holleran, S., and Jones, M. R. (1994). Sensitivity to event timing in regular and irregular sequences: Influences of musical skill. Percept. Psychophys. 56, 461–471. doi: 10.3758/BF03206737
Keywords: beat perception, neural resonance, musical rhythm, dynamical systems, neural networks
Citation: Large EW, Herrera JA and Velasco MJ (2015) Neural Networks for Beat Perception in Musical Rhythm. Front. Syst. Neurosci. 9:159. doi: 10.3389/fnsys.2015.00159
Received: 02 June 2015; Accepted: 02 November 2015;
Published: 25 November 2015.
Edited by:
Jonathan B. Fritz, University of Maryland, USAReviewed by:
Preston E. Garraghty, Indiana University, USAHugo Merchant, Universidad Nacional Autónoma de México, Mexico
James W. Grau, Texas A&M University, USA
Copyright © 2015 Large, Herrera and Velasco. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Edward W. Large, ZWR3YXJkLmxhcmdlQHVjb25uLmVkdQ==