- 1Neurophysics Group, Department of Neurology, Campus Benjamin Franklin, Charité—Universitätsmedizin Berlin, Berlin, Germany
- 2Department of Psychiatry and Psychotherapy, St. Hedwig Hospital, Charité—Universitätsmedizin Berlin, Berlin, Germany
- 3Department of Neurology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
- 4Center for Cognition and Decision Making, National Research University Higher School of Economics, Moscow, Russia
- 5Research Group “MEG and Cortical Networks”, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
- 6Department of Psychology, Whitehead Building, Goldsmiths, University of London, London, United Kingdom
Music performance relies on the ability to learn and execute actions and their associated sounds. The process of learning these auditory-motor contingencies depends on the proper encoding of the serial order of the actions and sounds. Among the different serial positions of a behavioral sequence, the first and last (boundary) elements are particularly relevant. Animal and patient studies have demonstrated a specific neural representation for boundary elements in prefrontal cortical regions and in the basal ganglia, highlighting the relevance of their proper encoding. The neural mechanisms underlying the encoding of sequence boundaries in the general human population remain, however, largely unknown. In this study, we examined how alterations of auditory feedback, introduced at different ordinal positions (boundary or within-sequence element), affect the neural and behavioral responses during sensorimotor sequence learning. Analysing the neuromagnetic signals from 20 participants while they performed short piano sequences under the occasional effect of altered feedback (AF), we found that at around 150–200 ms post-keystroke, the neural activities in the dorsolateral prefrontal cortex (DLPFC) and supplementary motor area (SMA) were dissociated for boundary and within-sequence elements. Furthermore, the behavioral data demonstrated that feedback alterations on boundaries led to greater performance costs, such as more errors in the subsequent keystrokes. These findings jointly support the idea that the proper encoding of boundaries is critical in acquiring sensorimotor sequences. They also provide evidence for the involvement of a distinct neural circuitry in humans including prefrontal and higher-order motor areas during the encoding of the different classes of serial order.
Introduction
A broad spectrum of daily tasks, such as preparing a meal or washing hands, requires the learning and production of sequential movements. These processes also support more complex forms of sensorimotor behavior, such as speech or music performance, which additionally require the processing of auditory feedback to control the production of motor output. The unique demands that music performance (including singing) poses on the underlying neural circuitry—namely, higher precision in temporal (i.e., rhythm) and spectral properties (i.e., pitch) than speech—make it a useful model for investigating the neural mechanisms at the base of sensorimotor sequence learning (Natke et al., 2003; Zatorre et al., 2007; Herrojo Ruiz et al., 2009, 2011; Patel, 2011; Zatorre and Baum, 2012).
Sequence learning requires organizing single actions in a specific temporal serial order to build a larger action unit. A large body of evidence suggests the involvement of frontoparietal, basal ganglia, and cerebellar circuits during sequence learning (Mushiake and Strick, 1995; Hikosaka et al., 1999; Averbeck et al., 2002; Fujii and Graybiel, 2003; Kao et al., 2005; Lehéricy et al., 2005; Penhune and Doyon, 2005; Ölveczky et al., 2011; Wymbs et al., 2012). Also, a neural representation of serial order coding has been reported in primate frontal areas—the prefrontal cortex (PFC), supplementary motor area (SMA) and primary motor cortex (M1)—as well as primate and rodent striatal areas (Tanji and Shima, 1994; Procyk and Joseph, 2001; Averbeck et al., 2002; Fujii and Graybiel, 2005; Lu and Ashe, 2005).
Animal studies suggest that in addition to the encoding of serial order, the encoding of boundary elements at the beginning and the end of the sequence is crucial for the acquisition of motor sequences (Fujii and Graybiel, 2003, 2005; Jin and Costa, 2010). In these studies, neuronal ensembles in the PFC and basal ganglia nuclei showed an increased neural response at the boundary elements (“Bo”) of a sequential task. Another study found that in macaque’s PFC, boundaries were associated with stronger neural representations compared to within-sequence (“In”) elements (Averbeck et al., 2002).
Crucially, the findings in non-human animals relating to specific neural representations for first and last sequence elements emphasize the relevance of sequence boundaries during sequence learning, as was also postulated in theoretical models of serial order memory (Dehaene and Changeux, 1997; Henson, 1998; Graybiel, 2008). However, the neural correlates of encoding sequence boundaries in humans have not yet been well studied and understood. Direct local field potential recordings from the human basal ganglia—available in patients with movement disorders—demonstrated differential changes of beta (13–30 Hz) oscillatory activity for boundary and within-sequence elements (Herrojo Ruiz et al., 2014a,b). In addition, one neuroimaging study reported that during a sequential visual task, increased BOLD activity in different prefrontal areas was specifically associated with either encoding boundary elements (the mid-ventrolateral PFC) or within-sequence elements (the mid-dorsolateral PFC and anterior cingulate cortex), respectively (Amiez and Petrides, 2007). Due to their specific characteristics, namely, clinical populations and visual task, these studies provide only fragmentary evidence regarding the neural mechanisms contributing to the encoding of boundary elements during sensorimotor sequence learning in humans.
In the current study, we explored how the encoding of sequence boundaries contributes to the formation of sensorimotor sequence representations during an early phase of learning. To address this question, we recorded the neuromagnetic activity from 21 healthy subjects while they performed short piano sequences. The acquisition of representations of piano sequences relies on the encoding of the precise mapping between the motor commands for the finger movements and the monitoring of the associated auditory feedback. To investigate the encoding of boundary elements in that context, we introduced an experimental manipulation. Specifically, we examined how alterations of feedback (AF), introduced at different ordinal positions (boundaries [Bo] or within-sequence elements [In]), affected the neural and behavioral responses during sensorimotor sequence learning.
Notably, AF introduced at random positions during piano performance elicit a frontocentral negative-going event-related potential (ERP) peaking between 140 ms and 240 ms, termed feedback-error related negativity (fERN), and likely generated by the anterior cingulate cortex (ACC, Maidhof et al., 2010). This component is followed within 280–330 ms by a later positive deflection with fronto-central topography, the P3a, reflecting an involuntary shift in attention towards unexpected stimuli (e.g., Escera et al., 2000). The use of better spatially-resolved techniques such as fMRI to investigate AF during piano performance has confirmed the crucial involvement of the ACC in processing AF (Pfordresher et al., 2014). Additionally, AF during music performance, as well as during singing and speech induced enhanced activation in the superior temporal lobe (Tourville et al., 2008; Zarate and Zatorre, 2008; Chang et al., 2013; Pfordresher et al., 2014; Herrojo Ruiz et al., 2017).
Here, in order to demonstrate a pivotal role of boundary elements encoding in sequence learning in humans, we tested the hypothesis that feedback alterations on boundary elements would disrupt behavioral performance to a greater extent than alterations on within-sequence elements. Parallel to the changes in performance, we anticipated that the brain responses during the early acquisition of auditory-motor representations would reflect different neural processing of the AF when introduced at the start/end or within-sequence elements. Based on the reviewed literature, we hypothesized that at the neural level, changes in processing AF at the different classes of ordinal positions (boundary, within-sequence elements) would be localized in areas of the prefrontal and temporal cortices, pre- and postcentral gyri (sensory and motor areas), and the SMA. Generic processing of AF, regardless of the ordinal position, should elicit a fERN (Maidhof et al., 2010), signaling the processing of error feedback that does not match the expected feedback based on motor prediction, itself related to prediction error (Chase et al., 2011). Initial processing of the error feedback may be followed by a deflection around 300 ms, similar to the P3a, related to the automatic shift of attention to deviant stimuli (Comerchero and Polich, 1999; Maidhof et al., 2010). Thus, a combined effect on behavioral and neuromagnetic responses would underscore the prominent role of boundaries during sensorimotor sequence learning.
Materials and Methods
Participants
Participants in the study included 21 healthy, right-handed subjects (10 females, aged 22–34 years, mean age = 27 years) with no extensive formal piano training (accumulated lifetime practice experience below 500 h). The participants had no history of neurological or psychiatric disorders and were compensated for their participation. They all gave written informed consent, and the study was approved by the ethics committee of the University of Leipzig. The data from one participant were discarded due to bad quality of the MEG recordings. In this study, we focused on evoked magnetic fields while our previous study used the data for the analysis of neuronal oscillations (Herrojo Ruiz et al., 2017).
Material and Procedure
This study is a re-analysis of Herrojo Ruiz et al. (2017) and the paradigm corresponds to the one used in that study, where more details are provided. The participants were asked to perform six sensorimotor sequences on a MIDI piano, using the dominant hand. The sequence patterns had a length of 4–5 notes and were explicitly taught to the participants (Figure 1A). The patterns were constructed to enable varying combinations of the successive finger movements. The keyboard did not have any ferromagnetic component and was tested for MEG and MRI compliance (Bangert et al., 2006). The time delay between keystrokes when registered as MIDI data and the corresponding trigger in the MEG recording was in the range of 20–25 ms. This interval was corrected for the MEG analyses. Accordingly, event-related field (ERF) waveforms at time 0 ms correspond with the keystroke.
Figure 1. Experimental paradigm and behavioral results. (A) Subjects performed piano sequences 1–6 with the dominant hand while listening to the associated normal feedback (NF) and occasionally to altered feedback (AF). The pitch content (and corresponding MIDI note numbers) of our custom-made MEG-compatible keyboard is displayed at the bottom of the schematic piano keys. For instance, as illustrated in the figure, if the key 61 [C] was played, participants could listen either to the NF (61, [C]) or to some AF (e.g., 63, [D]). The AF was presented either at the Boundary (Bo, green) or Within-sequence (In, orange) elements of the sequences. Each type of sequence was performed repeatedly in a block of 15 trials of 23 s duration each. The paradigm corresponds to the one used in Herrojo Ruiz et al. (2017). (B) The error rates after NF (light) and AF (dark) on Bo (green) and on In events (orange). The error rate was calculated as the ratio of the number of NF and AF events that induced an error in the next five key presses over the total number of the NF and AF events, respectively. (C) The timing performance (mean inter-onset interval or IOI, ms) of Bo (green) or In (orange) events with NF (light) or AF (dark) and the subsequent keystroke (+1). (D) The figure depicts the average difference in IOI (ms) between the analyzed events and the first subsequent keystroke for all the experimental conditions. Error bars denote the standard error of the mean; *p < 0.05 (non-parametric permutation test); n.s., not significant.
There were two sessions: familiarization and training. Both these sessions corresponded to an early stage of motor skill learning, which is characterized by rapid improvements in performance (i.e., improved timing and reduced error rates; Dayan and Cohen, 2011).
In both sessions, participants played the sequences and listened to the corresponding auditory feedback, which was delivered through air-conducting plastic ear tubes. During familiarization, participants practiced each sequence type in a block of three trials of 23 s duration. The performance tempo was paced by a metronome on a 200 bpm (beats per minute) speed prior to each trial (see next section).
In the training session participants completed, for each sequence type, a block consisting of 15 trials. In each trial of duration 23 s, participants had to play that sequence type continuously without breaks. As in the familiarization session, the tempo was induced at 200 bpm with the use of a metronome prior to each trial. Participants were instructed to play continuously during the trial’s length without stopping to correct any errors.
During the task, in 12 out of 15 trials for each sequence type, alterations of auditory feedback (termed, alterations of feedback, AF) were introduced randomly between every 8th and 10th produced note (every 8.37 [standard error of the mean or SEM, 0.05] keystrokes on average). We used this design because lower AF rates do not lead to behavioral effects (Maidhof et al., 2010; Pfordresher and Kulpa, 2011). The AF coincided with either boundary (Bo) or within-sequence (In) elements of the sequence. In the events with modified feedback, instead of hearing a tone corresponding to a given pressed key, participants heard either the tone of a different element of the sequence being played (i.e., from a different serial position in the sequence) or a tone that was unrelated to the sequence content. For a differential analysis of the two types of perturbations, see Herrojo Ruiz et al. (2017). Participants were informed, prior to the task, about the occasional occurrence of feedback alterations. Trials 1, 6 and 10 were perturbation-free.
Behavioral Analysis
General performance was evaluated by measuring three parameters, the average timing (time between consecutive keystrokes or inter-onset interval, IOI, in ms), the variability of timing (coefficient of variation for IOI [cvIOI]), and the error rate.
Previous studies focusing on sensorimotor sequence learning during an early training phase support the dissociation of two processes: (a) the encoding of the serial order of the actions (spatial feature), more strictly related to learning and reflected in error rates; (b) the concurrent improvements in performance, as reflected in faster tempo or reaction times and reduced temporal variability with training (Seidler et al., 2002; Kornysheva and Diedrichsen, 2014). Accordingly, to test the effects of AF on sequence learning, we used as main dependent variable the rate of pitch errors induced in the subsequent key presses. Specifically, the error rate was calculated as the ratio of the number of AF events that induced an error in the next five key presses over the total number of AF events. AF events that were followed by another AF event in the considered range—five subsequent keystrokes—were excluded from the error rate analysis, as the error could have been induced by the subsequent AF event. In addition, the effects of feedback alterations on the performance changes that typically accompany sequence learning (i.e., encoding) were assessed in terms average tempo and cvIOI in AF trials, as well as post-feedback slowing (larger IOI in events following the AF event).
MEG Data Acquisition and Pre-processing
Neuromagnetic signals were recorded during the performance session using a 306 sensor Elekta Neuromag system (Elekta Neuromag Oy, Helsinki, Finland) in an electromagnetically shielded room (Vacuumschmelze, Hanau, Germany). The MEG device has 102 triple sensor elements in a head-shaped array, and each of the elements was comprised of one magnetometer and two orthogonal planar gradiometers.
Head Position Indicator (HPI) coils attached to the scalp were used to monitor head movements. Vertical and horizontal bipolar electrooculograms (EOG) and electrocardiograms (ECG) were recorded simultaneously with the MEG recording so that we could control for ocular and cardiac artifacts.
Magnetic signals were recorded at a 1000 Hz sampling rate and a low pass filter of 330 Hz. The signal space separation method (Maxfilter Neuromag; Taulu et al., 2004) was used to suppress extracranial noise and to project individual signal space data to a default head position. This allowed performing statistical analyses across participants in sensor space. An additional correction was applied to one participant (#15) whose head displacement was larger than 5 mm (temporal-spatial filtering algorithm, MC Neuromag, Taulu and Kajola, 2005; Taulu and Simola, 2006). On average, the head displacement in all participants was 1.8 (standard error of the mean or SEM, 2) mm (range 0.5–4 mm; excluding Participant 15).
Further analysis was performed with custom-made Matlab algorithms (The MathWorks Inc., MA, USA) and the Fieldtrip toolbox (Oostenveld et al., 2011). The analysis was restricted to the 204 planar gradiometer sensors because they are more sensitive to cortical sources directly underneath them and less sensitive to extracranial noise sources (Hämäläinen et al., 1993). The continuous MEG data were filtered with a high pass filter of 1 Hz and a low pass filter of 100 Hz to minimize high frequency noise from MEG coils (Linear-phase FIR [Finite Impulse Response] filter as implemented by Fieldtrip with “firls” option, filter order = 6). Ocular and cardiac artifacts were identified and removed using the independent component analysis (FastICA, symmetric approach, with the hyperbolic tangent—tanh—as nonlinear function; Hyvärinen and Oja, 2000).
Sensor Space: Event-Related Fields
To analyze the ERFs in the sensor space, we segmented continuous data into epochs from −1 s to 1 s, time-locked to correctly played keystroke events in the different experimental conditions. The four experimental conditions included (Figure 1A): (i) boundary events with normal feedback (NF Bo); (ii) within-sequence events with NF (NF In), (iii) boundary events with AF (AF Bo); and (iv) within-sequence events with AF (AF In). For the conditions with AF, AF Bo and AF In, approximately 135 and 170 artifact-free epochs were extracted on average, respectively. From the larger pool of events (>1000) corresponding to the conditions with NF (NF Bo, NF In), we extracted the same number of events as those available in the AF conditions (135 and 170 on average, respectively). NF events following AF events (+1, +2) were excluded from this selection process. Importantly, the selected epochs were matched in timing (IOI) and keystroke velocity to the epochs from the AF conditions. These selected epochs were visually inspected for further artifacts. After visual inspection, 123 (SEM, 4) trials on average remained for NF Bo, 123 (4) for AF Bo and 157 (6) and 156 (6) for NF In and AF In, respectively. Finally, the average ERF across trials was estimated relative to a pre-keystroke baseline (−200 to −100 ms) and separately for each condition and participant. The data from the planar gradiometers were then combined at each sensor position by computing the mean square root of the signals (“combined planar” representation in FieldTrip).
Source Reconstruction
To examine between-condition differences at the source level, we calculated the current distribution of the sources with the use of L2-norm minimum-norm estimates (MNEs, Hämäläinen and Ilmoniemi, 1994; Dale et al., 2000). Note that MRI segmentation, coregistration and forward model estimation were performed as in Herrojo Ruiz et al. (2017). In brief, we first processed the individual T1-weighted MRI images (3T Magnetom Trio, Siemens AG, Germany) with the “Freesurfer” software1 for segmentation of the MRI data and cortical surface reconstruction. The “MNE” software2 was then used for the co-registration of the MR and MEG coordinate systems and the construction of boundary element conductivity models (BEM) to use in the forward calculations. We selected the inner skull surface as volume conductor geometry. Then we created a cortical grid in the MNI space template brain (as used in SPM8) with 4 mm resolution as the respective source space. We further warped this grid into the subject-specific space by use of transformation matrices obtained during the normalization of individual MR images. The source space, the volume conductor model, and the position of the planar gradiometer sensors were then used for the calculation of the forward model. In the last step, we computed the inverse solution using the L2-norm MNE method, as implemented in the FieldTrip software (minimum-norm estimate, based on Dale et al., 2000; Lin et al., 2004). MNE sources were estimated for each grid point at the time interval between 0.15 s and 0.37 s post-keystroke, and the individual source solutions were interpolated to a template MNI mesh. The noise-covariance matrix was estimated for each subject using data from the time intervals preceding each performance trial (thus corresponding to periods of no performance, amounting to ~3–5 min). The noise-covariance matrix was scaled using the regularization parameter λ (as in Dale et al., 2000; Lin et al., 2004). Here λ was set to the value recommended in the FieldTrip tutorial (initially, though, we explored different values of λ, which did not change the results qualitatively).
Statistical Analysis
To test for main effects and interactions of factors Feedback (normal, altered) and Position (boundary, within-sequence) in the behavioral data, we first run a 2 × 2 non-parametric factorial analysis (synchronized rearrangements, Good, 2005; Basso et al., 2007). This analysis was complemented with post hoc pair-wise permutation tests across subjects (Good, 2005) to assess significant differences between experimental conditions (e.g., error rates or mean IOI following AF at Bo and In). The difference in sample means was the test statistic. We performed n = 5000 rearrangements, drawn at random from the complete permutation distribution (Monte Carlo permutation test). The p-values were estimated as the percentage of the replications of the test statistic that had absolute values larger than the experimental difference.
To assess differences in the ERFs at the sensor level, we performed a 2 × 2 non-parametric factorial analysis with factors Feedback and Position. This analysis focused on the time interval from 0.15 s to 0.37 s post-keystroke. The choice of this time interval was primarily based on previous research suggesting that the main effects associated with the processing of AF during performance occur approximately between 0.15s and 0.25 s after the AF (Maidhof et al., 2010). However, we extended the window of analysis to 0.37 s because this was the average IOI during performance. Accordingly, the choice of this upper limit enabled the investigation of ERF effects up to the next keystroke.
To correct for multiple comparisons, we controlled the false discovery rate (FDR) at level q = 0.05 by using an adaptive two-stage linear step-up procedure (Benjamini et al., 2006). The result of this procedure is the corrected threshold p-value, which is provided in the text as pthr, when multiple comparisons were performed. Note that as a sanity check we also ran statistical tests in the peri-keystroke interval [−200, +150 ms] and found no significant clusters (p > 0.05).
Post hoc analyses of the ERFs following the 2 × 2 factorial analysis were performed in the same time window 0.15–0.37 s with non-parametric tests based on spatio-temporal clustering, using the FieldTrip software (dependent samples t-test, 1000 iterations; Maris and Oostenveld, 2007). The threshold to control for family-wise error (FWE) was set to p = 0.025 (two-sided test). The test statistic of the observed data was evaluated against the Monte-Carlo permutation distribution in order to test the null hypothesis of no difference between conditions. We applied the cluster-based permutation tests to the following between-condition differences: (i) AF Bo − NF Bo; (ii) AF In − NF In, to test for Feedback effects (Altered − Normal); (iii) NF Bo − NF In; and (iv) AF Bo − AF In to test for differences related to the ordinal Position (boundary − within-sequence).
Following up on the finding of a significant interaction between factors Position and Feedback in the sensor space (see “Results” section), statistical analyses at the source level focused on the pair-wise comparisons outlined in the previous paragraph for the ERF analysis and were carried out with pair-wise permutation tests.
To estimate statistically dynamic changes at the source level, we divided the time interval 0.15–0.37 s into four segments of 55 ms-length each and calculated the average source current distribution for each segment. We then applied the permutation test to assess between-condition source differences in each segment separately. The same design as the one used in the cluster statistics (described in the previous paragraph) was used to examine potential effects on the source level related to AF and ordinal position. The test was repeated for each grid point of the cortical mesh (~8000), and to correct for multiple comparisons, we kept the FDR at level q = 0.05, as indicated above. The anatomical locations with significant differences in source activity were identified by the use of the Automated Anatomical Labeling atlas (AAL). The regions of interest included the pre- and postcentral gyri, corresponding to sensory and motor areas, and the frontal and temporal cortices. Furthermore, the “area of significant activation” was estimated by the ratio of the number of significant grid points over the total number of grid points for each region (in percentage). In Tables 1, 2 we report the cortical regions with an “area of significant activation” equal or larger to 10%.
In addition to significance testing using permutation tests, a nonparametric effect size estimator, the probability of superiority for dependent samples or PSdep (Grissom and Kim, 2012), is reported. PSdep is an estimation of the probability (maximum 1) that in a randomly sampled pair of matched values (from same individual), the value from Condition B will be greater than the value from Condition A: PSdep = Pr (XiB > XiA). Throughout the manuscript, this index will be provided in association with each pair-wise permutation test.
Results
Behavioral Results
Data are provided as the mean and, in parentheses, SEM. Figure 1C depicts the average timing (IOI) of the analyzed events and the subsequent keystroke for all conditions, while Figure 1D depicts the average IOI difference between the two. We first compared the timing performance (mean IOI, ms) between events with NF and AF and, expectedly, found no significant differences (p > 0.05 overall, but also for all positions of 4-note sequences and 5-note sequences). To investigate the effect of AF on the timing of the subsequent keystrokes, and the role of ordinal position, we analyzed the change in the mean IOI (ms) at the keystroke following the feedback. A 2 × 2 factorial analysis of the timing performance at the subsequent (+1) stroke with factors Feedback (NF, AF) and Position (Bo, In) demonstrated a significant main effect for both factors (Feedback, p = 0.0064; Position, p < 0.001) as well as an interaction effect (p = 0.008). With respect to the main effect of Position, post hoc comparisons revealed that IOI change at +1 keystroke was significantly larger for Bo compared to In, both when AF and NF were introduced (AF: 49 [16] ms for Bo and −2.1 [3.7] ms for In, p < 0.001, PSdep = 0.85; NF: 34 [16] ms for Bo and −3.8 [2.4] ms for In, p < 0.001, PSdep = 0.95). Regarding the main effect in Feedback, post hoc comparisons were performed for IOI changes at +1 (Bo: AF vs. NF and In: AF vs. NF) in order to investigate whether the manifestation of AF-induced changes in tempo were different depending on the position in the sequence on which AF was introduced (Bo or In elements). When introduced at Bo elements, AF caused a slowing at the subsequent keystroke (an IOI difference of 49 [16] ms), which was significantly larger than the change in IOI observed at the same position following NF on Bo elements (34 [16] ms; p = 0.009, PSdep = 0.65; Figure 1D. Note that any changes in IOI following boundary elements with NF are due to sequence-position-specific characteristics). A similar analysis performed for In elements under AF showed that the effect of AF on the timing (IOI) of the subsequent keystroke was not significantly different relative to the timing (IOI) change at the same position following NF on In elements (p > 0.05; Figure 1D). The comparison of the difference (AF-NF) between Bo and In did not reach significance level (15 [6.2] ms for Bo and 1.7 [2.7] for In, p = 0.08, PSdep = 0.65). A detailed separate analysis for each ordinal position (Bo: first and last elements; In: each element within the sequence) confirmed this pattern of results and can be found in Supplementary Figure S1. In summary, these results show that the timing performance at the first subsequent keystroke is disturbed only after introducing AF on Bo but not on In elements.
With regard to the error rates, a 2 × 2 analysis on the error rates following Bo and In elements (for AF and NF events) unveiled a significant main effect for factor Feedback (p = 0.009) and a significant interaction (p = 0.013). This main effect was due to larger error rates following AF events (0.011 [0.02] on average) than NF events (0.005 [0.002]). When exploring the interaction effect, we found that AF induced a larger error rate when introduced at Bo elements relative to In elements (0.019 [0.003] for Bo and 0.016 [0.002] for In; p = 0.04, PSdep = 0.74; Figure 1B).
In sum, our findings on timing and error rates suggest that behavioral changes due to AF are sensitive to the class of ordinal position and that feedback modifications affecting start and end (boundary) elements seem to disrupt behavior to a greater extent than when introduced at within-sequence elements.
Lastly, we assessed sequence-specific learning and related changes in performance across time in purely NF trials, which are better suited to test effects of training when AF is not present (see also Herrojo Ruiz et al., 2017, for more details). There was no significant change in error rates from the first (#1) to the last (#11) NF trial (p > 0.05). Regarding changes in timing in NF trials across time, we found no significant changes in tempo but a significant increase in the extent of temporal variability (cvIOI = 0.24 [0.02] in trial 1 and 0.31 [0.03] in trial 11, p < pthr = 0.0028, PSdep = 0.80).
A similar analysis performed across time throughout the experiment, regardless of the sequence that participants played (i.e., comparison of data in the first and second half of the experiment), revealed also no significant changes in error rates in NF trials (p > 0.05). Notably, however, across time throughout the experiment there was a significant improvement in the average timing performance (mean IOI from 395 [10] ms to 374 [9] ms; p = 0.001, PSdep = 0.90).
Sensor Space: Event-Related Fields
The effects of AF and the ordinal position on the averaged ERFs were examined using a 2 × 2 non-parametric permutation-based factorial analysis with factors Feedback (normal, altered) and ordinal Position (Bo, In). This test revealed a significant main effect for both factors and a significant interaction within 150–370 ms post-keystroke (p < pthr = 0.0032 for interaction, 0.0050 for factor feedback and 0.0016 for factor position). Post hoc analyses with cluster-based permutation tests—performed in the same time window (150–370 ms)—revealed for both Feedback comparisons AF Bo − NF Bo and AF In − NF In a significantly larger activation for the AF conditions (p < 0.025). The significant difference was mainly due to enhanced activation over right (AF Bo − NF Bo) and bilateral (AF In − NF In) temporal sensors (Figure 2, Feedback, Figures 3A,B). A further comparison of the AF-NF difference between Bo and In (double difference statistic) was not significant. The cluster analysis for the comparisons NF Bo − NF In and AF Bo − AF In, which was performed to investigate the effect of ordinal position, demonstrated for both comparisons one positive and one negative significant cluster (p < 0.025; positive means Bo > In and negative Bo < In). In both cases, the differences were more pronounced over fronto-temporal and central sensors (Figure 2, Position, Figures 3C–F). Overall, this analysis revealed at the sensor level a differential activation pattern for the contrasted conditions in both the examined factors Feedback (AF vs. NF) and ordinal Position (Bo vs. In).
Figure 2. Sensor-level event-related field (ERF) effects. Each row shows the temporal evolution of the topography of the average ERF differences between the compared conditions in the window 0.15–0.37 ms post-keystroke. Cluster statistics were performed on that time window to explore possible significant effects for all of the four post hoc comparisons (1st row, AF Bo − NF Bo; 2nd AF In − NF In; 3rd NF Bo − NF In; 4th AF Bo − AF In). The lines below each row show the duration of each significant cluster (red denotes a positive and blue a negative effect) and the dots on the topography maps indicate the sensors that contribute to the cluster (black for positive and gray for negative cluster).
Figure 3. ERF waveforms for the significant effects. For all the significant effects revealed by the cluster statistical analysis (2 for Feedback and 4 for Position comparisons, see Figure 2), an exemplary topographical map of the ERF difference between the two compared conditions as well as their ERF waveform, averaged over the significant sensors, is shown here. The non-shaded area in each plot represents the duration of the respective cluster. The first row represents the significant effects in the “Bo: AF—NF” (A) and “In: AF—NF” (B) comparisons. The second and third row correspond to the positive (left) and negative (right) effects found in the “NF: Bo—In” (C,D) and “AF: Bo—In” (E,F) comparisons, respectively.
Source Space Analysis
Using minimum-norm estimates, we then contrasted the experimental conditions to define potential differences at the source level and their spatio-temporal characteristics. The selected time window from 0.15 s to 0.37 s was segmented into four epochs (S1–S4, see “Materials and Methods” section), and the source analysis results for each segment were contrasted between the compared conditions using a non-parametric permutation test. In Table 1 (Feedback effects) and Table 2 (Position effects), we report the cortical regions that showed significantly different activity between the contrasted conditions. The last column of the Tables indicates the time segments in which a particular region showed a significant effect while the sign in parenthesis in Table 2 (Position effects) indicates the direction of the effect. In the cases where an area is marked with “S1(+), S3 (–)”, that means that the particular region showed a Bo > In significant activation in the first segment (S1) while in segment S3, it showed an opposite activation pattern (Bo < In). Figure 4A illustrates the significant brain activations in the contrast between AF and NF (AF Bo − NF Bo, left panel; AF In − NF In, right panel). Table 1 indicates all the activated regions (p < pthr, see Table 3 and area of significant activation larger than 10% of the region’s total number of grid points). Both Feedback contrasts showed differential activation in a number of cortical locations primarily in the last two segments (S3: 260–315 ms, Figure 4A, first and third row for right and left hemisphere, respectively; S4: 315–370 ms, Figure 4A second and fourth rows), being consistent with our findings at the sensor-level (see Figure 2, Feedback). The main brain areas contributing to the significant AF minus NF contrasts were the primary sensorimotor (pre- and postcentral gyri), temporal (e.g., superior temporal gyrus) and frontal (mainly portions of the inferior frontal gyrus [IFG]) cortices. This result reflects a complex activation pattern involved in the processing of AF, which was most prominently right-lateralized for the Bo: AF-NF contrast (bilateral for the In contrast). In the right hemisphere, there was a large overlap in the implicated regions between the AF-NF contrasts for Bo and In elements. An additional comparison of the difference AF-NF between Bo and In revealed no significant effects.
Figure 4. Neural sources of the differential response to AF vs. NF and boundary vs. within-sequence items. (A) Feedback: significant source activity differences for “AF—NF”, separately for boundary (Bo, left) and within-sequence (In, right) elements. The maps depict the cortical sources for the right (top four maps) and left hemisphere (bottom four maps) in the windows 260–315 ms and 315–370 ms. (B) Ordinal Position: significant source activity differences for “Bo—In” (left NF, right AF) occurred at 150–205 ms (Bo > In) and 260–315 ms (Bo < In; criterion for visualization in both figures: p < pthr, non-parametric permutation test, pthr estimated to correct for multiple comparisons, see “Materials and Methods” section).
The analysis of the factor Position revealed a significant Bo—In effect mainly localized to bilateral fronto-temporal areas, including temporal regions and the IFG—similar to the analysis of factor Feedback—but also extending substantially to the dorsolateral and medial prefrontal cortex (DLPFC, MPFC; Figure 4B, Table 2). In the first segment (S1: 150–205 ms, Figure 4B, Table 2), both for AF and NF Bo—In contrasts, frontal regions such as the bilateral IFG, DLPFC, MPFC and the right SMA as well as several temporal areas (e.g., the left middle temporal gyrus) showed a significantly larger response at Bo relative to In elements. In contrast, in the third segment (S3: 260–315 ms, Figure 4B, second and fourth rows, Table 2) of the AF Bo—In contrast, we identified a significantly weaker response at Bo compared to In elements in temporal areas (e.g., the left superior and middle temporal gyri). Overall, the results of the source reconstruction indicate, first, that the processing of feedback alterations was associated with neural activity distributed over a wide cortical network including temporal regions and portions of the IFC. Additionally, the results show that fronto-temporal regions, similar to those found for the feedback analysis but additionally extending to the (DLPFC and MPFC), were differentially activated during the encoding of boundary elements, compared to within-sequence elements. The last finding, in addition to the results from the sensor-level and the behavioral analysis, suggests a distinct role for the encoding of boundaries during sensorimotor sequence learning in humans.
Discussion
Does the encoding of boundary and within-sequence elements differ during sensorimotor sequence learning? To address this question, the current study compared the effects of alterations of auditory feedback when introduced at boundary and within-sequence elements during the learning of short piano sequences. In terms of behavior, we found that AF led to greater disruptive changes in performance when applied at boundary relative to within-sequence elements. At the neural level, we also found a dissociation in the neural responses in the two conditions, demonstrated by the ERF and the source reconstruction analysis. Our findings thus suggest that the encoding of boundary elements plays an essential role during the early-stage acquisition of sensorimotor sequences.
Altered Feedback at Boundary Elements Impairs Performance to a Greater Extent
The analysis of performance revealed that AF when introduced at boundary elements—compared to within-sequence elements—resulted in larger error rates and a larger post-feedback slowing in the subsequent keystroke. This finding indicates that disturbances of the action-perception contingencies during the encoding of boundaries can affect both the accuracy and the timing of behavior. Although the slowing down in timing performance induced by AF might not be a direct indicator of impaired learning, the larger number of errors after AF at boundaries demonstrates that disturbance of the encoding of boundaries leads to greater disruption of sequential learning. This interpretation is aligned with the reported dissociation between error rates—as an index of sequence encoding and learning—and the changes in the timing of performance that occur in parallel (e.g., Seidler et al., 2002). Thus, the evidence supports that the adequate encoding of start and end elements facilitates sensorimotor sequence learning.
Fronto-Temporal and Sensorimotor Responses to Altered Feedback (AF)
With respect to the neuromagnetic processing of AF, the present data showed that AF activated the auditory cortex, the IFG, and the pre- and post-central gyri. The temporal cortex is involved in processing musical sequences (Hickok et al., 2003; Koelsch et al., 2005) and auditory sequence violations (Giard et al., 1995; Uhrig et al., 2014). In addition, it is critical for auditory-motor transformations in speech and aspects of musical abilities (Hickok and Poeppel, 2004). Although a recent fMRI study employing a similar design (Pfordresher et al., 2014) reported Spt (Sylvian parieto-temporal area) but no temporal cortex activation in response to AF, this might be due to the low sensitivity of fMRI in detecting rapid changes in brain activity, which were, however, detectable in the current experiment. The IFG (BA44) is involved in syntax processing during music performance and perception (Maess et al., 2001; Bianco et al., 2016) while music-related auditory-motor transformations involved IFG as well as the premotor cortex (Bangert et al., 2006; Lahav et al., 2007). Significantly, an earlier model of musical processing by Koelsch (2005) suggested that musical syntax engages the inferior frontolateral cortex (BA44), the premotor cortex and STG with a right hemispheric dominance. Because our sequences did not follow any harmonic rules but were instead created as simple associations between movement and pitch values, our findings expand upon the proposal by Koelsch et al. (2005) in demonstrating the involvement of these structures in processing more generic auditory-motor associations. This interpretation thus aligns well with anatomical data from non-human primates showing that STG is directly connected to IFG and the premotor cortex (see Zatorre et al., 2007).
We additionally found simultaneous activation of pre- and post-central gyri, which have been reported in previous studies to be activated during music performance and imagery (Meister et al., 2004) and during auditory feedback control in speech (Tourville et al., 2008). Thus, this finding might reflect the activation of the sensorimotor network that is known to connect sensory and the motor cortex during motor behavior (e.g., Brovelli et al., 2004).
Another interesting finding of the source analysis was a difference in laterality for the effects of AF depending on the position on which AF was introduced. While AF on within-sequence elements resulted in the engagement of bilateral fronto-temporal areas, the processing of feedback alterations on Bo was limited to right fronto-temporal areas. This difference in laterality could reflect a more localized processing of disturbances of boundary encoding as opposed to within-sequence elements. However, clarifying the implications of this laterality effect will require follow up studies aiming to replicate our findings.
DLPFC and SMA Support the Differential Encoding of Ordinal Position
The main aim of the study was to investigate whether different neural mechanisms are involved in the encoding of boundary and within-sequence elements and their disruption by AF. We found evidence supporting differences in the event-related responses to boundaries and within-sequence elements under AF as demonstrated by an early fronto-temporal positivity and a subsequent negativity over mainly temporal sensors. The source reconstruction analysis suggested that the latter effect emerges around 260–315 ms and stems mainly from the left temporal cortex. Although a similar negativity in the ERF was observed also for the NF at the sensor level, the effect was shorter in time and spatially more confined. Note, however, that this effect had no corresponding significant source. This negative ERF pattern found in the Bo—In contrast might be associated with the interaction effect as it extended over a longer period and was more pronounced in AF than NF conditions. Note, however, that our post hoc exploration of the interaction effect (factors Feedback—AF, NF—and Position—Bo, In) did not reveal conclusive results concerning the direction of changes leading to the interaction effects.
In contrast, the early fronto-temporal positivity had the same spatio-temporal characteristics under both NF and AF, suggesting that it might index merely different encoding of boundary and within-sequence elements regardless of the auditory feedback. The source analysis supports this idea by revealing a large overlap of activated regions in the corresponding time window between the NF Bo—NF In and AF Bo—AF In contrasts. This activation was distributed predominantly over the bilateral DLPFC, an area which has been consistently implicated in different aspects of serial order encoding both in human and non-human primate studies (Averbeck et al., 2002; Ninokura et al., 2004; Fujii and Graybiel, 2005; Amiez and Petrides, 2007). The observed additional activation of the right SMA is in agreement with studies linking SMA to sequential behavior (Shima and Tanji, 2006) and proposing a key role for SMA in the encoding of temporal structure (Kotz and Schwartze, 2011). The left IFG and temporal regions that were also activated have been related to auditory chunking (Dehaene et al., 2015) and formation of structural representations (only left IFG; Karuza et al., 2013), and thus they probably act complementarily to DLPFC and SMA. The present findings provide a clear indication for the differential encoding of boundaries and within-sequence elements during the early stage of sensorimotor sequence learning, a process that is supported by the DLPFC and SMA with the IFG and temporal regions having a complementary role.
Previous studies have demonstrated that the basal ganglia are involved in the encoding of sequence boundaries during the acquisition of sequential behavior (Jin and Costa, 2010; Herrojo Ruiz et al., 2014a,b). This finding converges with similar evidence for the frontal (Fujii and Graybiel, 2005) and motor cortex (Santos et al., 2015). We therefore propose that DLPFC and SMA might interact with the basal ganglia via cortico-basal ganglia loops during the differential encoding of boundary and within-sequence elements during the early phase of sequence acquisition. Ultimately, this encoding leads to the concatenation of actions into integrated units of behavior (Graybiel, 2008).
Methodological Considerations
As in any MEG study, the results of the source localization should be interpreted with caution, given the limitations that affect source localization of MEG data (Hari et al., 1988; Hansen et al., 2010). Moreover, it should be noted that minimum-norm estimates are limited to the cortical surface, thereby excluding the possibility of detecting other structures relevant for processing feedback-related errors in our paradigm, such as the cerebellum (Herrojo Ruiz et al., 2017). In addition, a higher sensitivity to neuromagnetic sources in the cerebellum might be better achieved by focusing on high-frequency oscillatory activity (E/MEG: Dalal et al., 2008, 2013).
Interestingly, a few recent studies have reported the involvement of the cingulate cortex in processing AF during piano performance (Maidhof et al., 2010; Pfordresher et al., 2014) or sensorimotor sequence learning (Herrojo Ruiz et al., 2017). The limited sensitivity of the L2-norm MNE to this region might account for this apparent discrepancy. However, our reported sources associated with processing AF (i.e., STG) or differential encoding of boundary elements (DLPFC) might be more consistent with these processes.
Conclusion
The present findings emphasize the importance of boundaries in sensorimotor sequence learning and extend previous studies by showing that the differential encoding of boundaries and within-sequence elements is sensitive to changes in auditory feedback and relies on dorsolateral prefrontal and higher-level motor areas. This finding is particularly relevant for understanding the neural circuitry behind the encoding of serial order position of actions in behavioral sequences that rely on auditory-motor coupling. Thus, it has implications for future studies investigating sequence learning in music, speech and singing.
Author Contributions
GM was involved in the data analysis and the writing of the manuscript. MHR was involved in the design of the study, data acquisition, data analysis and the writing of the manuscript. VVN, GC and BM were involved in the design of the study and a critical revision of the manuscript.
Funding
This research was supported by the German Research Foundation (DFG) through project HE 6013/1-2 to MHR. VVN was supported by the HSE Basic Research Program and the Russian Academic Excellence Project “5–100”. Note that a preprint of this article also exists (Michail et al., 2017).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank Yvonne Wolff for her invaluable help with the MEG data acquisition.
Footnotes
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnhum.2018.00240/full#supplementary-material
FIGURE S1 | The effect of altered feedback (AF) on the timing performance of the subsequent keystroke. The plot shows the timing performance (mean IOI, ms) at the subsequent keystroke, +1, after normal feedback (NF; gray) or altered feedback (AF; black) at each element of the 4-note sequences (Position 1, 2, 3 and 4). When participants heard altered feedback, there was a post-feedback slowing at the next keystroke, which was prominent only after AF on boundary elements (significant for the Position 4, p < pthr = 0.02, PSdep = 0.65; trend relative to pthr for the Position 1, p = 0.06, PSdep = 0.70). *p < pthr, non-parametric permutation test, pthr estimated to correct for multiple comparisons, see “Materials and Methods” section in the main text.
References
Amiez, C., and Petrides, M. (2007). Selective involvement of the mid-dorsolateral prefrontal cortex in the coding of the serial order of visual stimuli in working memory. Proc. Natl. Acad. Sci. U S A 104, 13786–13791. doi: 10.1073/pnas.0706220104
Averbeck, B. B., Chafee, M. V., Crowe, D. A., and Georgopoulos, A. P. (2002). Parallel processing of serial movements in prefrontal cortex. Proc. Natl. Acad. Sci. U S A 99, 13172–13177. doi: 10.1073/pnas.162485599
Bangert, M., Peschel, T., Schlaug, G., Rotte, M., Drescher, D., Hinrichs, H., et al. (2006). Shared networks for auditory and motor processing in professional pianists: evidence from fMRI conjunction. Neuroimage 30, 917–926. doi: 10.1016/j.neuroimage.2005.10.044
Basso, D., Chiarandini, M., and Salmaso, L. (2007). Synchronized permutation tests in replicated I × J designs. J. Stat. Plan. Inference 137, 2564–2578. doi: 10.1016/j.jspi.2006.04.016
Benjamini, Y., Krieger, A. M., and Yekutieli, D. (2006). Adaptive linear step-up procedures that control the false discovery rate. Biometrika 93, 491–507. doi: 10.1093/biomet/93.3.491
Bianco, R., Novembre, G., Keller, P. E., Kim, S.-G., Scharf, F., Friederici, A. D., et al. (2016). Neural networks for harmonic structure in music perception and action. Neuroimage 142, 454–464. doi: 10.1016/j.neuroimage.2016.08.025
Brovelli, A., Ding, M., Ledberg, A., Chen, Y., Nakamura, R., and Bressler, S. L. (2004). β oscillations in a large-scale sensorimotor cortical network: directional influences revealed by Granger causality. Proc. Natl. Acad. Sci. U S A 101, 9849–9854. doi: 10.1073/pnas.0308538101
Chang, E. F., Niziolek, C. A., Knight, R. T., Nagarajan, S. S., and Houde, J. F. (2013). Human cortical sensorimotor network underlying feedback control of vocal pitch. Proc. Natl. Acad. Sci. U S A 110, 2653–2658. doi: 10.1073/pnas.1216827110
Chase, H. W., Swainson, R., Durham, L., Benham, L., and Cools, R. (2011). Feedback-related negativity codes prediction error but not behavioral adjustment during probabilistic reversal learning. J. Cogn. Neurosci. 23, 936–946. doi: 10.1162/jocn.2010.21456
Comerchero, M. D., and Polich, J. (1999). P3a and P3b from typical auditory and visual stimuli. Clin. Neurophysiol. 110, 24–30. doi: 10.1016/s0168-5597(98)00033-1
Dalal, S. S., Osipova, D., Bertrand, O., and Jerbi, K. (2013). Oscillatory activity of the human cerebellum: the intracranial electrocerebellogram revisited. Neurosci. Biobehav. Rev. 37, 585–593. doi: 10.1016/j.neubiorev.2013.02.006
Dalal, S. S., Guggisberg, A. G., Edwards, E., Sekihara, K., Findlay, A. M., Canolty, R. T., et al. (2008). Five-dimensional neuroimaging: localization of the time-frequency dynamics of cortical activity. Neuroimage 40, 1686–1700. doi: 10.1016/j.neuroimage.2008.01.023
Dale, A. M., Liu, A. K., Fischl, B. R., Buckner, R. L., Belliveau, J. W., Lewine, J. D., et al. (2000). Dynamic statistical parametric mapping: combining fMRI and MEG for high-resolution imaging of cortical activity. Neuron 26, 55–67. doi: 10.1016/S0896-6273(00)81138-1
Dayan, E., and Cohen, L. G. (2011). Neuroplasticity subserving motor skill learning. Neuron 72, 443–454. doi: 10.1016/j.neuron.2011.10.008
Dehaene, S., and Changeux, J.-P. (1997). A hierarchical neuronal network for planning behavior. Proc. Natl. Acad. Sci. U S A 94, 13293–13298. doi: 10.1073/pnas.94.24.13293
Dehaene, S., Meyniel, F., Wacongne, C., Wang, L., and Pallier, C. (2015). The neural representation of sequences: from transition probabilities to algebraic patterns and linguistic trees. Neuron 88, 2–19. doi: 10.1016/j.neuron.2015.09.019
Escera, C., Alho, K., Schröger, E., and Winkler, I. W. (2000). Involuntary attention and distractibility as evaluated with event-related brain potentials. Audiol. Neurootol. 5, 151–166. doi: 10.1159/000013877
Fujii, N., and Graybiel, A. M. (2003). Representation of action sequence boundaries by macaque prefrontal cortical neurons. Science 301, 1246–1249. doi: 10.1126/science.1086872
Fujii, N., and Graybiel, A. M. (2005). Time-varying covariance of neural activities recorded in striatum and frontal cortex as monkeys perform sequential-saccade tasks. Proc. Natl. Acad. Sci. U S A 102, 9032–9037. doi: 10.1073/pnas.0503541102
Giard, M., Lavikahen, J., Reinikainen, K., Perrin, F., Bertrand, O., Pernier, J., et al. (1995). Separate representation of stimulus frequency, intensity and duration in auditory sensory memory: an event-related potential and dipole-model analysis. J. Cogn. Neurosci. 7, 133–143. doi: 10.1162/jocn.1995.7.2.133
Good, P. I. (2005). Permutation, Parametric and Bootstrap Tests of Hypotheses: A Practical Guide to Resampling Methods for Testing Hypotheses. 3rd Edn. New York, NY: Springer.
Graybiel, A. M. (2008). Habits, rituals, and the evaluative brain. Annu. Rev. Neurosci. 31, 359–387. doi: 10.1146/annurev.neuro.29.051605.112851
Grissom, R. J., and Kim, J. J. (2012). Effect Sizes for Research: Univariate and Multivariate Applications. New York, NY: Routledge.
Hämäläinen, M. S., and Ilmoniemi, R. J. (1994). Interpreting magnetic fields of the brain: minimum norm estimates. Med. Biol. Eng. Comput. 32, 35–42. doi: 10.1007/bf02512476
Hämäläinen, M., Hari, R., Ilmoniemi, R. J., Knuutila, J., and Lounasmaa, O. V. (1993). Magnetoencephalography—theory, instrumentation, and applications to noninvasive studies of the working human brain. Rev. Mod. Phys. 65, 413–497. doi: 10.1103/revmodphys.65.413
Hansen, P., Kringelbach, M., and Salmelin, R. (2010). MEG: An Introduction to Methods. Oxford: Oxford University Press.
Hari, R., Joutsiniemi, S. L., and Sarvas, J. (1988). Spatial resolution of neuromagnetic records: theoretical calculations in a spherical model. Electroencephalogr. Clin. Neurophysiol. 71, 64–72. doi: 10.1016/0168-5597(88)90020-2
Henson, R. N. (1998). Short-term memory for serial order: the start-end model. Cogn. Psychol. 36, 73–137. doi: 10.1006/cogp.1998.0685
Herrojo Ruiz, M., Brücke, C., Nikulin, V. V., Schneider, G.-H., and Kühn, A. A. (2014a). β-band amplitude oscillations in the human internal globus pallidus support the encoding of sequence boundaries during initial sensorimotor sequence learning. Neuroimage 85, 779–793. doi: 10.1016/j.neuroimage.2013.05.085
Herrojo Ruiz, M., Rusconi, M., Brücke, C., Haynes, J. D., Schönecker, T., and Kühn, A. A. (2014b). Encoding of sequence boundaries in the subthalamic nucleus of patients with Parkinson’s disease. Brain 137, 2715–2730. doi: 10.1093/brain/awu191
Herrojo Ruiz, M., Jabusch, H.-C., and Altenmüller, E. (2009). Detecting wrong notes in advance: neuronal correlates of error monitoring in pianists. Cereb. Cortex 19, 2625–2639. doi: 10.1093/cercor/bhp021
Herrojo Ruiz, M., Maess, B., Altenmüller, E., Curio, G., and Nikulin, V. V. (2017). Cingulate and cerebellar β oscillations are engaged in the acquisition of auditory-motor sequences. Hum. Brain Mapp. 38, 5161–5179. doi: 10.1002/hbm.23722
Herrojo Ruiz, M., Strübing, F., Jabusch, H.-C., and Altenmüller, E. (2011). EEG oscillatory patterns are associated with error prediction during music performance and are altered in musician’s dystonia. Neuroimage 55, 1791–1803. doi: 10.1016/j.neuroimage.2010.12.050
Hickok, G., Buchsbaum, B., Humphries, C., and Muftuler, T. (2003). Auditory-motor interaction revealed by fMRI: speech, music and working memory in area Spt. J. Cogn. Neurosci. 15, 673–682. doi: 10.1162/jocn.2003.15.5.673
Hickok, G., and Poeppel, D. (2004). Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition 92, 67–99. doi: 10.1016/j.cognition.2003.10.011
Hikosaka, O., Nakahara, H., Rand, M. K., Sakai, K., Lu, X., Nakamura, K., et al. (1999). Parallel neural networks for learning sequential procedures. Trends Neurosci. 22, 464–471. doi: 10.1016/s0166-2236(99)01439-3
Hyvärinen, A., and Oja, E. (2000). Independent component analysis: algorithms and applications. Neural Netw. 13, 411–430. doi: 10.1016/s0893-6080(00)00026-5
Jin, X., and Costa, R. M. (2010). Start/stop signals emerge in nigrostriatal circuits during sequence learning. Nature 466, 457–462. doi: 10.1038/nature09263
Kao, M. H., Doupe, A. J., and Brainard, M. S. (2005). Contributions of an avian basal ganglia-forebrain circuit to real-time modulation of song. Nature 433, 638–643. doi: 10.1038/nature03127
Karuza, E. A., Newport, E. L., Aslin, R. N., Starling, S. J., Tivarus, M. E., and Bavelier, D. (2013). The neural correlates of statistical learning in a word segmentation task: an fMRI study. Brain Lang. 127, 46–54. doi: 10.1016/j.bandl.2012.11.007
Koelsch, S. (2005). Neural substrates of processing syntax and semantics in music. Curr. Opin. Neurobiol. 15, 207–212. doi: 10.1016/j.conb.2005.03.005
Koelsch, S., Fritz, T., Schulze, K., Alsop, D., and Schlaug, G. (2005). Adults and children processing music: an fMRI study. Neuroimage 25, 1068–1076. doi: 10.1016/j.neuroimage.2004.12.050
Kornysheva, K., and Diedrichsen, J. (2014). Human premotor areas parse sequences into their spatial and temporal features. Elife 3:e03043. doi: 10.7554/elife.03043
Kotz, S. A., and Schwartze, M. (2011). Differential input of the supplementary motor area to a dedicated temporal processing network: functional and clinical implications. Front. Integr. Neurosci. 5:86. doi: 10.3389/fnint.2011.00086
Lahav, A., Saltzman, E., and Schlaug, G. (2007). Action representation of sound: audiomotor recognition network while listening to newly acquired actions. J. Neurosci. 27, 308–314. doi: 10.1523/jneurosci.4822-06.2007
Lehéricy, S., Benali, H., Van de Moortele, P.-F., Pélégrini-Issac, M., Waechter, T., Ugurbil, K., et al. (2005). Distinct basal ganglia territories are engaged in early and advanced motor sequence learning. Proc. Natl. Acad. Sci. U S A 102, 12566–12571. doi: 10.1073/pnas.0502762102
Lin, F. H., Witzel, T., Hämäläinen, M. S., Dale, A. M., Belliveau, J. W., and Stufflebeam, S. M. (2004). Spectral spatiotemporal imaging of cortical oscillations and interactions in the human brain. Neuroimage 23, 582–595. doi: 10.1016/j.neuroimage.2004.04.027
Lu, X., and Ashe, J. (2005). Anticipatory activity in primary motor cortex codes memorized movement sequences. Neuron 45, 967–973. doi: 10.1016/j.neuron.2005.01.036
Maess, B., Koelsch, S., Gunter, T. C., and Friederici, A. D. (2001). Musical syntax is processed in Broca’s area: an MEG study. Nat. Neurosci. 4, 540–545. doi: 10.1038/87502
Maidhof, C., Vavatzanidis, N., Prinz, W., Rieger, M., and Koelsch, S. (2010). Processing expectancy violations during music performance and perception: an ERP study. J. Cogn. Neurosci. 22, 2401–2413. doi: 10.1162/jocn.2009.21332
Maris, E., and Oostenveld, R. (2007). Nonparametric statistical testing of EEG-and MEG-data. J. Neurosci. Methods 164, 177–190. doi: 10.1016/j.jneumeth.2007.03.024
Meister, I. G., Krings, T., Foltys, H., Boroojerdi, B., Müller, M., Töpper, R., et al. (2004). Playing piano in the mind—an fMRI study on music imagery and performance in pianists. Cogn. Brain Res. 19, 219–228. doi: 10.1016/j.cogbrainres.2003.12.005
Michail, G., Nikulin, V. V., Curio, G., Maess, B., and Herrojo Ruiz, M. (2017). Encoding of boundaries is critical for sensorimotor sequence learning: an MEG study. Available online at: http://research.gold.ac.uk/22408/1/Michail_etal_Herrojo_NI_underreview.pdf
Mushiake, H., and Strick, P. L. (1995). Pallidal neuron activity during sequential arm movements. J. Neurophysiol. 74, 2754–2758. doi: 10.1152/jn.1995.74.6.2754
Natke, U., Donath, T. M., and Kalveram, K. T. (2003). Control of voice fundamental frequency in speaking versus singing. J. Acoust. Soc. Am. 113, 1587–1593. doi: 10.1121/1.1543928
Ninokura, Y., Mushiake, H., and Tanji, J. (2004). Integration of temporal order and object information in the monkey lateral prefrontal cortex. J. Neurophysiol. 91, 555–560. doi: 10.1152/jn.00694.2003
Ölveczky, B. P., Otchy, T. M., Goldberg, J. H., Aronov, D., and Fee, M. S. (2011). Changes in the neural control of a complex motor sequence during learning. J. Neurophysiol. 106, 386–397. doi: 10.1152/jn.00018.2011
Oostenveld, R., Fries, P., Maris, E., and Schoffelen, J.-M. (2011). FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011:156869. doi: 10.1155/2011/156869
Patel, A. D. (2011). Why would musical training benefit the neural encoding of speech? The OPERA hypothesis. Front. Psychol. 2:142. doi: 10.3389/fpsyg.2011.00142
Penhune, V., and Doyon, J. (2005). Cerebellum and M1 interaction during early learning of timed motor sequences. Neuroimage 26, 801–812. doi: 10.1016/j.neuroimage.2005.02.041
Pfordresher, P. Q., and Kulpa, J. D. (2011). The dynamics of disruption from altered auditory feedback: further evidence for a dissociation of sequencing and timing. J. Exp. Psychol. Hum. Percept. Perform. 37, 949–967. doi: 10.1037/a0021435
Pfordresher, P. Q., Mantell, J. T., Brown, S., Zivadinov, R., and Cox, J. L. (2014). Brain responses to altered auditory feedback during musical keyboard production: an fMRI study. Brain Res. 1556, 28–37. doi: 10.1016/j.brainres.2014.02.004
Procyk, E., and Joseph, J.-P. (2001). Characterization of serial order encoding in the monkey anterior cingulate sulcus. Eur. J. Neurosci. 14, 1041–1046. doi: 10.1046/j.0953-816x.2001.01738.x
Santos, F. J., Oliveira, R. F., Jin, X., and Costa, R. M. (2015). Corticostriatal dynamics encode the refinement of specific behavioral variability during skill learning. Elife 4:e09423. doi: 10.7554/eLife.09423
Seidler, R. D., Purushotham, A., Kim, S. G., Uğurbil, K., Willingham, D., and Ashe, J. (2002). Cerebellum activation associated with performance change but not motor learning. Science 296, 2043–2046. doi: 10.1126/science.1068524
Shima, K., and Tanji, J. (2006). Binary-coded monitoring of a behavioral sequence by cells in the pre-supplementary motor area. J. Neurosci. 26, 2579–2582. doi: 10.1523/JNEUROSCI.4161-05.2006
Tanji, J., and Shima, K. (1994). Role for supplementary motor area cells in planning several movements ahead. Nature 371, 413–416. doi: 10.1038/371413a0
Taulu, S., and Kajola, M. (2005). Presentation of electromagnetic multichannel data: the signal space separation method. J. Appl. Phys. 97:124905. doi: 10.1063/1.1935742
Taulu, S., Kajola, M., and Simola, J. (2004). Suppression of interference and artifacts by the signal space separation method. Brain Topogr. 16, 269–275. doi: 10.1023/b:brat.0000032864.93890.f9
Taulu, S., and Simola, J. (2006). Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurements. Phys. Med. Biol. 51, 1759–1768. doi: 10.1088/0031-9155/51/7/008
Tourville, J. A., Reilly, K. J., and Guenther, F. H. (2008). Neural mechanisms underlying auditory feedback control of speech. Neuroimage 39, 1429–1443. doi: 10.1016/j.neuroimage.2007.09.054
Uhrig, L., Dehaene, S., and Jarraya, B. (2014). A hierarchy of responses to auditory regularities in the macaque brain. J. Neurosci. 34, 1127–1132. doi: 10.1523/JNEUROSCI.3165-13.2014
Wymbs, N. F., Bassett, D. S., Mucha, P. J., Porter, M. A., and Grafton, S. T. (2012). Differential recruitment of the sensorimotor putamen and frontoparietal cortex during motor chunking in humans. Neuron 74, 936–946. doi: 10.1016/j.neuron.2012.03.038
Zatorre, R. J., and Baum, S. R. (2012). Musical melody and speech intonation: singing a different tune. PLoS Biol. 10:e1001372. doi: 10.1371/journal.pbio.1001372
Zatorre, R. J., Chen, J. L., and Penhune, V. B. (2007). When the brain plays music: auditory-motor interactions in music perception and production. Nat. Rev. Neurosci. 8, 547–558. doi: 10.1038/nrn2152
Keywords: serial order, boundaries, prefrontal cortex, supplementary motor area, sensorimotor learning, sequence learning
Citation: Michail G, Nikulin VV, Curio G, Maess B and Herrojo Ruiz M (2018) Disruption of Boundary Encoding During Sensorimotor Sequence Learning: An MEG Study. Front. Hum. Neurosci. 12:240. doi: 10.3389/fnhum.2018.00240
Received: 21 February 2018; Accepted: 24 May 2018;
Published: 12 June 2018.
Edited by:
Felix Blankenburg, Freie Universität Berlin, GermanyReviewed by:
Rolf Verleger, Universität zu Lübeck, GermanyTakako Fujioka, Stanford University, United States
Copyright © 2018 Michail, Nikulin, Curio, Maess and Herrojo Ruiz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Vadim V. Nikulin, bmlrdWxpbkBjYnMubXBnLmRl
María Herrojo Ruiz bS5oZXJyb2pvLXJ1aXpAZ29sZC5hYy51aw==