- 1State Key Laboratory of Mechanical Systems and Vibrations, Institute of Robotics, Shanghai Jiao Tong University, Shanghai, China
- 2Department of Neurosurgery of Huashan Hospital, Fudan University, Shanghai, China
- 3Department of Electronic and Electrical Engineering, University of Bath, Bath, United Kingdom
Stereo-electroencephalography (SEEG) utilizes localized and penetrating depth electrodes to directly measure electrophysiological brain activity. The implanted electrodes generally provide a sparse sampling of multiple brain regions, including both cortical and subcortical structures, making the SEEG neural recordings a potential source for the brain–computer interface (BCI) purpose in recent years. For SEEG signals, data cleaning is an essential preprocessing step in removing excessive noises for further analysis. However, little is known about what kinds of effect that different data cleaning methods may exert on BCI decoding performance and, moreover, what are the reasons causing the differentiated effects. To address these questions, we adopted five different data cleaning methods, including common average reference, gray–white matter reference, electrode shaft reference, bipolar reference, and Laplacian reference, to process the SEEG data and evaluated the effect of these methods on improving BCI decoding performance. Additionally, we also comparatively investigated the changes of SEEG signals induced by these different methods from multiple-domain (e.g., spatial, spectral, and temporal domain). The results showed that data cleaning methods could improve the accuracy of gesture decoding, where the Laplacian reference produced the best performance. Further analysis revealed that the superiority of the data cleaning method with excellent performance might be attributed to the increased distinguishability in the low-frequency band. The findings of this work highlighted the importance of applying proper data clean methods for SEEG signals and proposed the application of Laplacian reference for SEEG-based BCI.
Introduction
Brain–computer interface (BCI) has been a promising solution toward bridging the gap between disabled people and the external environment (Collinger et al., 2013). Compared with commonly used electroencephalography (EEG), intracranial electroencephalography (iEEG), such as electrocorticography (ECoG), implants grid or strip electrodes directly on the cortical surface and has the characteristics of higher spatial resolution, higher temporal resolution, and higher signal amplitude in the recorded neural signals (Freeman et al., 2000; Ball et al., 2009; Slutzky et al., 2010; Sheehan et al., 2018). In the past 15 years, ECoG-based BCIs have been paid extensive attention (Lebedev and Nicolelis, 2006; Schalk and Leuthardt, 2012; Volkova et al., 2019). Significant achievements have been made for ECoG-based motor BCIs in fine hand movement decoding, such as individual finger classification or hand kinematics regression (Acharya et al., 2010; Flint et al., 2014; Nakanishi et al., 2014; Hotson et al., 2016). Other ECoG studies decoded commonly used functional gestures directly and attained good decoding performance (Pistohl et al., 2012; Chestek et al., 2013; Spueler et al., 2014; Bleichner et al., 2016; Branco et al., 2017). Note that ECoG provides wide coverage over a large area of cortical regions, whereas another increasingly used iEEG recording technique in recent years, stereo-electroencephalography (SEEG), implant multiple depth electrodes into the brain and thus capture neural activities from both cortical and subcortical regions (Parvizi and Kastner, 2018; Herff et al., 2020). Therefore, SEEG provides the unique opportunity to make use of the neural information that cannot be accessed by other invasive techniques. A growing number of studies aiming at testing the possibilities and performance of building a motor BCI using SEEG signals have been reported (Vadera et al., 2013; Murphy et al., 2016; Fischer et al., 2017; Li et al., 2017; Wang et al., 2020).
For SEEG-based BCI, better decoding performance demands a higher quality of acquired SEEG signals, where data cleaning is essential to clean human electrophysiological recordings with nuisance noises. In literature, besides fundamental methods that exclude certain well-defined noise sources (e.g., 50/60 Hz line noise), there is no widely adopted standard for cleaning noise from iEEG data. Both manual and automated approaches are investigated to clean residual noises from the intracranial neural recordings. Specifically, manual methods (e.g., channels or epochs elimination) are used to remove noisy channels or epochs through manual identification (Nolan et al., 2010; Rangarajan et al., 2014; Fonken et al., 2016; Vass et al., 2016; Indira et al., 2017), whereas automated methods, such as common average referencing (CAR) (Schalk et al., 2007), closest white matter reference (CWM) (Arnulfo et al., 2015), bipolar reference (Kobayashi et al., 2009; Vidal et al., 2012; Burke et al., 2014; Greenberg et al., 2015; Sheehan et al., 2018), and Laplacian reference (Mercier et al., 2016; Li et al., 2018), are routinely used to improving signal quality in a global level. It is worth noting that investigators concentrate more attention on the signal itself, but few people assess the performance in BCI applications for these methods. A recent ECoG study has demonstrated that automated methods can produce substantial influence in improving brain state classification compared with manual methods (Meisler et al., 2019). However, for SEEG-based BCI, whether the automated methods have a contribution consistent with the expectation of investigators remains an open question. Accordingly, this paper aims to explore the effects of different automated data cleaning methods for SEEG recordings through practical assessment of gesture decoding performance and, subsequently, propose the optimal one.
Besides, we also comparatively investigated the differences between cleaned SEEG signals using the optimal method and other automated methods by conducting a multiple domains analysis. Finally, the reasons causing the differentiated effects were explored.
Materials and Methods
Subjects
Eight subjects with intractable epilepsy participated in this study. They had SEEG electrodes implanted for pre-surgical assessment of their seizure focus. The recording information of each subject is shown in Table 1. The subjects performed the task using hand contralateral to the hemisphere with the majority of electrodes implantation. The Ethics Committee of Huashan Hospital approved the study (Shanghai, China, approval ID: KY2019518), and informed consent for the study was obtained from all subjects.
Data Recording
SEEG signals were recorded using a clinical recording system (EEG-1200C, Nihon Kohden, Irvine, CA) system with 1,000- or 2,000-Hz sampling rates (Table 1). The original SEEG signals collected by the system were referenced to the average of two adjacent white matter contacts locating remotely from the suspected seizure focus and gray matter. To monitor the actual movement onset time during the experiment, two surface electromyographic (EMG) electrodes were used to record EMG signals from extensor carpi radialis muscle with the same sampling rate and same recording device as the SEEG signals simultaneously.
Experimental Protocol
The subjects participated in a visually cued hand movement task during the experiment (Figure 1). In detail, at the beginning of each trial, the subject faced an LED screen and took a rest for 4 s first without any hand movement before a cue (white cross, 1 s) appeared to inform them of the upcoming movement. After this, a picture illustrating one of three gestures (scissor, fist, and thumb) appeared on the screen randomly. Then, the subject executed the corresponding movement repeatedly for 5 s as soon as the appearance of a picture. Twenty trials were performed for each gesture, making the data from a total of 60 trials were collected for each subject.
Figure 1. Experimental paradigm. Each trial lasted 10 s (4-s rest, 1-s cue, and 5-s task). Also, one of three hand gestures (scissor, fist, and thumb) randomly appeared to inform participants to execute the corresponding movement during the task stage.
Electrode Localization
The subjects had a total of 88 electrode shafts (rounded mean ± std: 11 ± 4 per subject) and 1,115 contacts (rounded mean ± std: 139 ± 50 per subject) implanted (Figure 2). Each electrode contains 8–16 contacts, and each contact was 2.0 mm long with a 0.8-mm diameter and a 3.5-mm center-to-center spacing distance. We identified the location of all contacts in each individual brain model using pre-surgical magnetic resonance imaging, post-surgical computed tomographic images, Freesurfer software1, and the iEEGview toolbox (Li et al., 2020).
Figure 2. Electrode locations projected on the three-dimensional standard Montreal Neurological Institute brain model. (A–C) Brain model and implanted contacts (small red dots) in the sagittal, coronal, and transverse view, respectively.
Automated Data Cleaning Methods
The main goal of this study was to evaluate the potential influence of different data cleaning methods on decoding performance for SEEG-based BCI. To do this, five previously used automated methods (Li et al., 2018), including common average reference (CAR), gray–white matter reference (GWR), electrode shaft reference (ESR), bipolar reference, and Laplacian reference, were adopted for the evaluation.
For the CAR method, re-referencing was achieved by first creating an average of all channels and then subtracting the average signal from each channel (Kubanek et al., 2009; Miller et al., 2012). Slightly differently, for GWR, we re-referenced each channel located in the gray or white matter to the corresponding average of all gray and white matter channels separately. To do the ESR, we re-referenced each channel to the average signal of all channels located on the same shaft. For bipolar reference (Kobayashi et al., 2009; Vidal et al., 2012; Shirhatti et al., 2016), each channel was re-referenced to its adjacent channel on the same electrode shaft. Finally, to implement the Laplacian reference (He et al., 2008), each contact was re-referenced by the mean value of two adjacent contacts along the electrode shaft.
Besides, to answer the question of whether automated data cleaning can benefit decoding performance, we treated the signal without any re-referencing process (termed as a raw signal in this work) as a benchmark and conducted the same following analysis for these signals as well.
Data Preprocessing
For the collected original SEEG signals, we implemented the following preprocessing procedure. First, we resampled the SEEG signals to 1,000 Hz using the function (resample) in Matlab (MathWorks, Natick, MA, United States) to reduce computational cost and facilitate further computation across subjects. Then, we calculated a measure of line noise to remove channels with excessive line noise. Specifically, a second-order IIR peak filter (iirpeak) at 50 Hz was first applied to retain the 50-Hz frequency component. Then, we concatenated the filtered signals from all channels of each subject to calculate a cutoff threshold for noisy channels. Also, the threshold was defined as the summation of the median of the concatenation output and 10 times of its mean absolute deviation. The channels whose line noise was higher than the cutoff threshold were identified as bad channels and eliminated from further analysis. Then, a 50-Hz comb notch filter was applied to remove the possible line noise and its harmonics. After that, a 0.5–400 Hz band-pass filter (fourth-order Butterworth) was used to filter the resampled signal (Figure 3A). At the last step, the respective data cleaning methods described earlier (Automated data cleaning methods) were applied to clean the filtered signals (Figure 3C).
Figure 3. Framework of this work. (A) Single-trial filtered signals of selected channels. Redline represented task onset (appearance of movement cue, Figure 1). (B) Distribution of movement onsets across trials. Redline represented task onset, and purple asterisks represented detected movement onsets, and blackline represented mean value of movement onsets across trials. (C) A typical example of signals re-referenced by Laplacian reference, CAR, and Raw. Signals from one trial of a typical channel were shown. Blackline represented mean value of movement onsets across trials. (D) Feature extraction. Colored squares represented different frequency band powers. (E) Distribution of implanted channel (small blue dots) and selected channel (big red dots) from all subjects. All electrodes were projected on a standard Montreal Neurological Institute brain model in a transverse view. (F) Illustration of a calculation process for time–frequency difference (TFD). (G) Illustration of single-frequency band-based decoding accuracy (SDA). Single sub-band power of selected channels was taken separately to build the feature vector. (H) Relation to spatial locations. Take mean value of TFD within each frequency band during task period to calculate significance level of spectral difference.
Additionally, the obtained EMG signals were processed as well. In detail, we band-pass filtered (55–145 Hz, sixth-order Butterworth filter) the two EMG channels and subtracted the results from each other. Notably, such an adopted filtering setting can ensure the accurate detection of movement onsets’ use of high-frequency information of EMG signals. Then, for each trial, we detected the point where the absolute amplitude of EMG signals exceeded first time an adaptive threshold using the envelope of the processed EMG activity (Sedghamiz, 2018). Also, we detected the point where absolute EMG activity first time exceeded 1.5 times the average absolute value of EMG activity in the task stage. Finally, the earlier of the two points was treated as movement onset to ensure a robust EMG activity onset detection (Wang et al., 2020).
Finally, the SEEG signals were aligned to the movement onset (mean ± std: 0.72 ± 0.38 s after the onset of gesture picture) for each trial and each subject (Figure 3B). To be concise, the movement onset was defined as time 0. The baseline period and task period were defined as the [−4, −2] s and [0, 4] s around the movement onset, respectively. In each trial, the signals within [−4, 4] s around the movement onset were used in the following decoding analysis.
Performance Index–Decoding Accuracy
To quantify the decoding performance under different data cleaning methods, gesture decoding accuracy (DA) was computed. To do this, we conducted a gesture decoding procedure for each method and each subject separately. In detail, for each channel, we first segmented the cleaned signals from the task period using a time window at a length of 500 ms (overlapping by 250 ms) and then transformed the signals of each time bin into frequency domain with an autoregressive model of order 40 (Li et al., 2017). The spectral power in the sub-bands (e.g., 1–4, 4–8, 8–13, 13–30, 60–75, 75–95, 105–125, 125–145, and 155–195 Hz) were extracted by averaging transformed data within each interval (Figure 3D). After that, we normalized (i.e., Z-score) the obtained spectral power to the average power of the baseline period of all trials. In the last step of feature extraction, the normalized power from all frequency bands was concatenated together to build the feature vector for each channel.
Considering that SEEG electrodes were widely distributed inside the brain, channel selection was extremely necessary for the decoding purpose (Figure 3E). In this step, we used the forward search optimization algorithm to select the optimal channel set for each subject (see also Supplementary Figure 1; Kohavi and John, 1997; Lotte et al., 2018). More specifically, starting from the empty optimal channel set, we successionally selected one from all remaining channels, which produced the highest accuracy with the selected channels and incorporated it into the optimal channel set. To avoid the possible local extreme point during searching, the iteration was stopped when the accuracies reached a peak and did not increase after three more channels were added. Then, for each sample (i.e., time window), the feature vectors from all channels in the optimal channel set were concatenated together to construct the final feature vector for the classifier. Moreover, to achieve a robust classification, each element of the obtained final feature vector was renormalized across trials to maximally eliminate the difference of absolute amplitude among channels (Wang et al., 2020).
Finally, a Support Vector Machine classifier with a linear kernel was used for the classification of multiple hand gestures. Moreover, considering the data overlapping during the feature extraction, to avoid data leakage and make a fair evaluation for this process, we adopted trial-based 10-fold cross-validation. In brief, in each cross-validation, we first randomly selected ninefold (n = 54) of all trials (n = 60) as training set and the remaining onefold as testing set. For each subject and each method, the decoding accuracy was computed as the mean decoding value of all 10-fold cross-validation.
Comparison Indices
To further interpret the reasons causing the differentiated performances between the best-performing method and other data cleaning methods, three comparison indices were proposed and computed: (1) TFD, (2) SDA, and (3) relation to spatial locations.
Time–Frequency Difference
TFD was proposed in this work to quantify the spectral and temporal difference between different data cleaning methods. To compute the TFD, for each selected channel of each subject under each automated data cleaning method, we used the fast Fourier transform together with a Hanning window (timefreq function from EEGlab toolbox; Delorme and Makeig, 2004) for the purpose of time–frequency decomposition of trials (Band limits: 0–195 Hz) and stacked the decomposition of all trials to generate an array with three dimensions along time (t), frequency (f), and trials (i) [termed as TFT(t,f,i) here]. Then, we calculated trial-wise paired t-test of the TFT(t,f,i) between paired data cleaning methods (e.g., the optimal method and the other ones) and took the negative logarithm of the p-value for each element [e.g., −log(p(t, f))] to obtain the TFD of a single channel. Notably, given the multitude of selected channels for each subject and the fact of electrical field spread (leading to source leakage and volume conduction), Bonferroni correction was adopted for the p-values to correct the family-wise error. Finally, the distinguish index TFD (t, f) was calculated as the average across all selected channels of all subjects, where -log(0.05) was taken as the significance level (Figure 3F). Considering that the optimal channel set of different methods might be different, the selected channel set here was the union of the optimal channel sets of the two paired methods.
Single-Frequency Band-Based Decoding Accuracy
After evaluating the changes of data cleaning methods exerted to the signal itself, we further investigated how the induced changes affected the decoding performance. For this purpose, we calculated the SDA for different data cleaning methods. In detail, similar to the calculation of DA (Performance index–decoding accuracy), we separately took the single sub-band power (e.g., 1–4, 4–8, 8–13, 13–30, and 60–195 Hz) of the selected channels used in DA calculation to build the final feature vector (Figure 3G). Among these sub-bands, considering that high gamma band (60–195 Hz) had a comparatively larger frequency span, this band was segmented into multiple bins (e.g., 60–75, 75–95, 105–125, 125–145, and 155–195 Hz) to fully extract the effective neural information during the feature extraction. After this, a Support Vector Machine classifier (same as Performance index–decoding accuracy) was adopted to calculate the SDA for each subject and each data cleaning method.
Relation to Spatial Locations
To further answer the question of whether the differences induced between data cleaning methods have a relationship with spatial locations, we calculated two indices and measured the correlation between these two values. The first one is the significance level of spectral difference between the optimal data cleaning method and the other ones, where this value quantifies how much difference the two data cleaning methods exerting on the SEEG signal itself. The second index is the task relevance, and this value quantifies how much the SEEG channels correlate with the task. These two indices were computed for each channel and each subject. Moreover, to more precisely evaluate the potential relationship, the analysis was conducted using multiple frequency bands (e.g., 1–4, 4–8, 8–13, 13–30, and 60–195 Hz) separately.
To be more specific, for the calculation of the significance level of spectral difference, we took the mean value of the TFD (t, f) within each frequency band during the task period (Time–frequency difference) between the optimal method and the other methods (Figure 3H, see also Supplementary Figure 2). To calculate task relevance, we took the correlation between the single-frequency band power of each channel and the task. In detail, separately for each frequency band, we first applied a band-pass filter to signals processed by the optimal data cleaning method at this frequency band using a sixth-order Butterworth filter. Then, we extracted frequency band power by computing the squared absolute value of the Hilbert transform and obtained the mean frequency band power of the baseline period and task period for each trial separately. Second, we correlated (Spearman’s correlation coefficient) these power values of all trials with the corresponding task/baseline labels (e.g., 0/1) to get an observation of the r value. Third, we performed a permutation test in which we randomly shuffled the task/baseline labels and calculated the corresponding random r value (Schalk et al., 2007). The permutation test was repeated 1,000 times and generated a Gaussian distribution of 1,000 surrogate r values. The channel was considered statistically task-related if the computed channel r belonged to the 95th percentile of the Gaussian distribution (p < 0.05 after Bonferroni correction). Finally, task relevance was taken as the negative logarithm of the p-value [i.e., -log(p)].
After this, to determine whether there was a relationship between the significance level of spectral difference and the task relevance of each channel, we performed a first-order linear fitting using the function (polyfit) in Matlab. Moreover, F-test was used for regression analysis to assess the significance of the fitting equation (Shen and Faraway, 2004; Demidenko et al., 2012), where the function (stats.f.sf) from SciPy toolbox (Jones et al., 2001) was adopted to calculate the p-value of the F-test. Finally, the coefficient of determination (i.e., R-square) was calculated to evaluate the fitness of the line fitting.
Results
Decoding Accuracy of Different Data Cleaning Methods
The DA calculated under different data cleaning methods is shown in Figure 4. Compared with the signals without re-referencing (Raw in Figure 4), all data cleaning methods produced significantly higher DA values (p < 0.05, paired t-test), demonstrating that the adoption of proper data cleaning methods for SEEG signals could effectively enhance the decoding performance.
Figure 4. Decoding accuracy (DA) of different automated data cleaning methods (Laplacian reference, bipolar reference, ESR, CAR, GWR, and Raw). Asterisks denoted statistical significance level of accuracy value between groups (*p < 0.05, paired t-tests). Bars and error bars represented mean (across all subjects) accuracy and standard error, respectively. Bars arranged in descending order according to average accuracy value.
Among all the applied data cleaning methods, Laplacian reference produced the highest DA in average, reaching 85.2 ± 3.8% (mean ± s.e.). Bipolar and ESR ranked the second and third places; their DAs were lower than the Laplacian method by 0.2 and 3.7%, respectively. For CAR and GWR, the enhancing effects were weakest, and their DAs were significantly lower than Laplacian (p < 0.05, paired t-test, Figure 4).
Time–Frequency Difference Between Paired Data Cleaning Methods
We have shown that the Laplacian method produced the best decoding performance in the previous section (Decoding accuracy of different data cleaning methods). To further investigate which factors may contribute to this phenomenon, we calculated the TFD between the Laplacian reference and other automated data cleaning methods (Figure 5). As could be seen from the figure, the significant difference between the Laplacian reference and ESR, CAR, GWR, and Raw was mainly reflected in the low-frequency band (LFB, e.g., delta, theta, and alpha bands), where the TFD (t, f) was higher than the significance level [−log(0.05)], especially in the delta band (1–4 Hz) that was showing a more significant difference [TFD (t, f) > −log(0.01)]. In the temporal domain, the significant difference was mainly distributed in the [−1, 4] s around the movement onset. Comparing Laplacian reference with bipolar reference, the significant difference occurred in delta band during the [−0.5, 0] s around the movement onset (after the gesture picture appeared and before moving), and there was no significant difference during the task period. For all paired methods, no significant difference was seen during the baseline period.
Figure 5. Time–frequency difference (TFD) between paired data cleaning methods. (A) Laplacian reference versus bipolar reference; (B) Laplacian reference versus ESR; (C) Laplacian reference versus CAR; (D) Laplacian reference versus GWR; (E) Laplacian reference versus Raw. Difference in low-frequency region was partially enlarged for visualization purpose (Time, [−1, 4] s; Frequency, [0, 50] Hz). Colors represented TFD value between paired methods. Black dotted line represented movement onset, and [−4, 4] s around movement onset were presented.
Single-Frequency Band Based Decoding Accuracy for Different Data Cleaning Methods
To further explore whether it is the spectral difference that caused the differentiated decoding performance between paired methods, we calculated the SDAs of different methods (Single-frequency band-based decoding accuracy). The results are presented in Figure 6. Overall, the Laplacian method achieved the highest SDAs. More specifically, compared with CAR, GWR, and Raw, the SDAs of Laplacian reference were significantly higher in delta (p < 0.05, Bonferroni corrected, paired t-test, Figures 6C–E) and theta (p < 0.01, Bonferroni corrected, Figures 6C–E). Compared with ESR, the Laplacian method produced higher SDAs on average but without significance. For the comparison between bipolar and Laplacian references, these two methods produced very similar results. No significant difference between the SDAs of Laplacian reference and other methods in a high gamma band was seen. Notably, the statistical properties of SDAs in LFB (e.g., delta and theta bands) and high gamma band (60–195 Hz) were highly consistent with those observed in TFD (Figure 5, Time–frequency difference between paired data cleaning methods).
Figure 6. Single-frequency band-based decoding accuracy (SDA) of data cleaning methods for multiple sub-bands. (A) Laplacian reference versus bipolar reference; (B) Laplacian reference versus ESR; (C) Laplacian reference versus CAR; (D) Laplacian reference versus GWR; (E) Laplacian reference versus Raw. Sub-bands included delta band (1–4 Hz), theta band (4–8 Hz), alpha band (8–13 Hz), beta band (13–30 Hz), and high gamma band (60–195 Hz). Asterisks denoted statistical significance level of SDAs between paired methods (***p < 0.001; **p < 0.01; *p < 0.05, paired t-test). Box and line represented boxplot and mean accuracy across all subjects. (F) Comparison of SDAs of different frequency bands. Bars and error bars represented mean accuracy and standard error calculated across all subjects, respectively. Redline represented average SDAs across all methods for different bands, respectively.
On average, a high gamma band produced higher classification accuracy (69.2%, Figure 6F). On the contrary, the average SDAs across all methods in LFB were 5.7% (delta), 14.8% (theta), 23.3% (alpha), and 23.7% (beta) lower than that of high gamma band. Interestingly, among all automated methods, Laplacian reference and bipolar reference presented higher accuracy in delta band than the high gamma band, where the SDA of these two methods were 1.5 and 2.0% higher than that of high gamma band, respectively (Figure 6F).
Relationship Between Spatial Locations and Significance Level of Spectral Difference for Paired Methods
In the analysis mentioned earlier, we took the channels as a whole and calculated them in the mean sense and, thus, largely ignored the spatial (i.e., channel) information. In this section, we investigated the question of whether the difference between paired data cleaning methods would be more significantly presented on the channels that are more related to the task (Relation to spatial locations). Figure 7 shows the relationship between task relevance of all channels and the corresponding significance level of differences for all paired data cleaning methods (Laplacian versus others) in multiple frequency bands from a typical subject (Subject 5). As could be seen from Figure 7, there existed a significant linear relationship (p < 0.05, F-test) between the two indices for almost all subjects and all frequency bands (see also Supplementary Figure 3). In addition, for Laplacian reference, 85% (i.e., 74/87) of the selected channels across all subjects were task-related in a certain frequency band. Altogether, the results indicated that the Laplacian method could produce more prominent improvements to the signals of the most informative channels compared to other methods.
Figure 7. Relationship between task relevance and significance level of spectral difference for all channels from a typical subject (Subject 5). Frequency bands in delta, theta, alpha, beta, and high gamma band were analyzed, and five paired methods (Laplacian reference versus bipolar reference, ESR, CAR, GWR, and Raw, respectively) were presented. Each red dot represented one channel from the subject, and black dots represented the selected channels used for decoding under Laplacian reference (Performance index–decoding accuracy). In the subgraph, the x-axis represented the task relevance of channels, the y-axis represented the significance level of spectral difference of channels, and the straight gray line was the first order linear fitting of task relevance and significance level of difference. Three indices were presented (R-square, p-value of F-Test, and the number of subjects with significant linear relationship).
Discussion
In this study, we provided the first systematical evaluation of several automated data cleaning methods routinely used in SEEG studies, with the goal of verifying the effects of these methods and proposing the optimal method for SEEG-based BCI.
The results in this study showed that applying automated methods to a clean SEEG signal was able to improve the decoding accuracy. Among the listed methods, the Laplacian reference had the best performance. The current finding was in accordance with the previous SEEG study (Li et al., 2018), where Laplacian reference had also been demonstrated to be optimal in improving global signal quality metrics (e.g., the correlation of signals across channels). One step further, this work answered the subsequent question that the optimal SEEG data cleaning method is targeted for BCI purposes (Figures 5–7). The agreement reached in these two evaluation studies altogether suggested the application of Laplacian reference for further SEEG research.
To interpret the reasons causing the differentiated performances across data cleaning methods, we comparatively investigated the multiple-domain (e.g., spatial, spectral, and temporal domain) changes of SEEG signals. The results in Figure 5 shows that the Laplacian reference had a significant spectral difference in the LFB (e.g., delta, theta, and alpha bands) compared with the other automated methods (such as CAR and GWR, see also Supplementary Figure 2). Furthermore, we decoded the hand gestures based on a single-frequency band and found that the decoding performance of the Laplacian reference in LFB (e.g., delta and theta bands) significantly outperformed CAR and GWR (Figures 6A–E). Current results are also in agreement with the previous study, where Laplacian reference has been shown to be able to enhance the relationship with the task for LFB (e.g., alpha) power activity (Li et al., 2018).
Besides, in the frequency band, the power of the high-frequency band has been demonstrated to be correlated with the population-level cortical activity associated with different motor, sensory, or cognitive tasks (Gaona et al., 2011; Ray and Maunsell, 2011; Miller et al., 2014; Potes et al., 2014; Pesters et al., 2016; Branco et al., 2017) and, therefore, generally yield a higher classification accuracy than the low-frequency range (e.g., 8–32 Hz) in multiple tasks decoding (Ojemann, 2007; Kai et al., 2010; Gruenwald et al., 2017). The consistent results were found in this work (Figure 6F). Notably, several pieces of evidence tended to support the notion that the delta band also has an important effect on motion execution. Specifically, Graimann et al. (2004) showed a clear correlation between the normalized band power changes in the delta range with movement execution. Aleksandra found that the delta band carried significant motor-related information in classifying real wrist movements (Vuckovic and Sepulveda, 2008). In this work, the delta band had been found to be able to achieve the highest accuracy among the LFB of all automated methods, which highlighted the importance of the delta band in motion execution decoding (Schalk et al., 2007; Pistohl et al., 2012). Moreover, the delta band even achieved similar accuracy with the high-frequency band under the Laplacian or bipolar method in this work. One explanation for this result may be that the repetitive hand movement in our experiment required multiple movement initiation, which was modulated by delta band amplitude (Kobler et al., 2020). However, the reasons behind this result still need further investigation. Overall, the findings of this work highlight the importance of taking the high gamma band and delta band into consideration together for further BCI application.
Although we have presented the optimal SEEG data cleaning method for BCI usage based on the data observations, this work also has some limitations. First, for the comparison between Laplacian and other locally processed automated data cleaning methods (e.g., Bipolar reference and ESR), the Laplacian method held advantages on performance indicators on average but without significance (Figures 5, 6). This may be because of the limited number of subjects. Therefore, further study using a large number of subjects is still necessary for the next step. Second, in this study, we only used a DA to measure the effect of different automated methods on SEEG-based BCI without showing the anatomical information of the used key decoding electrodes. Due to the wide coverage in both cortical and subcortical levels for SEEG recordings, the spatial analysis on different brain regions is also of great importance for BCI research, which will be conducted in future work. Third, as the following study of our previous work (Li et al., 2018), we conduct analysis among the same different data cleaning methods and draw the conclusion within these tested methods. Although we also notice that some other data cleaning methods have been reported for SEEG recordings (Arnulfo et al., 2015; Schaworonkow and Voytek, 2021), further comparison between the Laplacian with these methods will be meaningful and thus worth of exploration in the future.
Conclusion
This study mainly explored the influence on the decoding performance of five automated data cleaning methods commonly used in SEEG studies. Moreover, we further investigated why the different methods may result in different decoding performances. The result showed that Laplacian reference produced the best enhancing effect on decoding performance, and such phenomenon may be caused by the increased ability on the task information retainment of low-frequency band activities compared with other data cleaning methods. This study provided practical guidance for the data cleaning method to be used for further BCI applications based on SEEG signals.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics Statement
The studies involving human participants were reviewed and approved by Ethics Committee of Huashan Hospital (Shanghai, China, Approval ID: KY2019518). The patients/participants provided their written informed consent to participate in this study.
Author Contributions
SL, GL, LC, and DZ: conceptualization. SL, GL, and XW: methodology. SL and GL: software, writing—original draft, and visualization. SL: formal analysis. SL, GL, SJ, DZ, and LC: investigation. SJ: data curation. SJ, XW, and DZ: writing—review and editing. DZ, LC, and JH: supervision. DZ and LC: project administration. DZ, LC, and GL: funding acquisition. All authors read and approved the final version of the manuscript.
Funding
This work was supported by grants from the National Natural Science Foundation of China (no. 91848112), China Post-doctoral Science Foundation (no. 20Z102060158), National Key R&D Program of China (2018YFB1307301), Medical and Engineering Cross Foundation of Shanghai Jiao Tong University (no. AH0200003), and the Science and Technology Commission of Shanghai Municipality (no. 18JC1410403).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
The authors would like to thank all subjects for their participation and patience in this study.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2021.725384/full#supplementary-material
Footnotes
References
Acharya, S., Fifer, M. S., Benz, H. L., Crone, N. E., and Thakor, N. V. (2010). Electrocorticographic amplitude predicts finger positions during slow grasping motions of the hand. J. Neural Eng. 7:046002. doi: 10.1088/1741-2560/7/4/046002
Arnulfo, G., Hirvonen, J., Nobili, L., Palva, S., and Palva, J. M. (2015). Phase and amplitude correlations in resting-state activity in human stereotactical EEG recordings. Neuroimage 112, 114–127. doi: 10.1016/j.neuroimage.2015.02.031
Ball, T., Kern, M., Mutschler, I., Aertsen, A., and Schulze-Bonhage, A. (2009). Signal quality of simultaneously recorded invasive and non-invasive EEG. Neuroimage 46, 708–716. doi: 10.1016/j.neuroimage.2009.02.028
Bleichner, M. G., Freudenburg, Z. V., Jansma, J. M., Aarnoutse, E. J., Vansteensel, M. J., and Ramsey, N. F. (2016). Give me a sign: decoding four complex hand gestures based on high-density ECoG. Brain Struct. Funct. 221, 203–216. doi: 10.1007/s00429-014-0902-x
Branco, M. P., Freudenburg, Z. V., Aarnoutse, E. J., Bleichner, M. G., Vansteensel, M. J., and Ramsey, N. F. (2017). Decoding hand gestures from primary somatosensory cortex using high-density ECoG. Neuroimage 147, 130–142. doi: 10.1016/j.neuroimage.2016.12.004
Burke, J. F., Long, N. M., Zaghloul, K. A., Sharan, A. D., Sperling, M. R., and Kahana, M. J. (2014). Human intracranial high-frequency activity maps episodic memory formation in space and time. Neuroimage 85, 834–843. doi: 10.1016/j.neuroimage.2013.06.067
Chestek, C. A., Gilja, V., Blabe, C. H., Foster, B. L., Shenoy, K. V., Parvizi, J., et al. (2013). Hand posture classification using electrocorticography signals in the gamma band over human sensorimotor brain areas. J. Neural Eng. 10:026002. doi: 10.1088/1741-2560/10/2/026002
Collinger, J. L., Wodlinger, B., Downey, J. E., Wang, W., and Schwartz, A. B. (2013). High-performance neuroprosthetic control by an individual with tetraplegia. Lancet 381, 557–564. doi: 10.1016/S0140-6736(12)61816-9
Delorme, A., and Makeig, S. (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21. doi: 10.1016/j.jneumeth.2003.10.009
Demidenko, E., Sargent, J., and Onega, T. (2012). Random effects coefficient of determination for mixed and meta-analysis models. Commun. Stat. 41, 953–969. doi: 10.1080/03610926.2010.535631
Fischer, P., Pogosyan, A., Cheeran, B., Green, A. L., Aziz, T. Z., Hyam, J., et al. (2017). Subthalamic nucleus beta and gamma activity is modulated depending on the level of imagined grip force. Exp. Neurol. 293, 53–61. doi: 10.1016/j.expneurol.2017.03.015
Flint, R. D., Wang, P. T., Wright, Z. A., King, C. E., Krucoff, M. O., Schuele, S. U., et al. (2014). Extracting kinetic information from human motor cortical signals. Neuroimage 101, 695–703. doi: 10.1016/j.neuroimage.2014.07.049
Fonken, Y. M., Rieger, J. W., Tzvi, E., Crone, N. E., Chang, E., Parvizi, J., et al. (2016). Frontal and motor cortex contributions to response inhibition: evidence from electrocorticography. J. Neurophysiol. 115, 2224–2236. doi: 10.1152/jn.00708.2015
Freeman, W. J., Rogers, L. J., Holmes, M., and Silbergeld, D. L. (2000). Spatial spectral analysis of human electrocorticograms including the alpha and gamma bands. J. Neurosci. Methods 95, 111–121. doi: 10.1016/S0165-0270(99)00160-0
Gaona, C. M., Sharma, M., Freudenburg, Z. V., Breshears, J. D., Bundy, D. T., Roland, J., et al. (2011). Nonuniform high-gamma (60-500 Hz) power changes dissociate cognitive task and anatomy in human cortex. J. Neurosci. 31, 2091–2100. doi: 10.1523/JNEUROSCI.4722-10.2011
Graimann, B., Huggins, J. E., Levine, S. P., and Pfurtscheller, G. (2004). Toward a direct brain interface based on human subdural recordings and wavelet-packet analysis. IEEE Trans. Biomed. Eng. 51, 954–962. doi: 10.1109/TBME.2004.826671
Greenberg, J. A., Burke, J. F., Haque, R., Kahana, M. J., and Zaghloul, K. A. (2015). Decreases in theta and increases in high frequency activity underlie associative memory encoding. Neuroimage 114, 257–263. doi: 10.1016/j.neuroimage.2015.03.077
Gruenwald, J., Kapeller, C., Guger, C., Ogawa, H., Kamada, K., and Scharinger, J. (2017). “Comparison of alpha/beta and high-gamma band for motor-imagery based BCI control: a qualitative study,” in Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, AB, 2308–2311. doi: 10.1109/SMC.2017.8122965
He, B. J., Snyder, A. Z., Zempel, J. M., Smyth, M., and Raichle, M. E. (2008). Electrophysiological correlates of the brain’s intrinsic large-scale functional architecture. Proc. Natl. Acad. Sci. U.S.A. 105, 16039–16044. doi: 10.1073/pnas.0807010105
Herff, C., Krusienski, D. J., and Kubben, P. (2020). The potential of stereotactic-EEG for brain-computer interfaces: current progress and future directions. Front. Neurosci. 14:123. doi: 10.3389/fnins.2020.00123
Hotson, G., Mcmullen, D. P., Fifer, M. S., Johannes, M. S., Katyal, K. D., Para, M. P., et al. (2016). Individual finger control of a modular prosthetic limb using high-density electrocorticography in a human subject. J. Neural Eng. 13:026017. doi: 10.1088/1741-2560/13/2/026017
Indira, G. C., Sol, E., Mikulan, E. P., Eugenia, H., Baglivo, F. H., Walter, S., et al. (2017). Attention, in and out: scalp-level and intracranial EEG correlates of interoception and exteroception. Front. Neurosci. 11:411. doi: 10.3389/fnins.2017.00411
Jones, E., Oliphant, T., and Peterson, P. (2001). SciPy: Open Source Scientific Tools for Python. Available online at: http://scipy.org
Kai, J. M., Schalk, G., Fetz, E., Nijs, M., Ojemann, J. G., and Rao, R. (2010). Cortical activity during motor execution, motor imagery, and imagery-based online feedback. Proc. Natl. Acad. Sci. U.S.A. 107, 4430–4435. doi: 10.1073/pnas.0913697107
Kobayashi, K., Jacobs, J., and Gotman, J. (2009). Detection of changes of high-frequency activity by statistical time-frequency analysis in epileptic spikes. Clin. Neurophysiol. 120, 1070–1077. doi: 10.1016/j.clinph.2009.03.020
Kobler, R. J., Kolesnichenko, E., Sburlea, A. I., and Müller-Putz, G. (2020). Distinct cortical networks for hand movement initiation and directional processing: an EEG study. Neuroimage 220:117076. doi: 10.1016/j.neuroimage.2020.117076
Kohavi, R., and John, G. H. (1997). Wrappers for feature subset selection. Artif. Intell. 97, 273–324. doi: 10.1016/S0004-3702(97)00043-X
Kubanek, J., Miller, K. J., Ojemann, J. G., Wolpaw, J. R., and Schalk, G. (2009). Decoding flexion of individual fingers using electrocorticographic signals in humans. J. Neural Eng. 6:066001. doi: 10.1088/1741-2560/6/6/066001
Lebedev, M. A., and Nicolelis, M. (2006). Brain-machine interfaces: past, present and future. Trends Neurosci. 29, 536–546. doi: 10.1016/j.tins.2006.07.004
Li, G., Jiang, S., Chen, C., Brunner, P., Wu, Z., Schalk, G., et al. (2020). iEEGview: an open-source multifunction GUI-based Matlab toolbox for localization and visualization of human intracranial electrodes. J. Neural Eng. 17:016016. doi: 10.1088/1741-2552/ab51a5
Li, G., Jiang, S., Paraskevopoulou, S. E., Wang, M., Xu, Y., Wu, Z., et al. (2018). Optimal referencing for stereo-electroencephalographic (SEEG) recordings. Neuroimage 183, 327–335. doi: 10.1016/J.NEUROIMAGE.2018.08.020
Li, G., Jiang, S., Yang, X., Wu, Z., and Zhang, D. (2017). “A preliminary study towards prosthetic hand control using human stereo-electroencephalography (SEEG) signals,” in Proceedings of the 8th International IEEE EMBS Conference on Neural Engineering, Shanghai.
Lotte, F., Bougrain, L., Cichocki, A., Clerc, M., Congedo, M., Rakotomamonjy, A., et al. (2018). A review of classification algorithms for EEG-based brain-computer interfaces: a 10 year update. J. Neural Eng. 15:031005. doi: 10.1088/1741-2552/aab2f2
Meisler, S. L., Kahana, M. J., and Ezzyat, Y. (2019). Does data cleaning improve brain state classification? J. Neurosci. Methods 328:108421. doi: 10.1016/j.jneumeth.2019.108421
Mercier, M. R., Bickel, S., Megevand, P., Groppe, D. M., Schroeder, C. E., Mehta, A. D., et al. (2016). Evaluation of cortical local field potential diffusion in stereotactic electro-encephalography recordings: a glimpse on white matter signal. Neuroimage 147, 219–232. doi: 10.1016/j.neuroimage.2016.08.037
Miller, K. J., Hermes, D., Honey, C. J., Hebb, A. O., Ramsey, N. F., Knight, R. T., et al. (2012). Human motor cortical activity is selectively phase-entrained on underlying rhythms. PLoS Comput. Biol. 8:e1002655. doi: 10.1371/journal.pcbi.1002655
Miller, K. J., Honey, C. J., Hermes, D., Rao, R. P., Dennijs, M., and Ojemann, J. G. (2014). Broadband changes in the cortical surface potential track activation of functionally diverse neuronal populations. Neuroimage 85, 711–720. doi: 10.1016/j.neuroimage.2013.08.070
Murphy, B. A., Miller, J. P., Kabilar, G., Bolu, A. A., and Gerwin, S. (2016). Contributions of subsurface cortical modulations to discrimination of executed and imagined grasp forces through stereoelectroencephalography. PLoS One 11:e0150359. doi: 10.1371/journal.pone.0150359
Nakanishi, Y., Yanagisawa, T., Shin, D., Chen, C., Kambara, H., Yoshimura, N., et al. (2014). Decoding fingertip trajectory from electrocorticographic signals in humans. Neurosci. Res. 85, 20–27. doi: 10.1016/j.neures.2014.05.005
Nolan, H., Whelan, R., and Reilly, R. B. (2010). FASTER: fully automated statistical thresholding for EEG artifact rejection. J. Neurosci. Methods 192, 152–162. doi: 10.1016/j.jneumeth.2010.07.015
Ojemann, J. G. (2007). Spectral changes in cortical surface potentials during motor movement. J. Neurosci. 27, 2424–2432.
Parvizi, J., and Kastner, S. (2018). Promises and limitations of human intracranial electroencephalography. Nat. Neurosci. 21, 474–483. doi: 10.1038/s41593-018-0108-2
Pesters, A. D., Coon, W. G., Brunner, P., Gunduz, A., Ritaccio, A. L., Brunet, N. M., et al. (2016). Alpha power indexes task-related networks on large and small scales: a multimodal ECoG study in humans and a non-human primate. Neuroimage 134, 122–131. doi: 10.1016/j.neuroimage.2016.03.074
Pistohl, T., Schulze-Bonhage, A., Aertsen, A., Mehring, C., and Ball, T. (2012). Decoding natural grasp types from human ECoG. Neuroimage 59, 248–260. doi: 10.1016/j.neuroimage.2011.06.084
Potes, C., Brunner, P., Gunduz, A., Knight, R. T., and Schalk, G. (2014). Spatial and temporal relationships of electrocorticographic alpha and gamma activity during auditory processing. Neuroimage 97, 188–195. doi: 10.1016/j.neuroimage.2014.04.045
Rangarajan, V., Hermes, D., Foster, B. L., Weiner, K. S., Jacques, C., Grill-Spector, K., et al. (2014). Electrical stimulation of the left and right human fusiform gyrus causes different effects in conscious face perception. J. Neurosci. 34, 12828–12836. doi: 10.1523/JNEUROSCI.0527-14.2014
Ray, S., and Maunsell, J. H. R. (2011). Different origins of gamma rhythm and high-gamma activity in macaque visual cortex. PLoS Biol. 9:e1000610. doi: 10.1371/journal.pbio.1000610
Schalk, G., Kubánek, J., Miller, K. J., Anderson, N. R., Leuthardt, E. C., Ojemann, J. G., et al. (2007). Decoding two-dimensional movement trajectories using electrocorticographic signals in humans. J. Neural Eng. 4, 264–275. doi: 10.1088/1741-2560/4/3/012
Schalk, G., and Leuthardt, E. C. (2012). Brain-computer interfaces using electrocorticographic signals. IEEE Rev. Biomed. Eng. 4, 140–154. doi: 10.1109/RBME.2011.2172408
Schaworonkow, N., and Voytek, B. (2021). Enhancing oscillations in intracranial electrophysiological recordings with data-driven spatial filters. bioRxiv [Preprint] 208. doi: 10.1101/2021.02.26.433127
Sedghamiz, H. (2018). BioSigKit: a Matlab toolbox and interface for analysis of BioSignals. J. Open Sour. Softw. 3:671. doi: 10.21105/joss.00671
Sheehan, T. C., Sreekumar, V., Inati, S. K., and Zaghloul, K. A. (2018). Signal complexity of human intracranial EEG tracks successful associative-memory formation across individuals. J. Neurosci. 38, 1744–1755. doi: 10.1523/JNEUROSCI.2389-17.2017
Shen, Q., and Faraway, J. (2004). An F test for linear models with functional responses. Stat. Sin. 14, 1239–1257. doi: 10.1007/s00440-004-0363-x
Shirhatti, V., Borthakur, A., and Ray, S. (2016). Effect of reference scheme on power and phase of the local field potential. Neural Comput. 28, 882–913. doi: 10.1162/NECO_a_00827
Slutzky, M. W., Jordan, L. R., Krieg, T., Ming, C., Mogul, D. J., and Miller, L. E. (2010). Optimal spacing of surface electrode arrays for brain-machine interface applications. J. Neural Eng. 7:26004. doi: 10.1088/1741-2560/7/2/026004
Spueler, M., Walter, A., Ramos-Murguialday, A., Naros, G., Birbaumer, N., Gharabaghi, A., et al. (2014). Decoding of motor intentions from epidural ECoG recordings in severely paralyzed chronic stroke patients. J. Neural Eng. 11:066008. doi: 10.1088/1741-2560/11/6/066008
Vadera, S., Marathe, A. R., Gonzalez-Martinez, J., and Taylor, D. M. (2013). Stereoelectroencephalography for continuous two-dimensional cursor control in a brain-machine interface. Neurosurg. Focus 34:E3. doi: 10.3171/2013.3.FOCUS1373
Vass, L., Copara, M., Seyal, M., Shahlaie, K., Farias, S. T., Shen, P., et al. (2016). Oscillations go the distance: low-frequency human hippocampal oscillations code spatial distance in the absence of sensory cues during teleportation. Neuron 89, 1180–1186. doi: 10.1016/j.neuron.2016.01.045
Vidal, J. R., Freyermuth, S., Jerbi, K., Hamamé, C. M., and Lachaux, J. P. (2012). Long-distance amplitude correlations in the high gamma band reveal segregation and integration within the reading network. J. Neurosci. 32, 6421–6434. doi: 10.1523/JNEUROSCI.4363-11.2012
Volkova, K., Lebedev, M. A., Kaplan, A., and Ossadtchi, A. (2019). Decoding movement from electrocorticographic activity: a review. Front. Neuroinform. 13:74. doi: 10.3389/fninf.2019.00074
Vuckovic, A., and Sepulveda, F. (2008). Delta band contribution in cue based single trial classification of real and imaginary wrist movements. Med. Biol. Eng. Comput. 46, 529–539. doi: 10.1007/s11517-008-0345-8
Keywords: brain–computer interface, stereo-electroencephalography, data cleaning, re-referencing method, gesture decoding
Citation: Liu S, Li G, Jiang S, Wu X, Hu J, Zhang D and Chen L (2021) Investigating Data Cleaning Methods to Improve Performance of Brain–Computer Interfaces Based on Stereo-Electroencephalography. Front. Neurosci. 15:725384. doi: 10.3389/fnins.2021.725384
Received: 15 June 2021; Accepted: 01 September 2021;
Published: 06 October 2021.
Edited by:
Gabriele Arnulfo, University of Genoa, ItalyReviewed by:
Andrea Canessa, University of Genoa, ItalyJohn Magnotti, University of Pennsylvania, United States
Copyright © 2021 Liu, Li, Jiang, Wu, Hu, Zhang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Dingguo Zhang, ZC56aGFuZ0BiYXRoLmFjLnVr; Liang Chen, dmhzY2hlbmxpYW5nQGZ1ZGFuLmVkdS5jbg==
†These authors have contributed equally to this work and share first authorship