A P300-Detection Method Based on Logistic Regression and a Convolutional Neural Network

Li, Qi; Wu, Yan; Song, Yu; Zhao, Di; Sun, Meiqi; Zhang, Zhilin; Wu, Jinglong

doi:10.3389/fncom.2022.909553

ORIGINAL RESEARCH article

Front. Comput. Neurosci., 16 June 2022

Volume 16 - 2022 | https://doi.org/10.3389/fncom.2022.909553

This article is part of the Research TopicBrain-inspired Machine Learning and Computation for Brain-Behavior Analysis Vol IIView all 8 articles

A P300-Detection Method Based on Logistic Regression and a Convolutional Neural Network

Qi Li^1,2*

Yan Wu¹

Yu Song¹

Di Zhao¹

Meiqi Sun¹

Zhilin Zhang^3*

Jinglong Wu³

¹School of Computer Science and Technology, Changchun University of Science and Technology, Changchun, China
²Zhongshan Institute of Changchun University of Science and Technology, Zhongshan, China
³Research Center for Medical Artificial Intelligence, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China

Background: Electroencephalogram (EEG)-based brain-computer interface (BCI) systems are widely utilized in various fields, including health care, intelligent assistance, identity recognition, emotion recognition, and fatigue detection. P300, the main event-related potential, is the primary component detected by EEG-based BCI systems. Existing algorithms for P300 classification in EEG data usually perform well when tested in a single participant, although they exhibit significant decreases in accuracy when tested in new participants. We attempted to address this lack of generalizability associated with existing classification methods using a novel convolutional neural network (CNN) model developed using logistic regression (LR).

Materials and Methods: We proposed an LR-CNN model comprising two parts: a combined LR-based memory model and a CNN-based generalization model. The LR-based memory model can learn the individual features of participants and addresses the decrease in accuracy caused by individual differences when applied to new participants. The CNN-based generalization model can learn the common features among participants, thereby reducing overall classification bias and improving overall classification accuracy.

Results: We compared our method with existing, commonly used classification methods through three different sets of experiments. The experimental results indicated that our method could learn individual differences among participants. Compared with other commonly used classification methods, our method yielded a marked improvement (>90%) in classification among new participants.

Conclusion: The accuracy of the proposed model in the face of new participants is better than that of existing, commonly used classification methods. Such improvements in cross-subject test accuracy will aid in the development of BCI systems.

Introduction

Electroencephalogram (EEG)-based brain-computer interface (BCI) systems use brain signals to transmit information. The primary feature of these systems is that self-elicited EEG signals can assist patients with loss of motor function in the limbs and those with language impairments in communicating with others by enabling them to control external devices. The BCI system has gradually moved from the laboratory to the market, which requires a stronger adaptability of the BCI system (Hochberg et al., 2006; Hoffmann et al., 2008; Xu et al., 2021).

The study of EEG classification problems is important for the application and development of the BCI system (Hu et al., 2019; Hu and Zhang, 2020; Li, 2020; Li et al., 2020). In EEG, event-related potentials (ERPs) are specific voltage signals generated in the brain in response to a task (e.g., gazing at numbers, letters, or pictures). Currently, diseases such as Alzheimer’s disease can be studied as an aid to identify early cognitive impairment by studying changes in patients’ ERPs, and the study of ERPs is important for medical rehabilitation, medical diagnosis, and improving EEG-based communication systems (Luck, 2005; Wang et al., 2017; Yan et al., 2018; Liu et al., 2022). Lu et al. (2020) constructed time-varying networks for ERPs in AV and V spelling paradigms based on adaptive directed transfer function to investigate the dynamic processes underpinning the processing of stimuli in the two spelling paradigms. The P300 potential, which is usually detected about 300 ms after the appearance of the target stimulus (Farwell and Donchin, 1988; Rakotomamonjy and Guigue, 2008), is generally used as the primary reference potential. Lu et al. (2019) designed a novel audiovisual P300-speller paradigm to improve the performance of vision-based P300 spelling system. P300 data are usually represented as a matrix with two dimensions (channel and time), and their values represent the true EEG amplitude obtained from the individual during task performance (Rakotomamonjy and Guigue, 2008). Li et al. (2019) attempt to improve the performance of P300 by increasing the user’s mental work. The traditional EEG classification process includes preprocessing, feature extraction, and classification (Rashid et al., 2019). Liu et al. (2021) proposed a sparse representation of the P300 spelling paradigm to solve the problem of brain signal classification with high-dimensional data and low signal-to-noise ratio. Preprocessing usually includes filtering, baseline correction, and artifact removal (Kundu and Ari, 2020). Feature extraction allows one to obtain the most discriminative features from real EEG data (Dodia et al., 2019). Traditional feature extraction generally extracts features from the time domain (e.g., variance, mean, and kurtosis) (Martin-Smith et al., 2017), frequency domain (e.g., fast Fourier transformation) (Park and Chung, 2018), and time-frequency domain (e.g., discrete wavelet transformation) (Amini et al., 2013).

Machine learning and deep learning have been widely used in image recognition (Liu et al., 2019), medical diagnosis (Zhao et al., 2022), intelligent assistance and other fields. In recent years, these methods have also become a hot topic in P300 signal detection. The P300 EEG signal is a complex combination of superimposed multi-band waveforms, and a number of classification methods have been used to decode the P300 signal. For example, principal component analysis (PCA) and local Fisher discriminant analysis (LFDA) are commonly used to reduce the dimensionality of features (Bernat et al., 2007), while linear discriminant analysis (LDA) (Dodia et al., 2019), support vector machine (SVM) (Li et al., 2014), decision tree (DT) (Guan et al., 2019), random forest (RF) (Akram et al., 2015; Masud et al., 2018), ADB (Hongzhi et al., 2012; Yildirim and Halici, 2014), and k-nearest neighbor (k-NN) methods (Guney et al., 2021) are commonly used for P300 classification.

Since traditional preprocessing and feature extraction methods are highly complex and time-consuming and can lead to a loss of important information after extraction, automated feature extraction algorithms are considered important (Shan et al., 2018). Deep learning represents a better solution to this problem (Oralhan, 2020), as it provides an excellent algorithm for automatically extracting discriminative features. Deep learning is currently adopted in many studies, as it can learn features from the original data well (Kshirsagar G. and Londhe N., 2019; Kshirsagar G. B. and Londhe N. D., 2019; Borra et al., 2021). At present, the main deep learning algorithms used for EEG data processing include convolutional neural networks (CNNs), recurrent neural networks (RNNs), deep belief networks (DBN), autoencoders (AEs), and other models (Maddula et al., 2017; Lu et al., 2018). Cecotti et al. introduced a CNN for detecting the P300 ERP in BCIs (Cecotti and Gräser, 2011). They proposed seven CNN-based classifiers and evaluated their performance, with excellent results. Feng et al. used a CNN classification algorithm based on PCA to classify P300 data (Li et al., 2020). Ditthapron et al. used multi-task AE-based feature extraction for EEG classification (Ditthapron et al., 2019). Vařeka et al. used stacked AEs for P300 classification (Vařeka and Mautner, 2017). Since the RNN model has achieved good results in sequential information recognition tasks (such as speech recognition) (Lipton et al., 2015), long short-term memory (LSTM) networks have also been applied to EEG recognition. Joshi et al. proposed a neural network model based on convolutional long short-term memory (ConvLSTM), in which a CNN and LSTM were used to capture spatial and temporal information, respectively. This effective use of temporal as well as spatial features yielded better performance than a single system (Joshi et al., 2018). Kundu et al. proposed a PCA-based ensemble of a weighted SVM (PCA-EWSVM) classifier. In this weighting method, different weights were assigned to each classifier, with the largest weight assigned to the best classifier, causing it to have the greatest impact on the final output of the classifier (Kundu and Ari, 2017). Kshirsagar et al. constructed a weighted ensemble using a deep CNN, which effectively reduced the classification error rate for single models, and adopted a new channel dropout-based character-detection approach, which further reduced the false detection rate arising from a single trial (Krusienski et al., 2008). Feng et al. proposed an automatic P300 EEG signal channel selection algorithm based on population sparsity Bayesian logistic regression (Feng and Zhang, 2019). Abibullaev et al. proposed multiple network structures for P300 signal analysis to select a more accurate classifier to decode the signal, however, the complexity of the network structure leads to higher complexity of the algorithm and more parameters, which makes it not suitable for online applications (Abibullaev and Zollanvari, 2021). Xu et al. proposed an online pre-alignment strategy for motion imagery signals to address the generalization ability of the model across datasets and obtained good results, and in 2021, this team made further improvements by combining adaptive batch normalization (AdaBN) and alignment strategies to reduce interval covariate shifts between datasets, but this approach was used more for processing datasets before validation and its insensitive to different individual features when used for P300 signal processing (Xu et al., 2020; Xu et al., 2021).

Stochastic gradient descent (SGD) is commonly utilized to optimize deep learning algorithms (Bottou, 2010). In 2009, John Duchi proposed the FOBOS (Forward-Backward Splitting) optimization algorithm (Duchi and Singer, 2009a,b), which divides the regularized gradient descent problem into an empirical loss gradient descent iteration and an optimization problem. However, FOBOS only uses the gradient of the previous iteration, without accumulation, and does not achieve effective sparsity. In 2010, Lin et al. proposed the regularized dual averaging (RDA) optimization algorithm (Xiao, 2010), which relies on gradient accumulation. In this algorithm, when the average value of accumulated gradients on a latitude is less than a threshold, the weight of that latitude will be set to zero, which ensures that the weights are fully trained. In 2011, McMahan et al. proposed the Follow the Regulation Leader (FTRL) algorithm (McMahan, 2011), which has worked well in many recommender systems (McMahan et al., 2013; Kim, 2017). FTRL is a general optimization algorithm for online deep learning prediction, which inherits the idea of SGD. Based on a truncated gradient, it absorbs the advantages of the FOBOS and RDA optimization algorithms, focusing on the sparsity problem while improving accuracy, which has great potential for application to online prediction systems.

Previous studies have shown that the amplitude and latency of the P300 component vary across individuals according to sex and age (Carlson and Iacono, 2006; Brumback et al., 2012; Dinteren et al., 2014). Therefore, an EEG-processing algorithm should have certain memorization and generalization capabilities for individuals of different ages and sexes to allow it to discover the common features among participants and effectively aid in subsequent classification. When we tested the existing commonly used algorithms, we found that they performed well on a single dataset (>95% accuracy). However, when we tested them on other available datasets and private datasets, the accuracy dropped significantly to around 80%.

To address this sharp decrease in accuracy when existing algorithms are applied to new datasets, we proposed an LR-CNN model that combines logistic regression (LR) and a CNN. In this model, optimization is achieved via the FTRL algorithm, which addresses the sparsity problem, and the network is applied to P300 detection, which also addresses sparse data features and improves the robustness of the algorithm across participants. We conducted experiments using a private dataset and compared our results with those of previous studies to determine whether our proposed method can achieve better results in the detection of the P300 component between different individuals.

Materials and Methods

Participants

This study was approved by the Ethics Committee of Changchun University of Science and Technology. All participants provided written informed consent after receiving a detailed explanation of the experimental procedure. Eight healthy, right-handed individuals with no mental disorders, normal or corrected-to-normal vision, and a stereopsis acuity better than 60″ participated in the study.

Paradigm Flow

Preparation for the EEG experiment included placing electrode caps on each participant, applying electrode paste to reduce the resistance to a reasonable range, and informing the participants of the experimental procedure and tasks.

We prepared two-dimensional and three-dimensional P300 speller paradigms, which were presented to participants using an NVIDIA stereo display with a resolution of 1,920 × 1,080 and a refresh rate of 120 Hz. Following EEG preparations, participants sat directly in front of the stereo display with their eyes 90 cm from the screen. Each task involved four sessions in which a word containing five characters was presented. Each session contained five runs, and each run output one character. The two tasks included a total of eight sessions. To avoid learning effects, the eight sessions were presented in pseudo-random order for each participant. Because the task paradigm was a 6 × 7 matrix of characters, the target character flashed twice every 13 flashes (once in row, once in column), which was called a sequence. Each run included eight sequences (i.e., a target character was output by flashing 104 times, as shown in Figure 1). During the tasks, participants were required to stare at the target character and silently count the number of times it flashed. They were permitted a 5-min rest period after each session.

FIGURE 1

Figure 1. P300 speller paradigms.

Dataset Processing

The EEG data used in this study were collected during the experiment using 64 active electrodes. Considering the amount of data requiring processing, we selected 30 standard electrodes according to the International 10-20 electrode system as nodes to build the brain networks, including FP1, FPz, FP2, F7, F3, Fz, F4, F8, FC3, FCz, FC4, T7, C3, Cz, C4, T8, CP3, CPz, CP4, P7, P3, Pz, P4, P8, PO5, POz, PO6, O1, Oz, and O2. The raw EEG data of each participant were preprocessed for the time-varying brain network analysis.

The raw EEG data were bandpass filtered between 0.1 and 30 Hz with a third-order Butterworth filter. The EEG was segmented according to the coding value of each Flash marker, from 100 ms before to 800 ms after the presentation of the stimuli, with a duration of 900 ms, for analysis. The data between −100 and 0 ms was used for baseline correction. According to the characteristics of the brainwaves, a threshold of −100 to 100 μV was used for artifact removal.

In the prediction phase, the data for the previous 1 s were collected every 0.5 s and input into the model.

Model

The model proposed in this study comprised two parts: a memorizing model and a generalizing model. The memorizing model was defined as an information retrieval model, expressed as follows:

f (x) = sigmoid (w^{T} x + b), x = (x_{1}, x_{2}, \dots, x_{d_{1} + d_{2} + d_{ϕ}}) (1)

where w^T = (w₁, w₂ … w_{d₁+d₂+d_φ}) represents the regression coefficient, b represents the bias, x = (x₁, x₂, ⋯ , x_{d₁+d₂+d_φ}) represents the matrix of independent variables, d₁ represents the dense EEG data, d₂ represents the sparse (one-hot encoding) feature data of the participant, and d_φ represents the data after cross-product transformation, which was defined as follows:

ψ (x) = \prod_{i = 1}^{d} x_{i = 1}^{c_{ki}}, c_{ki} \in {0, 1} (2)

In Eq. (2), c_ki is a Boolean value that serves to control the multiplication of specific features.

The prediction function is a Sigmoid function as shown in Eq. (3).

f (z) = \frac{1}{1 + e^{- z}} Z \in (- \infty, + \infty) (3)

This function controls the prediction value between (0,1) and makes the classification for judging the P300 signal.

The generalizing model was a CNN, as shown in Figure 2. Tables 1, 2 show the network parameters. The original W&D model used the multilayer perceptron (MLP) network. Since the EEG signal was dense with features in each frequency band, and the CNN can efficiently analyze the signal data with a fixed length, a multilayer one-dimensional CNN was selected as the generalizing model. The model structure with the best result, which had a filter of 1 × 3 and three layers, was obtained through exhaustive selection.

FIGURE 2

Figure 2. Structure diagrams of the models.

TABLE 1

Table 1. Parameter table for CNN section.

TABLE 2

Table 2. Parameter table for LR section.

Optimization Algorithm

The FTRL algorithm was used to optimize our mixed model. The FOBOS algorithm introduces L1 regularization based on gradient descent and sets a threshold for parameter truncation; however, the threshold slowly becomes smaller as the number of iterations increases and needs to be improved in sparsity.

The FTRL algorithm combines the advantages of the SGD and FOBOS algorithms, inheriting the accuracy and good sparsity of weights of the updated SGD algorithm from the FOBOS algorithm (Duchi and Singer, 2009b). This led to good real-time performance of our EEG classification model. The weight-updating formula was as follows:

$W_{t + 1} = \underset{W}{a r g m i n}$

{G_{1 : t} ⋅W + \frac{1}{2} \sum_{s = 1}^{t} σ_{s} | {| W - W_{s} | |_{2}^{2} + λ_{1} | | W | |}_{1} + \frac{1}{2} λ_{2} {|| W ||}_{2}^{2}} (4)

where G_1:t represents the sum of the gradients accumulated up to time t, σ_s represents the learning rate (which contained two parameters to be input, known as Per-Coordinate Learning Rates), and λ₁||W||₁ and $λ_{2} | | W | |_{2}^{2}$ represent the regulars of L1 and L2, respectively. The purpose of L1 regularization was to increase the sparsity of network weights, allowing for real-time recognition of the EEG data. The purpose of L2 regularization was to increase the smoothness of the results in the optimization process. The FTRL algorithm updated each Dim of W separately (i.e., using different learning rates, which accounted for the uneven feature distribution of the data). If there were few features in a certain dimension of a certain data (i.e., the feature value was 0), every such sample would become very important, and the corresponding learning rate of the feature value could be kept very large. The specific weight update algorithm is shown in Algorithm 1.

Algorithm 1: FTRL with L1 and L2 regularization.

www.frontiersin.org

Training Phase

In the training phase, FTRL was used as the optimizer, while logistic loss was used to represent loss. The loss function of logistic regression was formulated as follows:

\begin{matrix} J (w) = - \frac{1}{m} \sum_{i = 1}^{m} y^{(i)} \log (σ ({wx}^{(i)} + b)) + \\ (1 - y^{(i)}) \log (1 - σ ({wx}^{(i)} + b)) \\ , x = (x_{1}, x_{2}, \dots, x_{d_{1} + d_{2} + d_{φ}}) & (5) \end{matrix}

Suppose there are m samples. In this case, x⁽ⁱ⁾ denotes any certain sample, the dimension of the sample is d₁ + d₂ + d_φ, σ(⋅) denotes the sigmoid function, and w is the parameter vector required to solve such that (w^Tx_i + b) yields the probability prediction value in the sigmoid function. y⁽ⁱ⁾ denotes the prediction value of the ith sample. When y = 1, the latter equation is 0; when y = 0, the previous equation is 0. The loss functions for predicting positive and negative values are combined by y⁽ⁱ⁾, and the average of m samples is taken to finally obtain the loss function of logistic regression.

Prediction Phase

Since the prediction phase included data for a single participant, the embedding layer only needed to be calculated once (because the sparse TENSOR recorded the participant FEATURE). Therefore, the model we proposed would exhibit good real-time performance and predictive robustness.

Prediction Formula

P (Y = 1 | x) = σ (w_{wide}^{T} [x, ϕ (x)] + w_{Conv} a^{l_{f}} + b) (6)

where Y is the binary class label, σ(⋅) is the sigmoid function, φ(x) are the cross-product transformations of the original features x, and b is the bias term. Further, in this equation, $w_{w i d e}^{T}$ is the vector of all wide model weights, and w_Conv are the weights applied on the final activations a^l_f.

Results

We conducted three experiments to verify the effectiveness of the proposed model. We performed 5-fold cross-validation and took the average as the result. In the first set of experiments, data from eight participants were collected every other day to train and test the model. The data of each participant were divided into a training set (80%) and a test set (20%). Each classifier was trained using the training set, following which the test was conducted using the test set. The test results are shown in Table 3. Among them, the test accuracy of the DT for each participant was generally low, with an accuracy rate of only around 80%. The test accuracy rate of LR was only 82.5% in the first participant, but the test accuracy rates in the remaining seven participants were all above 90%. The test accuracy rates of RF, Adboost (ADB), MLP, SVM, KNN algorithm, and LDA in each participant all exceeded 90%. The CNN and LSTM approaches performed very well for a single participant, with accuracies of over 95%. Our proposed model achieved an accuracy rate greater than 90% on the test set in each participant, reaching up to 95.6%.

TABLE 3

Table 3. Test accuracy when training individual participants separately.

In the second set of experiments, data from seven participants were collected every other day to train the model and tested on the remaining participant. The test results are shown in Table 4. Since the test participant was never seen in the training set, the test accuracy rate of the traditional classification methods dropped significantly. Among them, the accuracy rate of DT decreased from around 82% to around 73%, and the accuracy rates of LR, RF, ADB, MLP, SVM, KNN, and LDA decreased from over 90 to 80-85%. The accuracy of the CNN and LSTM approaches also decreased from a very high 95%+ to 85–90%, with more dramatic decreases observed across participants. Because the classifier had not learned some individual features, the accuracy rate decreased significantly when used in participants it had not encountered previously. However, since the wide part and embedding layer of the proposed LR-CNN model had learnt some individual features, the test accuracy did not decrease significantly when the model was applied to new individuals. When data for participant 7 were tested, the accuracy of the model decreased markedly when compared with that for other participants, indicating that the individual features of participant 7 did not appear in the dataset for the other seven participants.

TABLE 4

Table 4. Test results for the remaining participant after training using data for the other seven participants.

In the third set of experiments, data from four of the eight participants were randomly selected for training, and data from these four individuals were collected every other day to train the model, which was tested on data for the remaining four participants. To avoid randomness of the results, we carried out the process of randomly selecting four participants for training and the remaining four participants for testing a total of four times, as shown in Tables 5–8. The overall accuracy rates of DT, LR, RF, ADB, MLP, SVM, KNN, LDA, and LSTM further decreased slightly when compared with those observed in the second set of experiments. This was due to the reduction of training data (from seven participants to four participants, whose data were not used during training). However, test accuracy for the four participants did not decrease when our proposed model was applied, and all accuracy rates exceeded 90% (Table 5).

TABLE 5

Table 5. Test results for the remaining four participants after training using data from four participants (1).

TABLE 6

Table 6. Test results for the remaining four participants after training using data from four participants (2).

TABLE 7

Table 7. Test results for the remaining four participants after training using data from four participants (3).

TABLE 8

Table 8. Test results for the remaining four participants after training using data from four participants (4).

We plotted the training loss and test accuracy, as shown in Figure 3. Table 6 shows that the test accuracy of the proposed model was significantly reduced in test participants 4, 7, and 8. In the second set of experiments, the individual features of participant 7 did not appear in the other seven participants. For the analyses summarized in Tables 5, 7, the test accuracy rates did not significantly decrease in participants 4 and 8, although relatively significant decreases were observed in the analyses summarized in Tables 6, 8. This was because participants 4 and 8 shared relatively similar features that were not observed in other participants. Therefore, if data for participants 4 and 8 were not included in the training phase but were only part of the testing phase, the accuracy would decrease significantly. However, when compared with that for the traditional learning methods, the test accuracy rate was still relatively high for our proposed model, indicating that our model also learned some individual features.

FIGURE 3

Figure 3. Accuracy and loss figures for training. (A) The accuracy of the test on the remaining 4 subjects after training using the other 4 subjects. (B) The accuracy of the test on the remaining subjects after training using the other 7 subjects. (C) Test accuracy when training individual subjects separately. (D) The loss of the test on the remaining 4 subjects after training using the other 4 subjects. (E) The loss of the test on the remaining subjects after training using the other 7 subjects. (F) Test loss when training individual subjects separately.

The above experiments demonstrated that the proposed model can better learn individual participant features when compared with traditional classification methods, with increasing generalization capability as the size of the training set (i.e., number of participants) increases.

Discussion

In this study, the proposed LR-CNN model exhibited better accuracy in the test phase than existing methods that are commonly used for P300 classification.

The accuracy rates of each method in the three sets of experiments are shown in the box plots below. Figure 4 shows a box plot of the accuracy rate for each method in the first set of experiments, Figure 5 represents that in the second set of experiments, and Figure 6 represents those in the third set of experiments. The accuracy rates of DT in the three sets of experiments were generally low, those for strong classifiers such as RF, ADB, MLP, and SVM were relatively high in the first set of experiments, with very little fluctuation. This indicated that these classifiers had learnt the common features of the eight participants, leading to a stable and relatively high accuracy rate when testing these participants. In particular, the CNN and LSTM approaches, which learn features better than other classifiers, perform very well when applied to a single participant. However, the overall accuracy rates of these methods decreased significantly in the second and third sets of experiments, with large fluctuations. Due to the differences across participants, the common features learned by these strong classifiers and CNN/LSTM had different applicability to each new participant, leading to large fluctuations in accuracy rates across testers and an overall decrease in test accuracy. These results suggest that these strong classifiers and CNN/LSTM had not learned the individual features of the participant.

FIGURE 4

Figure 4. Box plots of test accuracy for different methods in the first set of experiments.

FIGURE 5

Figure 5. Box plots of test accuracy for different methods in the second set of experiments.

FIGURE 6

Figure 6. Box plots of test accuracy for different methods in the third set of experiments.

Outliers were observed when LR and LDA were applied in the first set of experiments, but the overall classification accuracy rates were relatively high with very little fluctuation when the outliers were excluded. This indicated that these two classifiers may have strengthened some common feature types and that some individuals were not sensitive to such features, leading to the presence of outliers in which the accuracy rate was much lower than the average. In the second and third sets of experiments, as for the above-mentioned strong classifiers, the overall accuracy rates of these two classifiers decreased significantly and exhibited large fluctuations across participants. This can also be explained by the failure of the classifiers to learn the individual features of participants.

As shown in Figure 4, the classification accuracy of the CNN and LSTM approaches exceeded that of the traditional approach in a single participant, slightly outperforming our approach. However, as shown in Figures 5, 6, the classification accuracies of these two methods continued to decrease during cross-participant testing. Moreover, as shown in Figure 6, the classification accuracy of CNN and LSTM further decreased as the number of participants decreased relative to that shown in Figure 5. Thus, the CNN and LSTM approaches are limited by the numbers of cross-participant tests and participants in general.

Our LR-CNN classifier maintained a high level of overall accuracy in the three sets of experiments, with no significant decrease for new participants. In the first set of experiments, when compared with that for other methods, the accuracy of our model fluctuated more across participants with a relatively high average accuracy rate, which suggested that our model learned both common and individual features. Due to the different degrees of learning for individual features, the accuracy rate fluctuated greatly across participants when compared with that observed using other models. However, in the second and third sets of experiments, the overall degree of fluctuation and accuracy rate did not change significantly when compared with those for other models, further suggesting that our model learned both common and individual features. The outliers in the second and third sets of experiments arose from individual features of these participants that differed from those of participants in the training set. Thus, the training samples did not have relevant individual features for our model to learn. This also highlights the adaptability of our model: As the number of individual participants increases, the model can learn more individual features, thus continuously increasing its generalizability.

Limitations

First, our test dataset was small, meaning that our method has a limited ability to learn individual features. A small number of participants in the training set would result in some individual features not being included, reducing our classifier’s accuracy rate when testing these participants. Second, although our model is capable of learning more features as the number of participants increases, further studies with larger datasets are required to verify its value.

Conclusion

Our proposed LR-CNN model exhibited better test accuracy than existing methods that are commonly used for classification. Such improvements in the accuracy of extracting features from EEG data will be useful for the development of BCI systems.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

YW contributed to the conception of the study, assisted with data analysis, and participated in constructive discussions. YS and DZ wrote code and performed the experiments. MS contributed significantly to data analysis and manuscript preparation. DZ performed the data analyses and wrote the manuscript. QL was responsible for the overall experimental design. ZZ and JW checked the manuscript and made valuable suggestions on the data processing process. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by Shenzhen Overseas Innovation Team Project (grant number: KQTD20180413181834876) and Jilin Scientific and Technological Development Program (grant number: 20200802004GH).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We would like to thank all individuals who participated in the initial experiments from which raw data were collected.

References

Abibullaev, B., and Zollanvari, A. (2021). A Systematic Deep Learning Model Selection for P300-Based Brain-Computer Interfaces. IEEE Trans. Syst. Man Cybernet. Syst. 52, 2744–2756. doi: 10.1109/TSMC.2021.3051136

CrossRef Full Text | Google Scholar

Akram, F., Han, S. M., and Kim, T. S. (2015). An efficient word typing P300-BCI system using a modified T9 interface and random forest classifier. Comput. Biol. Med. 56, 30–36. doi: 10.1016/j.compbiomed.2014.10.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Amini, Z., Abootalebi, V., and Sadeghi, M. T. (2013). Comparison of Performance of Different Feature Extraction Methods in Detection of P300. Biocybern. Biomed. Eng. 33, 3–20. doi: 10.1016/S0208-5216(13)70052-4

CrossRef Full Text | Google Scholar

Bernat, E. M., Malone, S. M., Williams, W. J., Patrick, C. J., and Iacono, W. G. (2007). Decomposing delta, theta, and alpha time–frequency ERP activity from a visual oddball task using PCA. Int. J. Psychophysiol. 64, 62–74. doi: 10.1016/j.ijpsycho.2006.07.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Borra, D., Fantozzi, S., and Magosso, E. (2021). A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: analysis of Training Strategies and Uncovering of Network Decision. Front. Hum. Neurosci. 15:655840. doi: 10.3389/fnhum.2021.655840

PubMed Abstract | CrossRef Full Text | Google Scholar

Bottou, L. (2010). “Large-Scale Machine Learning with Stochastic Gradient Descent,” in Proceedings of COMPSTAT’2010, eds Y. Lechevallier and G. Saporta (Princeton, NJ: Physica-Verlag HD).

Google Scholar

Brumback, T., Arbel, Y., Donchin, E., and Goldman, M. S. (2012). Efficiency of responding to unexpected information varies with sex, age, and pubertal development in early adolescence. Psychophysiology 49, 1330–1339. doi: 10.1111/j.1469-8986.2012.01444.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Carlson, S. R., and Iacono, W. G. (2006). Heritability of P300 amplitude development from adolescence to adulthood. Psychophysiology 43, 470–480. doi: 10.1111/j.1469-8986.2006.00450.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Cecotti, H., and Gräser, A. (2011). Convolutional Neural Networks for P300 Detection with Application to Brain-Computer Interfaces. IEEE Trans. Pattern Anal. Mach. Intell. 33, 433–445. doi: 10.1109/TPAMI.2010.125

PubMed Abstract | CrossRef Full Text | Google Scholar

Dinteren, R., Arns, M., Jongsma, M. L., and Kessels, R. P. (2014). P300 Development across the Lifespan: A Systematic Review and Meta-Analysis. PLoS One 9:e87347. doi: 10.1371/journal.pone.0087347

PubMed Abstract | CrossRef Full Text | Google Scholar

Ditthapron, A., Banluesombatkul, N., Ketrat, S., Chuangsuwanich, E., and Wilaiprasitporn, T. (2019). Universal Joint Feature Extraction for P300 EEG Classification Using Multi-Task Autoencoder. IEEE Access 7, 68415–68428. doi: 10.1109/ACCESS.2019.2919143

CrossRef Full Text | Google Scholar

Dodia, S., Edla, D. R., Bablani, A., Ramesh, D., and Kuppili, V. (2019). An Efficient EEG based Deceit Identification Test using Wavelet Packet Transform and Linear Discriminant Analysis. J. Neurosci. Methods 314, 31–40. doi: 10.1016/j.jneumeth.2019.01.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Duchi, Y. J., and Singer, Y. (2009a). “Efficient learning using forward-backward splitting,” in Proceedings of the 22nd International Conference on Neural Information Processing Systems, 495–503.

Google Scholar

Duchi, Y. J, and Singer, Y. (2009b). Efficient Online and Batch Learning Using Forward Backward Splitting. J. Mach. Learn. Res. 10, 2899–2934.

Google Scholar

Farwell, L. A., and Donchin, E. (1988). Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroencephalogr. Clin. Neurophysiol. 70, 510–523. doi: 10.1016/0013-4694(88)90149-6

CrossRef Full Text | Google Scholar

Feng, B., and Zhang, S. R. (2019). Channel Automatic Selection Algorithm for P300 Signal with Group Sparsity Bayesian Logistic Regression. J. Northeastern Univ. 40, 1245–1251.

Google Scholar

Guan, S., Zhao, K., and Yang, S. N. (2019). Motor Imagery EEG Classification Based on Decision Tree Framework and Riemannian Geometry. Comput. Intell. Neurosci. 2019:5627156. doi: 10.1155/2019/5627156

PubMed Abstract | CrossRef Full Text | Google Scholar

Guney, S., Arslan, S., Duru, A. D., and Duru, D. G. (2021). Identification of Food/Nonfood Visual Stimuli from Event-Related Brain Potentials. Appl. Bionics Biomech. 2021:6472586. doi: 10.1155/2021/6472586

PubMed Abstract | CrossRef Full Text | Google Scholar

Hochberg, L. R., Serruya, M., Friehs, G. M., Mukand, J. A., Saleh, M., Caplan, A. H., et al. (2006). Neuronal ensemble control of prosthetic devices by a human with tetraplegia. Nature 442, 164–171. doi: 10.1038/nature04970

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoffmann, U., Vesin, J. M., Ebrahimi, T., and Diserens, K. (2008). An efficient P300-based brain-computer interface for disabled subjects. J. Neurosci. Methods 167, 115–125. doi: 10.1016/j.jneumeth.2007.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Hongzhi, Q., Minpeng, X., Dong, M., Baikun, W., Zhipeng, L., Tao, Y., et al. (2012). Study of channel selection based on AdaBoost SVM in P300 Speller. CHIN. J. Sci. Instr. 26, 55–57.

Google Scholar

Hu, L., and Zhang, Z. (2020). Evolving EEG signal processing techniques in the age of artificial intelligence. Brain Sci. Adv. 6, 159–161.

Google Scholar

Hu, X., Chen, J., Wang, F., and Zhang, D. (2019). Ten challenges for EEG-based affective computing. Brain Sci. Adv. 5, 1–20.

Google Scholar

Joshi, R. B., Goel, P., Sur, M., and Murthy, H. (2018). “Single Trial P300 Classification Using Convolutional LSTM and Deep Learning Ensembles Method,” in Intelligent Human Computer Interaction. IHCI 2018. Lecture Notes in Computer Science, ed. U. Tiwary (Cham: Springer).

Google Scholar

Kim, H. D. (2017). Predicting Product Demands for Large-scale Chain Stores with FTRL-Proximal Linear Regression. J. Adv. Inform. Technol. Converg. 7, 35–42.

Google Scholar

Krusienski, D. J., Sellers, E. W., Mcfarland, D. J., Vaughan, T. M., and Wolpaw, J. R. (2008). Toward enhanced P300 speller performance. J. Neurosci. Methods 167, 15–21. doi: 10.1016/j.jneumeth.2007.07.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Kshirsagar, G., and Londhe, N. (2019). Weighted Ensemble of Deep Convolution Neural Networks for Single-Trial Character Detection in Devanagari-Script-Based P300 Speller. IEEE Trans. Cogn. Dev. Syst. 12, 551–560. doi: 10.1109/TCDS.2019.2942437

CrossRef Full Text | Google Scholar

Kshirsagar, G. B., and Londhe, N. D. (2019). Improving Performance of Devanagari Script Input-Based P300 Speller Using Deep Learning. IEEE Trans. Biomed. Eng. 66, 2992–3005. doi: 10.1109/TBME.2018.2875024

PubMed Abstract | CrossRef Full Text | Google Scholar

Kundu, S., and Ari, S. (2017). P300 Detection with Brain–Computer Interface Application Using PCA and Ensemble of Weighted SVMs. IETE J. Res. 64, 406–414. doi: 10.1080/03772063.2017.1355271

CrossRef Full Text | Google Scholar

Kundu, S., and Ari, S. (2020). P300 based character recognition using convolutional neural network and support vector machine. Biomed. Signal Process. Control 55:101645. doi: 10.1016/j.bspc.2019.101645

CrossRef Full Text | Google Scholar

Li, F., Li, X., Wang, F., Zhang, D., Xia, Y., and He, F. (2020). A Novel P300 Classification Algorithm Based on a Principal Component Analysis-Convolutional Neural Network. Appl. Sci. 10:1546. doi: 10.3390/app10041546

CrossRef Full Text | Google Scholar

Li, J. (2020). Thoughts on neurophysiological signal analysis and classification. Brain Sci. Adv. 6, 210–223.

Google Scholar

Li, Q., Lu, Z., Gao, N., and Yang, J. (2019). Optimizing the performance of the visual P300-speller through active mental tasks based on color distinction and modulation of task difficulty. Front. Hum. Neurosci. 13:130. doi: 10.3389/fnhum.2019.00130

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., Chen, X., Yan, Y., Wei, W., and Wang, Z. J. (2014). Classification of EEG Signals Using a Multiple Kernel Learning Support Vector Machine. Sensors 14, 12784–12802. doi: 10.3390/s140712784

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Z., Zhang, L., Zhang, F., Gu, R., Peng, W., and Hu, L. (2020). Demystifying signal processing techniques to extract resting-state EEG features for psychologists. Brain Sci. Adv. 6, 189–209.

Google Scholar

Lipton, Z. C., Berkowitz, J., and Elkan, C. J. (2015). A Critical Review of Recurrent Neural Networks for Sequence Learning. arXiv. [preprint]. doi: 10.48550/arXiv.1506.00019

CrossRef Full Text | Google Scholar

Liu, H., Li, Y., and Wang, S. (2021). Exploiting sparse representation in the P300 speller paradigm. Tsinghua Sci. Technol. 26, 440–451.

Google Scholar

Liu, S., Song, Y., Zhang, M., Zhao, J., Yang, S., and Hou, K. (2019). An identity authentication method combining liveness detection and face recognition. Sensors 19:4733.

Google Scholar

Liu, T., Wang, L., Suo, D., Zhang, J., Wang, K., Wang, J., et al. (2022). Resting-State Functional MRI of Healthy Adults: temporal Dynamic Brain Coactivation Patterns. Radiology. 2022:211762. doi: 10.1148/radiol.211762

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, Z., Gao, N., Liu, Y., and Li, Q. (2018). “The Detection of P300 Potential Based on Deep Belief Network,” in 2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), (Piscataway: IEEE).

Google Scholar

Lu, Z., Li, Q., Gao, N., and Yang, J. (2020). Time-varying networks of ERPs in P300-speller paradigms based on spatially and semantically congruent audiovisual bimodality. J. Neural Eng. 17:046015.

Google Scholar

Lu, Z., Li, Q., Gao, N., Yang, J., and Bai, O. (2019). A novel audiovisual P300-speller paradigm based on cross-modal spatial and semantic congruence. Front. Neurosci. 13:1040. doi: 10.3389/fnins.2019.01040

PubMed Abstract | CrossRef Full Text | Google Scholar

Luck, S. J. (2005). An Introduction to The Event-Related Potential Technique. Cambridge: MIT Press.

Google Scholar

Maddula, R. K., Stivers, J., Mousavi, M., Ravindran, S., and Sa, V. (2017). “Deep recurrent convolutional neural networks for classifying P300 bci signals,” in 7th Graz Brain-Computer Interface Conference 2017, (Graz: Institute of Neural Engineering).

Google Scholar

Martin-Smith, P., Ortega, J., Asensio-Cubero, J., Gan, J. Q., and Ortiz, A. (2017). A supervised filter method for multi-objective feature selection in EEG classification based on multi-resolution analysis for BCI. Neurocomputing 250, 45–56. doi: 10.1016/j.neucom.2016.09.123

CrossRef Full Text | Google Scholar

Masud, U., Baig, M. I., Akram, F., and Kim, T. S. (2018). “A P300 brain computer interface based intelligent home control system using a random forest classifier,” in 2017 IEEE Symposium Series on Computational Intelligence (SSCI), (Honolulu, HI: IEEE).

Google Scholar

McMahan, H. B. (2011). Follow-the-Regularized-Leader and Mirror Descent: equivalence Theorems and L1 Regularization. Proc. Mach. Learn. Res. 15, 525–533.

Google Scholar

McMahan, H. B., Holt, G., Sculley, D., Young, M., Ebner, D., Grady, J., et al. (2013). “Ad Click Prediction: a View from the Trenches,” in Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), (New York: Association for Computing Machinery).

Google Scholar

Oralhan, Z. (2020). 3D Input Convolutional Neural Networks for P300 Signal Detection. IEEE Access 8, 19521–19529. doi: 10.1109/ACCESS.2020.2968360

CrossRef Full Text | Google Scholar

Park, Y., and Chung, W. (2018). “BCI classification using locally generated CSP features,” in 2018 6th International Conference on Brain-Computer Interface (BCI), (Piscataway: IEEE), 2572–7672. doi: 10.1109/TNSRE.2019.2922713

PubMed Abstract | CrossRef Full Text | Google Scholar

Rakotomamonjy, A., and Guigue, V. (2008). BCI competition III: dataset II-ensemble of SVMs for BCI P300 speller. IEEE Trans. Biomed. Eng. 55, 1147–1154. doi: 10.1109/TBME.2008.915728

PubMed Abstract | CrossRef Full Text | Google Scholar

Rashid, M., Sulaiman, N., Mustafa, M., Khatun, S., and Bari, B. S. (2019). “The Classification of EEG Signal Using Different Machine Learning Techniques for BCI Application,” in Robot Intelligence Technology and Applications. RiTA 2018. Communications in Computer and Information Science, eds J. H. Kim, H. Myung, and S. M. Lee (Singapore: Springer).

Google Scholar

Shan, H., Liu, Y., and Stefanov, T. (2018). “A Simple Convolutional Neural Network for Accurate P300 Detection and Character Spelling in Brain Computer Interface,” in Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, (Stockholm: IJCAI), 1604–1610.

Google Scholar

Vařeka, L., and Mautner, P. (2017). Stacked Autoencoders for the P300 Component Detection. Front. Neurosci. 11:302. doi: 10.3389/fnins.2017.00302

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, B., Niu, Y., Miao, L., Cao, R., Yan, P., Guo, H., et al. (2017). Decreased Complexity in Alzheimer’s Disease: resting-State fMRI Evidence of Brain Entropy Mapping. Front. Aging Neurosci. 2017:378. doi: 10.3389/fnagi.2017.00378

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiao, L. (2010). Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization. J. Mach. Learn. Res. 11, 2543–2596.

Google Scholar

Xu, L., Xu, M., Ke, Y., An, X., Liu, S., and Ming, D. (2020). Cross-Dataset Variability Problem in EEG Decoding With Deep Learning. Front. Hum. Neurosci. 14:103. doi: 10.3389/fnhum.2020.00103

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, L., Xu, M., Ma, Z., Wang, K., Jung, T. P., and Ming, D. (2021). Enhancing transfer performance across datasets for brain-computer interfaces using a combination of alignment strategies and adaptive batch normalization. J. Neural Eng. 18:0460e5. doi: 10.1088/1741-2552/ac1ed2

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, T., Wang, W., Yang, L., Chen, K., Chen, R., and Han, Y. (2018). Rich club disturbances of the human connectome from subjective cognitive decline to Alzheimer’s disease. Theranostics 8, 3237–3255. doi: 10.7150/thno.23772

PubMed Abstract | CrossRef Full Text | Google Scholar

Yildirim, A., and Halici, U. (2014). “Analysis of dimension reduction by PCA and AdaBoost on spelling paradigm EEG data,” in International Conference on Biomedical Engineering & Informatics, (Hangzhou: IEEE).

Google Scholar

Zhao, J., Zhao, D., Shi, L., Kuang, Z., Jing, W., and Wang, H. (2022). Multilayer weighted integrated self-learning algorithm for automatic diagnosis of epileptic electroencephalogram signals. Comput. Intell. 38, 3–19.

Google Scholar

Keywords: brain-computer interface, convolutional neural network, electroencephalogram, event-related potential, logistic regression, P300

Citation: Li Q, Wu Y, Song Y, Zhao D, Sun M, Zhang Z and Wu J (2022) A P300-Detection Method Based on Logistic Regression and a Convolutional Neural Network. Front. Comput. Neurosci. 16:909553. doi: 10.3389/fncom.2022.909553

Received: 31 March 2022; Accepted: 13 May 2022;
Published: 16 June 2022.

Edited by:

Tianyi Yan, Beijing Institute of Technology, China

Reviewed by:

Minpeng Xu, Tianjin University, China
Ou Bai, Florida International University, United States

Copyright © 2022 Li, Wu, Song, Zhao, Sun, Zhang and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qi Li, bGlxaUBjdXN0LmVkdS5jbg==; Zhilin Zhang, emhhbmd6aGlsaW5Ac2lhdC5hYy5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.