An EEG-based attention recognition method: fusion of time domain, frequency domain, and non-linear dynamics features

Chen, Di; Huang, Haiyun; Bao, Xiaoyu; Pan, Jiahui; Li, Yuanqing

doi:10.3389/fnins.2023.1194554

ORIGINAL RESEARCH article

Front. Neurosci., 12 July 2023

Sec. Perception Science

Volume 17 - 2023 | https://doi.org/10.3389/fnins.2023.1194554

An EEG-based attention recognition method: fusion of time domain, frequency domain, and non-linear dynamics features

Di Chen^1,2

Haiyun Huang^2,3

Xiaoyu Bao^1,2

Jiahui Pan^2,3

Yuanqing Li^1,2^*

¹School of Automation Science and Engineering, South China University of Technology, Guangzhou, China
²Research Center for Brain-Computer Interface, Pazhou Laboratory, Guangzhou, China
³School of Software, South China Normal University, Foshan, China

Introduction: Attention is a complex cognitive function of human brain that plays a vital role in our daily lives. Electroencephalogram (EEG) is used to measure and analyze attention due to its high temporal resolution. Although several attention recognition brain-computer interfaces (BCIs) have been proposed, there is a scarcity of studies with a sufficient number of subjects, valid paradigms, and reliable recognition analysis across subjects.

Methods: In this study, we proposed a novel attention paradigm and feature fusion method to extract features, which fused time domain features, frequency domain features and nonlinear dynamics features. We then constructed an attention recognition framework for 85 subjects.

Results and discussion: We achieved an intra-subject average classification accuracy of 85.05% ± 6.87% and an inter-subject average classification accuracy of 81.60% ± 9.93%, respectively. We further explored the neural patterns in attention recognition, where attention states showed less activation than non-attention states in the prefrontal and occipital areas in α, β and θ bands. The research explores, for the first time, the fusion of time domain features, frequency domain features and nonlinear dynamics features for attention recognition, providing a new understanding of attention recognition.

1. Introduction

Attention is a crucial cognitive process that allows individuals to selectively focus on specific aspects of their environment while filtering out irrelevant information, thereby enabling effective adaptation to their surroundings (Petersen and Posner, 2012). Poor attention and concentration skills can contribute to mental health problems such as anxiety and depression. If left unaddressed, these difficulties can develop into more severe conditions such as attention deficit hyperactivity disorder (ADHD) (Chen et al., 2019). Attention recognition is an emerging research area that provides a window to monitor and understand people's attention states. It shows significant potential application value in the fields of medicine (Moghaddari et al., 2020), military operations (Berka et al., 2004), and preventing fatigue while driving (Luo et al., 2019). Existing research methods for attention recognition have focused on psychological behavior scale tests, such as digital cancelation task (D-CAT) (Fliege et al., 2009) and simple reaction time (SRT) Krupski and Boyle (1978), Combined Raven's Test (CRT) (Wang et al., 1989), Shure grid test scale, and Conners et al. (1998). However, these methods are limited in their ability to provide real-time results of a user's attention states. Therefore, researchers turned to explore attention recognition based on neurophysiological signals with temporal information, such as heart rate, skin electricity, electroencephalogram (EEG), functional magnetic resonance imaging (fMRI), or multimodal methods, which captured physiological changes related to attention states. Compared to other peripheral physiological signals, EEG signal, with their high temporal resolution, provided more information about attention and showed great potential in the field of attention recognition (Andrillon et al., 2021).

Research on attention recognition based on EEG signal is of significant practical importance in brain–computer interface (BCI) applications. In a study by Hamadicharef Hamadicharef et al. (2009), they used a combination of temporal filters, spatial filters, and Fisher linear discriminant to classify attention states within five subjects and achieved an accuracy of 89.4%. Mohammadpour built an EEG-based BCI, which successfully recognized four levels of attention in five individuals with an accuracy of 63.5% (Mohammadpour and Mozaffari, 2017). Ac achieved an accuracy of 91.72% within subjects using time-frequency features and SVM in 2019 (Acı et al., 2019), while in 2021, Wang wan obtained an accuracy of 95.36% ± 2.31% for two attention levels within subjects using dynamical complexity (Wan et al., 2021). In light of the preceding information, studies in EEG-based attention recognition typically have fewer subjects than studies in other EEG fields (Zheng et al., 2017; Gao et al., 2021), which results in insufficient generalizablation on unseen data and obstacles in inter-subject studies.

Attention recognition paradigms based on EEG commonly involve the use of cues to prompt subjects to enter a state of attention or relaxation. The state of attention is typically associated with a task state, while the state of relaxation is considered a task-independent state. Tasks used to induce a state of attention include breath counting Braboszcz and Delorme (2011); Hosseini and Guo (2019), reading comprehension (Li et al., 2011), mental arithmetic (Hamadicharef et al., 2009), imagination (Ke et al., 2014), and Stroop test (Kawashima et al., 2023). However, how to induce participants under specific cognitive load and enhance their attention is still a challenging work.

Brain waves can be divided into different frequency bands, including δ (0.5–4Hz), θ (4–8Hz), α (8–13Hz), β (13–30Hz), and γ (30–50Hz), each of which is associated with specific physiological functions. Previous research showed that these frequency bands can reflect attention needs, emotional states, and cognitive processes (Rao, 2013). For example, studies by Ray demonstrated that EEG activities are related to attention (Ray and Cole, 1985). Klimesch and other researchers found that α wave amplitudes are smaller when individuals focused on mental arithmetic tasks Klimesch et al. (1993). Despite these findings, the specific neural patterns underlying attention-related EEG activities still require further investigation.

The application of EEG-based BCI in attention recognition is currently in its nascent stages. Researchers attempted to employ feature extraction methods such as power spectrum or non-linear dynamics (including approximate entropy and sample entropy) to identify attention levels. However, the application of attention recognition across subjects is hindered due to the lack of significant datasets, effective paradigms, and comprehensive feature analysis.

In this study, we first proposed a novel attention paradigm based on mental arithmetic tasks and built an EEG dataset of 85 subjects for attention recognition. Second, we proposed a composite EEG-based feature that took time domain, frequency domain, and non-linear dynamic features into consideration and constructed an attention recognition framework both across and within subjects. The best intra-subject accuracy and inter-subject accuracy are 85.05% ± 6.87% and 81.60% ± 9.93%, respectively. Furthermore, we explored neural patterns within attention and non-attention states and found that attention states showed less activation than non-attention states in the prefrontal and occipital areas across α, β, and θ bands.

This study is organized into five sections. Materials and Methods are presented in Section 2. Experimental results are presented in Section 3. Discussion is available in Section 4. Conclusion is presented in Section 5.

2. Materials and methods

In this section, we collected EEG data, conducted a preprocessing of raw EEG data, fused three types of features, and classified features for both intra- and inter-subject attention recognition. Additionally, we analyzed common neural patterns between attention and non-attention across subjects. Figure 1 depicts the attention recognition analysis framework, which encompasses EEG acquisition, data preprocessing, feature extraction, and classification.

FIGURE 1

Figure 1. Attention recognition framework is comprised of four integral components, namely data acquisition, data preprocessing, feature extraction, and classification.

2.1. Data acquisition

2.1.1. Equipment

Our study collected EEG signals using a 32-channel Neuroscan amplifier in accordance with the international 10–20 system (Gao et al., 2021). The signals were sampled at a rate of 250Hz and band-pass filtered between 0.1 and 50 Hz. Figure 2 displays the layout of EEG electrodes on the cap. To obtain high-quality data, we ensured that the impedance of each electrode was below 5 kΩ. The experiment employed a 22-inch external screen as a monitor and utilized the computer with a 32-bit Windows 7 system to store EEG data and run interface programs.

FIGURE 2

Figure 2. EEG cap layout for 32 channels.

2.1.2. Subjects

Eighty-five subjects with healthy visual and cognitive abilities were recruited from various universities in Guangzhou for this experiment. The subjects had an average age of 25.3 ± 2.4, with 45 male subjects and 40 female subjects. According to the Edinburgh Handedness Inventory (Robinson, 2021), all of these subjects were right-handed, and none had prior experience with attention-related BCI experiments. Moreover, all subjects were informed about the content and purpose of the study, and informed consent was obtained.

2.1.3. Paradigm

In the absence of a standard experimental paradigm for attention recognition, we proposed our own based on mental arithmetic and resting tasks for our study. Mental arithmetic tasks in our experiment were able to induce cognitive load on participants and improved their attention (Chin et al., 2018). Mental arithmetic tasks require cognitive resources such as working memory, attention control, and executive function (Hester and Garavan, 2005). When performing mental arithmetic, individuals need to retrieve numerical information from long-term memory, hold that information in working memory, manipulate that information to perform calculations, and monitor their progress toward a solution (Grabner and De Smedt, 2011).

The subjects were seated in a quiet room, and their brain activity was measured using an EEG acquisition device during the experiment. Figure 3 depicts the experimental paradigm, which contains 20 trials. Each trial includes a 3s cue, a 60s task, and a 10s rest period. Subjects were asked to prepare to enter the attention or non-attention state based on the screen cue during the cue period. The attention cue is depicted in Figure 3b, while the non-attention cue is depicted in Figure 3c. During the attention target state, subjects were instructed to keep doing mental arithmetic, which followed the rhythm of the screen, while a random number (possibly positive or negative) appeared on the screen and were continuously subtracted by 3 over time, as shown in Figure 3d. During the non-attention state, a fixed plus sign appeared on the screen as shown in Figure 3e; meanwhile, the subjects were asked to rest quietly with their eyes open. At the end of each trial, the subjects would be given 10 seconds to rest, called rest period. Each experimental session contained 10 attention states and 10 non-attention states, with the order randomized.

FIGURE 3

Figure 3. Attention paradigm: (a) the recording protocol, (b) attention cue in screen, (c) non-attention cue in screen, (d) mental arithmetic task, (e) non-attention task, and (f) a segmentation diagram.

2.2. Data preprocessing

The raw EEG signals were first filtered by a finite impulse response (FIR) band-pass filter between 0.5 Hz and 50 Hz to reduce noise and extract relevant information. Next, 15,000-point data (60s) for each trial in each channel were equally cut into 6 epochs of equal length, each containing 2,500-point data (10 s), as illustrated in Figure 3f. By the above operations, the data of each subject containing 20 trials were transformed into 120 epochs, of which 60 epochs corresponded to the labels of attention states and the other 60 epochs corresponded to the labels of non-attention states. To ensure the quality of the data, epochs with high amplitude or significant myoelectricity were removed from the dataset. After preprocessing, a total of 10,188 epochs were obtained from 85 subjects.

2.3. Feature extraction

It is now generally accepted that advanced cognition in the brain is often associated with time–frequency and non-linear dynamic features of EEG (Klimesch et al., 1993, 1998; Chun et al., 2011). To comprehensively investigate the relationship between these features and attention states, we exhaustively enumerated and extracted time domain features, frequency domain features, and non-linear dynamic features from the preprocessed EEG data, as described in Table 1. To simplify notation, we utilized T_l, F_l, and D_l to represent a time domain feature, a frequency domain feature, and a non-linear dynamics feature, respectively, with the subscript l denoting the order of the feature in Table 1. Additionally, we used $T_{l}^{m}$ , $F_{l}^{m}$ , and $D_{l}^{m}$ to signify the m-th dimension of the time domain feature T_l, frequency doamin feature F_l, and non-linear dynamic features D_l, respectively. For example, $F_{4}^{3}$ denotes the third dimension of power spectrum for frequency bands F₄.

TABLE 1

Table 1. List of representative EEG features extracted in our works.

2.3.1. Time domain features

As EEG signals are time-series signals, time domain features (such as mean T₂, skewness T₄, root-mean squared value T₆, standard deviation T₁₀, and number of zero-crossings T₁₂) have great advantages in expressing the amplitude, time scale, and complexity of signals. Numerous studies have shown that these features can distinguish different mental states (Vourkas et al., 2000; Wang and Guan, 2008). Furthermore, time domain features have the added advantage of low computational complexity and real-time calculation capability (Hu et al., 2016). Therefore, it is worth exploring time domain features for attention recognition based on EEG. Specifically, some time domain features used in this study are defined as follows.

The formulas of Hjorth parameter (activity (h₁), mobility (h₂), and complexity (h₃)) are defined as follows:

\begin{array}{l} h_{1} = σ_{x}^{2}, & (1) \end{array}

\begin{array}{l} h_{2} = σ_{d} / σ_{x}, & (2) \end{array}

\begin{array}{l} h_{3} = \frac{σ_{d d}}{σ_{d}} / \frac{σ_{d}}{σ_{x}} = σ_{d d} / σ_{x} & (3) \end{array}

where $σ_{x}^{2}$ is the variance of the signal, σ_d is the standard deviation of the first derivative of the signal, and σ_dd is the standard deviation of the second derivative of the signal.

2.3.2. Frequency domain features

Frequency domain features are commonly used in EEG research and have shown great potential in advanced cognitive recognition, such as emotion recognition (Huang W. et al., 2021). In this study, we selected the common EEG rhythms of δ, θ, α, β, and γ as the target frequency bands for analysis. Several frequency domain features were employed in this study, and their partial definitions are presented below.

The power spectrum density (PSD) of these bands F₄, along with their respective ratios, was utilized as features in the upcoming study. Assuming that X_k represents the Fourier transform of the time series x[n], the relevant PSD P is defined as follows:

\begin{array}{l} P = \sum_{n = 0}^{N - 1} | x^{2} | = \frac{1}{N} \sum_{k = 0}^{N - 1} | X_{k}^{2} | . & (4) \end{array}

Median frequency F₉ represents the frequency point that divides the power spectrum band of a signal into two equal parts (Thongpanja et al., 2013). This can be expressed by the following equation:

\begin{array}{l} \sum_{j = 1}^{MF} P_{j} = \sum_{j = MF}^{M} P_{j} = \frac{1}{2} \sum_{j = 1}^{M} P_{j}, & (5) \end{array}

where P_j denotes the power spectrum at frequency bin j, and MF denotes median frequency. The frequency band is from 1 to M, where 1 < MF<M. According to a previous study on the application of median frequency to EEG (Gudmundsson et al., 2005), median frequency of 10 frequency bands (0.5–2 Hz, 2–4 Hz, 4–5 Hz, 5–7 Hz, 7–10 Hz, 10–13 Hz, 13–15 Hz, 15–20 Hz, 20–30 Hz, and 30–40 Hz) were calculated in our study.

Discrete wavelet transform can be defined as follows (Blanco et al., 1998):

\begin{array}{l} C (j, k) = \int_{- \infty}^{\infty} x (t) \frac{1}{\sqrt{2^{j}}} ψ (\frac{t - 2^{j} k}{2^{j}}) d t, & (6) \end{array}

where 2^jk and 2^j represent the time positioning and scale coefficients respectively, and ψ(t) represents the mother wavelet function. The energy of each resolution level j = 1, ⋯ , J by wavelet coefficients can be (Candra et al., 2015) defined as follows:

\begin{array}{l} E_{j} = \sum_{k = 1}^{N} | C_{j, k} |^{2}, k = 1, \dots, N, & (7) \end{array}

where N is the number of wavelet coefficients in each decomposition layer. E_j can also be called Wavelet Coef Energy F₉. In this study, we used the mother wavelet Daubechies with a decomposition level of 6, which means J = 6.

2.3.3. Non-linear dynamic features

The theory of non-linear dynamics opened up a new window for understanding EEG. One of the non-linear estimates of dynamic EEG activity is complexity analysis. Among all complexity analysis methods, entropy proved to be a useful and robust estimation method for evaluating the regularity or predictability of EEG. The 17 features presented in Table 1 for each channel were calculated in preparation for the next step. Here are some definitions for non-linear dynamic features.

Teager–Kaiser energy D₁₄ is a non-linear energy tracking method can calculate the instantaneous energy of non-stationary signals (Solnik et al., 2010). For the case of the discrete signals x[n], the Teager–Kaiser energy ψ can be expressed as (O'Toole et al., 2014; Badani et al., 2017):

\begin{array}{l} ψ (x [n]) = x^{2} [n] - x [n - 1] x [n + 1] . & (8) \end{array}

We used the mean and standard deviation of Teager–Kaiser energy as features for the wavelet transform coefficients of decomposition level 6, which contained seven sets coefficients. This resulted in 14 (2 × 7) dimensional features.

To normalize the energy E_j of resolution level j, as calculated from Equation 7, the energy of the fixed resolution level j is compared with the total energy of the signal E_t:

\begin{array}{l} p_{j} = \frac{E_{j}}{E_{t}}, & (9) \end{array}

where E_t represents the sum of all frequency bands energy, E_j represents the energy of the fixed resolution level j, and p_j represents the proportion of E_j to E_t. According to Rosso et al. (2001), wavelet entropy (D₁₃) H_j can be defined as follows:

\begin{array}{l} H_{j} = - \sum p_{j} ln p_{j} . & (10) \end{array}

2.3.4. Feature fusion

Feature-level fusion involves integrating low-level or intermediate-level features extracted from different sources or modalities into a single representation before further analysis or decision-making (Cai et al., 2020). It aims to capture comprehensive and discriminative information provided by multiple features to enhance the overall representation and improve subsequent processing tasks (Chin et al., 2014).

We directly concatenated and fused the features extracted from time domain, frequency domain, and non-linear dynamics analysis methods. Considering that time domain features, frequency-domain features, and non-linear features can be represented as $T \in ℝ^{T_{d}}$ , $F \in ℝ^{F_{d}}$ , and $D \in ℝ^{D_{d}}$ respectively, the fusion features after concatenation can be represented as follows:

\begin{array}{l} F e a t u r e_{f u s i o n} = [T_{1}, T_{2}, \dots, T_{T_{d}}, F_{1}, F_{2}, \dots, F_{F_{d}}, D_{1}, D_{2}, \dots, D_{D_{d}}], & (11) \end{array}

where T_d, F_d, and D_d represent the dimension of time domain features, frequency domain features, and non-linear dynamic features, respectively.

After extracting 12-dimensional time domain features, 48-dimensional frequency domain features, and 63-dimensional non-linear dynamic features, we concatenated them to form fusion features, resulting in a total of 123 dimensions.

2.4. Classification

To valid the effectiveness of these extracted features, we used three common classification methods, including random forest (RF), decision tree (DT), and support vector machine (SVM) to build our attention recognition framework.

RF classifiers have been shown to be highly effective in small EEG data sets as demonstrated by the studies conducted by Amin et al. (2017), Lotte et al. (2018). In this study, the random forest classifier held 100 evaluators and used Gini impurity to measure the quality of a split as criterion. For DT, the criterion and splitter were set to information entropy and best. Linear kernel with a penalty parameter C of 2 and the kernel function coefficients of 0.2 was used for constructing the SVM classifiers.

We employed 5-fold cross-validation (CV) for intra-subject classification, dividing the training set and testing set strictly in the order of time. For inter-subject classification, leave-one-subject-out (LOSO) CV was employed.

3. Results

3.1. Performance of intra-subject attention recognition

In this section, we compared four types of features, namely time domain features, frequency domain features, non-linear dynamic features, and fusion features, in three different classifiers: RF, DT, and SVM. It should be noted that fusion features involve time domain, frequency domain, and non-linear dynamic features.

For each type of feature and each classifier, we calculated the mean accuracy based on a 5-fold CV approach, resulting in 85 mean accuracies in total. We then performed paired-sample t-tests to compare the accuracies of fusion features with those of the other types of features.

Figure 4 displays the mean and standard deviation of intra-subject accuracies for four types of features and three classifiers among 85 subjects. Fusion features demonstrate excellent performance, achieving accuracies (%) of 85.1, 78.7, and 79.8 using RF, DT, and SVM, respectively. For all classifiers (RF, DT, and SVM), the accuracies of fusion features are significantly greater than time domain features, frequency domain features, and non-linear dynamic features. RF performs the best among the three classifiers with different features, which is consistent with previous research findings that RF performs well on small datasets (Amin et al., 2017). The average accuracies (%) using RF are 81.4, 84.8, 84.0, and 85.1 for time domain features, frequency domain features, non-linear dynamic features, and fusion features, respectively.

FIGURE 4

Figure 4. Average intra-subject accuracies using different types of features and different classifiers. The ^* and ^** indicate that the intra-subject accuracies of fusion features are significantly higher than those of the compared type of features with p < 0.05 and p < 0.01, respectively.

3.2. Performance of inter-subject attention recognition

In terms of inter-subject analysis, we fixed features and classifiers as those in the intra-subject analysis. For each feature type and each classifier, we obtained 85 accuracies using LOSO CV and performed paired-sample t-tests to compare the accuracies of fusion features with those of other types of features. Figure 5 presents the inter-subject results obtained using different types of features and different classifiers, showing that RF and SVM were found to perform relatively well. Using RF, the average inter-subject accuracies (%) are 75.6, 78.7, 78.1, and 80.0 for the time domain features, frequency domain features, non-linear dynamic features, and fusion features, respectively. The best result (81.6%) of the average accuracies is achieved by SVM for distinguishing fusion features across subjects.

FIGURE 5

Figure 5. Average inter-subject accuracies using different types of features and different classifiers. The ^* and ^** indicate that the inter-subject accuracies of fusion features are significantly higher than those of the compared types of features with p < 0.05 and p < 0.01, respectively.

3.3. Assessment of our methods against baseline methods

Our study demonstrates the superior performance of our proposed method for the attention task, as compared to three baseline methods: PSD-SVM (Huang H. et al., 2021), Dynamical Complexity-XGBoost (Wan et al., 2021), and STFT-SVM (Acı et al., 2019). While PSD-SVM uses PSD features in the δ, θ, α, β, and γ bands and applies SVM for classification, Dynamical Complexity-XGBoost employs Multiscale Approximate Entropy, Sample Entropy, and Fuzzy Entropy as features and uses Extreme Gradient Boosting (XGBoost) for classification. Additionally, STFT-SVM utilizes the short-time Fourier transform (STFT) with a Blackman window to calculate smoothed time-dependent power spectra as features, which are then classified using SVM.

Our methods surpass the performance of baseline methods in both intra-subject and inter-subject classification, as evidenced by the results presented in Table 2. These findings demonstrate that our methods outperform the baseline methods.

TABLE 2

Table 2. Average intra-subject and inter-subject accuracies using our methods (Fusion Feature-RF and Fusion Feature-SVM), PSD-SVM, Dynamical Complexity-XGBoost, and STFT-SVM.

3.4. Individual feature analysis and classification performance evaluation

To determine the effectiveness of features, we conducted an individual feature analysis by evaluating the classification performance. The results were calculated for intra-subject classification by RF and inter-subject classification by SVM, as depicted in Figure 6.

FIGURE 6

Figure 6. Average intra-subject accuracies by RF and inter-subject accuracies by SVM in different features. The horizontal axis represents accuracy, while the vertical axis represents different features.

In the intra-subject experiment, numerous time domain, frequency domain, and non-linear dynamic features demonstrate an accuracy exceeding 70%. This finding suggests that the majority of the features calculated in Section 2 are effective within subjects. In the inter-subject experiment, only a few features exhibit a accuracy exceeding 70%, such as power spectrum for frequency bands F₄, wavelet entropy D₁₃, and Teager–kaiser energy D₁₄. These features demonstrate robustness across subjects, indicating their effectiveness as more reliable features. The highest inter-subject accuracy (%) achieved by an individual feature is 75.24 ± 11.53. However, our proposed fusion feature demonstrates a significantly higher accuracy (%) of 81.60 ± 9.93. This result further confirms the effectiveness of our fusion feature method.

3.5. Neural patterns

First, we classified PSD features from different frequency bands (δ, θ, α, β, and γ) within and across subjects, respectively. The inter-subject and intra-subject average accuracies with features from different frequency bands are shown in Tables 3, 4, respectively. The average accuracies in Tables 3, 4 indicate that the PSD calculated from δ, θ, α, β, and γ bands exhibits distinct separability, which suggests that δ, θ, α, β, and γ oscillations of brain activity are related to the processing of attention states (Brown, 1970; Ray and Cole, 1985; Klimesch et al., 1993; Klimesch, 1999; Prinzel et al., 2001).

TABLE 3

Table 3. Average intra-subject accuracies (%) of three classifiers for features from different frequency bands.

TABLE 4

Table 4. Average inter-subject accuracies (%) of three classifiers for features from different frequency bands.

Additionally, time–frequency analysis with Morlet wavelets Cohen (2019) was employed in electrode position Fz in an experiment, as depicted in Figure 7. This figure illustrates the distinct patterns observed for different attention states. Notably, the analysis reveals that frequencies below 30 Hz exhibit significantly lower response energy during the attention state compared to the non-attention state. These findings further support the relationship between attention states and frequency bands such as α, β, and θ.

FIGURE 7

Figure 7. Time–frequency analysis by Morlet wavelets of the electrode position Fz in all epochs for one subject. The white line represents the labels of attention states corresponding to time.

To further explore neural patterns associated with attention and non-attention states across all participants, we calculated the topographical maps of power features by averaging the power features over all epochs in all subjects for each frequency band between attention states and non-attention states. We then normalized the features by Z-Score for all epochs within each frequency band for each subject. Figure 8 depicts the topographical maps of the power features corresponding to the attention state an the non-attention state. The results demonstrate the existence of neural patterns associated with attention and non-attention states.

FIGURE 8

Figure 8. Topographical maps of power features in the canonical frequency bands. From left to right: δ, θ, α, β, and γ bands; from top to bottom: non-attention and attention.

Although the neural patterns of attention states and non-attention states are similar, greater activation iobserved in the prefrontal areas for both. In the α, β, and θ bands, the lateral prefrontal and occipital areas exhibit less activation during attention states than non-attention states. During non-attention states, there are significant higher β responses in both prefrontal and occipital regions. The existing studies (Klimesch et al., 1998; Egner and Gruzelier, 2004) have showed that the changes in EEG features are closely related to the degree of attention, with varying degrees of amplitude and power of individual rhythmic brain waves. For instance, when participants were in the state of attention, their EEG signals exhibited a significant decrease in α and β waves (Prinzel et al., 2001). Conversely, during non-attention processing, the energy of β and α responses was increased. These findings on neural patterns are consistent with previous attention studies (Ray and Cole, 1985; Klimesch, 1999; Kelly et al., 2003; Swartwood et al., 2003; K Binienda et al., 2011).

4. Discussion

In this study, we first designed a novel attention experiment paradigm and collected a dataset consisting of 85 subjects. Next, we extracted and fused time domain, frequency domain, and non-linear dynamic features. These features were then classified to construct a complete attention recognition framework. Additionally, we suggested that conducting a separate analysis of the differences in features and channels at the group level may be useful in distinguishing between attention and non-attention states.

The following discussions will be divided into six parts. First, we compared and analyzed the different features in different channels of attention and non-attention states, which is useful in the construction of our attention recognition framework. Second, we conducted a group level analysis on connectivity estimators of the attention and non-attention states. Third, we also discussed the neural patterns of attention and non-attention states. Fourth, we analyzed and compared different paradigms for attention recognition. Fifth, we conducted an advantages analysis on our method. Last, we described the limitations of this study and future research perspectives.

4.1. A group level analysis on the features of the attention and non-attention states

In our study, we conducted an analysis of the differences in the different types of features across channels between attention and non-attention states at the group level. Specifically, we collected features extracted from each channel of different epochs for each subject. We then divided the features into two parts for attention and non-attention states, respectively. After removing outliers, we averaged each part to obtain the average features, which represented the average level of a specific feature for a given channel and subject for attention and non-attention states.

To analyze whether there were significant differences in the average features between attention and non-attention states at the group level, paired t-tests were implemented on the average features in the time domain, frequency domain, and non-linear dynamic features, respectively. It is assumed that the average features of attention and non-attention do not significantly differ at the group level. The p-value of 0.05 is used as a significance level. This means that if the p>0.05, the null hypothesis is true; otherwise false. The results of the t-tests in terms of time domain features, frequency domain features, and non-linear dynamic features are, respectively, shown in Figures 9–11.

FIGURE 9

Figure 9. p-value of average features of time domain features paired t-test between attention and non-attention. The vertical axis represents 30 channels. From top to bottom, the channel names are as follows: Fp1, Fp2, F7, F3, Fz, F4, F8, FT7, FC3, FCz, FC4, FT8, T7, C3, Cz, C4, T8, TP7, CP3, CPz, CP4, TP8, P7, P3, Pz, P4, P8, O1, Oz, and O2. The horizontal axis represents the 24 dimensional time domain features, which are consistent with Table 1.

FIGURE 10

Figure 10. p-value of average features of frequency domain features paired t-test between attention and non-attention. The vertical axis is the same as Figure 9. The horizontal axis represents the 48-dimensional frequency domain features, which is consistent with Table 1. The 14-dimensional PSD features F₄ represent δ, θ, α, β, γ, δ/θ, δ/α, δ/β, δ/γ, θ/α, θ/β, θ/γ, α/β, α/γ, and β/γ from left to right.

FIGURE 11

Figure 11. p-value of average features of non-linear dynamic features paired t-test between attention and non-attention. The vertical axis is the same as Figure 9. The horizontal axis represents the 60 dimensional non-linear dynamic features, which are consistent with Table 1.

Regarding the average features of time domain features, it was found that for most of the channels corresponding to the feature skewness T₄ and decorrelation time T₁₁, the p-values were much less than 0.05, indicating a relatively significant difference. This finding is consistent with the results of previous studies that have investigated the relationship between skewness and cognition in EEG signals Davis et al. (2020). Additionally, Figure 9 shows that prefrontal channels such as Fp1, Fp2, F7, F3, Fz, and F4 exhibit more significant features relative to other channels.

Compared to the average features of time domain features, the average features of frequency domain features performed better in the t-test. The p-values of the average features of F₄ (power spectrum density) of θ, α, β, and power spectral density ratio δ/γ, δ/θ, δ/α, δ/β, δ/γ, θ/α, θ/β, θ/γ, α/β, and α/γ were less than 0.05 in frontal, occipital, and temporal brain regions, as shown in Figure 10. These significant differences at the group level explain the superior performance of frequency domain features compared to time domain features in Figures 4, 5. This finding is consistent with previous studies that cognitive tasks can enhance the power of eeg, particularly in visual cortex Fitzgibbon et al. (2004).

As indicated in Figure 11, the average features of non-linear dynamic features are very special, and most of the non-linear features are significant for part of the channels. This finding is consistent with previous attention studies that have found that the value of entropy decreases with a decrease of attention states (Li et al., 2012). For instance, the features D₁₄ (Teager–kaiser energy) exhibit a highly significant difference between attention and non-attention states, as shown in Figure 11, which indicates that Teager–kaiser energy D₁₄ is an excellent feature. Furthermore, the first eight dimensions of Teager–kaiser energy exhibit more significant channels and a deeper degree than the last six dimensions. According to the content in Section 2, the first eight dimensions correspond to Teager–kaiser energy of the low-frequency portion of wavelet decomposition, while the last six dimensions correspond to Teager–kaiser energy of the high-frequency portion of wavelet decomposition. This highlights that the contrast between attention and non-attention states in low-frequency portion is much more pronounced than in high-frequency portion, which is consistent with previous research (Fiebelkorn and Kastner, 2019).

Taken together, some average features from time domain, frequency domain, and non-linear dynamics exhibit significant differences at the group level, which further validates the effectiveness of our framework for attention recognition based on EEG.

4.2. A group level analysis on connectivity estimators of the attention and non-attention states

Brain functional connectivity (FC) elucidating the statistical dependencies and directed information flows unveils the functions and intricate interactions of diverse brain regions (Cao et al., 2022). We estimated correlations between different channels for each epoch of each participant to construct FC matrices and separately averaged the FC matrices of attention and non-attention states for each participant. This allowed us to obtain the average FC matrices for attention and non-attention states. We then conducted paired-sample t-tests and corrected using the false discovery rate (FDR) method for the average FC matrices corresponding to the two states. Figure 12 shows significant differences in brain FCs between attention and non-attention states across different frequency bands at the group level.

FIGURE 12

Figure 12. Significant differences in FC estimators across frequency bands (p < 0.05, FDR-corrected).

From Figure 12, it is evident that the significant differences in brain FCs between attention and non-attention states are primarily concentrated in the low-frequency bands, including δ, θ, lower α, and upper α (Zoefel et al., 2011) frequency bands. Within δ and θ frequency bands, there are significantly different connectivities distributed in various brain regions, including the left and right temporal lobes, parietal lobes, prefrontal regions, and occipital lobes. Within lower α and upper α frequency bands, the connectivities are mainly concentrated in the left and right temporal lobes. Overall, as the frequency range increases, there is a decrease in the number of significant connectivities, which further demonstrates that lower frequencies are more capable of characterizing changes in attention states. This is consistent with our discussion in section 4.1.

Furthermore, by comparing the differences in relatively low-frequency connectivities between the left and right hemispheres, we found that the number of significant connectivities in the right hemisphere at the group level was significantly higher than that in the left hemisphere. This indicates that the right hemisphere interacts more closely with information during attention changes, which is consistent with previous studies showing significant hemispheric asymmetry and lateralization toward the right hemisphere in the attention process of individuals (Bartolomeo and Malkinson, 2019).

4.3. Neural patterns analysis on the attention and non-attention states

In Section 3, we observed that neural patterns for attention and non-attention states exist according to Tables 3, 4, Figure 8. In this section, we analyzed these neural patterns of attention and non-attention states in greater detail.

Figure 13 shows the time–frequency analysis using Morlet wavelets for all epochs recorded from electrodes Fp1, Fp2, Fz, F3, F4, T3, T4, FT7, FT8, O1, Oz, and O2 in an experiment. As demonstrated by Figures 7 and 13, the time–frequency analysis reveals different patterns for different attention states. Specifically, the responses of low-frequency oscillations during attention states are lower than during non-attention states, especially in the temporal lobes and prefrontal regions. Additionally, the neural patterns remain relatively stable over time for each epoch within the experiment.

FIGURE 13

Figure 13. Time–frequency analysis by Morlet wavelets of the electrode position Fp1, Fp2, Fz, F3, F4, T3, T4, FT7, FT8, O1, Oz, and O2 in all epochs for one subject a red color indicates a high amplitude).

As shown in Figure 8, the average PSD in the θ, α, and β bands across all subjects exhibits distinct differences between attention and non-attention states, whereas this in the δ do not. This implies that the 4–30 Hz frequency bands are more closely associated with attention than the other frequency bands in EEG signals, which is consistent with prior research (Prinzel et al., 2001). For the δ band, compared with non-attention states, the neural patterns of attention states have a significantly lower response in the prefrontal. For the θ, β, and α bands, compared with non-attention states, the neural patterns of attention states have a significantly lower response in the lateral prefrontal and occipital areas.

4.4. A comparative analysis on paradigms

As described in Section 1, the tasks in paradigms of attention recognition include the Stroop test, breath counting, and reading comprehension. The Stroop test is inherently short in duration (Kawashima et al., 2023), which makes it difficult for participants to sustain their attention over time. The task of counting the number of breaths can easily lead to mental wandering (Braboszcz and Delorme, 2011). Additionally, reading tasks are influenced by different materials, leading to variations in attention and concentration levels among individuals (Li et al., 2011). To address these limitations, we proposed mental arithmetic tasks as an attention task in our paradigm. Mental arithmetic tasks have a longer duration and are not affected by different materials. Our framework's results provide strong evidence for the effective characterization of attention and non-attention states under our proposed paradigm. First, the excellent classification results in Figures 4, 5 demonstrate that attention states and non-attention states can be distinguished accurately with our paradigm. Second, different activation maps in Figure 8 between attention and non-attention states further support the efficacy of our proposed paradigm in characterizing attention and non-attention states. Together, these findings highlight the potential value of our paradigm in advancing research and understanding of attention states.

4.5. An advantages analysis on our method

First, our method can not only be used for intra-subject attention recognition but also for inter-subject attention recognition, making the application more convenient. Second, we can see that our proposed fusion feature method achieves the accuracy (%) of 85.05 ± 6.87 in intra-subject attention recognition and 81.60 ± 9.93 in inter-subject attention recognition. Compared with other attention classification methods, it achieves better classification results both within and across subjects, making the results of attention recognition more accurate.

4.6. Limitations and future study

There are three limitations to this study. First, this study analyzed the EEG features between attention states and non-attention states offline and did not perform some online validation. Thus, we will perform online validation for the framework presented in this study in future article. Second, in this study, all EEG channels (30 channels) were used to calculate the features, and it is difficult to collect 30 channels of EEG data for the complex variety of application scenarios of attention recognition (such as hospitals and schools). Therefore, reducing the number of channels for attention recognition is also an important direction for our future study. Third, in this study, for attention and non-attention states, we analyzed and classified the manual features based on EEG wthin and across subjects, and it is necessary to design an end-to-end network framework on large dataset in future study.

5. Conclusion

Attention recognition is of great importance in various fields such as medicine and industry. However, a reliable inter-subject attention recognition framework that can be effective is still missing. This study proposed a novel attention experiment paradigm, built a dataset of 85 subjects, fused three types of features, and classified features for attention recognition based on EEG. Eighty-five subjects participated in our experiment, and the experimental results demonstrated the validity of our paradigm and analysis methods with an average intra-subject attention recognition accuracy of 85.05% and an average inter-subject attention recognition accuracy of 81.60%. Additionally, our frequency band features analysis revealed neural patterns of attention and non-attention states, where attention states showed less activation than non-attention states in the prefrontal and occipital areas in α, β, and θ bands. Furthermore, we identified the features that exhibited signification corresponding channels between attention and non-attention states. These findings may be useful for understanding attention recognition based on EEG and may guide future study in this area.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by Ethics Committee of Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, China. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

HH and YL built the attention paradigm and did the experiment in this study. DC did the study analysis and wrote the study with help from HH, XB, JP, and YL. All authors contributed to the article and approved the submitted version.

Funding

This study was supported in part by the STI 2030–Major Projects under Grant 2022ZD0208900 and in part by the Key Realm Research and Development Program of Guangzhou, China, under Grant 202007030007.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Acı, ,̧ C. İ., Kaya, M., and Mishchenko, Y. (2019). Distinguishing mental attention states of humans via an EEG-based passive BCI using machine learning methods. Expert Syst. Appl. 134, 153–166. doi: 10.1016/j.eswa.2019.05.057

PubMed Abstract | CrossRef Full Text | Google Scholar

Amin, H. U., Mumtaz, W., Subhani, A. R., Saad, M. N. M., and Malik, A. S. (2017). Classification of EEG signals based on pattern recognition approach. Front. Comput. Neurosci. 11, 103. doi: 10.3389/fncom.2017.00103

PubMed Abstract | CrossRef Full Text | Google Scholar

Andrillon, T., Burns, A., Mackay, T., Windt, J., and Tsuchiya, N. (2021). Predicting lapses of attention with sleep-like slow waves. Nat. Commun. 12, 3657. doi: 10.1038/s41467-021-23890-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Badani, S., Saha, S., Kumar, A., Chatterjee, S., and Bose, R. (2017). “Detection of epilepsy based on discrete wavelet transform and teagerkaiser energy operator,” in 2017 IEEE Calcutta Conference (CALCON). Kolkata: IEEE, 164-167. doi: 10.1109/CALCON.2017.8280717

PubMed Abstract | CrossRef Full Text | Google Scholar

Bai, D., Qiu, T., and Li, X. (2007). The sample entropy and its application in EEG based epilepsy detection. J. Biomed. Eng. 24, 200–205.

PubMed Abstract | Google Scholar

Barry, R. J., Kirkaikul, S., and Hodder, D. (2000). EEG alpha activity and the ERP to target stimuli in an auditory oddball paradigm. Int. J. Psychophysiol. 39, 39–50. doi: 10.1016/S0167-8760(00)00114-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Bartolomeo, P., and Malkinson, T. S. (2019). Hemispheric lateralization of attention processes in the human brain. Curr. Opin. Psychol. 29, 90–96. doi: 10.1016/j.copsyc.2018.12.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Berka, C., Levendowski, D. J., Cvetinovic, M. M., Petrovic, M. M., Davis, G., Lumicao, M. N., et al. (2004). Real-time analysis of EEG indexes of alertness, cognition, and memory acquired with a wireless EEG headset. Int. J. Hum. Comput. Interact. 17:151–170. doi: 10.1207/s15327590ijhc1702_3

CrossRef Full Text | Google Scholar

Blanco, S., Figliola, A., Quiroga, R. Q., Rosso, O., and Serrano, E. (1998). Time-frequency analysis of electroencephalogram series. III. wavelet packets and information cost function. Phys. Rev. E 57, 932. doi: 10.1103/PhysRevE.57.932

CrossRef Full Text | Google Scholar

Borbely, A. A., and Neuhaus, H. U. (1979). Sleep-deprivation: effects on sleep and EEG in the rat. J. Comparat. Physiol. 133, 71–87. doi: 10.1007/BF00663111

PubMed Abstract | CrossRef Full Text | Google Scholar

Braboszcz, C., and Delorme, A. (2011). Lost in thoughts: neural markers of low alertness during mind wandering. Neuroimage. 54, 3040–3047. doi: 10.1016/j.neuroimage.2010.10.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Brown, B. B. (1970). Recognition of aspects of consciousness through association with EEG alpha activity represented by a light signal. Psychophysiology. 6, 442–452. doi: 10.1111/j.1469-8986.1970.tb01754.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Cai, H., Qu, Z., Li, Z., Zhang, Y., Hu, X., and Hu, B. (2020). Feature-level fusion approaches based on multimodal eeg data for depression recognition. Informat. Fusi. 59, 127–138. doi: 10.1016/j.inffus.2020.01.008

CrossRef Full Text | Google Scholar

Candra, H., Yuwono, M., Handojoseno, A., Chai, R., Su, S., and Nguyen, H. T. (2015). “Recognizing emotions from EEG subbands using wavelet analysis,” in 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Milano: IEEE, 6030–6033. doi: 10.1109/EMBC.2015.7319766

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, J., Zhao, Y., Shan, X., Wei, H.-,l., Guo, Y., Chen, L., et al. (2022). Brain functional and effective connectivity based on electroencephalography recordings: a review. Hum. Brain Mapp. 43, 860–879. doi: 10.1002/hbm.25683

PubMed Abstract | CrossRef Full Text | Google Scholar

Capurro, A., Diambra, L., Lorenzo, D., Macadar, O., Martín, M. T., Mostaccio, C., et al. (1998). Tsallis entropy and cortical dynamics: the analysis of EEG signals. Physica A. 257, 149–155. doi: 10.1016/S0378-4371(98)00137-X

CrossRef Full Text | Google Scholar

Chen, H., Song, Y., and Li, X. (2019). Use of deep learning to detect personalized spatial-frequency abnormalities in eegs of children with ADHD. J. Neural Eng. 16, 066046. doi: 10.1088/1741-2552/ab3a0a

PubMed Abstract | CrossRef Full Text | Google Scholar

Chin, Y. J., Ong, T. S., Teoh, A. B. J., and Goh, K. (2014). Integrated biometrics template protection technique based on fingerprint and palmprint feature-level fusion. Informat. Fusion. 18, 161–174. doi: 10.1016/j.inffus.2013.09.001

CrossRef Full Text | Google Scholar

Chin, Z. Y., Zhang, X., Wang, C., and Ang, K. K. (2018). “EEG-based discrimination of different cognitive workload levels from mental arithmetic,” in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Honolulu, HI: IEEE, 1984–1987. doi: 10.1109/EMBC.2018.8512675

PubMed Abstract | CrossRef Full Text | Google Scholar

Chun, M. M., Golomb, J. D., and Turk-Browne, N. B. (2011). A taxonomy of external and internal attention. Annu. Rev. Psychol. 62, 73–101. doi: 10.1146/annurev.psych.093008.100427

PubMed Abstract | CrossRef Full Text | Google Scholar

Cohen, M. X. (2019). A better way to define and describe morlet wavelets for time-frequency analysis. Neuroimage. 199, 81–86. doi: 10.1016/j.neuroimage.2019.05.048

PubMed Abstract | CrossRef Full Text | Google Scholar

Conners, C. K., Sitarenios, G., Parker, J. D., and Epstein, J. N. (1998). The revised Conners' Parent Rating Scale (CPRS-R): factor structure, reliability, and criterion validity. J. Abnorm. Child Psychol. 26, 257–268. doi: 10.1023/A:1022602400621

PubMed Abstract | CrossRef Full Text | Google Scholar

Davis, J. J., Schbeler, F., Ji, S., and Kozma, R. (2020). Discrimination between brain cognitive states using shannon entropy and skewness information measure, in 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC). Toronto, ON: IEEE, 4026–4031. doi: 10.1109/SMC42975.2020.9283315

CrossRef Full Text | Google Scholar

Delorme, A., Makeig, S., and Sejnowski, T. (2001). “Automatic artifact rejection for EEG data using high-order statistics and independent component analysis,” in Proceedings of the Third International ICA Conference. Princeton: Citeseer, 9–12.

Google Scholar

Demanuele, C., James, C. J., and Sonuga-Barke, E. J. (2007). Distinguishing low frequency oscillations within the 1/f spectral behaviour of electromagnetic brain signals. Behav. Brain Funct. 3, 1–14. doi: 10.1186/1744-9081-3-62

PubMed Abstract | CrossRef Full Text | Google Scholar

Dumermuth, G., and Molinari, L. (1987). Spectral analysis of the EEG. Neuropsychobiology. 17, 85–99. doi: 10.1159/000118345

PubMed Abstract | CrossRef Full Text | Google Scholar

Egner, T., and Gruzelier, J. H. (2004). EEG biofeedback of low beta band components: frequency-specific effects on variables of attention and event-related brain potentials. Clini. Neurophysiol. 115, 131–139. doi: 10.1016/S1388-2457(03)00353-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Esteller, R., Echauz, J., Tcheng, T., Litt, B., and Pless, B. (2001). “Line length: an efficient feature for seizure onset detection,” in 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, volume 2. Istanbul: IEEE, 1707–1710. doi: 10.1109/IEMBS.2001.1020545

PubMed Abstract | CrossRef Full Text | Google Scholar

Esteller, R., Vachtsevanos, G., Echauz, J., and Lilt, B. (1999). “A comparison of fractal dimension algorithms using synthetic and experimental data,” in 1999 IEEE International Symposium on Circuits and Systems (ISCAS). Orlando, FL: IEEE, 199–202. doi: 10.1109/ISCAS.1999.778819

CrossRef Full Text | Google Scholar

Fiebelkorn, I. C., and Kastner, S. (2019). A rhythmic theory of attention. Trends Cogn. Sci. 23, 87–101. doi: 10.1016/j.tics.2018.11.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Fitzgibbon, S., Pope, K., Mackenzie, L., Clark, C., and Willoughby, J. (2004). Cognitive tasks augment gamma EEG power. Clini. Neurophysiol. 115, 1802–1809. doi: 10.1016/j.clinph.2004.03.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Fliege, H., Becker, J., Walter, O. B., Rose, M., Bjorner, J. B., and Klapp, B. F. (2009). Evaluation of a computer adaptive test for the assessment of depression (D'CAT) in clinical application. Int. J. Methods Psychiatr. Res. 18, 23–36. doi: 10.1002/mpr.274

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, W., Yu, T., Yu, J.-G., Gu, Z., Li, K., Huang, Y., et al. (2021). Learning invariant patterns based on a convolutional neural network and big electroencephalography data for subject-independent P300 brain-computer interfaces. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 1047–1057. doi: 10.1109/TNSRE.2021.3083548

PubMed Abstract | CrossRef Full Text | Google Scholar

Grabner, R. H., and De Smedt, B. (2011). Neurophysiological evidence for the validity of verbal strategy reports in mental arithmetic. Biol. Psychol. 87, 128–136. doi: 10.1016/j.biopsycho.2011.02.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Grieszbach, G., and Schack, B. (1993). Adaptive quantile estimation and its application in analysis of biological signals. Biom. J. 35, 165–179. doi: 10.1002/bimj.4710350207

PubMed Abstract | CrossRef Full Text | Google Scholar

Gudmundsson, S., Runarsson, T. P., and Sigurdsson, S. (2005). “Automatic sleep staging using support vector machines with posterior probability estimates,” in International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06), volume 2. Vienna: IEEE, 366–372. doi: 10.1109/CIMCA.2005.1631496

CrossRef Full Text | Google Scholar

Hamadicharef, B., Zhang, H., Guan, C., Wang, C., Phua, K. S., Tee, K. P., et al. (2009). “Learning EEG-based spectral-spatial patterns for attention level measurement,” in 2009 IEEE International Symposium on Circuits and Systems. Seoul: IEEE, 1465–1468. doi: 10.1109/ISCAS.2009.5118043

CrossRef Full Text | Google Scholar

Hester, R., and Garavan, H. (2005). Working memory and executive function: the influence of content and load on the control of attention. Memory & Cognit. 33:221–233. doi: 10.3758/BF03195311

PubMed Abstract | CrossRef Full Text | Google Scholar

Hjorth, B. (1970). EEG analysis based on time domain properties. Electroencephalogr. Clin. Neurophysiol. 29, 306–310. doi: 10.1016/0013-4694(70)90143-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Hosseini, S., and Guo, X. (2019). “Deep convolutional neural network for automated detection of mind wandering using EEG signals,” in Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. New York: Association for Computing Machinery, 314–319. doi: 10.1145/3307339.3342176

CrossRef Full Text | Google Scholar

Hu, B., Li, X., Sun, S., and Ratcliffe, M. (2016). Attention recognition in EEG-based affective learning research using CFS+KNN algorithm. IEEE/ACM Trans Comput Biol. Bioinform. 15, 38–45. doi: 10.1109/TCBB.2016.2616395

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, H., Cai, Y., Feng, X., and Li, Y. (2021). An electroencephalogram-based study of resting-state spectrogram and attention in tinnitus patients. J. Biomed. Eng. 38, 492–497. doi: 10.7507/1001-5515.202012015

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, W., Wu, W., Lucas, M. V., Huang, H., Wen, Z., and Li, Y. (2021). “Neurofeedback training with an electroencephalogram-based brain-computer interface enhances emotion regulation,” in IEEE Transactions on Affective Computing (IEEE).

Google Scholar

Inouye, T., Shinosaki, K., Sakamoto, H., Toi, S., Ukai, S., Iyama, A., et al. (1991). Quantification of EEG irregularity by use of the entropy of the power spectrum. Electroencephalogr. Clin. Neurophysiol. 79:204–210. doi: 10.1016/0013-4694(91)90138-T

PubMed Abstract | CrossRef Full Text | Google Scholar

K Binienda, Z. A., Beaudoin, M. T., Thorn, B., et al. (2011). Analysis of electrical brain waves in neurotoxicology: gamma-hydroxybutyrate. Curr. Neuropharmacol. 9:236–239. doi: 10.2174/157015911795017209

PubMed Abstract | CrossRef Full Text | Google Scholar

Kannathal, N., Acharya, U. R., Lim, C. M., and Sadasivan, P. (2005). Characterization of EEG a comparative study. Comput. Methods Programs Biomed. 80, 17–23. doi: 10.1016/j.cmpb.2005.06.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Kawashima, I., Nagahama, T., Kumano, H., Momose, K., and Tanaka, S. C. (2023). Pavlovian-based neurofeedback enhances meta-awareness of mind-wandering. Neural Networks. 158, 239–248. doi: 10.1016/j.neunet.2022.11.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Ke, Y., Chen, L., and Fu, L. (2014). Visual attention recognition based on nonlinear dynamical parameters of EEG. Biomed. Mater. Eng. 24, 349–355. doi: 10.3233/BME-130817

PubMed Abstract | CrossRef Full Text | Google Scholar

Kelly, S., Dockree, P., Reilly, R., and Robertson, I. (2003). “EEG alpha power and coherence time courses in a sustained attention task,” in First International IEEE EMBS Conference on Neural Engineering, 2003. Conference Proceedings. Capri: IEEE, 83–86. doi: 10.1109/CNE.2003.1196761

CrossRef Full Text | Google Scholar

Kharbouch, A., Shoeb, A., Guttag, J., and Cash, S. S. (2011). An algorithm for seizure onset detection using intracranial EEG. Epilepsy Behav. 22, S29–S35. doi: 10.1016/j.yebeh.2011.08.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Klimesch, W. (1999). EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis. Brain Res. Rev. 29, 169–195. doi: 10.1016/S0165-0173(98)00056-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Klimesch, W., Doppelmayr, M., and Russegger, H. (1998). Induced alpha band power changes in the human EEG and attention. Neurosci. Lett. 244, 73–76. doi: 10.1016/S0304-3940(98)00122-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Klimesch, W., Schimke, H., and Pfurtscheller, G. (1993). Alpha frequency, cognitive load and memory performance. Brain Topogr. 5, 241–251. doi: 10.1007/BF01128991

PubMed Abstract | CrossRef Full Text | Google Scholar

Krupski, A., and Boyle, P. R. (1978). An observational analysis of children's behavior during a simple-reaction-time task: the role of attention. Child Dev. 6, 340–347. doi: 10.2307/1128696

CrossRef Full Text | Google Scholar

Li, D., Li, X., Liang, Z., Voss, L. J., and Sleigh, J. W. (2010). Multiscale permutation entropy analysis of EEG recordings during sevoflurane anesthesia. J. Neural Eng. 7, 046010. doi: 10.1088/1741-2560/7/4/046010

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W., Ming, D., Xu, R., Ding, H., Qi, H., and Wan, B. (2012). “Research on visual attention classification based on EEG entropy parameters,” in World Congress on Medical Physics and Biomedical Engineering May 26-31, 2012, Beijing, China. Berlin: Springer, 1553–1556. doi: 10.1007/978-3-642-29305-4_408

CrossRef Full Text | Google Scholar

Li, Y., Li, X., Ratcliffe, M., Liu, L., Qi, Y., and Liu, Q. (2011). “A real-time EEG-based BCI system for attention recognition in ubiquitous environment,” in Proceedings of 2011 International Workshop on Ubiquitous Affective Awareness and Intelligent Interaction (New York, NY: IOP Publishing), 33–40. doi: 10.1145/2030092.2030099

CrossRef Full Text | Google Scholar

Lotte, F., Bougrain, L., Cichocki, A., Clerc, M., Congedo, M., Rakotomamonjy, A., et al. (2018). A review of classification algorithms for EEG-based brain computer interfaces: a 10 year Update Univ. S C. Dep. Music. 15, 031005. doi: 10.1088/1741-2552/aab2f2

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, H., Qiu, T., Liu, C., and Huang, P. (2019). Research on fatigue driving detection using forehead EEG based on adaptive multi-scale entropy. Biomed. Signal Process. Control 51, 50–58. doi: 10.1016/j.bspc.2019.02.005

CrossRef Full Text | Google Scholar

Lykken, D., Tellegen, A., and Thorkelson, K. (1974). Genetic determination of EEG frequency spectra. Biol. Psychol. 1, 245–259. doi: 10.1016/0301-0511(74)90001-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Mardi, Z., Ashtiani, S. N. M., Mikaili, M., et al. (2011). EEG-based drowsiness detection for safe driving using chaotic features and statistical tests. J. Medical Signals Sens. 1, 130. doi: 10.4103/2228-7477.95297

PubMed Abstract | CrossRef Full Text | Google Scholar

Márton, L., Brassai, S. T., Bakó, L., and Losonczi, L. (2014). Detrended fluctuation analysis of EEG signals. Procedia Technol. 12, 125–132. doi: 10.1016/j.protcy.2013.12.465

PubMed Abstract | CrossRef Full Text | Google Scholar

Moghaddari, M., Lighvan, M. Z., and Danishvar, S. (2020). Diagnose ADHD disorder in children using convolutional neural network based on continuous mental task EEG. Comput. Methods Programs Biomed. 197, 105738. doi: 10.1016/j.cmpb.2020.105738

PubMed Abstract | CrossRef Full Text | Google Scholar

Mohammadpour, M., and Mozaffari, S. (2017). “Classification of EEG-based attention for brain computer interface,” in 2017 3rd Iranian Conference on Intelligent Systems and Signal Processing (ICSPIS) (IEEE), 34–37. doi: 10.1109/ICSPIS.2017.8311585

CrossRef Full Text | Google Scholar

Mormann, F., Andrzejak, R. G., Elger, C. E., and Lehnertz, K. (2007). Seizure prediction: the long and winding road. Brain. 130, 314–333. doi: 10.1093/brain/awl241

PubMed Abstract | CrossRef Full Text | Google Scholar

O'Toole, J. M., Temko, A., and Stevenson, N. (2014). “Assessing instantaneous energy in the EEG: a nonnegative, frequencyweighted energy operator,” in 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Chicago, IL: IEEE, 3288–3291. doi: 10.1109/EMBC.2014.6944325

PubMed Abstract | CrossRef Full Text | Google Scholar

Päivinen, N., Lammi, S., Pitkänen, A., Nissinen, J., Penttonen, M., and Grönfors, T. (2005). Epileptic seizure detection: a nonlinear viewpoint. Comput. Methods Programs Biomed. 79, 151–159. doi: 10.1016/j.cmpb.2005.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Papadelis, C., Kourtidou-Papadeli, C., Bamidis, P. D., Chouvarda, I., Koufogiannis, D., Bekiaris, E., et al. (2006). “Indicators of sleepiness in an ambulatory EEG study of night driving,” in 2006 International Conference of the IEEE Engineering in Medicine and Biology Society (Shahrood: IEEE), 6201–6204. doi: 10.1109/IEMBS.2006.259614

PubMed Abstract | CrossRef Full Text | Google Scholar

Petersen, S. E., and Posner, M. I. (2012). The attention system of the human brain: 20 years after. Annu. Rev. Neurosci. 35, 73–89. doi: 10.1146/annurev-neuro-062111-150525

PubMed Abstract | CrossRef Full Text | Google Scholar

Pollock, V., Schneider, L., and Lyness, S. (1990). EEG amplitudes in healthy, late-middle-aged and elderly adults: normality of the distributions and correlations with age. Electroencephalogr. Clin. Neurophysiol. 75, 276–288. doi: 10.1016/0013-4694(90)90106-T

PubMed Abstract | CrossRef Full Text | Google Scholar

Prinzel, L. J., Pope, A. T., Freeman, F. G., Scerbo, M. W., and Mikulka, P. J. (2001). “Empirical analysis of EEG and erps for psychophysiological adaptive task allocation,” in NASA Technical Report TM. Available online at: https://ntrs.nasa.gov/citations/20010060403

Google Scholar

Rao, R. P. (2013). Brain-Computer Interfacing: an Introduction. Cambridge: Cambridge University Press. doi: 10.1017/CBO9781139032803

CrossRef Full Text | Google Scholar

Ray, W. J., and Cole, H. W. (1985). EEG alpha activity reflects attentional demands, and beta activity reflects emotional and cognitive processes. Science. 228, 750–752. doi: 10.1126/science.3992243

PubMed Abstract | CrossRef Full Text | Google Scholar

Roberts, S. J., Penny, W., and Rezek, I. (1999). Temporal and spatial complexity measures for electroencephalogram based brain-computer interfacing. Medi. Biol. Eng. Comput. 37, 93–98. doi: 10.1007/BF02513272

PubMed Abstract | CrossRef Full Text | Google Scholar

Robinson, J. (2021). “Edinburgh handedness inventory,” in Encyclopedia of Autism Spectrum Disorders. Berlin: Springer, 1600–1604. doi: 10.1007/978-3-319-91280-6_877

CrossRef Full Text | Google Scholar

Rosso, O. A., Blanco, S., Yordanova, J., Kolev, V., Figliola, A., Schürmann, M., et al. (2001). Wavelet entropy: a new tool for analysis of short duration brain electrical signals. J. Neurosci. Methods. 105, 65–75. doi: 10.1016/S0165-0270(00)00356-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Schwender, D., Daunderer, M., Mulzer, S., Klasing, S., Finsterer, U., and Peter, K. (1996). Spectral edge frequency of the electroencephalogram to monitor “depth” of anaesthesia with isoflurane or propofol. Br. J. Anaesth. 77, 179–184. doi: 10.1093/bja/77.2.179

PubMed Abstract | CrossRef Full Text | Google Scholar

Solnik, S., Rider, P., Steinweg, K., DeVita, P., and Hortobgyi, T. (2010). Teager kaiser energy operator signal conditioning improves EMG onset detection. Eur. J. Appl. Physiol. 110, 489–498. doi: 10.1007/s00421-010-1521-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Spasic, S., Kesic, S., Kalauzi, A., and Saponjic, J. (2011). Different anesthesia in rat induces distinct inter-structure brain dynamic detected by higuchi fractal dimension. Fractals. 19, 113–123. doi: 10.1142/S0218348X1100521X

CrossRef Full Text | Google Scholar

Srinivasan, V., Eswaran, C., and Sriraam, N. (2007). Approximate entropy-based epileptic EEG detection using artificial neural networks. IEEE Trans. Inf. Technol. Biomed. 11, 288–295. doi: 10.1109/TITB.2006.884369

PubMed Abstract | CrossRef Full Text | Google Scholar

Swartwood, J. N., Swartwood, M. O., Lubar, J. F., and Timmermann, D. L. (2003). EEG differences in ADHD-combined type during baseline and cognitive tasks. Pediatr. Neurol. 28, 199–204. doi: 10.1016/S0887-8994(02)00514-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Teixeira, C., Direito, B., Feldwisch-Drentrup, H., Valderrama, M., Costa, R., Alvarado-Rojas, C., et al. (2011). EPILAB: a software package for studies on the prediction of epileptic seizures. J. Neurosci. Methods 200, 257–271. doi: 10.1016/j.jneumeth.2011.07.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Thongpanja, S., Phinyomark, A., Phukpattaranont, P., and Limsakul, C. (2013). Mean and median frequency of EMG signal to determine muscle force based on time-dependent power spectrum. Elektronika ir Elektrotechnika. 19, 51–56. doi: 10.5755/j01.eee.19.3.3697

CrossRef Full Text | Google Scholar

Tong, S., Bezerianos, A., Malhotra, A., Zhu, Y., and Thakor, N. (2003). Parameterized entropy analysis of EEG following hypoxic-ischemic brain injury. Physics Letters A. 314, 354–361. doi: 10.1016/S0375-9601(03)00949-6

CrossRef Full Text | Google Scholar

Van Hese, P., Philips, W., De Koninck, J., Van de Walle, R., and Lemahieu, I. (2001). “Automatic detection of sleep stages using the EEG,” in 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Istanbul: IEEE, 1944–1947.

PubMed Abstract | Google Scholar

Vourkas, M., Micheloyannis, S., and Papadourakis, G. (2000). “Use of ann and hjorth parameters in mental-task discrimination,” in 2000 First International Conference Advances in Medical Signal and Information Processing (IEE Conf. Publ. No. 476). Bristol: IET, 327–332. doi: 10.1049/cp:20000356

CrossRef Full Text | Google Scholar

Wan, W., Cui, X., Gao, Z., and Gu, Z. (2021). Frontal EEG-based multi-level attention states recognition using dynamical complexity and extreme gradient boosting. Front. Hum. Neurosci. 15, 673955. doi: 10.3389/fnhum.2021.673955

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, D., Qian, M., Fang, Y., Lai, C., Li, D., and Chen, G. (1989). Revision on the Combined Raven's test for the Rural in China. Psychol. Sci. 5, 004.

PubMed Abstract | Google Scholar

Wang, Y., and Guan, L. (2008). Recognizing human emotional state from audiovisual signals. IEEE Transact. Multimed. 10, 936–946. doi: 10.1109/TMM.2008.927665

CrossRef Full Text | Google Scholar

Zheng, W.-L., Zhu, J.-Y., and Lu, B.-L. (2017). Identifying stable patterns over time for emotion recognition from EEG. IEEE Transact. Affective Comp. 10, 417–429. doi: 10.1109/TAFFC.2017.2712143

PubMed Abstract | CrossRef Full Text | Google Scholar

Zoefel, B., Huster, R. J., and Herrmann, C. S. (2011). Neurofeedback training of the upper alpha frequency band in EEG improves cognitive performance. Neuroimage. 54, 1427–1431. doi: 10.1016/j.neuroimage.2010.08.078

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: electroencephalogram (EEG), brain-computer interfaces (BCIs), attention recognition, valid paradigm, intra-subject, inter-subject, neural patterns

Citation: Chen D, Huang H, Bao X, Pan J and Li Y (2023) An EEG-based attention recognition method: fusion of time domain, frequency domain, and non-linear dynamics features. Front. Neurosci. 17:1194554. doi: 10.3389/fnins.2023.1194554

Received: 29 March 2023; Accepted: 22 June 2023;
Published: 12 July 2023.

Edited by:

Yingzi Lin, Northeastern University, United States

Reviewed by:

Jing Jin, East China University of Science and Technology, China
Geng Peng, Shijiazhuang Tiedao University, China
Mohammad Ashraful Amin, International Centre for Diarrhoeal Disease Research (ICDDR), Bangladesh

Copyright © 2023 Chen, Huang, Bao, Pan and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuanqing Li, YXV5cWxpQHNjdXQuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.