A novel method for optimizing epilepsy detection features through multi-domain feature fusion and selection

Kong, Guanqing; Ma, Shuang; Zhao, Wei; Wang, Haifeng; Fu, Qingxi; Wang, Jiuru

doi:10.3389/fncom.2024.1416838

ORIGINAL RESEARCH article

Front. Comput. Neurosci., 19 November 2024

Volume 18 - 2024 | https://doi.org/10.3389/fncom.2024.1416838

This article is part of the Research TopicEpilepsy Redefined: Cutting-Edge Machine Learning and Software Tools for Diagnosis and TreatmentView all 5 articles

A novel method for optimizing epilepsy detection features through multi-domain feature fusion and selection

Guanqing Kong^1,2

Shuang Ma^2,3^*

Wei Zhao^1,2

Haifeng Wang^2,3

Qingxi Fu^1,2

Jiuru Wang³

¹Health and Medical Big Data Laboratory, Linyi People's Hospital, Linyi, China
²Shandong Open Laboratory of Data Innovation Application, Linyi People's Hospital Health and Medical Big Data Center, Linyi, China
³School of Information Science and Engineering, Linyi University, Linyi, China

Background: The methods used to detect epileptic seizures using electroencephalogram (EEG) signals suffer from poor accuracy in feature selection and high redundancy. This problem is addressed through the use of a novel multi-domain feature fusion and selection method (PMPSO).

Method: Discrete Wavelet Transforms (DWT) and Welch are used initially to extract features from different domains, including frequency domain, time-frequency domain, and non-linear domain. The first step in the detection process is to extract important features from different domains, such as frequency domain, time-frequency domain, and non-linear domain, using methods such as Discrete Wavelet Transform (DWT) and Welch. To extract features strongly correlated with epileptic classification detection, an improved particle swarm optimization (PSO) algorithm and Pearson correlation analysis are combined. Finally, Support Vector Machines (SVM), Artificial Neural Networks (ANN), Random Forest (RF) and XGBoost classifiers are used to construct epileptic seizure detection models based on the optimized detection features.

Result: According to experimental results, the proposed method achieves 99.32% accuracy, 99.64% specificity, 99.29% sensitivity, and 99.32% score, respectively.

Conclusion: The detection performance of the three classifiers is compared using 10-fold cross-validation. Surpassing other methods in detection accuracy. Consequently, this optimized method for epilepsy seizure detection enhances the diagnostic accuracy of epilepsy seizures.

1 Introduction

In traditional medical epilepsy diagnosis, medical experts rely on their personal experience to visually inspect patients' EEG signals (Tatum, 2014). It is time consuming and analytically demanding to detect epilepsy manually. Medical experts have difficulties interpreting EEG signals due to the non-stationary nature of the signals, which may cause human interpretation errors and disagreements (Oliva and Rosa, 2019). As a result, computer-based methods have gradually replaced traditional medical detection methods, helping medical experts to identify epilepsy-related events in EEG recordings (Li et al., 2020; Vargas et al., 2021; Ramakrishnan and Murugavel, 2019; Türk and Özerdem, 2019).

There are four main challenges involved in implementing automatic epilepsy detection and classification: data preprocessing, feature extraction, feature selection, and classifier design. Recent epilepsy classifications have become increasingly dependent on feature selection, and an efficient feature selection method improves classification accuracy significantly. A low computational efficiency has resulted from manually extracting features from large EEG datasets, even as many developers continue to improve algorithms based on machine learning.

Previously, EEG signal features were extracted manually, using methods such as Wavelet Transform (WT), Short-Time Fourier Transform (STFT), and others to categorize various electrocardiograms and EEG signals. Machine learning, along with the continuous development of artificial intelligence, has spurred the development of automatic epilepsy detection technology. With the introduction of algorithms with decomposition, signal correlation, feature engineering, and other features, detection time and classification accuracy have been shortened and improved (Liu et al., 2021).

In addition to this, the following two unresolved problems exist in the current epilepsy detection research: (1) the time-domain features extracted from the original signals are not sufficient to be used as a feature set for epilepsy detection alone; and (2) the extracted features suffer from the problem of irrelevance and independence from epilepsy detection. A number of studies have proposed methods for solving the above problems, including ICFS, AsyLnCPSO-GA, and GA (Khalid et al., 2014; Wei et al., 2020; Prasetiyowati et al., 2020; Mursalin et al., 2017; Gao et al., 2020; Wang et al., 2022; Omidvar et al., 2021). When using the selected features for classification, the existing methods also have low accuracy and high redundancy.

In order to solve the first problem, this work proposes a method for extracting multidomain features from raw EEG datasets by using discrete wavelet transforms (DWT) and Welch methods to extract time-domain (TD), frequency-domain (FD), time-frequency-domain (TFD), and non-linear features from raw EEG datasets. Due to the large number of extracted features, it leads to overfitting of the classifier and the performance of the classifier is greatly reduced, so it is important to consider that the number of features should be proportional to the cost of training and prediction of the classifier, and the features with high relevance needed by the classifier should be selected (Khalid et al., 2014). For the second question, a feature optimization method (PMPSO) is developed that uses the modified particle swarm algorithm (MPSO) and Pearson's correlation coefficient to select features that are both relevant and independent. Compared with the standard particle swarm optimization algorithm (PSO), MPSO has made improvements in convergence speed and global search capabilities. MPSO introduces a key shrinkage factor ϕ. MPSO realizes efficient search on feature subsets and uses classification accuracy as the fitness function for evaluation. In order to further improve the accuracy of feature selection, MPSO combines the Pearson correlation coefficient to perform a second screening of the initially selected features. By calculating the linear correlation between features and removing redundant features with strong correlation, the final feature set has better independence. This not only reduces the training time of the model, but also improves the overall efficiency of epilepsy detection. By removing irrelevant elements, these techniques can positively affect the performance of constructing classifiers (Wei et al., 2020; Prasetiyowati et al., 2020), using feature selection techniques to compare and improve on different classifiers. The main contributions of this paper can be summarized in the following two points:

The EEG signal is decomposed into sub-bands after fusion using DWT, Welch and STFT methods to extract 35 features in a variety of fields. As significant features in various fields, these features improve classification accuracy greatly.

This paper proposes a new method for feature optimization called PMPSO. This efficient feature optimization method combines the improved Particle Swarm Optimization algorithm (MPSO) with the Pearson correlation coefficient. It aims to eliminate features that are unrelated to epilepsy and those with strong correlations among themselves. The final feature vector is the most representative and optimal.

The remaining sections of this paper are organized as follows: Section 2 introduces some of the developed epilepsy detection methods in related work; Section 3 presents the proposed automatic epilepsy detection method in this paper; Section 4 provides the classification experimental results using this method; finally, Section 5 describes the conclusions drawn from the research and outlines future work.

2 Related word

In this section, the work on seizure detection over the past two decades is discussed. This leads to the epilepsy detection method proposed in this paper by analyzing the current state of the work.

2.1 Related research on feature selection methods

Feature extraction methods based on raw EEG signals were still mostly manual during this period, with techniques such as WT and STFT widely applied for electrocardiogram and EEG classification. The method proposed by Mursalin et al. (2017) combines an improved Correlation-based Feature Selection (ICFS) with a Random Forest classifier to detect epilepsy. The ICFS was used to select prominent features from the time domain, frequency domain, and entropy-based features for RF classification, with 98.45% accuracy. Based on Approximate Entropy and Recurrence Quantification Analysis combined with Convolutional Neural Networks, Gao et al. (2020) presented an automated method for epilepsy EEG recordings, achieving sensitivity, specificity, and accuracy of 98.84, 99.35, and 99.26%, respectively. According to Wang et al. (2022), an improved PSO and Genetic Algorithm were combined to determine the optimal combination of features for epilepsy seizure detection in a hybrid model. By utilizing a novel Asynchronous Learning Factor Particle Swarm Optimization (AsyLnCPSO) and GA for feature selection, a classification accuracy of 95.35% was achieved. In Omidvar et al. (2021), 55 statistical and entropy-based features were extracted from raw EEG signals using DWT. Using GA for feature selection, they achieved improved accuracy, sensitivity, and specificity of 98.7, 97.5, and 100%, respectively. Haputhanthri et al. (2019, 2020) selected the FS4 feature set based on the correlated feature selection algorithm (CFS), which provided a relatively high level of accuracy when compared to other methods because it contained the mean and standard deviation of the five channels (FT9, P3, Oz, TP9, and FC2). Compared to the PMPSO method proposed in this study, the correlation between features is ignored, and the selected feature set has high redundancy and low independence.

The above-mentioned methods for feature selection have a significant impact on epilepsy detection. These methods still select feature sets that have redundant features and are not optimal. This work proposes a PMPSO method that consists of two feature selection techniques, which can be used to select independent feature sets that have strong correlations and the most representative attributes.

2.2 Related research on feature extraction methods

Sriraam and Raghu (2017) extracted 26 features from time domains, frequency domains, information theory, and statistics. On these features, Wilcoxon rank-sum tests were applied based on a 95% significance level, and an optimized SVM classifier reached the highest sensitivity, specificity, and accuracy, respectively, of 94.56, 89.74, and 92.15%. The method proposed by Oliva and Rosa (2021) was based on the binary fusion of three domain features (frequency, time-frequency, and nonlinear), producing a total of 105 features for multiclass classification. The work of Xiong et al. (2022) exploited Pearson correlation coefficients, mutual information, and permutation disalignment index to construct a three-layer network, extracting similar features in each network, and optimizing them based on an improved genetic algorithm. In the CHB-MIT database, the method achieved AC, SP, SE, and F1 of 97.26, 97.55, 96.89, and 97.11%, respectively. In the Siena scalp database, AC, SP, SE, and F1 reached 98.88, 99.13, 98.36, and 98.75%.

In the above-mentioned related works, good results have been achieved in the detection of epilepsy. Although the selected features have strong relevance to epilepsy detection, the correlation between them has not been reflected, and some redundant features remain. Based on the discussion of the above related works, epilepsy seizure detection still has redundant features. To address the existing challenges, this study constructs a comprehensive dual-feature selection method based on multi-task learning. The method extracts crucial features from multiple domains of EEG signals, which directly impacts classification accuracy. Preprocessed EEG signals are decomposed into five subbands (Gamma, Beta, Alpha, Theta, and Delta) using DWT. Specific potential features related to EEG signals' non-linear and dynamic structure are obtained from each subband. The logarithmic sum, mean, mean power, standard deviation, and ratio of absolute mean are extracted. Additionally, Welch's method calculates spectral density estimation features in different frequency bands, extracting 35 features belonging to different domains from the original EEG signals. Then, the feature selection optimization method (PMPSO) combining the MPSO method improved by PSO and the Pearson correlation coefficient is used to select features with high correlation and strong independence. The MPSO method introduces a shrinkage factor ϕ in PSO to overcome its limitation of fast convergence speed but easy to fall into local optimality, so that particles can search collaboratively in the local area, thereby optimizing features with high correlation. The Pearson correlation coefficient is used to perform a secondary screening of these selected features to remove redundant features with strong correlation and further enhance the independence of the features. Finally, SVM, RF, ANN, and XGBoost classifiers classify epilepsy patients, healthy individuals, and epilepsy seizure detection.

3 Material and method

This chapter introduces the proposed epileptic seizure detection system in four parts: raw data preprocessing, feature extraction, feature selection, and classification. Raw data is segmented and filtered in the preprocessing stage. DWT and Welch methods are then applied to EEG segments to extract features from different frequency subbands, resulting in a feature set of 35 features, including important features from various domains, greatly improving classification accuracy. With the PMPSO method, features with strong correlation and independence are selected, forming a representative optimal subset. Finally, the chosen feature subset is serially concatenated to form a feature vector used for training multiple classifiers. Figure 1 illustrates the overall architecture of the proposed method, and the following subsections detail each part of the system.

Figure 1

Figure 1. The architectural diagram of the proposed method.

3.1 EEG dataset

In this study, three electroencephalogram (EEG) datasets from different sources were used, namely the epilepsy dataset of the University of Bonn and the CHB-MIT scalp EEG dataset of Boston Children's Hospital. The performance of the PMPSO method was evaluated on these datasets, and the robustness and applicability of the method in different application scenarios were verified.

3.1.1 University of Bonn Epilepsy Dataset

In this work, the epilepsy dataset from the University of Bonn (http://epileptologie-bonn.de/cms/upload/workgroup/lehnertz/eegdata.html) (Andrzejak et al., 2001) was utilized. The dataset comprises EEG data from five epilepsy patients and five healthy individuals, organized into five subsets labeled A to E. Each subset contains 100 single-channel EEG segments with a continuous duration of 23.6 s, containing 4,097 data points. For healthy, interictal, and seizure periods, 200, 200, and 100 data were available, respectively. After undergoing 12-bit analog-to-digital conversion, the data was continuously written to disk at a sampling frequency of 173.61 Hz (Andrzejak et al., 2001). Potential interferences such as muscle artifacts and eye movement artifacts were removed from the data. Figure 2 illustrates EEG contrasts during healthy states, interictal intervals, and ictal periods.

Figure 2

Figure 2. Comparison of EEGs between healthy states, interictal intervals, and ictal periods.

The EEG data collected from subsets N, F, and S are from hippocampal structures and different electrode positions in epilepsy patients' lesions. Interictal EEG subsets N and F are of epileptic patients, and iCtal EEG subset S is of epileptic patients. The Z and O subsets were obtained from five healthy subjects during awake and relaxed states, with Z representing an eyes-open situation and O representing an eyes-closed situation. Table 1 shows sample EEG recordings for five datasets.

Table 1

Table 1. Description of the Bonn University Epilepsy Dataset.

3.1.2 CHB-MIT

This study used the CHB-MIT (https://physionet.org/content/chbmit/1.0.0/), a publicly available scalp EEG database developed by researchers at Boston Children's Hospital and Massachusetts Institute of Technology (Shoeb, 2009). The dataset contains EEG records of 23 pediatric patients with intractable epilepsy, including 5 males with an age range of 3–22 years and 18 females with an age range of 1.5–19 years (Goldberger et al., 2000). The records were labeled by experienced clinicians. The EEG records of each subject contained 9–42 EDF files with a total duration of ~983 h, including 198 epileptic seizure events. All EEG signals were recorded using the international 10-20 bipolar system with a sampling rate of 256 Hz and a resolution of 16 bits. For more details about the CHB-MIT database, see the study by Goldberger et al. (2000), and the relevant case details are shown in Table 2.

Table 2

Table 2. Detailed description of the CHB-MIT database.

3.2 Pre-processing

The purpose of this section is to provide a detailed overview of how raw EEG signals are preprocessed. Epileptic seizures and healthy states cannot be distinguished in some studies. Because EEG signals are relatively weak, they are easily disturbed by external factors or human physiological activities. Consequently, it is impossible to distinguish between epileptic seizures and signals from a healthy state, which may adversely affect experimental results (Riccio et al., 2024; Handa et al., 2023; Li et al., 2023; Pandey et al., 2022). In order to ensure data quality and accuracy, a series of preprocessing operations are carried out before extracting features from EEG signals.

Firstly, linear filters are employed to process the EEG signals. A simple fourth-order Butterworth bandpass filter with a range of 0.5–70 Hertz is included. This filter enhances the signal quality by eliminating unwanted frequency components. To suppress interference from power lines, a notch filter at 50 Hertz is also employed.

Besides mitigating artifacts caused by various factors, it is imperative to address the issue of limited data size as well. Continuous EEG data is usually very large, and the available data for the epilepsy data sample is only 500 data instances. Therefore, the long EEG data is segmented into shorter segments using a segmentation strategy. An overlap of 64 data points on the time axis is used with this strategy, using a fixed-size window of 1,024. With this approach, unstable EEG fragments are segmented into shorter, pseudo-stable EEG segments that have similar statistical characteristics. Expanded data for healthy, interictal and seizure periods to 5,700, 5,700, and 2,850.

Finally, the segmented 14,250 EEG fragments are divided into training, validation, and test sets with proportions of 90, 5, and 5%, respectively. The purpose is to facilitate the subsequent training and evaluation of the model. The preprocessing steps provide a reliable foundation for our research by enhancing the quality and applicability of the data.

3.3 Feature extraction

Feature engineering is an essential component in the detection of epileptic seizures. EEG signals during healthy states, ictal periods, and interictal intervals can only be distinguished by extracting significant features from them. As shown in Figure 3, various feature extraction methods are employed in this chapter to extract key features denoted as f₁, f₂, , , f_c from multiple domains. Multi-domain features are extracted by converting the original signal into a format suitable for multi-domain feature extraction, which helps extract key features in each domain. By adding diversity to the feature set, classification accuracy is significantly improved.

Figure 3

Figure 3. Flowchart of multi-domain feature extraction.

The feature extraction process encompasses multiple feature sets from different domains, each having distinct physical and statistical significance. It includes, but is not limited to, time-domain, frequency-domain, time-frequency-domain, and other domain-specific features. As a result of this diverse feature set, our classifier is able to capture various aspects of EEG signals, thereby enabling a comprehensive understanding and differentiation of EEG signals in different states.

3.3.1 Time-frequency domain feature extraction

A time-frequency domain feature extraction is achieved using DWT in this section. Multiple time and frequency scales are used in this method to represent signals in the time-frequency domain through approximation coefficients and detail coefficients. Signal variations can be more accurately described with this approach (Ibrahim et al., 2018). By analyzing the time and frequency information of the signal, extracted time-frequency domain features provide a comprehensive and integrated way to determine the signal's properties. The formula for calculating DWT is shown in equation:

D W T (i, j) = \frac{1}{\sqrt{| 2^{i} |}} \int_{- \infty}^{\infty} x (t) ψ (\frac{t - 2^{i} k}{2^{j}})

Among them, where i represents the frequency band range of the coefficient, j represents the position of the wavelet coefficient on the time axis or spatial location, x(t) represents the original EEG signal, ψ(.) represents the wavelet function (mother wavelet), and k is the variable of integration representing the integration across the entire time axis. The primary objective of applying DWT to the EEG signal x[n] for time-frequency analysis is to extract sub-signals in five different frequency ranges: Delta, theta, alpha, beta, and gamma. In this process, low-pass filters h[n] and high-pass filters g[n] are used to generate wavelet coefficients, which are transformed sub-bands. We obtain the approximation coefficient A₁ and the detail coefficient D₁ at the first level. Next, the same procedure is applied to the approximation coefficient A₁ of the first level to obtain the coefficients for the next level. The resulting coefficients D₁,D₂,D₃,D₄,D₅, and A₅ are used to represent EEG sub-bands, as illustrated in Figure 4. This figure depicts a simple diagram of the decomposition of the EEG signal into five coefficients using DWT.

Figure 4

Figure 4. Principle of DWT sub-bands decomposition.

Produce an efficient feature vector by calculating 5 features from each decomposition sub-band. The extracted feature vectors can be reduced in dimensionality by utilizing statistics on discrete wavelet coefficients (Kandaswamy et al., 2004). In this work, the following statistical features were computed using the DWT:

(1) Logarithmic sum of wavelet coefficients (LSWT)

The logarithmic sum of wavelet coefficients refers to taking the logarithm of the absolute values of wavelet coefficients. This aids in capturing information about the signal across different frequency sub-bands. Logarithmic sums can be computed using the following formula:

\begin{array}{l} L S W T = \sum_{j = 1}^{N} l n (| D W T (i, j) |) \end{array}

In the aforementioned equation, DWT(i, j) represents the wavelet coefficient, where i indicates the frequency band range of the coefficient, j indicates where it occurs in space or on the time axis, and N indicates the total energy of the wavelet coefficients in the subband.

(2) The average of the absolute values of coefficients in each subband (MEAN)

The average of the absolute values of coefficients in each subband helps understand the average amplitude of the signal in different frequency ranges. This enables identification of frequency components with significant amplitudes. The formula for calculating the average of the absolute values of coefficients is shown in equation:

M E A N = \frac{1}{N} \sum_{j = 1}^{N} | D W T (i, j) |

(3) The mean power of wavelet coefficients in each subband (ABS)

The mean power feature refers to the energy distribution of the signal in the frequency domain. It distinguishes the levels of energy within different frequency ranges in the signal. The formula for calculating the mean power of wavelet coefficients is shown in equation:

A B S = \frac{1}{N} \sum_{j = 1}^{N} {(D W T (i, j))}^{2}

(4) The standard deviation of coefficients in each subband (STD)

The amplitude distribution and fluctuation characteristics of different types of signals vary from frequency sub-band to frequency sub-band. Standard deviation features help identify and distinguish different types of signals by capturing the characteristics of these amplitude changes. The formula for calculating the standard deviation of coefficients is shown in equation:

S T D = \sqrt{\frac{1}{N} \sum_{j = 1}^{N} {(D W T (i, j) - M E A N)}^{2}}

(5) The ratio of the absolute average values of adjacent subbands (RAT)

Signal frequency changes can be identified by comparing the absolute average values of adjacent subbands. When the ratio is higher, it indicates that there are more pronounced frequency changes between adjacent subbands, while when it is lower, it indicates relatively small changes in frequency. The calculation formula for the ratio of absolute average values of adjacent subbands is given by equation:

R A T = \frac{\sum_{j = 1}^{N} | D W T (i, j) |}{\sum_{j = 1}^{N} | D W T (i, j + 1) |}

3.3.2 Frequency domain feature extraction

Frequency domain features are extracted using the Welch method in this section. An EEG signal's power spectral density (PSD) can be calculated by using this method. Brihadiswaran et al. (2019) summarize different techniques of feature extraction such as statistical feature extraction and entropy based techniques. Compared to these techniques, Welch's method is more suitable for processing EEG signals for epilepsy detection, the extracted features provide information about the energy distribution of the signal in different frequency ranges, which provides a more accurate understanding of the frequency characteristics of the signal (Zhang and Parhi, 2016), with the advantages of fast computation and multi-window selection. Following are the steps to calculate the PSD of EEG signal segments in different frequency bands using Welch's period gram method (Welch, 1967):

First, the EEG brainwave signal x(n) with a length of N is divided into L segments, each segment with a length of M, where N = ML. The calculation formula for each segment of EEG signal x_i(n) is shown in equation:

\begin{array}{l} x_{i} (n) = x (n + i M - M), 0 \leq n \leq M, 1 \leq i \leq L \end{array}

Then, use a window function w(n) to mitigate the impact of spectral leakage caused by the edges of time windows on EEG brainwave segments. Calculate the power spectrum of each segment of data using the discrete Fourier transform. The calculation formula is shown in equation:

\begin{array}{l} P_{i} (w) = \frac{1}{U} | \sum_{n = 0}^{M - 1} x_{i} (n) w (n) e^{- j w n} |^{2}, i = 1, 2,,, L - 1 \end{array}

Where $U = \frac{1}{M} \sum_{n = 0}^{M - 1} w^{2} (n)$ is the normalization factor, and w(n) is the window function.

Finally, the power spectra of all segments are averaged to obtain the power spectrum of the entire signal. This reduces the variance of each power measurement. The calculation formula is shown in:

\begin{array}{l} P (w) = \frac{1}{L} \sum_{i = 0}^{L} P_{i} (w) \end{array}

This work divided EEG brainwave segments into five frequency bands: delta (0.1–4 Hz), theta (4–8 Hz), alpha (8–12 Hz), beta (12–30 Hz), and gamma (30–70 Hz). The PSD calculation formula for the i_th frequency band is shown in equation:

\begin{array}{l} P_{i} = log \sum_{ω \in b a n d i} P (ω) \end{array}

As a result of this process, the PSD distribution of EEG signal segments in different frequency ranges can be obtained, which allows a better understanding of the signal's frequency characteristics. The method is widely used in signal processing, spectrum analysis, and frequency domain feature extraction, especially when reducing noise and obtaining smooth spectral estimates are crucial. As shown in Algorithm 1, the pseudocode for extracting PSD features from each sub-band frequency range using the Welch method in this study includes Delta, Theta, Alpha, Beta, and Gamma.

Algorithm 1

Algorithm 1. Welch spectrum estimation algorithm.

The calculation of PSD in different frequency ranges follows a methodology similar to the formal description in lines 5–12 of Algorithm 1. Lines 1–3 of Algorithm 1 initialize the parameters. Starting from line 5, the algorithm iteratively applies the window function to each segment of data, preparing for the execution of the Fast Fourier Transform (FFT) to obtain the frequency domain representation of the data. The squared magnitude of the FFT result is calculated in line 10. Finally, the spectral estimates for all segments are accumulated and averaged to obtain the final spectral estimate.

3.4 Feature selection

A detailed introduction to the Feature Optimization PMPSO method is provided in this chapter, which is employed to solve the problem of inadequately comprehensive feature selection. A total of 35 features are extracted for each EEG brain signal segment, including 30 time-frequency domain features and five power spectral density features. With the increasing number of features extracted from EEG signals, many irrelevant and redundant pieces of information are present in the time-frequency domain features, resulting in dimensionality catastrophe and a significant impact on the performance of the classifier. An algorithm for selecting features is therefore crucial. In order to reduce feature dimensions and eliminate redundancy, the focus is on selecting those EEG features that most effectively reflect the pre-seizure state. As a result, the classifier's performance and generalization ability are improved. Previous research has only considered one aspect of the correlation, either the correlation between features and seizure occurrence or the correlation among features. Therefore, this section proposes a feature selection method based on MPSO and Pearson correlations. With this method, the MPSO algorithm selects features that are highly correlated with epilepsy detection, thereby minimizing the impacts of irrelevant features on classification results and reducing network overfitting. The Pearson correlation coefficient is employed to calculate the correlation between features. The smaller the correlation between features, the greater the independence of features, leading to a more comprehensive and effective feature set. EEG signals from epileptic patients are better measured with this method (Zhong et al., 2023). In the end, the optimal epilepsy features are obtained after rigorous screening using both methods.

3.4.1 The MPSO algorithm selects features based on their correlation

A MPSO algorithm is presented in this subsection for selecting features with strong correlations. It simulates an individual searching for the best solution in a multi-dimensional space based on the individual best solution (pbest) and the global best solution (gbest), using the PSO algorithm. The particles in PSO represent birds in a flock that move through the search space at a velocity and position. It is the velocity that determines the speed of movement, while the position determines the direction. The individual best is determined independently by each particle in the search space, and this information is shared with all particles. To find the global optimum, particles compare their individual bests with the global best among all particles. Particle speed and position are adjusted based on their individual bests and the current global best. As the iterative process continues, particles collaborate and compete to come up with better solutions.

It is prone to getting stuck in local optima due to the fast convergence of the PSO algorithm. In order to address this issue, a constriction factor ϕ is introduced to limit the range of the factors c₁ and c₂, which control the updating of particle velocity. This helps to reduce the adverse effects that improper learning factor setting may have on the algorithm. The particles will also be able to conduct collaborative search in their immediate vicinity as a result of this. The contraction factor ϕ can be expressed mathematically as follows:

\begin{array}{l} φ = \frac{2}{| 2 + 4 c_{1} - \sqrt{2 c_{2}^{2} - 4 c_{1}} |} \end{array}

Therefore, the updated velocity formula after optimization is given by equation:

\begin{array}{l} v_{i}^{t} (k + 1) = w v_{i}^{t} (k) + φ [c_{1} r_{1} (p B e s t_{i}^{t} (k) - x_{i}^{t} (k)) \\ + c_{2} r_{2} (g B e s t_{i}^{t} (k) - x_{i}^{t} (k))] \end{array}

Here, $v_{i}^{t} (k)$ is the updated velocity of the i particle in the t dimension after the k iteration. φ is the constriction factor, c₁ and c₂ are the learning factors, r₁ and r₂ are random numbers between 0 and 1 for the k iteration, $p B e s t_{i}^{t} (k)$ is the individual best solution of the i particle in the t dimension after the k iteration, $g B e s t_{i}^{t} (k)$ is the global best solution in the t dimension among all particles after the k iteration, $x_{i}^{t} (k)$ is the current position of the i particle in the t dimension after the k iteration.

In this section, the MPSO algorithm is introduced for finding the optimal feature vector set for epilepsy detection in the feasible space. In Algorithm 2, the optimization process in the feasible space is explained and the algorithm pseudocode is provided. A similar process takes place in the MPSO for computing the optimal solution pbest as described in lines 3–16 of Algorithm 2. In each iteration, fitness values are computed for each particle in steps 3–8, along with individual and global bests. Each particle's position and velocity are then updated in steps 10–17.

Algorithm 2

Algorithm 2. Improved particle swarm optimization (MPSO).

MPSO algorithm is applied to feature optimization in the following manner: First, the feature values are sorted in the order of D_i_LSWT, D_i_MEAN, D_i_ABS, D_i_STD, D_i_RAT, Delta, Theta, Alpha, Beta and Gamma, where iε{1, 2, 3, 4, 5, 6}. A particle swarm optimization algorithm maps particles to binary representations of feature selection statuses. Each extracted feature has two conditions: selected and unselected, represented by 0 and 1, respectively. A binary vector of length 30 composed of 0s and 1s represents each result. As an example, the particle [000011000000100000000000000001] indicates the selection of D₅_LSWT, D₆_LSWT, D₁_ABS and power_gamma. An algorithm's fitness function is the classifier, and its fitness value is the classification accuracy of each feature combination (Wang et al., 2022). Figure 5 illustrates the detailed process. Each particle's feature values and mapped selection status are initially initialized, with pbest representing the historical best candidate solution for a single particle and gbest representing the population's best candidate solution. These parameters are updated in two scenarios:

(1) The classification performance of the new particle is better than pbest/gbest;

(2) The classification performance of the new particle is the same as pbest/gbest, but the number of features in its corresponding feature subset is smaller, the solution size is smaller.

Figure 5

Figure 5. The flowchart of the application of the MPSO algorithm in feature optimization.

After reaching the maximum iteration count or finding the global optimum, the iteration ends, and the final feature set selection result is passed on to the next method.

3.4.2 Filtering independent features

The Pearson correlation coefficient is primarily used in this subsection to select features for the second round of analysis. By eliminating features with strong correlations, enhancing feature independence, removing redundant features, and reducing the training time of the model, epilepsy detection becomes more efficient.

The Pearson correlation analysis is widely used to determine the strength and direction of a linear relationship between two variables. Based on the concept of covariance, a correlation coefficient r is calculated by dividing two variables' covariances by the product of their standard deviations, resulting in a range of −1 to 1. The formula for calculating the Pearson correlation coefficient r is shown in equation:

\begin{array}{l} r = \frac{\sum_{i = 1}^{n} (X_{i} - \bar{X}) (Y_{i} - \bar{Y})}{\sqrt{\sum_{i = 1}^{n} {(X_{i} - \bar{X})}^{2}} \sqrt{\sum_{i = 1}^{n} {(Y_{i} - \bar{Y})}^{2}}} \end{array}

The correlation coefficient r has a range of values between [−1, +1], and X and Y represent two features. There is a negative correlation when the value is negative, a positive correlation when the value is positive, and no correlation when the value is zero. In general, the closer the correlation coefficient is to 0, the weaker the correlation; the closer it is to −1 or +1, the stronger the correlation.

3.5 Applying PMPSO optimized features in the classifier

In this section, we apply the optimal feature vectors extracted using the PMPSO feature optimization method to four different classifiers: ANN, SVM, RF and XGBoost. An ANN consists of an input layer, a hidden layer with 19 neural units, and an output layer with three nodes. For discrete prediction, the softmax output with cross-entropy loss is used, and for real-value prediction, the linear output with square loss is used.

In classification and regression analysis, SVM is a supervised learning method for analyzing data and identifying patterns (Kumar et al., 2017; Vapnik and Cortes, 1995). SVM classification involves separating data points using a hyperplane for input classification (Vapnik and Cortes, 1995). A SVM focuses on support vectors, the data points closest to the decision boundary, which makes it less susceptible to outliers and noise. Complex data can be handled well by SVM because of this property. In order to improve the accuracy of the three-class epilepsy problem, a fifth-order polynomial function is used with adjusted key parameters γ and c. Parameter γ controls the influence range of a single training example on the classification boundary. Parameter c balances correct classification and margin maximization, set to γ = 0.1 and c = 1.

The RF classifier is a machine learning model based on the bagging concept, introduced by Breiman (2001), incorporating additional randomness. A RF model consists of multiple simple decision tree predictors, each of which produces an output based on a set of predictor values. A decision tree is simultaneously constructed by RF by using different bootstrap samples, changing how classification or regression trees are traditionally constructed (Breiman, 2001).

The XGBoost classifier is a tree boosting method. It builds a strong classifier by gradually building multiple weak classifiers (usually decision trees) and combining their predictions. Each step reduces the weight of the previous round of incorrect predictions, allowing the model to gradually learn data points that are difficult to classify. Compared with traditional gradient boosting methods, XGBoost further improves performance through a variety of optimization techniques (Chen and Guestrin, 2016).

4 Results and discussion

An analysis of the design and implementation of the proposed epilepsy detection model is presented in this chapter. To work the impact of design on multi-domain feature extraction and PMPSO's performance metrics, experiments were conducted. A statistical analysis of the features extracted using DWT and Welch methods is included in the testing of multi-domain feature extraction. Experiments on the MPSO algorithm for filtering correlations between features and Pearson correlation analysis for determining feature independence are included in PMPSO. Using the random forest classifier, ablation experiments showed that the two proposed methods greatly improved seizure detection efficiency. Four classifiers were used to classify the selected optimal feature vector: ANN, RF, SVM and XGBoost.

4.1 Statistical analysis of features

Under this subsection, 35 features extracted from different domains are analyzed statistically using methods such as DWT and Welch. DWT is initially used to decompose the original EEG signal. A high-pass filter g[n] and a low-pass filter h[n] are used to decompose the original EEG signal into five subbands, represented by coefficients D1, D2, D3, D4, and A4. A number of statistical features are computed for each subband, including LSWT, Mean, ABS, STD, and Ratio. The scatterplot of the proposed features is shown below in Figure 6A, which randomly selects five of the extracted features as a comparison. It is clearer from the scatterplot in Figure 6A that the three categories are separated more clearly. However, a few cluster closer together. The features extracted from seizures are distinctly different from those extracted from healthy and inter-seizure periods.

Figure 6

Figure 6. (A) Scatter plot of features extracted using the DWT method, (B) scatter plot of features extracted using the Welch method.

A PSD is extracted using the Welch method, which estimates frequency-dependent features and aids in understanding static properties that capture both static and dynamic attributes of time-evolving properties, serving as a seizure detection feature (Harpale and Vinayak, 2021). To minimize the loss of edge information during data processing, overlapping time windows are used. For each window, the discrete Fourier transform is applied to calculate the periodicity of the signal. Finally, each period gram is squared before being averaged, reducing the variance of the power spectral density measurements. Figure 6A presents the scatter plot of the calculated average PSD frequency features. In Figure 6B, interictal and ictal periods are clearly distinguished, with little overlap between the two, but the health category is difficult to distinguish.

4.2 Application results of MPSO

In this section, features that are closely related to the study are selected using the improved Particle Swarm Optimization algorithm (MPSO). In each iteration of the particle swarm, when updating the particle's position and velocity, the MPSO algorithm introduces a contraction factor ϕ to limit the range of learning factors c₁ and c₂, effectively controlling the change of the velocity vector. Brihadiswaran et al. (2019) summarize some commonly used feature selection techniques such as correlation-based feature selection (CFS), analysis of variance (ANOVA), PCA, and input selection and test training (TWIST). From a multi-domain feature set, the improved MPSO method selects more features relevant for epilepsy detection. By using the contraction factor ϕ, the MPSO method mitigates the effects of improper learning factor settings on the algorithm's performance. The particles are also encouraged to search for solutions collaboratively in the local area. As a result, the MPSO algorithm has a stronger exploratory power and is less likely to fall into local optima than the PSO algorithm.

Using the feature set F as the input for the MPSO algorithm, F = {D_i_LSWT, D_i_Mean, D_i_ABS, D_i_STD, D_i_Ratio, power_delta, power_theta, power_alpha, power_beta, power_gamma}, where i∈{1, 2, 3, 4, 5, 6}, the classification results of the classifier are used as the fitness functions for each feature. The correlation weights of each feature for epilepsy detection are calculated after multiple iterations of optimization, as shown in Figure 7.

Figure 7

Figure 7. The importance of features in MPSO algorithm.

As shown in Figure 7, D₅_ABS and power_theta have the highest importance for epilepsy classification when using the MPSO method. However, LSWT, Mean, Ratio features computed for each sub-band have a very low percentage of contribution to epilepsy classification and have a significant inhibitory effect. For the next module, the top 10 features with strong correlations are retained based on their correlation weights. In this work, the following 10 features are retained for Pearson correlation analysis: D₅_ABS, power_theta, D₁_ABS, D₂_ABS, D₂_STD, D₆_STD, D₁_STD, power_gamma, D₃_STD, D₅<uscore>STD.

4.3 Application results of Pearson correlation analysis

Pearson r represents the correlation coefficient between variable X and variable Y in this section. The value of r ranges from −1 to 1, and the expression for r is as follows:

r = {\begin{matrix} 1, p e r f e c t p o s i t i v e c o r r e l a t i o n \\ - 1, p e r f e c t n e g a t i v e c o r r e l a t i o n \\ 0, n o l i n e a r r e l a t i o n s h i p \end{matrix}

Whenever the correlation coefficient r is close to 1, it indicates a strong linear relationship between two variables, X and Y. In contrast, when the absolute value of the correlation coefficient is close to 0, it indicates that the two variables have no linear relationship, indicating that their variations are not related. A correlation coefficient's sign also indicates the direction of the relationship between variables.

In this work, correlation coefficient r was calculated using pairwise combinations of the 10 features mentioned above. Since the number of base features is large, we set the threshold δ at 0.6. A feature was removed from the feature set if its calculated correlation coefficients r between feature X and feature Y exceeded a particular threshold δ. By eliminating that feature from subsequent calculations, computational resources were saved. Table 3 shows the correlation coefficient r calculated using Pearson correlation analysis. First, the correlation coefficient r values between D₅_ABS and power_theta, D₃_STD, and D₅_STD are all above threshold δ. This indicates that this feature's independence is weak, so it is excluded from the feature vector. The features selected later are similar. Finally, a feature vector consisting of D₂_ABS, D₅_STD, power_gamm, power_theta and D₁_ABS was selected as input for the classifier.

Table 3

Table 3. The calculation of the correlation coefficient r between features.

4.4 Performance evaluation

In order to evaluate the effectiveness of the developed algorithm for distinguishing seizure and interictal states, four evaluation metrics will be used to assess its performance. These metrics are SE, SP, AC, and F1. The definitions of these evaluation metrics are as follows:

SE refers to the proportion of actual positive instances that the model correctly identifies.

\begin{array}{l} S E = \frac{T P}{T P + F N} \times 100 % \end{array}

SP refers to the proportion of actual negative instances that the model correctly identifies among all true negative instances. Specificity describes the model's ability to distinguish negative instances.

\begin{array}{l} S P = \frac{T N}{T N + F P} \times 100 % \end{array}

AC is the ratio of the number of samples correctly predicted by the model to the total number of samples in all instances. These parameters are defined as follows:

\begin{array}{l} A C = \frac{T P + T N}{T P + F N + T N + F P} \times 100 % \end{array}

The F1 score is a comprehensive metric for evaluating model performance, commonly used in binary or multiclass classification problems. It combines two key performance metrics: precision and sensitivity. The F1 score is calculated using the following formula:

\begin{array}{l} F 1 = \frac{2^{*} (A C^{*} S E)}{(A C + S E)} \times 100 % \end{array}

Where TP is true positive, FN is false negative, TN is true negative, and FP is false positive. These performance metrics are used to evaluate the performance of the proposed model in this study.

4.5 Ablation study

This work aims to use the PMPSO feature optimization method, which consists of MPSO, Pearson, and RF classifiers. In order to analyze the effectiveness of each module in PMPSO, an ablation study was conducted on the Bonn University Epilepsy Dataset, as shown in Figure 8. Specifically, the study derived the following four model variants.

(1) Classification using only the Random Forest classifier.

(2) RF + MPSO: MPSO without Pearson correlation analysis, combined with the Random Forest classifier.

(3) RF + Pearson: Pearson correlation analysis without MPSO, combined with the Random Forest classifier.

(4) RF + PMPSO: Training MPSO and Pearson correlation analysis together.

Figure 8

Figure 8. The results of ablative experiments with different methods.

The following conclusions can be drawn from the ablation study shown in Figure 8 and Table 4. First of all, MPSO can improve classification performance, demonstrating the need to model epileptic seizures in conjunction with feature vectors. In addition, the Pearson correlation analysis improves classification efficiency by removing redundant features from the feature vector by comparing RF and RF + Pearson. In conclusion, PMPSO significantly improves performance in epileptic seizure detection compared to the other three variants in the ablation study. The proposed feature optimization method shows significant performance improvement in epileptic seizure detection.

Table 4

Table 4. The results of the ablation study.

4.6 Comparison with baseline methods

In this section, we will compare the PMPSO feature optimization method for epilepsy detection to the baseline PSO method. In the experiment, PMPSO not only introduced the shrinkage factor ϕ to dynamically adjust the search range of the particle swarm, avoiding the problem that the traditional PSO algorithm easily falls into local optimality during the optimization process, but also enhanced the speed and position update process by optimizing the speed and position update process. Global search capabilities of the model. This improvement enables PMPSO to more effectively balance the exploration and exploitation processes, resulting in faster convergence and fewer iterations of feature selection. Compared with the baseline PSO, PMPSO shows stronger robustness and efficiency in feature selection. In addition, PMPSO performs secondary feature screening combined with Pearson correlation coefficient to remove redundant features and improve the independence of features. This dual optimization strategy ensures that feature subsets are more relevant and reduces the possibility of overfitting, thereby significantly improving model performance in classification tasks.

According to Table 5, the improved MPSO method significantly outperforms the baseline PSO method in terms of classification accuracy. With a post-selection feature subset size of 5, PMPSO improves classification accuracy to 98.59%, a 9.09% improvement over the baseline PSO. The precision, recall, and F1 score of PMPSO are also higher than those of baseline PSO, indicating that the feature subset selected by PMPSO can improve the classifier's overall performance. Through its improved feature selection strategy, PMPSO can more efficiently select the features that have a higher contribution to the classification task, improving the model's performance.

Table 5

Table 5. Comparison of classification performance of baseline PSO methods for MPSO and PMPSO feature optimization methods.

This Table 6 compares the computational efficiency of PMPSO and baseline PSO methods. Despite the 59.8 s average computation time for PMPSO, which is slightly longer than baseline PSO (47.2 s) mainly due to the dynamic adjustment strategy and the additional shrinkage factor ϕ. In contrast, PMPSO (Dong et al., 2023) requires fewer iterations to complete feature selection than baseline PSO (Sun et al., 2022) suggesting that its optimization process is more efficient and is able to complete feature selection in fewer iterations. Since the sharp stop strategy is implemented during the training process, it is evident from the table that the PMPSO method achieves higher accuracy in fewer training epochs than the baseline PSO.

Table 6

Table 6. Comparison of computational efficiency of baseline PSO methods for MPSO and PMPSO feature optimization methods.

4.7 Comparison with state-of-the-art feature selection techniques

In order to verify the effectiveness of the proposed PMPSO method in the feature selection task, this experiment selected the genetic algorithm (GA) and the mutual information-based feature selection method (MIS) for experimental comparison. These methods are widely used in feature selection problems and can effectively reduce redundant features and improve the performance of the classifier. In the epilepsy dataset of the University of Bonn and under the same experimental conditions, PMPSO, GA and MIS were used for feature selection, and the selected features were input into the classifier. In order to ensure the fairness of the experiment, the parameter settings and optimization processes of all methods were kept consistent. The classifier used random forest (RF), and the performance of the model was evaluated by 10-fold cross validation. The experimental evaluation indicators include classification accuracy, the number of features after feature selection, and the running time of the algorithm.

Table 7 shows the performance of the three feature selection methods under different evaluation indicators. From the experimental results, it can be seen that the PMPSO method performs well in classification accuracy, reaching an accuracy of 98.59%, which is significantly better than GA (96.35%) and MIS (95.48%). In addition, the number of features selected by the PMPSO method is relatively small, only 5 features, while GA and MIS select 12 and 15 features, respectively. This shows that PMPSO can effectively remove redundant features while retaining key features, thereby improving the generalization ability of the model. In terms of running time, the average calculation time of PMPSO is 59.8 s, which is better than GA's 85.7 s and slightly higher than MIS's 72.3 s. PMPSO shows significant advantages in accuracy, feature subset size and running time, reflecting its unique innovation and practical application value in feature selection tasks. By introducing an improved particle swarm optimization algorithm, PMPSO demonstrates stronger robustness and higher feature selection efficiency in epilepsy detection tasks, making it an effective supplement and improvement to existing feature selection technology.

Table 7

Table 7. Performance comparison of different feature selection methods.

4.8 Multi-model classification experiments

In this experiment, in order to verify the effectiveness of the PMPSO feature optimization method in epilepsy detection, a multi-model classification experiment was designed, using the Bonn University Epilepsy Dataset and the Boston Children's Hospital CHB-MIT Dataset. The PMPSO method was applied to three common classification models: artificial neural network (ANN), support vector machine (SVM), random forest (RF) and XGBoost. Due to computing resource limitations, the training set of the SVM model was reduced to 2,000 samples on the University of Bonn and CHB-MIT datasets, while the ANN, RF and XGBoost models used the full dataset. A comprehensive experiment was conducted on the three-classification task of epilepsy detection (healthy, interictal, and epileptic seizure).

The experimental results are shown in Table 8. The experimental results on the Bonn University dataset show that the PMPSO method performs very well in different classifiers. The accuracy of ANN, SVM, RF and XGBoost models reached 98.11, 98.25, 98.59 and 99.32% respectively. Among them, the XGBoost model performed best in various evaluation indicators, with a specificity of 99.64%, a sensitivity of 99.29% and an F1 score of 99.32%. In order to enhance the statistical credibility of the results, this paper calculated the 95% confidence interval for each model. The 95% confidence interval of the classification accuracy of the XGBoost model is (98.61, 99.75), while the 95% confidence intervals of the accuracy of the RF, ANN and SVM models are (97.80, 99.38), (97.20, 99.02), and (97.30, 99.18), respectively, which further proves the stability of the PMPSO method.

Table 8

Table 8. Classification results of the PMPSO feature optimization method.

Experiments on the CHB-MIT dataset further validated the robustness of PMPSO. The original EEG signal of each patient was used as the input signal of the algorithm. The classification results for each patient were calculated and the average evaluation parameters were reported, as shown in Table 9. On the EEG data of 23 pediatric patients with intractable epilepsy, the accuracy on the RF classifier was 98.42%, the specificity was 98.83%, the sensitivity was 98.61%, and the F1 score was 98.57%. The confidence interval of the accuracy of the RF classifier on the CHB-MIT dataset is (97.90, 98.95), which shows the consistency of the performance of this method on different patient data.

Table 9

Table 9. Classification results of PMPSO feature optimization method on CHB-MIT dataset.

Additionally, confusion matrix plots were generated using evaluation metrics, illustrating the accuracy, sensitivity, and specificity of the four classifiers. In Figures 9, 10, you can clearly see the specific classification of health, epileptic seizures, and interictal periods. The current work on epilepsy detection is compared to recent research in Tables 10, 11. Using different feature extraction and selection methods, we significantly improved classification accuracy and efficiency under the same dataset conditions. Compared to other algorithms, the proposed algorithm shows substantial improvements in performance, as shown in Tables 10, 11.

Figure 9

Figure 9. Confusion matrix plots for (A) RF, (B) ANN, (C) SVM, and (D) XGBoost classification using the Bonn epilepsy dataset.

Figure 10

Figure 10. Confusion matrix of the RF classifier using the CHB-MIT dataset.

Table 10

Table 10. Performance comparison of different methods on the Bonn database.

Table 11

Table 11. Comparison of state-of-the-art epileptic seizure detection methods evaluated using the CHB-MIT dataset.

5 Conclusion

By focusing on feature extraction and feature selection, this paper explores an effective approach to epilepsy seizure detection using EEG data. First, 35 measurement features were extracted using methods such as DWT and Welch. Thereafter, a novel feature optimization method, Particle Swarm Optimization with Modified Shrinkage Factor (PMPSO), was developed to select features that are more relevant to epilepsy detection, reducing feature redundancy and enhancing detection efficiency and accuracy. A four-classifier approach was used to evaluate selected feature subsets: ANN, SVM, RF, and XGBoost. Finally, the results were assessed through 10-fold cross-validation. The experiments demonstrated better performance than previous works that overlooked feature selection and relied on deep learning methods. The introduced model benefited from the application of computational techniques in feature selection, enhancing the signal processing aspect of machine learning methods. The results validate the proposed approach by significantly reducing the computational workload while achieving comparable results through a substantial reduction in the number of features extracted from EEG segments.

For future work, the following plans are outlined: (1) Proposing feature selection techniques more suitable for epilepsy seizure detection; (2) Further reducing the number of features while maintaining or improving classification accuracy, aiming to use the minimum number of features for optimal classification efficiency; (3) Evaluating the model on epilepsy signal databases with more channels.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

GK: Conceptualization, Methodology, Writing – original draft. SM: Methodology, Writing – original draft. WZ: Funding acquisition, Methodology, Project administration, Writing – review & editing. HW: Data curation, Investigation, Supervision, Writing – review & editing. QF: Investigation, Methodology, Writing – review & editing. JW: Data curation, Investigation, Methodology, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This project was supported by Shandong Province Science and Technology Small and Medium Enterprises Innovation Ability Enhancement Project of China (No. 2023TSGC0449), Introduction and Cultivation Program for Young Innovative Talents of Universities in Shandong Province of China (2021QCYY003), Linyi Key Research and Development Project (No. 2023YX0041), and the Science and Technology Development Foundation of Affiliated Hospital of Xuzhou Medical University (XYFM202225).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Andrzejak, R. G., Lehnertz, K., Rieke, C., Mormann, F., David, P., and Elger, C. E. (2001). Indications of nonlinear deterministic and finite dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. Phys. Rev. E 64:061907. doi: 10.1103/PhysRevE.64.061907

PubMed Abstract | Crossref Full Text | Google Scholar

Bhanot, N., Mariyappa, N., Anitha, H., Bhargava, G. K., Velmurugan, J., and Sinha, S. (2020). Seizure detection and epileptogenic zone localisation on heavily skewed MEG data using RUSBoost machine learning technique. Int. J. Neurosci. 132, 963–974. doi: 10.1080/00207454.2020.1858828

PubMed Abstract | Crossref Full Text | Google Scholar

Breiman, L. (2001). Random forests. Mach. Learn. 45, 5–32. doi: 10.1023/A:1010933404324

Crossref Full Text | Google Scholar

Brihadiswaran, G., Haputhanthri, D., Gunathilaka, S., Meedeniya, J., and Jayarathna, S. (2019). EEG-based processing and classification methodologies for autism spectrum disorder: a review. J. Comp. Sci. 15, 1161–1183. doi: 10.3844/jcssp.2019.1161.1183

PubMed Abstract | Crossref Full Text | Google Scholar

Cai, H., Yan, Y., Liu, G., Cai, J., Cheok, A. D., Liu, N., et al. (2024). WKLD-based feature extraction for diagnosis of epilepsy based on EEG. IEEE Access 12, 69276–69287. doi: 10.1109/ACCESS.2024.3401568

Crossref Full Text | Google Scholar

Chen, T., and Guestrin, C. (2016). “XGBoost: a scalable tree boosting system,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16), 785–794.

Google Scholar

Cimr, D., Fujita, H., Busovsky, D., and Cimler, R. (2024). Enhancing EEG signal analysis with geometry invariants for multichannel fusion. Inf. Fus. 102:102023. doi: 10.1016/j.inffus.2023.102023

Crossref Full Text | Google Scholar

Dong, F., Yuan, Z., Wu, D., Jiang, L., Liu, J., and Hu, W. (2023). Novel seizure detection algorithm based on multi-dimension feature selection. Biomed. Signal Process. Control 84:104747. doi: 10.1016/j.bspc.2023.104747

Crossref Full Text | Google Scholar

Gao, X., Yan, X., Gao, P., Gao, X., and Zhang, S. (2020). Automatic detection of epileptic seizure based on approximate entropy, recurrence quantification analysis and convolutional neural networks. Artif. Intell. Med. 102:101711. doi: 10.1016/j.artmed.2019.101711

PubMed Abstract | Crossref Full Text | Google Scholar

Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., et al. (2000). PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation 101, e215–e220. doi: 10.1161/01.CIR.101.23.e215

PubMed Abstract | Crossref Full Text | Google Scholar

Handa, P., Handa, P., Mathur, M., and Goel, N. (2023). EEG datasets in machine learning applications of epilepsy diagnosis and seizure detection. SN Comp. Sci. 4:437. doi: 10.1007/s42979-023-01958-z

Crossref Full Text | Google Scholar

Haputhanthri, D., Brihadiswaran, G., Gunathilaka, S., Meedeniya, D., Jayarathna, S., Jaime, M., et al. (2020). Integration of facial thermography in EEG-based classification of ASD. Int. J. Autom. Comput. 17, 837–854. doi: 10.1007/s11633-020-1231-6

Crossref Full Text | Google Scholar

Haputhanthri, D., Brihadiswaran, G., Gunathilaka, S., Meedeniya, D., Jayawardena, Y., and Jayarathna, S. (2019). “An EEG based channel optimized classification approach for autism spectrum disorder,” in 2019 Moratuwa Engineering Research Conference (MERCon) (IEEE).

Google Scholar

Harpale, V., and Bairagi, V. (2021). An adaptive method for feature selection and extraction for classification of epileptic EEG signal in significant states. J. King Saud Univ. Comput. Inform. Sci. 33, 668–676. doi: 10.1016/j.jksuci.2018.04.014

Crossref Full Text | Google Scholar

Harpale, V., and Vinayak, B. (2021). An adaptive method for feature selection and extraction for classification of epileptic EEG signal in significant states. J. King Saud Univ. 33, 668–676. doi: 10.1016/j.jksuci.2018.04.014

Crossref Full Text | Google Scholar

Ibrahim, S., Djemal, R., and Alsuwailem, A. (2018). Electroencephalography(EEG) signal processing for epilepsy and autism spectrum disorder diagnosis. Biocybern. Biomed. Eng. 38, 16–26. doi: 10.1016/j.bbe.2017.08.006

Crossref Full Text | Google Scholar

Kandaswamy, A., Kumar, C. S., Ramanathan, R. P., Jayaraman, S., and Malmurugan, N. (2004). Neural classification of lung sounds using wavelet coefficients. Comput. Biol. Med. 34, 523–537. doi: 10.1016/S0010-4825(03)00092-1

PubMed Abstract | Crossref Full Text | Google Scholar

Khalid, S., Khalil, T., and Nasreen, S. (2014). “A survey of feature selection and feature extraction techniques in machine learning,” in 2014 Science and Information Conference (IEEE), 372–378.

Google Scholar

Kumar, N., Alam, K., and Siddiqi, A. H. (2017). Wavelet transform for classification of EEG signal using SVM and ANN. Biomed. Pharmacol. J. 10, 2061–2069. doi: 10.13005/bpj/1328

PubMed Abstract | Crossref Full Text | Google Scholar

Li, C., Zhou, W., Liu, G., Zhang, Y., Geng, M., Liu, Z., et al. (2021). Seizure onset detection using empirical mode decomposition and common spatial pattern, IEEE Trans. Neural Syst. Rehabil. Eng. 29, 458–467. doi: 10.1109/TNSRE.2021.3055276

PubMed Abstract | Crossref Full Text | Google Scholar

Li, Y., Liu, Y., Cui, W.-G., Guo, Y.-Z., Huang, H., and Hu, Z.-Y. (2020). Epileptic seizure detection in EEG signals using a unified tem-poral-spectral squeeze-and-excitation network. IEEE Trans. Neural Syst. Rehabil. Eng. 28, 782–794. doi: 10.1109/TNSRE.2020.2973434

PubMed Abstract | Crossref Full Text | Google Scholar

Li, Y., Yang, Y., Zheng, Q., Liu, Y., Wang, H., Song, S., et al. (2023). Dynamical graph neural network with attention mechanism for epilepsy detection using single channel EEG. Med. Biol. Eng. Comp. 62, 307–326. doi: 10.1007/s11517-023-02914-y

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, G., Xiao, R., Xu, L., and Cai, J. (2021). Minireview of epilepsy detection techniques based on electroencephalogram signals. Front. Syst. Neurosci. 15:685387. doi: 10.3389/fnsys.2021.685387

PubMed Abstract | Crossref Full Text | Google Scholar

Majzoub, S., Fahmy, A., Sibai, F., Diab, M., and Mahmoud, S. (2023). Epilepsy detection with multi-channel EEG signals utilizing AlexNet. Circ. Syst. Signal Process. 42, 6780–6797. doi: 10.1007/s00034-023-02423-1

Crossref Full Text | Google Scholar

Mouleeshuwarapprabu, R., and Kasthuri, N. (2023). Feature extraction and classification of EEG signal using multilayer perceptron. J. Elect. Eng. Technol. 18, 3171–3178. doi: 10.1007/s42835-023-01508-w

Crossref Full Text | Google Scholar

Mursalin, M., Zhang, Y., Chen, Y., and Chawla, N. V. (2017). Automated epileptic seizure detection using improved correlation-based feature selection with random forest classifier. Neurocomputing 241, 204–214. doi: 10.1016/j.neucom.2017.02.053

Crossref Full Text | Google Scholar

Oliva, J. T., and Rosa, J. L. G. (2019). Classification for EEG report gen-eration and epilepsy detection. Neurocomputing 335, 81–95. doi: 10.1016/j.neucom.2019.01.053

Crossref Full Text | Google Scholar

Oliva, J. T., and Rosa, R. J. L. (2021). Binary and multiclass classifiers based on multitaper spectral features for epilepsy detection. Biomed. Signal Process. Control 66:102469. doi: 10.1016/j.bspc.2021.102469

Crossref Full Text | Google Scholar

Omidvar, M., Zahedi, A., and Bakhshi, H. (2021). EEG signal processing for epilepsy seizure detection using 5-level Db4 discrete wavelet transform, GA-based feature selection and ANN/SVM classifiers. J. Ambient Intell. Humaniz. Comput. 12, 10395–10403. doi: 10.1007/s12652-020-02837-8

Crossref Full Text | Google Scholar

Pandey, S. K., Janghel, R. R., Mishra, P. K., and Ahirwal, M. K. (2022).Automated epilepsy seizure detection from EEG signal based on hybrid CNN and LSTM model.Signal, Image and Video Processing 17, 1113–1122. doi: 10.1007/s11760-022-02318-9

Crossref Full Text | Google Scholar

Prasetiyowati, M. I., Maulidevi, N. U., and Surendro, K. (2020). “The speed and accuracy evaluation of random forest performance by selecting features in the transformation data,” in Proceedings of the 2020 The 9th International Conference on Informatics, Environment, Energy and Applications. IEEA 2020 (New York, NY: Association for Computing Machinery), 25–130.

Google Scholar

Ramakrishnan, S., and Murugavel, A. S. M. (2019). Epileptic seizure detection using fuzzy-rules-based sub-band specific features and layered multi-class svm. Pattern Anal. Appl. 22, 1161–1176 doi: 10.1007/s10044-018-0691-6

Crossref Full Text | Google Scholar

Riccio, C., Martone, A., Zazzaro, G., and Pavone, L. (2024). Training datasets for epilepsy analysis: preprocessing and feature extraction from electroencephalography time series. Data 9:61. doi: 10.3390/data9050061

Crossref Full Text | Google Scholar

Sairamya, N. J., Subathra, M. S. P., Suviseshamuthu, E. S., and Thomas George, S. (2021). A new approach for automatic detection of focal EEG signals using wavelet packet decomposition and quad binary pattern method. Biomed. Signal Process. Control 63:102096. doi: 10.1016/j.bspc.2020.102096

Crossref Full Text | Google Scholar

Shoeb, A. H. (2009). Application of Machine Learning to Epileptic Seizure Onset Detection and Treatment (Ph.D. thesis). Massachusetts Institute of Technology.

Google Scholar

Song, Y., Fan, C., and Mao, X. (2024). Optimization of epilepsy detection method based on dynamic EEG channel screening. Neural Netw. 172:106119. doi: 10.1016/j.neunet.2024.106119

PubMed Abstract | Crossref Full Text | Google Scholar

Sriraam, N., and Raghu, S. (2017). Classification of focal and non focal epileptic seizures using multi-features and SVM classifier. J. Med. Syst. 41:160. doi: 10.1007/s10916-017-0800-x

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, Q., Liu, Y., Li, S., and Wang, C. (2022). Automatic epileptic seizure detection using PSO-based feature selection and multilevel spectral analysis for EEG signals. J. Sens. 2022, 1–16. doi: 10.1155/2022/8667606

Crossref Full Text | Google Scholar

Tatum, W. O. (2014). Handbook of EEG interpretation, 2nd Edn. New York, NY: Springer, 376.

Google Scholar

Türk, Ö., and Özerdem, M. S. (2019). Epilepsy detection by using scalogram based convolutional neural network from EEG signals. Brain Sci. 9:115. doi: 10.3390/brainsci9050115

PubMed Abstract | Crossref Full Text | Google Scholar

Vapnik, V., and Cortes, C. (1995). Support vector networks. Mach. Learn. 20, 273–297. doi: 10.1007/BF00994018

Crossref Full Text | Google Scholar

Vargas, D. L. D., Oliva, J. T., and Teixeira, M. (2021). “Uma abordagem baseada em redes neurais artificiais sobre o espectro de potência de eletroencefalogramas para o auxílio médico na classificação de crises epiléticas,” in Anais do Simpósio Brasileiro de Computação Aplicada à Saúde (SBCAS). SBC, 141–152

Google Scholar

Wang, R., Wang, H., Shi, L., Han, C., and Che, Y. (2022). Epileptic seizure detection using geometric features extracted from SODP shape of EEG signals and AsyLnCPSO-GA. Entropy 24:1540. doi: 10.3390/e24111540

PubMed Abstract | Crossref Full Text | Google Scholar

Wei, G., Zhao, J., Feng, Y., He, A., and Yu, J. (2020). A novel hybrid feature selection method based on dynamic feature importance. Appl. Soft. Comput. 93:106337. doi: 10.1016/j.asoc.2020.106337

Crossref Full Text | Google Scholar

Welch, P. (1967). The use of fast Fourier transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 15, 70–73. doi: 10.1109/TAU.1967.1161901

Crossref Full Text | Google Scholar

Xiong, Y., Dong, F., Wu, D., Jiang, L., Liu, J., and Li, B. (2022). Seizure detection based on improved genetic algorithm optimized multilayer network. IEEE Access 10:8134381354. doi: 10.1109/ACCESS.2022.3196004

Crossref Full Text | Google Scholar

Zarei, A., and Asl, B. M. (2021). Automatic seizure detection using orthogonal matching pursuit, disrete wavelet transform, and entropy based features of EEG signals, Comput. Biol. Med. 131:104250. doi: 10.1016/j.compbiomed.2021.104250

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Z., and Parhi, K. K. (2016). Low-complexity seizure prediction from iEEG/sEEG using spectral power and ratios of spectral power. IEEE Trans. Biomed.Circuits Syst.10, 693–706. doi: 10.1109/TBCAS.2015.2477264

PubMed Abstract | Crossref Full Text | Google Scholar

Zhong, L., Wan, J., Yi, F., He, S., Wu, J., Huang, Z., et al. (2023).Epileptic prediction using spatiotemporal information combined with optimal features strategy on EEG. Front. Neurosci. 17:4005. doi: 10.3389/fnins.2023.1174005

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: feature selection, feature fusion, discrete wavelet transform, Welch, particle swarm optimization, Pearson correlation analysis

Citation: Kong G, Ma S, Zhao W, Wang H, Fu Q and Wang J (2024) A novel method for optimizing epilepsy detection features through multi-domain feature fusion and selection. Front. Comput. Neurosci. 18:1416838. doi: 10.3389/fncom.2024.1416838

Received: 21 May 2024; Accepted: 28 October 2024;
Published: 19 November 2024.

Edited by:

Yuhua Li, Cardiff University, United Kingdom

Reviewed by:

Dulani Meedeniya, University of Moratuwa, Sri Lanka
Mohammadali Charoosaei, Manchester Metropolitan University, United Kingdom

Copyright © 2024 Kong, Ma, Zhao, Wang, Fu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shuang Ma, MjIwODU0MDQyMDA3QGx5dS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.