- 1School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing, China
- 2Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing, China
- 3College of Information Science and Technology, Jinan University, Guangzhou, China
- 4Faculty of Science and Technology, University of Macau, Taipa, Macau SAR, China
- 5School of Computer Science and Engineering, South China University of Technology, Guangzhou, China
Epilepsy is a chronic brain disease that causes persistent and severe damage to the physical and mental health of patients. Daily effective prediction of epileptic seizures is crucial for epilepsy patients especially those with refractory epilepsy. At present, a large number of deep learning algorithms such as Convolutional Neural Networks and Recurrent Neural Networks have been used to predict epileptic seizures and have obtained better performance than traditional machine learning methods. However, these methods usually transform the Electroencephalogram (EEG) signal into a Euclidean grid structure. The conversion suffers from loss of adjacent spatial information, which results in deep learning models requiring more storage and computational consumption in the process of information fusion after information extraction. This study proposes a general Graph Convolutional Networks (GCN) model architecture for predicting seizures to solve the problem of oversized seizure prediction models based on exploring the graph structure of EEG signals. As a graph classification task, the network architecture includes graph convolution layers that extract node features with one-hop neighbors, pooling layers that summarize abstract node features; and fully connected layers that implement classification, resulting in superior prediction performance and smaller network size. The experiment shows that the model has an average sensitivity of 96.51%, an average AUC of 0.92, and a model size of 15.5 k on 18 patients in the CHB-MIT scalp EEG dataset. Compared with traditional deep learning methods, which require a large number of parameters and computational effort and are demanding in terms of storage space and energy consumption, this method is more suitable for implementation on compact, low-power wearable devices as a standard process for building a generic low-consumption graph network model on similar biomedical signals. Furthermore, the edge features of graphs can be used to make a preliminary determination of locations and types of discharge, making it more clinically interpretable.
1. Introduction
Epilepsy is a brain disorder caused by abnormal brain activity. There are approximately 50 million people with epilepsy of all ages around the world (Organization, 2019). It is worth noting that these patients are at three times the risk of premature death than the general population, with sudden death in epilepsy, status epilepticus, unintentional injuries, and suicide being the most common causes. The leading and potentially preventable cause of death in people with epilepsy. In addition, people with epilepsy often have cognitive, psychological, and interpersonal impairments. Electroencephalogram (EEG) is widely used in healthcare (Duan et al., 2012) and epilepsy-related departments, especially in primary hospitals, and has become the key research data for epilepsy diagnosis and treatment due to its low cost, ease of use, tolerable movement restrictions, painless and comfortable procedure, and long-term monitoring. Although many advances have been made in epilepsy detection, there are few studies related to epilepsy prediction due to its complexity and difficulty. However, effective daily prediction of seizures is an important support for timely administration of fast-acting drugs, avoiding unintended harm, and relieving tension.
Developments in the field of seizure prediction have been greatly driven by the establishment of seizure EEG databases, the holding of international epilepsy prediction competitions, and the standardization of performance evaluation of algorithms (Kuhlmann et al., 2018). In terms of algorithms, deep learning has become the most popular research method for seizure prediction. However, traditional deep learning models are often used, such as Convolutional Neural Networks (CNN) (Rosas-Romero et al., 2019; Sharan and Berkovsky, 2020; Wang et al., 2020; Li et al., 2021d; Ozdemir et al., 2021; Usman et al., 2021), Recurrent Neural Networks (RNN) (Tsiouris et al., 2018; Li et al., 2021d; Usman et al., 2021), and new deep learning models, such as multi-view CNN (Liu et al., 2019), multi-time scale CNN (Qi et al., 2021), semi-expanded CNN (Hussein et al., 2021), and Transformer (Hussein et al., 2022). Can only process Euclidean grid data, often EEG data or the feature is represented as a real number matrix. However, this representation does not take into account the structural connectivity characteristics of EEG data, so a large number of model parameters requires more storage and computational consumption when automatically extracting features. Concerning this issue, some researchers use pruning (Zhao et al., 2021a), Neural Architecture Search (NAS) (Lee et al., 2020; Dong et al., 2021; Li et al., 2021a; Xue et al., 2021; Yang et al., 2021c; Zhao et al., 2021a; Wang et al., 2022), and other methods to simplify CNNs. Although a simplified model can eventually be obtained, it is still challenging to ensure the performance of the prediction.
Graphs are a way of representing entity relationships and structured data that are ubiquitous in the real world, such as social networks, business networks, biological networks, transportation networks, and knowledge graphs. A graph consists of some nodes and edges connecting these nodes and contains rich potential value by its complex structure. Human brains are massively complex networks of functional and structural domains, and graph theory provides a new perspective on their analysis. Figure 1 (Bronstein et al., 2017) visually shows the difference between Euclidean and non-Euclidean domains (manifolds and graphs). Specifically, the flatness of Euclidean spaces means that certain operations require many dimensions and complex computations to perform, whereas non-Euclidean spaces may perform these operations more flexibly with fewer dimensions. The graph structure breaks the uniform distribution of the commonly used Euclidean grid structure and can better represent the structural connections and functional realization of brains (Wein et al., 2021). It has been used in the diagnosis of autism spectrum disorders (Yang et al., 2021a), fMRI analysis (Li et al., 2019b, 2021b), brain network analysis (Wu et al., 2021; Royer et al., 2022), brain-computer interface decoding (Feng et al., 2021; Che et al., 2022), emotion classification (Liu et al., 2022), and epilepsy detection (Zhao et al., 2021b). Studies of graph networks for seizure prediction are rare and often focus on complex variants of graph networks while ignoring the model's scale (Lian et al., 2020) or building models that are too large (Dissanayake et al., 2021b; Li et al., 2021c). Lian et al. (2020) proposed a global-local graph convolutional network (Kipf and Welling, 2016) for robust seizure prediction. Compared with the sEEG signal, the processed iEEG signal has more channels, higher sampling frequency, and less artifact interference. While providing richer and cleaner information for seizure prediction, the premise of surgically implanting a signal collector also limits the application scope of the algorithm and ignores the need for a reduced model size in practical application scenarios. Furthermore, they believed that it is almost impossible to build a proper general prior map to predict seizures, so they proposed to use deep learning to build the prior graph. Li et al. (2021c) proposed a spatiotemporal spectral hierarchical graph convolutional network with semi-supervised active learning for seizure prediction. They used sEEG and also fully utilized deep learning to synthesize prior graphs, for which there was a problem of building a model that was too large. A GCN seizure predictor using sEEG was also proposed by Dissanayake et al. (2021b). They use deep learning to further synthesize the prior graph based on the physical connection derived graph, and the problem of the large model size still exists. However, mobile portable seizure predictors loaded with reduced-scale models are particularly important for patients with refractory epilepsy who are inoperable or ineffective for surgical resection.
Figure 1. Example of the four Laplacian eigenfunctions ϕ0, ..., ϕ3 on a Euclidean domain (1D line, top left) and non-Euclidean domains (human shape modeled as a 2D manifold, top right; and Minnesota road graph, bottom). In the Euclidean case, the result is the standard Fourier basis comprising sinusoids of increasing frequency. The citation requested by third-party rights holders is as follows: ©[2017] IEEE.
In this article, each sample is modeled as a graph, with nodes representing channels and undirected edges representing spatial and functional connections. Use simple and general spatial and functional edge features to build prior graphs, and then use GCN architecture to extract and aggregate features to predict epileptic seizures. While ensuring excellent performance, our method reduces the size of the model so that it can better meet the needs of practical applications of wearable devices. Therefore, it can be used as a standard process to build a generic low-consumption graph network of biomedical signals. In clinical practice, the graph feature we use can serve as an aid in the initial determination of discharge location and type, thus increasing medical interpretability.
2. Methodology
2.1. Data preprocessing
Currently, The CHB-MIT Scalp EEG Database is one of the few free public datasets of continuous long-term seizures. The dataset marked the onset and end of seizures, and these marks can be used to define the “ictal” for seizure detection. But for the seizure prediction task, we need to define and distinguish between preictal and interictal periods. Due to the small number of subjects, we did not use the EEG from the American Epilepsy Society Seizure Prediction Challenge, but we adopted its rules for dividing pre-ictal and inter-ictal periods because of its relative openness and authority. As shown in Figure 2, 1 h before the seizure is selected as pre-ictal data. It needs to be stressed that pre-ictal data is shifted forward by 5 min as a whole, which allows sufficient predictive time for the patient and avoids the effects of small errors in the onset of a seizure. Moreover, inter-ictal data is limited to 4 h before or after a seizure.
A total of 18 bipolar montages are selected to reduce the effect of noise: Fp1-F7, F7-T7, T7-P7, P7-O1, Fp1-F3, F3-C3, C3-P3, P3- O1, Fp2-F4, F4-C4, C4-P4, P4-O2, Fp2-F8, F8-T8, T8-P8, P8-O2, Fz-Cz, and Cz-Pz. Then, a 1Hz high-pass filter is used to remove low frequency noise, and a 50 Hz and 100 Hz notch filter are used to remove operating frequency noise. Min-Max Normalization is then used to normalize signals to a range of values between [0, 1] or [−1, 1]. Finally, signals are sliced by a fixed length of time (60 s) without overlapping slides.
2.2. Feature extraction
Because the graph consists of some nodes and edges that connect these nodes, extracted features can be classified as node features and edge features. The Band Energy, Hjorth parameters, Higher Order Crossings, 1-order difference, 2-order difference, Differential Entropy, and Fractal Dimension are features of nodes, and the geodesic distance and spectral correlation are features of edges.
The Band Energy reflects the level of activity of the cerebral cortex. We calculate the energy of six frequency bands with wavelet packet decomposition. Six frequency bands are Delta (1–4 Hz), Theta (4–7.5 Hz), Alpha (7.5–13 Hz), Lower Beta (13–16 Hz), Higher Beta (16–30 Hz), and Gamma (30–40 Hz).
The Hjorth parameters (Hjorth, 1970) reflect shape features in the time domain, in which activity measures the deviation of the signal amplitude, mobility measures the change in signal slope, and complexity measures the number of standard slopes on an amplitude of signals.
where μs denotes the mean value of signal s(n), s′(n) denotes the first derivative of signal s(n), and var denotes the variance of signal s(n).
Higher order crossings (HOC) use the number of times the signal passes through the zero point to reflect the degree of signal oscillation (Petrantonakis and Hadjileontiadis, 2009).
1-order difference is the difference between consecutive adjacent two items in the discrete function, which reflects the dynamic relationship between two adjacent frames. 2-order difference is the difference between the 1-order differences of consecutive adjacent two items in the discrete function, which reflects the dynamic relationship between the three adjacent frames.
Differential entropy (DE) is a generalized form of Shannon information entropy on continuous variables.
Fractal dimension (FD) can be used to represent the complexity of time domain signals, we have chosen Higuchi Fractal Dimension (Higuchi, 1988), Katz Fractal Dimension (Esteller et al., 2001), and Petrosian fractal dimension (Petrosian, 1995).
Geodesic distances reflect the spatial relationship between scalp electrodes by calculating distances in a spherical coordinate system.
where r is the sphere radius, (xi, yi, zi) and (xj, yj, zj) are two points on the surface of the sphere.
Spectral correlation reflects the energy correlation between scalp electrodes.
where Sij is the cross-spectral density of lead i and lead j, Sii and Sjj are the power spectral density of lead i and lead j.
2.3. Graph convolutional networks
Given a graph , where is a set with n nodes and is a set of edges. Extract the band powers and Hjorth parameters as node features. The adjacent matrix A∈ℝn×n represents the connection relationship between nodes, which can be used as the edge features. Compute the geodesic distance and spectral correlation to represent the adjacency matrix.
The Graph Fourier Transformation (GFT) is an extension of the discrete Fourier transform to graph, transforming graphs from spatial domain to spectral domain. Signals at all nodes of the graph are ϕ:V → ℝn where ϕi is the value at node vi. For a graph ϕ, the GFT and the inverse GFT are computed as:
where U is an orthogonalized eigenvector matrix of Laplacian Matrix L of an undirected graph G.
Convolutions on graphs are an extension of convolution, transforming graphs from spatial domain to spectral domain, and then transforming from spectral-domain back to the spatial domain after performing the convolution operation in the spectral domain. The calculation can be divided into three steps (Ma, 2021):
1. GFT:
2. Convolutions:
3. Inverse Graph Fourier Transform (IGFT):
where θ is the corresponding spectral domain convolution kernel obtained by the spatial domain filter g through the Fourier transform of graphs, and gθ is a diagonal matrix whose diagonal elements are θ.
Graph Convolutional Networks (GCN) (Kipf and Welling, 2016) is a simplified network based on ChebyNet (Hammond et al., 2011) proposed by Thomas Kipf. To circumvent the problem of local structure overfitting of ChebyNet on graphs with large node degree distribution, GCN limits the layer-wise convolutional operation to 1:
where Λ is the eigenvalue matrix of Laplacian matrix L, is the Chebyshev polynomial of , and Θ∈ℝc×d is a convolution kernel parameters matrix. To further simplify gθ, approximate maximum eigenvalue λmax≈2:
Then, the filtered signal is:
where Lsym is the Symmetric Normalized Laplacian Matrix, D is the Degree Matrix of graphs, A is the adjacency matrix of graphs, X∈ℝn×c is the initial attribute matrix of all (n) nodes, each node has c-dimensional attributes, and Y∈ℝn×d is the output of graph convolution. Let Θ = Θ0 = −Θ1:
In order to solve the numerical instability, gradient explosion, and dispersion problems that may be caused by multiple iterations, use the renormalization trick: , then:
Consequently, the layer-wise propagation rule is:
where Hl is the matrix of activations in the lth layer, and Wl is a layer-specific weight matrix. σ(·) is an activation function.
3. Experiments
3.1. Datasets
The CHB-MIT Scalp EEG Database (Shoeb, 2009) is a collection of EEG recordings from 23 pediatric subjects with intractable seizures. We finally select 18 available subjects because of the rare cluster seizures and poor time continuity. Moreover, all signals are sampled at 256 Hz. Files were recorded using the international 10–20 system of EEG electrode positions, most of which have 23 channels.
3.2. Architecture for EEG seizure prediction
As shown in Figure 3, we build a GCN architecture for epilepsy prediction, which has seven layers: three GCN convolution layers, one global average pooling layer, and three fully connected (FC) layers. The number of channels in GCNConv is 32, 64, and 128, and the number of neurons in FC layers is 128, 32, and 16. At the top of each GCN convolutional layer, Batch Normalization (BN), and Leaky ReLU activation functions are used to refine node features. Then, the global average pooling layer summarizes the node features of each channel to generate high-level features of the entire graph information. After processing in FC layers, the label probability and class prediction are output through the Sigmoid activation function.
3.3. Model training
To obtain more reliable results, we used the Leave One Out Cross Validation (LOOCV) method for each subject. During training, we choose the Adam optimizer whose learning rate is initially set to 0.01 and decreases by 10% after each epoch. We also use the Synthetic Minority Oversampling Technique (SMOTE) (Chawla et al., 2002) and Gradient Harmonizing Mechanism (GHM) (Li et al., 2019a) to ease the numerical imbalance between the two types of samples. We processed each subject's EEG sample by SMOTE and GHM. The ratios of inter-ictal and pre-ictal samples of 18 subjects are as follows: 2:1, 6:1, 17:1, 2:1, 7:1, 15:1, 9:1, 3:1, 6:1, 3:1, 1:1, 2:1, 10:1, 7:1, 2:1, 4:1, 3:1, 1:1, and 6:1. After SMOTE and GHM, all subjects had balanced ratios of 1:1. In addition, we use a 50% probability of randomly discarding neurons in the fully connected layer to avoid overfitting.
3.4. Performance metrics
Seizure prediction performance was measured by the mean and SD of sensitivity and AUC on the test set. The sensitivity is defined as follows:
where the True Positive (TP) and False Negative (FN) can be calculated from the confusion matrix. Furthermore, the Receiver Operating Characteristic Curve (ROC Curve) is obtained by plotting TPR as the horizontal coordinate and FPR as the vertical coordinate at different probability thresholds. The ROC curve considers both sensitivity and specificity and can remain almost constant under different sample distributions. The Area Under ROC Curve (AUC) can be used to quantitatively describe the ROC curve, and classifiers with a higher AUC score perform better.
4. Results
4.1. Feature selection results
To control the computational cost of the model, we combine the node features in pairs. Table 1 shows the results of the GCN model for feature combinations in the CHB-MIT dataset. Combination Band Energy and Hjorth and Combination Hjorth and FD not only have a sensitivity of over 90% but also have an AUC of over 0.9. Considering the physical significance of sensitivity in seizure prediction, we finally selected the combination Band Energy and Hjorth with better sensitivity as the node feature.
4.2. GCN results
Table 2 shows the results of the GCN model for Band Energy and Hjorth in the CHB-MIT dataset. The mean sensitivity and AUC for all subjects are 96.51 and 0.92%, and their SD of them are 1.70 and 0.01. The mean sensitivity and AUC of 10 subjects (sub01, sub10, sub11, sub13, sub14, sub16, sub18, sub19, sub20, and sub21) exceed 95%. The mean sensitivity of one subject (sub05) is below 0.9. The mean AUC of six subjects (sub04, sub05, sub06, sub07, sub09, and sub23) are all below 0.9.
Figure 4A shows the seizure performance of different patients individually and Figure 4B shows the overall prediction performance of all the 18 patients.
Figure 4. Performance of the GCN model for seizure prediction. (A) ROC curves of each subject, (B) ROC curves of all subjects.
4.3. Comparison results
Table 3 shows the comparison between our method and other methods, all of which are evaluated on the CHB-MIT Scalp EEG dataset. It is difficult to directly judge which method is best because of the differences in the patients selected, problem definition, evaluation methods, and performance metrics among the methods. Among all methods, our method uses the longest fixed pre-ictal time (60 min) and a medium number of patients. With these premises, it obtains the second highest sensitivity and the fifth high AUC, which is on par with the performance of similar state-of-the-art methods. Also promising is the achievement of the smallest model size, which can effectively reduce storage space and energy consumption, thus meeting the practical needs of wearable devices better.
5. Discussion
5.1. Performance degradation
It is worth noting that the goal of predictive models is to have the ability to apply to new samples, namely generalization ability. The training set is only a partial sample of the sample space. If the sampled training set cannot well reflect the characteristics of the sample space, it is difficult to expect that the model learned from the training set can perform satisfactorily on the unknown test set. In practice, it is generally assumed that all samples are independently and identically distributed. However, EEG is categorized as non-stationary data, and seizure prediction methods suffer from the data drift/covariate drift problem, where the data distribution and characteristics change over time. It can be observed in Figure 4A that there are significant differences in the AUC curves of sub06, and the AUC curves of sub14 and sub20 are completely consistent. Correspondingly, in Table 2, it can be observed that the SD of the prediction results of sub06 is the largest among all patients, and that of sub14 and sub20 is the smallest. By comparing Figures 5–7, it can be found that the distribution of the training set and test set of sub06 is significantly different, and both sub14 and sub20 are identically distributed. This is why the same model architecture has significantly different results between different patients.
Furthermore, within the perspective of raw data, the ratio of sample sizes in the two categories is approximately 17:1, 15:1, and 9:1 for sub04, sub07, and sub09 among 6 subjects with unsatisfactory performance. Besides, most seizures recorded in sub05, sub07, sub09, and sub23 are so concentrated that the inter-ictal data between seizures is missing. In addition, Sub06 recordings have many discontinuities in time.
5.2. Performance comparison
Li et al. (2021c) used GCN and LOOCV, and was the closest one in terms of our method. The AUC and longest up-front duration of this method are slightly better than our method, which means that possible subsequent seizures are predicted at an earlier time without increasing too many false alarms. But this also makes it slightly less sensitive than our method, which may miss early warnings for some seizures. Furthermore, to enhance the prediction performance, the method uses a complex deep learning sub-network to synthesize the prior map, which results in a much larger model size than ours, limiting its portable use in everyday life.
5.3. Seizure focus
Inter-ictal epileptiform discharges and focal seizures are characterized by a large number of neurons disrupting normal neuronal activity with synchronous discharges, and scalp inter-ictal EEGs are characterized by a large number of neurons collectively superimposed on synchronous oscillations of postsynaptic potentials (Kibler and Durand, 2011). Thus, the electrodes around the epileptogenic zone can record a similar increase in energy, and the corresponding EEG signal is similarly changed at the sites that are reached through the neural network. The strong correlation of functional features in edge features of the graph network can suggest that electrodes correspond to abnormal discharge sites in the patient's inter-ictal period. For temporal lobe epilepsy and frontal lobe epilepsy, which are the most common types of epilepsy, most cases of temporal lobe epilepsy can be roughly located according to EEG interpretation (Blume et al., 2001). In these patients, the strength of correlation can assist clinicians to find the seizure focus, as a supplement to manual reading EEG which mainly depended on the difference in amplitude, frequency, and time sequence of the interictal discharges. Besides, the underlying pathological mechanisms also can be explored by analyzing the characteristics of different subtypes of epilepsy, applying to the seizure origin and the evolution of the typical partial seizures in patients, since the similarity of energy changes was analyzed at the level of total channels not at the level of EEG indicators.
It is important to note that in patients with frontal lobe epilepsy, there are some limitations to scalp EEG. First, inter-ictal discharges in frontal lobe epilepsy are usually widely distributed involving multiple electrodes, and scalp EEG is difficult to detect in patients with deep epileptogenic zones (40% of patients with frontal lobe epilepsy do not record discharges) (Quesney, 1991; Salanova et al., 1993; Bautista et al., 1998), even in patients with medial frontal lobe epilepsy who have focal abnormal discharges at electrodes in the central zone (Blume, 1996).
For example, in the inter-ictal EEG of sub01, based on normal background activity, there are focal paroxysmal abnormalities (spikes and slow waves) issued on both sides of the prefrontal area in the inter-ictal EEG, and intermittent midline rhythms may be common. As shown in Figure 3, there is a strong spectral correlation in the bilateral forehead leads. Additionally, because of the same multifocal issuance in frontal lobe epilepsy, weaker spectral correlations exist in the other leads as well. In the EEG of sub09, unilateral or bilateral synchronous or asynchronous temporal lobe spikes and/or slow waves with conduction to the prefrontal region were observed in the inter-ictal EEG, as well as the existence of discontinuous rhythmic delta wave activity in the occipital region, suggesting that this patient may have temporal as well as occipital lobe epilepsy.
Compared with other CNNs, which can only clinically interpret the amplitude of features, GCN can additionally interpret the connection relationship between features because it retains rich edge features, thus expanding its clinical application. A Chord diagram of functional edge features is introduced in this article to visualize the Spectral correlation. Inspired by the revised Circular Graph (RCG) (Zhao et al., 2019), the visualization draws a circular plot where the montage names are placed on the circumference of the circle. The Spectral correlation values between two montages are drawn as lines connecting the two montages, with their width representing the strength of the energy correlation between two montages.
The strong spectral correlation results in Figure 8 could indicate similar energy activity in the bilateral occipital leads, which coincides with occipital lobe localization. In contrast, the discharge in the temporal lobe is not shown in Figure 8, probably due to the more restricted discharge in this area and the large difference in energy recorded by the surrounding electrodes.
6. Conclusion
The usual conversion of EEG data to a common Euclidean grid structure for processing leads to the loss of adjacent information. To process graph-structured data with richer structural information and to simplify the model size, we propose a new smaller and computationally simple GCN-based architecture for seizure prediction. The proposed method uses only models with a scale of 15.5 k, which matches the state-of-the-art performance of the CHB-MIT scalp EEG dataset. The results show that our method can be regarded as a standard procedure to build a general low-power graph network model for processing similar biomedical signals, which is easier to meet the requirements of EEG-based low-power wearable devices. In addition, graph network functional edge features have different sensitivities for different patients, different discharge locations, and types. This means that in the actual epileptic seizure prediction, we can not only obtain the final prediction result but also make preliminary speculation and judgment on the discharge location and type of the patient to enhance its medical interpretability.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: PhysioNet, https://physionet.org/, doi: 10.13026/C2K01R.
Ethics statement
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.
Author contributions
MJ presents the project idea and completes the modeling, experiments, and writing of this manuscript. WL analyses the results from a clinical medical perspective, while QW and ZZ contribute to the design of the study and the editing of this manuscript. JD, LC, and CC contribute to the editing of this manuscript. All authors read and approved the final manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Bautista, R. E. D., Spencer, D. D., and Spencer, S. S. (1998). Eeg findings in frontal lobe epilepsies. Neurology 50, 1765–1771. doi: 10.1212/WNL.50.6.1765
Blume, W. T.. (1996). Noninvasive electroencephalography in supplementary sensorimotor area epilepsy. Adv. Neurol. 70, 309–317.
Blume, W. T., Holloway, G. M., and Wiebe, S. (2001). Temporal epileptogenesis: localizing value of scalp and subdural interictal and ictal eeg data. Epilepsia 42, 508–514. doi: 10.1046/j.1528-1157.2001.02700.x
Bronstein, M. M., Bruna, J., LeCun, Y., Szlam, A., and Vandergheynst, P. (2017). Geometric deep learning: going beyond euclidean data. IEEE Signal Process Mag. 34, 18–42. doi: 10.1109/MSP.2017.2693418
Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P. (2002). Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357. doi: 10.1613/jair.953
Che, X., Zheng, Y., Chen, X., Song, S., and Li, S. (2022). Decoding color visual working memory from eeg signals using graph convolutional neural networks. Int. J. Neural Syst. 32, 2250003. doi: 10.1142/S0129065722500034
Daoud, H., and Bayoumi, M. A. (2019). Efficient epileptic seizure prediction based on deep learning. IEEE Trans. Biomed. Circ. Syst. 13, 804–813. doi: 10.1109/TBCAS.2019.2929053
Dissanayake, T., Fernando, T., Denman, S., Sridharan, S., and Fookes, C. (2021a). Deep learning for patient-independent epileptic seizure prediction using scalp eeg signals. IEEE Sens. J. 21, 9377–9388. doi: 10.1109/JSEN.2021.3057076
Dissanayake, T., Fernando, T., Denman, S., Sridharan, S., and Fookes, C. (2021b). Geometric deep learning for subject-independent epileptic seizure prediction using scalp eeg signals. IEEE J. Biomed. Health Inform. 26, 527–538.
Dong, H., Chen, D., Zhang, L., Ke, H., and Li, X. (2021). Subject sensitive eeg discrimination with fast reconstructable cnn driven by reinforcement learning: a case study of asd evaluation. Neurocomputing 449, 136–145. doi: 10.1016/j.neucom.2021.04.009
Duan, J., Chen, C., Pun, S. H., Wan, F., Mak, P. U., Hu, Y., et al. (2012). “A wearable wireless general purpose bio-signal acquisition prototype system for home healthcare,” in 2012 International Conference on Biomedical Engineering and Biotechnology (Macau: IEEE), 1176–1179.
Esteller, R., Vachtsevanos, G., Echauz, J., and Litt, B. (2001). A comparison of waveform fractal dimension algorithms. IEEE Trans. Circ. Syst. I Fundament. Theory Appl. 48, 177–183. doi: 10.1109/81.904882
Feng, N., Hu, F., Wang, H., and Zhou, B. (2021). Motor intention decoding from the upper limb by graph convolutional network based on functional connectivity. Int. J. Neural Syst. 31, 2150047. doi: 10.1142/S0129065721500477
Hammond, D. K., Vandergheynst, P., and Gribonval, R. (2011). Wavelets on graphs via spectral graph theory. Appl. Comput. Harmon. Anal. 30, 129–150. doi: 10.1016/j.acha.2010.04.005
Higuchi, T.. (1988). Approach to an irregular time series on the basis of the fractal theory. Physica D 31, 277–283. doi: 10.1016/0167-2789(88)90081-4
Hjorth, B.. (1970). Eeg analysis based on time domain properties. Electroencephalogr. Clin. Neurophysiol. 29, 306–310. doi: 10.1016/0013-4694(70)90143-4
Hussein, R., Lee, S., and Ward, R. (2022). Multi-channel vision transformer for epileptic seizure prediction. Biomedicines 10, 1551. doi: 10.3390/biomedicines10071551
Hussein, R., Lee, S., Ward, R., and McKeown, M. J. (2021). Semi-dilated convolutional neural networks for epileptic seizure prediction. Neural Networks 139, 212–222. doi: 10.1016/j.neunet.2021.03.008
Kibler, A. B., and Durand, D. M. (2011). Orthogonal wave propagation of epileptiform activity in the planar mouse hippocampus in vitro. Epilepsia 52, 1590–1600. doi: 10.1111/j.1528-1167.2011.03125.x
Kipf, T. N., and Welling, M. (2016). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907. doi: 10.48550/arXiv.1609.02907
Kuhlmann, L., Lehnertz, K., Richardson, M. P., Schelter, B., and Zaveri, H. P. (2018). Seizure prediction–ready for a new era. Nat. Rev. Neurol. 14, 618–630. doi: 10.1038/s41582-018-0055-2
Lee, S.-H., Chen, T.-Y., Hsien, Y.-T., and Cao, L.-R. (2020). “A music recommendation system for depression therapy based on EEG,” in 2020 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-Taiwan) (Taoyuan: IEEE), 1–2.
Li, B., Liu, Y., and Wang, X. (2019a). Gradient harmonized single-stage detector. Proc. AAAI Conf. Artif. Intell. 33, 8577–8584. doi: 10.1609/aaai.v33i01.33018577
Li, C., Zhang, Z., Song, R., Cheng, J., Liu, Y., and Chen, X. (2021a). Eeg-based emotion recognition via neural architecture search. IEEE Trans. Affect. Comput. doi: 10.1109/TAFFC.2021.3130387
Li, X., Dvornek, N., Zhou, Y., Zhuang, J., Ventola, P., and Duncan, J. (2019b). “Graph neural network for interpreting task-fmri biomarkers,” in Medical Image Computing and Computer-assisted Intervention: MICCAI. International Conference on Medical Image Computing and Computer-assisted Intervention, Vol. 11768 (Shenzhen), 485–493.
Li, X., Zhou, Y., Dvornek, N., Zhang, M., Gao, S., Zhuang, J., et al. (2021b). Braingnn: Interpretable brain graph neural network for fmri analysis. Med. Image Anal. 74, 102233. doi: 10.1016/j.media.2021.102233
Li, Y., Liu, Y., Guo, Y.-Z., Liao, X.-F., Hu, B., and Yu, T. (2021c). Spatio-temporal-spectral hierarchical graph convolutional network with semisupervised active learning for patient-specific seizure prediction. IEEE Trans. Cybern. doi: 10.1109/TCYB.2021.3071860
Li, Z., Fields, M., Panov, F., Ghatan, S., Yener, B., and Marcuse, L. (2021d). Deep learning of simultaneous intracranial and scalp eeg for prediction, detection, and lateralization of mesial temporal lobe seizures. Front. Neurol. 12, 705119. doi: 10.3389/fneur.2021.705119
Lian, Q., Qi, Y., Pan, G., and Wang, Y. (2020). Learning graph in graph convolutional neural networks for robust seizure prediction. J. Neural Eng. 17, 035004. doi: 10.1088/1741-2552/ab909d
Liu, C.-L., Xiao, B., Hsaio, W.-H., and Tseng, V. S. (2019). Epileptic seizure prediction with multi-view convolutional neural networks. IEEE Access 7, 170352–170361. doi: 10.1109/ACCESS.2019.2955285
Liu, H., Zhang, J., Liu, Q., and Cao, J. (2022). Minimum spanning tree based graph neural network for emotion classification using eeg. Neural Networks 145, 308–318. doi: 10.1016/j.neunet.2021.10.023
Ma, T.. (2021). “Graph neural networks,” in Graph Neural Networks: Foundations and Frontiers (Beijing: Publishing House of Electronics Industry), 21–22.
Organization W. H.. (2019). “Epilepsy: a public health imperative,” in Epilepsy: A Public Health Imperative.
Ozdemir, M. A., Cura, O. K., and Akan, A. (2021). Epileptic eeg classification by using time-frequency images for deep learning. Int. J. Neural Syst. 31, 2150026. doi: 10.1142/S012906572150026X
Petrantonakis, P. C., and Hadjileontiadis, L. J. (2009). Emotion recognition from eeg using higher order crossings. IEEE Trans. Inf. Technol. Biomed. 14, 186–197. doi: 10.1109/TITB.2009.2034649
Petrosian, A.. (1995). “Kolmogorov complexity of finite sequences and recognition of different preictal EEG patterns,” in Proceedings Eighth IEEE Symposium on Computer-Based Medical Systems (Lubbock, TX: IEEE), 212–217.
Qi, Y., Ding, L., Wang, Y., and Pan, G. (2021). Learning robust features from nonstationary brain signals by multi-scale domain adaptation networks for seizure prediction. IEEE Trans. Cogn. Dev. Syst. doi: 10.1109/TCDS.2021.3100270
Quesney, L.. (1991). Preoperative electroencephalographic investigation in frontal lobe epilepsy: electroencephalographic and electrocorticographic recordings. Can. J. Neurol. Sci. 18, 559–563. doi: 10.1017/S0317167100032698
Rosas-Romero, R., Guevara, E., Peng, K., Nguyen, D. K., Lesage, F., Pouliot, P., et al. (2019). Prediction of epileptic seizures with convolutional neural networks and functional near-infrared spectroscopy signals. Comput. Biol. Med. 111, 103355. doi: 10.1016/j.compbiomed.2019.103355
Royer, J., Bernhardt, B. C., Larivière, S., Gleichgerrcht, E., Vorderwülbecke, B. J., Vulliémoz, S., et al. (2022). Epilepsy and brain network hubs. Epilepsia 63, 537–550. doi: 10.1111/epi.17171
Salanova, V., Morris, H. H., Van Ness, P. C., Lüders, H., Dinner, D., and Wyllie, E. (1993). Comparison of scalp electroencephalogram with subdural electrocorticogram recordings and functional mapping in frontal lobe epilepsy. Arch. Neurol. 50, 294–299. doi: 10.1001/archneur.1993.00540030058015
Sharan, R. V., and Berkovsky, S. (2020). “Epileptic seizure detection using multi-channel EEG wavelet power spectra and 1-d convolutional neural networks,” in 2020 42nd Annual International Conference of the IEEE Engineering in Medicine &Biology Society (EMBC) (Montreal, QC: IEEE), 545–548.
Shoeb, A. H.. (2009). Application of machine learning to epileptic seizure onset detection and treatment (Ph.D. thesis). Massachusetts Institute of Technology.
Tsiouris, K. M., Pezoulas, V. C., Zervakis, M., Konitsiotis, S., Koutsouris, D. D., and Fotiadis, D. I. (2018). A long short-term memory deep learning network for the prediction of epileptic seizures using eeg signals. Comput. Biol. Med. 99, 24–37. doi: 10.1016/j.compbiomed.2018.05.019
Usman, S. M., Khalid, S., and Bashir, S. (2021). A deep learning based ensemble learning method for epileptic seizure prediction. Comput. Biol. Med. 136, 104710. doi: 10.1016/j.compbiomed.2021.104710
Wang, G., Wang, D., Du, C., Li, K., Zhang, J., Liu, Z., et al. (2020). Seizure prediction using directed transfer function and convolution neural network on intracranial EEG. IEEE Trans. Neural Syst. Rehabil. Eng. 28, 2711–2720. doi: 10.1109/TNSRE.2020.3035836
Wang, H., Zhu, X., Chen, P., Yang, Y., Ma, C., and Gao, Z. (2022). A gradient-based automatic optimization cnn framework for eeg state recognition. J. Neural Eng. 19, 016009. doi: 10.1088/1741-2552/ac41ac
Wein, S., Malloni, W. M., Tomé, A. M., Frank, S. M., Henze, G.-I., Wüst, S., et al. (2021). A graph neural network framework for causal inference in brain networks. Sci. Rep. 11, 1–18. doi: 10.1038/s41598-021-87411-8
Wu, D., Li, X., and Feng, J. (2021). Connectome-based individual prediction of cognitive behaviors via graph propagation network reveals directed brain network topology. J Neural Eng. 18, 0460a0463. doi: 10.1088/1741-2552/ac0f4d
Xue, Y., Jiang, P., Neri, F., and Liang, J. (2021). A multi-objective evolutionary approach based on graph-in-graph for neural architecture search of convolutional neural networks. Int. J. Neural Syst. 31, 2150035. doi: 10.1142/S0129065721500350
Yang, C., Wang, P., Tan, J., Liu, Q., and Li, X. (2021a). Autism spectrum disorder diagnosis using graph attention network based on spatial-constrained sparse functional brain networks. Comput. Biol. Med. 139, 104963. doi: 10.1016/j.compbiomed.2021.104963
Yang, X., Zhao, J., Sun, Q., Lu, J., and Ma, X. (2021b). An effective dual self-attention residual network for seizure prediction. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 1604–1613. doi: 10.1109/TNSRE.2021.3103210
Yang, Y., Gao, Z., Li, Y., and Wang, H. (2021c). A cnn identified by reinforcement learning-based optimization framework for EEG-based state evaluation. J. Neural Eng. 18, 046059. doi: 10.1088/1741-2552/abfa71
Zhang, Y., Guo, Y., Yang, P., Chen, W., and Lo, B. (2019). Epilepsy seizure prediction on eeg using common spatial pattern and convolutional neural network. IEEE J. Biomed. Health Inform. 24, 465–474. doi: 10.1109/JBHI.2019.2933046
Zhao, S., Yang, J., and Sawan, M. (2021a). Energy-efficient neural network for epileptic seizure prediction. IEEE Trans. Biomed. Eng. 69, 401–411. doi: 10.1109/TBME.2021.3095848
Zhao, Y., Dong, C., Zhang, G., Wang, Y., Chen, X., Jia, W., et al. (2021b). Eeg-based seizure detection using linear graph convolution network with focal loss. Comput. Methods Programs Biomed. 208, 106277. doi: 10.1016/j.cmpb.2021.106277
Keywords: seizure prediction, EEG, GCN, geometric deep learning, wearable devices
Citation: Jia M, Liu W, Duan J, Chen L, Chen CLP, Wang Q and Zhou Z (2022) Efficient graph convolutional networks for seizure prediction using scalp EEG. Front. Neurosci. 16:967116. doi: 10.3389/fnins.2022.967116
Received: 17 July 2022; Accepted: 08 July 2022;
Published: 01 August 2022.
Edited by:
Adeel Razi, Monash University, AustraliaReviewed by:
Ramy Hussein, Stanford University, United StatesDuong Nhu, Monash University, Australia
Copyright © 2022 Jia, Liu, Duan, Chen, Chen, Wang and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Qun Wang, wq99102@bit.edu.cn; Zhiguo Zhou, zhiguozhou@bit.edu.cn