- 1School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States
- 2Department of Neurology, Division of Child Neurology, University of North Carolina, Chapel Hill, NC, United States
We evaluated whether integration of expert guidance on seizure onset zone (SOZ) identification from resting state functional MRI (rs-fMRI) connectomics combined with deep learning (DL) techniques enhances the SOZ delineation in patients with refractory epilepsy (RE), compared to utilizing DL alone. Rs-fMRI was collected from 52 children with RE who had subsequently undergone ic-EEG and then, if indicated, surgery for seizure control (n = 25). The resting state functional connectomics data were previously independently classified by two expert epileptologists, as indicative of measurement noise, typical resting state network connectivity, or SOZ. An expert knowledge integrated deep network was trained on functional connectomics data to identify SOZ. Expert knowledge integrated with DL showed a SOZ localization accuracy of 84.8 ± 4.5% and F1 score, harmonic mean of positive predictive value and sensitivity, of 91.7 ± 2.6%. Conversely, a DL only model yielded an accuracy of <50% (F1 score 63%). Activations that initiate in gray matter, extend through white matter, and end in vascular regions are seen as the most discriminative expert-identified SOZ characteristics. Integration of expert knowledge of functional connectomics can not only enhance the performance of DL in localizing SOZ in RE but also lead toward potentially useful explanations of prevalent co-activation patterns in SOZ. RE with surgical outcomes and preoperative rs-fMRI studies can yield expert knowledge most salient for SOZ identification.
1 Introduction
The World Health Organization estimates that ~50 million individuals worldwide are affected by epilepsy (1). Within this population, medically refractory epilepsy (RE) constitutes about 30%, where patients have not achieved seizure control for at least 12 months despite adequate trials of two tolerated and appropriately chosen anti-epileptic medications. RE significantly impacts the quality of life of those affected. The most successful approach for addressing RE involves surgical ablation, resection, or disconnection of brain regions associated with seizure genesis, seizure onset zone (SOZ) (2–4). Recent research emphasizes the importance of early diagnosis and surgical intervention to mitigate developmental complications and reduce the risk of sudden deaths (5). Despite advances in surgical interventions, for focal onset RE such as mesial temporal lobe epilepsy (TLE), a large number of patients (30–40%) still suffer from continued debilitating seizures (6) and post-surgery developmental impairments (7, 8). Achieving a seizure-free surgical outcome is contingent upon accurately localizing the SOZ (9).
The gold-standard technique for localization of SOZ uses invasive intracranial electroencephalography (ic-EEG), which requires implantation of depth electrodes (10). However, concordance between ic-EEG spike density and the SOZ is observed in only 56% of patients (11), due to sub-optimal lead implantation. As such, ic-EEG lead placement must be guided by determining the expected SOZ using non-invasive brain imaging modalities (12, 13). One non-invasive method, resting state functional magnetic resonance imaging (rs-fMRI), uses blood oxygen level-dependent (BOLD) correlations measured at rest to map functional connectomics (14). It is an effective measure of the plasticity of large-scale networks induced by repeated and synchronized co-activation of brain regions (15) caused by debilitating seizures. Independent component analysis (ICA) of rs-fMRI has been shown to have 90% agreement with ic-EEG determined SOZs (6, 11), and the usage of rs-fMRI-guided ic-EEG to locate and surgically alter the SOZ has shown significant improvement in seizure-free surgical outcomes for RE pediatric patients without any increase in developmental risks (6, 16).
One of the challenges in relating ICA's resulting independent components (ICs) of rs-fMRI to SOZ is that abnormal BOLD correlations are variable across individuals, with activation foci varying across temporal, parietal, frontal lobes, hippocampus, and cortex (17). Furthermore, over 50% of the data comprises noise ICs, with the remaining 40–45% attributed to resting state network (RSN), leaving only a small percentage, around 5–10%, associated with SOZ ICs. Currently, a set of rules identified by a consortium of experts is meticulously applied with expert analysis of hundreds of ICs, rendering the pre-surgical screening process extremely time-consuming and not easily replicable (16). This calls for a need of a more streamlined, automated, and replicable pre-surgical screening process for SOZ localization. The existing methodology, relying on rules outlined by experts and involving detailed manual analysis of numerous ICs, is time-intensive, subjective, and lacks reproducibility. Given the demonstrated capability of large-scale supervised statistical approaches, such as deep learning (DL), to identify abnormal patterns within complex datasets, recent advances have shown their application for SOZ localization from rs-fMRI data (18, 19). However, a limited study on 14 subjects with refractory TLE has shown poor positive predictive value (PPV) of 52% (±3.9%) for a brain parcellation to be associated with SOZ (19). Moreover, the identified SOZs do not conform to the disease characteristics, as bilateral SOZ was identified for patients with unilateral focal TLE (19). Our prior research has demonstrated that automating the expert rules outlined in (16) and then implementing them in a carefully structured sequential manner result in a PPV of 65% (±7.8%), surpassing the performance of statistical approaches (17).
One of the primary reasons for poor PPV of statistical approaches is that abnormal BOLD correlations, meaning non-noise, not normal resting state fMRI network, and thus is interpreted to be pathological, form <10% of the rs-fMRI ICs from ICA (17, 19). This triggers the fundamental Achillies heel of classification science, that is, class imbalance (20), where the statistical approach lacks enough pathological data to effectively distinguish abnormal patterns amidst significant individual variation. A pure statistical approach like DL overlooks the valuable expert knowledge encoded in terms of rules applied to real data by experts and validated by the successful seizure-free outcomes upon surgical resection/ablation of expert-identified SOZ. In this study, we aimed to investigate whether integration of expert knowledge of SOZ characteristics combined with data-driven statistical supervised learning approaches could improve the identification accuracy of SOZ compared to purely statistical machine learning approach, for subjects with RE. We validated identification accuracy by comparing against manual evaluation by two independent experts and subsequent ic-EEG-based SOZ identification, and in surgical patients, with surgical outcomes in terms of Engel scores. Additionally, we tested which knowledge component contributed most in improving DL performance of SOZ identification. Finally, we utilized the expert knowledge contributions to generate clinically relevant explanations of SOZ identification result.
1.1 Contribution
This study aims to automate localization of the SOZ using DL and expert knowledge, with the primary goal of facilitating the non-invasive assessment of iEEG lead placement by the surgical team. The main contributions of the study include:
• Demonstrating that the integration of AI with expert knowledge on SOZ characteristics results in superior automated SOZ localization compared to relying solely on AI techniques for the same purpose.
• Illustrating through knowledge ablation study that the expert knowledge of activations originating in gray matter, extending through white matter, and concluding in vascular regions, is identified as the most discriminative expert knowledge among expert-identified SOZ features.
• Validation of SOZ localization accuracy on a large dataset of 52 patients across various age ranges and gender.
1.2 Related works
Recent research can be broadly categorized into two main areas, as outlined in Table 1: epilepsy detection (21–23), which entails classifying patients as either epileptic or non-epileptic based on EN identification, and SOZ localization (1, 6, 17), the primary focus of this study. Table 1 presents a comparative analysis of recent studies, considering factors such as the number of subjects, the proportion of the RE subgroup, age range, and the types of ICs identified. Within this domain, evaluations encompass diverse metrics such as concordance with iEEG, agreement with expert-identified SOZ, and consistency with physician assessments.
The reported results column in Table 1 presents the evaluation metrics from the original manuscripts for each study. Various manual techniques for identifying the SOZ involve expert-defined rules based on specific spatio-temporal characteristics of BOLD signals captured by rs-fMRI. Boerwinkle et al. (6) explored the agreement between the epileptogenic zone (EZ) identified through rs-fMRI and the SOZ identified using iEEG data. They employed prevalence-adjusted bias-adjusted kappa (PABAK) on a cohort of 40 patients, revealing a concordance rate of 89%. This study highlighted the limitations of previous approaches that focused on the most abnormal brain region for SOZ localization. However, no study is reported on automation of expert sorted ICA-based SOZ classification in this study. Gil et al. (24) manually studied 21 patients with extratemporal focal epilepsy to identify SOZ-related ICs in fMRI data using the general linear model-derived EEG-fMRI time courses associated with epileptic activity. Lee et al. (25) also manually investigated the functional connectivity changes in the ENs from rs-fMRI data using intrinsic connectivity contrast (ICC) to evaluate the non-invasive pre-surgical diagnostic potential for SOZ localization. The agreement of fMRI-IC with intracranial EEG SOZ was 72.4%.
The first automation attempts were from Hunyadi et al. (1), who present a set of SOZ spatial and temporal features used to train a least-squares support vector machine (LS-SVM). Evaluation on 18 RE patients showed sub-optimal results. DL was first explored by Nozais et al. (18) to classify RSN ICs on non-RE patients and reported an accuracy of 92%. However, they did not pursue SOZ identification. Luckett et al. (26) used 2,132 healthy control data for training of 3D CNN and tested it on temporal lobe epilepsy to detect the whole hemisphere of seizure onset. The training data were synthetically altered in randomly lateralized regions which helped in detection of biological SOZ's hemisphere. Note that ICs were not used here, so this study detected the whole brain hemisphere of seizure onset rather than the brain region pointing toward the SOZ. Their primary findings suggested the ICA guided by their technique has the potential to identify epilepsy-related ICs in patients with focal epilepsy. Naresh et al. (19) explored deep graph neural networks using the T1-weighted images from rs-fMRI along with diffusion MRI (dMRI) measurements. Study on 14 subjects showed a sensitivity of 40% and precision of 52% while an accuracy of 88%. Not only the precision was sub-optimal, the identified SOZs did not align with the expected disease characteristics as bilateral SOZs were identified for patients diagnosed with unilateral focal temporal lobe epilepsy, rendering it irrelevant for pre-surgical screening. Zhang et al. (27) proposed ICA-based automated method using unsupervised algorithm to localize the SOZ. SOZ ICs were screened based on peripheral noise IC removal, asymmetry, and temporal features (excluding IC outside of frequency band 0.01–0.1 Hz). Consistency with the resection surgery on 10 patients was reported. If we assume consistency as true positive (TP), failure as FN and success in rejecting non-SOZ IC as true negative (TN), and failure to reject non-SOZ ICs as false positive (FP), then the results indicate significant FPs. Banerjee et al. (17) are the most recent study in the automation of SOZ localization. It uses six expert features combined from Boerwinkle et al. (6) and Hunyadi et al. (1). This technique reports high FPs.
Knowledge integration into DL models has been recently explored in many domains (28, 29) including medical imaging (30) for diagnosis, lesion, or organ segmentation with great success rate. Expert knowledge can be integrated in two broad ways (28): (a) scientific knowledge, through mathematical models as performed in molecular dynamics analysis, or (b) experiential knowledge, through logic rules. The current study falls in the second category. To the best of our knowledge, this is the first study exploring experiential knowledge integration with DL in epilepsy surgical planning.
2 Materials and methods
2.1 Participants
A total of 52 consecutive patients with quality data of average age 8 years 8 months (±5 years 4 months) were retrospectively studied who were diagnosed to have RE based on International League Against Epilepsy (ILAE) criteria (31), from our previously published IRB-approved retrospective cohort data who had ic-EEG and surgery. The evaluation involved rs-fMRI, continuous video monitoring while electroencephalography (EEG) is being performed, and anatomical 3T MRI as a part of standard MRI SOZ localization protocol followed at Phoenix Children's hospital (PCH) for epilepsy surgery evaluation. From this published study, the three imaging modalities were independently reviewed by two blinded experts, a neurologist, and a neurosurgeon, to determine the SOZ location in both anatomical MRI and rs-fMRI. For rs-fMRI, each expert sorted independent components (ICs) into three categories: NOISE, resting state network (RSN) and SOZ (Henceforth, class labels are denoted by capitalized, bold, and italicized text). In cases where there was any disagreement, a third reviewer was consulted for the final determination.
Subsequently, each patient was subjected to ic-EEG-based monitoring, which was independent of the rs-fMRI monitoring result. A clear indication of SOZ through observation of ic-EEG spikes determined the confirmed candidacy of the patient for surgical resection, ablation, or disconnection. For all patients, the SOZ ICs identified from rs-fMRI were manually verified by the experts. Hence, the expert manual denotation of an IC as NOISE, RSN, or SOZ, supported by ic-EEG and/or surgical outcome, is considered as ground truth in this research.
For patients that did undergo surgery, the surgical location was determined by the expert epilepsy surgery conference team informed by the non-invasive imaging, including the expert-identified rs-fMRI-based SOZ location, and ic-EEG monitoring result. Validation of the manual SOZ determination was performed through evaluation of seizure-free outcomes after surgical alteration of the rs-fMRI identified SOZ corroborated with ic-EEG.
The patients were divided into three age groups to evaluate the effect of age on the SOZ localization performance. Patients in all age groups varied across many demographic and clinical characteristics (Table 2). Surgical outcomes were evaluated using Engel scores where Engel I meant seizure free, and Engel II meant at-most one debilitating seizure in the first year after surgery.
Table 2. Demographic and clinical characteristics of participants, including sex distribution, age at onset, surgical procedures, seizure frequencies, seizure outcomes, and ethnicity breakdown.
2.2 Data acquisition and processing
The MRI images were obtained using a 3T MRI unit, Ingenuity Philips Medical systems, equipped with a 32-channel head coil. The rs-fMRI settings were configured with a TR 2,000 ms, TE 30 ms, matrix size 80 × 80, flip angle 80°, and a total number of 46 slices. Each slice had a thickness of 3.4 mm without any gaps, and the in-plane resolution was set to 3 × 3 mm. The acquisition process involved interleaved acquisition, with a grand total of 600 volumes obtained across two 10-min runs, culminating in a total acquisition time of 20 min. MELODIC tool (32) was employed to analyze the rs-fMRI and extract ICs using ICA (16). Preprocessing consisted of discarding the initial five volumes to remove T1 saturation effects, applying a high-pass filter at 100s, correcting for slice time, implementing spatial smoothing with a full-width at half maximum of 1 mm, and addressing motion artifacts through MCFLIRT (33), while excluding non-brain structures. Linear registration was done between the individual functional scans and the patient's high-resolution anatomical scan (34) which was further refined using boundary-based registration (35).
2.3 Computational approach overview
The SOZ localization approach utilized two types of models: (a) deep supervised classification model or DL model and (b) expert knowledge integration (EKI) model. The approach combined the result of these two models following three steps (Figure 1):
Figure 1. Overview of the proposed SOZ IC localization. (Top panel) Preprocessing the data by reducing the image dimensions to alleviate computational overhead. (Second panel-top) training involves relabeling RSN and SOZ as non-noise components. (Second panel-bottom) These components are then subjected to CNN. Additionally, we establish an expert knowledge integration model (EKI), which is trained based on the extracted expert knowledge from RSN, and SOZ components. (Third panel) testing involves classification task of rs-fMRI ICs into three categories: NOISE, RSN, and SOZ using both DL and expert knowledge. (Bottom panel) Localization of SOZ involves identification of biggest cluster among a patient's SOZ slices. The operator ◦ denotes dot product.
Step 1 Preprocessing: DL model used a labeled set of ICs, where each IC, IL is labeled as RSN, SOZ, or NOISE. DL model complexity drastically increases with input size potentially resulting in more data requirement to avoid under-fitting. Moreover, DL model expects all ICs to be of the same size. Hence, in the preprocessing step, we resized each labeled IC, IL of size 709 × 1,006 × 3, to an image of size 270 × 470 × 3. for use by the DL model, while IL was used by the EKI. gave the most optimized DL model with the best accuracy for the given dataset, determined through hyperparameter tuning (36).
Step 2 Training: In this phase, each RSN or SOZ IC was relabeled as . The DL was trained to recognize NOISE or classes. In parallel, each RSN and SOZ IC was passed through the expert knowledge feature extraction mechanism, and a weight optimization was applied to obtain the best linear combination of expert knowledge components that were most discriminative between RSN and SOZ.
Step 3 Testing: A test patient's IC, I, was passed through DL and EKI models. EKI provided a confidence score ρ for I being SOZ. If DL categorized IR as , I retained EKI labels. However, if DL labeled IR as NOISE, I's classification then depended on ρ. Only if ρ > 0.9, I was marked as SOZ, else it retained DL label of NOISE. Having marked SOZ ICs, the SOZ was localized by designating the largest activation cluster, extracted using Density-Based Spatial Clustering of Applications with Noise (DBSCAN) (37).
2.3.1 Training phase: noise detection using labels through supervised learning
The ICs IL were relabeled to form IL{N} where ICs were either NOISE ICs or (RSN/SOZ). Five strategies with an 80-20 train/test split of the entire data were tested to classify IL{N} into the new class categories: (a) 2D convolution neural network (CNN), (b) multilayer perceptron similar to (18), (c) transfer learning using VGG-16 Imagenet model (38), (d) problem reduction by treating the BOLD timeseries as images (39), and (e) vision transformer (ViT). Validation result showed that the 2D CNN had the best precision and sensitivity in determining NOISE ICs (comprehensive results in Supplementary Document). Consequently, we opted for the utilization of the 2D CNN for the classification of noise ICs. The hyperparameters of the 2D CNN were obtained using the Keras-Tuner's hyperband algorithm, with the objective of minimizing the validation loss. The hyperparameter tuning process involved exploring various configurations:
• Number of convolution layers: [3, 4, 5],
• Number of units or filters per convolution layer: 32–512, with a default of 128,
• Number of neurons in the dense layer: 192–1,024, with a step of 256,
• Learning rates: 0.01, 0.001, or 0.0001,
• Dropout rates: 0.2, 0.33, 0.4, 0.5, or 0.66,
The optimized hyperparameter values, determined through the Keras-Tuner, were as follows: three convolution layers, with 64, 64, and 256 3 × 3 filters in each respective layer; a dense fully connected layer with 704 neurons; a learning rate of 0.0001; and a dropout rate of 0.33. These specific hyperparameter values were selected to enhance the performance of the 2D CNN in accurately classifying noise ICs. Keras's image data generator was used to create batches of both NOISE and IC images. IC images were resized from I to IR using “flow from directory” method. “Binary cross-entropy” loss function along with “Adam” optimizer was chosen. Potential overfitting was addressed using dropout regularization and “early_stopping” strategy. Activation function “ReLU” was chosen for input and hidden layers and “Sigmoid” function for the output layer. Given the characteristics of our dataset, which features dark backgrounds and required extraction of sharp features while controlling variance and computational complexity, we inserted a max pooling layer of 2 × 2 after every convolution layer (36).
2.3.2 Training phase: expert knowledge on rs-fMRI IC
Expert epileptologists use the RSN, NOISE, and SOZ indicators to manually sort the ICs (Figure 2) as compiled from the studies by (1) and (16). In our methodology, we encoded the SOZ specific expert knowledge into the SOZ localization mechanism. This phase was subsequently divided into two steps.
Figure 2. Three types of information are encoded in rs-fMRI: NOISE, RSN, and SOZ. Each of these categories adheres to specific rules that define their classification.
(a) Extracting brain slices: Brain slices were derived from RSN and SOZ ICs through template matching. We used the Montreal Neurological Institute's 152 brain template (MNI152) for this purpose. With the help of the coordinates given by template matching, we extracted brain slices which enabled the subsequent extraction of features guided by expert knowledge.
(b) Extraction of expert knowledge: The expert knowledge about SOZ characteristics (Figure 2) is represented using the following features, Fex:
1. Fex(1) Number of clusters: SOZ IC ideally has one cluster of activation that spreads asymmetrically in one hemisphere, whereas an RSN IC consists of multiple (at least 2) clusters of activation which are spread symmetrically across the two brain hemispheres.
2. Fex(2) Activation extended to ventricles: A SOZ has activation extended from gray matter toward ventricles through the white matter.
3. Fex(3) Dominant frequencies: SOZ's BOLD signal power spectra exhibit dominant frequencies >6 Hz.
4. Fex(4) Sparsity in frequency domain: The rs-fMRI SOZ power spectrum is sparse with dominant frequency much more spread out throughout the spectrum than RSN.
The abovementioned features were extracted using the following method to form the feature vector Fex for each IC, IL.
Fex extraction method:
Fex(1) Number of clusters: From each IC, brain slices were extracted (Figure 3). From each slice, the number of clusters was estimated using DBSCAN (37). This approach had two adjustable parameters: neighborhood, which defined the distance metric and a value called ϵ, and vmin, which determined the minimum number of neighboring voxels. Voxels with more than vmin neighbors within the ϵ distance were considered core points and formed a cluster. Voxels that were not core points but were within ϵ distance of a core point were classified as border points and assigned to the nearest core point's cluster. All other points were disregarded. Clusters were formed by combining core points that were within ϵ distance of each other. Additionally, we set a threshold of 135 pixels, counting only those clusters that surpassed this threshold to determine the total number of clusters. The output of this step was the number of clusters in each IC slice (Figure 3).
Fex(2) Activation extended to ventricles: To identify the activation of the SOZ that extended from gray matter toward the ventricles through the white matter, a Sobel filter-based edge detection technique was applied, which extracted the contours for each slice, with the white matter exhibiting the most prominent contour within the slice (17). To obtain the ventricular regions, we applied edge detection to determine brain boundary. The ventricles run throughout the brain; however, they are less prominent in the slices that are toward the brain surface. Hence, we selected slices that are near the base of the brain. In these slices, the ventricle is more prominent and interrupts the continuity of the image. As such the contour detection gives multiple brain boundary contours, identified through the Sobel filter. The ventricular regions were within the convex hull of the brain boundary contours but did not intersect any brain boundary (Figure 3). Subsequently, a comprehensive analysis was conducted to determine whether the larger clusters (with a size exceeding 135 pixels) had the overlapping with the white matter and extension toward the ventricles. In the overlapping process, from each slice of an IC, both clusters and contours were obtained. The presence of an overlapping cluster could potentially impede the contour detection algorithm, hindering the extraction of white matter and blood vessel contours. In the initial pass through the ICs, we obtained a version of each slice devoid of clusters, serving as a basis for contour identification. The algorithm then underwent a subsequent pass through each slice of an IC, detecting clusters and evaluating their intersection with the white matter.
Fex(3) Dominant frequencies and Fex(4) sparsity in frequency domain: For temporal SOZ characteristics, ICs were analyzed for activelet and sine dictionary sparsity in their time courses. For calculating the sparsity in activelet basis, the BOLD signal was divided into windows of length 256 samples. From every window, four levels of activelet transformation coefficients using the “a trous” algorithm with exponential-spline wavelets were extracted (1). The Gini Index metric was used for activelet coefficients and sine dictionary sparsity evaluation in the frequency band of 0.01–0.1 Hz.
2.3.3 Training phase: balanced dataset creation
The data distribution is composed of ~51% NOISE, 43% RSN, and merely 5% SOZ occurrences. To overcome class imbalance after the feature extraction process, synthetic SOZ features were created using SMOTE (40). Given the constraint of a restricted quantity of available SOZ ICs, ~5 ICs per subject, SMOTE identified authentic SOZ IC samples within the feature space and performed linear interpolation of features.
2.3.4 Training phase: expert knowledge combination logic
Ambiguity is inherent in expert knowledge, given the significant individual variance in seizure onset characteristics. Hence, Fex could not be used in isolation to represent expert knowledge, and a carefully crafted combination was necessary. We utilized the subset of ICs, IL{R, S} that is labeled RSN or SOZ to configure a linear combination logic for the expert knowledge vector for an IC I that gave the best discriminative power between these two classes. For each IL{R, S}, we defined yi = −1 if it was RSN and yi = 1 if it was SOZ. We derived an expert knowledge weight vector ωex, of size |Fex| × 1 that:
where is the expert knowledge vector of the ith IC in ILR, S, is the L2 norm of a vector, and ° denotes the dot product operator (Figure 3).
2.3.5 Testing phase: SOZ localization approach
To obtain the robust estimate of our approach's performance on patient's data, we employed a leave-one-out cross-validation strategy, wherein each patient was given an opportunity to represent the entirety of the test datasets. This method of cross-validation resulted in the most variance and a tight confidence interval through this approach, indicated robust performance across all patients. All rs-fMRI ICs of the test subject, I, went through a dual assessment using pre-trained DL model with IR and EKI model with I. The DL model classified IR as either NOISE or . In parallel, EKI model assigned SOZ or RSN labels to the ICs I based on their confidence score , where ωex is the weight configuration from the training phase of the EKI model. The test IC I is assigned the label SOZ under two conditions:
a) IR was classified as by DL model, but I was classified as SOZ by EKI model, or
b) IR was classified as NOISE by DL model, but I was classified as SOZ with a classification score ρ > 0.9.
Otherwise, I was not marked as SOZ. At this point, the knowledge component with the highest contribution, , in determining the SOZ was subsequently highlighted as the rationale/explanation behind selecting a specific IC as the SOZ.
The output of this step was a set ISOZ of ICs that were marked as SOZ. The SOZ ICs were then further processed through the brain slice extraction and DBSCAN mechanism to obtain a set of clusters . The localized SOZ was the largest cluster in each IC in ISOZ,
3 Results
We evaluated: (a) efficacy of our approach, its variation across age and sex and compare with state-of-the-art techniques, (b) significance of localized SOZ through correlation with surgical outcomes, and (c) knowledge ablation to show relative importance of spatial and temporal expert knowledge in SOZ identification.
3.1 Comparative techniques
We chose the following categories for comparison with our proposed approach supervised learning with both labels and expert knowledge (SLLEK):
1) Supervised learning with labels using CNN (SLL-CNN): We utilized a 2D CNN-based deep learning technique for comparison, solely using the labeled dataset in a supervised manner without incorporating any form of expert knowledge encoding. We also implemented cost-sensitive learning in CNN to ensure equal significance across all three classes during gradient updates.
2) Supervised learning with labels using ViT (SLL-ViT): We employed another DL approach of vision transformer (ViT) for our comparative analysis. This methodology also relied on the labeled dataset, embracing a supervised learning paradigm without integrating any explicit encoding of expert knowledge. To optimize the model's performance on our dataset, we leveraged Optuna to identify the most effective hyperparameters. Furthermore, to address class imbalance issue within the dataset, we set the weight parameter of the loss function to the computed class weights. This is particularly crucial when certain classes are underrepresented (SOZ in our case), as it helps the model to give more emphasis to the minority classes, preventing them from being overshadowed by the majority classes. Additionally, in order to prevent ViT from suffering from gradient explosion and gradient vanishing issues, we implemented gradient clipping and batch normalization, respectively.
3) Statistical pattern learning with expert knowledge (SLEK): This methodology was inspired by Hunyadi et al. (1) which uses expert-guided features to facilitate model learning. To ensure an unbiased comparison with our own approach, we also applied the Synthetic Minority Over-sampling Technique (SMOTE) to generate ICs endowed with SOZ features, thereby achieving balance among the three classes (implementation details in the Supplementary Document).
4) Unsupervised learning with expert knowledge (ULEK): This approach was inspired by EPIK (17), which employs a cascade of six expert rules in a waterfall technique for IC classification (detailed implementation in the Supplementary Document).
3.2 Evaluation metrics
We employed a 2-fold approach: (a) We assessed the agreement between SOZ-labeled ICs using our technique and the surgically targeted SOZ location for each Engel score group for 25 patients with available surgical resection/ablation outcomes in our dataset. (b) For all 52 patients, we validated the accuracy of generated labels from various approaches against the expert's sorted labels. The evaluation was conducted using commonly employed metrics such as accuracy, precision, and sensitivity (1, 25, 27).
3.3 Statistical methods
Statistical methods were utilized to derive the significance of (a) the effect of age and sex on the SOZ identification performance and (b) the difference in standard metrics among algorithms.
For the first aim, we utilized a mixed effects model, incorporating age and sex as predictors, along with their combined effect, and a random effect on the patient.
For the second aim, we computed the variance of the evaluation metrics across various subsets of test data obtained through categorization by age and sex. The variance of each metric in the techniques that are closest in performance to SLLEK showed < 10% difference. Moreover, we utilized the Kolmogorov-Smirnov (KS) test (41) to verify that the distribution of evaluation metrics across the subsets of test data came from a normal distribution with significance value α < 0.05. The p-value of the KS test is provided in Supplementary Table 3 with a high p-value indicating that the KS test could not reject the null hypothesis that the data came from a normal distribution. Since the variance of the evaluation metrics for the closest methods is similar and the metrics across test data subsets fit normal distribution, we utilized a one-sided t-test to evaluate the statistical significance of the difference between our approach and other comparative techniques. The 95% confidence p-values are provided in Table 3.
3.4 Performance evaluation
Our approach (SLLEK) outperformed the other techniques across all evaluation metrics, as illustrated in Table 3 for the given patient population, warranting further investigation. The results encompassed standard metrics evaluations, considering variations in age and sex.
We observed the impact of incorporating expert knowledge with DL in our approach, quantified as the difference between SLLEK and SLL (EoK). The last column in Table 3 provides the statistical significance of the difference between SLLEK and comparative techniques implemented on our dataset.
SLLEK exhibited high sensitivity, indicating a low false negative (FN) rate compared to other methods. Proficiency in accurately identifying the correct SOZ ICs suggests that expert knowledge integration with DL enhances the SOZ ICs identification and warrants further exploration.
SLLEK demonstrated higher accuracy, precision, and sensitivity across all age groups and sex distributions. In contrast, SLL and SLEK exhibited significant variability based on age and sex. ULEK emerged as the second-best performer after SLLEK. The p-values presented in Table 3 highlight the statistically significant differences between SLLEK and all other comparative techniques. Nevertheless, statistically, there is an insignificant difference between SLLEK and ULEK.
Comparison with state-of-the-art computer vision technique ViT is also presented in Table 3. As the results show, ViT did not perform good for SOZ localization. This observation aligns with the understanding that ViTs may face challenges in generalizing well with smaller datasets. It is noteworthy that CNNs, in contrast, exhibit better generalization on smaller datasets, yielding better accuracy. This is attributed to the inherent capability of CNNs to excel in learning from limited data (42, 43).
Overall, the outcomes suggest that our approach has the potential to enhance the manual sorting workflow for the surgical team, positioning it as a promising and effective tool in detecting SOZ for pediatric RE patients.
3.5 Performance with surgical outcomes
Of the 25 subjects who had surgery to remove rs-fMRI determined SOZ, 16 (64%) achieved seizure freedom (Engel I), and 7 (28%) experienced significantly reduced postoperative seizure frequency (Engel II). This indicated that the removed regions likely represented a substantial portion of the epileptogenic network.
SLLEK showed the highest sensitivity of 93.3% for patients undergoing minimally invasive ablation surgery, making it a promising option in such cases. For patients undergoing resection, SLLEK maintained a consistent sensitivity of 85.7%, outperforming other techniques for the dataset.
Furthermore, when analyzing patients with Engel I outcome, SLLEK exhibited a 93% agreement with expert sorting, reinforcing its suitability and reliability as a pre-surgical screening tool (Table 4).
3.6 Knowledge ablation studies
SLLEK's performance could be attributed to the influence of each expert knowledge component on the accuracy of SOZ identification. To better understand its capabilities, we assessed the impact of removing specific knowledge components (Table 5) from SLLEK in relation to standard metrics used in Table 3.
SLLEK without temporal features: The BOLD signal temporal features were removed one by one from the expert knowledge model of SLLEK. We created two unique configurations: (a) SLLEK without activelet domain sparsity and (b) SLLEK without sine domain sparsity. Table 5 reveals no significant impact on metrics, indicating that removing temporal features had limited effect on the classification of patient's ICs with SLLEK.
SLLEK without spatial features: The spatial features were removed one by one from the expert knowledge model to create two unique configurations: (a) SLLEK without the number of clusters, and (b) SLLEK without white matter overlap. An 11% reduction in accuracy and a 6% drop in F1 score were noted when the number of clusters feature was removed from the analysis. However, when the white matter overlap feature was omitted, a substantial 41% decrease in accuracy and a 27% reduction in the F1 score were observed. These findings underscore the pivotal role of white matter overlap as the most influential feature in the identification of the SOZ.
4 Discussions and limitations
The results suggest an approach that combines expert features, and AI for SOZ localization may possess the capability to generate connectivity classifications that align with ic-EEG and surgical outcomes. This stems from the design where expert knowledge integration model facilitates the derivation of weight contributions for each expert feature for SOZ identification. This provision not only enables explanations for the selection of an IC for SOZ but also amplifies its potential as an advanced tool for SOZ identification in clinical contexts. By furnishing transparent rationales for its classifications, our approach may equip the surgical team with invaluable insights.
The interplay between the deep supervised classification model and expert knowledge integration components in the SOZ localization approach is instrumental in achieving superior localization accuracy. The DL model excels in discerning noise images, as evidenced by its classification capabilities. Our PCH dataset of 52 patients had a total of 5,616 IC images where only 5.6% of these images represented SOZ ICs, while 51.1% were attributed to noise ICs, and 43.1% to RSN ICs. Due to such high data imbalance, where majority class is 16.6 times more prevalent than minority class, commonly used imbalanced data handling techniques such as cost-sensitive training failed to provide good performance as seen in SLL-CNN or SLL-ViT. Similarly, under-sampling the majority classes to balance the data would have resulted in significant information loss from the majority class, potentially resulting in overall performance loss.
Due to balanced data between Noise and RSN ICs, DL could learn their distribution. However, due to the limited availability of SOZ ICs in the dataset, traditional DL techniques faced challenges in learning the intricate features of these rare events from such a small subset of SOZ data. To overcome this limitation, a need arose for a methodology that could leverage the wealth of expert knowledge on SOZ characteristics available in the literature review. Additionally, relying solely on expert knowledge also exhibits a sub-optimal outcome as it struggles to capture the intricate details of brain networks, possibly because of the overlapping characteristics between noise and SOZ ICs. For instance, an activation located in the white matter is associated with noise, whereas an activation originating in the gray matter, extending into the white matter and reaching the ventricles, is indicative of SOZ. The minimal overlap of activation on gray matter can sometimes make these SOZ activations appear as a noise. This is a domain where DL excels in encoding the nuances of noise ICs more effectively, benefiting from a slightly larger data of noise ICs to learn and represent their characteristics. This unique integration of DL for noise IC classification and EKI for SOZ IC classification addresses the performance limitations inherent in relying solely on either DL or EKI strategies, offering a more robust and comprehensive solution to SOZ IC identification.
While our dataset is one of the largest in recent literature for pediatric patients with RE, a larger study is necessary to address the potential impact of variability in fMRI preprocessing and motion correction techniques, which can differ across centers. Before being used with minimal expert supervision, further testing of this technique in real-world settings is necessary, considering its intended application in local epilepsy care centers.
5 Conclusion
The most effective treatment for RE is surgical resection or ablation of the SOZ which requires accurate localization to avoid functional brain network damage and developmental impairments. While rs-fMRI, a non-invasive imaging technique, holds promise for SOZ localization and guiding iEEG lead placement, its clinical integration is hindered by the lack of expertise in manual seizure onset analysis. Additionally, manual sorting of ICs obtained from rs-fMRI data using ICA is a challenging and subjective task as only a small fraction (<5%) of the ICs is related to the SOZ. This makes the process time-consuming and limits the reproducibility and availability of this non-invasive technique. Accurate, automated, and reproducible SOZ localization is imperative for successful surgical treatment of RE while avoiding functional brain network damage and resultant developmental impairments. This study shows how expert knowledge can be integrated with powerful supervised learning approaches to automate SOZ localization. Reliable performance on a large dataset of children with RE, stratified across age, sex, and corroboration with 1-year post operative Engel outcomes for rs-fMRI-guided surgery increases confidence of potential for clinical integrability of the approach. The activations initiating in gray matter, extending through white matter, and ending in vascular regions were seen as the most discriminative expert identified SOZ characteristics. The prospect of automating SOZ localization using advanced AI techniques and existing expert knowledge not only addresses existing challenges in manual analysis but also suggests a transformative shift toward more accessible, trustworthy, and reproducible clinical application in epilepsy care. In future, a multi-center study to evaluate general applicability of the technique irrespective of scanning protocols and measurement devices is contemplated.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: The dataset includes a pediatric population, access to it requires a data sharing agreement and ethics review by Phoenix Children's Hospital (PCH). Requests to access these datasets should be directed to YWJhbmVyajMmI3gwMDA0MDthc3UuZWR1.
Ethics statement
The studies involving humans were approved by Phoenix Children Hospital (PCH), Institutional Review Board (IRB 20-358). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians/next of kin.
Author contributions
PK: Software, Validation, Methodology, Writing—original draft. AB: Investigation, Software, Methodology, Validation, Writing—original draft. VB: Writing—review & editing. SG: Investigation, Methodology, Project administration, Validation, Writing—review & editing.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Acknowledgments
The authors are thankful to Phoenix Children Hospital, Sarah Wyckoff and Bethany Sussman for rs-fMRI data collection and subject de-identification.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2023.1324461/full#supplementary-material
References
1. Hunyadi B, Tousseyn S, Dupont P, Van Huffel S, De Vos M, Van Paesschen W. A prospective fMRI-based technique for localizing the epileptogenic zone in presurgical evaluation of epilepsy. NeuroImage. (2015) 107:329–39. doi: 10.1016/j.neuroimage.2015.03.011
2. Kwan P, Sander JW. The natural history of epilepsy: an epidemiological view. J Neurol Neurosurg Psychiatry. (2004) 75:1376–81. doi: 10.1136/jnnp.2004.045690
3. Kwan P, Brodie MJ. Definition of refractory epilepsy: defining the indefinable? Lancet Neurol. (2010) 9:27–9. doi: 10.1016/S1474-4422(09)70304-7
4. Varatharajah Y, Berry B, Cimbalnik J, Kremen V, Van Gompel J, Stead M, et al. Integrating artificial intelligence with real-time intracranial EEG monitoring to automate interictal identification of seizure onset zones in focal epilepsy. J Neural Eng. (2018) 15:051002. doi: 10.1088/1741-2552/aac960
5. Sillanpää M, Shinnar S. Long-term mortality in childhood-onset epilepsy. N Engl J Med. (2010) 363:2522–9. doi: 10.1056/NEJMoa0911610
6. Boerwinkle VL, Mohanty D, Foldes ST, Guffey D, Minard CG, Vedantam A, et al. Correlating resting-state functional magnetic resonance imaging connectivity by independent component analysis-based epileptogenic zones with intracranial electroencephalogram localized seizure onset zones and surgical outcomes in prospective pediatric intractable epilepsy study. Brain Connect. (2017) 7:424–42. doi: 10.1089/brain.2016.0479
7. Perucca P, Dubeau F, Gotman J. Intracranial electroencephalographic seizure-onset patterns: effect of underlying pathology. Brain. (2014) 137:183–96. doi: 10.1093/brain/awt299
8. Proix T, Jirsa VK, Bartolomei F, Guye M, Truccolo W. Predicting the spatiotemporal diversity of seizure propagation and termination in human focal epilepsy. Nat Commun. (2018) 9:1088. doi: 10.1038/s41467-018-02973-y
9. West S, Nolan SJ, Cotton J, Gandhi S, Weston J, Sudan A, et al. Surgery for epilepsy. Cochrane Database Syst Rev. (2015) 7:CD010541. doi: 10.1002/14651858.CD010541.pub2
10. Nagahama Y, Schmitt AJ, Nakagawa D, Vesole AS, Kamm J, Kovach CK, et al. Intracranial EEG for seizure focus localization: evolving techniques, outcomes, complications, and utility of combining surface and depth electrodes. J Neurosurg. (2018) 130:1180–92. doi: 10.3171/2018.1.JNS171808
11. Jobst BC, Bartolomei F, Diehl B, Frauscher B, Kahane P, Minotti L, et al. Intracranial EEG in the 21st century. Epilepsy Curr. (2020) 20:180–8. doi: 10.1177/1535759720934852
12. Chakraborty AR, Almeida NC, Prather KY, O'Neal CM, Wells AA, Chen S, et al. Resting-state functional magnetic resonance imaging with independent component analysis for presurgical seizure onset zone localization: a systematic review and meta-analysis. Epilepsia. (2020) 61:1958–68. doi: 10.1111/epi.16637
13. Khoo A, Martin L, de Tisi J, O'Keeffe AG, Sander JW, Duncan JS. Cost of pre-surgical evaluation for epilepsy surgery: a single-center experience. Epilepsy Res. (2022) 182:106910. doi: 10.1016/j.eplepsyres.2022.106910
14. Smith SM, Vidaurre D, Beckmann CF, Glasser MF, Jenkinson M, Miller KL, et al. Functional connectomics from resting-state fMRI. Trends Cogn Sci. (2013) 17:666–82. doi: 10.1016/j.tics.2013.09.016
15. Guerra-Carrillo B, Mackey AP, Bunge SA. Resting-state fMRI: a window into human brain plasticity. Neuroscientist. (2014) 20:522–33. doi: 10.1177/1073858414524442
16. Boerwinkle VL, Cediel EG, Mirea L, Williams K, Kerrigan JF, Lam S, et al. Network targeted approach and postoperative resting state functional mri are associated with seizure outcome. Ann Neurol. (2019) 86:344. doi: 10.1002/ana.25547
17. Banerjee A, Kamboj P, Wyckoff SN, Sussman BL, Gupta SKS, Boerwinkle VL. Automated seizure onset zone locator from resting-state functional mri in drug-resistant epilepsy. Front Neuroimaging. (2022) 1:1007668. doi: 10.3389/fnimg.2022.1007668
18. Nozais V, Boutinaud P, Verrecchia V, Gueye MF, Hervé PY, Tzourio C, et al. Deep learning-based classification of resting-state fmri independent-component analysis. Neuroinformatics. (2021) 19:619–37. doi: 10.1007/s12021-021-09514-x
19. Nandakumar N, Hsu D, Ahmed R, Venkataraman A. DeepEZ: a graph convolutional network for automated epileptogenic zone localization from resting-state fMRI connectivity. IEEE Trans Biomed Eng. (2023) 70:216–27. doi: 10.1109/TBME.2022.3187942
20. Branco P, Torgo L, Ribeiro RP. A survey of predictive modeling on imbalanced domains. ACM Comput Surveys. (2016) 49:1–50. doi: 10.1145/2907070
21. Bharath RD, Panda R, Raj J, Bhardwaj S, Sinha S, Chaitanya G, et al. Machine learning identifies “rsfMRI epilepsy networks” in temporal lobe epilepsy. Eur Soc Radiol. (2019) 29:3496–505. doi: 10.1007/s00330-019-5997-2
22. Lopes R, Lina J, Fahoum F, Gotman J. Detection of epileptic activity in fMRI without recording the EEG. Neuroimage. (2012) 60:1867–79. doi: 10.1016/j.neuroimage.2011.12.083
23. Nguyen R, Smyth M, Zhu L, Pao L, Swisher S, Kennady E, et al. A comparison of machine learning classifiers for pediatric epilepsy using resting-state functional MRI latency data. Biomed Rep. (2021) 15:77. doi: 10.3892/br.2021.1453
24. Gil F, Padilla N, Soria-Pastor S, Setoain X, Boget T, Rumiá J, et al. Beyond the epileptic focus: functional epileptic networks in focal epilepsy. Cereb Cortex. (2020) 30:2338–57. doi: 10.1093/cercor/bhz243
25. Lee H, Arora J, Papademetris X, Tokoglu F, Negishi M, Scheinost D, et al. Altered functional connectivity in seizure onset zones revealed by fMRI intrinsic connectivity. Neurology. (2014) 83:2269–77. doi: 10.1212/WNL.0000000000001068
26. Luckett P, Maccotta L, Lee J, Park K, Dosenbach UF, Ances B, et al. Deep learning resting state functional magnetic resonance imaging lateralization of temporal lobe epilepsy. Epilepsia. (2022) 63:1542–52. doi: 10.1111/epi.17233
27. Zhang CH, Lu Y, Brinkmann B, Welker K, Worrell G, He B. Lateralization and localization of epilepsy-related hemodynamic foci using presurgical fMRI. Clin Neurophysiol. (2015) 126:27–38. doi: 10.1016/j.clinph.2014.04.011
28. Cui Z, Gao T, Talamadupula K, Ji Q. Knowledge-augmented deep learning and its applications: a survey. arXiv:221200017. (2022). doi: 10.1109/TNNLS.2023.3338619
29. Hossain S, Kamboj P, Maity A, Azuma T, Banerjee A, Gupta S. EdGCon: auto-assigner of iconicity ratings grounded by lexical properties to aid in generation of technical gestures. In: Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing. New York, NY: Association for Computing Machinery (2023). p. 3–10. doi: 10.1145/3555776.3577623
30. Xie X, Niu J, Liu X, Chen Z, Tang S, Yu S. A survey on incorporating domain knowledge into deep learning for medical image analysis. Med Image Anal. (2021) 69:101985. doi: 10.1016/j.media.2021.101985
31. Berg AT, Berkovic SF, Brodie MJ, Buchhalter J, Cross JH, van Emde Boas W, et al. Revised terminology and concepts for organization of seizures and epilepsies: report of the ILAE commission on classification and terminology, 2005–2009. Epilepsia. (2010) 51:676–85. doi: 10.1111/j.1528-1167.2010.02522.x
32. Beckmann CF, Smith SM. Probabilistic independent component analysis for functional magnetic resonance imaging. IEEE Trans Med Imaging. (2004) 23:137–52. doi: 10.1109/TMI.2003.822821
33. Jenkinson M, Bannister P, Brady M, Smith S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage. (2002) 17:825–41. doi: 10.1006/nimg.2002.1132
34. Jenkinson M, Smith S. A global optimisation method for robust affine registration of brain images. Med Image Anal. (2001) 5:143–56. doi: 10.1016/S1361-8415(01)00036-6
35. Greve DN, Fischl B. Accurate and robust brain image alignment using boundary-based registration. NeuroImage. (2009) 48:63–72. doi: 10.1016/j.neuroimage.2009.06.060
37. Celebi ME, Aslandogan YA, Bergstresser PR. Mining biomedical images with density-based clustering. In: International Conference on Information Technology: Coding and Computing (ITCC). IEEE (2005). p. 163–8. doi: 10.1109/ITCC.2005.196
38. Talo M, Yildirim O, Baloglu UB, Aydin G, Acharya UR. Convolutional neural networks for multi-class brain disease detection using MRI images. Comput Med Imag Graph. (2019) 78:101673. doi: 10.1016/j.compmedimag.2019.101673
39. Sobahi N, Ari B, Cakar H, Alcin OF, Sengur A. A new signal to image mapping procedure and convolutional neural networks for efficient schizophrenia detection in EEG recordings. IEEE Sensors. (2022) 22:7913–9. doi: 10.1109/JSEN.2022.3151465
40. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. (2002) 16:321–57. doi: 10.1613/jair.953
41. Chakravarti IM, Laha RG, Roy J. Handbook of Methods of Applied Statistics. Vol. 2. New York, NY: Wiley (1967).
42. Maurício J, Domingues I, Bernardino J. comparing vision transformers and convolutional neural networks for image classification: a literature review. Appl Sci. (2023) 13:5521. doi: 10.3390/app13095521
Keywords: seizure onset zone, preoperative evaluation, deep learning, rs-fMRI, expert knowledge, focal pharmaco-resistant epilepsy, pre-surgical
Citation: Kamboj P, Banerjee A, Boerwinkle VL and Gupta SKS (2024) The expert's knowledge combined with AI outperforms AI alone in seizure onset zone localization using resting state fMRI. Front. Neurol. 14:1324461. doi: 10.3389/fneur.2023.1324461
Received: 19 October 2023; Accepted: 13 December 2023;
Published: 11 January 2024.
Edited by:
Matteo Spezialetti, University of L'Aquila, ItalyReviewed by:
Amirmasoud Ahmadi, Max Planck Institute for Biological Intelligence, GermanyDaniele Lozzi, University of L'Aquila, Italy
Copyright © 2024 Kamboj, Banerjee, Boerwinkle and Gupta. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Payal Kamboj, cGthbWJvaiYjeDAwMDQwO2FzdS5lZHU=