- 1Maintenance Department, PETRONAS Gas Berhad, Kerteh, Terengganu, Malaysia
- 2Centre for Intelligent Signal and Imaging Research (CISIR), Electrical and Electronic Engineering Department, Universiti Teknologi PETRONAS, Bandar Seri Iskandar, Perak, Malaysia
- 3Department of Data Science and Artificial Intelligence, School of Engineering and Technology, Sunway University, Petaling Jaya, Selangor, Malaysia
- 4Centre for Subsurface Imaging, Department of Geosciences, Universiti Teknologi PETRONAS, Bandar Seri Iskandar, Perak, Malaysia
- 5Center of Excellence in Intelligent Engineering Systems (CEIES), Department of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah, Saudi Arabia
Introduction: Excessive alcohol consumption negatively impacts physical and psychiatric health, lifestyle, and societal interactions. Chronic alcohol abuse alters brain structure, leading to alcohol use disorder (AUD), a condition requiring early diagnosis for effective management. Current diagnostic methods, primarily reliant on subjective questionnaires, could benefit from objective measures.
Method: The study proposes a novel EEG-based classification approach, focusing on effective connectivity (EC) derived from resting-state EEG signals in combination with support vector machine (SVM) algorithms. EC estimation is performed using the partial directed coherence (PDC) technique. The analysis is conducted on an EEG dataset comprising 35 individuals with AUD and 35 healthy controls (HCs). The methodology evaluates the efficacy of connectivity features in distinguishing between AUD and HC and subsequently develops and assesses an EEG classification technique using EC matrices and SVM.
Result: The proposed methodology demonstrated promising performance, achieving a peak accuracy of 94.5% and an area under the curve (AUC) of 0.988, specifically using frequency bands 29, 36, 45, 46, and 52. Additionally, feature reduction techniques applied to the PDC adjacency matrices in the gamma band further improved classification outcomes. The SVM-based classification achieved an accuracy of 96.37 ± 0.45%, showcasing enhanced performance through the utilization of reduced PDC adjacency matrices.
Discussion: These results highlight the potential of the developed algorithm as a robust diagnostic tool for AUD detection, enhancing precision beyond subjective methods. Incorporating EC features derived from EEG signals can inform tailored treatment strategies, contributing to improved management of AUD.
1 Introduction
The economic and social consequences of alcohol use disorder (AUD) is one of the significant issues in today's society. It has a significant impact on the brain functionality, especially with heavy and chronic use. It can impair cognitive skills and alter brain structure by shrinking the white and gray matter of the brain, which leads to balance, speech, and coordination problems (Nunes et al., 2019). The severity of the effects varies depending on the amount and frequency of the consumption and the duration of alcohol use. Furthermore, alterations in particular regions of the brain can impact the likelihood of developing AUD (Lim and et al., 2021). AUD is a neurological condition that hinders the capacity to cease or regulate alcohol intake in a moderate manner (Jarcuskova et al., 2024). Hence, early diagnosis of AUD is one of the critical factors in successful treatment and therapy management of AUD subjects before it prolongs to a critical stage. A healthcare provider or addiction specialist will evaluate the symptoms and provide a proper diagnosis. The diagnosis identifies the signs and symptoms of AUD before it progresses to more severe stages. Early detection and treatment can increase the likelihood of recovery by preventing further the development of AUD (Masroor et al., 2024).
Mental health professionals use the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) to diagnose and categorize psychiatric disorders, including AUD (Källmén et al., 2019). It is divided into sections that list each disorder's diagnostic standards and codes. In order to aid in the diagnostic process, it also contains information on differential diagnosis, cultural considerations, and assessment tools. However, it has also received criticism for its reliance on categorical diagnoses and disregard for contextual and personal factors that could have an impact on mental health. Thus, professionals often use DSM-5 with other assessment tools to provide a comprehensive evaluation of a person's mental health.
In this work, effective connectivity (EC) will be used to classify AUD subjects from healthy controls (HCs). The EC of a brain node, which is derived from a model of neuronal integration that specifies the mechanisms of neuronal coupling, is the influence that a node has over another under a network model of causal dynamics. Numerous research investigations have explored the effects of AUD on neural connectivity, employing neuroimaging methods like electroencephalography (EEG) and functional magnetic resonance imaging (fMRI) (van Oort et al., 2023). The precuneus and lateral parietal regions of the brain, which are involved in the integration of information relating to the perception of the environment, were found to have reduced EC in individuals with AUD, according to a study (Khan et al., 2021a). Another study demonstrated that individuals with AUD exhibit less intense EC in the Default Mode Network (DMN) regions compared to the healthy individuals (Khan et al., 2021c). Low performance on the task and the severity of alcohol consumption were both strongly associated with the decreased connectivity. Hence, analyzing EEG-recorded brain activity patterns can reveal the relationships between different brain regions and their causal effects on other regions. Overall, these studies collectively suggest that EC can be significantly impacted by AUD, which contributes to the behavioral and cognitive deficits associated with the disorder.
Finally, recent advancements in neuroscience research have placed an increasing emphasis on network patterns of brain dynamics (Hou and et al., 2022; Khan et al., 2023; Shen et al., 2024). Hence, EC matrices are used for training and testing convolutional neural network (CNN) for the best classification result. The estimation of EC as a connectivity structure for classification can be done using mathematical techniques such as partial directed coherence (PDC), or dynamic causal modeling (DCM). Then, the 3D EC connectivity structure is transformed into 2D images, which serve as input to the CNN (Khan et al., 2021c).
The role of EC as an input to CNN offers a more precise analysis of the EEG data by distinguishing between direct and indirect connections and identifying the precise direction of information flow. Additionally, research has demonstrated that EC of EEG signals in the frequency bands could serve as a promising biomarker for the diagnosis of AUD (Khan et al., 2021c). Thus, the capabilities of EC in carrying the features of AUD subject and its performance for diagnosis will be investigated. The objectives of this study is to develop a robust and accurate machine learning algorithm for AUD detection using EC matrices derived from resting-state EEG signals.
This paper employs PDC adjacency matrices derived from EEG recordings to classify individuals with AUD and HC. These matrices serve as input for a machine learning model, SVM. The adjacency matrices are created through PDC calculations across 64 frequency bands, evenly divided within the range of 0.1 Hz to 40 Hz.
The paper is structured in the following manner: Section 2 covers related studies on brain connectivity and its application in classifying various neurological conditions. Section 3 focuses on the methods utilized to create and assess an EEG classification algorithm for AUD detection, involving PDC adjacency matrices and SVM. The result and discussion of the research are explained in Section 4. Lastly, Section 5 will conclude this paper.
2 EEG brain connectivity for detection of neurological conditions—its related studies
The activity of brain cells communicating via electrical impulse can be measured and recorded by EEG and is shown on recording in the forms of wavy lines (Khan et al., 2021b; Al-Hiyali et al., 2019). The most common application of an EEG is to identify and evaluate epilepsy (Wang et al., 2020; Wen et al., 2022), a brain disorder that frequently results in recurrent seizures. A lot of past research utilizes EC from EEG recordings in analyzing neurological disorders such as depression, autism, and epilepsy (Khan et al., 2022; Parker and et al., 2018; Rolls and et al., 2018). It shows that EC has a good potential in determining AUD and the methods could be extended by using CNN. EEG can aid in the diagnosis by supplying more details about the many types of epilepsy, its various therapies, and the underlying triggers. Similarly, the recorded EEG data of human brain can also be used to identify other disorders such as the AUD (Anuragi and Singh Sisodia, 2019; Khosla et al., 2020). In order to extract and analyze the desired information from the EEG signal, the analysis of EEG will make use of mathematical signal analysis techniques and computing technologies.
Deep learning techniques have proven beneficial in classifying neurophysiological signals, including EEG, for the diagnosis of AUD. A study applied a deep CNN to EEG signals and calculated the EC within the DMN brain regions, achieving an impressive overall accuracy of 92.7% for AUD classification (Khan et al., 2021c).
Furthermore, the ability of wavelet scattering techniques (WST) based features extracted from multichannel EEG signals were explored for AUD classification using SVM classifier (Buriro and et al., 2021). According to articles, WST-based shows a convincing alternative to CNN. This reflects WST's characteristics, which include high-frequency information durability, susceptibility to signal deformation, and connectivity to the occipital and parietal regions.
Then, in another study for automated AUD identification, SVM with rabial basis function (SVM-RBF) kernel was developed (Anuragi et al., 2020). The FBSE-EWT (Fourier-Bessel series expansion with based empirical wavelet transform) was used to derive entropy values from 112 EEG recordings of healthy and AUD subjects. FBSE-EWT decomposes EEG signals into sub-band signals with 20 significant extracted features. Using “Leave-one-sample-out” cross validation for performance evaluation, it achieves an accuracy of 98.8%.
On the other hand, two deep learning techniques were compared for the classification of AUD EEG signals (Farsi et al., 2021). The experiment consists of 77 subjects diagnosed with AUD and 45 healthy participants. Firstly, the extracted features of EEG using Principal Component Analysis (PCA) based feature extraction are used as input to “long term short memory” (LTSM) and is compared to second methods which the raw EEG data is directly fed to Artificial Neural Network (ANN) as an input for the classification of AUD. The experimental findings show that the second technique produces an average classification accuracy of 93% using a 10-cross validation strategy, whereas the first method achieves an accuracy of 86%. Therefore, deep learning and machine learning approaches have demonstrated potential in effectively categorizing EEG signals for diagnosing AUD. Its robust and data-driven approach enables it to identify subtle differences between EEG recordings of individuals with AUD and HCs, by automatically extracting relevant features and learning from large datasets.
In Dafters et al. (1999) investigate whether there is a correlation between quantitative EEG variables (spectral power and coherence) and cognitive/mood variables, and level of prior use of methylenedioxymethamphetamine (MDMA or Ecstasy). Pearson correlation analyses were employed to investigate the association between various measures and the subjects' consumption of MDMA over the past 12 months. The use of MDMA showed a positive correlation with absolute power in the alpha frequency, where alpha power is known to have an inverse relationship with mental function. Additionally, decreased coherence levels, linked to dysfunctional connectivity, have been previously associated with brain disorders such as dementia.
To investigate whether the neuronal network undergoes dynamic changes before and during the transition to an EEG epileptic discharge, Varotto et al. (2012) estimate EEG connectivity patterns in photosensitive (PS) patients with idiopathic generalized epilepsy. The results revealed that individuals with PS exhibit abnormal EEG hyperconnectivity, particularly affecting the anterior cortical regions. This hyperconnectivity pattern is observed both during resting conditions and intermittent photic stimulation (IPS).
The PDC technique of Baccalá and Sameshima (2001) is a widely used method for the problem. The three subjects of Pascual-Marqui et al. (2014) are: (1) To show that the PDC can misrepresent the frequency response under plausible realistic conditions, thus defeating the main purpose for which the measure was developed; (2) To provide a solution to this problem, namely the isolated effective coherence (iCoh), which consists of estimating the partial coherence under a multivariate autoregressive model, followed by setting all irrelevant associations to zero, other than the particular directional association of interest; and (3) To show that adequate iCoh estimators can be obtained from non-invasively computed cortical signals based on exact low resolution electromagnetic tomography (eLORETA) applied to scalp EEG recordings.
Networks of functional and effective connectivity were described using a spatial filter approach called the dynamic imaging of coherent sources (DICS) followed by the renormalized partial directed coherence (RPDC) (Muthuraman et al., 2015). The suggested approach demonstrates efficacy in generating efficient features for the detection of Motor Imagery (MI) tasks. It holds significant promise for application in Brain-Computer Interface (BCI) systems (Liang et al., 2016).
The aim of Japaridze et al. (2016) was to investigate the neuronal networks underlying background oscillations of epileptic encephalopathy with continuous spikes and waves during slow sleep (CSWS). The examination of information flow within this network indicates that the medial parietal cortex, the precuneus, and the thalamus function as central hubs, directing information flow to other areas, notably the temporal and frontal cortex. The pivotal roles of the precuneus and thalamus in the hierarchical organization of the network associated with the background EEG suggest the importance of vigilance fluctuations in the generation of CSWS. The high temporal resolution of the EEG is utilized to estimate the EC within the DMN (Khan et al., 2021a). The differences in causal effects within the DMN between control subjects and individuals with AUD exhibit a noticeable correlation with symptoms commonly linked to AUD. These symptoms include cognitive and memory impairments, executive control issues, and attention deficiencies (Balconi et al., 2024). In Saeedi et al. (2021) propose an EEG-based deep learning framework that automatically discriminates MDD patients from HCs. The fusion of CNN-LSTM architecture effectively captures both spatial and temporal relations within brain connectivity.
Phang et al. (2019) propose a deep CNN framework for classification of EEG-derived brain connectome in schizophrenia (SZ). The framework combines various connectivity features that improves performances over single-domain CNN. Event-related potentials were collected from 45 SZ patients and 30 HCs during a learning task, and then a combination of PDC effective and phase lag index (PLI) functional connectivity were used as features to train a SVM classifier with leave-one-out cross-validation for classification of SZ from HCs (Zhao et al., 2021). The results suggest that the majority of the chosen EC features are significantly increased in patients with SZ compared to HCs, whereas all selected functional connectivity features demonstrate a decrease in SZ patients.
3 Methodology
This section provides an overview of the methodology used in this work, covering the overall approach to develop the AUD classification using EC of EEG, description of the EEG dataset, the method to determine the EC matrices from 19-channel EEG, performance measures, and lastly the approach of classification using EC and SVM.
3.1 Overall approach
The general approach for development of AUD classification using EC obtained using the PDC is illustrated in Figure 1. Method based on SVM is employed for assessing the AUD classification of EC calculated at 64 bands of frequency. The performance of the classification method is evaluated based on accuracy, sensitivity, specificity, and precision metrics.
Figure 1. Overall methodology for development of AUD classification technique using PDC adjacency matrix (PDCaM). The PDCaM is estimated using electrode Cz as the reference, hence giving 18 × 18 PDCaM.
3.2 EEG signals dataset and preprocessing
The EEG dataset was taken from Universiti Malaya Medical Center (UMMC), Universiti Teknologi PETRONAS (UTP), and Bingkor Clinic in Sabah which was also used in Khan et al. (2021c).
EEG recordings of 35 individuals with AUD and 35 HCs are used in this study. All AUD subjects have met the criteria according to DSM-IV for alcohol addiction. UMMC and Bingkor Clinic, Kota Kinabalu, Sabah, provided the dataset of 15 HC and 35 AUD subjects. The remaining 20 HC data was obtained at UTP.
The EEG data was recorded in the morning under eye-close resting-state conditions for 5 min. Following the placement of 10–20, 19 electrodes were placed on the scalp utilizing two devices, the BrainMaster Discovery 24 EEG with 256 Hz sampling rate and amplitude in microvolts, and the Enobio system with 500 Hz sampling rate and amplitude in nanovolts.
Furthermore, pre-processing steps are crucial for the analysis of EEG data to obtain accurate and reliable results. Initially, to normalize the data, EEG recordings from the Enobio system were down sampled to 256 Hz and the amplitude was converted to microvolts. Then, the normalized signals were filtered using bandpass filter (0.1–70 Hz) and notch filter (50 Hz). Artifacts removals caused by sensory motions are then applied using EEGLAB software to reduce noise and enhance the underlying brain activity. The extraction of the PDC adjacency matrices is performed on every 2-sec duration of the pre-processed EEG as illustrated in Figure 1. Figure 2 provides information on the number of PDC adjacency matrices for each AUD/HC subject.
Figure 2. Number of PDC adjacency matrix for 35 AUD and 35 HC subjects, giving a total of 4,380 × 324 and 4,827 × 324 feature vectors for AUD and HC, respectively.
3.3 Partial directed coherence adjacency matrix
PDC is a frequency domain EC technique which is based on the multivariate autoregressive model (MVAR) modeling and partial coherence (Astolfi et al., 2006). For S simultaneous recording of 19-channel EEG signals, the time-series representation of the signals is
The corresponding autoregressive model of order ρ is
Here, is zero-mean multivariate Gaussian white process and Cζ is a S×S coefficient matrix, evaluate at every lag, ζ, given by
If C(ω) is the frequency domain equivalent of coefficient matrix Cζ, then the Fourier transform of cvw between 2 electrodes, v and w can be obtained as,
The 19 × 19 PDCaM, from electrode w to electrode v, is
where denotes the vw-th elements of the matrix, , and (.)H represents the conjugate transpose. The MATLAB coding for the calculation of PDC adjacency matrix is available at Omidvarnia (2020).
The adjacency matrix is obtained from EC of 19-channel EEG where it encodes the connectivity of nodes. EC plays an important role as it represents the causal influence of the brain regions. This is because EC is utilized to determine the directional interdependencies of EEG data and create an asymmetric adjacency matrix, which provides a more accurate representation of the interconnections between EEG channels. Using Cz as the reference node, PDC was used to compute the PDCaM at all 64 frequency bands producing 18 × 18 dimensions of adjacency matrix from the continuous 2-sec segments of the pre-processed EEG data. Each of the 64-band frequencies has a bandwidth equal to (70–0.1) Hz/64 = 1.092 Hz. Thus, 18 × 18 dimensions represent the pairwise connectivity between 18 channels, excluding one reference channel.
For visualization, the plot of the 324-connectivity values are provided in Figure 3. These values obtained using Equation 5, are normalized to be between 0 and 1. For more meaningful visualization, 2D plots of the PDCaM for AUD and HC are prepared for two selected frequency bands, as shown in Figure 4.
Figure 3. Plot of 324 PDCaM values for 2-second EEG signals from one AUD subject and one HC subject.
Figure 4. Image colorplot of 150-sample × 324-length feature vector of PDCaM for one AUD and one HC subject at 2 freqeuncy bands, 36 and 46.
In Figure 4, the PDCaM of 150 samples × 324-length feature vectors is plot for one AUD subject and HC subject and plot using a “colormap” to gain a better visualization of the data. The image are generated for 2 frequency bands, 36 and 46. As observed, the series of PDCaM values of the AUD subject has deeper blue and yellow colors than the more uniform pattern for HC subject, indicating potential differences in connectivity patterns. While the “colormap” visualization of the PDCaM may provide some initial insights, a comprehensive analysis of the EEG signals is necessary to accurately classify the subjects and draw meaningful conclusions.
3.4 Performance evaluation of the classification model
Using a confusion matrix, the performance quality will be assessed depending on how well the classification model can distinguish between subjects with AUD and HC. Illustration of the confusion matrix, where AUD is the positive class is given in Figure 5. As a measure of its classification performance, the model's output will provide the number of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN). Accuracy, precision, sensitivity, and specificity performance matrices for the classification model will also be computed using the confusion matrix.
3.5 AUD vs. HC classification of 64-band PDCaM using SVM
The evaluation of the PDCaM in discriminating between AUD and HC is performed using SVM technique (Kaul and Raina, 2022; Roy and Chakraborty, 2023). The experiment is conducted for EC feature vectors generated at 64 frequency bands. Notably, each band is a bandwidth of (70–0.1) Hz/64 = 1.092 Hz. Figure 6 illustrates the method for evaluation of EC at all 64 frequency bands. This analysis serves as a supplementary investigation to gain additional insights on the PDCaM for classification of AUD vs. HC. Subsequently, the matrices for each frequency bands across all subjects were combined, resulting in a final matrix with dimensions equal to 4, 380 × 324 and 4, 827 × 324 feature vectors for AUD and HC, respectively.
SVM is chosen for its effectiveness in discerning different patterns or categories in data, as it excels in maximizing the margin between classes, thereby enhancing its ability to generalize well to unknown data. Hence, SVM is frequently used in several fields, such as text recognition (Kaur and Singh, 2023), natural language processing (Hussain et al., 2023), and medical signal or image classification (Asri et al., 2020; Verma and Sharan, 2023; Ali et al., 2022). Here, the SVM parameters are obtained via Bayesian optimization to ensure the best SVM is utilized in classification of the EC feature vector. This process aimed to identify the most suitable model parameters, including the kernel function, scale, and box constraints, for training classifiers for all 64 frequency bands.
Evaluation across the 64 frequency bands aimed to explore the distinctiveness of the AUD and HC data across the different frequency bands and determine the extent to which it contributes to classification accuracy. The performance of the SVM model was evaluated based on the obtained accuracy values, providing insights into the effectiveness of using the 64 frequency bands for AUD classification.
4 Result and discussion
The result of this study is presented as follows. Firstly, Section 4.1 visualizes the Partial-Directed Coherence matrices of one subject with AUD and one HC subject, across 4 different frequency bands. The initial exploration involves employing SVM in Section 4.2 to evaluate the discriminative capacity of the 19-channel PDCaM through classification based on 64 frequency bands. Then, the performance of the top-5 accuracy from the 64 bands is further evaluated, using confusion matrix and ROC plots. Lastly, the classification using brain rhythm is investigated and compared with a previous study.
4.1 Scatter plot of PDC adjacency matrix feature vectors
Firstly, to assess the distinguishability of AUD and HC using PDCaM data, a 3D scatter plot is employed. Figure 7 displays a 3D scatter plot generated using t-distributed Stochastic Neighbor Embedding (t-SNE) to visualize the feature vectors distribution of AUD and HC subjects. The plots are shown at 4 different frequency bands out of the 64 bands, showing the first three main features extracted via the t-SNE technique.
Figure 7. 3D visualization of AUD and HC PDCaM feature vectors using t-SNE technique, selected for 4 frequency bands; 16, 32, 48, and 64.
T-SNE is a technique employed for dimensionality reduction, facilitating the visualization of high-dimensional data in a lower-dimensional space, commonly 2D or 3D. It proves advantageous in capturing intricate patterns within complex datasets, as it conserves the local structure and relationships among data points. From Figure 7, the scatter plot illustrates the first three main features extracted via the t-SNE technique. It is evident that the two classes of AUD and HC are distinguishable based on the cluster plot.
To further explore this analysis, a machine learning algorithm called SVM is employed to assess the accuracy of AUD detection for each frequency band. SVM provides valuable insights into the classification performance and helps understand how well the algorithm can distinguish between the two classes based on frequency-band PDCaM data.
4.2 Classification of AUD vs. HC PDC adjacency matrix using SVM
This section outline the evaluation of PDCaM in EEG recordings in the following manner: initially, the classification is conducted across all 64 frequency bands; subsequently, a thorough assessment is performed focusing on the top 5 frequency bands, and finally, an evaluation is carried out based on 5 distinct brain rhythms.
4.2.1 Classification of AUD vs. HC using 64-band PDC adjacency matrix
Here, the PDCaM will be examined based on different frequency bands to explore its potential in the classification of AUD vs. HC across different frequency ranges. This analysis will provide valuable insights into the effectiveness of using frequency-specific PDCaM data for distinguishing between AUD and HC.
As presented in Figure 8, the accuracy of classification using SVM remains consistent across the 64 frequency bands, exhibiting an average accuracy of 92.3%. Additionally, Figure 8 shows the best accuracy of 94.6% using band 36 and the other 4 best accuracy of 94.5% are from band 29, 45, 46, and 52. These findings suggest that the SVM algorithm performs reliably in classifying AUD across various frequency bands, with the accuracy remaining stable.
Figure 8. Classification accuracy of SVM across all 64 frequency bands taking PDC adjacency matrix as its input. The average value of the accuracy is 92.3%.
4.2.2 Evaluation of the top-5 frequency bands
For better insight on the performance of the top-5 frequency band as determined from the previous section, the PDCaM is further trained and tested on SVM to generate the confusion matrix and the ROC, as shown in Figures 9, 10. The hyperparameters of the SVM are obtained via Bayesian optimization. Notably, the SVM for the 5 bands utilized Gaussian as its kernel except for band 46 which is polynomial type.
Figure 9. Confusion matrix of test PDCaM for the top-5 frequency bands; 29, 36, 45, 46, and 52. The average accuracy for these bands is 94.5%.
Figure 10. ROC plot for AUD indicating the SVM's optimal operating point, TPR and AUC top-5 frequency bands; 29, 36, 45, 46, and 52. The inset table lists the TPR and AUC values for the 5 bands.
The plot of the confusion matrix as depicted in Figure 9 provides a comprehensive assessment of the classification results by illustrating the number of correctly classified instances and the misclassifications for each class. The results indicate that the trained SVM model, utilizing PDCaM as its input, demonstrates a high level of accuracy in classifying AUD. The model achieves an impressive accuracy rate of 94.5%, highlighting its effectiveness in accurately distinguishing between individuals with AUD and HCs. Notably, the average AUC of all 5 frequency bands observed in Figure 10 is >0.98 for both classes indicating that the developed model exhibits strong performance in distinguishing between HC and AUD.
Moreover, the ROC plots in Figure 10 show the optimal operating points for the trained SVM, generated for AUD as the positive class. For AUD classification, the model's optimal operating point lies at true positive rate (TPR) values between 0.923 to 0.935. The obtained results demonstrate that the classifier has achieved an impressive TPR, accurately identifying a significant number of positive samples representing instances belonging to the AUD. The FPR is alternatively known as 1-specificity and signifies the proportion of negative samples that are incorrectly classified as positive. Additionally, as FPR values are significantly low, it indicates that the classifier performs exceptionally well by not misclassifying any negative samples as positive, thus demonstrating its outstanding performance. These findings suggest that the classifier effectively distinguishes between positive and negative samples and possesses a high likelihood of correctly identifying positive cases.
4.2.3 Classification of PDC adjacency matrix across 5 brain rhythms
Lastly, the accuracy of AUD detection is further analyzed in terms of 5 different brain rhythms. Table 1 presents the classification accuracy achieved using PDCaM across various brain rhythms, alongside their corresponding frequency ranges and frequency bands. The table further compares these accuracy values with those obtained from a previous study that employed the same dataset (Khan et al., 2021c). However, the previous study employed a 3D CNN instead of the SVM that is used in this study.
Table 1. Comparison of accuracy across brain rhythms: current study vs. previous study (Khan et al., 2021c).
The results in Table 1 demonstrate that all brain rhythms of the current study exhibit high accuracy in classifying AUD. The gamma band provides the highest accuracy, followed by alpha and theta. However, the lower frequency bands, delta and theta, exhibit lower accuracy. This observation could be attributed to various factors, such as the complexity of AUD-related brain activities in those frequency ranges. Further investigations are required to gain deeper insights into the reasons behind this phenomenon.
On the other hand, the accuracy of brain rhythms in the previous study is lower than the accuracy achieved in the current study. Although the highest brain rhythm, beta, had the highest accuracy in the previous study, its performance is surpassed by all brain rhythms in the current study. The comparison enables observation of performance differences between the machine learning and deep learning architectures in classifying AUD based on PDCaM for different brain rhythms. These findings highlight the effectiveness and potential advantages of the developed algorithm for AUD detection using PDCaM, particularly in frequency-based classification with SVM.
4.3 Classification of reduced PDC adjacency matrix across 5 brain rhythms
It is noteworthy to emphasize that employing a greater number of feature vectors, such as in the context of classification with the five brain rhythms, results in improved accuracy. However, this comes at the cost of a significant increase in the length of the feature vector. Specifically, for the beta and gamma bands, the feature vector length expands to 5,509. Training the SVM on such a large feature vector will require large memory and computational power. In an effort to reduce the feature vector of PDCaM, a statistically significant test using the ANOVA technique is employed. The plot of AUD vs. HC p-values for all 18 × 18 connectivites are given in Figure 11.
Figure 11. Boxplot of p-values evaluated for all 18 × 18 connectivities of the PDCaM. The yellow boxes highlight the connectivies with p-value < 0.01, i.e., 99.9% significance in discriminating between AUD vs. HC. The number of connectivities with p-value < 0.01 (yellow boxes) is 147. The electrodes are labeled numerically from 1 to 18, corresponding to the following electrode positions: FP1, FP2, F3, F4, C3, C4, P3, P4, O1, O2, F7, F8, T3, T4, T5, T6, FZ, and PZ, respectively.
For further analysis of Figures 11, 12 has been generated to identify significant connections and their relationship to brain functions. Consistent with a prior research (Khan et al., 2021c), four electrodes–C3, C4, P3, and P4–exhibit significant connectivity with at least 13 other brain nodes. This finding aligns with the known impact of alcohol on the brain's communication pathways, which disrupt the brain's functionality. The electrodes C3 and C4, positioned over the left and right central areas of the brain, respectively, capture activity from the primary motor cortex and the somatosensory cortex. These regions play a critical role in motor planning, execution, and sensory processing. Similarly, electrodes P3 and P4 are associated with cognitive processing, integral to the processing of vestibular and proprioceptive information. The significant connectivity of these electrodes underscores the widespread functional disruptions caused by alcohol, particularly in regions essential for motor and cognitive integration.
Figure 12. Significant PDCaM connections (p-value < 0.01) among all 18 electrodes, extracted from Figure 11, for further analysis of affected brain functions in AUD subjects. The arrow denotes the direction of connectivity.
We further evaluate the classification accuracy as we reduced the feature according to the p-value and the results are tabulated and plot in Figure 13. Based on the table shown in Figure 13, clearly utilization of full 324 connectivities has resulted in the best accuracy across all 5 brain rhythms but not in the case of beta and gamma. Notably, at a p-value of 0.04, which corresponds to a 96% confidence level, the accuracies for beta and gamma exceed those achieved with the full-length feature set. This trend is anticipated since the best accuracies with the full feature set are 95.93% and 96.27% for beta and gamma, respectively. If the investigation into feature reduction continues for p-values greater than 0.04, we may expect better accuracies than those of the full feature set for delta, theta, and alpha to occur at p-values of 0.09, 0.08, and 0.07, respectively. However, selecting higher p-values is not desirable as it increases the feature vector length and may degrade the performance of the classifier.
Figure 13. Tabulated classification accuracy evaluated for reduced feature vectors with p-value between 0.01 to 0.04 (top) and its corresponding barplot (bottom). The length of feature vector for the original PDCaM is 324 and this value is reduced to between 147 to 210 as the p-value is increased from 0.01 to 0.04. The number in bracket in the 1st column of the table (left) indicate the number of frequency bands as per listed in Table 1. The length of the feature vector for each cases is Nfb×lf.
Based on the feature reduction experiment, gamma demonstrated a significant advantage by achieving the highest accuracy of 96.37% while using only 64.8% (210 features) of the full 324-length feature set. This accuracy is slightly higher compared to its performance with the full feature length.
5 Conclusions
In conclusion, this research paper presents a new approach for AUD detection by utilizing PDCaM of resting-state EEG signals and SVM classifier. The investigation is rooted in the understanding that abnormal brain patterns are linked to neurological disorders, making PDCaM a promising means to represent brain networks in various subjects. The derived EC is based on the PDC from EEG recordings of 19-channel EEG recording, producing 18 × 18 PDCaM, since Cz is used as the reference node. Then, the proposed EEG classification algorithm incorporates PDCaM and SVM to accurately classify AUD subjects from HCs. By analyzing the features of EEG signals from the AUD dataset, the algorithm's development for AUD detection is systematically achieved. Notably, the PDCaM is transformed into a row vector of length 324 features to facilitate the training and testing of the SVM model.
The AUD classification using PDCaM and SVM is tested across 64 frequency bands achieving an impressive accuracy of 94.6% and a high AUC of >0.98. This shows that the proposed PDCaM+SVM classification model has a high TPR while maintaining a low FPR. In other words, the model is effective in correctly identifying positive cases while minimizing the misclassification of negative cases.
Furthermore, a brain rhythm-based AUD detection is conducted across 5 different bands using SVM, revealing a high accuracy between 91% to 96.27%. Notably, the gamma bands exhibited the highest accuracy among the analyzed brain rhythms.
The proposed PDCaM-based approach for detecting AUD using resting-state EEG signals and SVM classifiers demonstrates significant potential for clinical application. Achieving high accuracy (94.6%) and AUC (>0.98), the model shows promise as a reliable, non-invasive diagnostic tool for distinguishing AUD subjects from healthy controls. Its ability to analyze specific brain rhythms, particularly the gamma band, highlights its potential for both early detection and monitoring of AUD-related neurophysiological abnormalities.
For real-world implementation, several challenges must be addressed. Scaling the methodology to larger and more diverse populations is crucial to validate its robustness and generalizability. Additionally, computational demands arising from high-dimensional PDCaM feature vectors require optimization through efficient algorithms and feature selection to enable deployment on portable or edge devices. Automating the pipeline from EEG preprocessing to classification is another critical step to facilitate its integration into clinical workflows.
Broader implications of this study include its applicability to monitoring treatment efficacy and its potential extension to other neurological or psychiatric disorders with similar EEG abnormalities. By addressing scalability, computational efficiency, and automation, the findings of this research pave the way for impactful applications in diagnosing and managing AUD. This could ultimately improve patient outcomes through targeted interventions and personalized care strategies.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: the data used in this study has been acquired from CISIR's data repository. It will be made available for use upon reasonable request from the corresponding author after signing a formal data sharing and usage agreement. Requests to access these datasets should be directed to Norashikin Yahya, bm9yYXNoaWtpbl95YWh5YUB1dHAuZWR1Lm15.
Ethics statement
The studies involving humans were approved by Ethics Committee of University Malaya, Malaysia. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
AM: Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. NY: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. DK: Conceptualization, Data curation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing, Formal analysis, Investigation. NM: Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. NB: Formal analysis, Project administration, Resources, Supervision, Validation, Visualization, Writing – review & editing. AA: Formal analysis, Methodology, Project administration, Resources, Validation, Visualization, Writing – review & editing. MA: Funding acquisition, Project administration, Resources, Validation, Visualization, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This project was funded by the Deanship of Scientific Research (DSR) at King Abdulaziz University, Jeddah, Saudi Arabia, under grant no. (GPIP-783-135-2024).
Acknowledgments
The authors gratefully acknowledge the DSR for its technical and financial support.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Al-Hiyali, M. I., Ishak, A. J., Harun, H. R., Ahmad, S. A., and Sulaiman, W. A. W. (2019). Examination of prefrontal cortex activity after EEG-neurofeedback stimulation in overweight cases. Int. J. Integr. Eng. 11:4536.
Ali, I., Mughal, N., Khand, Z. H., Ahmed, J., and Mujtaba, G. (2022). Resume classification system using natural language processing and machine learning techniques. Mehran Univ. Res. J. Eng. Technol. 41, 65–79. doi: 10.22581/muet1982.2201.07
Anuragi, A., and Singh Sisodia, D. (2019). Alcohol use disorder detection using EEG signal features and flexible analytical wavelet transform. Biomed. Signal Process. Control 52, 384–393. doi: 10.1016/j.bspc.2018.10.017
Anuragi, A., Sisodia, D. S., and Pachori, R. B. (2020). Automated alcoholism detection using fourier-bessel series expansion based empirical wavelet transform. IEEE Sens. J. 20, 4914–4924. doi: 10.1109/JSEN.2020.2966766
Asri, M. F. M., Yahya, N., and Elamvazuthi, I. (2020). Automatic arrhythmia detection algorithm using statistical and autoregressive model features. Platform J. Eng. 4, 41–49. doi: 10.61762/pajevol4iss1art5898
Astolfi, L., Cincotti, F., Mattia, D., Marciani, M., Baccala, L., Fallani, F., et al. (2006). Assessing cortical functional connectivity by partial directed coherence: simulations and application to real data. IEEE Trans. Biomed. Eng. 53, 1802–1812. doi: 10.1109/TBME.2006.873692
Baccalá, L. A., and Sameshima, K. (2001). Partial directed coherence: a new concept in neural structure determination. Biol. Cybern. 84, 463–474. doi: 10.1007/PL00007990
Balconi, M., Angioletti, L., and Crivelli, D. (2024). Integrating EEG biomarkers in the neurocognitive screening of executive functions in substance and behavioral addiction. Front. Psychiatry 15:1472565. doi: 10.3389/fpsyt.2024.1472565
Buriro, A. B., Ahmed, B., Baloch, G., Ahmed, J., Shoorangiz, R., Weddell, S. J., et al. (2021). Classification of alcoholic EEG signals using wavelet scattering transform-based features. Comput. Biol. Med. 139:104969. doi: 10.1016/j.compbiomed.2021.104969
Dafters, R., Duffy, F., O'Donnell, P., and Bouquet, C. (1999). Level of use of 3,4-methylenedioxymethamphetamine (MDMA or ecstasy) in humans correlates with EEG power and coherence. Psychopharmacology 145, 82–90. doi: 10.1007/s002130051035
Farsi, L., Siuly, S., Kabir, E., and Wang, H. (2021). Classification of alcoholic EEG signals using a deep learning method. IEEE Sens. J. 21, 3552–3560. doi: 10.1109/JSEN.2020.3026830
Hou, Y., Jia, S., Lun, X., Hao, Z., Shi, Y., Li, Y., et al. (2022). GCNs-net: a graph convolutional neural network approach for decoding time-resolved EEG motor imagery signals. IEEE Trans. Neural Netw. Learn. Syst. 35, 7312–7323. doi: 10.1109/TNNLS.2022.3202569
Hussain, F., Tbarki, K., and Ksantini, R. (2023). GAN-based one-class classification SVM for real-time medical image intrusion detection. Int. J. Comput. Digit. Syst. 13, 625–641. doi: 10.12785/ijcds/130150
Japaridze, N., Muthuraman, M., Dierck, C., von Spiczak, S., Boor, R., Mideksa, K. G., et al. (2016). Neuronal networks in epileptic encephalopathies with csws. Epilepsia 57, 1245–1255. doi: 10.1111/epi.13428
Jarcuskova, D., Pallayova, M., Carnakovic, S., Frajka, M., Fidmik, J., and Bednarova, A. (2024). Clinical characteristics of adults with alcohol dependence syndrome comorbid with antisocial personality disorder: a cross-sectional study. Front. Psychiatry 15:1397009. doi: 10.3389/fpsyt.2024.1397009
Källmén, H., Elgán, T. H., Wennberg, P., and Berman, A. H. (2019). Concurrent validity of the alcohol use disorders identification test (AUDIT) in relation to alcohol use disorder (AUD) severity levels according to the brief dsm-5 AUD diagnostic assessment screener. Nord. J. Psychiatry 73, 397–400. doi: 10.1080/08039488.2019.1642382
Kaul, A., and Raina, S. (2022). Support vector machine versus convolutional neural network for hyperspectral image classification: a systematic review. Concurr. Comput. 34:e6945. doi: 10.1002/cpe.6945
Kaur, P., and Singh, R. K. (2023). A review on optimization techniques for medical image analysis. Concurr. Comput. 35:e7443. doi: 10.1002/cpe.7443
Khan, D. M., Kamel, N., Muzaimi, M., and Hill, T. (2021a). Effective connectivity for default mode network analysis of alcoholism. Brain Connect. 11, 12–29. doi: 10.1089/brain.2019.0721
Khan, D. M., Masroor, K., Jailani, M. F. M., Yahya, N., Yusoff, M. Z., and Khan, S. M. (2022). Development of wavelet coherence EEG as a biomarker for diagnosis of major depressive disorder. IEEE Sens. J. 22, 4315–4325. doi: 10.1109/JSEN.2022.3143176
Khan, D. M., Yahya, N., Kamel, N., and Faye, I. (2021b). Automated diagnosis of major depressive disorder using brain effective connectivity and 3D convolutional neural network. IEEE Access 9, 8835–8846. doi: 10.1109/ACCESS.2021.3049427
Khan, D. M., Yahya, N., Kamel, N., and Faye, I. (2021c). Effective connectivity in default mode network for alcoholism diagnosis. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 796–808. doi: 10.1109/TNSRE.2021.3075737
Khan, D. M., Yahya, N., Kamel, N., and Faye, I. (2023). A novel method for efficient estimation of brain effective connectivity in EEG. Comput. Methods Progr. Biomed. 228:107242. doi: 10.1016/j.cmpb.2022.107242
Khosla, A., Khandnor, P., and Chand, T. (2020). A comparative analysis of signal processing and classification methods for different applications based on EEG signals. Biocyber. Biomed. Eng. 40, 649–690. doi: 10.1016/j.bbe.2020.02.002
Liang, S., Choi, K.-S., Qin, J., Wang, Q., Pang, W.-M., and Heng, P.-A. (2016). Discrimination of motor imagery tasks via information flow pattern of brain connectivity. Technol. Health Care 24, S795–S801. doi: 10.3233/THC-161212
Lim, Y., Beane-Ebel, J. E., Tanaka, Y., Ning, B., Husted, C. R., Henderson, D. C., et al. (2021). Exploration of alcohol use disorder-associated brain miRNA-mRNA regulatory networks. Transl. Psychiatry 11:504. doi: 10.1038/s41398-021-01635-w
Masroor, K., Khan, D. M., Khan, H. R., Yahya, N., Arshad, K., and Assaleh, K. (2024). Deciphering default mode network dynamics in alcohol use disorder through directed transfer function analysis of EEG. IEEE Access 12, 192090–192100. doi: 10.1109/ACCESS.2024.3518916
Muthuraman, M., Moliadze, V., Mideksa, K. G., Anwar, A. R., Stephani, U., Deuschl, G., et al. (2015). EEG-MEG integration enhances the characterization of functional and effective connectivity in the resting state network. PLoS ONE 10:e0140832. doi: 10.1371/journal.pone.0140832
Nunes, P. T., Kipp, B. T., Reitz, N. L., and Savage, L. M. (2019). Aging with alcohol-related brain damage: critical brain circuits associated with cognitive dysfunction. Int. Rev. Neurobiol. 148, 101–168. doi: 10.1016/bs.irn.2019.09.002
Omidvarnia, A. (2020). Time-varying EEG connectivity: a time-frequency approach. Available at: https://www.mathworks.com/matlabcentral/fileexchange/33721-time-varying%-EEGconnectivity-a-time-frequency-approach (accessed May 8, 2023).
Parker, C. S., Clayden, J. D., Cardoso, M. J., Rodionov, R., Duncan, J. S., Scott, C., et al. (2018). Structural and effective connectivity in focal epilepsy. Neuroimage Clin. 17, 943–952. doi: 10.1016/j.nicl.2017.12.020
Pascual-Marqui, R. D., Biscay, R. J., Bosch-Bayard, J., Lehmann, D., Kochi, K., Kinoshita, T., et al. (2014). Assessing direct paths of intracortical causal information flow of oscillatory activity with the isolated effective coherence (icoh). Front. Hum. Neurosci. 8:448. doi: 10.3389/fnhum.2014.00448
Phang, C.-R., Ting, C.-M., Noman, F., and Ombao, H. (2019). Classification of EEG-based brain connectivity networks in schizophrenia using a multi-domain connectome convolutional neural network. arXiv preprint arXiv:1903.08858. doi: 10.48550/arXiv.1903.08858
Rolls, E. T., Cheng, W., Gilson, M., Qiu, J., Hu, Z., Ruan, H., et al. (2018). Effective connectivity in depression. Biol. Psychiatry Cogn. Neurosci. Neuroimag. 3, 187–197. doi: 10.1016/j.bpsc.2017.10.004
Roy, A., and Chakraborty, S. (2023). Support vector machine in structural reliability analysis: a review. Reliab. Eng. Syst. Saf. 183:109126. doi: 10.1016/j.ress.2023.109126
Saeedi, A., Saeedi, M., Maghsoudi, A., and Shalbaf, A. (2021). Major depressive disorder diagnosis based on effective connectivity in EEG signals: A convolutional neural network and long short-term memory approach. Cogn. Neurodyn. 15, 239–252. doi: 10.1007/s11571-020-09619-0
Shen, G., Chen, Y.-H., Wu, Y., Jiahui, H., Fang, J., Jiayi, T., et al. (2024). Exploring core symptoms of alcohol withdrawal syndrome in alcohol use disorder patients: a network analysis approach. Front. Psychiatry 15:1320248. doi: 10.3389/fpsyt.2024.1320248
van Oort, J., Diazgranados, N., George, D. T., Horneffer, Y., Schwandt, M., Goldman, D., et al. (2023). Preliminary evidence for changes in frontoparietal network connectivity in the early abstinence period in alcohol use disorder: a longitudinal resting-state functional magnetic resonance imaging study. Front. Psychiatry 14:1185770. doi: 10.3389/fpsyt.2023.1185770
Varotto, G., Visani, E., Canafoglia, L., Franceschetti, S., Avanzini, G., and Panzica, F. (2012). Enhanced frontocentral EEG connectivity in photosensitive generalized epilepsies: a partial directed coherence study. Epilepsia 53, 359–367. doi: 10.1111/j.1528-1167.2011.03352.x
Verma, S., and Sharan, A. (2023). “Incorporating semantics for text classification in biomedical domain,” in Proceedings of the International Health Informatics Conference: IHIC 2022 (Springer), 185–197. doi: 10.1007/978-981-19-9090-8_17
Wang, S., Wang, S., Zhang, S., and Wang, Y. (2020). Time-resnext for epilepsy recognition based on EEG signals in wireless networks. J. Wirel. Commun. Netw. 2020:195. doi: 10.1186/s13638-020-01810-5
Wen, Y., Zhang, Y., Wen, L., Cao, H., Ai, G., Gu, M., et al. (2022). A 65nm/0.448mw EEG processor with parallel architecture SVM and lifting wavelet transform for high-performance and low-power epilepsy detection. Comput. Biol. Med. 144:105366. doi: 10.1016/j.compbiomed.2022.105366
Keywords: alcohol use disorder (AUD), effective connectivity, electroencephalography (EEG), partial directed coherence (PDC), support vector machine (SVM)
Citation: Mohd Nazri AK, Yahya N, Khan DM, Mohd Radzi NZ, Badruddin N, Abdul Latiff AH and Abdulaal MJ (2025) Partial directed coherence analysis of resting-state EEG signals for alcohol use disorder detection using machine learning. Front. Neurosci. 18:1524513. doi: 10.3389/fnins.2024.1524513
Received: 08 November 2024; Accepted: 23 December 2024;
Published: 10 January 2025.
Edited by:
Furqan Zahoor, King Faisal University, Saudi ArabiaReviewed by:
Muhammad Danish Mujib, NED University of Engineering and Technology, PakistanSaif Ur Rehman Khan, Central South University, China
Nazabat Hussain, University of South-Eastern Norway, Norway
Copyright © 2025 Mohd Nazri, Yahya, Khan, Mohd Radzi, Badruddin, Abdul Latiff and Abdulaal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Norashikin Yahya, bm9yYXNoaWtpbl95YWh5YUB1dHAuZWR1Lm15