Ensemble graph neural network model for classification of major depressive disorder using whole-brain functional connectivity

Venkatapathy, Sujitha; Votinov, Mikhail; Wagels, Lisa; Kim, Sangyun; Lee, Munseob; Habel, Ute; Ra, In-Ho; Jo, Han-Gue

doi:10.3389/fpsyt.2023.1125339

ORIGINAL RESEARCH article

Front. Psychiatry, 23 March 2023

Sec. Computational Psychiatry

Volume 14 - 2023 | https://doi.org/10.3389/fpsyt.2023.1125339

This article is part of the Research TopicClinical Application of Machine Learning Methods in Psychiatric DisordersView all 12 articles

Ensemble graph neural network model for classification of major depressive disorder using whole-brain functional connectivity

Sujitha Venkatapathy¹

Mikhail Votinov^2,3

Lisa Wagels^2,3

Sangyun Kim⁴

Munseob Lee⁴

Ute Habel^2,3

In-Ho Ra¹

Han-Gue Jo¹^*

¹School of Computer Information and Communication Engineering, Kunsan National University, Gunsan, Republic of Korea
²Department of Psychiatry, Psychotherapy and Psychosomatics, Medical Faculty, Uniklinik RWTH Aachen University, Aachen, Germany
³Research Center Juelich, Institute of Neuroscience and Medicine: JARA-Institute Brain Structure Function Relationship (INM 10), Juelich, Republic of Korea
⁴AI Convergence Research Section, Electronics and Telecommunications Research Institute, Gwangju, Republic of Korea

Major depressive disorder (MDD) is characterized by impairments in mood and cognitive functioning, and it is a prominent source of global disability and stress. A functional magnetic resonance imaging (fMRI) can aid clinicians in their assessments of individuals for the identification of MDD. Herein, we employ a deep learning approach to the issue of MDD classification. Resting-state fMRI data from 821 individuals with MDD and 765 healthy controls (HCs) is employed for investigation. An ensemble model based on graph neural network (GNN) has been created with the goal of identifying patients with MDD among HCs as well as differentiation between first-episode and recurrent MDDs. The graph convolutional network (GCN), graph attention network (GAT), and GraphSAGE models serve as a base models for the ensemble model that was developed with individual whole-brain functional networks. The ensemble's performance is evaluated using upsampling and downsampling, along with 10-fold cross-validation. The ensemble model achieved an upsampling accuracy of 71.18% and a downsampling accuracy of 70.24% for MDD and HC classification. While comparing first-episode patients with recurrent patients, the upsampling accuracy is 77.78% and the downsampling accuracy is 71.96%. According to the findings of this study, the proposed GNN-based ensemble model achieves a higher level of accuracy and suggests that our model produces can assist healthcare professionals in identifying MDD.

1. Introduction

Depression is a major source of disability and disease burden worldwide, affecting about 264 million people. Major depressive disorder (MDD) is a serious psychological problem that can cause people to feel sad, lose interest, become listless, and have trouble thinking (1). Individuals who have suffered from MDD typically have difficulty adjusting together with their society. They have a low opinion of themselves, which ultimately leads to a decline in their performance at work. MDD can cause serious emotional problems and suicidal thoughts and behavior if it is not adequately recognized and treated (2). In patients suffering from MDD, abnormalities in large-scale brain connections have been identified more frequently in recent years. Depressed people revealed significantly disrupted connections between the task-related regions of the brain throughout a variety of task-directed functions, such as working memory, executive control, facial emotion perception, and impulse control (3, 4).

In recent years, research on MDD has focused on brain structure and function using morphological or neurobiological features. Functional magnetic resonance imaging (fMRI), magnetoencephalography (MEG), electroencephalography (EEG), and positron emission tomography (PET) are the common physiological methods employed in comparing people with MDD to healthy controls (HCs) (5). Researchers have found that patients with MDD have abnormal communication among the functional brain networks using functional connectivity (FC) of resting-state fMRI (R-fMRI), which detects synchronized and desynchronized spontaneous activity within anatomically diverse networks (6–8). In the present study, whole brain FC is extracted from R-fMRI data in order to determine whether or not the subject is MDD, and for classification between first-episode and recurrent MDD.

Machine learning (ML) methods are of increasing interest for the medical industry at present, and it has emerged as an essential part of the diagnosis and treatment of conditions pertaining to oncology, neurology, and cardiology. The process involved in a typical deep learning pipeline for the identification of MDD can be highlighted as follows: region of interest (ROI) extraction from R-fMRI, functional connectivity matrix generation, graph construction, deep learning model training, and classification (9). Figure 1 depicts the processes required in identifying MDD from HCs.

FIGURE 1

Figure 1. The process of MDD and HC classif ication: 1. Recording R-fMRI signal, 2. Extracting time series of regions of interest, 3. Calculating correlation matrices of the time series, 4. Converting correlations in graph representations, 5. Applying deep learning to classify between patients and controls.

This study presents a high-performance graph neural network (GNN)-based deep learning method for classifying individuals with MDD using R-fMRI data. In recent years, graph neural network (GNN) has become increasingly popular in graph-based learning. GNN become the optimal deep learning approach for analyzing graph-structured information. GNN algorithm combines node attributes, edge attributes, and graph topology by embedding node characteristics in a neural network and transferring data through the graph's edges. GNN can work well on non-euclidean domains and this is in contrast to traditional convolutional neural networks, which are limited to accepting only euclidean inputs (10). GNN has replaced older ML approaches due to their greater performance in analyzing graph-based information (11, 12).

Studies on MDD have progressed in recent years understanding changes of brain structure and function using morphological or neurobiological features. The studies summarized here used a number of machine learning algorithms, such as support vector machines (SVM), logistic regression, and neural networks, to differentiate between MDD and HCs using fMRI data. Especially, resting-state functional connectivity characteristics of the entire brain were studied in MDD. In order to distinguish individuals with MDD from controls, several studies (13–16) employed SVM-based multivariate pattern analysis (MVPA) techniques, achieving a better classification accuracy. However, there are limitations to this approach that stem from small sample sizes, scanner variability, and the absence of a comprehensive independent data set. By computing the Hurst exponents of resting-state networks, researchers examined their long-term memory for distinguishing depressive patients from HCs (17). Scale-free dynamics of depression-related brain activity were seen as describing the long-term memory of resting-state networks. An SVM-based classifier was used to test the data with a leave-one-out cross-validation (Loocv) method. Others studied the effects of MDD and schizophrenia on whole brain R-fMRI using SVM based MVPA in Yu et al. (18), Lois and Wessa (19), Zhu et al. (20), and Li et al. (21). Again, the dimensionality reduction technique relied on the Loocv strategy due to the small sample size. However, it is essential to evaluate the classification performance of these perspectives using a larger sample of subjects. Others showed that whole-brain R-fMRI connectivity may effectively predict antidepressant medication status in people with serious MDD (22). Medication-naive patients were distinguished from controls by the use of a trained linear SVM classifier based on MVPA technique. A different MVPA strategy based on linear, radial basis function (RBF)-SVM classification with the elastic net feature selection technique could accurately distinguished MDD patients from control subjects (23). The hyper-networks in this study were built using an elastic net and the group lasso technique. Hyper-edge, brain area, and average metric analyses suggested that the hyper-networks built with elastic net and group lasso differed structurally (24). Further, functional connection density measures derived by R-fMRI have shown to be successful to determine the relationship between the changes in resting-state activities and the responses to electroconvulsive therapy in 23 patients with MDD (25). The neural indices that were discovered as classification criteria were entered into linear SVM based MVPA, which was then used to categorize MDD patients. In another study, the identification of MDD among subjects, was explored via different static and dynamic connection metrics retrieved from R-fMRI (26). In this study a feature vector for classification was built by combining features from static and dynamic techniques. To determine the final predictor performance, a Loocv procedure was implemented. Differential sub-graph entropy and dynamic connectivity characteristics have been used in an SVM classifier to distinguish between people with MDD and HCs. A sliding-window approach was implemented to determine functional connectivity in the context of dynamic processes (27). The dynamic functional connectivity matrices were then employed as features in a non-linear SVM model to differentiate MDD patients from controls. Analysis techniques based on the minimum spanning tree need the computation of measurable qualities and the selection of these attributes as features in the classification. Using two feature types to measure two aspects of the network, a multi-kernel SVM classification method (28) allows the use of both brain region features and subgraph features. All studies discussed above made use of an approach that is based on linear SVM, multi-kernel SVM and RBF-SVM to differentiate between MDD patients and HCs.

Some other studies have used different ML methods, such as regression and neural networks, to differentiate between individuals with MDD and HCs. Using partial least squares regression on R-fMRI data, researchers developed a low-dimensional representation that links symptoms to brain activity and predicts clinical measures (29). R-fMRI connectivity in another study was calculated by employing the automated anatomical labeling layout with a partial correlation method (30). The calculations of the metrics and the classification analysis were performed in the frame of neural network. However, the efficacy and sorting of selected features, as well as sample size, kind of classifiers, and distribution of data, all have a role in determining the appropriate amount of features. For example, when developing a classifier for melancholy MDD (31), it has been shown that is it crucial to identify vitally relevant functional connections. It is for such cases recommended to use logistic regression to evaluate the uniformity vs. heterogeneity connectivity hypotheses.

The concept of deep learning has recently received significant attention. Notably, graph-based techniques, such as GNN, have been used to investigate detailed node pair in imaging/nonimaging characteristics among participants, with the goal of identifying significant phenotypes for clinical identification. Successfully applying a whole-brain data-driven approach with R-fMRI, confirmed the use of effective connectivity for MDD detection by calculating its measures via a group sparse representation and a structured equation modeling approach (32). Successful integration of effective connectivity and nonimaging phenotypic information allowed the use of spectral graph convolutional networks (GCN) based on a population graph to differentiate drug-naive MDD patients from HCs. Using functional connectivity as a characteristic, Ktena et al. (33) trained a spectral GCN with subjects as nodes. The spectral GCN was used to diagnose the problem by grouping the nodes into their respective categories. Others used a mutual multi-scale triplet GCN (34) for the purpose of analyzing static FC and structural connectivity with the intention of identifying brain disorders. Further a spatio-temporal GCN framework was created to train discriminative features from FC measures for the automated identification and treatment response prediction of MDD (35). The GCN model was developed to every participant's whole-brain functional network in order to differentiate MDD patients from HCs, recognize the most important regions making a contribution to classifying, and investigate the association between structural features of salient regions and clinical features (36).

In this research, our main aim is to employ an ensemble-based GNN framework to perform the primary classification analysis between MDD and HCs as well as subgroup analysis between first-episode and recurrent (REC). Instead of using a single unified GNN model to learn representations for all of the nodes in a large graph, it is better to use ensemble learning methods (37) to improve classification performance. With ensemble learning, many fundamental classifiers are combined to boost the predictive power of the model. Therefore, to enhance the efficiency of scalable GNN, we propose a GNN-based ensemble model that creates customized models.

1.1. Aims

This study provides a methodology for MDD analysis and classification using brain functional networks derived from R-fMRI data. Since the brain is a complex network system, the present study analyses the R-fMRI as the whole brain functional structure rather than individual FCs. In this research, our main aim is to employ an ensemble- based GNN framework to perform the primary classification. Our main scientific contributions are as follows:

• Developing a robust ensemble-based GNN model that takes R-fMRI data for the detection of MDD. A created novel ensemble model is used for identifying individuals with MDD from HCs as well as to perform analysis between two sub groups of MDD patients, namely first episode drug nive (FEDN) and recurrent (REC) MDD patients.

• GCN, GAT, and GraphSAGE models are created as the base line models for the ensemble model, for improving classification accuracy. A developed ensemble model is trained using individual whole-brain functional networks.

• Methods of upsampling and downsampling are employed to achieve balanced sample size. A 10-fold enumeration is used to refine the classification process. Empirical investigations with a large sample size showed that our model is more accurate and beneficial for classification of MDD compared to other models that are currently available.

2. Materials and methods

2.1. Subjects

R-fMRI data from the REST-meta-MDD collaboration (6), which contained 25 datasets totaling 2428 persons (1300 MDD patients and 1128 HCs from 17 hospitals), was used in the present study. There have been 562 patients with MDD who were experiencing their first episode of the disorder, as well as 282 patients with MDD who have been experiencing recurring episodes of the disorder. According to a previous publication on the dataset (6), we used criteria such as missing data, low-quality spatial normalization, insufficient coverage, noticeable head movement, and sites with fewer than 10 subjects to exclude. This was produced in a sample of 821 people with MDD and 765 HCs from 16 different sites. Drug consumption data was submitted by 527 patients; 219 of these individuals are currently using first episode MDD patients without medication treatment was defined as FEDN, and REC is the MDD patients with recurrent episode regardless of medication status. Two research groups (Sites 5 and 13) contributed data on 117 FEDN patients and 72 patients with REC MDD, five research groups (Sites 4, 5, 9, 13, and 16) contributed data on 227 FEDN patients and 388 HCs, and six research groups (Sites 3, 5, 7, 12, 13, and 14) contributed statistics on 189 patients with REC and 423 HCs. The studies involving human participants data were reviewed and approved by the Institutional Review Board of Kunsan National University.

2.2. Preprocessing

Data from R-fMRI and structural MRI were acquired and the DPARSF toolkit (38) was used to perform preprocessing procedure. Slice timing correction, head motion correction, normalization, and the elimination of confounds were the main preprocessing procedures. Dosenbach's atlas was used as a reference point during the process of segmenting the entire brain into 160 distinct regions of interest (ROI) (39). The voxel-level BOLD values were extracted and averaged across all ROIs. The Pearson correlation coefficient of the related time series was used to assess FC between each pair of ROIs. Finally, the correlation estimates were transformed using Fisher's z-transform to generate FC matrix in the range of 160 × 160 for each subject (40).

2.3. Methods

The overall process of the GNN-based ensemble model is shown in Figure 2. The FC matrix the whole brain is initially depicted as a weighted undirected graph G(N, E), where N and E are collections of nodes and edges. Nodes are the 160 brain regions identified by the ROIs, and their characteristics are the matrix representation of the functional connection between them. The link between nodes are represented by a weighted adjacency matrix (A). Each node is linked to its nearest neighbors using a k-nearest neighbors (KNN) technique to establish edges (36). In order to construct the GNN-based ensemble model, the GCN, GAT, and SAGE models are used as the base models. The core operation of an ensemble model is to combine out features of base models and apply the softmax activation function to translate final scalers into predicted probabilities of the each class.

FIGURE 2

Figure 2. Overall framework of the proposed method for MDD classification. Ensemble model is designed to create meta model from the out features of each base models. GAT, graph attention network; GCN, graph convolutional network; SAGE, GraphSAGE technique, ReLU, Rectified Linear Unit; ELU, Exponential Linear Unit, MDD, major depressive disorder; HC, healthy controls.

2.3.1. Base models

First, the FC matrix was represented as a graph structure, along with an adjacency matrix and node characteristics. The GAT model uses a graph attention layer to learn the node representation, followed by an attention pooling layer and a classification layer to retrieve the node representation and perform the task of learning. We began by stacking three GAT layers with exponential linear unit (eLU) activation functions, then moved on to a global mean pooling layer, then a dropout layer, and finally fed that information into a classification layer. The head size is 8 and the dropout rate is 0.5 in the GAT model. Input layer, graph convolutional hidden layer, fully connected layer, and global average pooling layer are the components that make up GCN model. After each hidden layer, there is a rectified linear unit (ReLU) activation function. The dropout rate is 0.3 in the GCN model. We also use GraphSAGE with three layers (K = 3) as a base model. The dimensions of the node embeddings are the same as the size of the hidden units that are used in each GraphSAGE layer, which is 64. The ReLU activation function and a dropout rate of 0.3 are used in each of the GraphSAGE layers. Also, Every model has a weight decay value of 5 x 10⁻⁴ and a learning rate of 0.01. Each model in GNN, such as GCN, GAT, and GraphSAGE, is trained with the same set of node features, edge features, weights, and learning rate. Figure 3 illustrates the processes involved in creating a base model.

FIGURE 3

Figure 3. Structure of the base model.

2.3.2. Ensemble model

We use the ensemble-based GNN model in order to determine the essential features that contribute to the prediction of MDD. The suggested ensemble-GNN procedure is depicted in the flowchart described in Figure 4. We construct a GNN-based ensemble model using GCN, GAT, and GraphSAGE as building blocks. Both the node features and the adjacency matrix are fed into the base models, and the features' identities are then extracted from the respective models. Each model's predicted output features are fed into the ensemble model. Then, for class prediction, we add fully connected layers with a softmax activation function. The cross-entropy loss function is put into effect to this extent. The adam optimizer is used to find the optimal values for each of the model's parameters.

FIGURE 4

Figure 4. Flowchart to show the process of ensemble model.

2.4. MDD identification and evaluation

MDD classification employs ensemble-based GNN supervised learning classifiers, and this classification makes it simple to compare the efficacy of various machine learning strategies for processing fMRI data. Both training and testing are required for supervised learning classification. With the help of the samples' class labels, the classifier identifies a decision boundary that divides the input space during the training phase. Once the decision function has been calculated using the training set, it may be applied to unseen testing data to infer the corresponding class label. In order to reduce the overfitting problem and to offer a reliable and generalizable classification performance evaluation, the effectiveness of the classification framework is evaluated using the 10-fold cross validation scheme. In order to balance the sample size, oversampling is accomplished by copying data from minority classes, whereas undersampling is carried out by selecting data from majority classes. The performance of the classification system is measured and analyzed based on its accuracy (ACC), specificity (SPE), sensitivity (SEN), and area under the curve (AUC). The diagnostic accuracy of a classifier can be measured with the use of receiver operating characteristic, which is a curve that is generated by graphing the true positive rate against the false positive rate. The process of determining classification ACC, SPE, and SEN is denoted as,

\begin{array}{l} A C C = \frac{T P + T N}{T P + F P + F N + T N} & (1) \end{array}

\begin{array}{l} S P E = \frac{T N}{T N + F P} & (2) \end{array}

\begin{array}{l} S E N = \frac{T P}{T P + F N} & (3) \end{array}

In this case, TP indicates a successful classification of positive samples, TN indicates a successful classification of negative samples, and FP indicates incorrect classification of negative samples as positive, and FN indicates incorrect negative classification.

3. Results

In this section, we validate the efficiency of the suggested MDD identification approach by analyzing the following scenarios: (a) using FC as features; (b) using GCN, GAT, and GraphSAGE as base learners; (c) using an ensemble classification model. We used a 10-fold cross-validation, and in that cross-training, we split the samples of all MDD and HCs into 10 groups. Each time the method is modified, one unit is chosen as the testing dataset for assessing the performance of the model, while the remaining 9 units are used as the training dataset. By stratifying the 10-fold cross-validation, we are able to keep the percentage of samples from each class in every fold equal across the entire sample. In this case, the samples are not balanced, so in order to create samples that are balanced, random upsampling is performed on the majority classes, and random downsampling is performed on the minority classes. For the primary analysis, there were a total of 1,586 participants included in our method (821 patients with MDD and 765 HCs). Based on the clinical data for the patients who are included, 243 were FEDN patients, and 203 are REC patients.

3.1. MDD vs. HC classification

The ensemble model attained an accuracy of 71.8% for upsampling and 70.4% for downsampling when it came to classifying MDD and HC. When using upsampling, the ensemble model achieves an AUC of 76.53%, while using downsampling, it achieves an AUC of 71.27%. Specificity and sensitivity values for upsampling are 74.96 and 68.23%, respectively, whereas the values for downsampling are 67.27 and 72.88%. The findings for upsampling and downsampling based on a 10-fold cross validation for MDD and HC classification are given in Figure 5.

FIGURE 5

Figure 5. 10-Fold cross validation for MDD and HC samples depicting accuracy, sensitivity, specificity and the area under the curve. (A) Upsampling and (B) Downsampling.

3.2. FEDN vs. HC classification

FEDN could be distinguished from HCs with a classification accuracy of 88.93% for upsampling and 64.17% for downsampling. Classification of FEDN patients with HC achieved an upsampling AUC of 85.75% and a downsampling AUC of 62.25%. Upsampling has a specificity and sensitivity of 89% and 85.79%, whereas downsampling has a specificity and sensitivity of 60.31 and 61.36%. The findings from classifying FEDN and HCs are shown in Table 1.

TABLE 1

Table 1. Ensemble model performance for FEDN vs. HC classification.

3.3. REC vs. HC classification

Classification accuracy for upsampling REC with HC is 91.6%, whereas classification accuracy for downsampling REC patients with HC is 68.78%. Using upsampling, we are able to classify REC patients as distinct from HC with an AUC of 88.24%, while using downsampling, we are only able to reach an AUC of 67.11 %. The respective results for specificity and sensitivity while upsampling are 93.15 and 87.20%, while they are 60.96 and 66.92% when downsampling. Table 2 describes the results produced from the classification of REC and HCs.

TABLE 2

Table 2. Ensemble model performance for REC vs. HC classification.

3.4. FEDN vs. REC classification

The FEDN and the REC are discriminated from each other by the use of a subgroup analysis. There is a 77.78% accuracy rate when upsampling FEDN against REC and a 71.96% rate when downsampling. When we compared the existing approach (36) to the subgroup analysis, we found that the classification performance for the characterization of recurring patients was greater than that of FEDN. Using upsampling, we achieved an AUC of 75.19%, whereas using downsampling, we achieved an AUC of 71.77%. Upsampling yields results of 72.81 and 81.91% for specificity and sensitivity, whereas downsampling yields results of 71.52 and 71.56%. The outcomes of the FEDN and REC classifications are shown in Table 3.

TABLE 3

Table 3. Ensemble model performance under FEDN vs. REC classification.

4. Discussion

We proposed an ensemble-based GNN method for automatic MDD identification using whole brain functional network features. Using a large open source dataset, the current study employed an ensemble based GNN to classify MDD as well as to classify FEDN with REC, and the resulting upsampling classification performance outperformed typical machine learning approaches by around, 71.18% and 77.78%, respectively. In the analysis process, initially, separate base models are created, and then the classification performance of each model is examined. Models such as GCN, GAT, and GraphSAGE are employed as base line models. The GAT approach is utilized during the training of the model, which resulted in an upsampling accuracy of 66.24% when comparing MDD to HC and 71.67% when comparing FEDN to REC. The GCN technique is also separately applied during the training process of the model, which led to an upsampling accuracy of 64.72% while correlating MDD to HC and 73.58% while comparing FEDN to REC. Also, the GraphSAGE model alone was employed when training the model, which produced in an upsampling accuracy of 64.47 % for MDD among HC classification and 72.78% for FEDN with REC classification. In addition to this, we used an all-individual model to analyze subgroups such as FEDN with HC and REC with HC. The results of the individual base models such as GCN, GAT, and GraphSAGE are listed in Table 4. The GNN-based ensemble model is developed to improve the classification accuracy of primary analysis as well as subgroup analysis in analyzing MDD. Already, the base models are trained independently, and while some findings indicate that GCN produces better results, other findings show that GAT or GraphSAGE produces better outcomes. That indicates that no one model achieves better results for all classes. Because of this, a combined model is developed to produce more accurate results for the whole sample. Figure 6 displays the results of a comparison between the base model and the ensemble model.

TABLE 4

Table 4. Base model performance under different group analysis.

FIGURE 6

Figure 6. Comparison analysis between the individual model and the ensemble model. (A) Upsampling and (B) Downsampling.

In the majority of the earlier investigations, the ML algorithm was employed to differentiate between all MDD and HCs (20, 41–43). To identify brain disorders, researchers have created a number of deep learning techniques, including BrainNetCNN (44) and discriminative/generative long short-term memory (45). However, the sample size that they employed for the investigation was comparatively small. The use of GNN to distinguish between MDD and HCs has been proven effective in a small number of studies. Some previous research (32, 34, 46) has shown that the graph convolution technique can be used to distinguish disorders in patients with HC. This is in contrast to others, who employed MVPA of static or dynamic functional connectivity in the brain network (47, 48), which neglected topological elements that could provide key clues for diagnosis. The ensemble-GNN algorithm combines the results of numerous GNN classifiers into a single model in order to minimize the impact of overfitting. In addition, by using ensemble-GNN, the deviation caused by a single classifier can be reduced, resulting in improved reliability. When applied to imbalanced datasets with the same number of learning epochs, ensemble-GNN obtains a higher classification accuracy than a single classifier does. We took primary analyses into consideration such as all MDD with HCs as well as subgroup analyses including FEDN among HCs, REC with HCs, and FEDN along with REC. In addition, We also analyzed the model using the Craddock and Automated Anatomical Labeling (AAL) atlases. Our ensemble GNN model achieves an accuracy of 74.75% for the AAL atlas and 73.37% for the Craddock atlas when classifying MDD vs. HC upsampling data. Tables 2, 4 of the Supplementary material contain the primary and all of the sub class analysis results for the AAL and Craddock atlas, respectively.

5. Limitations

Some limitations should be taken into account when evaluating the present findings. We were not able to reach the classification performance in the case of MDD with HC categorization when compared with the previous approach (36). However, our methods achieve higher performance in subgroup classification. In order to solve the problem of imbalanced class representation, we tried both random sampling. Upsampling gives better results, but it can add a redundant samples to the model, which slows down training and vulnerable to overfitting. In our approach, the overfitting problem is reduced by utilizing cross-validation; however, training speed is not taken into account. Also, with respect to the population, most MDD was females which may have a confounding effect on MDD classification. However, it should be noted that the same GNN model structures could also successfully classify between MDD subgroups, FEDN vs. REC, implying that MDD classification is not entirely dependent on the sex effect. Further studies that aim to test this effect requires a much larger sample size for controlling sex or adequate matching.

6. Conclusion

In this study, we effectively created an ensemble model based on GNN for classifying MDD by utilizing R-fMRI data. In particular, we investigated a sub-group analysis between FEDN with REC. The proposed model that employs whole brain functional connectivity classifies MDD patients and healthy individuals with high accuracy. In order to improve the overall performance of the ensemble model, we used three different GNN base models under 10-fold cross-validation. Based on large dataset and a number of different of validation techniques, an ensemble model could classify MDD and HCs with a feasible accuracy of 71.18% for upsampling and 70.14% for downsampling. When compared to earlier approaches, the findings that these methods yield in the subgroup analysis are higher. This method achieves an accuracy of 77.78% for upsampling and 71.96% for downsampling when applied to an analysis of FEDN and REC. The findings of this validation suggest that our model produces a feasible application to assist healthcare professionals in identifying MDD.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving human participants were reviewed and approved by the Institutional Review Board of Kunsan National University. Written informed consent from the was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

SV, MV, LW, SK, ML, UH, I-HR, and H-GJ contributed to conception and implementation of the study. SV wrote the first draft. All authors contributed to review, editing, and approved the submitted version.

Funding

This work was supported by Electronics and Telecommunications Research Institute (ETRI) grant funded by the Korean government (23ZK1100, Honam region regional industry-based ICT convergence technology advancement support project), and by the FZJ-NST Bilateral Cooperation Programme funded by the Forschungszentrum Jülich and the National Research Council of Science & Technology (Global-22-001).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2023.1125339/full#supplementary-material

References

1. Yang H, Chen X, Chen ZB, Li L, Li XY, Castellanos FX, et al. Disrupted intrinsic functional brain topology in patients with major depressive disorder. Mol Psychiatry. (2021) 26:7363–71. doi: 10.1038/s41380-021-01247-2

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Rafiei A, Zahedifar R, Sitaula C, Marzbanrad F. Automated detection of major depressive disorder with EEG signals: a time series classification using deep learning. IEEE Access. (2022) 10:73804–17. doi: 10.1109/ACCESS.2022.3190502

CrossRef Full Text | Google Scholar

3. Müller VI, Cieslik EC, Serbanescu I, Laird AR, Fox PT, Eickhoff SB. Altered brain activity in unipolar depression revisited: meta-analyses of neuroimaging studies. JAMA Psychiatry. (2017) 74:47–55. doi: 10.1001/jamapsychiatry.2016.2783

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Cai H, Qu Z, Li Z, Zhang Y, Hu X, Hu B. Feature-level fusion approaches based on multimodal EEG data for depression recognition. Inf Fusion. (2020) 59:127–38. doi: 10.1016/j.inffus.2020.01.008

CrossRef Full Text | Google Scholar

5. Shao X, Sun S, Li J, Kong W, Zhu J, Li X, et al. Analysis of functional brain network in MDD based on improved empirical mode decomposition with resting state EEG data. IEEE Trans Neural Syst Rehabil Eng. (2021) 29:1546–56. doi: 10.1109/TNSRE.2021.3092140

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Yan CG, Chen X, Li L, Castellanos FX, Bai TJ, Bo QJ, et al. Reduced default mode network functional connectivity in patients with recurrent major depressive disorder. Proc Natl Acad Sci USA. (2019) 116:9078–83. doi: 10.1073/pnas.1900390116

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Li G, Liu Y, Zheng Y, Li D, Liang X, Chen Y, et al. Large-scale dynamic causal modeling of major depressive disorder based on resting-state functional magnetic resonance imaging. Hum Brain Mapp. (2020) 41:865–81. doi: 10.1002/hbm.24845

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kaiser RH, Whitfield-Gabrieli S, Dillon DG, Goer F, Beltzer M, Minkel J, et al. Dynamic resting-state functional connectivity in major depression. Neuropsychopharmacology. (2016) 41:1822–30. doi: 10.1038/npp.2015.352

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Khan DM, Masroor K, Jailani MFM, Yahya N, Yusoff MZ, Khan SM. Development of wavelet coherence EEG as a biomarker for diagnosis of major depressive disorder. IEEE Sens rs J. (2022) 22:4315–25. doi: 10.1109/JSEN.2022.3143176

CrossRef Full Text | Google Scholar

10. Bronstein MM, Bruna J, LeCun Y, Szlam A, Vandergheynst P. Geometric deep learning: going beyond euclidean data. IEEE Signal Process Mag. (2017) 34:18–42. doi: 10.1109/MSP.2017.2693418

CrossRef Full Text | Google Scholar

11. Kim BH, Ye JC. Understanding graph isomorphism network for rs-fMRI functional connectivity analysis. Front Neurosci. (2020) 14:630. doi: 10.3389/fnins.2020.00630

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Li X, Zhou Y, Dvornek N, Zhang M, Gao S, Zhuang J, et al. Braingnn: interpretable brain graph neural network for fmri analysis. Med Image Anal. (2021) 74:102233. doi: 10.1016/j.media.2021.102233

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Bhaumik R, Jenkins LM, Gowins JR, Jacobs RH, Barba A, Bhaumik DK, et al. Multivariate pattern analysis strategies in detection of remitted major depressive disorder using resting state functional connectivity. Neuroimage Clin. (2017) 16:390–8. doi: 10.1016/j.nicl.2016.02.018

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Craddock RC, Holtzheimer PE III, Hu XP, Mayberg HS. Disease state prediction from resting state functional connectivity. Mag Reson Med. (2009) 62:1619–28. doi: 10.1002/mrm.22159

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Zeng LL, Shen H, Liu L, Wang L, Li B, Fang P, et al. Identifying major depression using whole-brain functional connectivity: a multivariate pattern analysis. Brain. (2012) 135:1498–507. doi: 10.1093/brain/aws059

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Guo S, Yu Y, Zhang J, Feng J. A reversal coarse-grained analysis with application to an altered functional circuit in depression. Brain Behav. (2013) 3:637–48. doi: 10.1002/brb3.173

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Wei M, Qin J, Yan R, Li H, Yao Z, Lu Q. Identifying major depressive disorder using Hurst exponent of resting-state brain networks. Psychiatry Res Neuroimaging. (2013) 214:306–12. doi: 10.1016/j.pscychresns.2013.09.008

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Yu Y, Shen H, Zeng LL, Ma Q, Hu D. Convergent and divergent functional connectivity patterns in schizophrenia and depression. PLoS ONE. (2013) 8:e68250. doi: 10.1371/journal.pone.0068250

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Lois G, Wessa M. Differential association of default mode network connectivity and rumination in healthy individuals and remitted MDD patients. Soc Cogn Affect Neurosci. (2016) 11:1792–801. doi: 10.1093/scan/nsw085

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Zhu X, Yuan F, Zhou G, Nie J, Wang D, Hu P, et al. Cross-network interaction for diagnosis of major depressive disorder based on resting state functional connectivity. Brain Imaging Behav. (2021) 15:1279–89. doi: 10.1007/s11682-020-00326-2

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Li J, Chen H, Fan F, Qiu J, Du L, Xiao J, et al. White-matter functional topology: a neuromarker for classification and prediction in unmedicated depression. Transl Psychiatry. (2020) 10:1–10. doi: 10.1038/s41398-020-01053-4

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Qin J, Shen H, Zeng LL, Jiang W, Liu L, Hu D. Predicting clinical responses in major depression using intrinsic functional connectivity. Neuroreport. (2015) 26:675–80. doi: 10.1097/WNR.0000000000000407

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Geng X, Xu J, Liu B, Shi Y. Multivariate classification of major depressive disorder using the effective connectivity and functional connectivity. Front Neurosci. (2018) 12:38. doi: 10.3389/fnins.2018.00038

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Guo H, Li Y, Xu Y, Jin Y, Xiang J, Chen J. Resting-state brain functional hyper-network construction based on elastic net and group lasso methods. Front Neuroinform. (2018) 12:25. doi: 10.3389/fninf.2018.00025

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Wang J, Wei Q, Yuan X, Jiang X, Xu J, Zhou X, et al. Local functional connectivity density is closely associated with the response of electroconvulsive therapy in major depressive disorder. J Affect Disord. (2018) 225:658–64. doi: 10.1016/j.jad.2017.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Sen B, Cullen KR, Parhi KK. Classification of adolescent major depressive disorder via static and dynamic connectivity. IEEE J Biomed Health Inform. (2020) 25:2604–14. doi: 10.1109/JBHI.2020.3043427

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Yan B, Xu X, Liu M, Zheng K, Liu J, Li J, et al. Quantitative identification of major depression based on resting-state dynamic functional connectivity: a machine learning approach. Front Neurosci. (2020) 14:191. doi: 10.3389/fnins.2020.00191

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Guo H, Yan P, Cheng C, Li Y, Chen J, Xu Y, et al. fMRI classification method with multiple feature fusion based on minimum spanning tree analysis. Psychiatry Res Neuroimaging. (2018) 277:14–27. doi: 10.1016/j.pscychresns.2018.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Yoshida K, Shimizu Y, Yoshimoto J, Takamura M, Okada G, Okamoto Y, et al. Prediction of clinical depression scores and detection of changes in whole-brain using resting-state functional MRI data with partial least squares regression. PLoS ONE. (2017) 12:e0179638. doi: 10.1371/journal.pone.0179638

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Guo H, Cheng C, Cao X, Xiang J, Chen J, Zhang K. Resting-state functional connectivity abnormalities in first-onset unmedicated depression. Neural Regenerat Res. (2014) 9:153. doi: 10.4103/1673-5374.125344

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Ichikawa N, Lisi G, Yahata N, Okada G, Takamura M, Hashimoto Ri, et al. Primary functional brain connections associated with melancholic major depressive disorder and modulation by antidepressants. Sci Rep. (2020) 10:1–12. doi: 10.1038/s41598-020-73436-y

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Jun E, Na KS, Kang W, Lee J, Suk HI, Ham BJ. Identifying resting-state effective connectivity abnormalities in drug-naïve major depressive disorder diagnosis via graph convolutional networks. Hum Brain Mapp. (2020) 41:4997–5014. doi: 10.1002/hbm.25175

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Ktena SI, Parisot S, Ferrante E, Rajchl M, Lee M, Glocker B, et al. Metric learning with spectral graph convolutions on brain connectivity networks. Neuroimage. (2018) 169:431–42. doi: 10.1016/j.neuroimage.2017.12.052

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Yao D, Sui J, Wang M, Yang E, Jiaerken Y, Luo N, et al. A mutual multi-scale triplet graph convolutional network for classification of brain disorders using functional or structural connectivity. IEEE Trans Med Imaging. (2021) 40:1279–89. doi: 10.1109/TMI.2021.3051604

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Kong Y, Gao S, Yue Y, Hou Z, Shu H, Xie C, et al. Spatio-temporal graph convolutional network for diagnosis and treatment response prediction of major depressive disorder from functional connectivity. Hum Brain Mapp. (2021) 42:3922–33. doi: 10.1002/hbm.25529

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Qin K, Lei D, Pinaya WH, Pan N, Li W, Zhu Z, et al. Using graph convolutional network to characterize individuals with major depressive disorder across multiple imaging sites. EBioMedicine. (2022) 78:103977. doi: 10.1016/j.ebiom.2022.103977

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Shi S, Qiao K, Yang S, Wang L, Chen J, Yan B. Boosting-GNN: boosting algorithm for graph networks on imbalanced node classification. Front Neurorob. (2021) 15:154. doi: 10.3389/fnbot.2021.775688

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Yan C, Zang Y. DPARSF: a MATLAB toolbox for "pipeline" data analysis of resting-state fMRI. Front Syst Neurosci. (2010) 4:13. doi: 10.3389/fnsys.2010.00013

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Dosenbach NU, Nardos B, Cohen AL, Fair DA, Power JD, Church JA, et al. Prediction of individual brain maturity using fMRI. Science. (2010) 329:1358–61. doi: 10.1126/science.1194144

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Achard S, Bullmore E. Efficiency and cost of economical brain functional networks. PLoS Comput Biol. (2007) 3:e17. doi: 10.1371/journal.pcbi.0030017

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Guo M, Wang T, Zhang Z, Chen N, Li Y, Wang Y, et al. Diagnosis of major depressive disorder using whole-brain effective connectivity networks derived from resting-state functional MRI. J Neural Eng. (2020) 17:056038. doi: 10.1088/1741-2552/abbc28

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Nakano T, Takamura M, Ichikawa N, Okada G, Okamoto Y, Yamada M, et al. Enhancing multi-center generalization of machine learning-based depression diagnosis from resting-state fMRI. Front Psychiatry. (2020) 11:400. doi: 10.3389/fpsyt.2020.00400

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Yamashita A, Sakai Y, Yamada T, Yahata N, Kunimatsu A, Okada N, et al. Generalizable brain network markers of major depressive disorder across multiple imaging sites. PLoS Biol. (2020) 18:e3000966. doi: 10.1371/journal.pbio.3000966

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Kawahara J, Brown CJ, Miller SP, Booth BG, Chau V, Grunau RE, et al. BrainNetCNN: convolutional neural networks for brain networks; towards predicting neurodevelopment. NeuroImage. (2017) 146:1038–49. doi: 10.1016/j.neuroimage.2016.09.046

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Dvornek NC, Li X, Zhuang J, Duncan JS. Jointly discriminative and generative recurrent neural networks for learning from fMRI. In: International Workshop on Machine Learning in Medical Imaging. Springer (2019). p. 382–90.

PubMed Abstract | Google Scholar

46. Parisot S, Ktena SI, Ferrante E, Lee M, Guerrero R, Glocker B, et al. Disease prediction using graph convolutional networks: application to autism spectrum disorder and Alzheimer's disease. Med Image Anal. (2018) 48:117–30. doi: 10.1016/j.media.2018.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Hultman R, Ulrich K, Sachs BD, Blount C, Carlson DE, Ndubuizu N, et al. Brain-wide electrical spatiotemporal dynamics encode depression vulnerability. Cell. (2018) 173:166–80. doi: 10.1016/j.cell.2018.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Ramirez-Mahaluf JP, Roxin A, Mayberg HS, Compte A. A computational model of major depression: the role of glutamate dysfunction on cingulo-frontal network dynamics. Cereb Cortex. (2017) 27:660–79. doi: 10.1093/cercor/bhv249

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: major depressive disorder, deep learning, graph neural network, ensemble model, functional connectivity

Citation: Venkatapathy S, Votinov M, Wagels L, Kim S, Lee M, Habel U, Ra I-H and Jo H-G (2023) Ensemble graph neural network model for classification of major depressive disorder using whole-brain functional connectivity. Front. Psychiatry 14:1125339. doi: 10.3389/fpsyt.2023.1125339

Received: 16 December 2022; Accepted: 02 March 2023;
Published: 23 March 2023.

Edited by:

Zhen Zhou, University of Pennsylvania, United States

Reviewed by:

Dongren Yao, Massachusetts Eye and Ear Infirmary and Harvard Medical School, United States
Wenming Zhao, First Affiliated Hospital of Anhui Medical University, China

Copyright © 2023 Venkatapathy, Votinov, Wagels, Kim, Lee, Habel, Ra and Jo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Han-Gue Jo, aGdqb0BrdW5zYW4uYWMua3I=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Ensemble graph neural network model for classification of major depressive disorder using whole-brain functional connectivity

1. Introduction

1.1. Aims

2. Materials and methods

2.1. Subjects

2.2. Preprocessing

2.3. Methods

2.3.1. Base models

2.3.2. Ensemble model

2.4. MDD identification and evaluation

3. Results

3.1. MDD vs. HC classification

3.2. FEDN vs. HC classification

3.3. REC vs. HC classification

3.4. FEDN vs. REC classification

4. Discussion

5. Limitations

6. Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher's note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good