Multiple sclerosis clinical forms classification with graph convolutional networks based on brain morphological connectivity

Chen, Enyi; Barile, Berardino; Durand-Dubief, Françoise; Grenier, Thomas; Sappey-Marinier, Dominique

doi:10.3389/fnins.2023.1268860

METHODS article

Front. Neurosci. , 18 January 2024

Sec. Neurodegeneration

Volume 17 - 2023 | https://doi.org/10.3389/fnins.2023.1268860

This article is part of the Research Topic Computational Modeling and Machine Learning Methods in Neurodevelopment and Neurodegeneration: from Basic Research to Clinical Applications View all 11 articles

Multiple sclerosis clinical forms classification with graph convolutional networks based on brain morphological connectivity

$\r\nEnyi Chen$ Enyi Chen¹

Berardino Barile¹

Françoise Durand-Dubief^1,2

Thomas Grenier¹

Dominique Sappey-Marinier^1,3^*

¹CREATIS, CNRS UMR 5220, INSERM U1294, Université de Lyon, Université Claude Bernard-Lyon 1, INSA Lyon, Lyon, France
²Service de Sclérose en Plaques, des Pathologies de la Myéline et Neuro-Inflammation, Groupement Hospitalier Est, Hôpital Neurologique, Bron, France
³CERMEP - Imagerie du Vivant, Université de Lyon, Bron, France

Multiple Sclerosis (MS) is an autoimmune disease that combines chronic inflammatory and neurodegenerative processes underlying different clinical forms of evolution, such as relapsing-remitting, secondary progressive, or primary progressive MS. This identification is usually performed by clinical evaluation at the diagnosis or during the course of the disease for the secondary progressive phase. In parallel, magnetic resonance imaging (MRI) analysis is a mandatory diagnostic complement. Identifying the clinical form from MR images is therefore a helpful and challenging task. Here, we propose a new approach for the automatic classification of MS forms based on conventional MRI (i.e., T1-weighted images) that are commonly used in clinical context. For this purpose, we investigated the morphological connectome features using graph based convolutional neural network. Our results obtained from the longitudinal study of 91 MS patients highlight the performance (F1-score) of this approach that is better than state-of-the-art as 3D convolutional neural networks. These results open the way for clinical applications such as disability correlation only using T1-weighted images.

1 Introduction

Multiple sclerosis (MS) is a chronic autoimmune inflammatory and demyelinating disease of the central nervous system. While its etiology is still unknown (Polman et al., 2011), MS is the first cause of non-traumatic neurological disability in young adults, affecting about 2.8 million people worldwide (Goodin, 2014). Often starting with a preliminary clinical isolated syndrome (CIS) involving a large heterogeneity of clinical symptoms such as weak limbs, blurred vision, dizziness, fatigue, or tingling sensations, the disease may evolve along two main clinical courses. In 85% of patients, the disease starts as a relapsing-remitting course (RRMS, noted RR), with the occurrence of relapses. These RRMS patients can evolve over time into a non-systematic secondary-progressive course (SPMS, noted SP). In the 15% remaining patients, the disease evolves as primary-progressive MS (PPMS, noted PP) which corresponds to a steadily worsening of symptoms over time without any relapses (Lublin et al., 2014). The current McDonald diagnostic criteria for MS combine clinical assessment, imaging, and laboratory findings (Thompson et al., 2018). Despite such clinical classification, the status and the evolution of each patient could be very different from one to another, leading more and more to individual therapeutic approaches. Thus, to propose personalized medical care and therapy, the neurologist needs to better predict the disease evolution based on early clinical, biological, and imaging markers available from disease onset.

Magnetic Resonance Imaging (MRI) is the most effective tool for the diagnosis of MS and for monitoring the disease modifying treatment. Conventional MRI provides T1-weighted (T1w), T2-weighted (T2w) and FLAIR images allowing the detection and follow-up of white matter (WM) lesions for clinical care (Mure et al., 2016). These conventional sequences allow the quantification of whole brain, WM or gray matter (GM) atrophy using dedicated software. More advanced MRI sequences such as diffusion-weighted imaging (DWI) and diffusion tensor imaging (DTI) have been developed to provide more sensitive markers of the inflammation processes occurring in WM and leading to T1- and T2-lesions. Several metrics of DTI such as the fractional anisotropy and the mean diffusion enable the detection of micro-architectural alterations in WM lesions as well as in normal-appearing WM (Jutten et al., 2019).

More recently, graph theory methods have been used to model brain network organization (Rubinov and Sporns, 2010; Guo et al., 2017). These graph models consist of nodes, based on the parcellation of brain GM regions, and edges, determined by the underlying links between the network nodes. In brain structural connectivity, these links are defined by the extraction of WM fibers using DTI tractography (Hagmann et al., 2007). Previously, Kocevar et al. (2016) have demonstrated an interest of such approaches for the classification of MS clinical profiles using Machine Learning (ML) methods, while Marzullo et al. (2019) improved the classification performance by a first approach using a Deep Learning (DL) model.

However, DTI data used for structural connectivity modeling require long acquisition time and complex processing techniques, which limits its applicability in clinical practice. Nevertheless, brain connectivity can also be obtained from conventional MRI by measuring morphological metrics of the GM on T1w images (Raamana and Strother, 2018). Indeed, several imaging investigations have shown that GM atrophy is present early in MS (Durand-Dubief et al., 2012; Eshaghi et al., 2018). Narayana et al. (2013) has found significant cortical thinning in RRMS patients compared to healthy subjects. Hence, the GM degeneration used in brain morphological connectivity models could provide a sensitive marker of the disease evolution. In such graphs, nodes represent GM areas obtained from the GM tissue parcellation, while edges represent a degree of (dis-)similarity between nodes features like GM thickness or curvature (MacDonald et al., 2000). Such approach has been recently used in Alzheimer's Disease (AD), showing that GM network measures predicted hippocampal atrophy rates in preclinical AD, in contrast to other AD biomarkers (Dicks et al., 2020). Also, Mahjoub et al. (2018) proposed to use morphological connectivity to discriminate late mild cognitive impairment from AD patients. Several studies of GM morphological network were used in Autism Spectrum Disorder (ASD) patients. Kong et al. (2019) proposed an auto-encoder-based deep neural network to identify ASD patients from typical controls, while Corps and Rekik (2019) used morphological networks to estimate the ASD patients' age and deduce the age-related cortical regions. In MS, Muthuraman et al. (2016) analyzed morphological GM thickness networks to classify CIS and RRMS patients using the Support Vector Machine model, obtaining a good level of accuracy. Meanwhile, several studies used graph metrics of GM networks to characterize MS patients. Hawkins et al. (2020) found reduced global efficiency and a more random network in RRMS subjects with cognitive impairment. Likewise, lower node degree and connectivity density were found by Rimkus et al. (2019) in MS patients with cognitive impairment. Rocca et al. (2021) combined functional connectivity and GM network to predict clinical worsening in MS, confirming that GM atrophy is an important predictor for the conversion from RRMS to SPMS. By using the source-based morphometry approach to decompose the cortical thickness map into different patterns, Steenwijk et al. (2016) have further shown that several anatomical patterns are strongly associated with clinical dysfunction in MS patients. Meanwhile, several studies also addressed the problem of age/gender and cortical thickness correlation, and removed their effects before further analysis. Eshaghi et al. (2016) fitted the linear regression between age and GM measurements and took only the residual part to classify MS cohort from neuromyelitis optical patients. Given the graph nature of brain connectivity, the use of graph neural network (GNN) to process such data is an evitable path. GNN allows us to deal with the heterogeneity of input data by capturing the message passing across nodes (Bronstein et al., 2021). More specifically, graph convolutional network (GCN), a reimplementation of convolution concept on GNN, is now ubiquitous in solving problems on non-euclidean data.

In the meantime, the application of convolutional neural network (CNN) has proven its strong ability in computer vision, especially in the biomedical image processing field. Leclerc et al. (2019) has successfully delineated cardiac structure on ultrasound images through an encoder-decoder-based model. 3D-CNN, a particular type of CNN, has been widely used in medical context since a huge amount of medical images were acquired and reconstructed in 3 dimensions. Various studies have focused on disease detection from anatomical neuroimaging (Wargnier-Dauchelle et al., 2023). Huang et al. (2019) have built a VGG-like CNN to adapt 3D image challenge for the purpose of Alzheimer's Disease (AD) classification using both T1w-MRI and FDG-PET modalities for a better outcome. Folego et al. (2020) have adapted LeNet, VGGNet, GoogLeNet, and ResNet in 3D domain to the aim of AD detection. Flaus et al. (2022) has proposed a 3D sequential ResNet to enhance PET images for better visualization of brain lesions. A transparent CNN framework proposed by Eitel et al. (2019) has revealed the decision process of CNN in the diagnosis of MS and pointed out more disease-relevant features in MR images. Optic nerve lesions, one of the first manifestations of MS, can be detected by the 3D-CNN model designed by Marti-Juan et al. (2022).

In this study, we proposed to use GCN for the classification of MS clinical forms based only on the measurement of GM morphological feature (thickness) obtained from T1w-MRI. The impacts of different methodological parameters such as the spatial resolution of the GM parcellation atlases and the level of different graph thresholds were compared. Finally, in order to demonstrate the interest of GCN for MS clinical forms classification, we compared the GCN with a classic 3D-CNN approach.

2 Materials and methods

Our method was divided into three steps: (i) cortical feature extraction using FreeSurfer (Fischl, 2012); (ii) generation of brain morphological graphs using distance computation and threshold; and (iii) clinical forms classification using GCN.

2.1 MRI acquisition and data

The MS patient group (AMSEP) consists of 42 RR, 28 SP, and 21 PP participants included in a longitudinal MRI study. CIS patients (n = 12) were included in the RR patient group, in accordance with our clinical expert. Patients (n = 3) with change in clinical forms have been removed from the MS group. The patients underwent MR scans on a 1.5T Siemens Sonata system using an 8-channel head-coil at the Lyon CERMEP imaging platform, including a sagittal millimetric 3D-T1 MPRAGE (magnetization prepared rapid gradient echo-MPRAGE) sequence [(TR/TE/TI) = 1970/3.93/1100 ms, flip angle = 15°, field of view (FOV) = 256 × 256 mm, slice thickness = 1 mm, voxel size = 1 × 1 × 1 mm]. Table 1 provides information on the clinical data in further detail. During the first 3 years, MRI exams were performed every 6 months, and every year during the following years. These make up a MS patient dataset of 660 scans in total as detailed in Table 1. A healthy control (HC) group of 21 subjects following the AMSEP protocol was included in this study.

Table 1

Table 1. MS cohort description of 660 scans including relapsing-remitting (RRMS), primary-progressive (PPMS), and secondary-progressive (SPMS) patients.

Another HC group of 314 scans from the IXI dataset (http://brain-development.org/ixi-dataset/) was introduced for the training process (noted IXI). These healthy subjects underwent MR scans on a 1.5T Philips Gyroscan Intera system using a T1w sequence (TR/TE = 9813/4603 ms, flip angle = 8°, 192 phase encoding steps, reconstruction diameter = 240 mm). These make up a HC dataset of 335 scans in total as detailed in Table 2.

Table 2

Table 2. Healthy controls cohort description of 335 T1-weighted MRI including 21 healthy controls (HC-AMSEP) acquired with the same protocol as MS cohort and 314 healthy controls (HC-IXI) obtained from the open-access IXI dataset.

2.2 Classification using graph-based convolutional network

As we explore the ability of cortical anatomical changes to identify MS forms, we extract features related to the shape of cortical regions. With such features, we then build a graph reflecting shape similarities between cortical regions and use the graph matrix to train the GCN. The full pipeline of the proposed network is shown in Figure 1.

Figure 1

Figure 1. Proposed pipeline for GCN classification. The upper steps illustrate the cortical gray matter regions segmentation from T1w-MRI and parcellation using three atlases, the region feature extraction (thickness) and its vector values. The bottom steps describe the graph construction followed by the GCN classification network. Four threshold levels are applied on graphs (0, 60, 70, 80%), leading to four graphs per atlas. In summary, 12 networks are trained separately (3 atlases, 4 threshold levels) on 660 scans.

2.2.1 Feature extraction

In order to obtain features of cortical regions, the brain GM was first segmented (Figure 1), the cortical surface was parcellated into N regions using a dedicated brain atlas. Morphological features of each region can thus be calculated and represented as a vector of values.

Automatic segmentation of GM and cortical surface reconstruction were performed on all T1w-MRI using FreeSurfer v6.0.0 image analysis suite (Fischl, 2012), a neuroimaging toolkit for human brain analysis. This includes 31 preprocessing steps such as motion correction, intensity normalization, skull stripping and non-linear registration. All FreeSurfer processing steps were done on the Virtual Imaging Platform (Glatard et al., 2013), the 1,001 images were processed simultaneously and it took 6 h per image on average. The input T1w-MRI brain was resampled onto an average brain (fsaverage) generated from 40 subjects using the Buckner dataset. The Buckner dataset is a subset of a large structural dataset created by the Buckner Lab, it was specifically selected for the intermediate processing step of FreeSurfer. The obtained cortical surface consists of a mesh with 163842 vertices. All outputs were smoothed at full-width/half-max (FWHM) value of 10 mm.

These smoothed outputs are then parcellated. In order to study the impact of the number of cortical regions N, three different atlases were used for brain parcellation and graph generation, namely the Desikan-Killiany (Desikan et al., 2006) with N = 68 regions, Destrieux (Destrieux et al., 2010) with N = 148 regions and Glasser (Glasser et al., 2016) with N = 360 regions. The cortex parcellation of the average template brain is demonstrated in Figure 2.

Figure 2

Figure 2. Representation of the cortical parcellation of the three atlases: (A) Desikan-Killiany; (B) Destrieux; (C) Glasser.

More specifically, a region number i (with i = 1…N) was assigned to each vertex according to the atlas chosen by registering the patient's brain mesh to the template brain. As mainly used in brain connectivity studies (reference), the cortical thickness was chosen as the morphological feature and calculated for each region.

Since each region feature is a vector of thousands of elements on average, we summarize the distribution of the thickness values within one region i by a vector x_i ∈ ℝ⁴ containing the mean value, the standard deviation, the skewness, and the kurtosis: x_i = (μ_i, σ_i, γ_i, k_i). We called the feature matrix X ∈ ℝ^N×4 the combination of the N vectors x_i.

2.2.2 Age and gender normalization

Since women and men have different cortical atrophy manifestations with age (Narayana et al., 2013), we proposed two methods to normalize x_i: a proportional normalization and a residual normalization. For the proportional normalization, we first calculated the average cortical thickness of the whole brain of all MS patients and healthy subjects from the IXI dataset. Then, we performed a linear regression between age and cortical thickness as:

\begin{array}{l} C t h = a * a g e + b \end{array}

where Cth is the average cortical thickness of one person. Two different sets of coefficients (a_f, b_f) and (a_m, b_m) were calculated for healthy women and men respectively. If the slope represents the normal aging effect, we applied this slope to the MS patients group to correct the effect of age and sex. All MS patients' measurements were brought to the age of 20. Thus, the corrected thickness Cth₂₀ of a patient can be expressed as:

\begin{array}{l} C t h_{20} = a * 20 + b^{'} = a * 20 + C t h - a * a g e \end{array}

Therefore, the adjusted feature vector $x_{i}^{'}$ of each region with proportional correction with coefficient $α = \frac{C t h_{20}}{C t h}$ can be represented as: $x_{i}^{'} = (α μ_{i}, α σ_{i}, γ_{i}, k_{i})$ . The modified vectors were then used to calculate the new proportional normalized graphs following the same procedure as described above.

Inspired by the work of Eshaghi et al. (2016), we also proposed to adjust each cortical region for the effect of age and gender. For every brain region i of the healthy cohort, we fitted a linear regression where age was the regressor and the four attributes of the region were dependent variables. Therefore, for the four values of the feature vector, we have:

\begin{array}{l} μ_{i} = a_{i}^{(μ)} * a g e + b_{i}^{(μ)} \end{array}

\begin{array}{l} σ_{i} = a_{i}^{(σ)} * a g e + b_{i}^{(σ)} \end{array}

\begin{array}{l} γ^{(i)} = a_{i}^{(γ)} * a g e + b_{i}^{(γ)} \end{array}

\begin{array}{l} k_{i} = a_{i}^{(k)} * a g e + b_{i}^{(k)} \end{array}

We then estimated the residual of each variable that was inexplicable by the healthy linear regression model: $r_{i}^{(μ)} = {\hat{μ}}_{i} - μ_{i} = a_{i}^{(μ)} * a g e + b_{i}^{(μ)} - μ_{i}$ for example in the case of average cortical thickness measure. The residual feature vector of one region became: $r i = (r_{i}^{(μ)}, r_{i}^{(σ)}, r_{i}^{(γ)}, r_{i}^{(k)})$ . The residual vectors were also used to calculate the residual graphs that were further used in the GCN classification. Notice that these regressions are performed for both males and females separately.

2.2.3 Graph generation

A graph G is a mathematical representation of a complex system and is defined by a collection of nodes V and edges E between pairs of nodes with the possibility to assign a weighted value w for each edge:

\begin{array}{l} G = (V, E, w) \end{array}

Therefore, a brain can be described as a graph, with each brain region being represented by a node x_i, or $x_{i}^{'}$ and ri in case of normalization. Here, we associate four attributes (mean value μ, standard deviation σ, skewness γ, and kurtosis k) to each node. The graph representation of brain morphological connectivity was defined as the dissimilarity across brain regions. We propose to compare two distances to calculate the region-wise connections. The first one is the Mahalanobis distance d_M:

\begin{array}{l} d_{M} (x_{i}, x_{j}) = {({(x_{i} - x_{j})}^{T} S^{- 1} (x_{i} - x_{j}))}^{1 / 2} \end{array}

with S the covariance matrix of samples x_i and x_j.

The second studied distance is the Taxicab (or Manhattan) distance d_T:

\begin{array}{l} d_{T} (x_{i}, x_{j}) = \sum_{k = 1}^{4} | x_{i}^{k} - x_{j}^{k} | \end{array}

where $x_{i}^{k}$ is the kth dimension of the vector x_i.

The adjacent matrix A ∈ ℝ^N×N is computed for all distances between x_i and x_j: A(i, j)_X = d(x_i, x_j).

Using both X and A, we generate weighted and undirected graphs. The edge weights are given by the adjacent matrix.

Thresholds were used to counteract the impact of the redundant information given by the brain adjacent matrix. A fixed rejection quantile τ is used as a threshold value to remove the lowest distances and thus maintains the same graph density for each subject.

For graph availability, the reader can refer to Section 5.

2.2.4 GCN classification

Graph convolutional networks were used as they exploit input data through graph structure. As a dimension reduction tool, graph representation can largely reduce input data size from 12 MB to 130 KB on average in our case. Intuitively speaking, brain network topology is an alternative method of image analysis. Sporns (2018) have confirmed the importance of graph theory for the understanding of brain structure. Based on our previous results using brain structural graph analysis (Marzullo et al., 2019), we explore a new approach using brain morphological graph.

For the graph G = (V, E, w), the algorithm takes the adjacent matrix A and the associated node features matrix X as input. The layer-wise propagation rule is defined as follows (Kipf and Welling, 2017):

\begin{array}{l} H^{(l + 1)} = σ ({\tilde{D}}^{- \frac{1}{2}} Ã {\tilde{D}}^{- \frac{1}{2}} H^{(l)} W^{(l)}) \end{array}

Where Ã is the sum of A with the identity matrix I, $\tilde{D}$ is the corresponding diagonal degree matrix and the adjacent matrix is normalized by the step ${\tilde{D}}^{- \frac{1}{2}} Ã {\tilde{D}}^{- \frac{1}{2}}$ . W^l represents the trainable weight over each layer. The RELU activation function σ(x) = max(0, x) is chosen for σ.

2.2.5 GCN architecture

The proposed GCN classification model was composed of 3 GCN layers followed by a global mean pool layer with a dropout rate of 0.3 to prevent overfitting. The proposed structure is shown in Figure 3. This led to 8835 trainable parameters.

Figure 3

Figure 3. The overall structure of the proposed graph-based convolutional network. N is the number of regions according to the atlas chosen. Four represents the four elements of the feature vector per region. Input of the network consists of one adjacency matrix (N*N) and one feature matrix (N*4) per patient. The network starts with three graph convolutional layers of 64 filters each, then gathered into a vector using a global mean pooling. Two fully connected layers are used to obtain the classification into three classes (RR, PP, SP).

2.3 Classification using 3D convolutional neural network

To validate our GCN against classically used CNN architectures, we implemented a 3D-CNN architecture using a similar architecture by replacing graph convolutional layers with classical convolutional layers. The output of a filter of a 3D convolutional layer with kernel W of size (f_hxf_wxf_dxf_c) can be expressed as follows:

\begin{array}{l} z_{i, j, k} = b + \sum_{p = 0}^{f_{h} - 1} \sum_{q = 0}^{f_{w} - 1} \sum_{r = 0}^{f_{d} - 1} \sum_{c = 0}^{f_{c} - 1} x_{i^{'}, j^{'}, k^{'}, c} . W_{p, q, r, c} \end{array}

with

\begin{array}{r} i^{'} = i + p - ⌊ f_{h} / 2 ⌋ a n d j^{'} = j + q - ⌊ f_{w} / 2 ⌋ a n d \\ k^{'} = k + r - ⌊ f_{d} / 2 ⌋ \end{array}

Therefore, a 3D-CNN model was constituted of three 3D convolutional layer sets, including a 3D convolutional layer (kernel of 3 × 3 × 3), followed by a max pooling layer (subsampling spatial support by 2 × 2 × 2) and then a batch normalization layer. The tensor is then flattened and used as input of two consecutive fully connected layers of 128 and 2 neurons, respectively. These made up of 22,548,122 trainable parameters of the CNN network.

Before using a deep neural network to classify the 3D MRI, all scans were pre-processed using the brain extraction tool (BET) of FMRIB Software Library in order to eliminate non-brain structures. Then, the 3D-CNN image classification network predicts the class (RR, SP, or PP) of the T1w image of a patient's brain used as input. The architecture used is summarized in Figure 4. To prevent over-fitting, a dropout (Srivastava et al., 2014) rate of 0.3 is applied after the flattening layer.

Figure 4

Figure 4. The overall structure of the proposed 3D-CNN network. It starts with three convolutional layers of 16, 32, and 64 filters respectively, each convolution layer followed by a max pooling layer. The tensor is then flattened and two fully connected layers are used to obtain the classification into three classes (RR, PP, SP).

As it is known that CNN classification needs numerous data to perform well, we compared its performance with the classification results using a graph-based neural network.

2.4 Experimental settings

According to our previous study using brain morphological connectivity (Barile et al., 2022), 4 threshold levels τ ∈ {0, 0.6, 0.7, 0.8} were applied to the adjacent matrix computed using the 3 atlases and the 2 distances. Thus, each GCN classification is carried out in 72 different ways, and one for CNN.

For both network architectures, the MS images were divided into two datasets: approximately 80% of scans used for training and 20% of the scans used only for testing, i.e., to evaluate the performance of networks. To avoid the impacts of repetition of the same patient, we carefully grouped all time points of one patient in the same train or test set using the stratified group k-fold technique. The exams of the same patient won't be in the train set and test set simultaneously.

The precision, recall, and the F1-score were used to assess both algorithms' effectiveness. To provide a more thorough assessment of the two models, cross-validation using five-folds was performed.

From hyperparameters manual optimization, we use the Adam optimizer with a learning rate of 0.001 for GCN and the Stochastic Gradient Descent optimizer with a learning rate of 0.001 for 3D-CNN.

GCN was trained on one GPU (NVIDIA GeForce RTX 3060), and CNN was trained on one NVIDIA RTX A5000. All experiments were done using PyTorch.

For code availability, the reader can refer to Section 5.

3 Results

In this section, we first present the GCN classification tasks and then the results without age and gender normalization to allow the comparison with 3D-CNN classification results. Second, the GCN classification results with age and sex normalization are presented.

3.1 Clinical forms classification tasks

Six classification tasks related to clinical needs were implemented: (1) RR vs. PP; (2) RR vs. SP; (3) PP vs. SP; (4) RR vs. PP+SP; (5) RR vs. PP vs. SP; (6) MS vs. HC. For this last task, the train set consists of 619 MS scans and 290 randomly selected scans from the IXI dataset. For the test set, 42 scans were selected from the MS group (24 RRMS, 10 PPMS, 8 SPMS) along with the 21 HC-AMSEP scans from the same study and 24 HC-IXI scans from the IXI dataset. For the other tasks, only the MS patients dataset was used. A five-fold stratified cross-validation scheme was applied for all tasks.

3.2 GCN classification

3.2.1 Without normalization

F1-score of the three atlases (Desikan-Killiany, Destrieux, Glasser), four rejection rates and two distance calculation approaches were compared as shown in Tables 3, 4. Precision and Recall measures of corresponding experiments were included in Supplementary material.

Table 3

Table 3. F1-scores (mean value ± standard deviation) of clinical forms classification using GCN based on Mahalanobis graph for three parcellation atlases and four threshold levels τ.

Table 4

Table 4. F1-scores (mean value ± standard deviation) of clinical forms classification using GCN based on Taxicab graph for three parcellation atlases and four threshold levels τ.

Comparing classification results task by task, the best result was always found using Mahalanobis instead of Taxicab distance for the dissimilarity measurement. The classification of RR vs. PP gave the best result when an 80% rejection rate was applied to the Destrieux atlas with an F1-score of 72.5%. The separation between RR and SP patients provides an F1-score of 72.2% using an 80% rejection rate on the Glasser atlas. By grouping the PP and SP in a neurodegenerative group, the binary classification of RR vs. PP+SP reached an F1-score of 68.9%. The best three classes classification was obtained using an 80% rejection rate on the Glasser atlas with an F1-score of 64.2%. The optimal PP/SP splitting leading to an F1-score of 53.1% was obtained using the Glasser atlas and a rejection rate of 70%. Finally, all GCN classification networks can achieve a great result on MS vs. HC task (100% F1-score on the predefined unseen test dataset). Atlas-wise speaking, for Mahalanobis distance measurement, a 60% rejection rate gave the best result on the Desikan-Killiany atlas, while an 80% rejection rate yielded the best outcome on both Destrieux and Glasser atlases. For Taxicab distance measurement, a 70% rejection rate gave the best result on the Desikan-Killiany atlas, the graph without rejection generated the best on the Destrieux atlas, and a 60% rejection rate achieved the best performance on the Glasser atlas.

3.2.2 With normalization

In order to correct for age and gender, two normalization methods have been carried out. The results obtained using three atlases and two distance methods are shown in Tables 5–8. The best RR/PP separation can be found when the residual normalization was carried out to the Desikan-Killiany atlas with a threshold of 80%. The proportional normalization method applied to the Glasser atlas with an 80% rejection rate generated the best results of RR vs. SP, RR vs. PP+SP, and RR vs. PP vs. SP with F1-scores 71.1, 67.8, and 62.1% respectively. The best result of PP/SP classification can be found in residual normalization on the Desikan-Killiany atlas (rejection rate = 0) with an F1-score of 64.2%. For the proportional normalization method, the best overall result can be found using the Glasser atlas with 80% threshold. The best overall result for the residual normalization method was carried out by the same atlas with 60% threshold.

Table 5

Table 5. F1-scores (mean value ± standard deviation) of clinical forms classification using GCN based on Mahalanobis age-gender proportional adjusted graph for three parcellation atlases and four threshold levels τ.

Table 6

Table 6. F1-scores (mean value ± standard deviation) of clinical forms classification using GCN based on Taxicab age-gender proportional adjusted graph for three parcellation atlases and four threshold levels τ.

Table 7

Table 7. F1-scores (mean value ± standard deviation) of clinical forms classification using GCN based on Mahalanobis age-gender residual adjusted graph for three parcellation atlases and four threshold levels τ.

Table 8

Table 8. F1-scores (mean value ± standard deviation) of clinical forms classification using GCN based on Taxicab age-gender residual adjusted graph for three parcellation atlases and four threshold levels τ.

3.3 Comparing CNN and GCN

The results of the comparison between 3D-CNN classification and GCN without normalization are shown in Table 9. Comparing RR individually with PP and SP, 3D-CNN returned an F1-score of 72.1% and 69.7% respectively, which are slightly lower than GCN results. The separation between the RR and PP+SP groups on the F1-score was greater than that of the GCN technique at 70.7%. The 3D-CNN method generated a similar result on the multi-class classification task with an F1-score of 63.9%. Finally, 3D-CNN achieved a lower result than GCN for the PP vs. SP partition with a 49.5% F1-score. Overall, the best results were obtained using GCN over 3D-CNN while implementing an 80% rejection rate on the Glasser atlas and the Mahalanobis distance.

Table 9

Table 9. Best F1-scores (mean value ± standard deviation) of clinical forms classification using 3D-CNN and GCN [three datasets: non-normalized (NN) graph, proportional normalized (PN) graph, and residual normalized (RN) graph].

4 Discussion

Graph Convolutional Network is an innovative approach for the classification of clinical forms in multiple sclerosis. While functional and structural connectivities were previously used and provided good results (Ktena et al., 2018; Marzullo et al., 2019), they were constrained by the small size of the database available in clinical routine. To overcome this limitation, one approach is to develop a morphological connectivity method requiring only anatomical T1w MRI for brain studies. In order to test such a hypothesis, we developed a complete pipeline using morphological connectivity and graph convolutional networks. To our knowledge, this is the first attempt to use this approach for the classification of MS clinical forms. Brain graphs were established based on Desikan-Killiany, Destrieux, and Glasser atlases, for GM parcellation. Rejection rates of 60, 70, and 80% were applied to connectivity graphs to preserve solely main differences across brain regions. Morphological connectivity data were fed into GCN while 3D brain images were loaded in 3D-CNN to compare the two classification approaches.

First, non-normalized GCN was compared to 3D-CNN, which was unable to normalize age or gender based on image data. Generally speaking, GCN has outperformed 3D-CNN on 4 out of 5 predefined tasks when the threshold/atlas pair was carefully chosen. For the task RR vs. PP+SP, the F1-score generated by GCN was slightly weaker than the result of 3D-CNN with a 1.8 percentage point. However, it requires more computation resources to train a simple 3 convolutional layers network. In our case, GCN only took 5 h for network training while achieving a better result than 3D-CNN which took more than a week on the same computer. The proposed pipeline has gained in computation time thanks to its dimension-reduction ability. Instead of working on 256 × 256 × 256 volumetric images, the graph approach allowed us to use the adjacent matrix of size 360 × 360 in the most complex case.

The comparison of the two classification networks has also given us insights into the medical image processing field. In general, clinical image classification tasks can be easily affected by acquisition changes (manufacturers, centers, MR field, etc.). In particular, CNNs are sensitive to intensity changes with the use of convolution layers. To address this problem, CNN classification networks must be trained on a large number of images that represent both the variability of the acquisition process and the diversity of the patients. Since most medical datasets are composed of a small number of patients, CNN doesn't usually generate well due to its data-thirsty characteristic. In contrast, GCN can be trained on brain graph features that are less sensitive to image intensity changes. Indeed, cortical thinning is an important biomarker of the MS neurodegenerative process that is visible in T1w images (Narayana et al., 2013). With a brain graph generated from cortical thickness, these small changes in the brain were well-captured by the proposed GCN pipeline. Our pipeline returns a clearer relation between brain atrophy and clinical forms, compared to the 3D-CNN approach, which could be improved by using Grad-CAM (Selvaraju et al., 2020) or similar methods.

Second, normalized GCN was used to classify MS clinical forms. This is essential for clinical forms classification. Binary and multi-class classifications were performed between the three clinical forms (RR, PP, SP). The result of normalized GCN showed that GCN can return satisfactory results on binary classification between MS clinical courses. More specifically, the automatic separation of inflammatory forms from neurodegenerative forms, RR vs. SP and PP groups, has been carried out. The best F1-score was found when separating RR from PP patients, and a good result was also obtained in the RR/SP classification task. On one hand, RR patients present relapses corresponding to focal inflammatory processes. On the other hand, SP and PP patients share the experience of progressive clinical evolution, associated or not with inflammatory activity, resulting from degenerative phenomena of the gray matter. Thus, by grouping SP and PP patients, an adequate result was found when the finest atlas (Glasser) was applied.

The three-class classification is a difficult multi-class categorization task which is further worsened by the imbalanced data distribution. Nevertheless, a promising result was obtained using the Glasser parcellation atlas with a high rejection rate, indicating the advantage of dimension reduction when facing complex brain data such as our case.

Classification of SP and PP was the hardest binary classification task to be accomplished. this is partially due to the small amount of PP cases. Indeed, SP and PP are two neurodegenerative forms sharing similar pathological processes. Moreover, PP is a starting clinical form that can be divided into subclasses depending on the level of disability. With an EDSS score ranging from 2 to 7.5, our PP population is composed of both early and late stages of the disease. The latter ones are more relevant and probably more similar to SP patients as shown in the disease duration at scan. This large variability of disability scores reflects different progressions of the disease and thus different stages of brain alterations. Thus, the SP and some PP patients may share MRI phenotypes which makes the classification difficult, and perhaps even unnecessary.

Achieving good results, the binary classification of HC vs. MS patients was not our primary goal. In general, MS patients can be easily distinguished from healthy subjects in both clinical and imaging ways. In our experience, an F1-score of 100% was observed in all GCN outputs, meaning that all combinations of atlases and thresholds provided enough information for the classification task. Similar results were obtained in the previous work of Marzullo et al. (2019) on brain structural connectivity. Marzullo et al. (2019) has performed the test of HC vs. CIS+RR (24/253) and the test of HC vs. SP+PP (24/325) and achieved the best result (F-measure = 1), demonstrating an evident difference between HC and MS brain morphological and structural networks, respectively.

To further compare our work with other studies, we analyzed the results obtained from Marzullo et al. (2019) and Barile et al. (2022). Apart from the binary classification of HC vs. MS patients, Marzullo et al. (2019) have also tested the separation between early and progressive forms of MS (CIS+RR vs. SP+PP: 253/325) obtaining the highest F-measure at 0.99. Since CIS subjects are included in the RR group in our study, we can compare the previous result with our classification task of RR vs. SP+PP (299/361), leading to an F1-score of 0.678. This strong difference in performance demonstrates that white matter inflammation introduced significant information that facilitates the classification of clinical forms in MS. In contrast, the work of Barile et al. (2022) was performed on GM morphological connectivity. Three similar tasks were reported: (1) CIS+RR vs. PP; (2) CIS+RR vs. SP; (3) CIS+RR vs. SP+PP. By employing the same pipeline of graph generation and atlas (Glasser) and an ensemble of machine learning methods, they have obtained an F1-score of 0.661 (0.12), 0.654 (0.12), 0.648 (0.11) for the three tasks, respectively. In our study, we obtained better F1 scores of 0.671 (0.117), 0.711 (0.107), 0.678 (0.063) for the same tasks. This gain in performance (higher F1-score and reduced standard deviation) demonstrated the interest of brain graph convolutional networks.

Taxicab distance is an L1-norm metric that is generally preferred over Euclidean distance for high-dimension data analysis (Aggarwal et al., 2001). However, since every dimension (mean, standard deviation, skewness, kurtosis) has the same attribution in the calculation of Taxicab distance, our feature vector of four dimensions could not have the same impact on the final value due to the difference in magnitude. In such cases, Mahalanobis distance can overcome the problem while removing redundant information from correlated variables. Since distance measurement was included as edge weight in the input data of GCN, the choice can surely affect the final result. Thus, it is not surprising to observe a better result with Mahalanobis distance supporting the graph generation.

Finally, this work presents several methodological limitations. First the classification results were biased by the class imbalance of the database and the insufficient number of patients. Since the current database consists of a series of multiple MR scans per patient, it does not cover enough variability of the disease, meaning a lack of global vision of the disease. Hence, even if we carefully stop the network training before overfitting, it is hard to extract sufficient features of each MS clinical course to classify an unseen patient by the proposed network, resulting in bad output in some cases. Nevertheless, our cohort study had no bias related to the protocol acquisition, which is unique, guaranteeing the homogeneity of the data. In contrast, a multi-center study is more variable and therefore requires a precise study and corrections of bias.

5 Conclusion

Although studies on MS mainly focus on white matter and lesion analysis, morphological change in gray matter is a non-negligible aspect of the disease. A full pipeline was proposed in this study for the classification of MS clinical forms. It starts from automatic GM segmentation and surface parcellation, followed by GM thickness analysis using three different granularity of atlases, two different distance measurements, and two different age-gender normalization methods. Thus, a brain resulted in a morphological connectivity graph accompanied by a feature matrix per graph. Four rejection rates corresponding to noise elimination were applied to the graph. A graph convolutional network was performed on these graphs to exploit the hidden information behind GM morphological features. In parallel, a classic 3D convolutional neural network was applied to the brain MRI directly for comparison. The best results were generated by proportional GCN that trained on Glasser parcellation-based graphs with Mahalanobis distance measurement and 80% rejection rate. In future studies, to fully exploit its capacity for clinical image analysis, our method can be implemented on a larger database to predict patients' disease evolution and obtain the correlation between images' information and patients' disability. However, to work with such a heterogeneous study will require developing more advanced graph networks (i.e., with attention) to limit biases such as gender, age and acquisition systems.

Data availability statement

The sources code and graph data supporting the conclusions of this article can be found at: https://gitlab.in2p3.fr/thomas.grenier/msgcn-classification.

Ethics statement

The studies involving humans were approved by Local Ethics Committee (CPP Sud-Est IV) and French National Agency for Medicine and Health Products Safety (ANSM). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

EC: Conceptualization, Investigation, Software, Writing—original draft, Data curation, Formal analysis, Methodology, Validation, Visualization, Writing—review & editing. BB: Software, Writing—review & editing, Data curation. FD-D: Funding acquisition, Supervision, Writing—review & editing, Validation, Visualization. TG: Methodology, Supervision, Validation, Writing—review & editing, Conceptualization, Formal analysis, Investigation, Visualization. DS-M: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Validation, Writing—review & editing, Resources.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. EC was founded by the LABEX PRIMES (ANR-11-LABX-0063) of Université de Lyon, within the program “Investments for the Future” operated by the French National Research Agency (ANR).

Acknowledgments

Part of the results presented in this work were achieved using the FreeSurfer application (Fischl, 2012) through the Virtual Imaging Platform (Glatard et al., 2013), which uses the resources provided by the biomed virtual organization of the EGI infrastructure. This work was done within the framework of Observatoire Français de la Sclérose en Plaques (OFSEP), a national cohort supported by a grant provided by the French State and handled by the French National Research Agency (ANR) within the framework of the “Investments for the Future” program, under the reference ANR-10-COHO-002.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2023.1268860/full#supplementary-material

References

Aggarwal, C. C., Hinneburg, A., and Keim, D. A. (2001). “On the surprising behavior of distance metrics in high dimensional space,” in Database Theory - ICDT 2001 (Berlin; Heidelberg). doi: 10.1007/3-540-44503-X_27

Crossref Full Text | Google Scholar

Barile, B., Ashtari, P., Stamile, C., Marzullo, A., Maes, F., Durand-Dubief, F., et al. (2022). Classification of multiple sclerosis clinical profiles using machine learning and grey matter connectome. Front. Robot. AI 9, 926255. doi: 10.3389/frobt.2022.926255

PubMed Abstract | Crossref Full Text | Google Scholar

Bronstein, M. M., Bruna, J., Cohen, T., and Veličković, P. (2021). Geometric deep learning: grids, groups, graphs, geodesics, and gauges. arXiv preprint arXiv:2104.13478. doi: 10.48550/arXiv.2104.13478

Crossref Full Text | Google Scholar

Corps, J., and Rekik, I. (2019). Morphological brain age prediction using multi-view brain networks derived from cortical morphology in healthy and disordered participants. Sci. Rep. 9, 9676. doi: 10.1038/s41598-019-46145-4

PubMed Abstract | Crossref Full Text | Google Scholar

Desikan, R. S., Segonne, F., Fischl, B., Quinn, B. T., Dickerson, B. C., Blacker, D., et al. (2006). An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980. doi: 10.1016/j.neuroimage.2006.01.021

PubMed Abstract | Crossref Full Text | Google Scholar

Destrieux, C., Fischl, B., Dale, A., and Halgren, E. (2010). Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. Neuroimage 53, 1–15. doi: 10.1016/j.neuroimage.2010.06.010

PubMed Abstract | Crossref Full Text | Google Scholar

Dicks, E., van der Flier, W. M., Scheltens, P., Barkhof, F., and Tijms, B. M. (2020). Single-subject gray matter networks predict future cortical atrophy in preclinical Alzheimer's disease. Neurobiol. Aging 94, 71–80. doi: 10.1016/j.neurobiolaging.2020.05.008

PubMed Abstract | Crossref Full Text | Google Scholar

Durand-Dubief, F., Belaroussi, B., Armspach, J. P., Dufour, M., Roggerone, S., Vukusic, S., et al. (2012). Reliability of longitudinal brain volume loss measurements between 2 sites in patients with multiple sclerosis: comparison of 7 quantification techniques. Am. J. Neuroradiol. 33, 1918–1924. doi: 10.3174/ajnr.A3107

PubMed Abstract | Crossref Full Text | Google Scholar

Eitel, F., Soehler, E., Bellmann-Strobl, J., Brandt, A. U., Ruprecht, K., Giess, R. M., et al. (2019). Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional MRI using layer-wise relevance propagation. Neuroimage Clin. 24, 102003. doi: 10.1016/j.nicl.2019.102003

PubMed Abstract | Crossref Full Text | Google Scholar

Eshaghi, A., Marinescu, R. V., Young, A. L., Firth, N. C., Prados, F., Cardoso, M. J., et al. (2018). Progression of regional grey matter atrophy in multiple sclerosis. Brain 141, 1665–1677. doi: 10.1093/brain/awy088

PubMed Abstract | Crossref Full Text | Google Scholar

Eshaghi, A., Wottschel, V., Cortese, R., Calabrese, M., Sahraian, M. A., Thompson, A. J., et al. (2016). Gray matter MRI differentiates neuromyelitis optica from multiple sclerosis using random forest. Neurology 87, 2463–2470. doi: 10.1212/WNL.0000000000003395

PubMed Abstract | Crossref Full Text | Google Scholar

Fischl, B. (2012). Freesurfer. Neuroimage 62, 774–781. doi: 10.1016/j.neuroimage.2012.01.021

Crossref Full Text | Google Scholar

Flaus, A., Deddah, T., Reilhac, A., Leiris, N. D., Janier, M., Merida, I., et al. (2022). Pet image enhancement using artificial intelligence for better characterization of epilepsy lesions. Front. Med. 9, 1042706. doi: 10.3389/fmed.2022.1042706

PubMed Abstract | Crossref Full Text | Google Scholar

Folego, G., Weiler, M., Casseb, R. F., Pires, R., and Rocha, A. (2020). Alzheimer's disease detection through whole-brain 3D-CNN MRI. Front. Bioeng. Biotechnol. 8, 534592. doi: 10.3389/fbioe.2020.534592

PubMed Abstract | Crossref Full Text | Google Scholar

Glasser, M. F., Coalson, T. S., Robinson, E. C., Hacker, C. D., Harwell, J., Yacoub, E., et al. (2016). A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178. doi: 10.1038/nature18933

PubMed Abstract | Crossref Full Text | Google Scholar

Glatard, T., Lartizien, C., Gibaud, B., Silva, R. F. D., Forestier, G., Cervenansky, F., et al. (2013). A virtual imaging platform for multi-modality medical image simulation. IEEE Trans. Med. Imaging 32, 110–118. doi: 10.1109/TMI.2012.2220154

PubMed Abstract | Crossref Full Text | Google Scholar

Goodin, D. S. (2014). “Chapter 11: The epidemiology of multiple sclerosis: insights to disease pathogenesis,” in Multiple Sclerosis and Related Disorders, volume 122 of Handbook of Clinical Neurology, ed D. S. Goodin (Elsevier), 231–266. doi: 10.1016/B978-0-444-52001-2.00010-8

PubMed Abstract | Crossref Full Text | Google Scholar

Guo, Y., Nejati, H., and Cheung, N. M. (2017). “Deep neural networks on graph signals for brain imaging analysis,” in 2017 IEEE International Conference on Image Processing (ICIP) (Beijing). doi: 10.1109/ICIP.2017.8296892

Crossref Full Text | Google Scholar

Hagmann, P., Kurant, M., Gigandet, X., Thiran, P., Wedeen, V. J., Meuli, R., et al. (2007). Mapping human whole-brain structural networks with diffusion MRI. PLoS ONE 2, e597. doi: 10.1371/journal.pone.0000597

PubMed Abstract | Crossref Full Text | Google Scholar

Hawkins, R., Shatil, A. S., Lee, L., Sengupta, A., Zhang, L., Morrow, S., et al. (2020). Reduced global efficiency and random network features in patients with relapsing-remitting multiple sclerosis with cognitive impairment. Am. J. Neuroradiol. 41, 449–455. doi: 10.3174/ajnr.A6435

PubMed Abstract | Crossref Full Text | Google Scholar

Huang, Y., Xu, J., Zhou, Y., Tong, T., Zhuang, X., and the Alzheimer's Disease Neuroimaging Initiative (ADNI) (2019). Diagnosis of Alzheimer's disease via multi-modality 3D convolutional neural network. Front. Neurosci. 13, 509. doi: 10.3389/fnins.2019.00509

PubMed Abstract | Crossref Full Text | Google Scholar

Jutten, K., Mainz, V., Gauggel, S., Patel, H. J., Binkofski, F., Wiesmann, M., et al. (2019). Diffusion tensor imaging reveals microstructural heterogeneity of normal-appearing white matter and related cognitive dysfunction in glioma patients. Front. Oncol. 9, 536. doi: 10.3389/fonc.2019.00536

PubMed Abstract | Crossref Full Text | Google Scholar

Kipf, T. N., and Welling, M. (2017). “Semi-supervised classification with graph convolutional networks,” in 5th International Conference on Learning Representations, ICLR 2017 (Toulon).

Google Scholar

Kocevar, G., Stamile, C., Hannoun, S., Cotton, F., Vukusic, S., Durand-Dubief, F., et al. (2016). Graph theory-based brain connectivity for automatic classification of multiple sclerosis clinical courses. Front. Neurosci. 10, 478. doi: 10.3389/fnins.2016.00478

PubMed Abstract | Crossref Full Text | Google Scholar

Kong, Y., Gao, J., Xu, Y., Pan, Y., Wang, J., and Liu, J. (2019). Classification of autism spectrum disorder by combining brain connectivity and deep neural network classifier. Neurocomputing 324, 63–68. doi: 10.1016/j.neucom.2018.04.080

Crossref Full Text | Google Scholar

Ktena, S. I., Parisot, S., Ferrante, E., Rajchl, M., Lee, M., Glocker, B., et al. (2018). Metric learning with spectral graph convolutions on brain connectivity networks. Neuroimage 169, 431–442. doi: 10.1016/j.neuroimage.2017.12.052

PubMed Abstract | Crossref Full Text | Google Scholar

Leclerc, S., Smistad, E., Pedrosa, J., Ostvik, A., Cervenansky, F., Espinosa, F., et al. (2019). Deep learning for segmentation using an open large-scale dataset in 2d echocardiography. IEEE Trans. Med. Imaging 38, 2198–2210. doi: 10.1109/TMI.2019.2900516

PubMed Abstract | Crossref Full Text | Google Scholar

Lublin, F. D., Reingold, S. C., Cohen, J. A., Cutter, G. R., Sorensen, P. S., Thompson, A. J., et al. (2014). Defining the clinical course of multiple sclerosis: the 2013 revisions. Neurology 83, 278–86. doi: 10.1212/WNL.0000000000000560

Crossref Full Text | Google Scholar

MacDonald, D., Kabani, N., Avis, D., and Evans, A. C. (2000). Automated 3-d extraction of inner and outer surfaces of cerebral cortex from mri. Neuroimage 12, 340–356. doi: 10.1006/nimg.1999.0534

PubMed Abstract | Crossref Full Text | Google Scholar

Mahjoub, I., Mahjoub, M. A., Rekik, I., Weiner, M., Aisen, P., Petersen, R., et al. (2018). Brain multiplexes reveal morphological connectional biomarkers fingerprinting late brain dementia states. Sci. Rep. 8, 1–14. doi: 10.1038/s41598-018-21568-7

PubMed Abstract | Crossref Full Text | Google Scholar

Marti-Juan, G., Frias, M., Garcia-Vidal, A., Vidal-Jordana, A., Alberich, M., Calderon, W., et al. (2022). Detection of lesions in the optic nerve with magnetic resonance imaging using a 3d convolutional neural network. Neuroimage Clin. 36, 103187. doi: 10.1016/j.nicl.2022.103187

PubMed Abstract | Crossref Full Text | Google Scholar

Marzullo, A., Kocevar, G., Stamile, C., Durand-Dubief, F., Terracina, G., Calimeri, F., et al. (2019). Classification of multiple sclerosis clinical profiles via graph convolutional neural networks. Front. Neurosci. 13, 594. doi: 10.3389/fnins.2019.00594

PubMed Abstract | Crossref Full Text | Google Scholar

Mure, S., Grenier, T., Guttmann, C. R. G., Cotton, F., and Benoit-Cattin, H. (2016). “Classification of multiple sclerosis lesion evolution patterns a study based on unsupervised clustering of asynchronous time-series,” in 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI) (Prague), 1315–1319. doi: 10.1109/ISBI.2016.7493509

Crossref Full Text | Google Scholar

Muthuraman, M., Fleischer, V., Kolber, P., Luessi, F., Zipp, F., and Groppa, S. (2016). Structural brain network characteristics can differentiate cis from early rrms. Front. Neurosci. 10, 14. doi: 10.3389/fnins.2016.00014

PubMed Abstract | Crossref Full Text | Google Scholar

Narayana, P. A., Govindarajan, K. A., Goel, P., Datta, S., Lincoln, J. A., Cofield, S. S., et al. (2013). Regional cortical thickness in relapsing remitting multiple sclerosis: a multi-center study. Neuroimage Clin. 2, 120–131. doi: 10.1016/j.nicl.2012.11.009

PubMed Abstract | Crossref Full Text | Google Scholar

Polman, C. H., Reingold, S. C., Banwell, B., Clanet, M., Cohen, J. A., Filippi, M., et al. (2011). Diagnostic criteria for multiple sclerosis: 2010 revisions to the Mcdonald criteria. Ann. Neurol. 69, 292–302. doi: 10.1002/ana.22366

PubMed Abstract | Crossref Full Text | Google Scholar

Raamana, P. R., and Strother, S. C. (2018). graynet: single-subject morphometric networks for neuroscience connectivity applications. J. Open Source Softw. 3, 924. doi: 10.21105/joss.00924

Crossref Full Text | Google Scholar

Rimkus, C. M., Schoonheim, M. M., Steenwijk, M. D., Vrenken, H., Eijlers, A. J., Killestein, J., et al. (2019). Gray matter networks and cognitive impairment in multiple sclerosis. Multiple Scler. J. 25, 382–391. doi: 10.1177/1352458517751650

PubMed Abstract | Crossref Full Text | Google Scholar

Rocca, M. A., Valsasina, P., Meani, A., Pagani, E., Cordani, C., Cervellin, C., et al. (2021). Network damage predicts clinical worsening in multiple sclerosis: a 6.4-year study. Neurol. Neuroimmunol. NeuroInflam. 8, e1006. doi: 10.1212/NXI.0000000000001006

PubMed Abstract | Crossref Full Text | Google Scholar

Rubinov, M., and Sporns, O. (2010). Complex network measures of brain connectivity: uses and interpretations. Neuroimage 52, 1059–1069. doi: 10.1016/j.neuroimage.2009.10.003

PubMed Abstract | Crossref Full Text | Google Scholar

Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2020). Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 128, 336–359. doi: 10.1007/s11263-019-01228-7

Crossref Full Text | Google Scholar

Sporns, O. (2018). Graph theory methods: applications in brain networks. Dialog. Clin. Neurosci. 20, 111–121. doi: 10.31887/DCNS.2018.20.2/osporns

PubMed Abstract | Crossref Full Text | Google Scholar

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. (2014). Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958. Available online at: https://dl.acm.org/doi/10.5555/2627435.2670313

Google Scholar

Steenwijk, M. D., Geurts, J. J., Daams, M., Tijms, B. M., Wink, A. M., Balk, L. J., et al. (2016). Cortical atrophy patterns in multiple sclerosis are non-random and clinically relevant. Brain 139(Pt 1), 115–126. doi: 10.1093/brain/awv337

PubMed Abstract | Crossref Full Text | Google Scholar

Thompson, A. J., Banwell, B. L., Barkhof, F., Carroll, W. M., Coetzee, T., Comi, G., et al. (2018). Diagnosis of multiple sclerosis: 2017 revisions of the mcdonald criteria. Lancet Neurol. 17, 162–173. doi: 10.1016/S1474-4422(17)30470-2

PubMed Abstract | Crossref Full Text | Google Scholar

Wargnier-Dauchelle, V., Grenier, T., Durand-Dubief, F., Cotton, F., and Sdika, M. (2023). A weakly supervised gradient attribution constraint for interpretable classification and anomaly detection. IEEE Trans. Med. Imaging 42, 3336–3347. doi: 10.1109/TMI.2023.3282789

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: multiple sclerosis, graph convolutional network, CNN, classification, brain morphological connectivity, gray matter thickness

Citation: Chen E, Barile B, Durand-Dubief F, Grenier T and Sappey-Marinier D (2024) Multiple sclerosis clinical forms classification with graph convolutional networks based on brain morphological connectivity. Front. Neurosci. 17:1268860. doi: 10.3389/fnins.2023.1268860

Received: 28 July 2023; Accepted: 18 December 2023;
Published: 18 January 2024.

Edited by:

Roberto Maffulli, Italian Institute of Technology (IIT), Italy

Reviewed by:

Fulvia Palesi, University of Pavia, Italy
Noemi Montobbio, University of Genoa, Italy

Copyright © 2024 Chen, Barile, Durand-Dubief, Grenier and Sappey-Marinier. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dominique Sappey-Marinier, ZG9taW5pcXVlLnNhcHBleS1tYXJpbmllckB1bml2LWx5b24xLmZy

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Multiple sclerosis clinical forms classification with graph convolutional networks based on brain morphological connectivity

1 Introduction

2 Materials and methods

2.1 MRI acquisition and data

2.2 Classification using graph-based convolutional network

2.2.1 Feature extraction

2.2.2 Age and gender normalization

2.2.3 Graph generation

2.2.4 GCN classification

2.2.5 GCN architecture

2.3 Classification using 3D convolutional neural network

2.4 Experimental settings

3 Results

3.1 Clinical forms classification tasks

3.2 GCN classification

3.2.1 Without normalization

3.2.2 With normalization

3.3 Comparing CNN and GCN

4 Discussion

5 Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher's note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good