- 1CREATIS (UMR 5220 CNRS & U1294 INSERM), Université Claude Bernard Lyon1, INSA-Lyon, Université de Lyon, Lyon, France
- 2Department of Electrical Engineering, KU Leuven, Leuven, Belgium
- 3BIP SpA, Milan, Italy
- 4Department of Mathematics and Computer Science, University of Calabria, Rende, Italy
- 5Hôpital Neurologique, Service de Neurologie, Hospices Civils de Lyon, Bron, France
- 6CERMEP–Imagerie du Vivant, Université de Lyon, Lyon, France
Purpose: The main goal of this study is to investigate the discrimination power of Grey Matter (GM) thickness connectome data between Multiple Sclerosis (MS) clinical profiles using statistical and Machine Learning (ML) methods.
Materials and Methods: A dataset composed of 90 MS patients acquired at the MS clinic of Lyon Neurological Hospital was used for the analysis. Four MS profiles were considered, corresponding to Clinical Isolated Syndrome (CIS), Relapsing-Remitting MS (RRMS), Secondary Progressive MS (SPMS), and Primary Progressive MS (PPMS). Each patient was classified in one of these profiles by our neurologist and underwent longitudinal MRI examinations including T1-weighted image acquisition at each examination, from which the GM tissue was segmented and the cortical GM thickness measured. Following the GM parcellation using two different atlases (FSAverage and Glasser 2016), the morphological connectome was built and six global metrics (Betweenness Centrality (BC), Assortativity (r), Transitivity (T), Efficiency (Eg), Modularity (Q) and Density (D)) were extracted. Based on their connectivity metrics, MS profiles were first statistically compared and second, classified using four different learning machines (Logistic Regression, Random Forest, Support Vector Machine and AdaBoost), combined in a higher level ensemble model by majority voting. Finally, the impact of the GM spatial resolution on the MS clinical profiles classification was analyzed.
Results: Using binary comparisons between the four MS clinical profiles, statistical differences and classification performances higher than 0.7 were observed. Good performances were obtained when comparing the two early clinical forms, RRMS and PPMS (F1 score of 0.86), and the two neurodegenerative profiles, PPMS and SPMS (F1 score of 0.72). When comparing the two atlases, slightly better performances were obtained with the Glasser 2016 atlas, especially between RRMS with PPMS (F1 score of 0.83), compared to the FSAverage atlas (F1 score of 0.69). Also, the thresholding value for graph binarization was investigated suggesting more informative graph properties in the percentile range between 0.6 and 0.8.
Conclusion: An automated pipeline was proposed for the classification of MS clinical profiles using six global graph metrics extracted from the GM morphological connectome of MS patients. This work demonstrated that GM morphological connectivity data could provide good classification performances by combining four simple ML models, without the cost of long and complex MR techniques, such as MR diffusion, and/or deep learning architectures.
1 Introduction
An important issue in neuroscience is the characterization of human brain structure and function, and their alteration in brain diseases. Today, the neurologist’s challenge is to define disease phenotypes based on their underpinning mechanisms and to predict the disease evolution. Multiple Sclerosis (MS), which is the most common chronic immune-mediated disabling neurological disease affecting the central nervous system (Steinman, 1996; Goodin, 2014), is primary classified according to clinical symptoms. In about 85% of cases, disease onset is characterized by a first acute episode, called Clinically Isolated Syndrome (CIS), it evolves to a Relapsing-Remitting course (RRMS) followed by a Secondary-Progressive course (SPMS) The remaining 15% of MS starts directly from a Primary-Progressive course (PPMS) (Ghasemi et al., 2016; Lublin et al., 2014; McDonald et al., 2001). However, the clinical course of the disease and the risk for developing permanent disability are very different from one patient to another (Scalfari et al., 2010; Lynch et al., 2005). MS is characterized by pathological processes including inflammation and demyelination, leading to lesions, predominantly in White Matter (WM) tissue, that can be detected by conventional T2-weighted MRI (Compston and Coles, 2008). Nevertheless, lesions are also present in Grey Matter (GM) as initially demonstrated by histochemical studies (Geurts and Barkhof, 2008; Calabrese et al., 2013), and more recently by high field MRI (Bruschi et al., 2020; Tallantyre et al., 2009). These findings, as well as the measurements of GM atrophy (Filippi, 2015; Durand-Dubief et al., 2012), have confirmed the neurodegenerative hypothesis in MS (Ghasemi et al., 2016; Steenwijk et al., 2016; Preziosa et al., 2017). Kuceyeski et al., 2018 showed that GM atrophy is associated with the highest prediction accuracy of the patient’s future processing speed in MS. More recently, cortical thickness has been recognized as an early marker of neurodegeneration in MS (Cruz-Gomez, 2021).
In the last decade, Artificial Intelligence (AI) approaches have been increasingly applied within the medical field, hoping to increase diagnostic performance and improve treatment. Specifically, Machine Learning (ML) is a data-driven approach which covers a very broad set of methods. Indeed, ML aims to extract possibly complex relations among available data and generate predictions for an event. A wide range of ML applications have been proposed in the literature. A systematic review of the applications of ML methods in autoimmune diseases is proposed by Stafford et al. (2020). In a recent review paper, Segato et al. (2020) highlighted that AI methods nowadays are among the most widely used analytical tools, while classical ML approaches, such as support vector machines and random forest, are still widely used. Notwithstanding, in the field of MS, ML approaches have often focused on automatic examination of MRI images to classify disease at the time of onset or to predict evolution of clinically isolated forms (Kaka et al., 2020; Afzal et al., 2020). Such an example is proposed by Jackson et al. (2020), where a Genetic Model of MS Severity (GeM-MSS) was used for the evaluation of MS disability progression considering a cohort of 426 MS patients and obtaining a Root Mean Squared Error (RMSE) of 0.46 and a correlation with MS-DSS score of 0.21. Kolčava et al. (2020) proposed a classification task for predicting a second clinical event in 64 MS patients with a CIS and based on a multicenter MRI dataset. Logistic regression and Cox proportional hazards regression models were used, obtaining a sensitivity and specificity score of 84% and 63% respectively. In Table 1 a tabular review of the most relevant studies of ML application in the field of MS is provided. For a thorough review of ML applications in MS we also refer to Vázquez-Marrufo et al. (2021) and Garavand et al. (2022). Notwithstanding, DL models are becoming increasingly more important in biomedical diagnostics. For example, for the task of Epileptic Seizures Detection, multiple DL methods have been implemented and summarized in Shoeibi et al. (2021a). Also, in Shoeibi et al. (2021c), various intelligent DL-based methods for automated Schizophrenia diagnosis were described using electroencephalography signals (EEG). Interestingly, in the context of MS, multiple DL techniques are proposed for the task of disease detection, segmentation and classification using MRI data and summarized in Shoeibi et al. (2021b).
TABLE 1. Overview of the most relevant studies in the field of MS analysis from the literature using MRI data.
Graph theory represents a new and powerful approach for characterizing brain networks by providing both global and local metrics (Rubinov and Sporns, 2010; Guo et al., 2017), using either functional MRI or diffusion tensor imaging (DTI). Recently, Tozlu et al. (2021) demonstrated that dynamic functional and structural connectome metrics outperformed results obtained from conventional MRI clinical data when discriminating MS patients by impairment level. Previously, Kocevar et al. (2016) demonstrated the interest of DTI structural connectivity for the classification of MS clinical profiles using ML methods. Marzullo et al. (2019) improved the classification performance by using a CNN model. Schiavi et al. (2022) used structural disconnection for the classification of multiple sclerosis patients considering 55 MS patients and 24 healthy controls. Five different classifiers were used reaching an accuracy in the range between 64.5% and 91.1%. However, fMRI and DTI data, used for connectivity modeling, require long acquisition time and the use of complex processing techniques, strongly limiting their applicability in clinical practice. Nevertheless, brain connectivity can also be obtained from conventional MRI by measuring different morphological metrics of the GM on T1-weighted images (Raamana and Strother. 2018). In such graphs, nodes represent GM areas obtained from the GM tissue parcellation, while edges represent a degree of (dis-)similarity between nodes using features like GM thickness (MacDonald et al., 2000). Indeed, these morphological graphs follow a small-world topology at a macroscale level, characterized by a high degree of local clustering and short path-lengths linking individual network nodes (He et al., 2007). Such an approach has been recently used in Alzheimer’s Disease (AD), showing that a GM network measures predicted hippocampal atrophy rates within individuals with preclinical AD, in contrast to other AD biomarkers (Dicks et al., 2020). In MS, Muthuraman et al. (2016) analyzed morphological GM thickness networks in order to classify CIS and RRMS patients using Support Vector Machines model, obtaining good level of accuracy.
The main goal of this study is to investigate whether cortical thickness atrophy, in patients affected by MS, represents a discriminative biomarker for MS profiling. Our hypothesis is based on previous studies where the GM thickness morphometric feature was demonstrated to be one of the most important biomarkers characterizing MS patients (Durand-Dubief et al., 2012). Also, focusing on network changes and not on local structural properties may represent an important step forward for a better discrimination of the MS clinical profiles. In order to test such an hypothesis, an ensemble of four ML models was used for the classification of MS patients based on the morphological GM connectivity using the GM thickness feature. The graph characterization was based on six global graph metrics, describing the topological behavior of the connectome. Two different spatial resolutions were considered for GM regional parcellation, which allows to test the initial hypothesis in two different settings (i.e., high and low spatial resolution). Also, with the aim of investigating whether the choice of the parcellation atlas has an impact on our initial hypothesis, the results obtained from the statistical and ML analysis were compared. To our knowledge, this is the first attempt to characterize brain networks of MS patients based on their GM atrophy and to perform statistical analysis and automatic classification of MS clinical profiles using two parcellation approaches. An ensemble of 4 ML models was considered in order to improve the binary classification task, while taking into account the thresholding impact on the topological architecture of the GM network.
2 Materials and methods
2.1 Study population and MRI acquisition
In this study, 90 MS patients were examined at different time points, every 6 months during the first 3 years and then every year for the following 4 years. Patients were recruited at the MS clinic of Lyon Neurological Hospital and underwent a MR examination at the CERMEP MRI department, on a 1.5T S Sonata system (Siemens Medical Solution, Erlangen, Germany) using an 8-channel head-coil. The MR protocol included the acquisition of a sagittal 3D-T1 sequence (1 × 1 × 1 mm3, TE/TR = 4/2000 ms). A total of six-hundred-fifty-two scans were obtained, corresponding to 12 CIS, 30 RRMS, 28 SPMS, 20 PPMS. To better clarify the dataset used in this study, Table 2 reports the summary information for all the MS patients. This study was approved by the local Ethics Committee (CPP Sud-Est IV) and the French national agency for medicine and health products safety (ANSM). Written informed consent was obtained from all patients prior to study initiation.
TABLE 2. Summary information of the dataset partition into MS clinical profiles (CIS, RR, SP, PP) and healthy subjects. Average values for Age (at first scan) and Disease Duration (DD) with standard deviation in parentheses is reported. Median values of EDSS is provided along with range of variation in parentheses. Percentage of female patients is reported.
2.2 GM connectivity generation
Starting from 3D T1-weighted images, several preprocessing steps were applied to the anatomical image of each patient and based on the Freesurfer v6.0.0 image analysis suite (Reutera et al., 2012). A detailed description of all the preprocessing steps can be found in Hanganu et al. (2015). In particular, the skull was removed from the brain image and motion correction as well as registration and intensity normalization were performed in order to reduce noise and make all the MRI scans comparable, avoiding pixel intensity artifacts that may distort the final analysis. Additionally, the MRI image was resampled into a 3D coordinate system called the Talairach space. It allows to map the location of brain structures such that MRI scans with different size and overall shape of the brain are mapped using comparable 3D coordinates. To better visualize the impact of the preprocessing, Figure 1 is shown to illustrate the effect for a random subject sampled from our cohort of MS patients. Finally, segmentation of the anatomical tissue, including cortical and sub-cortical GM segmentation was performed. Each voxel was classified into one of four classes, such as WM, cortical GM, sub-cortical GM and cerebro-spinal fluid (CSF). The cortical thickness morphometric feature was calculated for each GM pixel and clustered in brain regions based on the used parcellation atlas. The exact pipeline used to build the morphological connectome was described in Raamana and Strother (2017). The reliability of the pipeline was validated using histological (Rosas et al., 2002) and manual measurements (Rosas et al., 2002) demonstrating high performances across different scanner and field of strengths (Reutera et al., 2012). Additionally, previous studies have performed similar analysis demonstrating the robustness of the implemented pipeline (Muthuraman et al., 2016; Kuperberg et al., 2003). The GM parcellation task was performed using two different atlases, the FSAverage (Rosenke et al., 2017; Fischl et al., 1999) and the Glasser 2016 (Glasser et al., 2016), providing 68 and 360 brain regions (nodes), respectively. The GM morphological connectome of each patient was obtained comparing the morphometric features (i.e., GM thickness) of each brain region, using the Manhattan distance formulation (Craw, 2017), and expressed by a full squared symmetric matrix
FIGURE 1. Visual comparison between the anatomical T1w image modality before (A) and after (B) preprocessing.
FIGURE 2. Intuitive representation of the pipeline implemented for morphological connectome data generation. Starting from the magnetic resonance T1-weighted image of a patient, the GM tissue was first segmented and then parcelled in different anatomical brain regions (ROIs) based on a specific atlas used as template (FSAverage or Glasser 2016). Additionally, from the segmented tissue, the thickness morphometric statistic was calculated for each GM region. A ROI to ROI binary comparison was performed for all possible combinations of GM regions (one to one comparison). The Manhattan distance formulation was used as dissimilarity metric. The results were organized in a squared symmetric adjacency matrix (i.e. connectome) where each entry represent the (dis-)similarity between two ROIs. From the connectome, six global metrics were calculated for the analysis of graph characterization.
2.3 Graph binarization
In order to perform the brain morphological connectivity analysis, graphs were binarized to remove the weakest connections generated by the morphometric GM thickness comparison. Henceforth, a percentile threshold 0 < τ < 1 was imposed and a corresponding binarized unweighted and undirected graph
For the analysis, we followed the same approach proposed by Kocevar et al. (2016). The Coefficient of Variation (CV) was used as measure of variability.
2.4 Graph metrics estimation
In this study, six global graph features were calculated based on the binarized connectome
3 Statistical and machine learning analysis
3.1 Statistical analysis
MS clinical profiles were compared based on their global graph metrics using Generalized Linear Model (GLM) with binomial family and logistic link (Agresti, 2021). Stata16 statistical programming language (StataCorp, 2021) was used along with the command xtmelogit for GLM model fitting. The longitudinal aspect of the dataset was taken into account by considering both mixed and random effects. The evaluation of statistical differences between MS clinical profiles, for each global graph metric, was performed controlling for age and gender as confounding factors. The longitudinal aspect of the dataset was taken into account by the random effect, which model the time component, while the remaining regressors were included as fixed effects. The tests were computed with a level of significance of 5%.
3.2 Classification of MS clinical profiles
Four different ML models were considered in this study for the classification task, namely Logistic Regression (LR) (Cox, 1958), Random Forest (RF) (Breiman, 2001), Support Vector Machine (SVM) (Cortes and Vapnik, 1995) and Adaptive Boosting (AdaBoost) (Schapire, 2013) models. The predictive analysis was performed using Python 3.6 programming language along with scikit-learn package v0.24.2, while the networkx package v2.2 was used for global graph metrics calculation. The 4 ML models were chosen as they are widely used, easy to train and tune with a minimal number of hyperparameters to optimize during cross validation (Witten et al., 2011) and provide built-in feature importance for the prediction task, with the exception of SVM in case a non-linear kernel is used (i.e., Radial Basis Function Kernel). Additionally, simple models are usually preferred with small datasets to avoid overfitting the training set (Ghojogh and Crowley, 2019). Moreover, the different underlying assumptions, used for each classifier, may provide interesting insights for the classification performance. A combination of many different predictors can often improve predictions (Sollich and Krogh, 1995), and in statistics this idea has been investigated extensively (Granger, 1989). In order to boost the performances and exploit the individual properties of each classifier, we combined them into a higher level ensemble model through majority voting. To better clarify the ML pipeline, Figure 3 was proposed. Predictive performances were compared considering the non-parametric Wilcoxon matched-pairs signed-rank test.
FIGURE 3. Schematic representation of the Machine Learning pipeline implemented for the classification of MS clinical profiles. Starting from the adjacency matrix representation of the brain connectome for a patient, six global graph metrics (Betweenness Centrality (BC), Assortativity (r), Transitivity (T), Efficiency (Eg), Modularity (Q) and Density (D) were calculated and used as input to the four Machine Learning models (Logistic Regression, Random Forest, Support Vector Machine and AdaBoost) independently trained. The four models were then combined in a final ensemble model by majority voting and then used for binary classification.
3.3 Statistical metrics
In this work, five statistical parameters were used for the predictive analysis, such as accuracy, precision, recall, and F1. Such metrics are based on the analysis of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) instances classified during the test phase (Gorunescu, 2011). Formally speaking, accuracy is defines as the ratio between the number of correct assessments and the number of all assessments
3.4 Model optimization
In order to optimize the models, a grid search strategy was employed. In particular, the following hyperparameters were optimized. For the LR model a regularization parameter in the interval between 1 and 100 was used, while for the RF model, the number of trees were optimized in the range between 10 and 500 and the max depth of each tree between 1 and 10. For the SVM model, a radial basis kernel (RBF) was used due to its good performance, as demonstrated in previous studies (Kocevar et al., 2016), along with a regularization parameter and kernel coefficient in the range between 1 and 100. Finally, for the AdaBoost model, the number of trees was tuned in the range between 10 and 500, with a learning rate between 0.1 and 3. Additionally, the thresholding value (τ), used for graph binarization, as explained in Section 2.3, was also considered as an additional hyperparameter along the entire percentile range. In order to avoid data leakage, the search for the optimal threshold was only performed on the training set. The final results reported in Section 4 refers to the performance obtained on the hold-out test sets.
4 Results
4.1 Descriptive statistical analysis of GM connectivity
Morphological GM connectivity was characterized using six global graph metrics and two different atlases for parcellation. By measuring the degree of variability (i.e. CV coefficient) along different percentile thresholding values, for each graph metrics (Figure 4), a similar behavior was observed with both atlases. This first result showed that the spatial resolution of parcellation did not have a strong impact on the variability of graph topology. Second, we observed that the smallest CV score was obtained for a τ value comprised between 0.7 and 0.8 for the graph metrics BC and r. These metrics play an important role in describing the information flow of the network and the stability of centralized hubs. Also, the variability of the modularity (Q) metric was coherent and stable between the two atlases only in the range of 0.6–0.8. This result is important as this metric is crucial for maintaining the small-world property characterizing the human brain (Sporns and Betzel, 2016). For the remaining three global metrics, namely Eg, T and D, an opposite trend was observed. For Eg and T metrics an exponential increase in the CV coefficient was observed for a value of τ greater than 0.7. Overall, these results suggest that the information flow between nodes far apart inside the network, and possibly pertaining to different modular clusters, start to be compromised for higher thresholding values, due to the reduction of intra-cluster connections and lower number of triangular motifs (reduced local transitivity). Notwithstanding, a stable value of CV in the range of τ between 0.4 and 0.7 can be noticed for the Eg metric. Finally, the D metric increased steadily until a value of τ equal to 0.7 from which an increased rate was observed, particularly for the Glasser 2016 atlas which provides high spatial resolution parcellation. Therefore, we concluded that good topological characteristics could be observed for a value of τ in the range between 0.6 and 0.8. For this reason, the statistical analysis in Section 4.2 was performed considering a binary thresholding value of 0.7 as a reasonable compromise since it represents the central value of the suggested range.
FIGURE 4. Variability study over the entire percentile threshold range, using two different atlases [FSAverage () and Glasser 2016 ()]. Each block defines the Coefficient of Variation (% CV), expressed in percentage values (y-axis), at each specific percentile threshold in the range [0–1] (x-axis) for a defined global graph feature. For visual purposes, the upper and lower threshold values were cut off at a convenient value in order to avoid image flattening.
4.2 Descriptive statistical analysis of MS clinical profiles
Statistical analysis of the four MS clinical profiles was performed by characterizing graphs with six global metrics calculated using two atlases, FSAverage and Glasser 2016. As shown in Figure 5 and Figure 6, significant differences were detected between the MS clinical profiles for most of the graph metrics, especially considering the higher spatial resolution (Glasser 2016) atlas. This is evident when comparing the Transitivity (T) and Modularity (Q) metrics, for which the Glasser 2016 atlas reported significant differences for almost all binary comparisons, with the exception of the CIS and RRMS groups. This result showed that a more refined parcellation may be helpful in the discrimination of clinical profiles. Considering the Betweenness Centrality (BC) metric, a significant reduction was observed comparing patients in the beginning stages of the disease, such as CIS and RRMS, with progressive MS courses (PPMS and SPMS). Conversely, for r, Eg and D, reduced mean values were observed for CIS and RRMS patients compared to progressive courses (p < 0.01). When comparing CIS with progressive courses, significant differences (p < 0.05) were detected for almost all metrics, with the exception of Q using the FSAverage atlas. However, only small differences were observed between CIS and RRMS patients. Additionally, significant differences (p < 0.05) were observed between PPMS and SPMS groups, especially when the Glasser 2016 atlas was used. From these results, it is possible to conclude that GM thickness was more significantly altered in progressive and degenerative courses, compared to earlier MS stages. The brain networks of PPMS and SPMS patients exhibited a reduced number of centralized nodes, leading to randomized distribution of the information transfer.
FIGURE 5. Boxplot comparison between four MS clinical profiles over six global graph metrics using FSAverage atlas. Differences between clinical profiles were determined employing a generalized mixed effect model with age and sex as controlling factors (*p
FIGURE 6. Boxplot comparison between four MS clinical profiles over six global graph metrics using Glasser 2016 atlas. Differences between clinical profiles were determined employing a generalized mixed effect model with age and sex as controlling factors (*p
4.3 Classification of MS Clinical profiles
From the descriptive statistical analysis, significant differences were detected, suggesting good degree of separation between MS clinical profiles. Thus, in order to evaluate the predictive performance of the six global graph metrics, the 4 ML models were trained separately and subsequently combined in an ensemble model, by late integration using majority voting, as described in Section 4.2. The classification task was performed using the two atlases separately and the results compared in Tables 3, 4. Overall, comparable results were observed between the two atlases. Moreover, high classification performance scores (F1 > 0.7 for both atlases) were obtained comparing patients in the primary stage of the disease (CIS), with progressive courses (PPMS and SPMS). When considering the AUC score, a similar classification performance (0.75) between CIS and PPMS was observed with the FSAverage atlas while lower score (0.67) was obtained using the Glasser 2016 atlas, when comparing CIS and SPMS groups. These findings suggested that a more detailed GM parcellation may lead to less informative connectomes and a reduced classification performance. Interestingly, when comparing CIS with RRMS patients, no significant difference (p > 0.05) in performance was observed between the two atlases (FSAverage; 0.66 and Glasser 2016; 0.58). This result showed the limit of GM connectome to discriminate patients in the early stages of the disease, as CIS and RRMS profiles are mainly subject to inflammation and less degeneration leading to GM atrophy especially when a large (less refined) parcellation strategy was considered. Notwithstanding, the high level of imbalance between the two groups as well as the low number of CIS patients considered in this study may play a substantial role. Conversely, when comparing RRMS patients with PPMS, first, significant differences were detected comparing the FSAverage and Glasser 2016 atlases (p < 0.05), with an AUC score of 0.70 and 0.84 respectively, conform with the statistical analysis in Section 4.2. Surprisingly, when comparing RRMS with SPMS patients, poor classification performances were detected, especially when the Glasser 2016 atlas was considered (AUC score of 0.62). This result is probably due to the high variability between the two clinical profiles, as observed from the boxplot analysis in Figure 5 and Figure 6. This result did not significantly differ from the one obtained using the FSAverage (AUC score of 0.67) atlas. For the comparison between progressive courses, a good level of classification (F1 score of 0.72) was obtained using the Glasser 2016 atlas, compared to the less refined FSAverage atlas (F1 score of 0.66). Also, the AUC scores were similar reporting a value of 0.67 and 0.72 for the FSAverage and Glasser 2016 atlas, respectively. Interestingly, the performance, as expressed by the F1 score and AUC scores, obtained from the two parcellation strategies, provided an interesting insight for the use of a more refined parcellation approach, although no differences were detected (p > 0.05). As an additional ablation study, in comparison with previous reports (Muthuraman et al., 2016; Kocevar et al., 2016), the classification results obtained using only the SVM model were reported for both atlases in Tables 5, 6. Of particular relevance is the fact that the SVM model alone outperformed the ensemble model when the RRMS group was compared with progressive courses (PPMS: AUC score of 0.79 and SPMS: AUC score of 0.71), using the FSAverage atlas. However, when Glasser 2016 atlas was used, this result was not confirmed when RRMS and PPMS patients were compared (F1 score of 0.58 and AUC score of 0.59). A second experiment was also performed in order to investigate the classification performances obtained by comparing early stages of MS patients (CIS and PPMS) with progressive MS patients (PPMS and SPMS). For this comparison, Figures 7, 8 are proposed, which depict the Receiver Operating Characteristic (ROC) curves for both atlases and for each of the folds in the cross-validation. From these Figures, it can be observed that both atlases display a high variability between the folds. Hence, no firm conclusions can be drawn when comparing the two parcellation strategies. Notwithstanding, the lower performances obtained for the FSAverage atlas are mainly due to two out of 10 folds, namely fold 3 and 7, for which very low classification performance of the ensemble model was obtained. As shown in Tables 7, 8, multiple MS profile comparisons were considered using the FSAverage and Glasser 2016 atlas, respectively. Notwithstanding, good level of classification performances (AUC) were observed considering the CIS and RR group combined, with respectively PPMS (FSAverage: 0.65; Glasser 2016: 0.69) and SPMS (FSAverage: 0.66; Glasser 2016: 0.69) patients. However, no statistical differences were observed between the two atlases. Conversely, when comparing RRMS patients with progressive courses, significant differences (p < 0.05) were observed between the two atlases with higher score (F1 and AUC scores of 0.78) obtained with FSAverage. Also, it is interesting to compare this result with the one obtained when considering CIS and RRMS together against progressive MS courses. In this case, an F1 score of 0.65 was obtained for both atlases, suggesting that the inclusion of CIS patients happened to be harmful for the classification performance, probably due to the high variability and the low number of patients in the CIS group.
TABLE 3. Mean (st.dev) of the predictive performances of the ensemble model across the ten folds of the cross validation using FSAverage atlas.
TABLE 4. Mean (st.dev) of the predictive performances of the ensemble model across the ten folds of the cross validation using Glasser 2016 atlas.
TABLE 5. Ablation study: Mean (st.dev) of the predictive performances of the SVM model across the ten folds of the cross validation using FSAverage atlas.
TABLE 6. Ablation study: Mean (st.dev) of the predictive performances of the SVM model across the ten folds of the cross validation using Glasser 2016 atlas.
FIGURE 7. Receiver Operating Characteristic Curves (ROC) curve of the ensemble model for all 10-folds of the cross validation using the FSAverage atlas.
FIGURE 8. Receiver Operating Characteristic Curves (ROC) curve of the ensemble model for all 10-folds of the cross validation using the Glasser 2016 atlas.
TABLE 7. Ablation study: Mean (st.dev) of the predictive performances of the ensemble model across the ten folds of the cross validation using FSAverage atlas and multiclass binary comparisons.
TABLE 8. Ablation study: Mean (st.dev) of the predictive performances of the ensemble model across the ten folds of the cross validation using Glasser 2016 atlas and multiclass binary comparisons.
5 Discussion
GM atrophy is known to occur in MS, even at the earliest stages of the disease (Eshaghi et al., 2018) and may vary between brain regions. Since GM atrophy is probably less affected by inflammation, these GM alterations may provide a more reliable marker of neurodegeneration in MS. Moreover, measuring the cortical thickness may constitute a more sensitive marker than traditional volume-based measures as previously demonstrated (Narayana et al., 2013; Nygaard et al., 2015).
5.1 Classification of MS forms
Recalling that our initial hypothesis was that of evaluating the discrimination power of cortical thickness atrophy, graph theory techniques were employed for testing such an hypothesis. Specifically, the power of graph representation was combined with the anatomical GM thickness feature. Six of the most important global graph metrics were used and based on a previous study (Kocevar et al., 2016). Also, an ML analysis was performed using an ensemble of four different ML models. A previous study attempted similar tasks (Muthuraman et al., 2016) by characterizing GM network data considering only CIS and RRMS profiles, thereby reaching good level of accuracy (97%) using an SVM model. The present study extended the comparison to all MS phenotypes and proposed a fully automated pipeline to generate GM connectivity graphs, using only the anatomical images. To our knowledge, this is the first attempt to classify all the clinical MS phenotypes considering both a statistical and ML approach. Our pipeline is particularly relevant for use in clinical practice since our method is only based on classical anatomical T1w images, which represent the most common modality in clinical applications because of its fast and cheap acquisition. From the obtained results, when comparing CIS with progressive courses (PPMS and SPMS), good level of classification performances were obtained, in agreement with our initial hypothesis as well as a previous study (Muthuraman et al., 2016). In fact, CIS represents the first stage of the disease, usually characterized by tissue inflammation, while PPMS and SPMS correspond to the progressive evolution of the disease where severe GM tissue degeneration occurs. Notwithstanding, poor predictive performances were obtained comparing CIS and RRMS clinical profiles, particularly when the FSAverage atlas was used. Thus, our initial hypothesis does not hold for MS patients in the primary inflammatory stages of the disease, meaning that thickness alone was not able to discriminate patients according to these two MS profiles. However, the lack of discrimination between the two early stages of MS may be explained by the reduced number of patients in the CIS group and the high level of imbalance with respect to the RRMS profile, which represents the most common MS group (Zahoor et al., 2017). Additionally, interesting results were obtained comparing multiple global metrics. In particular, patients in the early inflammatory stages of the disease (CIS and RRMS) showed a more assortative brain structure compared with patients in progressive stages (PPMS and SPMS). Such a measure evaluates the behavior of strongly correlated cortical regions, in terms of cortical thickness. From a clinical standpoint, a value closer to zero may be indicative of patterns of cortical degeneration between highly correlated GM regions (i.e. ROIs). In fact, cortical degeneration in specific regions of the GM tissue reduces the thickness properties of such regions compared to those not-affected, reducing the overall assortativity of the network. The exact opposite was found in previous studies in which connectome data was obtained from WM streamline tractography (Kocevar et al., 2016). This result is expected due to the opposite information provided by GM and WM graphs. Indeed, while the former measures the degree of dissimilarity between 2 GM cortical regions (dissociative measure), the latter measures the axonal interconnectivity between two brain regions by counting the number of streamline fibers obtained from tractography (associative measure), thus providing opposite interpretations.
As long as node centrality and modularity measures are concerned, a reduction of such metrics was observed in patients with progressive courses, especially when high resolution parcellation was considered (Glasser 2016). These results may suggest that the small-world property of the human brain was altered. Advances in connectomics and network neuroscience have found that the small-worldness of brain networks is associated with efficient communication (Bullmore and Sporns, 2012). Thus, from a clinical standpoint, such observations suggest that when small-worldness is disrupted, the communication between different GM regions becomes less efficient, in agreement with results already found in the literature (He et al., 2009; Fleischer et al., 2017). Interestingly, once again this result is in agreement with the WM analysis performed by Kocevar et al. (2016) and with our initial hypothesis. It is important to notice that, these aspects were taken into account by the ML model. In fact, for the FSAverage atlas, BC and r turned out to be the overall most important metrics, pointing to strong predictive discrimination. Moreover, a higher Eg and D were observed in later stages of MS, consistent with the concept of progressive neural loss and consequently of structural hubs, thereby increasing the randomness of the global network (Rimkus et al., 2018). All these findings are coherent with our initial hypothesis stating that pathological cortical thickness alteration may represent an important biomarker for measuring the degree of degeneration between different MS profiles. Additionally, due to the high number of statistically significant differences between clinical profiles, observed for both atlases (Figure 5 and Figure 6), it is reasonable to question whether single global graph metrics can naively classify MS patients in their respective clinical profiles. However, from the boxplot analysis and from the results reported in Table 9, we can conclude that a trivial binary thresholding cannot discriminate MS profiles and more sophisticated ML models are required.
TABLE 9. Mean (st.dev) values of global metrics [Betweenness Centrality (BC), Assortativity (r), Transitivity (T), Efficiency (Eg), Modularity (Q) and Density (D)] were calculated on binarized graphs for τ=0.7 using FSAverage and Glasser 2016 atlas.
5.2 Atlas comparison
Two different parcellation atlases were considered in this study in order to investigate the impact of the spatial resolution of the GM regions. From the analysis performed in Section 4, only the comparison between RRMS and PPMS resulted in significant differences in performance between the two atlases. The lack of significant differences in all the remaining binary comparisons might be explained by the low number of patients and the high variability associated with the obtained results. Notwithstanding, clear evidence in favor of the high spatial resolution atlas (Glasser 2016) can be noticed with generally higher classification scores in terms of both F1 and AUC scores. Conform with the statistical analysis, the Glasser 2016 atlas showed high degree of variation between MS clinical profiles, most likely due to the larger size of the connectome due to more precise GM parcellation. Notwithstanding, the morphological features extracted from each GM region are subject to larger variability since a reduced number of pixels was considered for creating each link inside the connectome. For this reason, a larger number of patients might be needed in order to confirm our results.
5.3 Limitations of the study
This work has also some limitations. First, the reduced number of patients in our dataset, especially for the CIS group, may provide some degree of uncertainty in the generalization results. This has led to an imbalanced dataset when comparing the CIS group with other clinical profiles. In order to tackle the problem, weights were imposed to the cost functions of each ML model in order to regain balance during models optimization, reducing the likelihood to overfit the majority class. With a large enough sample size, a multi-class classification approach might be considered and under/over sampling techniques could be applied for the classification of MS patients across the four MS groups, which may offer a useful tool for clinical applications. However, the aim of this study was more limited to binary classification and focused to the question of whether GM tissue degeneration, combined with the connectome representation, can discriminate MS subgroups. Thus, in order to provide comparable results with previous works, in this study, binary comparisons between MS clinical profiles were considered. Second, the conditions followed in this study for binary thresholding represent our best attempt to empirically solve the binarization problem without inducing bias in the process. The statistical analysis was in fact performed independently from the predictive classification task, in which the thresholding value was considered as an additional hyper-parameter, optimized during cross validation. Good properties were observed for thresholding values between 0.6 and 0.8. It is important to notice that, the study of GM connectome thresholding remains an open issue in the literature. In this work, we provided some empirical justifications to the thresholding rule applied for graph characterization based on the analysis performed in Section 4.1. Additionally, the statistical results remain unchanged for different nearby thresholding values ensuring robustness around the thresholding neighbours. Third, the classification results might be improved by including lesion filling during the GM segmentation, which has demonstrated to increase the accuracy of cortical thickness measurements in MS patients (Magon et al., 2014). However, this approach requires lesions to be segmented by experts, which represents a time consuming and expensive procedure. Automated lesion segmentation procedures are today available, although imprecise segmentation may hinder the improvement provided by lesion filling, thus limiting its applicability in clinical practice.
6 Conclusion
Although MS is mainly considered as an inflammatory and demyelinating WM disease, it also exhibits extensive GM involvement and neuro-degenerative processes. An automated pipeline was proposed in this study to characterize GM graphs, extracted from T1w MRI, using morphological features such as GM thickness. The statistical analysis revealed that significant differences were present between multiple global metrics, highlighting the importance of GM connectome graphs. The analysis was performed using two different resolution atlases, showing slightly higher classification performances with the more refined GM parcellation. The results obtained in this work are of great interest considering that only the anatomical T1w image was needed for classification of MS clinical profiles, which represents the most common MRI modality in clinical applications. To the best of our knowledge, no other studies performed classification of MS clinical profiles only considering the GM tissue degeneration measured by the morphological thickness atrophy. Notwithstanding, the use of more advanced Deep Learning (DL) methods may provide an additional improvement to the baseline results proposed in this work. In particular, advances in new deep learning architectures, such as Transformer-based models and other deep learning based architectures exploiting the attention mechanisms, have demonstrated already impressive results which are worth exploring. Besides deep learning applications, connectome data analysis represents a new interesting field for studying the architectural organization of the human brain network. We plan to extend the present analysis considering a multi-view kernel-based tensor factorization approach for the fusion of multiple morphometric features extracted from the GM tissue, such as GM curvature and area. Finally, the information obtained from the WM and GM tissue may be combined in order to enhance the classification performance of the ML model. Such an approach can be performed by embedding the WM and GM connectomes in a unified graph representation, exploiting the complementary information provided by different brain tissue types. Notwithstanding, such an approach requires the acquisition of DTI data, which is less exploited for clinical applications. Also, the study of longitudinal changes in MRI biomarkers across the four groups represents an interesting evolution of the present work that we aim to investigate.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors upon reasonable request.
Ethics statement
The studies involving human participants were reviewed and approved by CPP Sud-Est IV and French national agency for medicine and health products safety (ANSM). The patients/participants provided their written informed consent to participate in this study.
Author contributions
BB conceptualized the analysis, performed the ML analysis and wrote the manuscript. PA, AM, CS, and SV supervised the ML analysis. FD-D coordinated the study and supervised the MRI acquisitions. DS-M coordinated the study and supervised the image acquisition and processing. DS-M and SV supervised the manuscript writing.
Funding
This study was funded by the following projects: European Research Council within the grant 813120-2018 of Marie Sklodowska-Curie Innovative Training Networks (ITN) of the Horizon 2020 through the INSPiRE-MED project. French National Research Agency (ANR) within the national program “Investissements d’Avenir” through the OFSEP project (ANR-10-COHO-002). Also, this research received funding from the Flemish Government (AI Research Program). FM, PA, and SV are affiliated to KU Leuven institute for AI, B-3000, Leuven, Belgium.
Conflict of interest
Author CS was employed by the company BIP SpA.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Afzal, H., Luo, S., Ramadan, S., Lechner-Scott, J., Amin, M. R., Li, J., et al. (2021). Automatic and robust segmentation of multiple sclerosis lesions with convolutional neural networks. CMC-COMPUTERS Mater. CONTINUA 66 (1), 977–991.
Afzal, H., Luo, S., Ramadan, S., and Lechner-Scott, J. (2020). The emerging role of artificial intelligence in multiple sclerosis imaging. Mult. Scler. 28 (6), 849–858. doi:10.1177/1352458520966298
Barile, B., Marzullo, A., Stamile, C., Durand-Dubief, F., and Sappey-Marinier, D. (2021a). Data augmentation using generative adversarial neural networks on brain structural connectivity in multiple sclerosis. Comput. Methods Programs Biomed. 206, 106113. doi:10.1016/j.cmpb.2021.106113
Barile, B., Marzullo, A., Stamile, C., Durand-Dubief, F., and Sappey-Marinier, D. (2022). Ensemble learning for multiple sclerosis disability estimation using brain structural connectivity. Brain Connect. 12 (5), 476–488. doi:10.1089/brain.2020.1003
Barile, B., Marzullo, A., Stamile, C., Durand-Dubief, F., and Sappey-Marinier, D. (2021b). Tensor factorization of brain structural graph for unsupervised classification in multiple sclerosis. Int. Conf. Pattern Recognit. (ICPR), 5052–5059.
Barnes, J. (1969). Graph theory and social networks: A technical comment on connectedness and connectivity. Sociology 3 (2), 215–232. doi:10.1177/003803856900300205
Bouwer, I. (1972). On edge but not vertex transitive regular graphs. J. Comb. Theory Ser. B 12 (1), 32–40. doi:10.1016/0095-8956(72)90030-5
Brichetto, G., Bragadin, M., Fiorini, S., Battaglia, M. A., Konrad, G., Ponzio, M., et al. (2020). The hidden information in patient-reported outcomes and clinician-assessed outcomes: Multiple sclerosis as a proof of concept of a machine learning approach. Neurol. Sci. 41, 459–462. doi:10.1007/s10072-019-04093-x
Bruschi, N., Boffa, G., and Inglese, M. (2020). Ultra-high-field 7-T MRI in multiple sclerosis and other demyelinating diseases:from pathology to clinical practice. Eur. Radiol. Exp. 4 (59). doi:10.1186/s41747-020-00186-x
Bullmore, E., and Sporns, O. (2009). Complex brain networks: Graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10 (3), 186–198. doi:10.1038/nrn2575
Bullmore, E., and Sporns, O. (2012). The economy of brain network organization. Nat. Rev. Neurosci. 13 (5), 336–349. doi:10.1038/nrn3214
Calabrese, M., Favaretto, A., Martini, V., and Gallo, P. (2013). Grey matter lesions in MS: From histology to clinical implications. Prion 7 (1), 20–27. doi:10.4161/pri.22580
Compston, A., and Coles, A. (2008). Multiple sclerosis. Lancet 372, 1502–1517. doi:10.1016/s0140-6736(08)61620-7
Cortes, C., and Vapnik, V. (1995). Support-vector networks. Mach. Learn. 20, 273–297. doi:10.1007/bf00994018
Cox, D. (1958). The regression analysis of binary sequences. J. R. Stat. Soc. Ser. B 20 (2), 215–232. doi:10.1111/j.2517-6161.1958.tb00292.x
Craw, S. (2017). Manhattan distance Encyclopedia of machine learning and data mining. Boston, MA: Springer.
Cruz-Gomez, A., Forero, L., Lozano-Soto, E., Cano-Cano, F., Sanmartino, F., Rashid-Lopez, R., et al. (2021). Cortical thickness and serum NfL explain cognitive dysfunction in newly diagnosed patients with multiple sclerosis. Neurol. Neuroimmunol. Neuroinflamm. 8, e1074. doi:10.1212/nxi.0000000000001074
Dicks, E., Flier, W., Scheltens, P., Barkhof, F., and Tijms, B. M. (2020). Single-subject gray matter networks predict future cortical atrophy in preclinical Alzheimer’s disease. Neurobiol. Aging 94, 71–80. doi:10.1016/j.neurobiolaging.2020.05.008
Durand-Dubief, F., Belaroussi, B., Armspach, J., Dufour, M., Roggerone, S., Vukusic, S., et al. (2012). Reliability of longitudinal brain volume loss measurements between 2 sites in patients with multiple sclerosis: Comparison of 7 quantification techniques. AJNR. Am. J. Neuroradiol. 33 (10), 1918–1924. doi:10.3174/ajnr.a3107
Ek, B., VerSchneider, C., and Narayan, D. (2015). Global efficiency of graphs. AKCE Int. J. Graphs Comb. 12 (1), 1–13. doi:10.1016/j.akcej.2015.06.001
Eshaghi, A., Marinescu, R., Young, A., Firth, N. C., Prados, F., Jorge Cardoso, M., et al. (2018). Progression of regional grey matter atrophy in multiple sclerosis. Brain 141 (6), 1665–1677. doi:10.1093/brain/awy088
Filippi, M. (2015). MRI measures of neurodegeneration in multiple sclerosis: Implications for disability, disease monitoring, and treatment. J. Neurol. 262, 1–6. doi:10.1007/s00415-014-7340-9
Fischl, B., Sereno, M., Tootell, R., and Dale, A. M. (1999). High-resolution intersubject averaging and a coordinate system for the cortical surface. Hum. Brain Mapp. 8, 272–284. doi:10.1002/(sici)1097-0193(1999)8:4<272:aid-hbm10>3.0.co;2-4
Fleischer, V., Groger, A., Koirala, N., Droby, A., Muthuraman, M., Kolber, P., et al. (2017). Increased structural white and grey matter network connectivity compensates for functional decline in early multiple sclerosis. Mult. Scler. 23 (3), 432–441. doi:10.1177/1352458516651503
Fleischer, V., Muthuraman, M., Anwar, A., Gonzalez-Escamilla, G., Radetz, A., Gracien, R. M., et al. (2020). Continuous reorganization of cortical information flow in multiple sclerosis: A longitudinal fmri effective connectivity study. Sci. Rep. 10 (1), 806–811. doi:10.1038/s41598-020-57895-x
Garavand, A., Samadbeik, M., and Aslani, N. (2022). The applications of machine learning algorithms in multiple sclerosis: A systematic review. Acta Med. Iran. 60, 5. doi:10.18502/acta.v60i5.9551
Geurts, J., and Barkhof, F. (2008). Grey matter pathology in multiple sclerosis. Lancet Neurol. 7 (9), 841–851. doi:10.1016/s1474-4422(08)70191-1
Ghasemi, N., Razavi, S., and Nikzad, E. (2016). Multiple sclerosis: Pathogenesis, symptoms, diagnoses and cell-based therapy. Cell J. 19 (1), 1–10. doi:10.22074/cellj.2016.4867
Ghojogh, B., and Crowley, M. (2019). The theory behind overfitting, cross validation, regularization, bagging, and boosting: Tutorial. arXiv 1905.12787.
Glasser, M., Coalson, T., Robinson, E., Hacker, C. D., Harwell, J., Yacoub, E., et al. (2016). A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178. doi:10.1038/nature18933
Goodin, D. (2014). The epidemiology of multiple sclerosis: Insights to disease pathogenesis. Handb. Clin. Neurol. 122, 231–266. doi:10.1016/B978-0-444-52001-2.00010-8
Gorunescu, F. (2011). Classification performance evaluation. Data Min. Concepts, Models Tech., 319–330.
Guo, Y., Nejati, H., and Cheung, N. (2017). Deep neural networks on graph signals for brain imaging analysis. IEEE International Conference on Image Processing (ICIP) , 3295–3299.
Hanganu, A., Flak, M., Chang, L., Siebner, H., Moeller, F., Siniatchkin, M., et al. (2015). Cortical thickness changes associated with photoparoxysmal response. Brain Topogr. 28, 702–709. doi:10.1007/s10548-014-0353-y
He, Y., Chen, Z., and Evans, A. (2007). Small-world anatomical networks in the human brain revealed by cortical thickness from MRI. Cereb. Cortex 17, 2407–2419. doi:10.1093/cercor/bhl149
He, Y., Dagher, A., Chen, Z., Charil, A., Zijdenbos, A., Worsley, K., et al. (2009). Impaired small-world efficiency in structural cortical networks in multiple sclerosis associated with white matter lesion load. Brain 132 (12), 3366–3379. doi:10.1093/brain/awp089
Jackson, K., Sun, K., Barbour, C., Hernandez, D., Kosa, P., Tanigawa, M., et al. (2020). Genetic model of ms severity predicts future accumulation of disability. Ann. Hum. Genet. 84 (1), 1–10. doi:10.1111/ahg.12342
Jannoud, I. (2014). On understanding centrality in directed citation graph. ARPN J. Eng. Appl. Sci. 9 (8).
Kaka, H., Zhang, E., and Khan, N. (2020). Artificial intelligence and deep learning in neuroradiology: Exploring the new frontier. Can. Assoc. Radiol. J. 72, 35–44. doi:10.1177/0846537120954293
Kocevar, G., Stamile, C., Hannoun, S., Cotton, F., Vukusic, S., Durand-Dubief, F., et al. (2016). Graph theory-based brain connectivity for automatic classification of multiple sclerosis clinical courses. Front. Neurosci. 10, 478. doi:10.3389/fnins.2016.00478
Kolčava, J., Kočica, J., Hulová, M., Dusek, L., Horakova, M., Kerkovsky, M., et al. (2020). Conversion of clinically isolated syndrome to multiple sclerosis: A prospective study. Multiple Scler. Relat. Disord. 44, 102262. doi:10.1016/j.msard.2020.102262
Kuceyeski, A., Monohan, E., Morris, E., Fujimoto, K., Vargas, W., and Gauthier, S. (2018). Baseline biomarkers of connectome disruption and atrophy predict future processing speed in early multiple sclerosis. NeuroImage Clin. 19, 417–424. doi:10.1016/j.nicl.2018.05.003
Kuperberg, G., Broome, M., McGuire, P., David, A., Eddy, M., Ozawa, F., et al. (2003). Regionally localized thinning of the cerebral cortex in schizophrenia. Schizophr. Res. 60 (9), 199–200. doi:10.1016/s0920-9964(03)81127-1
Lancichinetti, A., and Fortunato, S. (2009). Community detection algorithms: A comparative analysis. Phys. Rev. E 80 (5), 056117. doi:10.1103/physreve.80.056117
Lublin, F., Reingold, S., Cohen, J., Cutter, G. R., Sorensen, P. S., Thompson, A. J., et al. (2014). Defining the clinical course of multiple sclerosis: The 2013 revisions. Neurology 83 (3), 278–286. doi:10.1212/wnl.0000000000000560
Lynch, S., Parmenter, B., and Denney, D. (2005). The association between cognitive impairment and physical disability in multiple sclerosis. Mult. Scler. 11, 469–476. doi:10.1191/1352458505ms1182oa
MacDonald, D., Kabani, N., Avis, D., and Evans, A. C. (2000). Automated 3-D extraction of inner and outer surfaces of cerebral cortex from MRI. Neuroimage 12, 340–356. doi:10.1006/nimg.1999.0534
Magon, S., Gaetano, L., Chakravarty, M., Lerch, J., Naegelin, Y., Stippich, C., et al. (2014). White matter lesion filling improves the accuracy of cortical thickness measurements in multiple sclerosis patients: A longitudinal study. BMC Neurosci. 15, 106. doi:10.1186/1471-2202-15-106
Marzullo, A., Kocevar, G., Stamile, C., Durand-Dubief, F., Terracina, G., Calimeri, F., et al. (2019). Classification of multiple sclerosis clinical profiles via graph convolutional neural networks. Front. Neurosci. 13, 594. doi:10.3389/fnins.2019.00594
McDonald, W., Compston, A., Edan, G., Goodkin, D., Hartung, H. P., Lublin, F. D., et al. (2001). Recommended diagnostic criteria for multiple sclerosis: Guidelines from the international panel on the diagnosis of multiple sclerosis. Ann. Neurol. 50, 121–127. doi:10.1002/ana.1032
Muthuraman, M., Fleischer, V., Kolber, P., Luessi, F., Zipp, F., and Groppa, S. (2016). Structural brain network characteristics can differentiate CIS from early RRMS. Front. Neurosci. 10, 14. doi:10.3389/fnins.2016.00014
Narayana, P., Govindarajan, K., Goel, P., Datta, S., Lincoln, J. A., Cofield, S. S., et al. (2013). Regional cortical thickness in relapsing remitting multiple sclerosis: A multi-center study. NeuroImage Clin. 2, 120–131. doi:10.1016/j.nicl.2012.11.009
Newman, M. (2002). Assortative mixing in networks. Phys. Rev. Lett. 89 (20), 208701. doi:10.1103/physrevlett.89.208701
Newman, M. (2006). Modularity and community structure in networks. Proc. Natl. Acad. Sci. U. S. A. 103 (23), 8577–8582. doi:10.1073/pnas.0601602103
Nygaard, G. O., Walhovd, K. B., Sowa, P., Chepkoech, J. L., Bjornerud, A., Due-Tonnessen, P., et al. (2015). Cortical thickness and surface area relate to specific symptoms in early relapsing–remitting multiple sclerosis. Mult. Scler. 21 (4), 402–414. doi:10.1177/1352458514543811
Pinto, M., Oliveira, H., Batista, S., Cruz, L., Pinto, M., Correia, I., et al. (2020). Prediction of disease progression and outcomes in multiple sclerosis with machine learning. Sci. Rep. 10, 21038. doi:10.1038/s41598-020-78212-6
Preziosa, P., Pagani, E., Mesaros, S., Riccitelli, G. C., Dackovic, J., Drulovic, J., et al. (2017). Progression of regional atrophy in the left hemisphere contributes to clinical and cognitive deterioration in multiple sclerosis: A 5-year study. Hum. Brain Mapp. 38, 5648–5665. doi:10.1002/hbm.23755
Pruenza, C., Solano, M., Díaz, J., Arroyo, R., and Izquierdo, G. (2019). Model for prediction of progression in multiple sclerosis. IJIMAI 5 (6), 48–53. doi:10.9781/ijimai.2019.06.005
Raamana, P., and Strother, S. (2018). graynet: single-subject morphometric networks for neuroscience connectivity applications. J. Open Source Softw. 3 (30), 924. doi:10.21105/joss.00924
Raamana, P., and Strother, S. (2017). Impact of spatial scale and edge weight on predictive power of cortical thickness networks. bioRxiv , 170381
Reutera, M., Schmansky, N., ans, B., and Fischl, H. R. (2012). Within-subject template estimation for unbiased longitudinal image analysis. Neuroimage 61, 1402–1418. doi:10.1016/j.neuroimage.2012.02.084
Rimkus, C., Schoonheim, M., Steenwijk, M., Vrenken, H., Eijlers, A. J., Killestein, J., et al. (2018). Gray matter networks and cognitive impairment in multiple sclerosis. Mult. Scler. 25, 382–391. doi:10.1177/1352458517751650
Rosa, F. L., Fartaria, M., Kober, T., Richiardi, J., Granziera, C., and Thiran, J. P. (2018). Shallow vs deep learning architectures for white matter lesion segmentation in the early stages of multiple sclerosis. International MICCAI Brainlesion Workshop , 142–151.
Rosas, A., Liu, A., Hersch, S., Glessner, M., Ferrante, R. J., Salat, D. H., et al. (2002). Regional and progressive thinning of the cortical ribbon in huntington’s disease. Neurology 58 (5), 695–701. doi:10.1212/wnl.58.5.695
Rosenke, M., Weiner, K., Barnett, M., Zilles, K., Amunts, K., Goebel, R., et al. (2017). A cross-validated cytoarchitectonic atlas of the human ventral visual stream. NeuroImage 170, 257–270. doi:10.1016/j.neuroimage.2017.02.040
Rubinov, M., and Sporns, O. (2010). Complex network measures of brain connectivity: Uses and interpretations. Neuroimage 52 (3), 1059–1069. doi:10.1016/j.neuroimage.2009.10.003
Scalfari, A., Neuhaus, A., Degenhardt, A., Rice, G. P., Muraro, P. A., Daumer, M., et al. (2010). The natural history of multiple sclerosis: A geographically based study 10: Relapses and long-term disability. Brain 133, 1914–1929. doi:10.1093/brain/awq118
Schapire, R. (2013). Explaining adaboost. Empir. Inference 37–52, 37–52. doi:10.1007/978-3-642-41136-6_5
Schiavi, S., Azzari, A., Mensi, A., Graziano, N., Daducci, A., Bicego, M., et al. (2022). Classification of multiple sclerosis patients based on structural disconnection: A robust feature selection approach. J. Neuroimaging 32, 647–655. doi:10.1111/jon.12991
Seccia, R., Gammelli, D., Dominici, F., Romano, S., Landi, A. C., Salvetti, M., et al. (2020). Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis. PLoS ONE 15, e0230219. doi:10.1371/journal.pone.0230219
Segato, A., Marzullo, A., Calimeri, F., and De Momi, E. (2020). Artificial intelligence for brain diseases: A systematic review. Apl. Bioeng. 4, 041503. doi:10.1063/5.0011697
Shoeibi, A., Khodatars, M., Ghassemi, N., Jafari, M., Moridian, P., Alizadehsani, R., et al. (2021a). Epileptic Seizures detection using deep learning techniques: A review. Int. J. Environ. Res. Public Health 18 (11), 5780. doi:10.3390/ijerph18115780
Shoeibi, A., Khodatars, M., Jafari, M., Moridian, P., Rezaei, M., Alizadehsani, R., et al. (2021b). Applications of deep learning techniques for automated multiple sclerosis detection using magnetic resonance imaging: A review. Comput. Biol. Med. 136, 104697. doi:10.1016/j.compbiomed.2021.104697
Shoeibi, A., Sadeghi, D., Moridian, P., Ghassemi, N., Heras, J., Alizadehsani, R., et al. (2021c). Automatic diagnosis of schizophrenia in eeg signals using cnn-lstm models. Front. Neuroinform. 15, 777977. doi:10.3389/fninf.2021.777977
Shrwan, R., and Gupta, A. (2021). Classification of pituitary tumor and multiple sclerosis brain lesions through convolutional neural networks. IOP Conf. Ser. Mat. Sci. Eng. 1049, 012014. doi:10.1088/1757-899x/1049/1/012014
Simpson, S., Bowman, F., and Laurienti, P. (2013). Analyzing complex functional brain networks: Fusing statistics and network science to understand the brain. Stat. Surv. 7, 1–36. doi:10.1214/13-ss103
Sollich, P., and Krogh, A. (1995). Learning with ensembles: How over-fitting can be useful. Proceedings of the 8th International Conference on Neural Information Processing Systems (NIPS’95). MIT Press, 190–196.
Sporns, O., and Betzel, R. (2016). Modular brain networks. Annu. Rev. Psychol. 67, 613–640. doi:10.1146/annurev-psych-122414-033634
Stafford, I., Kellermann, M., Mossotto, E., Beattie, R. M., MacArthur, B. D., and Ennis, S. (2020). A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases. npj Digit. Med. 3, 30. doi:10.1038/s41746-020-0229-3
Stamile, C., Marzullo, A., and Deusebio, E. (2021). Graph Machine Learning: Take graph data to the next level by applying machine learning techniques and algorithms. Birmingham, United kingdom: Packt Publishing Ltd.
StataCorp, (2021). Stata statistical software: Release 16. College Station, TX: StataCorp LLC. Available at: https://www.stata.com/support/faqs/resources/citing-software-documentation-faqs/.
Steenwijk, M., Geurts, J., Daams, M., Tijms, B. M., Wink, A. M., Balk, L. J., et al. (2016). Cortical atrophy patterns in multiple sclerosis are non-random and clinically relevant. Brain 139, 115–126. doi:10.1093/brain/awv337
Steinman, L. (1996). Multiple sclerosis: A coordinated immunological attack against myelin in the central nervous system. Cell 85, 299–302. doi:10.1016/s0092-8674(00)81107-1
Tallantyre, E., Morgan, P., Dixon, J., Al-Radaideh, A., Brookes, M. J., Evangelou, N., et al. (2009). A comparison of 3T and 7T in the detection of small parenchymal veins within MS lesions. Invest. Radiol. 44, 491–494. doi:10.1097/rli.0b013e3181b4c144
Thedchanamoorthy, K., Piraveenan, M., Kasthuriratna, D., and Senanayake, U. (2014). Node assortativity in complex networks: An alternative approach. Procedia Comput. Sci. 29, 2449–2461. doi:10.1016/j.procs.2014.05.229
Tozlu, C., Jamison, K., Gauthier, S., and Kuceyeski, A. (2021). Dynamic functional connectivity better predicts disability than structural and static functional connectivity in people with multiple sclerosis. Front. Neurosci. 15, 763966. doi:10.3389/fnins.2021.763966
Vázquez-Marrufo, M., Sarrias-Arrabal, E., García-Torres, M., Martín-Clemente, R., and Izquierdo, G. (2021). A systematic review of the application of machine-learning algorithms in multiple sclerosis. Neurologia 3, S0213–S4853.
Witten, I., Frank, E., and Hall, M. (2011). Data mining: Practical machine learning tools and techniques. Morgan Kaufmann Publishers, Elsevier, 0123748569.
Ye, Z., George, A., Wu, A., Niu, X., Lin, J., Adusumilli, G., et al. (2020). Deep learning with diffusion basis spectrum imaging for classification of multiple sclerosis lesions. Ann. Clin. Transl. Neurol. 7 (5), 695–706. doi:10.1002/acn3.51037
Yperman, J., Becker, T., Valkenborg, D., Popescu, V., Hellings, N., Wijmeersch, B. V., et al. (2020). Machine learning analysis of motor evoked potential time series to predict disability progression in multiple sclerosis. BMC Neurol. 20 (1), 105–115. doi:10.1186/s12883-020-01672-w
Zahoor, I., Asimi, R., Haq, E., and Wani, I. (2017). Demographic and clinical profile of multiple sclerosis in kashmir: A short report. Mult. Scler. Relat. Disord. 13, 103–106. doi:10.1016/j.msard.2017.02.018
Zhao, Y., Wang, T., Bove, R., Cree, B., Henry, R., Lokhande, H., et al. (2020). Ensemble learning predicts multiple sclerosis disease course in the summit study. npj Digit. Med. 3, 135. doi:10.1038/s41746-020-00338-8
Keywords: multiple sclerosis, brain connectivity, grey matter, machine learning, artificial intelligence–AI
Citation: Barile B, Ashtari P, Stamile C, Marzullo A, Maes F, Durand-Dubief F, Van Huffel S and Sappey-Marinier D (2022) Classification of multiple sclerosis clinical profiles using machine learning and grey matter connectome. Front. Robot. AI 9:926255. doi: 10.3389/frobt.2022.926255
Received: 22 April 2022; Accepted: 18 August 2022;
Published: 13 October 2022.
Edited by:
Sigal Berman, Ben-Gurion University of the Negev, IsraelReviewed by:
Francesco Crescenzo, AULSS 9 Scaligera, ItalyAfshin Shoeibi, K.N.Toosi University of Technology, Iran
Copyright © 2022 Barile, Ashtari, Stamile, Marzullo, Maes, Durand-Dubief, Van Huffel and Sappey-Marinier. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Dominique Sappey-Marinier, dominique.sappey-marinier@univ-lyon1.fr