The impact of harmonization on radiomic features in Parkinson’s disease and healthy controls: A multicenter study

Tafuri, Benedetta; Lombardi, Angela; Nigro, Salvatore; Urso, Daniele; Monaco, Alfonso; Pantaleo, Ester; Diacono, Domenico; De Blasi, Roberto; Bellotti, Roberto; Tangaro, Sabina; Logroscino, Giancarlo

doi:10.3389/fnins.2022.1012287

ORIGINAL RESEARCH article

Front. Neurosci. , 10 October 2022

Sec. Brain Imaging Methods

Volume 16 - 2022 | https://doi.org/10.3389/fnins.2022.1012287

This article is part of the Research Topic New Challenges and Future Perspectives in Brain Imaging Methods View all 15 articles

The impact of harmonization on radiomic features in Parkinson’s disease and healthy controls: A multicenter study

$\r\nBenedetta Tafuri,*$ Benedetta Tafuri^1,2*

Angela Lombardi^3,4

Salvatore Nigro^1,5

Daniele Urso^1,6

Alfonso Monaco³

Ester Pantaleo^3,4

Domenico Diacono³

Roberto De Blasi^1,7

Roberto Bellotti^3,4

Sabina Tangaro^3,8†

Giancarlo Logroscino^1,2*†

¹Dipartimento di Ricerca Clinica in Neurologia, Centro per le Malattie Neurodegenerative e l’Invecchiamento Cerebrale, Pia Fondazione Cardinale G. Panico, Università degli Studi di Bari Aldo Moro, Lecce, Italy
²Dipartimento di Scienze Mediche di Base, Neuroscienze e Organi di Senso, Università degli Studi di Bari Aldo Moro, Bari, Italy
³Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Bari, Bari, Italy
⁴Dipartimento Interateneo di Fisica M. Merlin, Università degli Studi di Bari Aldo Moro, Bari, Italy
⁵Istituto di Nanotecnologia, Consiglio Nazionale delle Ricerche (CNR-NANOTEC), Lecce, Italy
⁶Department of Neurosciences, King’s College London, Institute of Psychiatry, Psychology and Neuroscience, London, United Kingdom
⁷Dipartimento di Radiologia, Pia Fondazione Cardinale G. Panico, Lecce, Italy
⁸Dipartimento di Scienze del Suolo, Della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy

Radiomics is a challenging development area in imaging field that is greatly capturing interest of radiologists and neuroscientists. However, radiomics features show a strong non-biological variability determined by different facilities and imaging protocols, limiting the reproducibility and generalizability of analysis frameworks. Our study aimed to investigate the usefulness of harmonization to reduce site-effects on radiomics features over specific brain regions. We selected T1-weighted magnetic resonance imaging (MRI) by using the MRI dataset Parkinson’s Progression Markers Initiative (PPMI) from different sites with healthy controls (HC) and Parkinson’s disease (PD) patients. First, the investigation of radiomics measure discrepancies were assessed on healthy brain regions-of-interest (ROIs) via a classification pipeline based on LASSO feature selection and support vector machine (SVM) model. Then, a ComBat-based harmonization approach was applied to correct site-effects. Finally, a validation step on PD subjects evaluated diagnostic accuracy before and after harmonization of radiomics data. Results on healthy subjects demonstrated a dependence from site-effects that could be corrected with ComBat harmonization. LASSO regressor after harmonization was unable to select any feature to distinguish controls by site. Moreover, harmonized radiomics features achieved an area under the receiving operating characteristic curve (AUC) of 0.77 (compared to AUC of 0.71 for raw radiomics measures) in distinguish Parkinson’s patients from HC. We found a not-negligible site-effect studying radiomics of HC pre- and post-harmonization of features. Our validation study on PD patients demonstrated a significant influence of non-biological noise source in diagnostic performances. Finally, harmonization of multicenter radiomic data represent a necessary step to make analysis pipelines reliable and replicable for multisite neuroimaging studies.

Introduction

Radiomics is a challenging development area in imaging field that is greatly capturing interest of radiologists and neuroscientists (Kumar et al., 2012; Gillies et al., 2016; Salvatore et al., 2019; Guiot et al., 2022). Allowing quantitative radiographic phenotyping over several types of magnetic resonance imaging (MRI) acquisition, radiomic analysis has been proposed as a primary task to improve knowledge about diagnosis, prognosis and predictions of pharmaceutical response in oncology and neurodegenerative diseases (Mayerhoefer et al., 2020). Moreover, thanks to its capability to extract engineered measures from specific regions of interest (ROIs), radiomics has shown to be a useful approach for characterizing and classifying patients with pathological conditions (Gillies et al., 2016; Feng and Ding, 2020). Indeed, many oncological applications have demonstrated the radiomics ability to capture intra-tumoral heterogeneity in a non-invasive way. Concerning neurodegenerative diseases, instead, recent studies on Alzheimer’s (AD) and Parkinson’s diseases (PD) have highlighted the potentiality of radiomics to detect abnormalities beyond standard morphological imaging markers. In particular, radiomics approach had achieved interesting results in distinguishing patients with PD from controls (Cao et al., 2020; Liu et al., 2020) and from atypical parkinsonian syndromes (Tupe-Waghmare et al., 2021). Moreover, associations between radiomics measures and clinical variables have been described in both cross-sectional and longitudinal studies (Feng et al., 2018; Salmanpour et al., 2022).

Despite the outstanding results, radiomics features have showed a strong dependence from different research facilities or different acquisition protocols, limiting the reproducibility and generalizability of the proposed frameworks especially for application on multi-site dataset (Nieuwenhuis et al., 2017). Recent studies have addressed this issue using an intensity normalization step before the feature extraction (Nyul et al., 2000; Shinohara et al., 2014; Reinhold et al., 2018; Dewey et al., 2019). However, the elimination of non-biological variability caused by site-effects represents a not-trivial problem that makes sometimes the normalization approach ineffective for application on multi-scanner datasets (Eshaghzadeh Torbati et al., 2021; Li et al., 2021). Therefore, latest applications in the field of oncology have proposed an additional step of feature harmonization based on ComBat method (Crombé et al., 2020; Da-Ano et al., 2020; Li et al., 2021; Mali et al., 2021), originally implemented as batch-effect correction method for microarray expression data (Johnson et al., 2007). This approach has been also applied to classical morphometric properties such as cortical thickness, cortical surface area and subcortical volumes in brain MRI removing scan effect and increasing the power and statistical significance of the results (Fortin et al., 2018; Pomponio et al., 2020; Radua et al., 2020; Eshaghzadeh Torbati et al., 2021).

In the current study, we investigated the effectiveness of normalization and harmonization approaches to reduce site-effects on radiomics features from healthy brain ROIs. At first, we extracted radiomics features on T1-weighted MRI images of healthy subjects collected from different acquisition sites in the context of the Parkinson’s Progression Markers Initiative (PPMI), sponsored by the Michael J. Fox Foundation, evaluating the sensitivity to site-related effects. In a second step, normalization and harmonization models defined on healthy subjects were applied on patients with PD to evaluate the classification performance pre- and post- site-effect correction.

Materials and methods

Participants

Data used in the preparation of this study were obtained from the PPMI database.¹ For up-to-date information on the study, visit ppmi-info.org. The T1-weighted MR images selected for this study were acquired using a 1.5–3 Tesla scanner from different manufactures (Philips, GE, Siemens). Acquisition protocols from each site are reported in Supplementary Table 1.

Magnetic resonance imaging pre-processing

Structural MR images were segmented using the recon-all script included in Freesurfer v6.0.² After removal of non-brain tissue and bias of each structural brain image, we used the non-uniform intensity corrected image (nu.mgz) in the Freesurfer space to compute radiomics features. An additional step of intensity normalization was performed using the Z-Score method by centering each pre-processed T1w volume at the mean with standard deviation. To identify and characterize the site effect on radiomic features, we extracted the ROIs using the Desikan–Killiany atlas cortical parcelation from the individual subcortical segmentation image (aparc + aseg.mgz) (Desikan et al., 2006). Then, we thresholded each brain parcelation using FSL (Smith et al., 2004) tools to extract the binary masks for the next radiomics analysis.

Radiomics features extraction and harmonization

Data used for our analysis was collected from three ROIs, namely Caudate, Putamen and Thalamus, for both hemispheres, as a set of subcortical brain regions pertinent to PD (Shimohama et al., 2003; Halliday, 2009). For each ROI, we defined a set of 88 radiomic features, including 18 first-order features to describe voxel intensity distribution within image mask, and 70 second-level textural measures to highlight spatial distribution of voxels through four different matrices: 24 features from Gray Level Co-occurrence Matrices (GLCM), 16 from Gray Level Run Length Matrices (GLRLM), 14 measures from Gray Level Dependence Matrices (GLDM) and 16 features from Gray Level Size Zone Matrices (GLSZM) (detailed information about extracted features are reported in Supplementary Table 2; Zwanenburg et al., 2020). The extraction procedure was implemented using Pyradiomics, an open-source Python package (Van Griethuysen et al., 2017).

The multicenter harmonization was performed using ComBat algorithm (Johnson et al., 2007) for location (mean) and scale (variance) adjustments of data due to the site differences between subjects. Particularly, this approach assume that the batch effects can be modeled out by standardizing means and variances across batches. We applied the generalized additive model (GAM) of ComBat, also called NeuroHarmonize, that considered sex and non-linear age effects as covariates in the input data (Pomponio et al., 2020). More in details, this method combines the ComBat harmonization pipeline (Fortin et al., 2017, 2018), with the GAM (Hastie and Tibshirani, 1986). The former aims to remove unwanted sources of variability due to site differences, while preserving the variability due to other biological significant covariates; the latter introduces a penalized non-linear term to better take into account the age effects and capture also non-linearities in age-related differences in radiomic feature. In contrast to a general linear model approach that includes site as a fixed effect covariate, the GAM of ComBat considers only age and sex as covariates to control for during harmonization. This approach assumes that for a given site, the effects across features derive from a common distribution, and thus borrows information across features to shrink estimates toward a common mean. In addition to removing additive site effects, ComBat also corrects multiplicative site effects by removing heteroscedasticity of model errors across site. In our framework, NeuroHarmonize was implemented using Empirical Bayes framework, which is useful for harmonizing multiple features, such as brain regional measures. The estimation of the site hyperparameters (γ as an additive batch effect affecting the measurement, δ as a multiplicative batch effect) of the prior distribution for site-effect correction was conducted considering only healthy controls (HC). Of note, to ensure unbiased results, the harmonization parameters was calculated over control subjects in the training set of each cross-validation fold and then applied on the remaining subjects in train and test folds to correct the site-effect. The Python implementation of harmonization framework was found at https://github.com/rpomponio/neuroHarmonize (Pomponio et al., 2020).

Radiomics modeling

To characterize site effects on radiomics features, a “site vs. site” classification model was built in Leave-One-Out cross validation (LOOCV) considering only HC. To this end, classification performances were first evaluated using raw data. Next, we evaluated the impact of image normalization and ComBat harmonization on the classification performances. For each classification model, at each inner loop of LOOCV, we firstly reduced the burden of high dimensionality of radiomics set of features using least absolute shrinkage and selection operator (LASSO) (Tibshirani, 1996; Friedman et al., 2010). Therefore, the optimal penalty parameter of LASSO was defined via minimization of “Binomial Deviance” and features with non-zero regression coefficients were retained. Then, for classification purpose, we trained a radial basis Support Vector Machines (SVM) model (Cortes and Vapnik, 1995; Chang and Lin, 2011) for each binary site-classifier on previously selected radiomics features.

As second step of our framework, we implemented the same pipeline for PD detection. This procedure was implemented in a ten-times repeated 10-fold cross validation setting. At each of 100 bootstraps, nine folds was used to define a LASSO regression model to select optimal radiomics features. The selected features were saved in a vector to further analysis (Lombardi et al., 2020). These features were also used to train a radial basis SVM. At each iteration, we tested the predictive power of the model by using the excluded fold. The whole pipeline is illustrated in Figure 1.

FIGURE 1

Figure 1. Processing pipeline.

Statistical analysis

Demographic and clinical information of the dataset were provided with descriptive statistics (mean ± SD). Group differences in age, sex, MoCA (Montreal Cognitive Assessment), UPDRS-III (Unified Parkinson’s Disease Rating Scale) scales and H&Y (Hoehn and Yahr) stage were investigated through Chi-square test, one-way analysis of variance (ANOVA) and Kruskal–Wallis ANOVA followed by post-hoc comparisons. For all analyses, the corrected significance threshold was set at p < 0.05 after Bonferroni’s correction for multiple comparisons. Statistical analysis was performed by using R software (Version 3.6.3: R Foundation for Statistical Computing, Vienna, Austria).

The Area Under the receiving operating characteristic Curves (AUCs) were used as evaluation metric for our “site vs. site” models. Classification performances for HC vs. PD models were evaluated by accuracy, sensitivity and specificity, mediated over the 100 bootstraps of classification. Finally, the diagnostic capabilities of the radiomics signatures were evaluated with Receiver Operating Characteristic (ROC) curve analysis.

To assess the stability of radiomics features selected by LASSO regression over HC vs. PD model, we used a frequency-based criterion. For each round of the bootstraps, we stored as relevant features only those corresponding to non-zero weights assigned by LASSO. Subsequently, we selected as most stable radiomics features those that occurred in at least 95° percentile of the frequency vector.

Results

Demographic and clinical data

We selected MR images from seven sites of the PPMI database according with the number of enrolled HC. Table 1 reports all demographic and clinical details for each clinical site included in our study. No statistical difference was found between HC from clinical sites in age, sex, H&Y score and MoCA scales.

TABLE 1

Table 1. Demographic and clinical details of healthy controls for each site.

For each site, we also selected a set of age- and sex-matched PD. Data is reported in Table 2. As expected, we found significant differences in H&Y score, UPDRS-III and MoCA scales (p-values < 0.001) between HC and PD patients.

TABLE 2

Table 2. Demographic and clinical details of patients with Parkinson’s disease and healthy controls.

Characterization and correction of the “Site Effects” on healthy controls

As first step, we studied the variability of radiomics features from healthy ROIs in different acquisition sites. Figure 2 presents, respectively, bivariate scatter plots of the first two principal components (dim) from a principal component analysis (PCA) (Figure 2A) and a histogram-based representation on one textural feature (i.e., GLCM-Correlation) (Figure 2B) for no pre-processed/normalized/harmonized approaches. As expected, a large proportion of the variation by site was corrected by harmonization of data. In Figure 3, we also report AUCs results from each binary comparison across HC from different sites. Classification performance of raw radiomics features (Figure 3, left panel) showed optimal discriminative power for all comparisons, except for Site 19 vs. Site 52. Similar results were obtained using radiomics features calculated from normalized MRI (Figure 3, right panel). To evaluate pairwise differences between the models, we also performed a Wilcoxon signed rank test obtaining a p-value of 0.035.

FIGURE 2

Figure 2. (A) Plots of the first 2 principal components (dim) from principal component analysis (PCA), colored by site; (B) histograms of example radiomic feature for raw, normalized, and harmonized processing.

FIGURE 3

Figure 3. Area under the receiving operating characteristic curves (AUCs) for “site vs. site” classification on healthy controls. Left panel reports performances for not pre-processed radiomics features; right panel reports results using radiomics features from normalized MRI.

After harmonization of radiomics features, LASSO regressions were unable to select any features for prediction of the outcomes. Indeed, LOOCV plots for each pairwise site comparison resulted in a penalty factor that shrinked all regression coefficients to zero (see Supplementary Figure 1).

Parkinson’s disease classification

As second step of our analysis, we evaluated goodness of classification of PD from HC in our three different radiomics approaches, namely without any pre-processing prior to radiomics features calculation, with normalization of image before radiomics computations and with harmonization from site-effect of radiomics features. We report results of each model in Table 3. Figure 4 also shows the corresponding ROCs for trained SVM. Respect to raw and normalized radiomics implementations, harmonization of features determined an increased classification power of the radiomics model.

TABLE 3

Table 3. Performances of PD classification model from different pre-processed radiomics features.

FIGURE 4

Figure 4. Receiver operating characteristic (ROC) curves for three different radiomics features processing.

As shown in Figure 5, we found five features over 95th percentile as most stable, with a predominance of radiomics measures in the right thalamus, involving energy features as measure of the magnitude of voxel values in an image, and “Gray Level Non-Uniformity Normalized” measure, quantifying the variability of gray-level intensity values in the image. Moreover, we found features in putamen, bilaterally, as most frequent predictors.

FIGURE 5

Figure 5. Frequency of radiomics features for PD vs. HC classification. Red line defines 95th percentile beyond which we choose the most stable measures.

Discussion

This study demonstrated the sensitivity of radiomic features to site-effects in multicenter neuroimaging study. We firstly investigated the problem of data variability due to non-biological effects on healthy brain regions. Then, implementing a ComBat-based harmonization procedure of radiomics features, we modelized site-related noise source reducing differences across healthy subjects. Lastly, as validation task, we evaluated the effectiveness of our harmonization approach for classification of PD patients.

The small sample size made necessary some methodological choices. First of all, it was mandatory to use a feature selection method to avoid a course of dimensionality problem due to the imbalance between the number of radiomic features and the sample size (Koutroumbas and Theodoridis, 2008; Zollanvari et al., 2020). On the other hand, a feature selection method such as LASSO was preferred over other feature reduction methods (PCA, LDA, etc.) in order to achieve a more explainable model (Lombardi et al., 2021). Indeed, the cross-validated optimization of penalty factor for LASSO feature selection allowed to define the most important predictors over the radiomics features, also guaranteeing the interpretability of the model in the clinical/radiological field (Lombardi et al., 2022). Overall, the implementation of a leave-one-out cross-validation procedure for site-vs.-site classification (compared to k-fold cross-validation as a bias-variance tradeoff) was used to guarantee approximately unbiased results over our small sample size. Concerning site-effects on radiomic features, PCA and histogram-based representation of radiomics measures highlighted the need of correcting for site effects before performing further analyses. Indeed, this allowed to distinguish HC from each site with high accuracy. Similar results were observed for normalized images. By contrast, after application of NeuroHarmonize algorithm, no subset of features could be identified to differentiate HCs each other. On note, only comparison between subjects of Site 19 and Site 52 reported an AUC close to random choice without using the harmonization approach, probably due to common scanner and protocol parameters used in MRI acquisition.

These findings demonstrated an effective dependency of radiomics features from scanner and acquisition protocol that could not be eliminated with normalization of image intensity but only using a ComBat-based algorithm. Our results were in line with previous findings in radiomics, over both oncology and neuroimaging, that have demonstrated a better standardization capabilities of ComBat-based models compared with different intensity normalization techniques, such as normalization (Z-Score), WhiteStripe and Ravel (Nyul et al., 2000; Shinohara et al., 2014; Reinhold et al., 2018; Dewey et al., 2019; Eshaghzadeh Torbati et al., 2021; Li et al., 2021).

As further result, harmonization pipeline applied to PD patients allowed to improve prediction performance with respect to raw data, suggesting that site noise factor might affect classification performance in multicenter study using radiomic features. Moreover, classification performance obtained in the current study using thalamus, caudate and putamen overcame results reported in previous radiomics studies using similar region-based approach for PD classification (Liu et al., 2020; Tupe-Waghmare et al., 2021). Specifically, Tupe-Waghmare et al. (2021) achieved 0.72 of AUC highlighting the T1 radiomics of substantia nigra as most predictive features. On the other hand, Liu et al. (2020) studied radiomics of putamen and caudate separately on T2w MRI with a 0.77 of AUC of caudate model. Our study further confirmed the impact of right thalamus, besides the involvement of more classical neostriatal regions (caudate + putamen) (Sikiö et al., 2015), in PD pathophysiology, highlighting at the same time its usefulness as diagnostic marker for PD.

Our work has some limitations. Firstly, the limited sample size for each site could produce unstable LASSO regression results, as well as a possible overfitting in SVM training, that we have tried to overcome with a bootstrapped 10-fold CV. Overall, cohorts of subjects from other international neuroimaging studies can be added to solve these issues guarantying greater generalizability of the results. Second, we applied only one type of normalization and harmonization techniques limiting other possible comparisons and further optimizations of performances. Future works can consider more complex intensity normalization methods, such as RAVEL or WhiteStripe, and recently developed alternative versions of ComBat with improved flexibility (M-ComBat) and robustness (B-ComBat) (Da-Ano et al., 2020).

The harmonization of MRI data represents a crucial problem in several medical imaging applications due to non-biological effects determined by different acquisition sites, scanners and multiparametric sequences. Our study aimed to assess the variability of radiomics features extracted on T1w MR images collected in a multicentric context. We found a not-negligible site-effect comparing radiomics features of HC pre- and post-harmonization pipeline. Moreover, our study demonstrated a significant influence of scan noise in distinguishing controls from PD patients. Overall, harmonization of radiomic features represents a necessary requirement for reliable and replicable analysis frameworks in multicenter study.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: https://www.ppmi-info.org/access-data-specimens/download-data.

Ethics statement

The PPMI study was approved by the local Institutional Review Boards of all participating sites (https://www.ppmi-info.org/about-ppmi/ppmi-clinical-sites/) and written informed consent for imaging data and clinical questionnaires was obtained from each participant at the time of enrollment

Author contributions

BT: conceptualization, formal analysis, methodology, software, visualization, and roles/writing – original draft. AL: conceptualization, methodology, validation, supervision, and writing – review and editing. SN: methodology, project administration, supervision, validation, and writing – review and editing. DU: data curation, resources, and writing – review and editing. AM, EP, and DD: data curation and writing – review and editing. RD: validation, visualization, and writing – review and editing. RB: project administration and writing – review and editing. ST: conceptualization, project administration, supervision, and writing – review and editing. GL: funding acquisition, conceptualization, project administration, supervision, and writing – review and editing. All authors contributed to the article and approved the submitted version.

Funding

This project was supported in part by the Michael J. Fox Foundation, including use of data available from the PPMI—a public-private partnership—funded by the Michael J. Fox Foundation for Parkinson’s Research and funding partners (listed at https://www.ppmi-info.org/about-ppmi/who-we-are/study-sponsors) and supported with the founding of Regione Puglia and CNR for Tecnopolo per la Medicina di Precisione. D.G.R. n. 2117 of 21.11.2018 (B84I18000540002).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2022.1012287/full#supplementary-material

Footnotes

References

Cao, X., Wang, X., Xue, C., Zhang, S., Huang, Q., and Liu, W. (2020). A radiomics approach to predicting Parkinson’s disease by incorporating whole-brain functional activity and gray matter structure. Front. Neurosci. 14:751. doi: 10.3389/fnins.2020.00751

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C., and Lin, C. (2011). LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2:27.

Google Scholar

Cortes, C., and Vapnik, V. (1995). Support-vector networks. Mach. Learn. 20, 273–297. doi: 10.1007/BF00994018

CrossRef Full Text | Google Scholar

Crombé, A., Kind, M., Fadli, D., Le Loarer, F., Italiano, A., Buy, X., et al. (2020). Intensity harmonization techniques influence radiomics features and radiomics-based predictions in sarcoma patients. Sci. Rep. 10:15496. doi: 10.1038/s41598-020-72535-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Da-Ano, R., Masson, I., Lucia, F., Doré, M., Robin, P., Alfieri, J., et al. (2020). Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies. Sci. Rep. 10:10248. doi: 10.1038/s41598-020-66110-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Desikan, R. S., Ségonne, F., Fischl, B., Quinn, B. T., Dickerson, B. C., Blacker, D., et al. (2006). An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980. doi: 10.1016/j.neuroimage.2006.01.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Dewey, B. E., Zhao, C., Reinhold, J. C., Carass, A., Fitzgerald, K. C., Sotirchos, E. S., et al. (2019). DeepHarmony: A deep learning approach to contrast harmonization across scanner changes. Magn. Reson. Imaging 64, 160–170. doi: 10.1016/j.mri.2019.05.041

PubMed Abstract | CrossRef Full Text | Google Scholar

Eshaghzadeh Torbati, M., Minhas, D. S., Ahmad, G., O’Connor, E. E., Muschelli, J., Laymon, C. M., et al. (2021). A multi-scanner neuroimaging data harmonization using RAVEL and ComBat. Neuroimage 245:118703. doi: 10.1016/j.neuroimage.2021.118703

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, F., Wang, P., Zhao, K., Zhou, B., Yao, H., Meng, Q., et al. (2018). Radiomic features of hippocampal subregions in Alzheimer’s disease and amnestic mild cognitive impairment. Front. Aging Neurosci. 10:290. doi: 10.3389/fnagi.2018.00290

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, Q., and Ding, Z. (2020). MRI radiomics classification and prediction in Alzheimer’s disease and mild cognitive impairment: A review. Curr. Alzheimer Res. 17, 297–309. doi: 10.2174/1567205017666200303105016

PubMed Abstract | CrossRef Full Text | Google Scholar

Fortin, J.-P., Cullen, N., Sheline, Y. I., Taylor, W. D., Aselcioglu, I., Cook, P. A., et al. (2018). Harmonization of cortical thickness measurements across scanners and sites. Neuroimage 167, 104–120. doi: 10.1016/j.neuroimage.2017.11.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Fortin, J.-P., Parker, D., Tunç, B., Watanabe, T., Elliott, M. A., Ruparel, K., et al. (2017). Harmonization of multi-site diffusion tensor imaging data. Neuroimage 161, 149–170. doi: 10.1016/j.neuroimage.2017.08.047

PubMed Abstract | CrossRef Full Text | Google Scholar

Friedman, J., Hastie, T., and Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22.

PubMed Abstract | Google Scholar

Gillies, R. J., Kinahan, P. E., and Hricak, H. (2016). Radiomics: Images are more than pictures. Radiology 278, 563–577. doi: 10.1148/radiol.2015151169

PubMed Abstract | CrossRef Full Text | Google Scholar

Guiot, J., Vaidyanathan, A., Deprez, L., Zerka, F., Danthine, D., Frix, A.-N., et al. (2022). A review in radiomics: Making personalized medicine a reality via routine imaging. Med. Res. Rev. 42, 426–440. doi: 10.1002/med.21846

PubMed Abstract | CrossRef Full Text | Google Scholar

Halliday, G. M. (2009). Thalamic changes in parkinson’s disease. Parkinsonism Relat. Disord. 15(Suppl. 3) S152–S155. doi: 10.1016/S1353-8020(09)70804-1

CrossRef Full Text | Google Scholar

Hastie, T., and Tibshirani, R. (1986). Generalized additive models. Stat. Sci. 1, 297–310. doi: 10.1214/ss/1177013604

CrossRef Full Text | Google Scholar

Johnson, W. E., Li, C., and Rabinovic, A. (2007). Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostat. Oxf. Engl. 8, 118–127. doi: 10.1093/biostatistics/kxj037

PubMed Abstract | CrossRef Full Text | Google Scholar

Koutroumbas, K., and Theodoridis, S. (2008). Pattern recognition. Cambridge, MA: Academic Press.

Google Scholar

Kumar, V., Gu, Y., Basu, S., Berglund, A., Eschrich, S. A., Schabath, M. B., et al. (2012). Radiomics: The process and the challenges. Magn. Reson. Imaging 30, 1234–1248. doi: 10.1016/j.mri.2012.06.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Ammari, S., Balleyguier, C., Lassau, N., and Chouzenoux, E. (2021). Impact of preprocessing and harmonization methods on the removal of scanner effects in brain MRI radiomic features. Cancers 13:3000. doi: 10.3390/cancers13123000

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, P., Wang, H., Zheng, S., Zhang, F., and Zhang, X. (2020). Parkinson’s disease diagnosis using neostriatum radiomic features based on T2-weighted magnetic resonance imaging. Front. Neurol. 11:248. doi: 10.3389/fneur.2020.00248

PubMed Abstract | CrossRef Full Text | Google Scholar

Lombardi, A., Amoroso, N., Bellantuono, L., Bove, S., Comes, M. C., Fanizzi, A., et al. (2022). Accurate evaluation of feature contributions for sentinel lymph node status classification in breast cancer. Appl. Sci. 12:7227. doi: 10.3390/app12147227

CrossRef Full Text | Google Scholar

Lombardi, A., Amoroso, N., Diacono, D., Monaco, A., Tangaro, S., and Bellotti, R. (2020). Extensive evaluation of morphological statistical harmonization for brain age prediction. Brain Sci. 10:364. doi: 10.3390/brainsci10060364

PubMed Abstract | CrossRef Full Text | Google Scholar

Lombardi, A., Diacono, D., Amoroso, N., Monaco, A., Tavares, J. M., Bellotti, R., et al. (2021). Explainable deep learning for personalized age prediction with brain morphology. Front. Neurosci. 15:674055. doi: 10.3389/fnins.2021.674055

PubMed Abstract | CrossRef Full Text | Google Scholar

Mali, S. A., Ibrahim, A., Woodruff, H. C., Andrearczyk, V., Müller, H., Primakov, S., et al. (2021). Making radiomics more reproducible across scanner and imaging protocol variations: A review of harmonization methods. J. Pers. Med. 11:842. doi: 10.3390/jpm11090842

PubMed Abstract | CrossRef Full Text | Google Scholar

Mayerhoefer, M. E., Materka, A., Langs, G., Häggström, I., Szczypiñski, P., Gibbs, P., et al. (2020). Introduction to radiomics. J. Nucl. Med. 61, 488–495. doi: 10.2967/jnumed.118.222893

PubMed Abstract | CrossRef Full Text | Google Scholar

Nieuwenhuis, M., Schnack, H. G., van Haren, N. E., Lappin, J., Morgan, C., Reinders, A. A., et al. (2017). Multi-center MRI prediction models: Predicting sex and illness course in first episode psychosis patients. Neuroimage 145, 246–253. doi: 10.1016/j.neuroimage.2016.07.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Nyul, L. G., Udupa, J. K., and Zhang, X. (2000). New variants of a method of MRI scale standardization. IEEE Trans. Med. Imaging 19, 143–150. doi: 10.1109/42.836373

CrossRef Full Text | Google Scholar

Pomponio, R., Erus, G., Habes, M., Doshi, J., Srinivasan, D., Mamourian, E., et al. (2020). Harmonization of large MRI datasets for the analysis of brain imaging patterns throughout the lifespan. Neuroimage 208:116450. doi: 10.1016/j.neuroimage.2019.116450

PubMed Abstract | CrossRef Full Text | Google Scholar

Radua, J., Vieta, E., Shinohara, R., Kochunov, P., Quidé, Y., Green, M. J., et al. (2020). Increased power by harmonizing structural MRI site differences with the ComBat batch adjustment method in ENIGMA. Neuroimage 218:116956. doi: 10.1016/j.neuroimage.2020.116956

PubMed Abstract | CrossRef Full Text | Google Scholar

Reinhold, J. C., Dewey, B. E., Carass, A., and Prince, J. L. (2018). Evaluating the impact of intensity normalization on MR image synthesis. Available online at: http://arxiv.org/abs/1812.04652 (accessed February 28, 2022).

Google Scholar

Salmanpour, M. R., Shamsaei, M., Hajianfar, G., Soltanian-Zadeh, H., and Rahmim, A. (2022). Longitudinal clustering analysis and prediction of Parkinson’s disease progression using radiomics and hybrid machine learning. Quant. Imaging Med. Surg. 12, 906–919. doi: 10.21037/qims-21-425

PubMed Abstract | CrossRef Full Text | Google Scholar

Salvatore, C., Castiglioni, I., and Cerasa, A. (2019). Radiomics approach in the neurodegenerative brain. Aging Clin. Exp. Res. 33, 1709–1711. doi: 10.1007/s40520-019-01299-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Shimohama, S., Sawada, H., Kitamura, Y., and Taniguchi, T. (2003). Disease model: Parkinson’s disease. Trends Mol. Med. 9, 360–365. doi: 10.1016/s1471-4914(03)00117-5

CrossRef Full Text | Google Scholar

Shinohara, R. T., Sweeney, E. M., Goldsmith, J., Shiee, N., Mateen, F. J., Calabresi, P. A., et al. (2014). Statistical normalization techniques for magnetic resonance imaging. Neuroimage Clin. 6, 9–19. doi: 10.1016/j.nicl.2014.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Sikiö, M., Holli-Helenius, K. K., Harrison, L. C. V., Ryymin, P., Ruottinen, H., Saunamäki, T., et al. (2015). MR image texture in Parkinson’s disease: A longitudinal study. Acta Radiol. Stockh. Swed. 1987, 97–104. doi: 10.1177/0284185113519775

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, S. M., Jenkinson, M., Woolrich, M. W., Beckmann, C. F., Behrens, T. E. J., Johansen-Berg, H., et al. (2004). Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23(Suppl. 1) S208–S219. doi: 10.1016/j.neuroimage.2004.07.051

PubMed Abstract | CrossRef Full Text | Google Scholar

Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Methodol. 58, 267–288.

Google Scholar

Tupe-Waghmare, P., Rajan, A., Prasad, S., Saini, J., Pal, P. K., and Ingalhalikar, M. (2021). Radiomics on routine T1-weighted MRI can delineate Parkinson’s disease from multiple system atrophy and progressive supranuclear palsy. Eur. Radiol. 31, 8218–8227. doi: 10.1007/s00330-021-07979-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Griethuysen, J. J. M., Fedorov, A., Parmar, C., Hosny, A., Aucoin, N., Narayan, V., et al. (2017). Computational radiomics system to decode the radiographic phenotype. Cancer Res. 77, e104–e107. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | CrossRef Full Text | Google Scholar

Zollanvari, A., James, A. P., and Sameni, R. (2020). A theoretical analysis of the peaking phenomenon in classification. J. Classif. 37, 421–434. doi: 10.1007/s00357-019-09327-3

CrossRef Full Text | Google Scholar

Zwanenburg, A., Leger, S., Vallières, M., and Löck, S. (2020). Image biomarker standardisation initiative. Radiology 295, 328–338. doi: 10.1148/radiol.2020191145

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: radiomics analysis, ComBat, multi-site harmonization, structural MRI, Parkinson’s disease

Citation: Tafuri B, Lombardi A, Nigro S, Urso D, Monaco A, Pantaleo E, Diacono D, De Blasi R, Bellotti R, Tangaro S and Logroscino G (2022) The impact of harmonization on radiomic features in Parkinson’s disease and healthy controls: A multicenter study. Front. Neurosci. 16:1012287. doi: 10.3389/fnins.2022.1012287

Received: 05 August 2022; Accepted: 20 September 2022;
Published: 10 October 2022.

Edited by:

Dahua Yu, Inner Mongolia University of Science and Technology, China

Reviewed by:

Lydia Yahia Cherif, Institut du cerveau et de la moelle, INSERM, France
Johann Faouzi, INSERM U1127 Institut du Cerveau et de la Moelle Épinière (ICM), France

Copyright © 2022 Tafuri, Lombardi, Nigro, Urso, Monaco, Pantaleo, Diacono, De Blasi, Bellotti, Tangaro and Logroscino. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Benedetta Tafuri, YmVuZWRldHRhLnRhZnVyaUB1bmliYS5pdA==; Giancarlo Logroscino, Z2lhbmNhcmxvLmxvZ3Jvc2Npbm9AdW5pYmEuaXQ=

^†These authors share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

The impact of harmonization on radiomic features in Parkinson’s disease and healthy controls: A multicenter study

Introduction

Materials and methods

Participants

Magnetic resonance imaging pre-processing

Radiomics features extraction and harmonization

Radiomics modeling

Statistical analysis

Results

Demographic and clinical data

Characterization and correction of the “Site Effects” on healthy controls

Parkinson’s disease classification

Discussion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

Footnotes

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good