Radiomics and machine learning applied to STIR sequence for prediction of quantitative parameters in facioscapulohumeral disease

Colelli, Giulia; Barzaghi, Leonardo; Paoletti, Matteo; Monforte, Mauro; Bergsland, Niels; Manco, Giulia; Deligianni, Xeni; Santini, Francesco; Ricci, Enzo; Tasca, Giorgio; Mira, Antonietta; Figini, Silvia; Pichiecchio, Anna

doi:10.3389/fneur.2023.1105276

ORIGINAL RESEARCH article

Front. Neurol., 24 February 2023

Sec. Applied Neuroimaging

Volume 14 - 2023 | https://doi.org/10.3389/fneur.2023.1105276

This article is part of the Research TopicDeep Learning for MRI-Based Brain Network Analysis: Novel Methods, Discoveries, and ApplicationsView all 6 articles

Radiomics and machine learning applied to STIR sequence for prediction of quantitative parameters in facioscapulohumeral disease

Giulia Colelli^1,2,3^*^†

Leonardo Barzaghi^1,2^†

Matteo Paoletti²

Mauro Monforte⁴

Niels Bergsland^5,6

Giulia Manco²

Xeni Deligianni^7,8

Francesco Santini^7,8

Enzo Ricci⁴

Giorgio Tasca^4,9

Antonietta Mira^10,11

Silvia Figini^12,13^‡

Anna Pichiecchio^2,14^‡

¹Department of Mathematics, University of Pavia, Pavia, Italy
²Neuroradiology Department, Advanced Imaging and Radiomics Center, IRCCS Mondino Foundation, Pavia, Italy
³INFN, Group of Pavia, Pavia, Italy
⁴UOC di Neurologia, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy
⁵Department of Neurology, Jacobs School of Medicine and Biomedical Sciences, Buffalo Neuroimaging Analysis Center, University of Buffalo, The State University of New York, Buffalo, NY, United States
⁶IRCCS, Fondazione Don Carlo Gnocchi ONLUS, Milan, Italy
⁷Department of Radiology, University Hospital Basel, Basel, Switzerland
⁸Basel Muscle MRI, Department of Biomedical Engineering, University of Basel, Basel, Switzerland
⁹John Walton Muscular Dystrophy Research Centre, Newcastle University and Newcastle Hospitals NHS Foundation Trusts, Newcastle upon Tyne, United Kingdom
¹⁰Data Science Lab, Università della Svizzera italiana, Lugano, Switzerland
¹¹Department of Science and High Technology, University of Insubria, Como, Italy
¹²Department of Political and Social Sciences, University of Pavia, Pavia, Italy
¹³BioData Science Center, IRCCS Mondino Foundation, Pavia, Italy
¹⁴Department of Brain and Behavioural Sciences, University of Pavia, Pavia, Italy

Purpose: Quantitative Muscle MRI (qMRI) is a valuable and non-invasive tool to assess disease involvement and progression in neuromuscular disorders being able to detect even subtle changes in muscle pathology. The aim of this study is to evaluate the feasibility of using a conventional short-tau inversion recovery (STIR) sequence to predict fat fraction (FF) and water T2 (wT2) in skeletal muscle introducing a radiomic workflow with standardized feature extraction combined with machine learning algorithms.

Methods: Twenty-five patients with facioscapulohumeral muscular dystrophy (FSHD) were scanned at calf level using conventional STIR sequence and qMRI techniques. We applied and compared three different radiomics workflows (WF1, WF2, WF3), combined with seven Machine Learning regression algorithms (linear, ridge and lasso regression, tree, random forest, k-nearest neighbor and support vector machine), on conventional STIR images to predict FF and wT2 for six calf muscles.

Results: The combination of WF3 and K-nearest neighbor resulted to be the best predictor model of qMRI parameters with a mean absolute error about ± 5 pp for FF and ± 1.8 ms for wT2.

Conclusion: This pilot study demonstrated the possibility to predict qMRI parameters in a cohort of FSHD subjects starting from conventional STIR sequence.

1. Introduction

Muscle Magnetic Resonance Imaging (mMRI) has been increasingly used over the last years as a powerful diagnostic tool to evaluate disease involvement and progression in several neuromuscular disorders (1–3). mMRI is able to demonstrate selective patterns of damage distribution both in terms of fat replacement and muscular edema (4, 5). Facioscapulohumeral muscular dystrophy (FSHD) is a genetic muscle disorders that causes a slowly progressive and asymmetric weakness of the facioscapulohumeral, abdominal, paraspinal, and lower leg muscles (6–9) both in pediatric and adult patients. mMRI of FSHD has relied on acquisition of conventional sequences such as T1-weighted (T1w) and short-tau inversion recovery (STIR) sequences that are able to foster the qualitative detection of anatomical changes in muscles size or shape, particularly related to fat replacement and muscle edema (or edema –like) (10, 11), revealing a widespread involvement both in upper girdle and lower limbs (12, 13). The use of mMRI enabled to propose a peculiar model for FSHD disease evolution, highlighting how patients undergo a muscle-selective involvement with an early hyperintense signal on STIR sequence related to edema/inflammation, followed by fatty replacement of single muscles, particularly evident on T1w images (14). Recently, the use of STIR signal intensity as a longitudinal marker of inflammation suppression in FSHD has been questioned because an incremental STIR signal has been reported in FSHD patients during the immunosuppressive treatment period (15). As per other neuromuscular diseases, semi-quantitative visual scales have been applied to support and improve the evaluation of morphological changes in muscles, e.g., Mercuri and Fischer scales (16, 17). The recent development and implementation of quantitative MRI (qMRI) in the field of neuromuscular diseases allowed to go beyond the conventional and semi-quantitative approaches, being able to assess quantitative parameters (e.g., the percentage of fat replacement in the muscle, the so called fat fraction, FF), that have been correlated both with transcriptome signatures (DUX4 and PAX7 signatures) and with clinical tests (e.g., Ricci clinical severity score) (18). Therefore the development of qMRI techniques improved the non-invasive applicability of muscle imaging in the diagnostic process and follow-up of muscle disorders (19). Neither the clinical outcomes nor the conventional muscle MRI techniques, in fact, are deemed to be sensitive enough to track muscle changes in slowly progressing diseases (3). qMRI is considered a valuable tool to monitor even fine changes in neuromuscular disease evaluation and longitudinal progression over time because it delivers quantitative information such as muscles FF and the muscle water T2 (wT2) relaxation time which is an unspecific marker for disease activity because it is sensitive to the presence of leaky membranes, muscle fiber necrosis, edema, inflammation, or denervation (20). Dixon imaging and Multi-Echo T2 spin-echo sequences are the most commonly used qMRI methods to compute FF and wT2 (3). Up-to-date qMRI methods require custom-tailored sequences provided by vendors on the MRI scanner resulting in high-cost implementations. Recently, Image Biomarker Standardization Initiative (IBSI, https://ibsi.readthedocs.io/en/latest/) radiomics proved to be a powerful tool to extract quantitative information from MRI images, becoming a new asset in the diagnostic field (21). It can identify the main patterns of a disease through the mathematical extraction of pixels intensity and spatial interrelationships distributions. Radiomics quantifies textural information that, once dimensionally reduced (22, 23), can be combined with machine learning (ML) algorithms to predict neuromuscular quantitative biomarkers such as FF and wT2 with a good predictive power (24). Standardized features extraction can also help to overcome possible limitations due to the presence of fat in the evaluation of wT2 biomarkers through exponential fitting. However, it is still unclear whether and how radiomics could be applied on conventional STIR images and combined with ML algorithms to predict FF and wT2. Moreover, it remains unexplored whether the predictive power of ML algorithms on conventional STIR images could be improved through the definition of new radiomic features as an alternative to the ones provided by commercial radiomic features extraction software (25).

STIR sequence is most likely available in all MRI centers and it has a very competitive acquisition time compared to qMRI sequences. In this study, we aim to investigate whether different radiomics and machine learning algorithms may be applied to conventional STIR sequence to predict quantitative parameters in skeletal muscle.

2. Materials and methods

Twenty-five FSHD patients (10 females, age range: 19–60 y) and six healthy volunteers (HCs) (5 females, age range: 47–63 y) were scanned on a 3T MRI scanner (Magnetom Skyra, Siemens Healthcare, Erlangen, Germany) using integrated spine and body surface coils. Acquisition volume was centered on the calf with the last acquired slice located at 6 cm proximally from the upper limit of the patella. The MRI protocol included 3D 6-point multi-echo gradient-echo (MEGE) [52 slices, slice thickness = 5.0 mm, distance factor = 20%, resolution = 1 × 1 × 5 mm3, TR/ TE = 35 ms/1.7–9.2 ms, scan time = 15 min], multi-echo spin echo (MESE) [7 slices, TH = 10 mm, DF = 300%, resolution = 1.2 × 1.2 × 10 mm3, TR/TE = 4,100 ms/10.9–185.3 ms, 17 echoes, scan time = 5.13 min] and 2D STIR sequences [50 slices, TH = 5.0 mm, DF = 20%, resolution = 1 × 1 × 5mm3, TR/TE = 4,200/82 ms, TI = 230 ms, scan time = 3.40 min]. An example of STIR image is reported in Figure 1. Pre-processing steps have been performed on STIR images in order to ensure features extraction on an inter-patients harmonized grayscale values. In particular, all images were pre-processed by 3DSlicer (26) N4 Bias Field Correction to correct low frequency intensity non-uniformity in MRI images, and 3DSlicer Histogram Matching to normalize grayscale MRI images.

FIGURE 1

Figure 1. Example of axial STIR image of an FSHD subject at calf level. Image acquired at Neuroradiology Department of IRCCS Mondino Foundation.

A single slice from the medial calf level of each FSHD patient was selected from the first echo images of MEGE because of the higher SNR than the other echoes. Each selected slice was automatically segmented (27) into six regions of interest (ROIs) for each calf muscle, i.e. Soleus (S), Medial and Lateral Gastrocnemius (MG, LG), Anterior Tibialis (TA), Extensor Digitorum Longus (ELD), Peroneus Longus (Pe). The ROIs were co-registered to the medial calf slice of MESE and STIR using the linear registration command ‘flirt' of FSL software (28). A single trained operator with 3 years of experience manually corrected each ROIs after the automatic segmentation of MEGE images and after the co-registration on MESE and STIR images (Figure 2).

FIGURE 2

Figure 2. Segmentation flow from MEGE to MESE and STIR images. Automatic segmentation was performed on MEGE slice followed by manual correction for 6 ROIs: Soleus (Red), Medial and Lateral Gastrocnemius (Green, Dark Blue), Anterior Tibialis (Yellow), Extensor Digitorum Longus (Light Blue), Peroneus Longus (Pink). Then the ROIs were co-registered and manually corrected both on MESE and STIR images.

For each subject and each muscle, radiomic features extraction and ML prediction were performed on the mid-calf slice of STIR image because it gives a representation of all calf muscles with a cross sectional area (CSA) wide enough to ensure the extraction of a robust pixel intensity distribution (29). Fifty six radiomics features were extracted averaging left and right side per each muscle. In particular, we extracted 25 first-order statistical-based features concerning voxels intensity distributions, e.g., CONVENTIONAL_mean, CONVENTIONAL_std, CONVENTIONAL_max, CONVENTIONAL_Q1, 26 second-order statistical-based features highlighting voxels spatial relationship such as the gray level co-occurrence matrix (GLCM) features (e.g., GLCM_Correlation, GLCM_Entropy_log10) and the gray level zone length matrix (GLZLM) features (e.g. GLZLM_LZE, GLZLM_LGZE, GLZLM_HGZE), 5 shape related features concerning size and geometric properties (e.g. SHAPE_Volume(mL), SHAPE_Volume(vx)) (25). Finally, ground truth FF and wT2 values, which the ML predictions have been compared to, were calculated by Fatty Riot algorithm (30) and by EPG signal simulation (two-component model, both for water and fat) (31, 32) from mid-calf MEGE and MESE slice, respectively.

2.1. Dataset, dimensionality reduction, and machine learning algorithms

We compare the performance in predicting calf muscle FF and wT2 values introducing three different workflows. In particular, inspired by Felisaz et al. (24) work, the first workflow predicts FF and wT2 combining radiomics with LIFEx software (25), principal component analysis (PCA) (33) and ML regression models. The second method uses the same features extraction and ML models of the previous method but explores the use of a new dimensionality reduction technique (23) as an alternative to PCA to verify a possible improvement in the prediction of neuromuscular quantitative parameters. The third method relies neither on LIFEx features nor on any dimensionality reduction technique. In particular, two STIR-based features are defined as markers of muscle fat percentage and muscle inflammation. These two features are used as predictors in ML models to test whether there is an improvement in the predictive performance of FF and wT2.

2.2. Workflow 1

Features extraction was performed using the IBSI standard-compliant LIFEx software v.7.1.0 with the aim to extract shape related features, taking into account for size and geometric properties, first-order statistical-based features, concerning voxels intensity distributions and second-order statistical-based features highlighting voxels spatial relationship. In particular, a 2D extraction was performed on each ROI corresponding to the six calf muscles (left and right side were averaged). Therefore, we obtained six datasets associated with each calf muscle. On each dataset principal component analysis (PCA) (33) dimensional reduction was performed in order to obtain lower-dimensional data while preserving as much of the data variation as possible. Six principal components, which in our case retain about 90% of the explained variance, were identified and consequently each data point was projected onto them. For each muscle dataset we implemented the parametric linear (34), ridge (35) and Lasso (36) regression and the non-parametric KNN (37), SVM (38), tree (39), and RF (40) algorithms. A k-fold cross validation resampling approach with k = 5 was used on the associated PCA dimensionally reduced dataset. This procedure guarantees a more realistic performance evaluation of each machine learning model by fitting the same statistical model several times on randomly obtained subsets of approximately equal size.

2.3. Workflow 2

The starting point was the 2D extraction of texture features from the pre-processed STIR image as described in WF1. To reduce the dimensionality of the dataset we have used the concept of information imbalance described in Glielmo et al. (23). More precisely, performing feature selection or dimensionality reduction in our case is the same task of finding the most suitable measure between data points since explicit features are available. This is because a particular choice of features naturally gives rise to a different distance function computed through the Euclidean norm (23). Therefore, we designed a feature selection algorithm by selecting the subset of features, which minimizes the information imbalance with respect to the two targets, the values of the neuromuscular biomarkers FF and wT2, separately. The definition of information imbalance Δ used was its estimation on a dataset with N points (23):

\begin{array}{l} Δ (A \to B) \approx \frac{2 (r^{B} | r^{A} = 1)}{N} & (1) \end{array}

where A is the space consisting in the radiomic feature space and B is the space associated to FF or wT2 biomarkers, r^B and r^A represent the rank of each pair points in the space B and A, respectively, calculated according to the distance d_B and d_A, an euclidean norm defined in the relative space. Thus, information imbalance quantifies the relative information content of a distance measure with respect to another using the widespread idea of local neighborhoods. A low value of Δ (A→B) means that the combination of certain features can predict a specific neuromuscular biomarker. Figure 3 shows for Soleum the minimum information imbalance Δ (A→B) achievable with a specific subset of radiomics features for the two biomarkers wT2 and FF. For each muscle, we optimized the information imbalance with respect to target FF and wT2 separately and selected the subspace of radiomics features corresponding to the associated minimum Δ. The obtained datasets for each muscle and each biomarker were used as input for machine learning algorithms. As in WF1 parametric and non-parametric algorithms were implemented using the resampling k-folds cross validation.

FIGURE 3

Figure 3. Optimized information imbalance for blocks of features for the Soleus muscle. On the y-axis are reported the optimized information imbalance values, which are calculated using Equation (1), as a function of subsets of radiomic features (x-axis). (A) Optimized imbalance with respect to the target biomarker FF (top) and (B) to the wT2 (bottom).

2.4. Workflow 3

We defined two STIR-based radiomic features to be used as an alternative to the conventional textural features of WF1 and WF2. We use these new features as the only covariates in the implementation of ML algorithms to test whether the prediction performance of ML models could be improved over those obtained by the previously described workflows. Firstly, we applied the same segmentation method of FSHD patients on the pre-processed STIR images of each healthy control (HC). In particular, six contiguous HCs slices of mid-calf region were segmented in order to ensure a robust pixel statistics of the grayscale intensity distributions. Then, two reference limits, Upper Limit (UL) and Lower Limit (LL), were defined as follows. Inspired by Dahlqvist et al. (41), UL was defined for each calf muscle through the extraction of a pixel-wise histogram of signal intensity distribution from all slices. The six muscle-wise UL were set at the mean μ of the associated pixels-intensity distribution added to 2 standard deviation (S.D.) σ:

\begin{array}{l} {U L}_{i} = μ_{i} + 2 σ_{i} & (2) \end{array}

with i indexing the six calf muscles.

Due to non-uniform fat suppression of STIR sequence, LL was calculated as a representative value of fat signal intensity. Therefore, subcutaneous fat (average thickness at medial level of HCs was about 10.5 mm) was manually drawn in HCs slices to ensure the extraction of LL feature. In particular, from subcutaneous fat ROI of all slices the pixel-wise histogram of signal intensity distribution was extracted. Subsequently, the LL was set as the mode of the distribution. In this way, we could calculate a more realistic fat intensity representative value, limiting the contribution of blood vessels present in the subcutaneous fat, which tend to shift the mean value of the associated distribution toward greater value due to the hyperintesity STIR signal of the blood.

Moreover, the obtained LL and muscle-wise UL coefficients were set as the reference limits to quantify, for every FSHD patient, fat infiltration grade (FFG) and muscle edema grade (MEG) by expressing the number of pixels below LL and above UL as a percentage of the total pixels in each calf muscle. FFG and MEG were then used as covariates in ML models to predict FF and wT2, respectively. Particularly, muscle-wise FFG and MEG values were separately collected into datasets according to calf muscles and neuromuscular biomarker and used as input for machine learning algorithms.

As described in WF1, we implemented both parametric and non-parametric models using the k-folds cross validation as a resampling approach. WF3 brought the advantage of testing the prediction accuracy of neuromuscular biomarkers with two features that were easy to compute by means of a stand-alone Python routine, without going through commercial texture software and any dimensionality reduction techniques.

2.5. ML models performance evaluation

According to the aforementioned workflows, models performance estimation was performed calculating for each muscle and for each ML algorithm the mean absolute error (MAE):

\begin{array}{l} {M A E}_{j} = \frac{Σ_{i = 1}^{N} | y_{i} - {\bar{y}}_{i} |}{N} & (3) \end{array}

where N is the number of observations, y_i is the target value, ȳ_i the predicted value, index j is related to the different calf muscles and index i runs over the observations associated with each muscle. Furthermore, mean MAE ( $\bar{M A E}$ ) was defined as:

\begin{array}{l} {\bar{M A E}}_{j} = \frac{Σ_{k = 1}^{5} {M A E}_{j}}{k} & (4) \end{array}

where the index k runs over the k = 5 folds.

To measure the variability of volume and ground truths distribution we also calculated the coefficients of variation (CVs) defined as:

\begin{array}{l} C V_{i} = \frac{σ_{i}}{μ_{i}} & (5) \end{array}

where the index i runs over the muscles, σ_i and μ_i are the associated S.D. and mean of the distributions, respectively. Thus, CVs for volume and ground truth muscle-wise FF and wT2 quantify the variability range of ground truth values on which the ML models were tested.

Moreover, we explored whether $\bar{M A E}$ prediction shows linear or monotonic dependency on CV values of muscle volume and ground truth parameters using Pearson (ρ_P) and Spearman (ρ_S) correlation coefficients.

3. Results

In Tables 1–3 the FF $\bar{M A E}$ was reported for the three used workflows (WF1, WF2, and WF3) calculated for each muscle and from each ML algorithm. Similarly, in Tables 4–6 the $\bar{M A E}$ was reported for wT2. Boxplots in Figure 4 show the FF and wT2 $\bar{M A E}$ distribution per each muscle and workflow (WF 1, 2, and 3). The discrepancy between the ground truth values and ML predicted values are expressed in percentage points (pp) for FF and in milliseconds (ms) for wT2, respectively.

TABLE 1

Table 1. Workflow 1: Evaluation of ML models predicting performances: mean absolute discrepancy ( $\bar{M A E}$ ) between the muscle-wise Fat Fraction gold standard values from Fatty Riot algorithm and the predicted value through ML algorithms.

TABLE 2

Table 2. Workflow 2: Evaluation of ML models predicting performances: mean absolute discrepancy ( $\bar{M A E}$ ) between the muscle-wise Fat Fraction gold standard values from Fatty Riot algorithm and the predicted value through ML algorithms.

TABLE 3

Table 3. Workflow 3: Evaluation of ML models predicting performances: mean absolute discrepancy ( $\bar{M A E}$ ) between the muscle-wise Fat Fraction gold standard values from Fatty Riot algorithm and the predicted value through ML algorithms.

TABLE 4

Table 4. Workflow 1: Evaluation of ML models predicting performances: mean absolute discrepancy ( $\bar{M A E}$ expressed in ms) between the muscle-wise water T2 gold standard values from EPG signal simulation algorithm and the predicted value through ML algorithms.

TABLE 5

Table 5. Workflow 2: Evaluation of ML models predicting performances: mean absolute discrepancy ( $\bar{M A E}$ expressed in ms) between the muscle-wise water T2 gold standard values from EPG signal simulation algorithm and the predicted value through ML algorithms.

TABLE 6

Table 6. Workflow 3: Evaluation of ML models predicting performances: mean absolute discrepancy ( $\bar{M A E}$ expressed in ms) between the muscle-wise water T2 gold standard values from EPG signal simulation algorithm and the predicted value through ML algorithms.

FIGURE 4

Figure 4. FF and wT2 boxplots. Muscle-wise boxplots (first quartile (Q1) to third quartile (Q3) and median value in orange line) (A) for FF (top) expressed in percentage points (pp) and (B) wT2 (bottom) expressed in ms. Three boxplots are given for each muscle related to WF 1 (blue), WF 2 (green), WF 3 (red). Highest accuracy is related to red dots (FF, wT2 boxplots) corresponding to KNN prediction performances.

As inferred from boxplots in Figure 4, each workflow resulted in a mean FF and wT2 prediction performance of ± 20 pp and ± 6 ms (averaged values) for the anterior compartment muscles and of ± 15 pp and ± 6 ms for the posterior compartment, respectively. Figure 5 shows the mean prediction performance, averaged on all calf muscles, for each ML algorithm and workflow. KNN algorithm proved to be the best predictor model when combined with WF3 for FF [ $\bar{M A E}$ ± 5pp (S.D.1.8 pp)] and for wT2 [ $\bar{M A E}$ ± 1.8 ms (S.D.0.7 ms)]. By contrast linear regression (LR) combined with WF2 showed the worst accuracy in estimating FF [±36 pp (S.D.38.2 pp)] and wT2 [±10.9 ms (S.D.9.4)].

FIGURE 5

Figure 5. Algorithm-wise for FF and wT2. (A) FF (top) and (B) wT2 (bottom) prediction performances averaged on all muscles and showed as a function of the different implemented ML algorithms. According to the proposed workflows, a trio of mean prediction accuracy was defined for each ML model i.e., blue plot (WF1), green plot (WF2), red plot (WF3).

Figure 6 reports the CV_i for FF and wT2 for each calf muscle. Similarly, muscle volume CVs account for inter-subject muscle shape variability. Volume CVs are reported in Figure 7. The ground truth CVs range from 0.45 to 0.99 for FF and from 0.04 to 0.22 for wT2 whereas volume CVs range from 0.30 to 0.42 (Figures 6, 7).

FIGURE 6

Figure 6. FF and wT2 gold standard boxplots. Muscle-wise boxplots (first quartile (Q1) to third quartile (Q3) and median value in orange line) (A) for FF (top) and (B) wT2 (bottom) gold standard values with CV listed in the legend.

FIGURE 7

Figure 7. Volume size boxplots. Muscle-wise volume boxplots (first quartile (Q1) to third quartile (Q3) and median value in orange line). Muscle-wise mean volume size is reported in round brackets on x-axis, CV is listed in legend.

Table 7 shows no significant correlation between KNN $\bar{M A E}$ and both CVs of ground truth and volume values. Thus, KNN prediction seemed to be independent from inter-subject muscle shape, i.e., CVs volume, and ground truth variability ranges, i.e., CVs of FF and WT2. Furthermore, the presence of linear and monotonic correlations was tested even between KNN $\bar{M A E}$ and the mean volume of muscles to examine KNN prediction dependency on different calf muscle size. For our cohort, the following mean volume values for calf muscles were: S ≈ 1743.1 mm³, MG ≈ 987.5 mm³, LG ≈ 585.9 mm³, TA ≈ 458.4 mm³, ELD ≈ 295.8 mm³, Pe ≈ 534.6 mm³. Pearson and Spearman coefficients did not show any significant correlation neither for $\bar{M A E}$ FF [ρ_P = 0.66 (0.22) and ρ_S= 0.52 (0.36)] nor for $\bar{M A E}$ wT2 [ρ_P = 0.12 (0.83) and ρ_S= 0.08 (0.87)]. Therefore, KNN prediction seemed to be independent even from dimension of calf muscles.

TABLE 7

Table 7. Pearson and Spearman correlation coefficients between volume CVs, ground truths CVs and KNN $\bar{M A E}$ prediction of neuromuscular parameters.

3.1. Discussion

In this study, we explored the possibility to predict fat fraction and water T2 of calf muscles in FSHD subjects starting from a conventional STIR sequence and applying three different workflows, which combine radiomics, dimensionality reduction methods and ML models. To the authors' knowledge, this is the first attempt to predict qMRI parameters from STIR imaging, whereas MRI radiomics features extraction from STIR images have already been exploited to classify disease groups or autoantibodies in patients with idiopathic inflammatory myopathies (IIMs) with ML (42). The three applied workflows resulted in a comparable mean prediction performance about ± 20 pp for FF and about ± 6 ms for wT2 with the exception of LR and KNN models. KNN, according to the obtained results, turned out to be the best predictor model both for FF and wT2. More specifically, the algorithm-wise performance highlights the best prediction for the combination of KNN and WF3 for both FF (±5 pp) and wT2 (± 1.8 ms). The muscle-wise analysis of the prediction performance also demonstrates a KNN mean prediction performance with almost no dependency either on the dimension of the muscles and on inter-subject muscle shape. We investigate these hypotheses by calculating for each muscle the muscle mean volume and the volume CVs. Despite the difference both in mean muscle-wise volume values and in volume CVs, no significant Pearson and Spearman correlation were found with KNN $\bar{M A E}$ that was able to predict wT2 and FF with a mean error of approximately ± 1.8 ms and ± 5 pp, respectively.

Furthermore, the combination of a small sample size and high CV of ground truth distributions may have negatively affected the ML training step and consequently compromised prediction performance. However, KNN parameters prediction seemed to have no dependency on CV of ground truth values used for training ML algorithms. In contrast to the good predictive power of KNN, we found the least performative model being LR combined with WF2. We surmise that LR + WF2 might be unable to detect the complex relationship between predictors and target variable as suggested by the wider error bars. The main limit of the current study is related to STIR sequence artifacts such as the low-signal-intensity banding artifacts and high-signal-intensity areas without proper fat suppression (43) that eventually may affect the FF prediction. Nevertheless, we used this non-uniform fat signal component to identify image fat pixels, which were used to extract conventional radiomics features (WF1, WF2), and to define FFG feature (WF3). Conversely STIR imaging is particularly suitable for muscle edema pattern detection (41) which may be easily detected by radiomic features. Furthermore, this study focused on the prediction by all WFs of the mean value of FF and wT2. FSHD is an asymmetric muscular dystrophy, therefore a more in-depth predictivity analysis that also takes into account the laterality of ROIs could be a useful tool for an ever-improving prediction model. Moreover, to expand the applicability of the current results, we aim to conduct further studies enrolling larger cohorts of subjects with different muscular dystrophies and also exploring other skeletal muscle districts (e.g., paravertebral muscles).

In conclusion, our study showed that conventional STIR imaging can potentially be used to predict quantitative muscle MRI parameters by applying radiomics combined with ML models. In particular, the KNN algorithm combined with WF3 was the best predictor for both FF and wT2. The proposed radiomic workflows could contribute to a wider application of a relatively common imaging technique as STIR to rapidly estimate quantitative parameters of skeletal muscle, without the need to acquire long and complex advanced qMRI sequences.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions. Requests to access these datasets should be directed to Direzione Scientifica, ZGlyc2NpQG1vbmRpbm8uaXQ=.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of Pavia and the Ethics Committee of Fondazione Policlinico Universitario A. Gemelli. The patients/participants provided their written informed consent to participate in this study.

Author contributions

GC and LB: conceptualization, methodology, software, and writing—review and editing. MP: data curation, supervision, and writing—review and editing. MM, ER, and GT: resources and writing—review and editing. NB: software and review. GM: data curation. XD and FS: resources, writing—review and editing, and supervision. AM and SF: validation, writing—review and editing, and supervision. AP: supervision, project administration, and writing—review and editing. All authors contributed to the article and approved the submitted version.

Funding

This research was funded by the Ministry of Health, Italy [RC 1048 2017-2019], [RC 2020-2021], [RC 2022], and [RF 2016-02362914].

Acknowledgments

We thank the patients for their collaboration. Authors would also like to thank Alessandro Laio (SISSA Trieste, Italy) for his fruitful support in the development of workflow 2 and Chiara Bonizzoni for revision of segmentation maps.

Conflict of interest

FS receives consulting fees from Hoffman - La Roche AG. AP has received honorarium for consultancy and Advisory Board for Sanofi G-enzyme and Amicus Ther.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Author disclaimer

The content of this paper is in the PhD dissertation of one of the authors (GC): GC (2022) Artificial Intelligence, Mathematical Modeling and Magnetic Resonance Imaging for Precision Medicine in Neurology and Neuroradiology. [PhD dissertation]. [Pavia (PV)]: University of Pavia (44).

References

1. Paoletti M, Pichiecchio A, Cotti Piccinelli S, Tasca G, Berardinelli AL, Padovani A, Filosto M. Advances in quantitative imaging of genetic and acquired myopathies: clinical applications and perspectives. Front Neurol. (2019) 10:78. doi: 10.3389/fneur.2019.00078

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Diaz-Manera J, Llauger J, Gallardo E, Illa I. Muscle MRI in muscular dystrophies. Acta Myologica. (2015) 34:2–3.

Google Scholar

3. Carlier PG, Marty B, Scheidegger O, Loureiro de Sousa P, Baudin PY, Snezhko E, Vlodavets D. Skeletal muscle quantitative nuclear magnetic resonance imaging and spectroscopy as an outcome measure for clinical trials. J Neuromusc Dis. (2016) 3:1–28. doi: 10.3233/JND-160145

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Hollingsworth K. G. Quantitative MRI in muscular dystrophy: an indispensable trial endpoint? Neurology. (2014) 83:956–7. doi: 10.1212/WNL.0000000000000785

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Costa F, Di Primio GA, Schweitzer ME. Magnetic resonance imaging of muscle disease: a pattern-based approach. Muscle Nerve. (2012) 46:465–81. doi: 10.1002/mus.23370

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Andersen G, Dahlqvist JR, Vissing CR, Heje K, Thomsen C, Vissing J. MRI as outcome measure in facioscapulohumeral muscular dystrophy: 1-year follow-up of 45 patients. J Neurol. (2017) 264:438–44. doi: 10.1007/s00415-016-8361-3

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Tawil R, Kissel JT, Heatwole C, Pandya S, Gronseth G, Benatar M. Evidence-based guideline summary: evaluation diagnosis and management of facioscapulohumeral muscular dystrophy: report of the guideline development dissemination and implementation subcommittee of the American academy of neurology and the practice issues review panel of the american association of neuromuscular abd electrodiagnostic medicine. Neurology. (2015) 85:357–64. doi: 10.1212/WNL.0000000000001783

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Tawil R, Van Der Maarel SM, Tapscott SJ. Facioscapulohumeral dystrophy: the path to consensus on pathophysiology. Skeletal Muscle. (2014) 4:1–15. doi: 10.1186/2044-5040-4-12

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Dahlqvist JR, Vissing CR, Thomsen C, Vissing J. Severe paraspinal muscle involvement in facioscapulohumeral muscular dystrophy. Neurology. (2014) 83:1178–83. doi: 10.1212/WNL.0000000000000828

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Reimers CD, Schedel H, Fleckenstein JL, Nägele M, Witt TN, Pongratz DE, Vogl TJ. Magnetic resonance imaging of skeletal muscles in idiopathic inflammatory myopathies of adults. J Neurol. (1994) 241:306–14. doi: 10.1007/BF00868438

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Mercuri E, Pichiecchio A, Counsell S, Allsop J, Cini C, Jungbluth H, Bydder G. A short protocol for muscle MRI in children with muscular dystrophies. Eur J Paed Neurol. (2002) 6:305–7. doi: 10.1053/ejpn.2002.0617

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Gerevini S, Scarlato M, Maggi L, Cava M, Caliendo G, Pasanisi B, Morandi L. Muscle MRI findings in facioscapulohumeral muscular dystrophy. Eur Radiol. (2016) 26:693–705. doi: 10.1007/s00330-015-3890-1

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Fatehi F, Salort-Campana E, Le Troter A, Bendahan D, Attarian S. Muscle MRI of facioscapulohumeral dystrophy (FSHD): A growing demand and a promising approach. Revue Neurologique. (2016) 172:566–71. doi: 10.1016/j.neurol.2016.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Monforte M, Laschena F, Ottaviani P, Bagnato M. R, Pichiecchio A, Tasca G, Ricci E. Tracking muscle wasting and disease activity in facioscapulohumeral muscular dystrophy by qualitative longitudinal imaging. J Cachexia Sarcopenia Muscle. (2019) 10:1258–65. doi: 10.1002/jcsm.12473

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Wang LH, Johnstone LM, Bindschadler M. Adapting MRI as a clinical outcome measure for a facioscapulohumeral muscular dystrophy trial of prednisone and tacrolimus: case report. BMC Musculoskelet Disord. (2021) 22:56. doi: 10.1186/s12891-020-03910-1

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Mercuri E, Pichiecchio A, Allsop J, Messina S, Pane M, Muntoni F. Muscle MRI in inherited neuromuscular disorders: past present and future. J Int Soc Mag Reson Med. (2007) 25:433–40. doi: 10.1002/jmri.20804

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Fischer D, Kley RA, Strach K, Meyer C, Sommer T, Eger K, Olivé M. Distinct muscle imaging patterns in myofibrillar myopathies. Neurology. (2008) 71:758–65. doi: 10.1212/01.wnl.0000324927.28817.9b

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Van den Heuvel A, Lassche S, Mul K, Greco A, San León Granado D, Heerschap A, et al. Facioscapulohumeral dystrophy transcriptome signatures correlate with different stages of disease and are marked by different MRI biomarkers. Sci Rep. (2022) 12:1426. doi: 10.1038/s41598-022-04817-8

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Janssen B, Voet N, Geurts A, van Engelen B, Heerschap A. Quantitative MRI reveals decelerated fatty infiltration in muscles of active FSHD patients. Neurology. (2016) 86:1700–7. doi: 10.1212/WNL.0000000000002640

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Locher N, Wagner B, Balsiger F, Scheidegger O. Quantitative water T2 relaxometry in the early detection of neuromuscular diseases: a retrospective biopsy-controlled analysis. Eur Radiol. (2022) 32:7910–7. doi: 10.1007/s00330-022-08862-9

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Van Timmeren JE, Alkadhi CDT, Baessler B. Radiomics in medical imaging—“how-to” guide and critical reflection. Insights Imag. (2020) 11:1. doi: 10.1186/s13244-020-00887-2

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Abdi H, Williams LJ. Principal component analysis. Wiley interdisciplinary reviews. Computat Stat. (2010) 2:433–59. doi: 10.1002/wics.101

CrossRef Full Text | Google Scholar

23. Glielmo A, Zeni C, Cheng B, Csányi G, Laio A. Ranking the information content of distance measures. PNAS Nexus. (2022) 1:039. doi: 10.1093/pnasnexus/pgac039

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Felisaz PF, Colelli G, Ballante E, Solazzo F, Paoletti M, Germani G, Pichiecchio A. Texture analysis and machine learning to predict water T2 and fat fraction from non-quantitative MRI of thigh muscles in Facioscapulohumeral muscular dystrophy. Eur J Radiol. (2021) 134:109460. doi: 10.1016/j.ejrad.2020.109460

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Nioche C, Orlhac F, Boughdad S, Reuzé S, Goya-Outi J, Robert C, et al. LIFEx: a freeware for radiomic feature calculation in multimodality imaging to accelerate advances in the characterization of tumor heterogeneity. Cancer Res. (2018) 78:4786–9. doi: 10.1158/0008-5472.CAN-18-0125

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, Kikinis R. 3D Slicer as an image computing platform for the quantitative imaging network. Mag Resonan Imag. (2012) 30:1323–41. doi: 10.1016/j.mri.2012.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Agosti A, Shaqiri E, Paoletti M, Solazzo F, Bergsland N, Colelli G, Pichiecchio A. Deep learning for automatic segmentation of thigh and leg muscles. Mag Res Mat Physics Biol Med. (2022) 35:467–83. doi: 10.1007/s10334-021-00967-4

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Woolrich M. W, Jbabdi S, Patenaude B, Chappell M, Makni S, Behrens T, Smith SM. Bayesian analysis of neuroimaging data in FSL. Neuroimage. (2009) 45:S173–86. doi: 10.1016/j.neuroimage.2008.10.055

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Arpan I, Forbes SC, Lott DJ, Senesac CR, Daniels MJ, Triplett WT, Vandenborne K. T2 mapping provides multiple approaches for the characterization of muscle involvement in neuromuscular diseases: a cross-sectional study of lower leg muscles in 5–15-year-old boys with Duchenne muscular dystrophy. NMR Biomed. (2013) 26:320–8. doi: 10.1002/nbm.2851

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Smith DS, Berglund J, Kullberg J, Ahlström H, Avison MJ, Welch EB. Optimization of fat-water separation algorithm selection and options using image-based metrics with validation by ISMRM fat-water challenge datasets. in Proceedings of the 21st Annual Meeting of the International Society for Magnetic Resonance in Medicine Salt Lake City Utah. (2013) (Vol. 2413).

Google Scholar

31. Weigel M. Extended phase graphs: dephasing RF pulses and echoes-pure and simple. J Mag Res Imag. (2015) 41:266–95. doi: 10.1002/jmri.24619

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Santini F, Deligianni X, Paoletti M, Solazzo F, Weigel M, De Sousa PL, Bergsland N. Fast open-source toolkit for water T2 mapping in the presence of fat from multi-echo spin-echo acquisitions for muscle MRI. Front Neurol. (2021) 248:387. doi: 10.3389/fneur.2021.630387

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Jolliffe IT. Principal Component Analysis for Special Types of Data. New York: Springer (2002) (pp. 338-372).

Google Scholar

34. Friedman J, Hastie T, Tibshirani R. The Elements of Statistical Learning (Vol. 1). New York: Springer Series in Statistics (2001).

PubMed Abstract | Google Scholar

35. Hoerl AE, Kennard RW. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics. (1970) 12:55–67.

Google Scholar

36. Tibshirani R. Regression shrinkage and selection via the lasso. J Royal Stat Soc Series B. (1996) 58:267–88.

Google Scholar

37. Cover T, Hart P. Nearest neighbor pattern classification. IEEE Transact Inform Theory. (1967) 13:21–7.

Google Scholar

38. Drucker H, Burges CJ, Kaufman L, Smola A, Vapnik V. Support vector regression machines. Adv Neural Inform Process Systems. (1996) 9:5.

Google Scholar

39. Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees. Routledge. (2017).

Google Scholar

40. Breiman L. Random forests. Mach Learn. (2001) 45:5–32.

Google Scholar

41. Dahlqvist JR, Widholm P, Leinhard OD, Vissing J. MRI in neuromuscular diseases: an emerging diagnostic tool and biomarker for prognosis and efficacy. Ann Neurol. (2020) 88:669–81. doi: 10.1002/ana.25804

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Nagawa K, Suzuki M, Yamamoto Y, Inoue K, Kozawa E, Mimura T, et al. Texture analysis of muscle MRI: machine learning-based classifications in idiopathic inflammatory myopathies. Sci Rep. (2021) 11:9821. doi: 10.1038/s41598-021-89311-3

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Ulbrich EJ, Sutter R, Aguiar RF, Nittka M, Pfirrmann CW, STIR. sequence with increased receiver bandwidth of the inversion pulse for reduction of metallic artifacts. AJR Am J Roentgenol. (2012) 199:W735–42. doi: 10.2214/AJR.11.8233

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Colelli G,. Artificial Intelligence Mathematical Modeling Magnetic Resonance Imaging for Precision Medicine in Neurology Neuroradiology. [PhD dissertation]. [Pavia (PV)]: University of Pavia (2022). Available online at: https://hdl.handle.net/11571/1468414 (accessed January 20, 2023).

Google Scholar

Keywords: radiomics, machine learning, muscle MRI, stir, FSHD

Citation: Colelli G, Barzaghi L, Paoletti M, Monforte M, Bergsland N, Manco G, Deligianni X, Santini F, Ricci E, Tasca G, Mira A, Figini S and Pichiecchio A (2023) Radiomics and machine learning applied to STIR sequence for prediction of quantitative parameters in facioscapulohumeral disease. Front. Neurol. 14:1105276. doi: 10.3389/fneur.2023.1105276

Received: 22 November 2022; Accepted: 30 January 2023;
Published: 24 February 2023.

Edited by:

Jordi Diaz-Manera, University of Newcastle, United Kingdom

Reviewed by:

Teresa Gerhalter, University Hospital Erlangen, Germany
Jorge Alonso-Pérez, Hospital Santa Creu i Sant Pau, Spain
José Verdú-Díaz, Newcastle University, United Kingdom

Copyright © 2023 Colelli, Barzaghi, Paoletti, Monforte, Bergsland, Manco, Deligianni, Santini, Ricci, Tasca, Mira, Figini and Pichiecchio. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Giulia Colelli, yes Z2l1bGlhLmNvbGVsbGlAbW9uZGluby5pdA==; yes Z2l1bGlhY29sZWxsaTY5M0BnbWFpbC5jb20=

^†These authors share first authorship

^‡These authors share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Radiomics and machine learning applied to STIR sequence for prediction of quantitative parameters in facioscapulohumeral disease

1. Introduction

2. Materials and methods

2.1. Dataset, dimensionality reduction, and machine learning algorithms

2.2. Workflow 1

2.3. Workflow 2

2.4. Workflow 3

2.5. ML models performance evaluation

3. Results

3.1. Discussion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher's note

Author disclaimer

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good