- 1Department of Radiology, Jing’an District Center Hospital of Shanghai, Fudan University, Shanghai, China
- 2Department of Neurology, Huashan Hospital, Fudan University, Shanghai, China
- 3Department of Neurology, Jing’an District Center Hospital of Shanghai, Fudan University, Shanghai, China
Objectives: This study aimed to assess the feasibility of a machine learning-based radiomics tools to discriminate between Limb-girdle muscular dystrophy R2 (LGMDR2) and immune-mediated necrotizing myopathy (IMNM) using lower-limb muscle magnetic resonance imaging (MRI) examination.
Methods: After institutional review board approval, 30 patients with genetically proven LGMDR2 (12 females; age, 34.0 ± 11.3) and 45 patients with IMNM (28 females; age, 49.2 ± 16.6) who underwent lower-limb MRI examination including T1-weighted and interactive decomposition water and fat with echos asymmetric and least-squares estimation (IDEAL) sequences between July 2014 and August 2022 were included. Radiomics features of muscles were obtained, and four machine learning algorithms were conducted to select the optimal radiomics classifier for differential diagnosis. This selected algorithm was performed to construct the T1-weighted (TM), water-only (WM), or the combined model (CM) for calf-only, thigh-only, or the calf and thigh MR images, respectively. And their diagnostic performance was studied using area under the curve (AUC) and compared to the semi-quantitative model constructed by the modified Mercuri scale of calf and thigh muscles scored by two radiologists specialized in musculoskeletal imaging.
Results: The logistic regression (LR) model was the optimal radiomics model. The performance of the WM and CM for thigh-only images (AUC 0.893, 0.913) was better than those for calf-only images (AUC 0.846, 0.880) except the TM. For “calf + thigh” images, the TM, WM, and CM models always performed best (AUC 0.953, 0.907, 0.953) with excellent accuracy (92.0, 84.0, 88.0%). The AUCs of the Mercuri model of the calf, thigh, and “calf + thigh” images were 0.847, 0.900, and 0.953 with accuracy (84.0, 84.0, 88.0%).
Conclusion: Machine learning-based radiomics models can differentiate LGMDR2 from IMNM, performing better than visual assessment. The model built by combining calf and thigh images presents excellent diagnostic efficiency.
1. Introduction
Limb-girdle muscular dystrophy R2 (LGMDR2) (previously called LGMD2B) is genetically determined muscular dystrophy with autosomal recessive transmission due to mutations in the DYSF gene leading to sarcolemma repair abnormalities and secondary inflammatory activation (1). Its clinical features predominantly include progressive weakness and atrophy of proximal limb muscles (2). Immune-mediated necrotizing myopathy (IMNM) is a lately identified subtype of idiopathic inflammatory myopathies (IIMs), which are a group of autoimmune diseases usually characterized by auto-antibodies including 3-hydroxy-3-methylglutaryl-coenzyme A reductase (HMGCR) or the signal recognition particle (SRP) (3). Moreover, its primary clinical manifestation is rapidly progressive muscle weakness (3). In recent years, many researchers noticed that LGMDR2 most frequently mimics IMNM because they both have onset in adulthood and have similar clinicopathological features, including progressive proximal muscle weakness, elevated creatine kinase (CK) and scattered myonecrosis or inflammatory infiltrates on a muscle biopsy, which leads quickly to clinical misdiagnosis (4, 5). Nevertheless, they differ in their management. Patients with LGMDR2 depend on supportive care only, while IMNM patients respond to early aggressive immunotherapies (3, 6). Thus, timely and accurate discrimination between two diseases is essential for predicting clinical courses and making treatment decisions.
Muscle MRI is capable of clearly demonstrating muscle edema, atrophy or fat replacement, becoming an indispensable tool for diagnosing myopathies, and therefore it contributes to researching the particular pattern of muscle involvement in various muscle diseases (7–9). Although previous studies have compared characteristic muscle MRI performance in LGMDs-and IIM-patients (5, 10, 11), the similarity and complexity of these patterns pose great challenges for radiological diagnosis. Mercuri et al. (12) published the Mercuri scale that has been frequently used to grade intramuscular fatty infiltration based on conventional T1-weighted sequence, but the method depends on the human visual system, which is subjective and does not enable quantification. Thus, the quantitative assessments with MRI need to be considered in the early diagnosis and further monitoring of myopathies.
Radiomics is a novel and attractive data analysis technique that can mine a mass of high-throughput quantitative image features from standard medical imaging, reflecting subtle pathophysiologic features often imperceptible to humans (13, 14). With an increasing incidence of muscle diseases and a further understanding of the relationship between muscle imaging and pathophysiology currently, attention has gradually been paid to radiomics analysis of musculoskeletal imaging (15). To date, a few studies have applied radiomics assessments to myopathy, which has shown promising results regarding the ability of this technique to help in the differentiation between neurogenic and myogenic diseases using muscle ultrasound (16), or in the diagnosis of IIM (11) and inherited myopathy using muscle MRI (16).
However, studies are yet to differentiate between LGMDR2 and IMNM by exploiting radiomics analysis. We hypothesize that the MRI-based radiomics tool could discriminate LGMDR2 from IMNM. The purpose of this study was to establish machine learning-based MRI radiomics models and evaluate the capability of these to distinguish between LGMDR2 and IMNM, as well as compare with the Mercuri model based on the assessment of radiologists.
2. Materials and methods
2.1. Patients
This retrospective study was approved by the Institutional Review Board of Huashan Hospital, Fudan University (KY2019-409), and exempt from informed consent. Data from LGMDR2 or IMNM patients who underwent lower-limb muscle MRI in our Radiology Department between July 2014 and August 2022 was retrieved. Diagnoses of LGMDR2 were determined based on clinical evaluation and genetic testing confirmed by pure or compound heterozygous mutation in the DYSF gene. For diagnoses of IMNM, results of clinical judgment, antibody testing, and muscle biopsy were used, as suggested by the European Neuromuscular Centre International Workshop diagnostic criteria (17). Patients who underwent calf and thigh MR examination including T1-weighted sequence and T2-weighted IDEAL sequence, were enrolled. The main exclusion criteria were as follows: (I) insufficient diagnosis data; (II) high grade of muscle atrophy (difficulty in segmentation); (III) lacking images of thigh or calf; (IV) incomplete sequences. As illustrated in Figure 1, 30 patients were included in the LGMDR2 group, and 45 patients were enrolled in the IMNM group after careful selection.
2.2. MRI acquisitions
All MRI examinations were performed by a 3.0-Tesla scanner (Discovery MR750; GE Healthcare, Chicago, Illinois) equipped with a GE 8-channel body coil. The lower-limb muscle imaging protocol included an axial T1-weighted FSE sequence (flip angle: 111°; repetition time/echo time: 676/14 ms; acquisition matrix: 320 × 224; field of view [FOV]: 380 × 228 mm; slice thickness/spacing: 6/18 mm) and an axial T2-weighted FSE-IDEAL sequence (flip angle: 111°; repetition time/echo time: 2545/85 ms; acquisition matrix: 320 × 256; FOV: 380 × 228 mm; slice thickness/spacing: 6/18 mm). The thigh images were obtained at levels from the lesser trochanter through the femoral condyles, and the calf images were obtained at levels from the tibiofibular joint to the soleus myotendinous junction. FSE-IDEAL sequence, an innovative 3-point Dixon technique, effectively separates the fat signal from the water signal to reconstruct water-only and fat-only images (18).
2.3. Image assessment and segmentation
Two radiologists (Reader 1 and 2) with over 10 years of experience in musculoskeletal MRI participated in slice selection and segmentation using ITK-SNAP (version 3.8.0; www.itksnap.org) software. Muscle MR images from each patient have been imported into the software, and then the water-only images were automatically aligned to T1-weighted images. For each subject, three slices were selected from the proximal, middle, and distal bilateral thighs, respectively, and two from the proximal and middle of the bilateral calves. This selection ensured that a two-dimensional region of interest (ROI) would involve the following muscles: thigh: rectus femoris (RFEM), vastus medialis (VME), vastus intermedius (VIN), vastus lateralis (VLA), sartorius (SAR), gracilis (GR), adductor longus (AL), adductor magnus (AM), semimembranosus (SMB), semitendinosus (SMT), biceps femoris (HBF); and calf: tibialis anterior (TA), peroneal muscle (PER), tibialis posterior (TP), soleus (SOL), gastrocnemius (medial head [GASM] and lateral head [GASL]). With the water-only images as references, they individually and manually segmented the ROI on the T1-weighted images of every patient that covered the global muscle area of each selected slice and excluded the epimysium (Figure 2). Reyngoudt et al. reported that the MRI fat fraction analysis with global muscle segment in the calf or thigh has shown the capability to be nearly as sensitive as individual muscle ROIs in most muscular diseases, which is an easier and faster segmentation method (19). Subsequently, the delineated ROIs based on the T1-weighted images were copied to the same location of the water-only images. And minor revision was performed to the ROIs to erase unnecessary regions, and add missed muscle areas on the water-only images. The ROIs of two regions, including thigh and calf ROIs, were then obtained. Reader 1 performed muscle segmentation on the same image slices again with a minimum interval of 2 months. The intraclass correlation coefficient (ICC) were calculated to evaluate the reliability and repeatability of features by different segmentation.
Figure 2. Lower-limb muscle MR image selection and segmentation. (A,B) Two images showed the coronal T1-weighted calf and thigh images. For each patient, five slices in different areas were selected for radiomics analysis: three slices of thighs and two slices of calves for each sequence (T1-weighted and water-only). The middle showed the selected slices of axial T1-weighted and water-only images, respectively. (C,D) The global area of the muscles in each selected slice was segmented as calf or thigh ROI (red-shaded area), excluding the epimysium.
The two radiologists individually evaluated the T1-weighted images of the above-mentioned muscles and graded muscular fatty infiltration based on the modified Mercuri scale score (12, 20, 21) as follows: score 0, normal appearance; score 1, punctate T1 hyperintense lesion; score 2, scattered T1 hyperintense lesion (<30% muscle bulk); score 3, small confluent lesion of T1 hyperintensity (30–60% muscle bulk); score 4, large confluent lesion of T1 hyperintensity (>60% muscle bulk) and score 5, diffuse T1 hyperintense in the global muscle. Both radiologists were blinded to the clinical information (e.g., name, age, sex, clinical diagnosis), and evaluations were performed at the same levels where the ROIs were segmented on the T1-weighted images.
2.4. Radiomics feature extraction
Before the feature extraction, all MR images were resampled to the same resolution (3 × 3 × 3 mm) with linear interpolation to avoid data heterogeneity bias. Radiomics features were extracted from the thigh and calf ROIs of the T1-weighted and water-only images, respectively, using the “PyRadiomics” package in Python 3.9.13 (www.python.org). For each patient, 2,024 radiomics calf or thigh features were extracted from calf-only or thigh-only ROIs (each ROI including 1,012 T1-weighted and 1,012 water-only features, respectively). An early fusion was used to combine T1-weighted or water-only features of calf and thigh images into “calf + thigh” combined features (22). The extracted features were divided into three sets (23, 24): (1) First-order features describe the distribution of voxel intensities. (2) Texture features represent the second and high-order spatial heterogeneity of the intensity level containing gray level dependence matrix (GLDM), gray level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM), gray level size zone matrix (GLSZM), and neighborhood gray-tone difference matrix (NGTDM). (3) And transform features from three filters including local binary pattern (LBP), gradient and wavelet. The gradient filter was to replace all voxel values in an area with the local gradient value of the image (e.g., maximum-minimum value). And the LBP features represent a comparison of central pixels and their surrounding pixels. For wavelet transforms, each image was transformed in the x, y, and z directions using a low or high band pass filter.
2.5. Feature selection
After all the features were normalized by the Z score, the selection was performed in the following four steps, illustrated in Figure 3. Firstly, only features with ICC ≥ 0.81 (excellent stability) were retained. Next, Mann Whitney U-test was conducted to choose the significant features with a p-value less than 0.05 (Figure 3A). Then, for features with high repeatability, Spearman’s rank correlation coefficient was also used to calculate the correlation between features (Figure 3B), and only one of the features with a correlation coefficient greater than 0.9 between any two features is retained (25). Finally, the most minor absolute shrinkage and selection operator (LASSO) regression model with 10-fold cross-validation was used to select the remaining features. Depending on the regulation weight *λ*, LASSO shrinks all regression coefficients toward zero and precisely sets the coefficients of many irrelevant features to zero (26) (Figures 3C,D). The retained features with non-zero coefficients were retained for regression model fitting. Subsequently, we obtained the most representative features for each patient which were used to establish classification models (Table 1).
Figure 3. The process of radiomics feature selection in the “calf + thigh” images for the TM. (A) The statistical distribution of the extracted radiomics features. The p-value > 0.05 of radiomics features was removed. (B) The heatmap of radiomics features after correlation filtering. The correlation coefficient [C] ranges between −1 and +1. The higher the C, the more redundant the feature was. (C) The coefficient profile plot and (D) the 10-fold cross-validation plot of the LASSO regression model. The vertical lines on (C,D) determine the optimal *λ*.
2.6. Model building and evaluation
All patients are divided into a training and a test cohort by a 7:3 ratio. Different radiomics models were developed and tested, respectively, to differentiate LGMDR2 and IMNM based on the following four machine learning classification algorithms: LR, k-Nearest Neighbors (kNN), random forest (RF), and eXtreme Gradient Boosting (XGBoost). The radiomics T1-weighted sequence model (TM), the water-only sequence model (WM), and the “combined” sequence model (the fusion of T1-weighted and water-only sequences, CM) were built with calf, thigh and “calf + thigh” muscular images separately based on different selected features. The Mercuri diagnosis model for differentiating LGMDR2 and IMNM were built with the four machine learning classifiers by the Mercuri score in each muscle of the calf-only, thigh-only and “calf + thigh” images, respectively. Their respective area under the receiver operating characteristic (ROC) curve (AUC) values and accuracy (ACC) were used to evaluate the performance of each model. And the machine learning classifier with the highest AUC of the test cohort was used to construct the optimal prediction models. The AUC, accuracy, sensitivity, specificity, and F1 measure were used to evaluate the classification performance and goodness of fit of the various models. All model building was developed in Python.1
2.7. Statistical analysis
All statistical analysis were performed using SPSS (version 23.0; SPSS Inc., Chicago, IL, United States). Differences in fatty infiltration scores for each muscle of thighs and calves between LGMDR2 and IMNM based on the Mercuri scale scores were assessed in terms of the Mann Whitney U-test. A statistically significant difference was considered at p < 0.05. ICC was used to evaluate intra-observer and inter-observer agreements for radiomics features obtained from different segmentation, interpreted as follows: 1.0, perfect; 0.81–0.99, almost perfect; 0.61–0.80, substantial; 0.41–0.60, moderate; 0.21–0.40, fair; and 0.20 or less, slight agreement (27). Inter-observer agreement of Mercuri scale assessment was performed using Cohen’s kappa analysis, and the kappa value >0.80 was considered to represent almost perfect agreement (28).
3. Result
3.1. Patient characteristics
Thirty patients with genetically proven LGMDR2 and 45 with IMNM (15 with the anti-SRP autoantibody, 15 with the anti-HMGCR autoantibody, and 15 with seronegative IMNM) were selected from one hundred patients who underwent muscle MRI examinations for myopathy between July 2014 and August 2022. Patients with LGMDR2 were younger than those with IMNM (p < 0.024; LGMDR2 patients: median ± standard deviation, 34.03 ± 11.266; IMNM patients: median ± standard deviation; 49.20 ± 16.583). The IMNM patients showed a greater female prevalence than the LGMDR2 patients, while there was no significant difference between the two diseases (p < 0.059). LGMDR2 had the most apparent fat infiltration in the posterior muscle group of the thigh and calf. In IMNM, the posterior group of thigh muscle was also more heavily affected by fat than any other muscle group. Most of the muscles in the thighs and calves except sartorius, gracilis and tibialis posterior showed higher scores of fat substitution in LGMDR2 than IMNM (p < 0.05).
3.2. Intra-observer and Inter-observer agreement evaluation
Mercuri scores of each muscle graded separately by two radiologists showed almost perfect agreement, and the Kappa value ranged from 0.826 to 0.933. Features derived from the ROIs segmented separately by two radiologists showed almost perfect agreement, and the mean ICC values in the intra-observer agreement ranged from 0.895 to 0.917 and from 0.833 to 0.873 in the inter-observer reproducibility test (Table 2). Thus, all statistical analyses were based on the results of the first feature extraction segmented by reader 1. For calf-only ROIs, 834 T1-weighted and 874 water-only features had excellent intra-reader and inter-reader reproducibilities (ICC ≥ 0.81), respectively, with 770 T1-weighted and 800 water-only features for thigh-only ROIs.
3.3. Radiomics feature selection and optimal machine learning algorithm
In this study, radiomics features were extracted from the different ROIs. After excluding features with ICC < 0.81, p-value > 0.05 in U-test statistical test (Figure 3A) and Pearson correlation coefficients >0.9, a heatmap of selected features depicts little redundancy (Figure 3B). All features with non-zero coefficients were chosen for model building after the LASSO regression analysis, and then 16 T1-weighted and 9 water-only features were finally retained for calf-only ROIs, 8 and 14 for thigh-only ROIs, respectively (Table 1). The AUC and accuracy of the radiomics TM, WM, CM and the Mercuri model constructed by four machine learning algorithms are listed in Table 3. The LR algorithm performed the highest AUC (0.953) and accuracy (92.0%) in the TM for “calf + thigh” images. Therefore, LR was regarded as the optimal machine-learning algorithm for subsequent model analysis. Also, the LR classifier is the best option for Mercuri model. Figure 3 shows the feature selection process in the TM for “calf + thigh” images. And the positive coefficients features contribute most to the best model after LASSO analysis.
3.4. Performance and clinical application of different models
A radiomics TM, WM along with CM, and the Mercuri model were built for calf, thigh and “calf + thigh” images separately. The performances of the radiomics and Mercuri models are illustrated in Figure 4. For calf images, the TM achieved a sensitivity of 75.0% with an accuracy of 87.0%, the WM 80.0% with 81.6%, the CM 80.0% with 88.0%, and the Mercuri model 70.0% with 84.0%. For thigh images, the TM achieved a sensitivity of 90.0% with an accuracy of 80.0%, the WM 80.0% with 88.0%, the CM 90.0% with 88.0%, and the Mercuri model 80.0% with 84.0%. For calf and thigh images, the TM achieved a sensitivity of 100% with an accuracy of 92.0%, the WM 90.0% with 84.0%, the CM 90.0% with 88.0%, and the Mercuri model 100% with 88.0%. Based on the ROC analysis, the performance of the radiomics models for the calf images was moderate, and the AUCs of the TM, WM, and CM were 0.892, 0.846, and 0.880. The models for thigh images showed good performance (AUCTM 0.827; AUCWM 0.893; AUCCM 0.913). The models for the “calf + thigh” images had better performance than those for calf and thigh images (AUCTM 0.953; AUCWM 0.907; AUCCM 0.953). The AUCs of the Mercuri model for the calf, thigh, and “calf + thigh” images were 0.847, 0.900, and 0.953 (Figure 5).
Figure 4. The performances of all models. The accuracy, sensitivity, specificity, and F1 measures of the prediction models were based on the calf, thigh and “calf + thigh” muscular images.
Figure 5. The discrimination of radiomics and the Mercuri scale model. (A–C) Graphs show Receiver operating characteristic (ROC) curves and area under curve (AUC) of the three radiomics models for calf, “calf + thigh” images. (D) The ROC and AUCs of the Mercuri model for calf, thigh and “calf + thigh” images.
4. Discussion
The present study proves that radiomics analysis of MR images can be employed in calf, thigh and “calf + thigh” muscles to discriminate two myopathies, LGMDR2 and IMNM, that can be misdiagnosed in clinical practice. Based on several machine learning classification algorithms, the various MRI-based radiomics models were constructed to differentiate these two diseases, and the LR classifier was selected as the optimal machine learning model with the highest accuracy and AUC. We compared the performance of the TM, WM, and CM based on the LR model, and the models for the integration of calf and thigh muscular images showed better performance than those for calf-only or thigh-only images. Among these, TM for “calf + thigh” images presented the best value. Moreover, the AUC values of the radiomics models were higher or equal to those of the Mercuri model obtained from a visual analysis of the radiologists, revealed higher discrimination efficiency and gained great clinical net benefits, which can help clinical differential diagnosis of these two diseases. This study identifies that radiomics could be applied to muscle MRI to diagnose myopathies, even with limited cases. At present, our study is the only one that uses machine learning-based radiomics analysis of lower-limb muscle MRI to differentiate LGMDR2 and IMNM.
Muscle MRI has been developed as an essential tool for diagnosing myopathies because it can be used either for global assessment of each muscle or for compositional assessment noninvasively to assess the characteristic radiological patterns (8, 10). Traditionally, T1-weighted sequence and STIR sequence were used for muscle fatty infiltration and edema in the MRI evaluation of myopathy (29, 30). Previous studies on using the Mercuri scale based on the T1-weighted sequence to identify the different patterns of myopathies suggested that this visual scoring could be advantageous in conveniently applied in clinical practice, especially in primary care hospitals (8, 31). Some studies reported selectively involved pattern characterization of muscles evaluated by the Mercuri scale in LGMDR2 and IMNM patients (8, 9) and compared the distributions and characteristics of intramuscular fatty infiltration of lower limbs between these two diseases (5). However, different muscular disorders may have various patterns and degrees of muscular fatty infiltration, edema or atrophy (7). Therefore, the diversity of patterns and the similarities in many patterns can complicate the radiological diagnosis and cause limited inter-observer reproducibility, which could be affected by the skill of each radiologist (32). And Verdú-Díaz et al. built machine learning models using the Mercuri score to classify 10 different MDs, and the models showed almost perfect accuracy (33). Compared to this study, the main strength of radiomics technique using MRI data is that subjective visual evaluation is not required for its analysis. Although some authors believe that the radiomics approach may achieve excellent performance in the radiologic diagnosis of muscles, this method has seldom been applied to the study of myopathies (34, 35). To our knowledge, our results imply that the radiomics features seem to include more details contributing to the further diagnosis of muscular pathological changes than visual analysis.
Radiomics analysis of MR images is a noninvasive tool that can reveal the underlying pathophysio-logical changes. Several authors have confirmed the potential of radiomics analysis of muscle MRI to be used to diagnose various muscular diseases. In experimental studies of animal models with muscle disorders, radiomics analysis could be used to distinguish mouse models covering different muscular dystrophic phenotypes (36), and radiomics features enabled differentiation between the normal and the golden retriever muscular dystrophy dogs with different disease progression (37). Regarding human subjects, Lee et al. demonstrated a correlation between the radiomics features of T1-weighted images and fat fraction obtained by an MRI quantitative evaluation method, the DIXON technique. Furthermore, these features differed between Charcot–Marie-Tooth patients and controls (34). Akinci et al. reported that the radiomics models of various muscles performed better than the fat fraction (FF) models to differentiate cerebral palsy from healthy children to show that the radiomics features of muscles are more sensitive than FF (38). Nagawa et al. classified different disease subgroups in idiopathic inflammatory myopathies (IIMs) patients, including dermatomyositis, amyopathic dermatomyositis and polymyositis based on a machine learning model constructed by muscle MRI radiomics features to conclude that machine learning-based radiomics model of muscle MRI is a valuable objective tool for radiologic differential diagnosis (11).
Identifying optimal machine learning algorithms is crucial to ensure the high efficiency and reliability of radiomics models; thus, we trained four different classifiers in this study. LR performed best of all classifiers because complex models needed more training samples (39). The optimal machine learning classifiers built the TM, WM, and CM. Water-only images from the IDEAL technique rather than the conventional STIR sequence were included and extracted features in our study. Compared to the commonly performed STIR sequence, water-only images reconstructed by FSE-IDEAL allow equal or better fat suppression (40), which is more beneficial for assessing edema. Carlier and Quijano-Roy highlighted that T1-weighted images are comparable to the fat-only images obtained with the Dixon method, giving similar qualitative information (41). In addition, we used T1-weighted rather than fat-only images in our study because of a better comparison between the radiomics analysis and the visual grade system based on the Mercuri scale. Therefore, we have chosen the T1-weighted sequence and water-only sequence according to the data available to us. The radiomics features in each model were from calf, thigh and “calf + thigh” images, respectively. The difference in the distribution and severity of muscular fatty infiltration and atrophy between LGMDR2 and IMNM can be clearly shown in the T1-weighted sequence (5). Water-only images, including the water signal without fat interference, can exhibit edema of muscle and muscular fascia (18). As for the calf-only images, the accuracy of the TM, WM, and CM constructed based on LR model were 87.0, 81.6, and 88.0% in differentiating LGMDR2 from IMNM. The corresponding AUCs were 0.892, 0.846, and 0.880, respectively. As for the thigh-only images, the CM, TM, and WM accuracy was 80.0, 88.0, and 88.0% in differentiating LGMDR2 from IMNM. The corresponding AUCs were 0.827, 0.893, and 0.913, respectively. As for the integration of calf and thigh images, the accuracy of the TM, WM, and CM constructed based on LR model was 92.0, 84.0, and 88.0% in differentiating LGMDR2 from IMNM. The corresponding AUCs were 0.953, 0.907, and 0.953, respectively. In comparison, the accuracy of the Mercuri model for calf, thigh, and “calf + thigh” images were 84.0, 84, and 88%, while the corresponding AUCs were 0.847, 0.900, and 0.953. The results demonstrated that the radiomics models had higher or equal diagnostic performance, compared with Mercuri model constructed based on visual analysis, but this radiomics tool achieved more diagnosis accuracy, consistent with the study of Lee et al. (34). The results also suggested that compared with the water-only sequence, the T1-weighted sequence of calf-only or “calf + thigh” muscles had higher potential application in distinguishing these two diseases. Perhaps this is because some patients with LGMDR2 were not at the early stages of the disease, while calf muscles were involved by fatty infiltration and acute muscle edema in thigh muscles were partial relief. Yang et al. reported that patients with LGMDR2 had a longer interval of onset to first visit than patients with IMNM (5). Besides, some residual fat signal in water-only images may result in mistaking the fat signal for an edema signal (Figure 2), making it difficult to rely on muscle edema to identify the two diseases, which could explain the lower AUC of the water-only model in this study.
Moreover, this study further confirmed the requirement to analyze muscle MR images in two areas: thighs and calves. Thus, for calf-only or thigh-only images, the performance of the models is unsatisfactory. Although with a deeper understanding of LGMDR2 and IMNM recently, the result of our radiomics models and the Mercuri analysis reflects diagnostic difficulties, even for musculoskeletal imaging specialists, in distinguishing the two diseases based on MR images acquired from only the calf or only the thigh area without the help of clinical information. MR muscle images of a single area could have low accuracy, particularly in myopathies with minimal fatty infiltration or edema. In addition, the performance of the WM and CM for thigh-only images (AUC 0.893, 0.913) was better than those for calf-only images (AUC 0.846, 0.880) except the TM, and the reason might be the different patterns of muscle involvement between LGMDR2 and IMNM. A proximal lower-limb dominant involvement was reported in IMNM patients (42), while LGMDR2 patients involved at least one calf muscle (8). Also, muscle edema with the adductor magnus was evident in IMNM patients, but often absent in LGMDR2 patients (5). Thus, this result emphasized again that combined calf and thigh images to analyze myopathies could provide more diagnostic information.
Several limitations exist in this study. Firstly, the main limitation is that the sample was a small and imbalanced number and should be increased to build a more generalizable radiomics model. Also, further research is to detect whether radiomics analysis could be applied to differentiate a variety of myopathies besides LGMDR2 and IMNM. Secondly, the patients need help to stand the long image acquisition process, makes it difficult to perform whole-body MRI, which forces us to lose a large amount of muscle data from other areas, especially the body trunk and shoulders. Thirdly, considering clinical, serological, and pathological data of myopathies may have a large capacity for diagnosis, it is crucial to build a comprehensive classification model based on the combination of radiomics features and these data. Moreover, the visual score of the degree of muscle edema that might improve the efficacy of the visual model should have been considered. Fourthly, we did not screen these two myopathies according to the duration and severity of the disease. Thus, patients with advanced myopathy with marked fatty replacement or edema may affect the reproducibility of the radiomic model. To reach a practical utility, our radiomics model should be further trained with the various degrees of fatty infiltration or edema of muscle, respectively. Then, although this radiomics tool may differentiate LGMDR2 from the IMNM patients, its role in differentiating the two entities is limited. Because fatty replacement is common in many hereditary myopathies, including various LGMD subtypes. Lastly, this radiomics model did not rely on visual assessment, but it relied on subjective manual segmentation, which may be avoided by further using automatic muscle segmentation.
5. Conclusion
In conclusion, the radiomics analysis using the combination of calf and thigh muscle MR images can effectively differentiate LGMDR2 and IMNM, and the models constructed by the LR machine learning classifier has the highest AUC, with better performance than Mercuri visual grade system. The approach should be further applied to a large cohort of patients with a broader range of myopathies to optimize and validate this suggested model.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Institutional Review Board of Huashan Hospital, Fudan University (KY2019-409). The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because our study is the retrospective study, and we only used their MRI images.
Author contributions
PW: conceptualization, investigation, and writing—original draft. HZ: data curation and writing—review and editing. QX: conceptualization, methodology, and funding acquisition. JL: methodology and visualization. SL: resources and methodology. XG: data curation. ZL: supervision, funding acquisition, and writing—review and editing. DY: conceptualization, data curation, and writing—review and editing. All authors contributed to the article and approved the submitted version.
Funding
This research was funded by Medical Discipline Construction Plan of Shanghai Jing’an District Health Commission (grant number 2021PY01), special project of medical innovation research of Shanghai Science and Technology Innovation Action Plan (grant number 21Y11910600), the Scientific Research Foundation of Shanghai Municipal Health Commission (grant number 202140244), and the Medical Scientific Research Foundation of Jing’an District (grant number 2019MS01).
Acknowledgments
We sincerely thank the OnekeyAI platform and its developers for Code consultation of the study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
References
1. Han, R. Muscle membrane repair and inflammatory attack in dysferlinopathy. Skelet Muscle. (2011) 1:10. doi: 10.1186/2044-5040-1-10
2. Urtizberea, JA, Bassez, G, Leturcq, F, Nguyen, K, Krahn, M, and Levy, N. Dysferlinopathies. Neurol India. (2008) 56:289–97. doi: 10.4103/0028-3886.43447
3. Lundberg, IE, Fujimoto, M, Vencovsky, J, Aggarwal, R, Holmqvist, M, Christopher-Stine, L, et al. Idiopathic inflammatory myopathies. Nat Rev Dis Primers. (2021) 7:86. doi: 10.1038/s41572-021-00321-x
4. Selva-O’Callaghan, A, Trallero-Araguás, E, Milisenda, JC, and Grau-Junyent, JM. Differential diagnosis of necrotizing myopathy. Curr Opin Rheumatol. (2021) 33:544–53. doi: 10.1097/BOR.0000000000000836
5. Yang, M, Ji, S, Xu, L, Zhang, Q, Li, Y, Gao, H, et al. The Clinicopathological distinction between immune-mediated necrotizing myopathy and limb-girdle muscular dystrophy R2: key points to prevent misdiagnosis. J Clin Med. (2022) 11:6566. doi: 10.3390/jcm11216566
6. Nigro, V, Aurino, S, and Piluso, G. Limb girdle muscular dystrophies: update on genetic diagnosis and therapeutic approaches. Curr Opin Neurol. (2011) 24:429–36. doi: 10.1097/WCO.0b013e32834aa38d
7. Aivazoglou, LU, Guimarães, JB, Link, TM, Costa, MAF, Cardoso, FN, de Mattos Lombardi Badia, B, et al. Mr imaging of inherited myopathies: a review and proposal of imaging algorithms. Eur Radiol. (2021) 31:8498–512. doi: 10.1007/s00330-021-07931-9
8. Diaz-Manera, J, Fernandez-Torron, R, Llauger, J, James, MK, Mayhew, A, Smith, FE, et al. Muscle MRI in patients with dysferlinopathy: pattern recognition and implications for clinical trials. J Neurol Neurosurg Psychiatry. (2018) 89:1071–81. doi: 10.1136/jnnp-2017-317488
9. Pinal-Fernandez, I, Casal-Dominguez, M, Carrino, JA, Lahouti, AH, Basharat, P, Albayda, J, et al. Thigh muscle MRI in immune-mediated Necrotising myopathy: extensive Oedema, early muscle damage and role of anti-Srp autoantibodies as a marker of severity. Ann Rheum Dis. (2017) 76:681–7. doi: 10.1136/annrheumdis-2016-210198
10. Hsu, W-C, Lin, Y-C, Chuang, H-H, Yeh, K-Y, Chan, WP, and Ro, L-S. A muscle biosignature differentiating between limb-girdle muscular dystrophy and idiopathic inflammatory myopathy on magnetic resonance imaging. Front Neurol. (2021) 12:783095. doi: 10.3389/fneur.2021.783095
11. Nagawa, K, Suzuki, M, Yamamoto, Y, Inoue, K, Kozawa, E, Mimura, T, et al. Texture analysis of muscle MRI: machine learning-based classifications in idiopathic inflammatory myopathies. Sci Rep. (2021) 11:9821. doi: 10.1038/s41598-021-89311-3
12. Mercuri, E, Pichiecchio, A, Counsell, S, Allsop, J, Cini, C, Jungbluth, H, et al. A short protocol for muscle MRI in children with muscular dystrophies. Eur J Paediatr Neurol. (2002) 6:305–7. doi: 10.1053/ejpn.2002.0617
13. Lambin, P, Leijenaar, RTH, Deist, TM, Peerlings, J, de Jong, EEC, van Timmeren, J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141
14. Quan, G, Ban, R, Ren, J-L, Liu, Y, Wang, W, Dai, S, et al. Flair and Adc image-based Radiomics features as predictive biomarkers of unfavorable outcome in patients with acute ischemic stroke. Front Neurosci. (2021) 15:730879. doi: 10.3389/fnins.2021.730879
15. De Certaines, JD, Larcher, T, Duda, D, Azzabou, N, Eliat, PA, and Escudero, LM. Application of texture analysis to muscle MRI: 1-what kind of information should be expected from texture analysis? EPJ. Nonlinear Biomed Phys. (2015) 3:1–14. doi: 10.1140/s40366-015-0017-1
16. Sogawa, K, Nodera, H, Takamatsu, N, Mori, A, Yamazaki, H, Shimatani, Y, et al. Neurogenic and myogenic diseases: quantitative texture analysis of muscle us data for differentiation. Radiology. (2017) 283:492–8. doi: 10.1148/radiol.2016160826
17. Allenbach, Y, Mammen, AL, Benveniste, O, and Stenzel, W. 224th Enmc international workshop:: Clinico-Sero-pathological classification of immune-mediated necrotizing myopathies Zandvoort, the Netherlands, 14-16 October 2016. Neuromuscul Disord. (2018) 28:87–99. doi: 10.1016/j.nmd.2017.09.016
18. Takasu, M, Tani, C, Sakoda, Y, Ishikawa, M, Tanitame, K, Date, S, et al. Iterative decomposition of water and fat with Echo asymmetry and least-squares estimation (Ideal) imaging of multiple myeloma: initial clinical efficiency results. Eur Radiol. (2012) 22:1114–21. doi: 10.1007/s00330-011-2351-8
19. Reyngoudt, H, Marty, B, Boisserie, J-M, Le Louër, J, Koumako, C, Baudin, P-Y, et al. Global versus individual muscle segmentation to assess quantitative MRI-based fat fraction changes in neuromuscular diseases. Eur Radiol. (2021) 31:4264–76. doi: 10.1007/s00330-020-07487-0
20. Mercuri, E, Cini, C, Pichiecchio, A, Allsop, J, Counsell, S, Zolkipli, Z, et al. Muscle magnetic resonance imaging in patients with congenital muscular dystrophy and Ullrich phenotype. Neuromuscul Disord. (2003) 13:554–8. doi: 10.1016/S0960-8966(03)00091-9
21. Stramare, R, Beltrame, V, Dal Borgo, R, Gallimberti, L, Frigo, AC, Pegoraro, E, et al. MRI in the assessment of muscular pathology: a comparison between limb-girdle muscular dystrophies, hyaline body myopathies and myotonic dystrophies. Radiol Med. (2010) 115:585–99. doi: 10.1007/s11547-010-0531-2
22. Tang, W-J, Kong, Q-C, Cheng, Z-X, Liang, Y-S, Jin, Z, Chen, L-X, et al. Performance of Radiomics models for tumour-infiltrating lymphocyte (Til) prediction in breast Cancer: the role of the dynamic contrast-enhanced (Dce) MRI phase. Eur Radiol. (2022) 32:864–75. doi: 10.1007/s00330-021-08173-5
23. Pyradiomics. (2016). Available at: https://pyradiomics.readthedocs.io/en/latest/features.html.
24. Zwanenburg, A, Vallières, M, Abdalah, MA, Aerts, HJWL, Andrearczyk, V, Apte, A, et al. The image biomarker standardization initiative: standardized quantitative Radiomics for high-throughput image-based phenotyping. Radiology. (2020) 295:328–38. doi: 10.1148/radiol.2020191145
25. Liu, X-F, Lu, J-J, Li, M-D, Li, Y, Zeng, A-R, and Qiang, J-W. Prediction of pre-eclampsia by using Radiomics nomogram from gestational hypertension patients. Front Neurosci. (2022) 16:961348. doi: 10.3389/fnins.2022.961348
26. Sauerbrei, W, Royston, P, and Binder, H. Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Stat Med. (2007) 26:5512–28. doi: 10.1002/sim.3148
27. Landis, JR, and Koch, GG. The measurement of observer agreement for categorical data. Biometrics. (1977) 33:159–74. doi: 10.2307/2529310
28. Kim, HS, Yoon, YC, Choi, B-O, Jin, W, and Cha, JG. Muscle fat quantification using magnetic resonance imaging: case-control study of Charcot-Marie-tooth disease patients and volunteers. J Cachexia Sarcopenia Muscle. (2019) 10:574–85. doi: 10.1002/jcsm.12415
29. Warman Chardon, J, Díaz-Manera, J, Tasca, G, Bönnemann, CG, Gómez-Andrés, D, Heerschap, A, et al. Myo-MRI diagnostic protocols in genetic myopathies. Neuromuscul Disord. (2019) 29:827–41. doi: 10.1016/j.nmd.2019.08.011
30. Warman-Chardon, J, Diaz-Manera, J, Tasca, G, and Straub, V. 247th Enmc international workshop: muscle magnetic resonance imaging-implementing muscle MRI as a diagnostic tool for rare genetic myopathy cohorts. Hoofddorp, the Netherlands, September 2019. Neuromuscul Disord. (2020) 30:938–47. doi: 10.1016/j.nmd.2020.08.360
31. Feng, X, Luo, S, Li, J, Yue, D, Xi, J, Zhu, W, et al. Fatty infiltration evaluation and selective pattern characterization of lower limbs in limb-girdle muscular dystrophy type 2a by muscle magnetic resonance imaging. Muscle Nerve. (2018) 58:536–41. doi: 10.1002/mus.26169
32. Davis, WR, Halls, JE, Offiah, AC, Pilkington, C, Owens, CM, and Rosendahl, K. Assessment of active inflammation in juvenile dermatomyositis: a novel magnetic resonance imaging-based scoring system. Rheumatology (Oxford). (2011) 50:2237–44. doi: 10.1093/rheumatology/ker262
33. Verdú-Díaz, J, Alonso-Pérez, J, Nuñez-Peralta, C, Tasca, G, Vissing, J, Straub, V, et al. Accuracy of a machine learning muscle MRI-based tool for the diagnosis of muscular dystrophies. Neurology. (2020) 94:e1094–102. doi: 10.1212/WNL.0000000000009068
34. Lee, JH, Yoon, YC, Kim, HS, Kim, J-H, and Choi, B-O. Texture analysis using T1-weighted images for muscles in Charcot-Marie-tooth disease patients and volunteers. Eur Radiol. (2021) 31:3508–17. doi: 10.1007/s00330-020-07435-y
35. Mannil, M, Burgstaller, JM, Held, U, Farshad, M, and Guggenberger, R. Correlation of texture analysis of Paraspinal musculature on MRI with different clinical endpoints: lumbar stenosis outcome study (Lsos). Eur Radiol. (2019) 29:22–30. doi: 10.1007/s00330-018-5552-6
36. Martins-Bach, AB, Malheiros, J, Matot, B, Martins, PCM, Almeida, CF, Caldeira, W, et al. Quantitative T2 combined with texture analysis of nuclear magnetic resonance images identify different degrees of muscle involvement in three mouse models of muscle dystrophy: mdx, Largemyd and mdx/Largemyd. PLoS One. (2015) 10:e0117835. doi: 10.1371/journal.pone.0117835
37. Duda, D, Kretowski, M, Azzabou, N, and de Certaines, JD. MRI texture analysis for differentiation between healthy and Golden retriever muscular dystrophy dogs at different phases of disease evolution. Computer information systems and industrial management; 2015. Cham: Springer International Publishing (2015).
38. Akinci D’Antonoli, T, Santini, F, Deligianni, X, Garcia Alzamora, M, Rutz, E, Bieri, O, et al. Combination of quantitative MRI fat fraction and texture analysis to evaluate spastic muscles of children with cerebral palsy. Front Neurol. (2021) 12:633808. doi: 10.3389/fneur.2021.633808
39. Mao, B, Ma, J, Duan, S, Xia, Y, Tao, Y, and Zhang, L. Preoperative classification of primary and metastatic liver Cancer via machine learning-based ultrasound Radiomics. Eur Radiol. (2021) 31:4576–86. doi: 10.1007/s00330-020-07562-6
40. Brandão, S, Seixas, D, Ayres-Basto, M, Castro, S, Neto, J, Martins, C, et al. Comparing T1-weighted and T2-weighted three-point Dixon technique with conventional T1-weighted fat-saturation and short-tau inversion recovery (Stir) techniques for the study of the lumbar spine in a short-bore MRI machine. Clin Radiol. (2013) 68:e617–23. doi: 10.1016/j.crad.2013.06.004
41. Carlier, R-Y, and Quijano-Roy, S. Myoimaging in congenital myopathies. Semin Pediatr Neurol. (2019) 29:30–43. doi: 10.1016/j.spen.2019.03.019
Keywords: immune-mediated necrotizing myopathy, limb-girdle muscular dystrophy, radiomics, muscle MRI, machine learning, fatty infiltration
Citation: Wei P, Zhong H, Xie Q, Li J, Luo S, Guan X, Liang Z and Yue D (2023) Machine learning-based radiomics to differentiate immune-mediated necrotizing myopathy from limb-girdle muscular dystrophy R2 using MRI. Front. Neurol. 14:1251025. doi: 10.3389/fneur.2023.1251025
Edited by:
Kristl Claeys, University Hospitals Leuven, BelgiumReviewed by:
Adil Bashir, Auburn University, United StatesBitao Bu, Huazhong University of Science and Technology, China
Lotte Huysmans, KU Leuven Research & Development, Belgium
Copyright © 2023 Wei, Zhong, Xie, Li, Luo, Guan, Liang and Yue. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zonghui Liang, bGlhbmd6aEB2aXAuMTYzLmNvbQ==; Dongyue Yue, ZHl1ZTA5QGZ1ZGFuLmVkdS5jbg==
†These authors have contributed equally to this work