- 1Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin, China
- 2Department of Molecular Imaging and Nuclear Medicine, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Tianjin's Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin, China
- 3Department of Neurosurgery, Qilu Hospital of Shandong University Dezhou Hospital (Dezhou People’s Hospital), Shandong, China
Background: Accurate preoperative prediction of glioma is crucial for developing individualized treatment decisions and assessing prognosis. In this study, we aimed to establish and evaluate the value of integrated models by incorporating the intratumoral and peritumoral features from conventional MRI and clinical characteristics in the prediction of glioma grade.
Methods: A total of 213 glioma patients from two centers were included in the retrospective analysis, among which, 132 patients were classified as the training cohort and internal validation set, and the remaining 81 patients were zoned as the independent external testing cohort. A total of 7728 features were extracted from MRI sequences and various volumes of interest (VOIs). After feature selection, 30 radiomic models depended on five sets of machine learning classifiers, different MRI sequences, and four different combinations of predictive feature sources, including features from the intratumoral region only, features from the peritumoral edema region only, features from the fusion area including intratumoral and peritumoral edema region (VOI-fusion), and features from the intratumoral region with the addition of features from peritumoral edema region (feature-fusion), were established to select the optimal model. A nomogram based on the clinical parameter and optimal radiomic model was constructed for predicting glioma grade in clinical practice.
Results: The intratumoral radiomic models based on contrast-enhanced T1-weighted and T2-flair sequences outperformed those based on a single MRI sequence. Moreover, the internal validation and independent external test underscored that the XGBoost machine learning classifier, incorporating features extracted from VOI-fusion, showed superior predictive efficiency in differentiating between low-grade gliomas (LGG) and high-grade gliomas (HGG), with an AUC of 0.805 in the external test. The radiomic models of VOI-fusion yielded higher prediction efficiency than those of feature-fusion. Additionally, the developed nomogram presented an optimal predictive efficacy with an AUC of 0.825 in the testing cohort.
Conclusion: This study systematically investigated the effect of intratumoral and peritumoral radiomics to predict glioma grading with conventional MRI. The optimal model was the XGBoost classifier coupled radiomic model based on VOI-fusion. The radiomic models that depended on VOI-fusion outperformed those that depended on feature-fusion, suggesting that peritumoral features should be rationally utilized in radiomic studies.
1 Introduction
Glioma is a highly fatal disease that represents the most frequent form of primary cancer in the central nervous system (CNS), accounting for about 80% of all malignant tumors in the brain (1–3). Due to the same standardized treatment that can result in varying prognoses for different patients, it may be necessary to make specialized treatment decisions based on the tumor grade in clinical practice (4–6). Gliomas are categorized into grades I-IV in the Central Nervous System Midstream Classification of the World Health Organization (WHO) of 2021, with grades I-II being low-grade gliomas (LGG) and grades III-IV being high-grade gliomas (HGG) (7, 8). Accurate preoperative grading of gliomas is essential for assessing prognosis and developing individualized treatment plans, such as the extent of surgical resection, and the decision of postoperative chemoradiotherapy (9).
In contemporary clinical practice, gliomas are graded based on surgical or puncture histopathologic investigation (10). This diagnostic method is intrusive and slow, though. Furthermore, tissue biopsies from one area of the tumor might not be indicative of the histology of the entire tumor due to the recognized heterogeneity of gliomas and sampling error (11–13). High-precision noninvasive solutions that can offer preoperative grading information are therefore gaining popularity. Over the past decades, magnetic resonance imaging (MRI) has emerged as a crucial non-invasive diagnostic and assistant therapeutic technique for brain tumors, which is used to aid in differential diagnosis, guide treatment planning, and monitor therapy response (14–17). Nevertheless, competent radiologists may easily spot tumors from MRI sequences with the naked eye, gliomas are difficult to discriminate based on grade because of the variability and diversity of the tumors, which is undoubtedly a great challenge for imaging technology (18–20).
Radiomics, a burgeoning discipline, employing automated data mining algorithms to extract characteristics from medical images in a high-throughput manner, has been demonstrated notable advancements in the realm of medical imaging applications (21–24). These extracted features are then utilized by the machine to train itself and generate the anticipated desired output (25–27). Notably, recent progress has been achieved in the prediction of grading gliomas through the utilization of preoperative MRI scans and various machine learning methods. Gemini et al. evaluated the capacity of the Visually AcceSAble Rembrandt Images (VASARI) scoring system in predicting glioma grades and Isocitrate Dehydrogenase (IDH) status, with a possible application in machine learning (28). You et al. utilized traditional radiomics and the VASARI standard to construct a model determining glioma grade with an Area Under the Curve (AUC) of 0.966 (29). Wang et al. created and assessed a multiparametric MRI-based radiomics nomogram for predicting glioma grading (30). In recent years, deep learning has exhibited excellent performance with broader application prospects and deeper development in clinical applications. Voort et al. developed a single multi-task convolutional neural network that used preoperative MRI scans to predict the molecular subtype and grade of glioma, and the independent dataset evaluated that the approach achieved good performance and generalized well (31). Li et al. compared predictive models established by traditional radiomics and deep learning based on multiparametric MRI for grading gliomas and demonstrated that the latter performed better in most circumstances (32). While deep learning-based models have very respectable efficacy, it is commonly recognized as a black box that lacks satisfactory explanatory power. However, these machine learning approaches have focused mainly on the intratumoral region and neglected the role of the peritumoral environment in glioma grade.
The peritumoral environment holds great potential and may provide insightful information for clinical evaluation of the aggressive biological behavior of the tumor (33–35). Regarding intratumoral and peritumoral radiomic analysis, two main research approaches emerged. The first involved feature-fusion, where features from both intratumoral and peritumoral volumes of interest (VOIs) were separately extracted and then integrated. The second method was VOI-fusion, wherein the tumor region was expanded outward by a specific range to create a new VOI combining intra- and peri-tumoral areas. Radiomic features from this newly generated VOI were then extracted for subsequent analysis. For instance, Li et al. independently delineated the gross-tumor region and the peritumor region which was defined as the parenchyma that fell within a 2-cm distance to the tumor boundary (35). The radiomic features extracted from the two regions were merged and screened to build the radiomic model. Differently, Shi et al. expanded the originally segmented masks of VOIs by five radial distances outside the tumor at 1 mm intervals, creating five new VOIs (34). The findings indicated that the radiomic signature derived from peritumoral regions, specifically at dilated distances of 1 mm and 3 mm, demonstrated the most effective prediction performance in different MRI sequences, respectively. To date, although numerous studies have explored the radiomics of the peritumoral region, there has been a lack of a definitive study that has determined which among them is more persuasive and authoritative. The growth and infiltration of gliomas lead to the disruption of the blood-brain barrier. Consequently, there is a leakage of water, electrolytes, and proteins from peritumoral blood vessels, resulting in increased water content within the brain parenchyma, which contributes to the formation of peripheral edema (32, 36). Given that the tumor and the surrounding edematous area were closely interconnected, forming the microenvironment crucial for tumor cell growth and infiltration, it would be more reasonable to explore them as an integrated whole.
Drawing on the current state of research, the two methods were performed and compared in our study. We conducted an investigation focusing on the intratumor region and its surrounding peritumoral edema region of preoperative MRI scans to predict the grade of glioma. A total of 30 radiomic models, which depended on four different combinations of predictive features source (intratumoral VOI, peritumoral VOI, VOI-fusion, and feature-fusion), five different machine learning classifiers, and different MRI sequences, were established to select the optimal model, which was used for the construction of the nomogram for accurately predicting glioma grade, thereby assisting in the development of personalized treatment strategies for patients, ensuring they receive optimal benefits.
2 Materials and methods
2.1 Study population
All procedures involving human participants in this study adhered to the ethical guidelines outlined in the 1964 Declaration of Helsinki and its subsequent revisions, as well as other applicable ethical standards. The study was approved by the Ethics Committee of Tianjin Medical University General Hospital and Qilu Hospital of Shandong University Dezhou Hospital. Written informed consent was waived due to the retrospective nature of the study.
This study retrospectively analyzed 213 patients with cerebral gliomas from January 2019 to June 2023, who underwent preoperative MRI followed by surgery. Among them, 132 patients with glioma were from the Tianjin Medical University General Hospital (center 1), which was classified as the training cohort and internal validation set, and the remaining 81 patients were from the Qilu Hospital of Shandong University Dezhou Hospital (center 2), zoned as the independent external testing cohort. The inclusion and exclusion criteria are shown in Figure 1.
2.2 Pathological assessment
Pathologists with more than 10 years of experience graded the postoperative specimens, based on the 2021 WHO classification of CNS tumors, classifying gliomas into grades I-IV, with grades I-II being LGG and grades III-IV being HGG (19).
2.3 MRI protocol and image preprocessing
Every patient underwent MRI scans within two weeks before the surgery. All MRI studies were performed on the same type of scanners of the two centers, which were acquired using a 3.0 T scanner Discovery MR750 (GE Healthcare) with a 32-channel head coil. MRI acquisition parameters are summarized in Supplementary Table S1. The same MRI protocol was used for training and external testing sets. Patients were told not to move their heads during the scan to reduce the possible impact of head motion. The most useful anatomical multi-contrast MRI sequences included contrast-enhanced T1-weighted images and T2-weighted fluid-attenuated inversion recovery (Flair) images that were analyzed for further study.
For image preprocessing, the preoperative contrast-enhanced T1 images and T2 flair images were spatially aligned by using the rigid registration function of the well-validated Ants software from 3D Slicer (version: 5.2.2) software. Moreover, the quality of the registration was carefully inspected for alignment of the ventricular structures by a radiation oncologist. Then, the spacing of contrast-enhanced T1 and T2 flair images was resampled to 1×1×1 mm³.
2.4 Image segmentation and feature extraction
Manual segmentation of the contrast-enhanced T1 and T2 flair images of target lesions was performed using 3D Slicer (version: 5.2.2) software by two radiologists who possessed over five years of experience in a blinded manner to the study outcome, to eliminate unstable radiomic features and minimize inter-individual variability. Following clinical studies (8, 33), three VOIs have been delineated, the intratumoral VOI based on the contrast-enhanced T1 images, the peritumoral edema VOI including the intratumor VOI based on the T2 flair images, and the peritumoral edema VOI only. Given the spatial alignment between the contrast-enhanced T1 and T2 flair images, the delineated VOIs were shared across the two sequences.
In this study, the feature extraction was performed utilizing the Pyradiomics module in Python 3.7.0. A comprehensive set of 1288 quantitative radiomic features was extracted individually from each VOI for every sequence, obtaining a total of 7728 features. The available features were categorized as follows: the three-dimensional shape characteristics (n=14), the first-order statistical distribution of voxel intensities (n=252), and the texture features, which comprised gray-level co-occurrence matrix (GLCM) (n=308), gray-level dependence matrix (GLDM) (n=196), gray-level run length matrix (GLRLM) (n=224), gray level size zone matrix (GLSZM) (n=224), and neighborhood gray-tone difference matrix (NGTDM) (n=70).
2.5 Feature selection
Six distinct radiomic models were developed independently by employing extracted conventionally intratumoral radiomic features from two sequences and three kinds of VOIs. Firstly, the TRT1 model was a radiomic model based on contrast-enhanced T1-derived radiomic features from the intratumoral VOI. The TRT2 model was constructed by utilizing T2 flair-derived radiomic features from the intratumoral VOI. The TR model was established by integrating radiomic features originating from the intratumoral VOI of both sequences. Then, the PR model was based on the combined radiomic features from the peritumoral VOI of both sequences. The TPRVOI-fusion model was constructed by utilizing the radiomic features from the combining VOI of intratumoral and peritumoral VOIs of both sequences. The TPRfeature-fusion model was established by integrating radiomic features from the intratumoral VOI and peritumoral VOI of both sequences, separately.
Subsequently, Z-score normalization was applied to standardize the intensity range of each radiomic feature across various models, preventing the undue assignment of lower or higher weights to specific features. In the feature selection process, three steps were implemented for the training cohort. The ICC test was conducted between the datasets obtained by the two radiologists. The value exceeding 0.75 was deemed indicative of robust reproducibility and reliability, leading to the exclusion of features with ICC<0.75 from subsequent analysis (37, 38). Furthermore, Pearson’s rank correlation coefficient was employed to evaluate the correlation between feature pairs, with one feature randomly excluded from each pair exhibiting a correlation coefficient > 0.9. Lastly, the least absolute shrinkage and selection operator (LASSO) regression, coupled with 10-fold cross-validation, was utilized to identify informative features with non-zero coefficients and calculate the corresponding feature weights.
In addition to radiomic features, machine learning models based on predictive clinical parameters were also constructed, which was referred to as the Clinical model in the study. For the feature selection of clinical features, a two-step procedure was performed. First, univariate analysis was used to identify significant features with a p-value < 0.05. Then, the stepwise multivariate analysis was employed to determine the independent indicator with a p-value < 0.05, which was utilized as the predictive clinical parameters for the prediction of glioma grade.
2.6 Model construction
Then, classifiers including Logistic Regression (LR), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), Decision Tree (DT), and Multilayer Perceptron (MLP), employed the features filtrated by Lasso feature screening. In total, 30 machine learning models were formulated by integrating the six distinct radiomic models derived from various sequences and VOIs with the five machine learning classifiers for predicting glioma grade.
To ensure the stability of the prediction models, we randomly divided 30% of the training cohort as an internal validation set, then repeated the steps 100 times and averaged the results as the final prediction result under the model. The performance of the model was then evaluated on the independent external testing cohort, which was not used during the development of the model.
To evaluate the effectiveness of these models, several indicators, such as the AUC, accuracy, sensitivity, specificity, Positive Predictive Value (PPV), and Negative Predictive Value (NPV), were computed to evaluate the performance of the models. Additionally, to demonstrate the precision and net benefit, both the calibration curve and the decision curve analysis were employed.
Additionally, a nomogram using a logistic regression algorithm involving the optimal radiomic model and significant clinical features was developed to provide a straightforward visual representation in clinical practice. The receiver operating characteristic (ROC) curve, calibration curve, and decision curve analysis were employed correspondingly.
2.7 Statistical analysis
Software version 3.7.0 of Python was used for statistical analysis. The p-value for statistical significance was fixed at 0.05, and all statistical tests were two-sided. For continuous variables, mean ± SD was applied to communicate data that followed a normal distribution; and counts and percentages (n, %) were utilized to convey data for categorical variables. To compare continuous and categorical variables, t-tests and Chi-square were wielded. The prediction result of each model was displayed on an ROC curve, and the prediction performance was evaluated by calculating the AUC, accuracy, sensitivity, specificity, PPV, and NPV. The Delong test was utilized to verify the significance of the AUC from different ROC curves. The Hosmer-Lemeshow test was employed to evaluate the fitting ability of the model. The entire workflow of this study is illustrated in Figure 2.
Figure 2 The workflow of intratumoral and peritumoral radiomics and clinical analysis for image acquisition, segmentation, feature extraction and selection, model comparison, and nomogram construction in our study.
3 Results
3.1 Patient characteristics
A total of 213 glioma patients fulfilled the requirements for admission from the two centers, 132 patients from center 1 were classified as the training cohort and internal validation set (84HGGs, 48LGGs, mean age 52 years), and the remaining 81 patients from center 2 were zoned as the independent external testing cohort (50HGGs, 31LGGs, mean age 52 years). Table 1 displays the variations in the clinical features of the two groups from different centers. It can be shown that the only factor that significantly distinguished HGG from LGG was age (p<0.001). The gender and tumor location were not proposed as potential predictors of glioma grade.
3.2 Machine learning models based on intratumoral radiomics
For intratumoral radiomics, a total of 2576 (1288 contrast-enhanced T1-based and 1288 T2 flair-based radiomic features) features were extracted from each tumor region, containing shape, first-order, and texture features. Hence the TRT1, TRT2, and TR models included 1288, 1288, and 2576 radiomic features respectively. The feature statistics of categories and distribution are presented in Supplementary Figure S1. After feature filtration of the ICC test and Pearson’s rank correlation coefficient, the LASSO with 10-fold cross-validation was employed to select significant features for building radiomic signatures. At last, 7 features were selected for the TRT1 model (Supplementary Figure S2A), 7 features were selected for the TRT2 model (Supplementary Figure S2B), and 6 features were selected for the TR model (Supplementary Figure S2C), with weighting coefficient severally. The coefficients and mean standard error (MSE) of the 10-fold validation are also exhibited in Supplementary Figure S2.
Then, 15 machine learning models were constructed based on the various radiomic models above-mentioned and machine learning classifiers (LR, SVM, XGBoost, DT, and MLP) to determine which model was optimal for glioma grade prediction. The results of internal validation with randomized division repeated 100 times are shown in Figures 3A–C for the TRT1, TRT2, and TR models in sequence. Among these five categories of machine learning models, the MLP-based TRT1 model, LR-based TRT2 model, and MLP-based TR model, which had the highest mean values of 0.76, 0.73, and 0.77, respectively, were considered the most stable models. In the external testing set, the performance of each category classifier on the TRT1 and TR model outperformed that of the TRT2 model respectively (Figure 3E, Supplementary Table S2). Ultimately, the MLP-based TR model (Figure 3F) with an AUC of 0.734 and MLP-based TRT1 model (Figure 3D) with an AUC of 0.733, were considered the optimal model in intratumoral radiomics. Although the Delong test showed a non-significantly statistic (p = 0.966) between these two models, the accuracy, sensitivity, and specificity of the MLP-based TR model were higher than the MLP-based TRT1 model. Hence, the TR model based on double-sequence MRI was chosen for further research and evaluation. The decision curves and calibration curves are also depicted in Supplementary Figure S4.
Figure 3 The performance of intratumoral radiomic models based on the different MRI sequences. The performance of the internal validation employed 30% of the training cohort randomly, and repeated the steps 100 times of TRT1 (A), TRT2 (B), and TR (C) models respectively. Comparison of ROC curves for the external test of TRT1 (D), TRT2 (E), and TR (F) models of five machine learning models. The models that the mean AUC of the internal validation throughout the 100 repetitions reached more than 0.7, were considered to be relatively stable and efficient. Comprehensively, the MLP-based TR model (AUC = 0.734 in external test) based on the two MRI sequences was considered the optimal model and selected for further analysis.
3.3 Machine learning models based on peritumoral radiomics
For peritumoral radiomics, the PR models contained 2576 radiomic features (Supplementary Figure S1). After feature selection, 8 features were finally selected for the PR model (Supplementary Figure S3A). Figure 4A shows the performance of 100-repetition randomized division internal validation. In the external testing cohort, only the AUC of the MLP-based PR model exceeded 0.7, indicating a lower predictive performance compared to the intratumoral radiomics (Figure 4D).
Figure 4 The performance of intratumoral and peritumoral radiomic models based on the two MRI sequences. The performance of the internal validation employed 30% of the training cohort randomly, and repeated the steps 100 times of PR (A), TPRVOI-fusion (B), and TPRfeature-fusion (C) models respectively. Comparison of ROC curves for the external test of PR (D), TPRVOI-fusion (E), and TPRfeature-fusion (F) models of five machine learning models. Ultimately, the XGBoost-based TPRVOI-fusion model (AUC = 0.805 in the external test) was identified as the best model to develop a nomogram.
3.4 Machine learning models based on intra- and peri-tumoral fusion radiomics
For intra- and peri-tumoral fusion radiomics, the TPRVOI-fusion models contained 2576 radiomic features, while the TPRfeature-fusion model included 5152 (Supplementary Figure S1), due to the different combinations of feature sources. After radiomic features dimensionality reduction, the LASSO regression finally selected 6 features for the TPRVOI-fusion model (Supplementary Figure S3B), and 9 features for the TPRfeature-fusion model (Supplementary Figure S3C). Analogically, the two radiomic models were combined with 5 kinds of classifiers previously mentioned to develop machine-learning models for predicting glioma grade. The 100-repetition randomized division internal validation was conducted to evaluate the model’s performance stability. Except for the DT model, the mean AUC of the other four machine learning models reached more than 0.7 in internal validation, which were considered to be relatively stable and efficient predicting models (Figures 4B, C). In the external testing, each categorical classifier on the TPRVOI-fusion model outperformed that of the TPRfeature-fusion model respectively (Table 2, Figures 4E, F). The XGBoost-based TPRVOI-fusion model with an AUC of 0.805 was considered the optimal model (Figure 4E) in all intratumoral and/or peritumoral radiomic models. Supplementary Figure S5 exhibits the decision curves and calibration curves of corresponding models.
Table 2 Each evaluation index of PR, TPRVOI-fusion, and TPRfeature-fusion models of five machine learning classifiers in the external test.
3.5 Nomogram construction
The predictive clinical parameters were chosen based on univariate and multivariate analyses. As shown in Table 3, age was found to be an independent predictor (OR 1.013; 95% CI 1.009-1.016; p<0.001) for glioma grade prediction. Then, the Clinical models were built based on the selected independent predictor and 5 kinds of classifiers.
Subsequently, in exploring the potential utility of the developed MRI-based intratumoral and peritumoral radiomic models for preoperative prediction of glioma grade, a nomogram was constructed by combining the clinical independent predictor with the optimal TPRVOI-fusion machine learning model (Figure 5A). The individual risk of being predicted as an HGG glioma was derived from the cumulative total points obtained, which allowed for the representation of the prediction model in a more simplified and comprehensive manner. Figure 5B illustrates the superior performance of the nomogram with an AUC of 0.825 (testing cohort), in comparison to both the Clinical model and TPRVOI-fusion radiomic model. After the Delong test, the nomogram was proved to significantly outperform both of the models (Figure 5C), and the net benefit in the DCA curve of the nomogram was higher than that of the two models at threshold probabilities in the testing cohort (Figure 5D). The Hosmer-Lemeshow test revealed favorable calibration of the nomogram (p = 0.089), suggesting alignment with an ideal fit without significant deviation (Figure 5E).
Figure 5 Clinical application of the nomogram constructed by radiomic model and clinical parameter in predicting the probability of being HGG for glioma patients (A). The total point was obtained by adding the scores located on the TPRVOI-fusion and age coordinate axis together, and the vertically corresponding value on the bottom line was the probability of being HGG. (B) Comparison of ROC curves of the Clinical model, TPRVOI-fusion radiomic model, and nomogram. (C) The Delong test among the three models. The nomogram with an AUC of 0.825 in the test cohort, significantly outperformed the other two models. The DCA (D) and calibration curves (E) of the three models. The nomogram had a higher net benefit in predicting glioma grade and represented an ideal fit.
4 Discussion
In this study, we developed variously dependable models to preoperatively predict grade glioma by using MRI images, which were constructed by radiomic features extracted from the intratumoral region and peritumoral edema region. The XGBoost machine learning classifier, incorporating features extracted from the combination of intratumoral and peritumoral VOIs of MRI, exhibited superior performance in distinguishing between LGG and HGG. The internal validation and independent external test underscored the robustness and generalizability of the model. Additionally, the radiomic models derived from VOI-fusion outperformed those derived from feature-fusion in our study, suggesting more extensive investigations into peritumoral radiomics were necessary to determine a more standardized research method and provide more theoretical support for radiomic studies. Finally, a nomogram based on the optimal machine learning model and clinical parameter was established to detect potential applications for predicting glioma grade in clinical practice.
MRI serves as a routine tool for preliminary diagnosing, treatment planning, and monitoring the treatment response of patients with glioma (39, 40). Recent studies have consistently demonstrated a robust association between radiomic features extracted from multiparametric MRI scans and various applications related to gliomas (2, 41). Kim et al. distinguished between glioblastoma and primary CNS lymphoma using multiparametric MRI sequences that included contrast-enhanced T1-weighted, T2-weighted, and diffusion-weighted imaging (42). Fifteen features were chosen from all imaging modalities, and the model rendered an AUC of 0.979, demonstrating the potent prediction potential of multi-parameter MRI. Nevertheless, the prediction of models based on a single MRI sequence was not performed and compared. In our study, we constructed and compared various radiomic models based on the intratumoral region of the single contrast-enhanced T1 sequence, the single T2 flair sequence, and the combination of two sequences. As expected, the radiomic models derived from dual-sequence MR imaging outperformed those solely based on single contrast-enhanced T1 or single T2 flair sequence. Hence, multicontrast MRI-based radiomics was poised to enhance the predictive capability for glioma grading compared to that of single MRI sequence.
Previous studies have demonstrated that the heterogeneity of gliomas extended beyond the tumor interior to include the peritumoral region, where approximately 90% of gliomas recurred (43, 44). Glioma cells interacted with molecules in the peritumoral area to cause hypoxia, angiogenesis, and tumor infiltration, which would ultimately accelerate the growth of gliomas (45). Consequently, the significant potential existed in the peritumoral environment, which could provide important information for evaluating the aggressive biological behavior of the tumor clinically (46, 47). In studies of other cancers, Ding et al. investigated the effect of peritumoral features for predicting sentinel lymph node metastasis in breast cancer (48). They created peritumoral regions by expanding tumor regions of interest at thicknesses of 2 mm, 4 mm, 6 mm, and 8 mm. By incorporating peritumoral features, the accuracy in the validation set increased from 0.704 to 0.796. Shan et al. developed a prediction model using peritumoral radiomic signatures extracted from a 2 cm peritumoral area, assessing its effectiveness in predicting the early recurrence of hepatocellular carcinoma post-curative treatment (49). In the validation cohort, ROC curves and decision curves revealed superior prediction efficiency and greater clinical benefits with the peritumoral model. Consequently, the extraction and integration of peritumoral and intratumoral features present a promising avenue. However, existing radiomic-based techniques for grading gliomas focused primarily on the interior of the tumor and less on the peritumoral environment.
Peripheral edema and peritumor in gliomas are two distinct concepts, the peritumor is typically an area within a specified radius around the tumor, whereas the peripheral edema around a glioma is irregular and frequently dispersed along the cerebral gyrus. Previous research has shown that the degree of peritumoral edema increases with the pathological grade and aggressiveness of glioma (14). It was demonstrated that individuals with severe edema (>10 mm) had mean or overall survival rates that were more than 50% lower than those with mild edema (8, 9, 44). Cheng et al. compared the predictive power of the peritumor and peripheral edema region for grading gliomas and found the most predictive features were extracted from the peritumor region within an immediate distance of 1 mm from the tumor core based on MRI scans (8). Notably, the study did not explore a predictive model based on the peripheral edema region in combination with the intratumoral region. Considering that the prognosis of gliomas was strongly correlated with the occurrence of peritumoral edema, we attempted to investigate the radiomic models based on peripheral edema for glioma grade.
Concerning the two main research approaches in intratumoral and peritumoral radiomic analysis, which was simplified as feature-fusion and VOI-fusion, definitive studies remained absent establishing a persuasive conclusion about which approach made more sense and produced more predictive features. Our study filled this gap by conducting a comparative analysis of the two methodologies for the first time. In the internal validation, the AUC of models based on both two methods showed no obvious differences, the difference in AUC was consistent across classifiers. In the external test, each categorical classifier on the VOI-fusion model outperformed that of the feature-fusion model respectively, which indicated that the model constructed by extracting features from the intratumoral and peritumoral regions as a whole yielded higher prediction efficiency. Regarding VOI fusion, the radiomic features, such as shape, first-order, and texture, extracted comprehensive information taking into account both intratumoral and peripheral edema region. Regarding feature fusion, individual peritumoral features exhibited minimal statistical variance in effectively distinguishing HGG from LGG, rendering them prone to elimination during screening. In our study, the constructed model just by feature fusion performed poorly in external tests, indicating unstable performance. This highlighted the need for further studies to unravel the intricacies of intratumoral and peritumoral radiomics.
We assessed 1288 radiomics features for every MRI sequence, and 7728 features in total, which was distinctly more than most recent findings and included all significant variables for radiomic analysis (30, 45). To identify the most optimal model for our dataset, we applied the six previously discussed categories of radiomic models across five classifiers: LR, SVM, XGBoost, DT, and MLP. This meticulous process ensured the exploration of fully optimized models best suited for our data. Meanwhile, previous studies have identified several clinical parameters crucial in distinguishing between LGG and HGG. Wang et al. selected clinical factors including age and sex as well as radiomic signature to develop a nomogram to predict glioma grading (30). It is well-recognized that HGG tends to be diagnosed in the elderly (31, 50). Consistently, we found that only the age parameter was statistically significant in predicting glioma grading, using the uni-multivariate analyses. Despite the lower predictive capacity of clinical model based solely on age feature compared to radiomic models, the nomogram amalgamating clinical parameter and radiomic models surpassed the predictive efficacy of either model in isolation. In terms of the predictive efficiency of various machine learning methods, Voort et al. utilized a deep learning model to predict the grade of glioma, achieving an AUC of 0.81 in the external testing cohort of 240 patients from 13 different institutes (31). Similarly, Li et al. distinguished LGG from HGG by developing deep convolutional neural network models, achieving an AUC of 0.89. In comparison, the nomogram in our study showed great performance with an AUC of 0.825 in the independent external testing cohort, which was equivalent to the state-of-the-art research aforementioned.
For all this, there were limitations in this study. First, our study required a larger sample size from more centers to make the findings more convincing. Second, only two MRI sequences were employed in this study. Some advanced parametric MRI scans, such as DWI and DTI, have shown powerful potential in tumor research, and new scanning techniques should be explored (7, 18, 36). Third, the VOIs in our study were manually annotated, which was time-consuming and laborious, and even prone to inaccurate annotation. Deep learning-based tumor segmentation methods are expected to be employed to improve the accuracy and reliability of image segmentation. Finally, our study lacked molecular subtyping of the samples, which was critical for the prognosis of gliomas, and planned to integrate such information in future studies.
5 Conclusion
In this work, we assessed the function of radiomic models of intratumoral and peripheral edema regions in MRI scans for predicting glioma grade and validated the methodology on an independent external test dataset, which provided a fresh viewpoint on the disease. The nomogram combined clinical parameter and the optimal radiomic model was efficient in glioma grade, and this non-invasive approach was expected to promote clinical research and guide the management of individualized glioma treatment.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Tianjin Medical University General Hospital’s Institutional Ethics Committee. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because The requirement for written informed consent forms from all patients taking part in the study was waived because of the retrospective investigation.
Author contributions
RT: Data curation, Writing – original draft. CS: Methodology, Writing – original draft. CW: Writing – original draft. TZ: Conceptualization, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by Scientific Research Program of Tianjin Municipal Science and Technology Bureau (NO. 19ZXDBSY00040).
Acknowledgments
We express our gratitude for the technical support offered by the OnekeyAI platform.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1401977/full#supplementary-material
References
1. Abdel Razek AAK, Alksas A, Shehata M, AbdelKhalek A, Abdel Baky K, El-Baz A, et al. Clinical applications of artificial intelligence and radiomics in neuro-oncology imaging. Insights Imaging. (2021) 12:152. doi: 10.1186/s13244-021-01102-6
2. Singh G, Manjila S, Sakla N, True A, Wardeh AH, Beig N, et al. Radiomics and radiogenomics in gliomas: a contemporary update. Br J Cancer. Aug. (2021) 125:641–57. doi: 10.1038/s41416-021-01387-w
3. Siegel RL, Miller KD, Wagle NS, Jemal A. Cancer statistics, 2023. CA Cancer J Clin. (2023) 73:17–48. doi: 10.3322/caac.21763
4. Yi Z, Long L, Zeng Y, Liu Z. Current advances and challenges in radiomics of brain tumors. Front Oncol. (2021) 11:732196. doi: 10.3389/fonc.2021.732196
5. Liu D, Chen J, Hu X, Yang K, Liu Y, Hu G, et al. Imaging-genomics in glioblastoma: Combining molecular and imaging signatures. Front Oncol. (2021) 11:699265. doi: 10.3389/fonc.2021.699265
6. Lee DH, Park JE, Kim N, Park SY, Kim YH, Cho YH, et al. Tumor habitat analysis by magnetic resonance imaging distinguishes tumor progression from radiation necrosis in brain metastases after stereotactic radiosurgery. Eur Radiol. (2022) 32:497–507. doi: 10.1007/s00330-021-08204-1
7. Yang Y, Yan LF, Zhang X, Han Y, Nan HY, Hu YC, et al. Glioma grading on conventional MR images: A deep learning study with transfer learning. Front Neurosci. (2018) 12:804. doi: 10.3389/fnins.2018.00804
8. Cheng J, Liu J, Yue H, Bai H, Pan Y, Wang J. Prediction of glioma grade using intratumoral and peritumoral radiomic features from multiparametric MRI images. IEEE/ACM Trans Comput Biol Bioinform. (2022) 19:1084–95. doi: 10.1109/TCBB.2020.3033538
9. Li Y, Ammari S, Lawrance L, Quillent A, Assi T, Lassau N, et al. Radiomics-based method for predicting the glioma subtype as defined by tumor grade, IDH mutation, and 1p/19q codeletion. Cancers (Basel). (2022) 14. doi: 10.3390/cancers14071778
10. Hooper GW, Ginat DT. MRI radiomics and potential applications to glioblastoma. Front Oncol. (2023) 13:1134109. doi: 10.3389/fonc.2023.1134109
11. Calabrese E, Rudie JD, Rauschecker AM, Villanueva-Meyer JE, Clarke JL, Solomon DA, et al. Combining radiomics and deep convolutional neural network features from preoperative MRI for predicting clinically relevant genetic biomarkers in glioblastoma. Neurooncol Adv. (2022) 4:vdac060. doi: 10.1093/noajnl/vdac060
12. Park JE, Kim HS, Kim N, Park SY, Kim YH, Kim JH. Spatiotemporal heterogeneity in multiparametric physiologic MRI is associated with patient outcomes in IDH-wildtype glioblastoma. Clin Cancer Res. (2021) 27:237–45. doi: 10.1158/1078-0432.CCR-20-2156
13. Waqar M, Van Houdt PJ, Hessen E, Li KL, Zhu X, Jackson A, et al. Visualising spatial heterogeneity in glioblastoma using imaging habitats. Front Oncol. (2022) 12:1037896. doi: 10.3389/fonc.2022.1037896
14. Brancato V, Cerrone M, Lavitrano M, Salvatore M. Cavaliere C. A systematic review of the current status and quality of radiomics for glioma differential diagnosis. Cancers (Basel). (2022) 14. doi: 10.3390/cancers14112731
15. Beig N, Bera K, Prasanna P, Antunes J, Correa R, Singh S, et al. Radiogenomic-based survival risk stratification of tumor habitat on gd-T1w MRI is associated with biological processes in glioblastoma. Clin Cancer Res. (2020) 26:1866–76. doi: 10.1158/1078-0432.CCR-19-2556
16. Gillies RJ, Balagurunathan Y. Perfusion MR imaging of breast cancer: Insights using "Habitat imaging". Radiology. (2018) 288:36–7. doi: 10.1148/radiol.2018180271
17. Wu J, Cao G, Sun X, Lee J, Rubin DL, Napel S, et al. Intratumoral spatial heterogeneity at perfusion MR imaging predicts recurrence-free survival in locally advanced breast cancer treated with neoadjuvant chemotherapy. Radiology. (2018) 288:26–35. doi: 10.1148/radiol.2018172462
18. Kim M, Jung SY, Park JE, Jo Y, Park SY, Nam SJ, et al. Diffusion- and perfusion-weighted MRI radiomics model may predict isocitrate dehydrogenase (IDH) mutation and tumor aggressiveness in diffuse lower grade glioma. Eur Radiol. (2020) 30:2142–51. doi: 10.1007/s00330-019-06548-3
19. Zhou H, Xu R, Mei H, Zhang L, Yu Q, Liu R, et al. Application of enhanced T1WI of MRI radiomics in glioma grading. Int J Clin Pract. (2022) 2022:3252574. doi: 10.1155/2022/3252574
20. Wu J, Gong G, Cui Y, Li R. Intratumor partitioning and texture analysis of dynamic contrast-enhanced (DCE)-MRI identifies relevant tumor subregions to predict pathological response of breast cancer to neoadjuvant chemotherapy. J Magn Reson Imaging. (2016) 44:1107–15. doi: 10.1002/jmri.25279
21. Aoude LG, Wong BZY, Bonazzi VF, Brosda S, Walters SB, Koufariotis LT, et al. Radiomics biomarkers correlate with CD8 expression and predict immune signatures in melanoma patients. Mol Cancer Res. (2021) 19:950–6. doi: 10.1158/1541-7786.MCR-20-1038
22. Wang X, Yang W, Zhou Q, Luo H, Chen W, Yeung SJ, et al. The role of (18)F-FDG PET/CT in predicting the pathological response to neoadjuvant PD-1 blockade in combination with chemotherapy for resectable esophageal squamous cell carcinoma. Eur J Nucl Med Mol Imaging. (2022) 49:4241–51. doi: 10.1007/s00259-022-05872-z
23. Hannequin P, Decroisette C, Kermanach P, Berardi G, Bourbonne V. FDG PET and CT radiomics in diagnosis and prognosis of non-small-cell lung cancer. Transl Lung Cancer Res. (2022) 11:2051–63. doi: 10.21037/tlcr-22-158
24. Napel S, Mu W, Jardim-Perassi BV, Aerts H, Gillies RJ. Quantitative imaging of cancer in the postgenomic era: Radio(geno)mics, deep learning, and habitats. Cancer. (2018) 124:4633–49. doi: 10.1002/cncr.31630
25. Wang X, Xie T, Luo J, Zhou Z, Yu X, Guo X. Radiomics predicts the prognosis of patients with locally advanced breast cancer by reflecting the heterogeneity of tumor cells and the tumor microenvironment. Breast Cancer Res. (2022) 24. doi: 10.1186/s13058-022-01516-0
26. Cui L, Yu T, Kan Y, Dong Y, Luo Y, Jiang X. Multi-parametric MRI-based peritumoral radiomics on prediction of lymph-vascular space invasion in early-stage cervical cancer. Diagn Interv Radiol. (2022) 28:312–21. doi: 10.5152/dir.2022.20657
27. Stoyanova R, Chinea F, Kwon D, Reis IM, Tschudi Y, Parra NA, et al. An automated multiparametric MRI quantitative imaging prostate habitat risk scoring system for defining external beam radiation therapy boost volumes. Int J Radiat Oncol Biol Phys. (2018) 102:821–9. doi: 10.1016/j.ijrobp.2018.06.003
28. Gemini L, Tortora M, Giordano P, Prudente ME, Villa A, Vargas O, et al. Vasari scoring system in discerning between different degrees of glioma and IDH status prediction: A possible machine learning application? J Imaging. (2023) 9. doi: 10.3390/jimaging9040075
29. You W, Mao Y, Jiao X, Wang D, Liu J, Lei P, et al. The combination of radiomics features and VASARI standard to predict glioma grade. Front Oncol. (2023) 13:1083216. doi: 10.3389/fonc.2023.1083216
30. Wang Q, Li Q, Mi R, Ye H, Zhang H, Chen B, et al. Radiomics nomogram building from multiparametric MRI to predict grade in patients with glioma: A cohort study. J Magn Reson Imaging. Mar. (2019) 49:825–33. doi: 10.1002/jmri.26265
31. van der Voort SR, Incekara F, Wijnenga MMJ, Kapsas G, Gahrmann R, Schouten JW, et al. Combined molecular subtyping, grading, and segmentation of glioma using multi-task deep learning. Neuro Oncol. (2023) 25:279–89. doi: 10.1093/neuonc/noac166
32. Li Y, Wei D, Liu X, Fan X, Wang K, Li S, et al. Molecular subtyping of diffuse gliomas using magnetic resonance imaging: comparison and correlation between radiomics and deep learning. Eur Radiol. (2022) 32:747–58. doi: 10.1007/s00330-021-08237-6
33. Tian Y, Hua H, Peng Q, Zhang Z, Wang X, Han J, et al. Preoperative evaluation of gd-EOB-DTPA-enhanced MRI radiomics-based nomogram in small solitary hepatocellular carcinoma (</=3 cm) with microvascular invasion: A two-center study. J Magn Reson Imaging. (2022) 56:1459–72. doi: 10.1002/jmri.28157
34. Shi J, Dong Y, Jiang W, Qin F, Wang X, Cui L, et al. MRI-based peritumoral radiomics analysis for preoperative prediction of lymph node metastasis in early-stage cervical cancer: A multi-center study. Magn Reson Imaging. (2022) 88:1–8. doi: 10.1016/j.mri.2021.12.008
35. Li N, Wan X, Zhang H, Zhang Z, Guo Y, Hong D. Tumor and peritumor radiomics analysis based on contrast-enhanced CT for predicting early and late recurrence of hepatocellular carcinoma after liver resection. BMC Cancer. (2022) 22:664. doi: 10.1186/s12885-022-09743-6
36. Su C, Jiang J, Zhang S, Shi J, Xu K, Shen N, et al. Radiomics based on multicontrast MRI can precisely differentiate among glioma subtypes and predict tumour-proliferative behaviour. Eur Radiol. (2019) 29:1986–96. doi: 10.1007/s00330-018-5704-8
37. Xie D, Xu F, Zhu W, Pu C, Huang S, Lou K, et al. Delta radiomics model for the prediction of progression-free survival time in advanced non-small-cell lung cancer patients after immunotherapy. Front Oncol. (2022) 12:990608. doi: 10.3389/fonc.2022.990608
38. Chen K, Wang J, Li S, Zhou W, Xu W. Predictive value of 18F-FDG PET/CT-based radiomics model for neoadjuvant chemotherapy efficacy in breast cancer: a multi-scanner/center study with external validation. Eur J Nucl Med Mol Imaging. (2023) 50:1869–80. doi: 10.1007/s00259-023-06150-2
39. Gu D, Xie Y, Wei J, Li W, Ye Z, Zhu Z, et al. MRI-based radiomics signature: A potential biomarker for identifying glypican 3-positive hepatocellular carcinoma. J Magn Reson Imaging. (2020) 52:1679–87. doi: 10.1002/jmri.27199
40. Yan J, Zhang B, Zhang S, Cheng J, Liu X, Wang W, et al. Quantitative MRI-based radiomics for noninvasively predicting molecular subtypes and survival in glioma patients. NPJ Precis Oncol. (2021) 5:72. doi: 10.1038/s41698-021-00205-z
41. Xu C, Peng Y, Zhu W, Chen Z, Li J, Tan W, et al. An automated approach for predicting glioma grade and survival of LGG patients using CNN and radiomics. Front Oncol. (2022) 12:969907. doi: 10.3389/fonc.2022.969907
42. Kim Y, Cho HH, Kim ST, Park H, Nam D, Kong DS. Radiomics features to distinguish glioblastoma from primary central nervous system lymphoma on multi-parametric MRI. Neuroradiology. (2018) 60:1297–305. doi: 10.1007/s00234-018-2091-4
43. Bailo M, Pecco N, Callea M, Scifo P, Gagliardi F, Presotto L, et al. Decoding the heterogeneity of Malignant gliomas by PET and MRI for spatial habitat analysis of hypoxia, perfusion, and diffusion imaging: A preliminary study. Front Neurosci. (2022) 16:885291. doi: 10.3389/fnins.2022.885291
44. Shaheen A, Bukhari ST, Nadeem M, Burigat S, Bagci U, Mohy-Ud-Din H. Overall survival prediction of glioma patients with multiregional radiomics. Front Neurosci. (2022) 16:911065. doi: 10.3389/fnins.2022.911065
45. Li G, Li L, Li Y, Qian Z, Wu F, He Y, et al. An MRI radiomics approach to predict survival and tumour-infiltrating macrophages in gliomas. Brain. (2022) 145:1151–61. doi: 10.1093/brain/awab340
46. Kazerouni AS, Hormuth DA 2nd, Davis T, Bloom MJ, Mounho S, Rahman G, et al. Quantifying tumor heterogeneity via MRI habitats to characterize microenvironmental alterations in HER2+ Breast cancer. Cancers (Basel). (2022) 14. doi: 10.3390/cancers14071837
47. Algohary A, Shiradkar R, Pahwa S, Purysko A, Verma S, Moses D, et al. Combination of peri-tumoral and intra-tumoral radiomic features on bi-parametric MRI accurately stratifies prostate cancer risk: A multi-site study. Cancers. (2020) 12. doi: 10.3390/cancers12082200
48. Ding J, Chen S, Serrano Sosa M, Cattell R, Lei L, Sun J, et al. Optimizing the peritumoral region size in radiomics analysis for sentinel lymph node status prediction in breast cancer. Acad Radiol. (2022) 29 Suppl 1:S223–8. doi: 10.1016/j.acra.2020.10.015
49. Shan Q-Y, Hu H-T, Feng S-T, Peng Z-p, Chen S-I, Zhou Q, et al. CT-based peritumoral radiomics signatures to predict early recurrence in hepatocellular carcinoma after curative tumor resection or ablation. Cancer Imaging. (2019) 19. doi: 10.1186/s40644-019-0197-5
Keywords: MRI, glioma grade, radiomics, peritumoral features, nomogram
Citation: Tan R, Sui C, Wang C and Zhu T (2024) MRI-based intratumoral and peritumoral radiomics for preoperative prediction of glioma grade: a multicenter study. Front. Oncol. 14:1401977. doi: 10.3389/fonc.2024.1401977
Received: 16 March 2024; Accepted: 29 April 2024;
Published: 13 May 2024.
Edited by:
Camilla Russo, Santobono-Pausilipon Children’s Hospital, ItalyReviewed by:
Ying Zhuge, National Institutes of Health (NIH), United StatesMario Tortora, University of Naples Federico II, Italy
Mario Tranfa, University of Naples Federico II, Italy
Copyright © 2024 Tan, Sui, Wang and Zhu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tao Zhu, emh1dGFvNUAxMjYuY29t
†These authors share first authorship