Skip to main content

ORIGINAL RESEARCH article

Front. Med., 27 June 2024
Sec. Precision Medicine
This article is part of the Research Topic Artificial Intelligence in Imaging, Pathology, and Genetic Analysis of Brain Tumor in the Era of Precision Medicine View all 3 articles

MRI radiomics-based interpretable model and nomogram for preoperative prediction of Ki-67 expression status in primary central nervous system lymphoma

Endong Zhao&#x;Endong Zhao1Yun-Feng Yang,&#x;Yun-Feng Yang2,3Miaomiao BaiMiaomiao Bai4Hao ZhangHao Zhang4Yuan-Yuan Yang,Yuan-Yuan Yang2,3Xuelin SongXuelin Song4Shiyun LouShiyun Lou4Yunxuan YuYunxuan Yu4Chao Yang
Chao Yang1*
  • 1Department of Radiology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
  • 2Laboratory for Medical Imaging Informatics, Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai, China
  • 3Laboratory for Medical Imaging Informatics, University of Chinese Academy of Sciences, Beijing, China
  • 4Department of Radiology, The Second Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China

Objectives: To investigate the value of interpretable machine learning model and nomogram based on clinical factors, MRI imaging features, and radiomic features to predict Ki-67 expression in primary central nervous system lymphomas (PCNSL).

Materials and methods: MRI images and clinical information of 92 PCNSL patients were retrospectively collected, which were divided into 53 cases in the training set and 39 cases in the external validation set according to different medical centers. A 3D brain tumor segmentation model was trained based on nnU-NetV2, and two prediction models, interpretable Random Forest (RF) incorporating the SHapley Additive exPlanations (SHAP) method and nomogram based on multivariate logistic regression, were proposed for the task of Ki-67 expression status prediction.

Results: The mean dice Similarity Coefficient (DSC) score of the 3D segmentation model on the validation set was 0.85. On the Ki-67 expression prediction task, the AUC of the interpretable RF model on the validation set was 0.84 (95% CI:0.81, 0.86; p < 0.001), which was a 3% improvement compared to the AUC of the nomogram. The Delong test showed that the z statistic for the difference between the two models was 1.901, corresponding to a p value of 0.057. In addition, SHAP analysis showed that the Rad-Score made a significant contribution to the model decision.

Conclusion: In this study, we developed a 3D brain tumor segmentation model and used an interpretable machine learning model and nomogram for preoperative prediction of Ki-67 expression status in PCNSL patients, which improved the prediction of this medical task.

Clinical relevance statement: Ki-67 represents the degree of active cell proliferation and is an important prognostic parameter associated with clinical outcomes. Non-invasive and accurate prediction of Ki-67 expression level preoperatively plays an important role in targeting treatment selection and patient stratification management for PCNSL thereby improving prognosis.

Highlights

• This is the first study to utilize radiomics to preoperatively predict Ki-67 expression status in primary central nervous system lymphoma;

• An interpretable machine learning algorithm framework is proposed to bridge the performance and interpretability gap of traditional classification algorithms;

• A 3D automated brain tumor segmentation model was developed to provide a convenient automated segmentation tool for subsequent brain tumor-related studies.

Introduction

PCNSL is a rare malignant tumor that involves only the Central nervous system (CNS) without lymphomas occurring elsewhere. PCNSL accounts for 3% of CNS tumors and more than 90% are diffuse large B cell lymphomas (1). The incidence of this disease has increased exponentially over the past few decades (2).

The Ki-67 proliferation index has been used as a surrogate marker for rapid growth and increased invasiveness in tumors, and an increasing number of researchers have attempted to predict its expression status in a variety of tumors by different methods, Ki-67 has emerged as one of the major predictive factors for tumor prognosis (313). Multiple studies have revealed the prognostic significance of Ki-67 in PCNSL, demonstrating significant independent predictive value (1421). Liu et al.’s research indicates that high Ki-67 expression (i.e., Ki-67 index ≥90%) is associated with poorer overall survival and progression-free survival in PCNSL (18, 22, 23). Currently, the conventional methods for detecting Ki-67 expression status are utilizing biopsy or surgery, but the risk of intracranial complications is high. Therefore, accurate preoperative noninvasive prediction of Ki-67 expression levels plays an important role in targeting therapeutic choices and patient management for PCNSL thereby improving prognosis.

Accurate semantic segmentation of medical images can help doctors pinpoint pathological areas and help in disease research (24, 25). nnU-NetV2 is a deep learning-based semantic segmentation method, which, as the most competitive medical image segmentation model, achieves optimal results in most public semantic segmentation tasks (26). Radiomics allows high-throughput extraction of key features of an image and utilizes these features in combination with machine learning algorithms to make predictions. Studies have shown that most complex and efficient machine learning models lack interpretability (27, 28). SHAP (29) is a method that provides an explanation for the model by calculating SHAP values to quantify the impact of each feature on the predicted results. In addition, multimodal information also adds to model predictions (30).

Therefore, this paper proposes an interpretable machine learning model that incorporates multimodal information such as clinical factors, image features, and radiomics features to compensate for the shortcomings of existing models in terms of performance or interpretability. In addition, the Nomogram method, which is commonly used in clinical research, is constructed for model comparison experiments. To achieve the goal of effectively predicting the Ki-67 expression status in PCNSL before surgery.

Materials and methods

Patients

This study was approved by the ethics committees of Medical Center 1 and Medical Center 2. The informed consent of the patient is waived, and the entire study follows the principles outlined in the Declaration of Helsinki.

Patients with primary central nervous system lymphoma attending Medical Center 1 from January 2017 to September 2023 and Medical Center 2 from February 2010 to June 2023 were retrospectively collected. Inclusion criteria: (1) The lesion was definitively confirmed by puncture or post-surgical pathology; (2) No other sites of lymphoma occurred; (3) No history of blood or immune system disease. Exclusion criteria: (1) Lack of clinical information and imaging; (2) Poor image quality, VOI difficult to outline; (3) Prior to undergoing MRI, the patient underwent interventions such as puncture, surgery, radiotherapy, and chemotherapy. A total of 92 patients were finally enrolled in the study and the pathological types are all diffuse large B-cell lymphoma, including 53 in the training set (Medical Center 1) and 39 in the external validation set (Medical Center 2). The enrollment flow chart is shown in Figure 1.

Figure 1
www.frontiersin.org

Figure 1. Patient enrollment flowchart.

Imaging protocol

The following MRI scanners were used: GE Signa HDXT 1.5 T 、Siemens Verio 3.0 T and Philips Ingenia CX 3.0 T. More detailed scanning parameters can be found in the Supplementary Appendix.

Image pre-processing

Patient information was first anonymized for all image sequences and then the original DICOM images of three different sequences (T1WI, T1CE, and T2WI) were converted to Neuroimaging Informatics Technology Initiative (NIFTI) format. The spatial resolution of all image sequences is resampled to [1 mm, 1 mm, 3 mm] by the nearest-neighbor interpolation algorithm, which ensures the consistency of the physical space in different images and removes part of the bias caused by different instruments. The MRI images were corrected for artifacts using the N4 bias field correction algorithm. Finally, the Min-max method was used to normalize the intensity of all images into the 0–255 interval. Preprocessing of all images was done using the Simple ITK algorithm library (https://simpleitk.org/).

Imaging features analysis

Analysis of the patient’s clinical information, MRI imaging features, and histopathology data. Includes analysis of patients’ clinical information, MRI image characteristics, and histopathologic data (31, 32). Included: (1) Age and Sex; (2) Tumor length: the maximum diameter of the tumor was measured on the image of the largest cross-section of the tumor; (3) Edema volume: the edema VOI volume parameter was calculated by the 3D Slicer’s calculation function; (4) Involvement of deep regions: whether the tumor invades periventricular regions, basal ganglia, brainstem, or cerebellum; (5) Cystic and necrosis: yes, no; (6) Tumor margins: regular, irregular; (7) Enhancement features: mass and patchy, indicates obvious solid enhancement without large areas of non-enhancing necrosis within. Ring enhancement refers to a circular ring of peripheral enhancement due to cystic degeneration and necrosis within the tumor, leading to no enhancement in the interior. (8) Enhanced signal: homogeneous, nonhomogeneous; (9) Midline shift: yes, no; (10) Morphological characteristics: Angular sign, the irregular enhancement lesions protrude to a certain direction, showing a sharp angle appearance; Incision sign, based on the T1CE images, there are umbilical concave or striated defects on the edge of the enhanced lesion; Butterfly sign, lesion involving the corpus callosum can infiltrate transcallosally, appearing as a symmetric “butterfly” appearance n T1CE imaging; (11) Ki-67index: the Ki-67 proliferation index was calculated using the percentage of cells staining positive for Ki-67.

Imaging features were evaluated by 2 radiologists with 3 years of experience in diagnostic CNS imaging. When disagreements in assessment arose, they were resolved by another radiologist with 15 years of experience in diagnostic CNS imaging. All radiologists were blinded to the patient’s histopathologic information when evaluating imaging features.

Immunohistochemical

Surgical specimens were fixed in 10% buffered formalin solution and then wax block embedded, sectioned, and stained with anti-Ki67 antibody. The antibody binds to the Ki67 protein and the positive cells show a brown complex. One thousand cells were randomly selected in the hot spot field for Ki-67-positive cell counting, and the percentage of all counted cells was the Ki-67 index. According to previous studies (18, 22, 23), 90% was used as the cutoff value for the Ki-67 index. Ki-67 index ≥90% was defined as high expression and less than 90% indicated low expression. There were 37 cases in the high-expression group and 55 cases in the low-expression group.

Research analysis workflow

The research pipeline of this paper is mainly divided into the following four modules: A. Raw Data Acquisition and Preprocessing; B. VOI Segmentation and Features Acquisition; C. Model Establishment; D. Evaluation of the model and interpretability analysis. The specific information is shown in Figure 2.

Figure 2
www.frontiersin.org

Figure 2. Flowchart for analyzing radiomics modeling studies. Raw data acquisition and preprocessing (A); VOI segmentation and features acquisition (B); Model establishment (C); Evaluation of the model and interpretability analysis (D).

The volume of interest segmentation and image registration

The preprocessed T2WI sequences were imported into 3Dslicer (version 5.0.2) software, and VOI segmentation of the tumor parenchyma and its peritumoral edema region was performed by a radiologist with 3 years of experience. The final obtained VOI is used as Ground Truth for training the 3D automatic segmentation model. From all images, twenty T2WI sequences were randomly taken and segmented by another radiologist with 3 years of experience to prepare the data for ICC calculation. The results of the segmentation are shown schematically in Figure 3. The first case depicts a tumor located in the left basal ganglia region, affecting deep brain tissue, with angular protrusions visible at the front edge and an overall nodular enhancement. The second case shows a tumor in the cerebellum, also within deep brain tissue, with a smooth tumor edge and a ring-shaped enhancement on the contrast-enhanced scan.

Figure 3
www.frontiersin.org

Figure 3. Schematic diagram of VOI segmentation and partial image features. The case in the first line depicts a tumor in the left basal ganglia region, which exhibits three imaging features (Involvement of deep regions, Angular sign, Enhancement features: Mass); The case in the second row shows tumors in the cerebellum, including Involvement of deep regions, Enhancement features (Ring), and no Angular sign. Involvement of deep regions: whether the tumor invades periventricular regions, basal ganglia, brainstem, or cerebellum. Angular sign:the irregular enhancement lesions protrude to a certain direction, showing a sharp angle appearance. Enhancement features (Mass): the tumor shows obvious massive. Enhancement features (Ring): there is no obvious enhancement inside the tumor, but obvious circular enhancement can be seen at the edges.

The 3D automatic segmentation model developed in this study is trained based on nnU-NetV2. nnU-NetV2 is an improved version of the U-Net model. U-Net is a classic convolutional neural network architecture for image segmentation, and nnU-Net builds upon it with enhancements, particularly excelling in medical image segmentation tasks. For detailed technical details about the nnU-NetV2 model, please refer to the Supplementary Appendix. The experiments were configured with Python 3.9.0, Pytorch 2.0.0, and cuda11.8 deep learning platforms, Windows 10 operating system, and NVIDIA GeForce RTX 4090 GPU. An efficient 3D segmentation model for brain tumors was obtained by using the 3d_fullres mode for the training setup of the network, and the pre-processed 3D MRI images and Ground Truth were inputted into the network for training. The model is based on a large number of public dataset training experiences to summarize some of the fixed hyperparameters and configuration experience, and can automatically configure the hyperparameters of the model for any new dataset’s characteristics (including Learning rate, Loss function, Architecture template, Optimizer, Data augmentation, etc.), to learn from the a priori experience of other medical datasets while avoiding the problem of model performance degradation due to the lack of experience (5). The final training and validation results of the model are evaluated using the DSC metric.

For image alignment, the Simple ITK algorithm library and ITK-Snap software (version: 4.0.1, www.itksnap.org) were used to align the images of T1WI and T1CE to T2WI. Mutual information was used as the Image similarity metric, Rigid was used as the Transformation model, and Nearest Neighbor Interpolation was used as the image interpolation method.

Feature extraction and screening

Based on Ground Truth and 3 MRI sequences (aligned T1WI sequence, aligned T1WI-enhanced sequence, T2WI sequence), using the Pyradiomics (https://pyradiomics.readthedocs.io/en/latest/) algorithm library (33) to extract 1762 radiomics features, all of which were compliant with the Image Biomarker Standardization Initiative (IBSI) standard (34). Radiomics features obtained from the three sets of sequences were combined to obtain a total of 5,286 features, and the intraclass correlation coefficient (ICC) was calculated based on duplicate outlined image sequences of 20 cases to remove unstable features (ICC < 0.85). Then Z-Score normalization was performed and redundant features were removed using Pearson or Spearman methods. Finally, the radiomics features were screened using the minimal-redundancy-maximal-relevance criterion (mRMR) feature screening algorithm. To further improve the prediction performance of the model, the fusion feature Rad-Score is computed using ElasticNet and added to the training process of the model. For detailed information on the mRMR method, the extracted radiomics feature categories and Rad Score, please refer to the Supplementary Appendix.

Univariate logistic regression analysis was performed on the clinical information and MRI image features, and the variables with p < 0.05 were selected for the multivariable logistic regression analysis, and finally the variables with p < 0.05 were selected as independent predictors of Ki-67 expression status and were used to construct the interpretable machine-learning model and nomogram.

Radiomics model construction

The construction steps of interpretable machine learning predictive models can be divided into three main parts: extraction and screening of structured features, training of complex machine learning models, and application of model interpretation methods. The screened multimodal factors such as multisequence radiomics features and coded clinically independent predictors were first combined and screened using the mRMR algorithm to obtain the 10 predictors with the most predictive value. The Synthetic Minority Oversampling Technique (SMOTE) (35) technique is used to balance the number of samples across labels in the training set to improve the upper bound of model performance. The Random Forest algorithm was chosen as the classifier for model training, and 5-fold cross-validation was used in the training process. The optimal parameter search was performed using GridSearchCV to obtain the best model. The SHAP method was used to provide an explanation for the prediction results of the final random forest model. The principles and advantages of SHAP model interpretable methods and SMOTE data augmentation methods can be found in the Supplementary Appendix.

In addition, independent predictors and the Rad-Score fusion feature screened by univariate and multivariable analyses were combined, and the multivariable logistic regression model was used for model training and the construction of a nomogram.

Statistics

SPSS 26.0 (https://www.ibm.com/spss) and Python 3.7 (https://www.python.org/) were used for statistical analysis. Measurements were tested for normality, and data that did not fit the normal distribution were expressed as median (interquartile spacing) M (P25, P75). Comparisons between groups were made by the Mann–Whitney U rank sum test. The chi-square test or Fisher’s exact probability method was used for counting data. Results of logistic regression analyses were expressed as odd ratio (OR) and 95% CI. p < 0.05 represents a statistical difference. Use “sklearn. Metrics,” “sklearn. Calibration,” and “matplotlib. Pyplot “to calculate the AUC, Sensitivity, and Specificity of the model and plot the calibration curve and decision curve of the model, respectively. The Delong test was used to compare the AUC of the models.

Results

Patient characteristics

Table 1 provides detailed results of the differential analysis of clinical factors and imaging characteristics. It can be found that the distribution of patient information was balanced between the training cohort and the external test cohort, with no statistically significant difference in any of the distributions (p > 0.05). In addition, Table 2 shows that the involvement of deep regions, enhancement feature, and angular sign were statistically significant (p < 0.05) in the Ki-67 high and low expression groups.

Table 1
www.frontiersin.org

Table 1. Results of the variability analysis of clinical factors and imaging features between the training set and the external test set.

Table 2
www.frontiersin.org

Table 2. Results of the differential analysis of clinical factors and imaging characteristics between the Ki-67 high and low expression groups.

Evaluation of 3D segmentation model based on nnU-NetV2

The configured Batch size for training the segmentation model is 2 and the Patch size is (48, 224, 192). A total of 1,000 epochs were trained to obtain the optimal model, and the average DSC value on the validation set was 0.85, which shows that nnU-NetV2 has a good segmentation efficacy for brain tumor VOI. The training iterations of the segmentation model are given in Figure 4.

Figure 4
www.frontiersin.org

Figure 4. Training iteration plot for 3D brain tumor segmentation model.

Performance of the two Ki-67 expression prediction models

A total of 21 unstable radiomics features were removed from all radiomics features by calculating the ICC values, and then 826 features were obtained after de-redundancy using Pearson and Spearman. The fusion feature Rad-Score is computed by the ElasticNet regression network.

For the training of the interpretable RF model, 10 features with optimal predictive value were obtained after combining clinical factors, imaging features, and radiomics features and filtering them using the mRMR algorithm (including Rad-Score, 6 radiomics features, and 3 imaging features). After training, an efficient interpretable RF classification model is obtained with AUC: 0.84 (95% CI, [0.81, 0.86]), Sensitivity: 0.929, and Specificity: 0.68 on the external test set. The ROC of the interpretable RF model on the external test set is shown in Figure 5A. The calibration curve is shown in Figure 5B, which shows that the model is well-calibrated with a good linear fit. Decision curve analysis (DCA) is shown in Figure 5C, which shows that the model has good clinical utility.

Figure 5
www.frontiersin.org

Figure 5. Receiver operating characteristic curve, calibration curves and decision curve analysis curves for the interpretable RF model and the multivariate logistic regression nomogram model. The ROC performance of the interpretable RF model and the multivariate logistic regression nomogram model on an external test set (A); Performance of calibration curves for two models (B); The DCA performance of the interpretable RF model (C); The DCA performance of the multivariate logistic regression nomogram model (D).

For the multivariate logistic regression nomogram model, three independent predictors associated with Ki-67 expression levels, namely, Deep involvement, Enhanced characteristics, and Sharp angle feature, were screened out from the imaging features and clinical factors using univariate and multivariate analyses, as shown in Table 3. Combining the Rad-Score fusion feature, a nomogram was constructed using a multivariate logistic regression model, as shown in Figure 6A. This model performed in the external test set with AUC:0.81 (95% CI, [0.78, 0.83]), Sensitivity: 0.79, Specificity: 0.72. The Receiver operating characteristic curve (ROC) of this model on the external test set is shown in Figure 5A, which shows a slight decrease in AUC compared to the effect of the interpretable RF model. The calibration curve is shown in Figure 5B, and the DCA is shown in Figure 5D. It can be seen that the model has a good calibration effect and clinical practicability.

Table 3
www.frontiersin.org

Table 3. Results of univariate and multivariate analysis of imaging characteristics and clinical factors.

Figure 6
www.frontiersin.org

Figure 6. The nomogram and SHAP interpretation plots. Multivariate logistic regression nomogram (A), SHAP model decision interpretation results (B,C). Where A: ‘Rad-score’, B: ‘Gray level co-occurrence matrix correlation’, C: ‘Average gray level intensity’, D:'Gray level dependence matrix dependence entropy’, E: ‘Gray level size zone small area high gray level emphasis’, F: ‘Gray level co-occurrence matrix maximal correlation coefficient’, G: 'Involvement of deep regions’, H: 'Enhancement feature’, I: 'Gray level size zone gray level non-uniformity’, J: 'Angular sign’.

Feature contribution evaluation by SHAP values

To assess the contribution of each feature to the model, the impact of each feature on the predicted results of the model was quantified using the SHAP method. Previous studies have shown that this method is highly suitable for explaining complex medical artificial intelligence models (3639). In Figure 6B, each scatter represents a sample, the horizontal axis represents the SHAP values, the vertical axis represents the features, and the position of each feature on the vertical axis indicates its importance, with higher positions indicating larger contributions; the position of the scatter point on the horizontal axis indicates the SHAP value of the feature, the closer the horizontal coordinate is to the centerline (usually zero), the less the feature contributes to the prediction result, and the further the horizontal coordinate is from the centerline, the more the feature contributes to the prediction result; the color of the scatter can be used to indicate the magnitude of the value of the feature. Figure 6C exhibits the SHAP values for different features, and it can be seen that Rad-Score, Gray level co-occurrence matrix maximal correlation coefficient, and Gray level size zone small area high gray level emphasis exhibit higher model contributions, with contribution values of 0.18, 0.05 and 0.03, respectively. For imaging features, the involvement of deep regions showed a relatively high contribution value (0.02).

Discussion

Ki-67 is a proliferating cell nuclear protein present in all phases of the cell cycle except G0 (40), the higher level indicates more cells in the dividing stage and more active cell division. Ki-67 is an important prognostic parameter associated with clinical outcomes, and accurate preoperative prediction allows for customized patient stratification and proper care. Mukai H’s study suggests that Ki-67 expression levels in patients with breast malignancies may be an indicator of a significant response to neoadjuvant chemotherapy (41). Therefore, the expression level of Ki-67 can also assist in guiding the choice of chemotherapy regimen.

The observation in this study that PCNSL patients in the high-expression group showed more atypical enhancement features, i.e., ring enhancement, was due to the fact that the high Ki-67 index represents excessive tumor growth and high invasiveness, which may lead to cystic degeneration and necrosis due to insufficient blood supply within the tumors; and hemorrhage due to destruction of vascular endothelial cells (42). The “Angular sign” is an intensification of the tumor at a certain level showing a sharp angle of prominence (32), it reflects the characteristics of the lymphoma itself, which has no envelope, is softer in texture, and is easily permeable. This sign appeared more frequently in the Ki-67 high-expression group, which may be due to the uneven growth rate of some parts of the tumor during its rapid growth. In addition, Involvement of deep regions indicates that the tumor growth invades deeper intracranial areas such as periventricular regions, basal ganglia, brainstem, and/or cerebellum [It is an important risk factor in the International Extranodal Lymphoma Study Group (IELSG) prognostic score (43), the present study similarly found that it was also an independent predictor of high Ki-67 expression levels (p < 0.05)].

In a previous study, Ouyang et al. demonstrated good discriminatory ability in preoperative prediction of the Ki-67 proliferation index in patients with meningiomas using the radiomics nomogram (44), but they only extracted imaging features of the tumor parenchyma. The tissues surrounding tumors likewise contain a vast amount of heterogeneous information (45), especially vasogenic edema around intracranial tumors, which are sites of altered specific molecular, cellular, biological, and radiological information. Studies in other different classification tasks and our previous studies have shown that the region of the tumor combined with peritumoral edema will effectively improve the diagnostic performance of classification models (46, 47). In this study, peritumoral edema was included in the region of interest together, aiming to maximize the accurate prediction of Ki-67 expression status.

Both the interpretable RF model and the multivariate logistic regression nomogram constructed in this study showed good results in the preoperative prediction of Ki-67 expression status in PCNSL patients. Comparatively, the interpretable RF model showed better predictive performance. Both models use radiomics features and imaging features of multi-sequence MRI images as training data, while the RF model, as a more complex machine learning model, has a higher upper limit of model performance than the logistic regression model, but the RF model’s interpretability is relatively weak, which limits its use for medical tasks (27, 28). This study combines the SHAP method with the RF model to analyze and quantify the impact of each feature on the model’s prediction results, complementing the interpretability of the RF model. In addition, Rad-Score, a more comprehensive and advanced feature, can help the model understand the data better, as can be seen in Figure 6B, where Rad-Score plays an important role in the decision-making process of the interpretable RF model. The SHAP results indicate that among the selected radiomic features, there are three first-order features describing voxel intensity distribution (Gaussian Laplacian operator and wavelet features) and three features quantifying image grayscale (gray-level co-occurrence matrix and gray-level size zone matrix) that are at the forefront. The Gaussian Laplacian operator is a two-dimensional isotropic measure of the image’s second-order spatial derivative, emphasizing regions in the image with rapid intensity changes, thus primarily used in edge detection tasks. Wavelet transform provides a localized analysis of signals in both time and frequency domains, refining the analysis of signals through operations such as dilation and translation at multiple scales, effectively extracting information. Gray-level co-occurrence matrix and gray-level size zone matrix, as texture features in radiomics, primarily describe voxel grayscale distribution and variations, contributing to better prediction of tumor heterogeneity. Additionally, three macroscopic MRI features exhibiting significant differences between enhancement features, involvement of deep regions, and angular sign surpass certain radiomic features, resulting in model benefit. Compared with the multivariate logistic regression nomogram model, this novel interpretable RF model balances the requirements of both model performance and interpretability, improving the prediction level of this task. In terms of automatic brain tumor segmentation, nnU-NetV2 achieves good segmentation performance by implementing automatic network architecture and training hyper-parameter configuration based on the training experience of several public medical databases. Compared to previous studies (10, 11, 13, 48), this study provides a novel interpretable machine learning radiomics framework that offers an efficient solution for the study of other medical tasks.

This study also has some limitations. Two medical center data were included in the study and were validated on an external validation set, but the amount of data from both centers was small. This is mainly because PCNSL is a low-prevalence disease and large-scale data sets are difficult to collect, and data from more medical centers will be included later to increase the data size for further research. In addition, the Ground Truth segmentation labels required for model training at the time of this study still need to be manually labeled by radiologists due to the lack of annotated public datasets relevant to this task. This segmentation work will be a preliminary technology accumulation for later related studies.

Conclusion

In conclusion, this study presents a novel non-invasive automated interpretable machine learning research framework. The effectiveness of radiomics for preoperative prediction of Ki-67 expression status in PCNSL was explored, and conventional imaging features were incorporated to improve model performance. In addition, an automated 3D brain tumor segmentation model was developed to prepare the segmentation tool for subsequent studies.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by this study was approved by the institutional Ethics Committee of the First Affiliated Hospital of Dalian Medical University (Approval No. PJ-KS-KY-2023-481) and the institutional Ethics Committee of the Second Affiliated Hospital of Dalian Medical University (Approval No. 2023–257). The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study is a retrospective non-interventional study that collected existing radiological data from patients without implementing any additional intervention measures.

Author contributions

EZ: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Visualization, Writing – original draft. Y-FY: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Visualization, Writing – original draft. MB: Validation, Visualization, Writing – original draft. HZ: Validation, Visualization, Writing – original draft. Y-YY: Supervision, Writing – review & editing. XS: Validation, Visualization, Writing – original draft. SL: Validation, Visualization, Writing – original draft. YY: Validation, Visualization, Writing – original draft. CY: Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2024.1345162/full#supplementary-material

Abbreviations

PCNSL, Primary central nervous system lymphoma; SHAP, SHapley Additive exPlanations; RF, Random Forest; DSC, Dice Similarity Coefficient; CNS, Central nervous system; NIFTI, Neuroimaging Informatics Technology Initiative; VOI, Volume of interest; IBSI, Image Biomarker Standardization Initiative; ICC, The intraclass correlation coefficient; mRMR, Minimal-redundancy-maximal-relevance criterion; SMOTE, Synthetic Minority Oversampling Technique; ROC, Receiver operating characteristic curve; DCA, Decision curve analysis.

References

1. Kasenda, B, Ferreri, AJM, Marturano, E, Forst, D, Bromberg, J, Ghesquieres, H, et al. First-line treatment and outcome of elderly patients with primary central nervous system lymphoma (PCNSL)—a systematic review and individual patient data meta-analysis. Ann Oncol. (2015) 26:1305–13. doi: 10.1093/annonc/mdv076

PubMed Abstract | Crossref Full Text | Google Scholar

2. Ferreri, AJM, Calimeri, T, Cwynarski, K, Dietrich, J, Grommes, C, Hoang-Xuan, K, et al. Primary central nervous system lymphoma. Nat Rev Dis Primers. (2023) 9:29. doi: 10.1038/s41572-023-00439-0

PubMed Abstract | Crossref Full Text | Google Scholar

3. Luo, X, Zheng, R, Zhang, J, He, J, Luo, W, Jiang, Z, et al. CT-based radiomics for predicting Ki-67 expression in lung cancer: a systematic review and meta-analysis. Front Oncol. (2024) 14:1329801. doi: 10.3389/fonc.2024.1329801

PubMed Abstract | Crossref Full Text | Google Scholar

4. Liu, L, Zhao, L, Jing, Y, Li, D, Linghu, H, Wang, H, et al. Exploring a multiparameter MRI-based radiomics approach to predict tumor proliferation status of serous ovarian carcinoma. Insights Imaging. (2024) 15:74. doi: 10.1186/s13244-024-01634-7

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zhu, X, He, Y, Wang, M, Shu, Y, Lai, X, Gan, C, et al. Intratumoral and Peritumoral multiparametric MRI-based Radiomics signature for preoperative prediction of Ki-67 proliferation status in glioblastoma: a two-center study. Acad Radiol. (2024) 31:1560–71. doi: 10.1016/j.acra.2023.09.010

PubMed Abstract | Crossref Full Text | Google Scholar

6. Tabnak, P, HajiEsmailPoor, Z, Baradaran, B, Pashazadeh, F, and Aghebati Maleki, L. MRI-based Radiomics methods for predicting Ki-67 expression in breast Cancer: a systematic review and Meta-analysis. Acad Radiol. (2024) 31:763–87. doi: 10.1016/j.acra.2023.10.010

PubMed Abstract | Crossref Full Text | Google Scholar

7. Zhang, D, Zhang, XY, Lu, WW, Liao, JT, Zhang, CX, Tang, Q, et al. Predicting Ki-67 expression in hepatocellular carcinoma: nomogram based on clinical factors and contrast-enhanced ultrasound radiomics signatures. Abdom Radiol. (2024) 49:1419–31. doi: 10.1007/s00261-024-04191-1

PubMed Abstract | Crossref Full Text | Google Scholar

8. Feng, S, Zhou, D, Li, Y, Yuan, R, Kong, J, Jiang, F, et al. Prediction of Ki-67 expression in bladder cancer based on CT radiomics nomogram. Front Oncol. (2024) 14:1276526. doi: 10.3389/fonc.2024.1276526

PubMed Abstract | Crossref Full Text | Google Scholar

9. Chen, W, Lin, G, Chen, Y, Cheng, F, Li, X, Ding, J, et al. Prediction of the Ki-67 expression level in head and neck squamous cell carcinoma with machine learning-based multiparametric MRI radiomics: a multicenter study. BMC Cancer. (2024) 24:418. doi: 10.1186/s12885-024-12026-x

PubMed Abstract | Crossref Full Text | Google Scholar

10. Li, H, Liu, Z, Li, F, Shi, F, Xia, Y, Zhou, Q, et al. Preoperatively predicting Ki67 expression in pituitary adenomas using deep segmentation network and Radiomics analysis based on multiparameter MRI. Acad Radiol. (2023) 31:617–27. doi: 10.1016/j.acra.2023.05.023

PubMed Abstract | Crossref Full Text | Google Scholar

11. Yan, Y, Lin, XS, Ming, WZ, Chuan, ZQ, Hui, G, Juan, SY, et al. Radiomic analysis based on Gd-EOB-DTPA enhanced MRI for the preoperative prediction of Ki-67 expression in hepatocellular carcinoma. Acad Radiol. (2023) 31:859–69. doi: 10.1016/j.acra.2023.07.019

PubMed Abstract | Crossref Full Text | Google Scholar

12. Deng, S, Ding, J, Wang, H, Mao, G, Sun, J, Hu, J, et al. Deep learning-based radiomic nomograms for predicting Ki67 expression in prostate cancer. BMC Cancer. (2023) 23:638. doi: 10.1186/s12885-023-11130-8

PubMed Abstract | Crossref Full Text | Google Scholar

13. Liu, Y, He, C, Fang, W, Peng, L, Shi, F, Xia, Y, et al. Prediction of Ki-67 expression in gastrointestinal stromal tumors using radiomics of plain and multiphase contrast-enhanced CT. Eur Radiol. (2023) 33:7609–17. doi: 10.1007/s00330-023-09727-5

PubMed Abstract | Crossref Full Text | Google Scholar

14. Li, J, Tang, X, Luo, X, Liu, L, Li, D, and Yang, L. Clinicopathological analysis and specific discriminating markers of interleukin detection in cerebrospinal fluid with primary central nervous system lymphoma: results from a retrospective study. Ann Hematol. (2023) 102:2153–63. doi: 10.1007/s00277-023-05301-7

PubMed Abstract | Crossref Full Text | Google Scholar

15. Khan, B, Chong, I, Ostrom, Q, Ahmed, S, Dandachi, D, Kotrotsou, A, et al. Diffusion-weighted MR imaging histogram analysis in HIV positive and negative patients with primary central nervous system lymphoma as a predictor of outcome and tumor proliferation. Oncotarget. (2020) 11:4093–103. doi: 10.18632/oncotarget.27800

PubMed Abstract | Crossref Full Text | Google Scholar

16. Meyer, HJ, Schob, S, Münch, B, Frydrychowicz, C, Garnov, N, Quäschling, U, et al. Histogram analysis of T1-weighted, T2-weighted, and Postcontrast T1-weighted images in primary CNS lymphoma: correlations with histopathological findings-a preliminary study. Mol Imaging Biol. (2018) 20:318–23. doi: 10.1007/s11307-017-1115-5

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zhang, L, Liu, X, Xu, X, Liu, W, Jia, Y, Chen, W, et al. An integrative non-invasive malignant brain tumors classification and Ki-67 labeling index prediction pipeline with radiomics approach. Eur J Radiol. (2023) 158:110639. doi: 10.1016/j.ejrad.2022.110639

PubMed Abstract | Crossref Full Text | Google Scholar

18. Cho, U, Oh, WJ, Hong, YK, and Lee, YS. Prognostic significance of high Ki-67 index and Histogenetic subclassification in primary central nervous system lymphoma. Appl Immunohistochem Mol Morphol. (2018) 26:254–62. doi: 10.1097/PAI.0000000000000424

PubMed Abstract | Crossref Full Text | Google Scholar

19. Zhang, Y, Zhang, Q, Wang, XX, Deng, XF, and Zhu, YZ. Value of pretherapeutic DWI in evaluating prognosis and therapeutic effect in immunocompetent patients with primary central nervous system lymphoma given high-dose methotrexate-based chemotherapy: ADC-based assessment. Clin Radiol. (2016) 71:1018–29. doi: 10.1016/j.crad.2016.05.017

PubMed Abstract | Crossref Full Text | Google Scholar

20. Chong, I, Ostrom, Q, Khan, B, Dandachi, D, Garg, N, Kotrotsou, A, et al. Whole tumor histogram analysis using DW MRI in primary central nervous system lymphoma correlates with tumor biomarkers and outcome. Cancers. (2019) 11:1506. doi: 10.3390/cancers11101506

PubMed Abstract | Crossref Full Text | Google Scholar

21. Schob, S, Münch, B, Dieckow, J, Quäschling, U, Hoffmann, KT, Richter, C, et al. Whole tumor histogram-profiling of diffusion-weighted magnetic resonance images reflects Tumorbiological features of primary central nervous system lymphoma. Transl Oncol. (2018) 11:504–10. doi: 10.1016/j.tranon.2018.02.006

PubMed Abstract | Crossref Full Text | Google Scholar

22. Liu, J, Wang, Y, Liu, Y, Liu, Z, Cui, Q, Ji, N, et al. Immunohistochemical profile and prognostic significance in primary central nervous system lymphoma: analysis of 89 cases. Oncol Lett. (2017) 14:5505–12. doi: 10.3892/ol.2017.6893

PubMed Abstract | Crossref Full Text | Google Scholar

23. Qi, Z, Duan, L, Yuan, G, Liu, J, Li, J, Li, G, et al. Clinical impact of the histopathological index and neuroimaging features status in primary central nervous system diffuse large B-cell lymphoma: a single-center retrospective analysis of 51 cases. Front Oncol. (2022) 12:769895. doi: 10.3389/fonc.2022.769895

PubMed Abstract | Crossref Full Text | Google Scholar

24. Aljabri, M, and AlGhamdi, M. A review on the use of deep learning for medical images segmentation. Neurocomputing. (2022) 506:311–35. doi: 10.1016/j.neucom.2022.07.070

Crossref Full Text | Google Scholar

25. Cheng, J, Tian, S, Yu, L, Gao, C, Kang, X, Ma, X, et al. ResGANet: residual group attention network for medical image classification and segmentation. Med Image Anal. (2022) 76:102313. doi: 10.1016/j.media.2021.102313

PubMed Abstract | Crossref Full Text | Google Scholar

26. Isensee, F, Jaeger, PF, Kohl, SAA, Petersen, J, and Maier-Hein, KH. nnU-net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. (2021) 18:203–11. doi: 10.1038/s41592-020-01008-z

PubMed Abstract | Crossref Full Text | Google Scholar

27. Gunning, D, Stefik, M, Choi, J, Miller, T, Stumpf, S, and Yang, GZ. XAI-Explainable artificial intelligence. Sci Robot. (2019) 4. doi: 10.1126/scirobotics.aay7120

PubMed Abstract | Crossref Full Text | Google Scholar

28. Peng, J, Zou, K, Zhou, M, Teng, Y, Zhu, X, Zhang, F, et al. An explainable artificial intelligence framework for the deterioration risk prediction of hepatitis patients. J Med Syst. (2021) 45:61. doi: 10.1007/s10916-021-01736-5

PubMed Abstract | Crossref Full Text | Google Scholar

29. Lundberg, SM, and Lee, S-I (2017) A unified approach to interpreting model predictions proceedings of the 31st international conference on neural information processing systems. Curran Associates Inc., Long Beach, California, USA, pp. 4768–4777

Google Scholar

30. Lipkova, J, Chen, RJ, Chen, B, Lu, MY, Barbieri, M, Shao, D, et al. Artificial intelligence for multimodal data integration in oncology. Cancer Cell. (2022) 40:1095–110. doi: 10.1016/j.ccell.2022.09.012

PubMed Abstract | Crossref Full Text | Google Scholar

31. Cheng, G, and Zhang, J. Imaging features (CT, MRI, MRS, and PET/CT) of primary central nervous system lymphoma in immunocompetent patients. Neurol Sci. (2019) 40:535–42. doi: 10.1007/s10072-018-3669-7

PubMed Abstract | Crossref Full Text | Google Scholar

32. Han, Y, Wang, ZJ, Li, WH, Yang, Y, Zhang, J, Yang, XB, et al. Differentiation between primary central nervous system lymphoma and atypical glioblastoma based on MRI morphological feature and signal intensity ratio: a retrospective multicenter study. Front Oncol. (2022) 12:811197. doi: 10.3389/fonc.2022.811197

PubMed Abstract | Crossref Full Text | Google Scholar

33. van Griethuysen, JJM, Fedorov, A, Parmar, C, Hosny, A, Aucoin, N, Narayan, V, et al. Computational Radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | Crossref Full Text | Google Scholar

34. Zwanenburg, A, Vallières, M, Abdalah, MA, Aerts, HJWL, Andrearczyk, V, Apte, A, et al. The image biomarker standardization initiative: standardized quantitative Radiomics for high-throughput image-based phenotyping. Radiology. (2020) 295:328–38. doi: 10.1148/radiol.2020191145

PubMed Abstract | Crossref Full Text | Google Scholar

35. Chawla, NV, Bowyer, KW, Hall, LO, and Kegelmeyer, WP. SMOTE: synthetic minority over-sampling technique. J Artif Int Res. (2002) 16:321–57. doi: 10.1613/jair.953

Crossref Full Text | Google Scholar

36. Wang, Y, Lang, J, Zuo, JZ, Dong, Y, Hu, Z, Xu, X, et al. The radiomic-clinical model using the SHAP method for assessing the treatment response of whole-brain radiotherapy: a multicentric study. Eur Radiol. (2022) 32:8737–47. doi: 10.1007/s00330-022-08887-0

PubMed Abstract | Crossref Full Text | Google Scholar

37. Yang, H, Liu, H, Lin, J, Xiao, H, Guo, Y, Mei, H, et al. An automatic texture feature analysis framework of renal tumor: surgical, pathological, and molecular evaluation based on multi-phase abdominal CT. Eur Radiol. (2023) 34:355–66. doi: 10.1007/s00330-023-10016-4

PubMed Abstract | Crossref Full Text | Google Scholar

38. Ye, JY, Fang, P, Peng, ZP, Huang, XT, Xie, JZ, and Yin, XY. A radiomics-based interpretable model to predict the pathological grade of pancreatic neuroendocrine tumors. Eur Radiol. (2023) 34:1994–2005. doi: 10.1007/s00330-023-10186-1

PubMed Abstract | Crossref Full Text | Google Scholar

39. Yu, L, Yu, Z, Sun, L, Zhu, L, and Geng, D. A brain tumor computer-aided diagnosis method with automatic lesion segmentation and ensemble decision strategy. Front Med. (2023) 10:1232496. doi: 10.3389/fmed.2023.1232496

PubMed Abstract | Crossref Full Text | Google Scholar

40. Endl, E, and Gerdes, J. Posttranslational modifications of the KI-67 protein coincide with two major checkpoints during mitosis. J Cell Physiol. (2000) 182:371–80. doi: 10.1002/(SICI)1097-4652(200003)182:3<371::AID-JCP8>3.0.CO;2-J

PubMed Abstract | Crossref Full Text | Google Scholar

41. Mukai, H, Yamaguchi, T, Takahashi, M, Hozumi, Y, Fujisawa, T, Ohsumi, S, et al. Ki-67 response-guided preoperative chemotherapy for HER2-positive breast cancer: results of a randomised phase 2 study. Br J Cancer. (2020) 122:1747–53. doi: 10.1038/s41416-020-0815-9

PubMed Abstract | Crossref Full Text | Google Scholar

42. Johnson, BA, Fram, EK, Johnson, PC, and Jacobowitz, R. The variable MR appearance of primary lymphoma of the central nervous system: comparison with histopathologic features. AJNR Am J Neuroradiol. (1997) 18:563–72.

PubMed Abstract | Google Scholar

43. Ferreri, AJM, Blay, J-Y, Reni, M, Pasini, F, Spina, M, Ambrosetti, A, et al. Prognostic scoring system for primary CNS lymphomas: the international Extranodal lymphoma study group experience. J Clin Oncol. (2003) 21:266–72. doi: 10.1200/JCO.2003.09.139

PubMed Abstract | Crossref Full Text | Google Scholar

44. Ouyang, ZQ, He, SN, Zeng, YZ, Zhu, Y, Ling, BB, Sun, XJ, et al. Contrast enhanced magnetic resonance imaging-based radiomics nomogram for preoperatively predicting expression status of Ki-67 in meningioma: a two-center study. Quant Imaging Med Surg. (2023) 13:1100–14. doi: 10.21037/qims-22-689

PubMed Abstract | Crossref Full Text | Google Scholar

45. Wu, Y, Ma, Q, Fan, L, Wu, S, and Wang, J. An automated breast volume scanner-based intra-and Peritumoral Radiomics nomogram for the preoperative prediction of expression of Ki-67 in breast malignancy. Acad Radiol. (2023) 31:93–103. doi: 10.1016/j.acra.2023.07.004

PubMed Abstract | Crossref Full Text | Google Scholar

46. Fan, Y, Wang, X, Dong, Y, Cui, E, Wang, H, Sun, X, et al. Multiregional radiomics of brain metastasis can predict response to EGFR-TKI in metastatic NSCLC. Eur Radiol. (2023) 33:7902–12. doi: 10.1007/s00330-023-09709-7

PubMed Abstract | Crossref Full Text | Google Scholar

47. Kim, Y, Cho, HH, Kim, ST, Park, H, Nam, D, and Kong, DS. Radiomics features to distinguish glioblastoma from primary central nervous system lymphoma on multi-parametric MRI. Neuroradiology. (2018) 60:1297–305. doi: 10.1007/s00234-018-2091-4

PubMed Abstract | Crossref Full Text | Google Scholar

48. Moon, CM, Lee, YY, Kim, DY, Yoon, W, Baek, BH, Park, JH, et al. Preoperative prediction of Ki-67 and p53 status in meningioma using a multiparametric MRI-based clinical-radiomic model. Front Oncol. (2023) 13:1138069. doi: 10.3389/fonc.2023.1138069

Crossref Full Text | Google Scholar

Keywords: Ki-67, radiomics, interpretable model, primary central nervous system lymphoma, nomogram

Citation: Zhao E, Yang Y-F, Bai M, Zhang H, Yang Y-Y, Song X, Lou S, Yu Y and Yang C (2024) MRI radiomics-based interpretable model and nomogram for preoperative prediction of Ki-67 expression status in primary central nervous system lymphoma. Front. Med. 11:1345162. doi: 10.3389/fmed.2024.1345162

Received: 27 November 2023; Accepted: 11 June 2024;
Published: 27 June 2024.

Edited by:

Zekuan Yu, Fudan University, China

Reviewed by:

Shengjun Sun, Capital Medical University, China
Stathis Hadjidemetriou, University of Limassol, Cyprus

Copyright © 2024 Zhao, Yang, Bai, Zhang, Yang, Song, Lou, Yu and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chao Yang, ZHJ5YW5nY2hhb0AxNjMuY29t

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.